TY - JOUR
T1 - Deep reinforcement learning-based energy-aware disassembly planning for end-of-life products with stimuli-activated self-disassembly
AU - Wang, Di
AU - Zhao, Jing
AU - Han, Muyue
AU - Li, Lin
PY - 2025/12/1
Y1 - 2025/12/1
N2 - Remanufacturing stands as a cornerstone strategy for end-of-life (EOL) product management, playing a vital role in fostering a circular economy. Despite its significance, the widespread implementation remains difficult, mainly due to challenges such as labor-intensive operations, diminished quality, and time-consuming processes involved in component disassembly. A potential solution emerges in stimuli-activated self-disassembly, offering a non-destructive pathway that encourages seamless human–machine collaboration. This innovative approach facilitates the simultaneous disassembly of multiple components, reducing damage, labor costs, and energy consumption. Notably, limited studies have addressed real-time disassembly planning (DP), especially within self-disassembling workstations. Our research aims to maximize disassembly profit and energy recovery by optimizing disassembly sequences, EOL options, and a hybrid scheme that combines manual and self-disassembly operations. We propose an advanced deep reinforcement learning (DRL) algorithm that incorporates an innovative loss function, a revised training scheme, and parameter embedding to generate the Pareto frontier. Additionally, we propose a compact product representation that captures dynamics and uncertainties, such as product type variations, missing components, potential disassembly failure, and stochastic product quality. The effectiveness of our approach is demonstrated through a case study involving a TV disassembly line, benchmarked against six baselines. Furthermore, a sensitivity analysis is conducted to elucidate the impact of labor expenses and hybrid disassembly schemes on the ultimate profit recovery.
AB - Remanufacturing stands as a cornerstone strategy for end-of-life (EOL) product management, playing a vital role in fostering a circular economy. Despite its significance, the widespread implementation remains difficult, mainly due to challenges such as labor-intensive operations, diminished quality, and time-consuming processes involved in component disassembly. A potential solution emerges in stimuli-activated self-disassembly, offering a non-destructive pathway that encourages seamless human–machine collaboration. This innovative approach facilitates the simultaneous disassembly of multiple components, reducing damage, labor costs, and energy consumption. Notably, limited studies have addressed real-time disassembly planning (DP), especially within self-disassembling workstations. Our research aims to maximize disassembly profit and energy recovery by optimizing disassembly sequences, EOL options, and a hybrid scheme that combines manual and self-disassembly operations. We propose an advanced deep reinforcement learning (DRL) algorithm that incorporates an innovative loss function, a revised training scheme, and parameter embedding to generate the Pareto frontier. Additionally, we propose a compact product representation that captures dynamics and uncertainties, such as product type variations, missing components, potential disassembly failure, and stochastic product quality. The effectiveness of our approach is demonstrated through a case study involving a TV disassembly line, benchmarked against six baselines. Furthermore, a sensitivity analysis is conducted to elucidate the impact of labor expenses and hybrid disassembly schemes on the ultimate profit recovery.
KW - Activated self
KW - Deep reinforcement learning
KW - Disassembly
KW - Disassembly planning
KW - End
KW - Life management
KW - Multi
KW - Objective optimization
KW - Of
KW - Stimuli
UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85210511489&origin=inward
UR - https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=85210511489&origin=inward
U2 - 10.1007/s10845-024-02527-8
DO - 10.1007/s10845-024-02527-8
M3 - Article
SN - 0956-5515
VL - 36
SP - 5475
EP - 5494
JO - Journal of Intelligent Manufacturing
JF - Journal of Intelligent Manufacturing
IS - 8
ER -