Skip to main navigation Skip to search Skip to main content

Deep reinforcement learning-based approach for dynamic disassembly scheduling of end-of-life products with stimuli-activated self-disassembly

  • University of Illinois at Chicago

Research output: Contribution to journalArticlepeer-review

27 Scopus citations

Abstract

Remanufacturing is one of the most critical strategies for end-of-life product management to promote a circular economy; however, it has been seen very limited implementation due to the labor-intensive and time-consuming disassembly processes for component retrieval. The newly emerged 4D printing technology enables the fabrication of stimuli-responsive reconfigurable structures, outlining new ways to achieve non-destructive and simultaneous self-disassembly of components with different geometry. However, large uncertainties and increased process dynamics have also emerged directly pertaining to the real-time scheduling in disassembly lines with self-disassembly workstations, which the existing scheduling methods are not equipped to handle. In this study, a constrained multi-agent deep reinforcement learning approach is proposed to maximize the disassembly profit by dynamically changing the batch mixing ratios of different-sized components in self-disassembly workstations and adapting real-time scheduling to stochastic product quality, changes in operational sequences, and self-disassembly failures. The proposed approach is validated on a disassembly line for hand pulse detectors that contain heat-activated self-disassembly components. Numerical results show that the proposed achieves stable convergence under uncertainties, and the implementation of a dynamic batch mixing scheme in self-disassembly operations yields a substantial improvement in disassembly profit over the scheduling period. In addition, sensitivity analyses are conducted to evaluate the impacts of system uncertainties on the profitability of the disassembly line.
Original languageEnglish
Article number138758
JournalJournal of Cleaner Production
Volume423
DOIs
StatePublished - Oct 15 2023

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy
  2. SDG 8 - Decent Work and Economic Growth
    SDG 8 Decent Work and Economic Growth
  3. SDG 12 - Responsible Consumption and Production
    SDG 12 Responsible Consumption and Production

Keywords

  • Dynamic scheduling
  • End-of-life management
  • Multi-agent deep reinforcement learning
  • Stimuli-activated self-disassembly

Fingerprint

Dive into the research topics of 'Deep reinforcement learning-based approach for dynamic disassembly scheduling of end-of-life products with stimuli-activated self-disassembly'. Together they form a unique fingerprint.

Cite this