Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

An end-to-end deep reinforcement learning method based on graph neural network for distributed job-shop scheduling problem

Published: 27 February 2024 Publication History

Abstract

Distributed Job-shop Scheduling Problem (DJSP) is a hotspot in industrial and academic fields due to its valuable application in the real-life productions. For DJSP, the available methods aways complete the job selection first and then search for an appropriate factory to assign the selected job, which means job selection and job assignment are done independently. This paper proposes an end-to-end Deep Reinforcement Learning (DRL) method to make the two decisions simultaneously. To capture the problem characteristics and realize the objective optimization, the Markov Decision Process (MDP) of DJSP is formulated. Specialised action space made up of operation-factory pairs is designed to achieve the simultaneous decision-making. A stitched disjunctive graph representation of DJSP is specially designed, and a Graph Neural Network (GNN) based feature extraction architecture is proposed to dig the state embedding during problem solving. A Proximal Policy Optimization (PPO) method is applied to train an action-selection policy. To further lead the agent to assign jobs to the factory with smaller makespan, a probability enhancement mechanism is designed. The experimental results on 240 test instances have shown that the proposed method outperforms 8 classical Priority Dispatching Rules (PDRs), 3 closely-related RL methods and 5 metaheuristics in terms of effectiveness, stability and generalization.

References

[1]
J. Błażewicz, E. Pesch, M. Sterna, The disjunctive graph machine representation of the job shop scheduling problem, European Journal of Operational Research 127 (2) (2000) 317–331,.
[2]
H.C. Chang, T.K. Liu, Optimisation of distributed manufacturing flexible job shop scheduling by using hybrid genetic algorithms, Journal of Intelligent Manufacturing 28 (8) (2017) 1973–1986,.
[3]
I. Chaouch, O.B. Driss, K. Ghedira, A novel dynamic assignment rule for the distributed job shop scheduling problem using a hybrid ant-based algorithm, Applied Intelligence 49 (5) (2019) 1903–1924,.
[4]
L. De Giovanni, F. Pezzella, An improved genetic algorithm for the distributed and flexible job-shop scheduling problem, European Journal of Operational Research 200 (2) (2010) 395–408,.
[5]
Y. Du, J.Q. Li, C. Luo, L.L. Meng, A hybrid estimation of distribution algorithm for distributed flexible job shop scheduling with crane transportations, Swarm and Evolutionary Computation 62 (2021),.
[6]
Y.C. Fonseca-Reyna, Y. Martínez-Jiménez, A. Nowé, Q-learning algorithm performance for m-machine, n-jobs flow shop scheduling problems to minimize makespan, Investigación Operacional 38 (3) (2018) 3.
[7]
J.A. Palombarini, E.C. Martinez, End-to-end on-line rescheduling from Gantt chart images using deep reinforcement learning, International Journal of Production Research 60 (14) (2022) 4434–4463,.
[8]
Y.M. Gu, M. Chen, L. Wang, A self-learning discrete salp swarm algorithm based on deep reinforcement learning for dynamic job shop scheduling problem, Applied Intelligence 53 (2023) 18925–18958,.
[9]
W.X. Guo, W.H. Tian, Y.F. Ye, L.X. Xu, K. Wu, Cloud resource scheduling with deep reinforcement learning and imitation learning, IEEE Internet of Things Journal 8 (5) (2021) 3576–3586,.
[10]
B.A. Han, J.J. Yang, Research on adaptive job shop scheduling problems based on dueling double dqn, IEEE Access 8 (2020) 186474–186495,.
[11]
B.A. Han, J.J. Yang, A deep reinforcement learning based solution for flexible job shop scheduling problem, International Journal of Simulation Modelling 20 (2) (2021) 375–386,.
[12]
J.P. Huang, Q.K. Pan, Z.H. Miao, L. Gao, Effective constructive heuristics and discrete bee colony optimization for distributed flowshop with setup times, Engineering Applications of Artificial Intelligence 97 (2021),.
[13]
Y.Y. Huang, Q.K. Pan, J.P. Huang, P. Suganthan, L. Gao, An improved iterated greedy algorithm for the distributed assembly permutation flowshop scheduling problem, Computers & Industrial Engineering 152 (2021),.
[14]
H.Z. Jia, A.Y.C. Nee, J.Y.H. Fuh, Y.F. Zhang, A modified genetic algorithm for distributed scheduling problems, Journal of Intelligent Manufacturing 14 (3) (2003) 351–362.
[15]
E. da Jiang, L. Wang, Peng, Z. Ping, Solving energy-efficient distributed job shop scheduling via multi-objective evolutionary algorithm with decomposition, Swarm and Evolutionary Computation 58 (2020).
[16]
X.L. Jing, Q.K. Pan, L. Gao, Y.L. Wang, An effective Iterated Greedy algorithm for the distributed permutation flowshop scheduling with due windows, Applied Soft Computing 96 (2020).
[17]
Z.Q. Kong, X.H. Jin, Z.G. Xu, B. Zhang, Spatio-temporal fusion attention: A novel approach for remaining useful life prediction based on graph neural network, IEEE Transactions on Instrumentation and Measurement 71 (2022) 3515912.
[18]
K. Lei, P. Guo, W.C. Zhao, Y. Wang, L.M. Qian, X.Y. Meng, L.S. Tang, A multi-action deep reinforcement learning framework for flexible Job-shop scheduling problem, Expert Systems with Applications 205 (2022).
[19]
B. Lin, B. Ghaddar, J. Nathwani, Deep reinforcement learning for the electric vehicle routing problem with time windows, IEEE Transactions on Intelligent Transportation Systems 23 (8) (2022) 11528–11538.
[20]
R. Liu, R. Piplani, C. Toro, Deep reinforcement learning for dynamic scheduling of a flexible job shop, International Journal of Production Research (2022) 1–21.
[21]
Q. Luo, Q.W. Deng, G.L. Gong, L.K. Zhang, W.W. Han, K.X. Li, An efficient memetic algorithm for distributed flexible job shop scheduling problem with transfers, Expert Systems with Applications 160 (2020).
[22]
S. Luo, L.X. Zhang, Y.S. Fan, Real-time scheduling for dynamic partial-no-wait multiobjective flexible job shop by deep reinforcement learning, IEEE Transactions on Automation Science and Engineering (2021) 1–19.
[23]
L.L. Meng, C.Y. Zhang, Y.P. Ren, B. Zhang, C. Lv, Mixed-integer linear programming and constraint programming formulations for solving distributed flexible job shop scheduling problem, Computers & Industrial Engineering 142 (2020).
[24]
T. Meng, Q.K. Pan, L. Wang, A distributed permutation flowshop scheduling problem with the customer order constraint, Knowledge-Based Systems 184 (2019).
[25]
B. Naderi, A. Azab, Modeling and heuristics for scheduling of distributed job shops, Expert Systems with Applications 41 (17) (2014) 7754–7763.
[26]
B. Naderi, A. Azab, An improved model and novel simulated annealing for distributed job shop problems, The International Journal of Advanced Manufacturing Technology 81 (1) (2015) 693–703.
[27]
B. Naderi, R. Ruiz, The distributed permutation flowshop scheduling problem, Computers & Operations Research 37 (4) (2010) 754–768.
[28]
C.E. Okwudire, H.V. Madhyastha, Distributed manufacturing for and by the masses, Science 372 (2021) 341–342.
[29]
Q.K. Pan, L. Gao, L. Wang, An effective cooperative co-evolutionary algorithm for distributed flowshop group scheduling problems, IEEE Transactions on Cybernetics 52 (7) (2022) 5999–6012.
[30]
Q.K. Pan, L. Gao, L. Wang, J. Liang, X.Y. Li, Effective heuristics and metaheuristics to minimize total flowtime for the distributed permutation flowshop problem, Expert Systems with Applications 124 (2019) 309–324.
[31]
Z.X. Pan, L. Wang, J.J. Wang, J.W. Lu, Deep reinforcement learning based optimization algorithm for permutation flow-shop scheduling, TEEE Transactions on Emerging Topics in Computational Intelligence 7 (4) (2021) 983–994.
[32]
J.H. Ruan, Z.X. Wang, F.T.S. Chan, S. Patnaik, M.K. Tiwari, A reinforcement learning-based algorithm for the aircraft maintenance routing problem, Expert Systems with Applications 169 (2021).
[33]
R. Ruiz, Q.K. Pan, B. Naderi, Iterated greedy methods for the distributed permutation flowshop scheduling problem, Omega 83 (2019) 213–222.
[34]
I. Sabuncuoglu, Y. Karpat, Process planning and scheduling for distributed manufacturing, International Journal of Production Research 47 (4) (2009) 1151–1152.
[35]
M.A. Şahman, A discrete spotted hyena optimizer for solving distributed job shop scheduling problems, Applied Soft Computing 106 (2021).
[36]
Schulman, J., Wolski, F., Dhariwal, P., Radford, A., & Klimov, O. (2017). Proximal policy optimization algorithms (arXiv:1707.06347). arXiv.
[37]
V. Sels, N. Gheysen, M. Vanhoucke, A comparison of priority rules for the job shop scheduling problem under different flow time- and tardiness-related objective functions, International Journal of Production Research 50 (15) (2012) 4255–4270.
[38]
D. Wang, H. Deng, Multirobot coordination with deep reinforcement learning in complex environments, Expert Systems with Applications 180 (2021).
[39]
J.J. Wang, L. Wang, A knowledge-based cooperative algorithm for energy-efficient scheduling of distributed flow-shop, IEEE Transactions on Systems, Man, and Cybernetics: Systems 50 (5) (2020) 1805–1819.
[40]
Z.Y. Wang, M. Eisen, A. Ribeiro, Learning decentralized wireless resource allocations with graph neural networks, IEEE Transactions on Signal Processing 70 (2022) 1850–1863.
[41]
Z.Y. Wang, M. Gombolay, Learning scheduling policies for multi-robot coordination with graph attention networks, IEEE Robotics and Automation Letters 5 (3) (2020) 4509–4516.
[42]
Xu, K., Hu, W. H., Leskovec, J., & Jegelka, S. (2019). How powerful are graph neural networks? (arXiv:1810.00826). arXiv.
[43]
X. Yu, W. Luo, Reinforcement learning-based multi-strategy cuckoo search algorithm for 3D UAV path planning, Expert Systems with Applications 223 (2023).
[44]
C. Zhang, W. Song, Z.G. Cao, J. Zhang, P.S. Tan, X. Chi, Learning to dispatch for job shop scheduling via deep reinforcement learning, Advances in Neural Information Processing Systems 33 (2020) 1621–1632.
[45]
G.H. Zhang, X.X. Lu, X. Liu, L.T. Zhang, S.W. Wei, W.Q. Zhang, An effective two-stage algorithm based on convolutional neural network for the bi-objective flexible job shop scheduling problem with machine breakdown, Expert Systems with Applications 203 (2022).
[46]
J.D. Zhang, Z.X. He, W.H. Chan, C.Y. Chow, DeepMGA: Deep reinforcement learning with multi-agent graphs for flexible job shop scheduling, Knowledge-Based Systems 259 (2023).
[47]
Y. Zhang, H. Zhu, D. Tang, T. Zhou, Y. Gui, Dynamic job shop scheduling based on deep reinforcement learning for multi-agent manufacturing systems, Robotics and Computer-Integrated Manufacturing 78 (2022).
[48]
Z. Zhang, H. Liu, M.C. Zhou, J.H. Wang, Solving dynamic traveling salesman problems with deep reinforcement learning, IEEE Transactions on Neural Networks and Learning Systems (2021) 1–14.
[49]
F.Q. Zhao, S.L. Di, L. Wang, A hyperheuristic with Q-learning for the multiobjective energy-efficient distributed blocking flow shop scheduling problem, IEEE Transactions on Cybernetics 53 (5) (2023) 3337–3350.
[50]
F.Q. Zhao, Z.Y. Wang, L. Wang, A reinforcement learning driven artificial bee colony algorithm for distributed heterogeneous no-wait flowshop scheduling problem with sequence-dependent setup times, IEEE Transactions on Automation Science and Engineering (2022) 1–16.
[51]
F.Q. Zhao, H. Zhang, L. Wang, A Pareto-based discrete Jaya algorithm for multiobjective carbon-efficient distributed blocking flow shop scheduling problem, IEEE Transactions on Industrial Informatics 19 (8) (2023) 8588–8599.
[52]
L.L. Zhao, J.X. Fan, C.J. Zhang, W.M. Shen, J. Zhuang, A DRL-based reactive scheduling policy for flexible job shops with random job arrivals, IEEE Transactions on Automation Science and Engineering (2023) 1–12.
[53]
F.Q. Zhao, L.X. Zhang, J. Cao, J.X. Tang, A cooperative water wave optimization algorithm with reinforcement learning for the distributed assembly no-idle flowshop scheduling problem, Computers & Industrial Engineering 153 (2021).

Cited By

View all
  • (2024)Learning to schedule dynamic distributed reconfigurable workshops using expected deep Q-networkAdvanced Engineering Informatics10.1016/j.aei.2023.10230759:COnline publication date: 1-Jan-2024

Index Terms

  1. An end-to-end deep reinforcement learning method based on graph neural network for distributed job-shop scheduling problem
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image Expert Systems with Applications: An International Journal
          Expert Systems with Applications: An International Journal  Volume 238, Issue PA
          Mar 2024
          1584 pages

          Publisher

          Pergamon Press, Inc.

          United States

          Publication History

          Published: 27 February 2024

          Author Tags

          1. Distributed job shop
          2. Scheduling
          3. Distributed scheduling
          4. Reinforcement learning

          Qualifiers

          • Research-article

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)0
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 16 Jan 2025

          Other Metrics

          Citations

          Cited By

          View all
          • (2024)Learning to schedule dynamic distributed reconfigurable workshops using expected deep Q-networkAdvanced Engineering Informatics10.1016/j.aei.2023.10230759:COnline publication date: 1-Jan-2024

          View Options

          View options

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media