Deep reinforcement learning&#x2043;based multi&#x2043;agent cooperative communication and task scheduling for smart manufacturing

Fan Zijing; Guo Yinzhang

doi:10.13232/j.cnki.jnju.2025.04.005

Journal of Nanjing University(Natural Sciences), Volume. 61, Issue 4, 583(2025)

Deep reinforcement learning⁃based multi⁃agent cooperative communication and task scheduling for smart manufacturing

Fan Zijing and Guo Yinzhang^*

Author Affiliations

Swarm Intelligence and Cloud Computing Laboratory, Taiyuan University of Science and Technology, Taiyuan, 030024, China

show less

Abstract Get PDF(in Chinese)

References(29)

[1] [1] Lecun Y, Bengio Y, Hinton G. Deep learning. Nature, 2015, 521：436-444.

[2] [2] Mnih V, Kavukcuoglu K, Silver D, et al. Human‐level control through deep reinforcement learning. Nature, 2015, 518(7540)：529-533.

[3] [3] Lowe R, Wu Y, Tamar A, et al. Multi‐agent actor‐critic for mixed cooperative‐competitive environ‐ments∥Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach, CA, USA：Curran Associates Inc., 2017：6382-6393.

[4] [4] Tai L, Paolo G, Liu M. Virtual‐to‐real deep reinforcement learning：Continuous control of Mobile robots for mapless navigation∥2017 IEEE/RSJ International Conference on Intelligent Robots and Systems. Vancouver, Canada：IEEE, 2017:31-36.

[5] [5] Russo L, Terlizzi M, Tipaldi M, et al. A reinforcement learning approach for pedestrian collision avoidance and trajectory tracking in autonomous driving systems∥2021 5th International Conference on Control and Fault：Tolerant Systems. Saint‐Raphael, France：IEEE, 2021：44-49.

[6] [6] Zhu C X, Dastani M, Wang S H. A survey of multi‐agent deep reinforcement learning with communication∥Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems. Richland, SC, USA：AAMS, 2024：1500-1525.

[7] [7] Wu X Q, Yan X F, Guan D H, et al. A deep reinforcement learning model for dynamic job‐shop scheduling problem with uncertain processing time. Engineering Applications of Artificial Intelligence, 2024, 131：107790.

[8] [8] Zhang L X, Yan Y, Yang C, et al. Dynamic flexible job‐shop scheduling by multi‐agent reinforcement learning with reward‐shaping. Advanced Engineering Informatics, 2024, 62(Part C)：102872.

[9] [9] Sheng J J, Wang X F, Jin B, et al. Learning struc‐tural communication for multi‐agent reinforce‐ment learning∥Proceedings of 2023 International Conference on Autonomous Agents and Multiagent Systems. Richland, SC, USA：AAMS, 2023：436-438.

[10] [10] Foerster J N, Assael Y M, Freitas N D, et al. Learning to communicate with deep multi‐agent reinforcement learning∥Proceedings of the 30th International Conference on Neural Information Processing Systems. Barcelona, Spain：NIPS, 2016：2145-2153.

[11] [11] Kong X Y, Xin B, Liu F C, et al. Revisiting the master–slave architecture in multi‐agent deep rein‐forcement learning. 2017, arXiv:1712.07305.

[12] [12] Sukhbaatar S, Szlam A, Fergus R. Learning multi‐agent communication with backpropagation∥Proceedings of the 30th International Conference on Neural Information Processing Systems. Barcelona, Spain：NIPS, 2016：2252-2260.

[13] [13] Singh A, Jain T, Sukhbaatar S. Learning when to communicate at scale in multiagent cooperative and competitive tasks. 2018, arXiv:1812.09755.

[14] [14] Jiang J C, Lu Z Q. Learning attentional communication for multi‐agent cooperation. Barcelona, Spain：Proceedings of the 32nd Inter‐national Conference on Neural Information Processing Systems. Montral, Canada：Curran Associates Inc., 2018：7265-7275.

[15] [15] Jiang J C, Dun C, Huang T J, et al. Graph convolutional reinforcement learning. 2018, arXiv:1810.09202.

[16] [16] Luo S, Zhang L X, Fan Y S. Real‐Time scheduling for dynamic Partial‐No‐Wait multiobjective flexible job shop by deep reinforcement learning. IEEE Transactions on Automation Science and Engineering, 2022, 19(4)：3020-3038.

[17] [17] Liu R K, Piplani R, Toro C. A deep multi‐agent reinforcement learning approach to solve dynamic job shop scheduling problem. Computers and Operations Research, 2023, 159：106294.

[18] [18] Leng J W, Sha W N, Lin Z S, et al. Blockchained smart contract pyramid‐driven multi‐agent auto‐nomous process control for resilient individualised manufacturing towards. International Journal of Production Research, 2022, 61(13)：4302-4321.

[19] [19] Hu G Z, Zhu Y H, Zhao D B, et al. Event‐Triggered communication network with limited‐bandwidth constraint for multi‐agent reinforcement learning. IEEE Transactions on Neural Networks and Learning Systems, 2023, 34(8)：3966-3978.

[20] [20] Zhang H, Yu H, Wang X M, et al. Knowledge‐guided communication preference learning model for multi‐agent cooperation. Information Sciences, 2024, 667：120395.

[21] [21] Ma Z H, Tang Z, Feng J W, et al. Distributed formation containment control for multi‐agent systems via dynamic event‐triggering communi‐cation mechanism. Applied Mathematics and Computation, 2024, 482：128958.

[22] [22] Zhang Y, Zhu H H, Tang D B, et al. Dynamic job shop scheduling based on deep reinforcement learning for multi‐agent manufacturing systems. Robotics and Computer‐Integrated Manufacturing, 2022, 78：102412.

[23] [23] Zhang H, Wang W H, Zhang S S, et al. A novel method based on deep reinforcement learning for machining process route planning. Robotics and Computer‐Integrated Manufacturing, 2024, 86：102688.

[24] [24] Rao Z H, Xu Y Y, Yao Y, et al. DAR‐DRL：A dynamic adaptive routing method based on deep reinforcement learning. Computer Communications, 2024, 228：107983.

[25] [25] Serrano‐Ruiz J C, Mula J, Poler R. Job shop smart manufacturing scheduling by deep reinforcement learning. Journal of Industrial Information Integration, 2024, 38：100582.

[26] [26] Zhang H, Zhang X H, Feng Z, et al. Heterogeneous Multi‐Robot cooperation with asynchronous multi‐agent reinforcement learning. IEEE Robotics and Automation Letters, 2024, 9(1)：159-166.

[27] [27] Wang R Q, Wang G, Sun J, et al. Flexible job shop scheduling via dual attention Network‐Based rein‐forcement learning. IEEE Transactions on Neural Networks and Learning Systems, 2024, 35(3)：3091-3102.

[28] [28] Sun Z, Yu Z W, Guo B, et al. Integrated sensing and communication for effective multi‐agent cooperation systems. IEEE Communications Magazine, 2024, 62(9)：68-73.

[29] [29] Li C X, Zheng P, Yin Y, et al. Deep reinforcement learning in smart manufacturing：A review and prospects. CIRP Journal of Manufacturing Science and Technology, 2023, 40：75-101.

Tools

Get Citation

Copy Citation Text

Fan Zijing, Guo Yinzhang. Deep reinforcement learning⁃based multi⁃agent cooperative communication and task scheduling for smart manufacturing[J]. Journal of Nanjing University(Natural Sciences), 2025, 61(4): 583

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Received: May. 29, 2025

Accepted: Aug. 22, 2025

Published Online: Aug. 22, 2025

The Author Email: Guo Yinzhang (guoyinzhang@tyust.edu.cn)

DOI:10.13232/j.cnki.jnju.2025.04.005

Topics