Journal of Nanjing University(Natural Sciences), Volume. 61, Issue 4, 583(2025)

Deep reinforcement learning⁃based multi⁃agent cooperative communication and task scheduling for smart manufacturing

Fan Zijing and Guo Yinzhang*
Author Affiliations
  • Swarm Intelligence and Cloud Computing Laboratory, Taiyuan University of Science and Technology, Taiyuan, 030024, China
  • show less
    References(29)

    [1] [1] Lecun Y, Bengio Y, Hinton G. Deep learning. Nature, 2015, 521:436-444.

    [2] [2] Mnih V, Kavukcuoglu K, Silver D, et al. Human‐level control through deep reinforcement learning. Nature, 2015, 518(7540):529-533.

    [3] [3] Lowe R, Wu Y, Tamar A, et al. Multi‐agent actor‐critic for mixed cooperative‐competitive environ‐ments∥Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach, CA, USA:Curran Associates Inc., 2017:6382-6393.

    [4] [4] Tai L, Paolo G, Liu M. Virtual‐to‐real deep reinforcement learning:Continuous control of Mobile robots for mapless navigation∥2017 IEEE/RSJ International Conference on Intelligent Robots and Systems. Vancouver, Canada:IEEE, 2017:31-36.

    [5] [5] Russo L, Terlizzi M, Tipaldi M, et al. A reinforcement learning approach for pedestrian collision avoidance and trajectory tracking in autonomous driving systems∥2021 5th International Conference on Control and Fault:Tolerant Systems. Saint‐Raphael, France:IEEE, 2021:44-49.

    [6] [6] Zhu C X, Dastani M, Wang S H. A survey of multi‐agent deep reinforcement learning with communication∥Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems. Richland, SC, USA:AAMS, 2024:1500-1525.

    [7] [7] Wu X Q, Yan X F, Guan D H, et al. A deep reinforcement learning model for dynamic job‐shop scheduling problem with uncertain processing time. Engineering Applications of Artificial Intelligence, 2024, 131:107790.

    [8] [8] Zhang L X, Yan Y, Yang C, et al. Dynamic flexible job‐shop scheduling by multi‐agent reinforcement learning with reward‐shaping. Advanced Engineering Informatics, 2024, 62(Part C):102872.

    [9] [9] Sheng J J, Wang X F, Jin B, et al. Learning struc‐tural communication for multi‐agent reinforce‐ment learning∥Proceedings of 2023 International Conference on Autonomous Agents and Multiagent Systems. Richland, SC, USA:AAMS, 2023:436-438.

    [10] [10] Foerster J N, Assael Y M, Freitas N D, et al. Learning to communicate with deep multi‐agent reinforcement learning∥Proceedings of the 30th International Conference on Neural Information Processing Systems. Barcelona, Spain:NIPS, 2016:2145-2153.

    [11] [11] Kong X Y, Xin B, Liu F C, et al. Revisiting the master–slave architecture in multi‐agent deep rein‐forcement learning. 2017, arXiv:1712.07305.

    [12] [12] Sukhbaatar S, Szlam A, Fergus R. Learning multi‐agent communication with backpropagation∥Proceedings of the 30th International Conference on Neural Information Processing Systems. Barcelona, Spain:NIPS, 2016:2252-2260.

    [13] [13] Singh A, Jain T, Sukhbaatar S. Learning when to communicate at scale in multiagent cooperative and competitive tasks. 2018, arXiv:1812.09755.

    [14] [14] Jiang J C, Lu Z Q. Learning attentional communication for multi‐agent cooperation. Barcelona, Spain:Proceedings of the 32nd Inter‐national Conference on Neural Information Processing Systems. Montral, Canada:Curran Associates Inc., 2018:7265-7275.

    [15] [15] Jiang J C, Dun C, Huang T J, et al. Graph convolutional reinforcement learning. 2018, arXiv:1810.09202.

    [16] [16] Luo S, Zhang L X, Fan Y S. Real‐Time scheduling for dynamic Partial‐No‐Wait multiobjective flexible job shop by deep reinforcement learning. IEEE Transactions on Automation Science and Engineering, 2022, 19(4):3020-3038.

    [17] [17] Liu R K, Piplani R, Toro C. A deep multi‐agent reinforcement learning approach to solve dynamic job shop scheduling problem. Computers and Operations Research, 2023, 159:106294.

    [18] [18] Leng J W, Sha W N, Lin Z S, et al. Blockchained smart contract pyramid‐driven multi‐agent auto‐nomous process control for resilient individualised manufacturing towards. International Journal of Production Research, 2022, 61(13):4302-4321.

    [19] [19] Hu G Z, Zhu Y H, Zhao D B, et al. Event‐Triggered communication network with limited‐bandwidth constraint for multi‐agent reinforcement learning. IEEE Transactions on Neural Networks and Learning Systems, 2023, 34(8):3966-3978.

    [20] [20] Zhang H, Yu H, Wang X M, et al. Knowledge‐guided communication preference learning model for multi‐agent cooperation. Information Sciences, 2024, 667:120395.

    [21] [21] Ma Z H, Tang Z, Feng J W, et al. Distributed formation containment control for multi‐agent systems via dynamic event‐triggering communi‐cation mechanism. Applied Mathematics and Computation, 2024, 482:128958.

    [22] [22] Zhang Y, Zhu H H, Tang D B, et al. Dynamic job shop scheduling based on deep reinforcement learning for multi‐agent manufacturing systems. Robotics and Computer‐Integrated Manufacturing, 2022, 78:102412.

    [23] [23] Zhang H, Wang W H, Zhang S S, et al. A novel method based on deep reinforcement learning for machining process route planning. Robotics and Computer‐Integrated Manufacturing, 2024, 86:102688.

    [24] [24] Rao Z H, Xu Y Y, Yao Y, et al. DAR‐DRL:A dynamic adaptive routing method based on deep reinforcement learning. Computer Communications, 2024, 228:107983.

    [25] [25] Serrano‐Ruiz J C, Mula J, Poler R. Job shop smart manufacturing scheduling by deep reinforcement learning. Journal of Industrial Information Integration, 2024, 38:100582.

    [26] [26] Zhang H, Zhang X H, Feng Z, et al. Heterogeneous Multi‐Robot cooperation with asynchronous multi‐agent reinforcement learning. IEEE Robotics and Automation Letters, 2024, 9(1):159-166.

    [27] [27] Wang R Q, Wang G, Sun J, et al. Flexible job shop scheduling via dual attention Network‐Based rein‐forcement learning. IEEE Transactions on Neural Networks and Learning Systems, 2024, 35(3):3091-3102.

    [28] [28] Sun Z, Yu Z W, Guo B, et al. Integrated sensing and communication for effective multi‐agent cooperation systems. IEEE Communications Magazine, 2024, 62(9):68-73.

    [29] [29] Li C X, Zheng P, Yin Y, et al. Deep reinforcement learning in smart manufacturing:A review and prospects. CIRP Journal of Manufacturing Science and Technology, 2023, 40:75-101.

    Tools

    Get Citation

    Copy Citation Text

    Fan Zijing, Guo Yinzhang. Deep reinforcement learning⁃based multi⁃agent cooperative communication and task scheduling for smart manufacturing[J]. Journal of Nanjing University(Natural Sciences), 2025, 61(4): 583

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Received: May. 29, 2025

    Accepted: Aug. 22, 2025

    Published Online: Aug. 22, 2025

    The Author Email: Guo Yinzhang (guoyinzhang@tyust.edu.cn)

    DOI:10.13232/j.cnki.jnju.2025.04.005

    Topics