Self-Supervised Multi-Agent Learning Algorithm for Automated Supply Chain Coordination and Disruption Recovery

Ergashev Bunyod Shokir  ugli; Dr.H. Shaheen; Aakansha  Soy; Dr.D. Muthusankar; Sanobar  Kenjayeva; Kattakul  Kinjaev

Self-Supervised Multi-Agent Learning Algorithm for Automated Supply Chain Coordination and Disruption Recovery

Authors

Ergashev Bunyod Shokir ugli Turan International University, Namangan, Uzbekistan.
Dr.H. Shaheen Course Leader & Sr Lecturer, Department of computing and engineering, University of west London, Rak Branch Campus, UAE.
Aakansha Soy Assistant Professor, Kalinga University, Naya Raipur, Chhattisgarh, India.
Dr.D. Muthusankar Associate Professor, Department of Computer Science and Engineering, K.S.Rangasamy College of Technology, Tiruchengode, India.
Sanobar Kenjayeva Teacher, Jizzakh State Pedagogical University, Uzbekistan.
Kattakul Kinjaev Lecturer, Department of Finance and Tourism, Termez University of Economics and Service, Termez, Uzbekistan.

Keywords:

Multi-Agent Reinforcement Learning, Self-Supervised Learning, Supply Chain Resilience;

Abstract

With sprawling multi-tiered networks and many links to each of the various stages, modern supply chains are susceptible to cascading failures caused by natural disasters, geopolitical shocks, demand variability, and supplier bankruptcies. Heuristic, rule-based, and single-agent reinforcement learning approaches are limited in their ability to model the many dimensions of modern global supply chains, particularly because of their distributed, partially observed, and nonstationary nature. This paper introduces a new decentralized learning algorithm called SSMASC (Self-Supervised Multi-Agent Supply Chain) that allows for heterogeneous autonomous agents (suppliers, manufacturers, distributors, and retailers) to coordinate without a centralized authority. SSMASC uses a two-phase methodology comprised of contrastive self-supervised pre-training to create rich latent representations of the states of a supply chain from unlabelled operational data, followed by a cooperative multi-agent reinforcement learning (MARL) phase using graph-attention-based communications. An innovative mechanism is introduced called disruption-aware value decomposition with adaptive credit assignments that allows for rapid recovery behaviors to occur even when only partially observed. Comprehensive evaluation experiments across three publicly available benchmark supply chain environments including an innovative 128 node global trade simulation demonstrate that SSMASC is capable of producing outcomes (i.e., resilience scores), faster recovery times, and higher total profit for the entire supply chain than the state-of-the-art solution, as evidenced by SSMASC's maximum performance increase of 31.4% for resilience scores, 43.7% decrease in average recovery time, and 22.1% increase in total profit. Ablation studies confirm that both self-supervised pre-training and graph-attention-based communication modules are critical components of SSMASC.

Downloads

Published

2026-05-12

How to Cite

ugli, E. B. S., Shaheen, D., Soy, A., Muthusankar, D., Kenjayeva, S., & Kinjaev, K. (2026). Self-Supervised Multi-Agent Learning Algorithm for Automated Supply Chain Coordination and Disruption Recovery. International Journal of Artificial Intelligence and Machine Learning, 6(2s), 194–203. Retrieved from https://svedbergopen.com/index.php/ijaiml/article/view/197

Download Citation

Issue

Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Most read articles by the same author(s)

Abdusamiev Dilmurod Abdugani ugli, Dr.D. Muthusankar, E. Shalini, S. Suganya, Ramazon Xurramov, Lazizbek Burkhonov, Optimizing Education Pathways Using Hyperparameter Tuning in Neural Architecture Search (NAS) , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 1s (2026): IJAIML_VOL.6_NO.1s 2026
Dr.S. Murali, K. Neppolian, Dr. R. Chithra, Dr. Sanjay Kumar, Bozormurod Abduvakhitov, Kattakul Kinjaev, Federated Meta-Learning Algorithm for Cross-Enterprise Collaborative Business Intelligence Without Data Sharing , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026
Ergashev Rasulbek Sokhib ugli, Krishnamurthy Kumar, Dr.C. Nallusamy, Roohee Khan, Yanglish Kosimova, Kattakul Kinjaev, Multi-Objective Evolutionary Optimization Algorithm for Profit-Maximization and Risk Minimization in Business Operations , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026
Kosimov Khusniddin Badriddinovich, K. Karthik, Dr.M. Sasikumar, Ashu Nayak, Gulnoz Shertaylakova, Kattakul Kinjaev, Explainable Reinforcement Learning Algorithm for Transparent Human-Centric Business Decision Support Systems , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026
Yusufjanov Ulugbek Javlon ugli, Mani Raja Kumar, Dr.K. Poongodi, Dr. Priya Vij, Zukhra Akramova, Kattakul Kinjaev, Hypergraph Neural Network Algorithm for Complex Market Relationship Modeling and Consumer Behavior Prediction , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026
Dr.T. Senthil Prakash, Dr.R. Udayakumar, Dr. Megala Rajendran, Dr.H. Shaheen, Dr.T. Abirami, Siyovush Boboyev, Hardware Conscious Architecture Search Algorithms for Specialized AI Accelerators , International Journal of Artificial Intelligence and Machine Learning: Vol. 6 No. 2s (2026): IJAIML_VOL.6_NO.2s 2026

Self-Supervised Multi-Agent Learning Algorithm for Automated Supply Chain Coordination and Disruption Recovery

Authors

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Similar Articles

Make a Submission

INDEXING

Developed By

Information

Browse

Current Issue