Game-Theoretic Multi-Agent Reinforcement Learning for Economic Resource Allocation Optimization

Lin Wang; Qizhi Pan

doi:10.31449/inf.v49i22.8426

Contact Editors Europe, Africa:
Matjaz Gams
N. and S. America:
Karthick Gunasekaran
Asia, Australia:
Vinay Singh
Overview papers:
Maria Ganzha
Wiesław Pawlowski
Aleksander Denisiuk Abstacting / Indexing

Informatica is surveyed by:

ACM Digital Library
Citeseer
COBISS
Compendex
Computer & Information Systems Abstracts
Computer Database
Computer Science Index
dLib.si
DBLP Computer Science Bibliography
Directory of Open Access Journals
Google Scholar
InfoTrac OneFile
Inspec
Linguistic and Language Behaviour Abstracts
Mathematical Reviews, MatSciNet, MatSci on SilverPlatter and Current Mathematical Publications
Scopus Publishing

Informatica is published by:

Support

Informatica is supported by:

ACM Slovenia
Slovenian Society for Pattern Recognition
Slovenian Artificial Intelligence Society
Slovenian Society for Cognitive Science
Slovenian Society of Mathematicians, Physicists and Astronomers
Automatic Control Society of Slovenia
Slovenian Academy of Engineering
International Federation for Information Processing

Journal Help

User

Journal Content Search
Browse

Information

Notifications

About The Authors

Lin Wang

Qizhi Pan

China

Support & Indexing

Game-Theoretic Multi-Agent Reinforcement Learning for Economic Resource Allocation Optimization

Lin Wang, Qizhi Pan

Abstract

This paper presents a novel framework for optimizing economic resource allocation by integrating computational game theory with multi-agent reinforcement learning (MARL), addressing the challenges of dynamic, multi-agent interactions in complex economic systems. The framework leverages game-theoretic equilibrium concepts, such as Nash Equilibrium, alongside policy gradient methods and best-response dynamics to enable scalable, efficient, and stable decision-making in high-dimensional environments. An end-to-end experimental pipeline, validated using real-world data from the World Bank Open Data repository, demonstrates the effectiveness of the proposed approach. Quantitative results show that the framework achieves an economic utility score of 92.5,(±3.2), outperforming baseline models including Single-Agent RL (78.3), Non-Cooperative Game Theory (85.1), and Centralized Optimization (88.7). It also reduces convergence time to 750,(±25) steps and improves fairness, with a Gini coefficient of 0.15,(±0.02), indicating balanced resource distribution. Compared to existing models, the proposed method delivers superior policy stability (0.01 ± 0.005) and faster adaptation. These results highlight the framework’s ability to discover equitable, high-utility resource allocations while maintaining long-term equilibrium, making it a powerful tool for applications in market competition, supply chain management, and public policy optimization.

Full Text:

PDF

DOI: https://doi.org/10.31449/inf.v49i22.8426

This work is licensed under a Creative Commons Attribution 3.0 License.

Informatica is financially supported by the Slovenian research agency from the Call for co-financing of scientific periodical publications.

Webmaster: Mario Konecki

Username
Password
Remember me