|
Frontiers of Information Technology & Electronic Engineering
ISSN 2095-9184 (print), ISSN 2095-9230 (online)
2025 Vol.26 No.4 P.479-509
Optimization methods in fully cooperative scenarios: a review of multiagent reinforcement learning
Abstract: Multiagent reinforcement learning (MARL) has become a dazzling new star in the field of reinforcement learning in recent years, demonstrating its immense potential across many application scenarios. The reward function directs agents to explore their environments and make optimal decisions within them by establishing evaluation criteria and feedback mechanisms. Concurrently, cooperative objectives at the macro level provide a trajectory for agents’ learning, ensuring alignment between individual behavioral strategies and the overarching system goals. The interplay between reward structures and cooperative objectives not only bolsters the effectiveness of individual agents but also fosters interagent collaboration, offering both momentum and direction for the development of swarm intelligence and the harmonious operation of multiagent systems. This review delves deeply into the methods for designing reward structures and optimizing cooperative objectives in MARL, along with the most recent scientific advancements in this field. The article meticulously reviews the application of simulation environments in cooperative scenarios and discusses future trends and potential research directions in the field, providing a forward-looking perspective and inspiration for subsequent research efforts.
Key words: Multiagent reinforcement learning (MARL); Cooperative framework; Reward function; Cooperative objective optimization
1北京联合大学北京市信息服务工程重点实验室,中国北京市,100101
2中国人民解放军32178部队科技创新研究中心,中国北京市,100012
摘要:近年来,多智能体强化学习已成为强化学习领域一颗耀眼的新星,展现了其在众多应用场景的巨大潜力。奖励函数通过建立评估标准和反馈机制,引导智能体在其环境中探索并做出最优决策。同时,宏观层面的协作目标为智能体的学习提供了轨迹,确保个体行为策略与整体系统目标的高度一致性。奖励结构与协作目标之间的相互作用,不仅增强了个体智能体的有效性,还促进了智能体之间的协作,为群体智能的发展和多智能体系统的和谐运行提供了动力和方向。本文深入探讨了多智能体强化学习中奖励结构的设计方法及协作目标的优化策略,详细审视了这些领域的最新科学进展。此外,对协作场景中的仿真环境应用进行了深入评述,讨论了该领域的未来发展趋势及潜在研究方向,为后续研究提供了前瞻视角与灵感。
关键词组:
References:
Open peer comments: Debate/Discuss/Question/Opinion
<1>
DOI:
10.1631/FITEE.2400259
CLC number:
TP181
Download Full Text:
Downloaded:
772
Download summary:
<Click Here>Downloaded:
92Clicked:
927
Cited:
0
On-line Access:
2025-05-06
Received:
2024-04-06
Revision Accepted:
2024-09-06
Crosschecked:
2025-05-06