Fact-checked by Grok 2 weeks ago
References
-
[1]
A survey on multi-agent reinforcement learning and its applicationMulti-agent reinforcement learning (MARL) has been a rapidly evolving field. This paper presents a comprehensive survey of MARL and its applications.
-
[2]
NoneSummary of each segment:
-
[3]
[PDF] Markov games as a framework for multi-agent reinforcement learningGiven this definition of optimality, Markov games have several important properties. Like MDP's, every Markov game has a non-empty set of optimal policies ...
-
[4]
[PDF] The Dynamics of Reinforcement Learning in Cooperative Multiagent ...Joint action learners (JALs), in contrast, learn the value of their own actions in conjunction with those of other agents via integration of RL with equilibrium ...
-
[5]
[PDF] Multi-Agent Reinforcement Learning: Independent vs. Cooperative ...The key investigations of this paper are, \Given the same number of reinforcement learning agents, will cooperative agents outperform independent agents who do ...Missing: seminal | Show results with:seminal
- [6]
-
[7]
Cooperative Multi-Agent Reinforcement Learning for Data Gathering ...This study introduces a novel approach to data gathering in energy-harvesting wireless sensor networks (EH-WSNs) utilizing cooperative multi-agent ...
-
[8]
[PDF] An Overview of Cooperative and Competitive Multiagent LearningZero-sum games are games where the rewards of the agents for each joint action sum to zero. General sum games allow for any sum of values for the reward of ...
-
[9]
[PDF] Nash Q-Learning for General-Sum Stochastic GamesLittman (1994) designed a Minimax-Q learning algorithm for zero-sum stochastic games. A con- vergence proof for that algorithm was provided subsequently by ...
-
[10]
[PDF] Algorithms for Sequential Decision Making - Brown CSThe algorithm is called minimax-Q because it is essentially identical to the standard. Q-learning algorithm with a minimax replacing the maximization. It is ...
-
[11]
[PDF] Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near ...Model-based MARL naturally decouples the learning and planning phases, and can be incorporated with any black-box planning algo- rithm that is efficient, e.g., ...<|control11|><|separator|>
-
[12]
Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games ...Oct 7, 2023 · Learning Nash equilibrium (NE) in complex zero-sum games with multi-agent reinforcement learning (MARL) can be extremely computationally ...
-
[13]
[PDF] Multi-agent learning in mixed-motive coordination problemsMar 8, 2021 · Two uncontroversial properties of a welfare function are Pareto-optimality (i.e., its optimizer should be Pareto-optimal) and symmetry (the.
-
[14]
[PDF] Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium ...A correlated equilibrium (CE), is a joint mixed strategy P(a) such that no player p has payoff to gain from unilaterally choosing to play another action ap ...
-
[15]
Google Research Football: A Novel Reinforcement Learning ... - arXivJul 25, 2019 · A new reinforcement learning environment where agents are trained to play football in an advanced, physics-based 3D simulator.
-
[16]
[2301.13812] Learning Roles with Emergent Social Value OrientationsJan 31, 2023 · The multi-agent reinforcement learning community has leveraged ideas from social science, such as social value orientations (SVO), to solve ...
-
[17]
Melting Pot: an evaluation suite for multi-agent reinforcement learningJul 14, 2021 · Melting Pot assesses generalization to novel social situations involving both familiar and unfamiliar individuals, and has been designed to test a broad range ...Missing: mixed- motive
-
[18]
[PDF] Melting Pot Contest: Charting the Future of Generalized Cooperative ...The class of mixed-motive problems also includes bargaining problems, in which players have differing preferences over Pareto-optimal agreements, and may fail ...
-
[19]
Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement ...Jun 11, 2019 · This paper surveys recent works that address the non-stationarity problem in multi-agent deep reinforcement learning.
-
[20]
Deep multiagent reinforcement learning: challenges and directionsOct 19, 2022 · In a multiagent setting, nonstationarity makes learning more challenging as all agents update their policies simultaneously.<|separator|>
-
[21]
[PDF] Decentralized POMDPs - Frans A. OliehoekPeshkin et al (2000) introduced decentralized gradient ascent policy search (DGAPS), a method for MARL in partially observable settings based on gradient ...<|separator|>
-
[22]
Multi-agent Reinforcement Learning: A Comprehensive Survey - arXivJul 3, 2024 · Multi-agent reinforcement learning (MARL) is data-driven decision-making within multi-agent systems, where multiple agents make decisions in a ...
-
[23]
[PDF] Model-Based Opponent ModelingModel-based opponent modeling (MBOM) uses an environment model to adapt to all opponents by simulating their reasoning and imagining policy improvements.
-
[24]
[PDF] Learning to Model Opponent Learning - arXivJun 6, 2020 · We propose the use of a recurrent neural network (RNN) to model an opponent's learning process. By learning an update rule for the state of ...
-
[25]
[PDF] Multi-Agent Reinforcement Learning for smart mobility and traffic ...Sep 7, 2023 · This work investigates Multi-Agent Reinforcement Learning (MARL) to address autonomous vehicles' difficulty in handling traffic scenarios and ...
-
[26]
Credit Assignment with Meta-Policy Gradient for Multi-Agent ... - arXivFeb 24, 2021 · Reward decomposition is a critical problem in centralized training with decentralized execution~(CTDE) paradigm for multi-agent reinforcement ...
-
[27]
Value-Decomposition Networks For Cooperative Multi-Agent LearningJun 16, 2017 · We study the problem of cooperative multi-agent reinforcement learning with a single joint reward signal.
-
[28]
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent ...Mar 30, 2018 · Our solution is QMIX, a novel value-based method that can train decentralised policies in a centralised end-to-end fashion.Missing: decomposition | Show results with:decomposition
-
[29]
Multi-Agent Reinforcement Learning: A Review of Challenges and ...Below, we present an assortment of MARL algorithms that address the above-mentioned challenges of non-stationarity and scalability. We then address partially ...
-
[30]
NoneSummary of each segment:
-
[31]
[PDF] A comprehensive survey of multi-agent reinforcement learningThis paper surveys multi-agent reinforcement learning (MARL), where agents learn through trial-and-error, and focuses on stability and adaptation of agent ...
-
[32]
An Introduction to Centralized Training for Decentralized Execution ...Sep 4, 2024 · This text is an introduction to CTDE in cooperative MARL. It is meant to explain the setting, basic concepts, and common methods.
-
[33]
[1705.08926] Counterfactual Multi-Agent Policy Gradients - arXivMay 24, 2017 · Counterfactual multi-agent (COMA) policy gradients is a multi-agent actor-critic method using a centralized critic and decentralized actors, ...
-
[34]
[1902.04043] The StarCraft Multi-Agent Challenge - arXivFeb 11, 2019 · The StarCraft Multi-Agent Challenge (SMAC) is a benchmark for cooperative multi-agent learning based on StarCraft II, where each unit is ...
-
[35]
Multi-agent Reinforcement Learning in Sequential Social DilemmasFeb 10, 2017 · We analyze the dynamics of policies learned by multiple self-interested independent learning agents, each using its own deep Q-network, on two Markov games.
-
[36]
Multiagent reinforcement learning in the Iterated Prisoner's DilemmaThis paper is an empirical study of reinforcement learning in the Iterated Prisoner's Dilemma (IPD), where the agents' payoffs are neither totally positively ...
-
[37]
Prosocial learning agents solve generalized Stag Hunts better than ...Sep 8, 2017 · We ask how we can change the learning rule of a single agent to improve its outcomes in Stag Hunts that include other reactive learners.
-
[38]
Intrinsic fluctuations of reinforcement learning promote cooperationJan 24, 2023 · A famous example is the Tit-for-Tat strategy, in which you cooperate if your co-player cooperated, and you defect if your co-player defected in ...
-
[39]
Tackling Asymmetric and Circular Sequential Social Dilemmas with ...Jun 26, 2022 · Tackling Asymmetric and Circular Sequential Social Dilemmas with Reinforcement Learning and Graph-based Tit-for-Tat. Authors:Tangui Le Gléau, ...
-
[40]
Social Reward Shaping in the Prisoner's Dilemma - ResearchGateReward shaping is a well-known technique applied to help reinforcement-learning agents converge more quickly to near-optimal behavior.<|separator|>
-
[41]
[PDF] Learning Optimal “Pigovian Tax” in Sequential Social DilemmasJun 2, 2023 · We build a typical reward shaping mechanism to promote social welfare. Our proposed method is called Learning Optimal Pigovian Tax. (LOPT), ...
-
[42]
Evolutionary Multi-agent Reinforcement Learning in Group Social ...Nov 1, 2024 · This paper studies evolutionary multi-agent reinforcement learning in Public Goods Games, exploring the tragedy of the commons and free rider ...
-
[43]
[1909.07528] Emergent Tool Use From Multi-Agent AutocurriculaSep 17, 2019 · Global Survey · Computer Science > Machine Learning · Title:Emergent Tool Use From Multi-Agent Autocurricula · Bibliographic and Citation Tools.
-
[44]
[PDF] Diverse Auto-Curriculum is Critical for Successful Real ... - IFAAMASThe core thesis of this paper is that the development of learning frameworks that can induce behavioural diversity in the policy space is critical for. MARL to ...
-
[45]
[PDF] Emergent Cooperation and Deception in Public Good GamesMay 10, 2023 · Emergent Cooperation and Deception in Public Good Games. Nicole ... by the certain agents to deceive the others. • We also show that ...
-
[46]
A Comprehensive Review of Multi-Agent Reinforcement Learning in ...Sep 3, 2025 · This paper aims to provide a thorough examination of MARL's application from turn-based two-agent games to real-time multi-agent video games ...
-
[47]
[1902.00506] The Hanabi Challenge: A New Frontier for AI ResearchFeb 1, 2019 · Full-text links: Access Paper: View a PDF of the paper titled The Hanabi Challenge: A New Frontier for AI Research, by Nolan Bard and 14 other ...Missing: URL | Show results with:URL
-
[48]
Mastering the Game of No-Press Diplomacy via Human-Regularized ...Oct 11, 2022 · No-press Diplomacy is a complex strategy game involving both cooperation and competition that has served as a benchmark for multi-agent AI research.Missing: superhuman | Show results with:superhuman
-
[49]
[PDF] Distributed Influence-Augmented Local Simulators for ... - NIPS papersIn this paper, we extend the IBA solution to MARL and explain how to build a network of independent IALS such that we can train agents in parallel. One of the ...
-
[50]
Foraging Swarms using Multi-Agent Reinforcement LearningMARL policies were compared with · behavior determined by the three Boids rules and the basic · strategy of always moving in the opposite direction of the.
-
[51]
Understanding Domain Randomization for Sim-to-real Transfer - arXivOct 7, 2021 · In this paper, we propose a theoretical framework for sim-to-real transfers, in which the simulator is modeled as a set of MDPs with tunable parameters.Missing: MARL | Show results with:MARL
-
[52]
Domain Randomization for Sim2Real Transfer | Lil'LogMay 5, 2019 · Domain randomization optimization for task performance match real data distribution guided by data in simulator.DR as Optimization · DR as Meta-Learning · Optimization for Task...Missing: MARL | Show results with:MARL
-
[53]
MARVEL: Multi-Agent Reinforcement Learning for constrained field-of-View multi-robot Exploration in Large-scale environments### Summary of MARVEL Framework for Drone Swarms in Search-and-Rescue or Exploration
- [54]
-
[55]
[PDF] Boosting Sample Efficiency and Generalization in Multi-agent ...Multi-Agent Reinforcement Learning (MARL) struggles with sample inefficiency and poor generalization [1]. These challenges are partially due to a lack of ...
- [56]
-
[57]
Robust Multi-Agent Reinforcement Learning with State UncertaintyJul 30, 2023 · We study the problem of MARL with state uncertainty in this work. We provide the first attempt to the theoretical and empirical analysis of this challenging ...
-
[58]
(PDF) Social coordination perpetuates stereotypic expectations and ...Mar 4, 2025 · Social coordination perpetuates stereotypic expectations and behaviors across generations in deep multi-agent reinforcement learning. March ...
-
[59]
[PDF] Unequal Norms Emerge Under Coordination Uncertainty in Multi ...Using a multi-agent reinforcement learning approach, we explored the emergence of conventions under social inter- action uncertainty and its concurrent outcomes ...
- [60]
- [61]