MARL 論文
モデル
- Planning Problems for Sophisticated Agents with Present Bias(2016)
- Learning Multiagent Communication with Backpropagation(2016)
- Multiagent cooperation and competition with deep reinforcement learning(2017)
- Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning(2018)
- The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games(2021)
- RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning(2022)
- Multi-Agent Reinforcement Learning is a Sequence Modeling Problem(2023)
- HyperMARL: Adaptive Hypernetworks for Multi-Agent RL(2024)