Papers of Multi Agent Reinforcement Learning(MARL)
Papers in Multi-Agent Reinforcement Learning(MARL)
This is my paper lists about Multi-Agent Reinforcement Learning.
What makes this list outstanding?
There is introduction part(or called comment) based my understanding of the papers(if there is some objective mistakes, thanks a lot if you can tell me!).
There is score part to help you quickly find papers that may enlight and accelerate your learning.
-
PS:
- "Score" is range from 1 to 5.The higer score is, the more useful the paper is(i.e. 5 means the higest quanlity and useful to study).
- Note that the point is based on only my personal view.
Book and Reviews
Title | Introduction | Score |
---|---|---|
Reinforcement Learning: state of the art | A comprehensive review including POMDP and Bayesian RL | 5 |
POMDP solution methods | A concise and detailed introduction to POMDP | 4 |
A Concise Introduction to Decentralized POMPDs | A newbie-friendly and comprehensive book to dec-POMPDs | 4 |
A Comprehensive Survey of Multi-agent Reinforcement Learning | An top scope to MARL, inconlusive and comprehensive! | 5 |
Markov Decision Process in Artificial Intelligence and CS294-Sequential Decisions: Planning and Reinforcement Learning | Detailed MDP and beyond MDP | 4 |
Multi-agent Systems:Algorithmic, Game-Theoretic, and Logic Foundations | From the view of game theory, not deep reinforcement learning | 3 |
Deep Dec-POMDPs
Title | Introduction | Score |
---|---|---|
Multiagent Cooperation and Competition with Deep Reinforcement Learning | The first paper looks at MADRL after dqn? | 3 |
Deep Recurrent Q-Learning for Partially Observable MDPs | Dqn has problem: observation != state | 4 |
Cooperative Multi-Agent Control Using Deep Reinforcement Learning | 3 schemes extend DQN、DDPG、TRPO from sing-agent to multi-agent;code avaiable | 4 |
Value-Decomposition Networks for Cooperative Multi-Agent Learning | The first paper apply decomposition in MADRL | 4 |
QMIX: Monotonic Value Function Fatorisation for Deep Multi-agent Reinforcement Learning | Based VDN, more flexible to decomposition global Q | 4 |
Opponent Modeling
Title | Introduction | Score |
---|---|---|
Modeling Others using Oneself in Multi-agent Reinforcement Learning | Using opponent goal as addtional input | 3 |
Learning Policy Representations in Multi-agent Systems | Using policy representation to cluser, classify and RL(using opponent's embedding as addtional input) | 4 |
Communication
Title | Introduction | Score |
---|---|---|
Emergence of Grounded Compositional Language in Multi-Agent Populations | ||
Learning to Communicate with Deep Multi-Agent Reinforcement Learning | Communicate discrete action | 4 |
Learning Multiagent Communication with Backpropagation | Communicate hidden state | 3 |
相關文章
- Reinforcement Learning Basic Notes
- ARS Reinforcement Learning using Gymnasium
- Enhancing Diffusion Models with Reinforcement Learning
- Reinforcement Learning Chapter2APT
- Join Query Optimization with Deep Reinforcement Learning AlgorithmsGo
- [Active Learning] Multi-Criteria-based Active Learning
- Jan 2023-Prioritizing Samples in Reinforcement Learning with Reducible Loss
- RAG-Multi-Modal-Generative-AI-AgentAI
- Multi-Agent Oriented Programming PDF Download
- 吳恩達機器學習第三課 Unsupervised learning recommenders reinforcement learning吳恩達機器學習
- 文章學習29“Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning”RaftAIREST
- 【Coursera GenAI with LLM】 Week 3 Reinforcement Learning from Human Feedback Class NotesAI
- 論文閱讀翻譯之Deep reinforcement learning from human preferences
- 強化學習(Reinforcement Learning)中的Q-Learning、DQN,面試看這篇就夠了!強化學習面試
- Multi-Patch Prediction Adapting LLMs for Time Series Representation LearningAPT
- 《Graph Representation Learning》【4】——Multi-relational Data and Knowledge Graphs
- 論文解讀(MLGCL)《Multi-Level Graph Contrastive Learning》GCAST
- 論文解讀(MVGRL)Contrastive Multi-View Representation Learning on GraphsASTView
- 機器學習:詳解多工學習(Multi-task learning)機器學習
- 到底選誰?五大多智慧體 ( Multi-AI Agent) 框架對比智慧體AI框架
- Visual Tracking Papers and Researchers
- 論文解讀(MERIT)《Multi-Scale Contrastive Siamese Networks for Self-Supervised Graph Representation Learning》AST
- 論文《Learning Properties of Ordered and Disordered Materials from Multi-fidelity Data》中的程式碼實現IDE
- Diffusion model papers, survey, and taxonomy
- 多模態學習之論文閱讀:《Multi-modal Learning with Missing Modality in Predicting Axillary Lymph Node Metastasis 》AST
- 千問AI agent qwan_agent使用AI
- langchain multi modal supportLangChain
- Angular 2 Multi ProvidersAngularIDE
- 智慧體Agent智慧體
- Java Agent(上)Java
- A curated list of Artificial Intelligence (AI) courses, books, video lectures and papersIntelAIIDE
- Call for Papers | IJCNN 2019 Special Section 徵稿通道開啟CNN
- multi-parent genetic algorithmsGo
- Pytorch MNIST Multi-layerPyTorch
- Multi Role的實現
- Multi-path handling for asmASM
- Imitation LearningMIT
- learning sequelize