- Does COMA work better than a simple Policy Gradient with a centralised critic? Why? https://arxiv.org/abs/1705.08926 4 comments reinforcementlearning
Linking pages
- Learning to Cooperate, Compete, and Communicate https://blog.openai.com/learning-to-cooperate-compete-and-communicate/?source=hn 36 comments
- Learning to Cooperate, Compete, and Communicate https://blog.openai.com/learning-to-cooperate-compete-and-communicate/ 1 comment
- GitHub - opendilab/DI-engine: OpenDILab Decision AI Engine https://github.com/opendilab/DI-engine 0 comments
- GitHub - oxwhirl/smac: SMAC: The StarCraft Multi-Agent Challenge https://github.com/oxwhirl/smac 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [1705.08926] Counterfactual Multi-Agent Policy Gradients
See how to search.