discu
Newsletters
Mentions
Extension
Pricing
Login
Sign Up
Reddit
Why does MADDPG use action log prob for Q (Critic) instead of sampled action?
https://arxiv.org/pdf/1706.02275.pdf
4 comments
4/2/2022
reinforcementlearning