discu
Newsletters
Mentions
Extension
Pricing
Login
Sign Up
Reddit
Understanding why there isn't a log probability in TRPO and PPO's objective
https://costa.sh/blog-understanding-why-there-isn't-a-log-probability-in-trpo-and-ppo's-objective.html
10 comments
12/4/2020
reinforcementlearning