[1806.07857] RUDDER: Return Decomposition for Delayed Rewards - discu.eu

Reddit

RUDDER -- Reinforcement Learning algorithm that is "exponentially faster than TD, MC, and MC Tree Search (MCTS)" https://arxiv.org/abs/1806.07857 5 comments 21/6/2018 reinforcementlearning

Linking pages

Challenges of real-world reinforcement learning | the morning paper https://blog.acolyer.org/2020/01/13/challenges-of-real-world-rl/ 5 comments

Related searches:

Search whole site: site:arxiv.org

Search title: [1806.07857] RUDDER: Return Decomposition for Delayed Rewards

See how to search.

Submit link to: