- What does the big E (expected value..?) in bellman's equation really mean? https://spinningup.openai.com/en/latest/algorithms/ddpg.html#the-q-learning-side-of-ddpg 14 comments reinforcementlearning
Linking pages
- Which Reinforcement learning-RL algorithm to use where, when and in what scenario? | by Ujwal Tewari | DataDrivenInvestor https://medium.com/datadriveninvestor/which-reinforcement-learning-rl-algorithm-to-use-where-when-and-in-what-scenario-e3e7617fb0b1?amp%3Bsk=ab3658c27431dafc50a276a8b166ba1d&source=friends_link 19 comments
- GitHub - giorgi-o/trackmania-neural-network: A Trackmania (2020) agent trained using either DQN or DDPG. https://github.com/giorgi-o/trackmania-neural-network 0 comments
Related searches:
Search whole site: site:spinningup.openai.com
Search title: Deep Deterministic Policy Gradient — Spinning Up documentation
See how to search.