- Soft Actor Critic - Entropy? https://spinningup.openai.com/en/latest/algorithms/sac.html 3 comments reinforcementlearning
- Can polyak averaging neural networks lead to numerical instability? https://spinningup.openai.com/en/latest/algorithms/sac.html#pseudocode 9 comments reinforcementlearning
- Soft Actor-Critic with Discrete Actions https://spinningup.openai.com/en/latest/algorithms/sac.html 6 comments reinforcementlearning
Linking pages
- Which Reinforcement learning-RL algorithm to use where, when and in what scenario? | by Ujwal Tewari | DataDrivenInvestor https://medium.com/datadriveninvestor/which-reinforcement-learning-rl-algorithm-to-use-where-when-and-in-what-scenario-e3e7617fb0b1?amp%3Bsk=ab3658c27431dafc50a276a8b166ba1d&source=friends_link 19 comments
Related searches:
Search whole site: site:spinningup.openai.com
Search title: Soft Actor-Critic — Spinning Up documentation
See how to search.