- Why is policy assumed to be a probability density function instead of a probability function in Sutton and Barto for continuous actions? http://incompleteideas.net/book/RLbook2020.pdf 10 comments reinforcementlearning
Linking pages
- State of the art in LLMs + Robotics - 2023 - hlfshell https://hlfshell.ai/posts/llms-and-robotics-papers-2023/ 4 comments
- free-programming-books/free-programming-books-subjects.md at main · EbookFoundation/free-programming-books · GitHub https://github.com/EbookFoundation/free-programming-books/blob/main/books/free-programming-books-subjects.md 3 comments
- Real-time machine learning: challenges and solutions https://huyenchip.com/2022/01/02/real-time-machine-learning-challenges-and-solutions.html 1 comment
- RLlib for Deep Hierarchical Multiagent Reinforcement Learning – DeUmbra https://deumbra.com/2022/08/rllib-for-deep-hierarchical-multiagent-reinforcement-learning/ 0 comments
- RLlib for Deep Hierarchical Multiagent Reinforcement Learning | by Jonathan Mugan | Medium https://medium.com/@jmugan/rllib-for-deep-hierarchical-multiagent-reinforcement-learning-6aa96cdee154 0 comments
- GitHub - aikorea/awesome-rl: Reinforcement learning resources curated https://github.com/aikorea/awesome-rl 0 comments
- GitHub - ahmedbahaaeldin/From-0-to-Research-Scientist-resources-guide: Detailed and tailored guide for undergraduate students or anybody want to dig deep into the field of AI with solid foundation. https://github.com/ahmedbahaaeldin/From-0-to-Research-Scientist-resources-guide 0 comments
- An open-source gymnasium for machine learning assisted computer architecture design – Google Research Blog https://ai.googleblog.com/2023/07/an-open-source-gymnasium-for-computer.html 0 comments
- An open-source gymnasium for machine learning assisted computer architecture design – Google Research Blog https://blog.research.google/2023/07/an-open-source-gymnasium-for-computer.html?m=1 0 comments
- Wordle Perfect Play with Generic Methods https://espadrine.github.io/blog/posts/wordle-perfect-play-with-generic-methods.html 0 comments