- OpenAI Retro Contest (Sonic meta-RL) results: AliBaba team wins 1st place, 4,692/10,000; 229 submissions; winners use PPO/DQN w/hyperparameter tuning; next contest launches in a few months https://blog.openai.com/first-retro-contest-retrospective/ 4 comments reinforcementlearning
Linking pages
- Basic Tutorial with TensorFlow.js: Linear Regression | by Tristan Sokol | Medium https://medium.com/@tristansokol/basic-tutorial-with-tensorflow-js-linear-regression-aa68b16e5b8e 0 comments
- OpenAI Retro Contest Report (#15 score, #2 write-up) | by Oleg Mürk | Medium https://medium.com/@olegmrk/openai-retro-contest-report-b870bfd014e0 0 comments
- Song Lyric Toxicity, Commit Assistant, NLP Progress, DensePose, PyTorch Geometric,… | by elvis | DAIR.AI | Medium https://medium.com/dair-ai/song-lyric-toxicity-commit-assistant-nlp-progress-densepose-pytorch-geometric-4cf0c4ea0d25 0 comments
Linked pages
- ChatGPT https://chat.openai.com/ 742 comments
- MarI/O - Machine Learning for Video Games - YouTube https://www.youtube.com/watch?v=qv6UVOQ0F44 375 comments
- 42 (school) - Wikipedia https://en.wikipedia.org/wiki/42_(school) 150 comments
- [1506.02640] You Only Look Once: Unified, Real-Time Object Detection http://arxiv.org/abs/1506.02640 8 comments
- Train a Reinforcement Learning agent to play custom levels of Sonic the Hedgehog with Transfer Learning | Felix Yu https://flyyufelix.github.io/2018/06/11/sonic-rl.html 0 comments
- [1804.02717] DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills https://arxiv.org/abs/1804.02717 0 comments
- OpenAI Retro Contest Report (#15 score, #2 write-up) | by Oleg Mürk | Medium https://medium.com/@olegmrk/openai-retro-contest-report-b870bfd014e0 0 comments
Related searches:
Search whole site: site:blog.openai.com
Search title: Retro Contest: Results
See how to search.