Hacker News
- DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL https://pretty-radio-b75.notion.site/DeepScaleR-Surpassing-O1-Preview-with-a-1-5B-Model-by-Scaling-RL-19681902c1468005bed8ca303013a4e2 127 comments
Linking pages
- Scaling up Open Reasoning with OpenThinker-32B | Open Thoughts https://www.open-thoughts.ai/blog/scale 1 comment
- GitHub - agentica-project/deepscaler: Democratizing Reinforcement Learning for LLMs https://github.com/agentica-project/deepscaler 0 comments
- GitHub - zzli2022/Awesome-System2-Reasoning-LLM https://github.com/zzli2022/Awesome-System2-Reasoning-LLM 0 comments
Related searches:
Search whole site: site:pretty-radio-b75.notion.site
Search title: DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL | Notion
See how to search.