Hacker News
- Emerging reasoning with reinforcement learning https://hkust-nlp.notion.site/simplerl-reason 211 comments
- [R] New 7B open-source replication of R1 solves math problems https://hkust-nlp.notion.site/simplerl-reason 5 comments machinelearning
Linking pages
- Explainer: What's R1 & Everything Else? - Tim Kellogg https://timkellogg.me/blog/2025/01/25/r1 125 comments
- DeepSeek: The Greatest Growth Hack of All Times meets its David in a Chinese Quant. https://centreforaileadership.org/resources/deepseeks_narrative_attack/ 0 comments
- GitHub - zzli2022/Awesome-System2-Reasoning-LLM https://github.com/zzli2022/Awesome-System2-Reasoning-LLM 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:hkust-nlp.notion.site
Search title: 7B Model and 8K Examples: Emerging Reasoning with Reinforcement Learning is Both Effective and Efficient | Notion
See how to search.