7B Model and 8K Examples: Emerging Reasoning with Reinforcement Learning is Both Effective and Efficient | Notion - discu.eu

Hacker News

Emerging reasoning with reinforcement learning https://hkust-nlp.notion.site/simplerl-reason 212 comments 26/1/2025

Reddit

[R] New 7B open-source replication of R1 solves math problems https://hkust-nlp.notion.site/simplerl-reason 5 comments 26/1/2025 machinelearning

Linking pages

Explainer: What's R1 & Everything Else? - Tim Kellogg https://timkellogg.me/blog/2025/01/25/r1 125 comments
DeepSeek: The Greatest Growth Hack of All Times meets its David in a Chinese Quant. https://centreforaileadership.org/resources/deepseeks_narrative_attack/ 0 comments
GitHub - zzli2022/Awesome-System2-Reasoning-LLM https://github.com/zzli2022/Awesome-System2-Reasoning-LLM 0 comments

Would you like to stay up to date with Computer science? Checkout Computer science Weekly.

Related searches:

Search whole site: site:hkust-nlp.notion.site

Search title: 7B Model and 8K Examples: Emerging Reasoning with Reinforcement Learning is Both Effective and Efficient | Notion

See how to search.

Submit link to: