Hacker News
- Emerging reasoning with reinforcement learning https://hkust-nlp.notion.site/simplerl-reason 207 comments
- Emerging Reasoning with Reinforcement Learning Is Both Effective and Efficient https://hkust-nlp.notion.site/simplerl-reason 1 comment
- [R] New 7B open-source replication of R1 solves math problems https://hkust-nlp.notion.site/simplerl-reason 5 comments machinelearning
Linking pages
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:hkust-nlp.notion.site
Search title: 7B Model and 8K Examples: Emerging Reasoning with Reinforcement Learning is Both Effective and Efficient | Notion
See how to search.