discu
Newsletters
Mentions
Extension
Pricing
Login
Sign Up
Hacker News
Emerging reasoning with reinforcement learning
https://hkust-nlp.notion.site/simplerl-reason
212 comments
26/1/2025
Reddit
[R] New 7B open-source replication of R1 solves math problems
https://hkust-nlp.notion.site/simplerl-reason
5 comments
26/1/2025
machinelearning