Hacker News
- TinyZero: Reproduction of DeepSeek R1 Zero in countdown and multiplication tasks https://github.com/Jiayi-Pan/TinyZero 27 comments
Linking pages
- Understanding Reasoning LLMs - by Sebastian Raschka, PhD https://magazine.sebastianraschka.com/p/understanding-reasoning-llms 186 comments
- Team Says They've Recreated DeepSeek's OpenAI Killer for Literally $30 https://futurism.com/researchers-deepseek-even-cheaper 26 comments
- GitHub - volcengine/verl: veRL: Volcano Engine Reinforcement Learning for LLM https://github.com/volcengine/verl 1 comment
- GitHub - Unakar/Logic-RL https://github.com/Unakar/Logic-RL 0 comments
- GitHub - zzli2022/Awesome-System2-Reasoning-LLM https://github.com/zzli2022/Awesome-System2-Reasoning-LLM 0 comments
Linked pages
Related searches:
Search whole site: site:github.com
Search title: GitHub - Jiayi-Pan/TinyZero: Clean, minimal, accessible reproduction of DeepSeek R1-Zero
See how to search.