Linking pages
- GitHub - Jiayi-Pan/TinyZero: Clean, minimal, accessible reproduction of DeepSeek R1-Zero https://github.com/Jiayi-Pan/TinyZero 27 comments
- GitHub - AmberLJC/LLMSys-PaperList: Large Language Model (LLM) Systems Paper List https://github.com/AmberLJC/LLMSys-PaperList/ 1 comment
- GitHub - Unakar/Logic-RL https://github.com/Unakar/Logic-RL 0 comments
- Reducing VRAM Footprint in PPO and GRPO Using Selective Log-Softmax https://www.tylerromero.com/posts/2025-02-selective-log-softmax/ 0 comments
- What DeepSeek means for the world - by Mahesh https://madiator.substack.com/p/what-deepseek-means-for-the-world 0 comments
- GitHub - inclusionAI/AReaL: Distributed RL System for LLM Reasoning https://github.com/inclusionAI/AReaL 0 comments
Linked pages
Related searches:
Search whole site: site:github.com
Search title: GitHub - volcengine/verl: veRL: Volcano Engine Reinforcement Learning for LLM
See how to search.