Linking pages
- Together AI Released DeepCoder-14B-Preview: A Fully Open-Source Code Reasoning Model That Rivals o3-Mini With Just 14B Parameters - MarkTechPost https://www.marktechpost.com/2025/04/10/together-ai-released-deepcoder-14b-preview-a-fully-open-source-code-reasoning-model-that-rivals-o3-mini-with-just-14b-parameters/ 2 comments
Linked pages
- DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL | Notion https://pretty-radio-b75.notion.site/DeepScaleR-Surpassing-O1-Preview-with-a-1-5B-Model-by-Scaling-RL-19681902c1468005bed8ca303013a4e2 127 comments
- deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B · Hugging Face https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B 17 comments
- DeepCoder: A Fully Open-Source 14B Coder at O3-mini Level | Notion https://pretty-radio-b75.notion.site/DeepCoder-A-Fully-Open-Source-14B-Coder-at-O3-mini-Level-1cf81902c14680b3bee5eb349a512a51 5 comments
- GitHub - volcengine/verl: veRL: Volcano Engine Reinforcement Learning for LLM https://github.com/volcengine/verl 1 comment
Related searches:
Search whole site: site:github.com
Search title: GitHub - agentica-project/rllm: Democratizing Reinforcement Learning for LLMs
See how to search.