Linking pages
- Demystifying Reasoning Models - by Cameron R. Wolfe, Ph.D. https://cameronrwolfe.substack.com/p/demystifying-reasoning-models 0 comments
- GitHub - zzli2022/Awesome-System2-Reasoning-LLM https://github.com/zzli2022/Awesome-System2-Reasoning-LLM 0 comments
- Recent reasoning research: GRPO tweaks, base model RL, and data curation https://www.interconnects.ai/p/papers-im-reading-base-model-rl-grpo 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [2501.12599] Kimi k1.5: Scaling Reinforcement Learning with LLMs
See how to search.