Hacker News
- Understanding Reasoning LLMs https://magazine.sebastianraschka.com/p/understanding-reasoning-llms 186 comments
Linked pages
- [2501.12948] DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning https://arxiv.org/abs/2501.12948 1060 comments
- Build a Large Language Model (From Scratch): Raschka, Sebastian: 9781633437166: Amazon.com: Books https://www.amazon.com/dp/1633437167/ 38 comments
- GitHub - Jiayi-Pan/TinyZero: Clean, minimal, accessible reproduction of DeepSeek R1-Zero https://github.com/Jiayi-Pan/TinyZero 27 comments
- LLM Training: RLHF and Its Alternatives https://magazine.sebastianraschka.com/p/llm-training-rlhf-and-its-alternatives 14 comments
- Sky-T1: Train your own O1 preview model within $450 https://novasky-ai.github.io/posts/sky-t1/ 6 comments
- [2408.03314] Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters https://arxiv.org/abs/2408.03314 1 comment
- [2410.18982] O1 Replication Journey: A Strategic Progress Report -- Part 1 https://arxiv.org/abs/2410.18982 0 comments
Related searches:
Search whole site: site:magazine.sebastianraschka.com
Search title: Understanding Reasoning LLMs - by Sebastian Raschka, PhD
See how to search.