Understanding Reasoning LLMs - by Sebastian Raschka, PhD - discu.eu

Hacker News

Understanding Reasoning LLMs https://magazine.sebastianraschka.com/p/understanding-reasoning-llms 188 comments 6/2/2025

Linking pages

Linked pages

[2501.12948] DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning https://arxiv.org/abs/2501.12948 1061 comments
Build a Large Language Model (From Scratch): Raschka, Sebastian: 9781633437166: Amazon.com: Books https://www.amazon.com/dp/1633437167/ 38 comments
GitHub - Jiayi-Pan/TinyZero: Clean, minimal, accessible reproduction of DeepSeek R1-Zero https://github.com/Jiayi-Pan/TinyZero 27 comments
LLM Training: RLHF and Its Alternatives https://magazine.sebastianraschka.com/p/llm-training-rlhf-and-its-alternatives 14 comments
Sky-T1: Train your own O1 preview model within $450 https://novasky-ai.github.io/posts/sky-t1/ 6 comments
[2408.03314] Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters https://arxiv.org/abs/2408.03314 1 comment
[2410.18982] O1 Replication Journey: A Strategic Progress Report -- Part 1 https://arxiv.org/abs/2410.18982 0 comments

Related searches:

Search whole site: site:magazine.sebastianraschka.com

Search title: Understanding Reasoning LLMs - by Sebastian Raschka, PhD

See how to search.

Submit link to: