Demystifying Reasoning Models - by Cameron R. Wolfe, Ph.D.

Linking pages

nanoMoE: Mixture-of-Experts (MoE) LLMs from Scratch in PyTorch https://cameronrwolfe.substack.com/p/nano-moe 0 comments

Linked pages

OpenAI o3 Breakthrough High Score on ARC-AGI-Pub https://arcprize.org/blog/oai-o3-pub-breakthrough 1773 comments
https://openai.com/index/learning-to-reason-with-llms/ 1525 comments
https://openai.com/index/openai-o3-mini/ 904 comments
QwQ: Reflect Deeply on the Boundaries of the Unknown | Qwen https://qwenlm.github.io/blog/qwq-32b-preview/ 421 comments
Elo rating system - Wikipedia https://en.wikipedia.org/wiki/Elo_rating_system 386 comments
Stanford CRFM https://crfm.stanford.edu/2023/03/13/alpaca.html 298 comments
[2502.03387] LIMO: Less is More for Reasoning https://arxiv.org/abs/2502.03387 180 comments
[2305.15717] The False Promise of Imitating Proprietary LLMs https://arxiv.org/abs/2305.15717 119 comments
Lagrange multiplier - Wikipedia https://en.wikipedia.org/wiki/Lagrange_multiplier 27 comments
https://openai.com/index/introducing-openai-o1-preview/ 17 comments
[2206.02336] On the Advance of Making Language Models Better Reasoners https://arxiv.org/abs/2206.02336 12 comments
Monte Carlo tree search - Wikipedia https://en.wikipedia.org/wiki/Monte_Carlo_tree_search 12 comments
https://openai.com/index/introducing-swe-bench-verified/ 10 comments
Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality | LMSYS Org https://lmsys.org/blog/2023-03-30-vicuna/ 7 comments
Number theory - Wikipedia http://en.wikipedia.org/wiki/number_theory#quotations 6 comments
Sky-T1: Train your own O1 preview model within $450 https://novasky-ai.github.io/posts/sky-t1/ 6 comments
Terence Tao - Wikipedia http://en.wikipedia.org/wiki/Terence_Tao 4 comments
Koala: A Dialogue Model for Academic Research – The Berkeley Artificial Intelligence Research Blog https://bair.berkeley.edu/blog/2023/04/03/koala/ 4 comments
[2305.20050] Let's Verify Step by Step https://arxiv.org/abs/2305.20050 3 comments
Bespoke Labs https://www.bespokelabs.ai/blog/bespoke-stratos-the-unreasonable-effectiveness-of-reasoning-distillation 2 comments