Hacker News
- Demystifying Reasoning Models https://cameronrwolfe.substack.com/p/demystifying-reasoning-models 0 comments
Linked pages
- OpenAI o3 Breakthrough High Score on ARC-AGI-Pub https://arcprize.org/blog/oai-o3-pub-breakthrough 1773 comments
- https://openai.com/index/learning-to-reason-with-llms/ 1525 comments
- https://openai.com/index/openai-o3-mini/ 902 comments
- QwQ: Reflect Deeply on the Boundaries of the Unknown | Qwen https://qwenlm.github.io/blog/qwq-32b-preview/ 421 comments
- Elo rating system - Wikipedia https://en.wikipedia.org/wiki/Elo_rating_system 386 comments
- Stanford CRFM https://crfm.stanford.edu/2023/03/13/alpaca.html 298 comments
- [2502.03387] LIMO: Less is More for Reasoning https://arxiv.org/abs/2502.03387 180 comments
- [2305.15717] The False Promise of Imitating Proprietary LLMs https://arxiv.org/abs/2305.15717 119 comments
- Lagrange multiplier - Wikipedia https://en.wikipedia.org/wiki/Lagrange_multiplier 27 comments
- https://openai.com/index/introducing-openai-o1-preview/ 17 comments
- [2206.02336] On the Advance of Making Language Models Better Reasoners https://arxiv.org/abs/2206.02336 12 comments
- Monte Carlo tree search - Wikipedia https://en.wikipedia.org/wiki/Monte_Carlo_tree_search 12 comments
- https://openai.com/index/introducing-swe-bench-verified/ 10 comments
- Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality | LMSYS Org https://lmsys.org/blog/2023-03-30-vicuna/ 7 comments
- Number theory - Wikipedia http://en.wikipedia.org/wiki/number_theory#quotations 6 comments
- Sky-T1: Train your own O1 preview model within $450 https://novasky-ai.github.io/posts/sky-t1/ 6 comments
- Terence Tao - Wikipedia http://en.wikipedia.org/wiki/Terence_Tao 4 comments
- Koala: A Dialogue Model for Academic Research – The Berkeley Artificial Intelligence Research Blog https://bair.berkeley.edu/blog/2023/04/03/koala/ 4 comments
- [2305.20050] Let's Verify Step by Step https://arxiv.org/abs/2305.20050 3 comments
- Bespoke Labs https://www.bespokelabs.ai/blog/bespoke-stratos-the-unreasonable-effectiveness-of-reasoning-distillation 2 comments
Related searches:
Search whole site: site:cameronrwolfe.substack.com
Search title: Demystifying Reasoning Models - by Cameron R. Wolfe, Ph.D.
See how to search.