Engineering reasoning LLMs: Notes and Observations - discu.eu

Hacker News

Engineering Reasoning LLMs: Notes and Observations https://www.thelis.org/blog/reasoning-model-notes 0 comments 11/3/2025

Linked pages

QwQ: Reflect Deeply on the Boundaries of the Unknown | Qwen https://qwenlm.github.io/blog/qwq-32b-preview/ 421 comments
[2205.11916] Large Language Models are Zero-Shot Reasoners https://arxiv.org/abs/2205.11916 55 comments
https://cdn.openai.com/improving-mathematical-reasoning-with-process-supervision/Lets_Verify_Step_by_Step.pdf 29 comments
Sky-T1: Train your own O1 preview model within $450 https://novasky-ai.github.io/posts/sky-t1/ 6 comments
https://openai.com/o1/ 3 comments
[2408.03314] Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters https://arxiv.org/abs/2408.03314 1 comment
[2412.09413] Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems https://arxiv.org/abs/2412.09413 1 comment
The State of LLM Reasoning Models https://magazine.sebastianraschka.com/p/state-of-llm-reasoning-and-inference-scaling 0 comments

Related searches:

Search whole site: site:thelis.org

Search title: Engineering reasoning LLMs: Notes and Observations

See how to search.

Submit link to: