Hacker News
- Engineering Reasoning LLMs: Notes and Observations https://www.thelis.org/blog/reasoning-model-notes 0 comments
Linked pages
- QwQ: Reflect Deeply on the Boundaries of the Unknown | Qwen https://qwenlm.github.io/blog/qwq-32b-preview/ 421 comments
- [2205.11916] Large Language Models are Zero-Shot Reasoners https://arxiv.org/abs/2205.11916 55 comments
- https://cdn.openai.com/improving-mathematical-reasoning-with-process-supervision/Lets_Verify_Step_by_Step.pdf 29 comments
- Sky-T1: Train your own O1 preview model within $450 https://novasky-ai.github.io/posts/sky-t1/ 6 comments
- https://openai.com/o1/ 3 comments
- [2408.03314] Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters https://arxiv.org/abs/2408.03314 1 comment
- [2412.09413] Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems https://arxiv.org/abs/2412.09413 1 comment
- The State of LLM Reasoning Models https://magazine.sebastianraschka.com/p/state-of-llm-reasoning-and-inference-scaling 0 comments
Related searches:
Search whole site: site:thelis.org
Search title: Engineering reasoning LLMs: Notes and Observations
See how to search.