Hacker News
Linked pages
- https://openai.com/index/learning-to-reason-with-llms/ 1525 comments
- [2501.12948] DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning https://arxiv.org/abs/2501.12948 1060 comments
- R1-Zero and R1 Results and Analysis https://arcprize.org/blog/r1-zero-r1-results-analysis 269 comments
- ARC Prize https://arcprize.org/ 3 comments
- Mixture of Experts Explained https://huggingface.co/blog/moe 2 comments
- [2402.03300] DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models https://arxiv.org/abs/2402.03300 2 comments
- [2201.11903] Chain of Thought Prompting Elicits Reasoning in Large Language Models https://arxiv.org/abs/2201.11903 1 comment
Related searches:
Search whole site: site:labs.adaline.ai
Search title: Inside Reasoning Models OpenAI o3 And DeepSeek R1
See how to search.