- [R] A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques. https://github.com/hijkzzz/Awesome-LLM-Strawberry 4 comments machinelearning
Linking pages
Linked pages
- https://openai.com/index/learning-to-reason-with-llms/ 1525 comments
- [2403.09629] Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking https://arxiv.org/abs/2403.09629 271 comments
- https://openai.com/index/finding-gpt4s-mistakes-with-gpt-4/ 228 comments
- GitHub - sindresorhus/awesome: 😎 Awesome lists about all kinds of interesting topics https://github.com/sindresorhus/awesome 69 comments
- [2401.10020] Self-Rewarding Language Models https://arxiv.org/abs/2401.10020 68 comments
- [2009.03393] Generative Language Modeling for Automated Theorem Proving https://arxiv.org/abs/2009.03393 29 comments
- [2109.15316] Scalable Online Planning via Reinforcement Learning Fine-Tuning https://arxiv.org/abs/2109.15316 13 comments
- [2406.07394] Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B https://arxiv.org/abs/2406.07394 11 comments
- [2203.14465] STaR: Bootstrapping Reasoning With Reasoning https://arxiv.org/abs/2203.14465 5 comments
- [2408.16737] Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling https://arxiv.org/abs/2408.16737 5 comments
- https://openai.com/index/openai-o1-mini-advancing-cost-efficient-reasoning/ 5 comments
- [2305.14992] Reasoning with Language Model is Planning with World Model https://arxiv.org/abs/2305.14992 4 comments
- [2404.12253] Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing https://arxiv.org/abs/2404.12253 4 comments
- [2305.20050] Let's Verify Step by Step https://arxiv.org/abs/2305.20050 3 comments
- [2406.14283] Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning https://arxiv.org/abs/2406.14283 3 comments
- [2405.03553] AlphaMath Almost Zero: process Supervision without process https://arxiv.org/abs/2405.03553 2 comments
- [2201.11903] Chain of Thought Prompting Elicits Reasoning in Large Language Models https://arxiv.org/abs/2201.11903 1 comment
- [2404.02078] Advancing LLM Reasoning Generalists with Preference Trees https://arxiv.org/abs/2404.02078 1 comment
- [2408.03314] Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters https://arxiv.org/abs/2408.03314 1 comment
- [2402.03271] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models https://arxiv.org/abs/2402.03271 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.