GitHub - hijkzzz/Awesome-LLM-Strawberry: A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques. - discu.eu

Reddit

[R] A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques. https://github.com/hijkzzz/Awesome-LLM-Strawberry 4 comments 16/9/2024 machinelearning

Linking pages

GitTrends - September 22 2024 - GitTrends https://gitstars.substack.com/p/gittrends-september-22-2024 0 comments

Linked pages

https://openai.com/index/learning-to-reason-with-llms/ 1525 comments
[2403.09629] Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking https://arxiv.org/abs/2403.09629 271 comments
https://openai.com/index/finding-gpt4s-mistakes-with-gpt-4/ 228 comments
GitHub - sindresorhus/awesome: 😎 Awesome lists about all kinds of interesting topics https://github.com/sindresorhus/awesome 69 comments
[2401.10020] Self-Rewarding Language Models https://arxiv.org/abs/2401.10020 67 comments
[2009.03393] Generative Language Modeling for Automated Theorem Proving https://arxiv.org/abs/2009.03393 29 comments
[2109.15316] Scalable Online Planning via Reinforcement Learning Fine-Tuning https://arxiv.org/abs/2109.15316 13 comments
[2406.07394] Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B https://arxiv.org/abs/2406.07394 11 comments
[2203.14465] STaR: Bootstrapping Reasoning With Reasoning https://arxiv.org/abs/2203.14465 5 comments
[2408.16737] Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling https://arxiv.org/abs/2408.16737 5 comments
https://openai.com/index/openai-o1-mini-advancing-cost-efficient-reasoning/ 5 comments
[2305.14992] Reasoning with Language Model is Planning with World Model https://arxiv.org/abs/2305.14992 4 comments
[2404.12253] Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing https://arxiv.org/abs/2404.12253 4 comments
[2305.20050] Let's Verify Step by Step https://arxiv.org/abs/2305.20050 3 comments
[2406.14283] Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning https://arxiv.org/abs/2406.14283 3 comments
[2405.03553] AlphaMath Almost Zero: process Supervision without process https://arxiv.org/abs/2405.03553 2 comments
[2201.11903] Chain of Thought Prompting Elicits Reasoning in Large Language Models https://arxiv.org/abs/2201.11903 1 comment
[2404.02078] Advancing LLM Reasoning Generalists with Preference Trees https://arxiv.org/abs/2404.02078 1 comment
[2408.03314] Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters https://arxiv.org/abs/2408.03314 1 comment
[2402.03271] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models https://arxiv.org/abs/2402.03271 0 comments

Would you like to stay up to date with Computer science? Checkout Computer science Weekly.

Related searches:

Search whole site: site:github.com

Search title: GitHub - hijkzzz/Awesome-LLM-Strawberry: A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

See how to search.

Submit link to: