[2404.12253] Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing - discu.eu

Reddit

[R] Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing https://arxiv.org/abs/2404.12253 3 comments 21/4/2024 machinelearning

Linking pages

GitHub - hijkzzz/Awesome-LLM-Strawberry: A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques. https://github.com/hijkzzz/Awesome-LLM-Strawberry 4 comments
Tencent AI Lab Developed AlphaLLM: A Novel Machine Learning Framework for Self-Improving Language Models - MarkTechPost https://www.marktechpost.com/2024/04/22/tencent-ai-lab-developed-alphallm-a-novel-machine-learning-framework-for-self-improving-language-models/ 1 comment
How do Large Language Models “think”? - by Thomas Voice https://thomasvoice.substack.com/p/how-do-large-language-models-think 1 comment
How Good Are the Latest Open LLMs? And Is DPO Better Than PPO? https://magazine.sebastianraschka.com/p/how-good-are-the-latest-open-llms 1 comment

Would you like to stay up to date with Computer science? Checkout Computer science Weekly.

Related searches:

Search whole site: site:arxiv.org

Search title: [2404.12253] Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

See how to search.

Submit link to: