- [R] Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing https://arxiv.org/abs/2404.12253 3 comments machinelearning
Linking pages
- GitHub - hijkzzz/Awesome-LLM-Strawberry: A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques. https://github.com/hijkzzz/Awesome-LLM-Strawberry 4 comments
- Tencent AI Lab Developed AlphaLLM: A Novel Machine Learning Framework for Self-Improving Language Models - MarkTechPost https://www.marktechpost.com/2024/04/22/tencent-ai-lab-developed-alphallm-a-novel-machine-learning-framework-for-self-improving-language-models/ 1 comment
- How do Large Language Models “think”? - by Thomas Voice https://thomasvoice.substack.com/p/how-do-large-language-models-think 1 comment
- How Good Are the Latest Open LLMs? And Is DPO Better Than PPO? https://magazine.sebastianraschka.com/p/how-good-are-the-latest-open-llms 1 comment
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2404.12253] Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
See how to search.