Hacker News
- LIMA: Less Is More for Alignment https://arxiv.org/abs/2305.11206 3 comments
- LIMA: Less Is More for Alignment https://arxiv.org/abs/2305.11206 9 comments
- LIMA, a 65B-Param LLaMa fine-tuned with standard supervised loss on only 1,000 carefully curated prompts & responses, without any RLHF, demonstrates remarkably strong performance, learning to follow specific responses from only a handful of examples in the training data, including complex queries. https://arxiv.org/abs/2305.11206 32 comments machinelearning
Linking pages
- AI and Open Source in 2023 - by Sebastian Raschka, PhD https://magazine.sebastianraschka.com/p/ai-and-open-source-in-2023 67 comments
- Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation) https://magazine.sebastianraschka.com/p/practical-tips-for-finetuning-llms 37 comments
- Optimizing LLMs From a Dataset Perspective https://sebastianraschka.com/blog/2023/optimizing-LLMs-dataset-perspective.html 24 comments
- 10 Noteworthy AI Research Papers of 2023 https://magazine.sebastianraschka.com/p/10-ai-research-papers-2023 24 comments
- Bringing LLM Fine-Tuning and RLHF to Everyone https://argilla.io/blog/argilla-for-llms/ 11 comments
- GitHub - JUSTSUJAY/ML-Research-Papers https://github.com/JUSTSUJAY/ML-Research-Papers 10 comments
- Apple's chance to remake AI development - by Matthew Lynley https://www.supervised.news/p/apples-chance-to-remake-ai-development 4 comments
- Ahead of AI #9: LLM Tuning & Dataset Perspectives https://magazine.sebastianraschka.com/p/ahead-of-ai-9-llm-tuning-and-dataset 4 comments
- AI Research Highlights In 3 Sentences Or Less (May-June 2023) https://magazine.sebastianraschka.com/p/ai-research-highlights-in-3-sentences-2a1 3 comments
- GitHub - Alpha-VLLM/LLaMA2-Accessory: An Open-source Toolkit for LLM Development https://github.com/Alpha-VLLM/LLaMA2-Accessory 3 comments
- AI Fundamentals: Datasets 101 - Latent Space https://www.latent.space/p/datasets-101 1 comment
- Aman's AI Journal • Primers • Overview of Large Language Models https://aman.ai/primers/ai/LLM/ 1 comment
- Modern AI is Domestification https://thegradient.pub/ai-is-domestification/ 0 comments
- GitHub - RUCAIBox/LLMSurvey: The official GitHub page for the survey paper "A Survey of Large Language Models". https://github.com/RUCAIBox/LLMSurvey 0 comments
- Running Large Language Models Privately - privateGPT and Beyond | Weaviate - vector database https://weaviate.io/blog/private-llm 0 comments
- Extended Guide: Instruction-tune Llama 2 https://www.philschmid.de/instruction-tune-llama-2 0 comments
- Fine-tuning OpenLLaMA-7B with QLoRA for instruction following | Jou-ching (George) Sung https://georgesung.github.io/ai/qlora-ift/ 0 comments
- Reinforcement Learning from Human Feedback: When the Math Ain't Enough https://evalovernite.substack.com/p/rlhf-math-aint-enough 0 comments
- GitHub - nlpfromscratch/nlp-llms-resources: Master list of curated resources on NLP and LLMs https://github.com/nlpfromscratch/nlp-llms-resources 0 comments
- 📝 Guest Post: The Coming Wave of Specialized AI* https://thesequence.substack.com/p/guest-post-the-coming-wave-of-specialized 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2305.11206] LIMA: Less Is More for Alignment
See how to search.