Hacker News
- Distilling Step-by-Step Outperforming Larger Language Models with Less Training https://arxiv.org/abs/2305.02301 34 comments
- [R] Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes https://arxiv.org/abs/2305.02301 18 comments machinelearning
- Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes https://arxiv.org/abs/2305.02301 4 comments languagetechnology
Linking pages
- Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes – Google Research Blog https://blog.research.google/2023/09/distilling-step-by-step-outperforming.html 123 comments
- GitHub - monkeypatch/monkeypatch.py: The easiest way to build scalable LLM-powered applications, which get cheaper and faster over time. https://github.com/monkeypatch/monkeypatch.py 71 comments
- GitHub - Tanuki/tanuki.py: Easily build LLM-powered apps that get cheaper and faster over time. https://github.com/Tanuki/tanuki.py 15 comments
- AI Research Highlights In 3 Sentences Or Less (April-May 2023) https://magazine.sebastianraschka.com/p/ai-research-highlights-in-3-sentences 4 comments
- GitHub - horseee/Awesome-Efficient-LLM: A curated list for Efficient Large Language Models https://github.com/horseee/Awesome-Efficient-LLM 0 comments
- GitHub - AIoT-MLSys-Lab/Efficient-LLMs-Survey: Efficient Large Language Models: A Survey https://github.com/AIoT-MLSys-Lab/Efficient-LLMs-Survey 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2305.02301] Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
See how to search.