- [R] The case for 4-bit precision: k-bit Inference Scaling Laws - Tim Dettmers and Luke Zettlemoyer - Findings show that 4-bit precision is almost universally optimal for total model bits and zero-shot accuracy! https://arxiv.org/abs/2212.09720 2 comments machinelearning
Linking pages
- What I learned from looking at 900 most popular open source AI tools https://huyenchip.com/2024/03/14/ai-oss.html 40 comments
- Efficient LLM inference - by Finbarr Timbers https://www.artfintel.com/p/efficient-llm-inference 11 comments
- Efficient LLM inference - Artificial Fintelligence https://finbarrtimbers.substack.com/p/efficient-llm-inference-341 0 comments
- GitHub - RUCAIBox/LLMSurvey: The official GitHub page for the survey paper "A Survey of Large Language Models". https://github.com/RUCAIBox/LLMSurvey 0 comments
- Transformer inference tricks - by Finbarr Timbers https://www.artfintel.com/p/transformer-inference-tricks 0 comments
- GitHub - NexaAI/Awesome-LLMs-on-device: Awesome LLMs on Device: A Comprehensive Survey https://github.com/NexaAI/Awesome-LLMs-on-device 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2212.09720] The case for 4-bit precision: k-bit Inference Scaling Laws
See how to search.