[2212.09720] The case for 4-bit precision: k-bit Inference Scaling Laws - discu.eu

Reddit

[R] The case for 4-bit precision: k-bit Inference Scaling Laws - Tim Dettmers and Luke Zettlemoyer - Findings show that 4-bit precision is almost universally optimal for total model bits and zero-shot accuracy! https://arxiv.org/abs/2212.09720 2 comments 20/12/2022 machinelearning

Linking pages

What I learned from looking at 900 most popular open source AI tools https://huyenchip.com/2024/03/14/ai-oss.html 40 comments
Efficient LLM inference - by Finbarr Timbers https://www.artfintel.com/p/efficient-llm-inference 11 comments
Efficient LLM inference - Artificial Fintelligence https://finbarrtimbers.substack.com/p/efficient-llm-inference-341 0 comments
GitHub - RUCAIBox/LLMSurvey: The official GitHub page for the survey paper "A Survey of Large Language Models". https://github.com/RUCAIBox/LLMSurvey 0 comments
Transformer inference tricks - by Finbarr Timbers https://www.artfintel.com/p/transformer-inference-tricks 0 comments
GitHub - NexaAI/Awesome-LLMs-on-device: Awesome LLMs on Device: A Comprehensive Survey https://github.com/NexaAI/Awesome-LLMs-on-device 0 comments

Would you like to stay up to date with Computer science? Checkout Computer science Weekly.

Related searches:

Search whole site: site:arxiv.org

Search title: [2212.09720] The case for 4-bit precision: k-bit Inference Scaling Laws

See how to search.

Submit link to: