[2208.07339] LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale - discu.eu

Hacker News

LLM.int8(): 8-Bit Matrix Multiplication for Transformers at Scale (2022) https://arxiv.org/abs/2208.07339 23 comments 10/6/2023

Reddit

LLM.bit8 - Quantization via Matrices to cut inference memory in half https://arxiv.org/abs/2208.07339 8 comments 10/6/2023 machinelearning

Linking pages

Would you like to stay up to date with Computer science? Checkout Computer science Weekly.

Related searches:

Search whole site: site:arxiv.org

Search title: [2208.07339] LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

See how to search.

Submit link to: