Pushing the Limits of LLM Quantization via the Linearity Theorem - discu.eu

Hacker News

Pushing the Limits of LLM Quantization via the Linearity Theorem https://arxiv.org/abs/2411.17525 2 comments 20/4/2025

Reddit

We built a data-free method for compressing heavy LLMs https://arxiv.org/pdf/2411.17525 2 comments 19/4/2025 artificial

Linking pages

LLMs No Longer Require Powerful Servers: Researchers from MIT, KAUST, ISTA, and Yandex Introduce a New AI Approach to Rapidly Compress Large Language Models without a Significant Loss of Quality - MarkTechPost https://www.marktechpost.com/2025/04/11/llms-no-longer-require-powerful-servers-researchers-from-mit-kaust-ista-and-yandex-introduce-a-new-ai-approach-to-rapidly-compress-large-language-models-without-a-significant-loss-of-quality/ 66 comments

Related searches:

Search whole site: site:arxiv.org

Search title: Pushing the Limits of LLM Quantization via the Linearity Theorem

See how to search.

Submit link to: