Hacker News
- Pushing the Limits of LLM Quantization via the Linearity Theorem https://arxiv.org/abs/2411.17525 2 comments
- We built a data-free method for compressing heavy LLMs https://arxiv.org/pdf/2411.17525 2 comments artificial
Linking pages
- LLMs No Longer Require Powerful Servers: Researchers from MIT, KAUST, ISTA, and Yandex Introduce a New AI Approach to Rapidly Compress Large Language Models without a Significant Loss of Quality - MarkTechPost https://www.marktechpost.com/2025/04/11/llms-no-longer-require-powerful-servers-researchers-from-mit-kaust-ista-and-yandex-introduce-a-new-ai-approach-to-rapidly-compress-large-language-models-without-a-significant-loss-of-quality/ 66 comments
Related searches:
Search whole site: site:arxiv.org
Search title: Pushing the Limits of LLM Quantization via the Linearity Theorem
See how to search.