Hacker News
- [R] Scalable MatMul-free Language Modeling https://arxiv.org/abs/2406.02528 14 comments machinelearning
Linking pages
- Researchers upend AI status quo by eliminating matrix multiplication in LLMs | Ars Technica https://arstechnica.com/information-technology/2024/06/researchers-upend-ai-status-quo-by-eliminating-matrix-multiplication-in-llms/2 92 comments
- Energy-efficient AI model could be a game changer, 50 times better efficiency with no performance hit | TechSpot https://www.techspot.com/news/103561-ai-researchers-rethink-how-neural-networks-work-huge.html 88 comments
- Researchers upend AI status quo by eliminating matrix multiplication in LLMs | Ars Technica https://arstechnica.com/information-technology/2024/06/researchers-upend-ai-status-quo-by-eliminating-matrix-multiplication-in-llms/ 31 comments
- GitHub - ridgerchu/matmulfreellm: Implementation for MatMul-free LM. https://github.com/ridgerchu/matmulfreellm 1 comment
- [AINews] Talaria: Apple's new MLOps Superweapon • Buttondown https://buttondown.email/ainews/archive/ainews-talaria-apples-new-mlops-superweapon-4066/ 0 comments
- New Transformer architecture for powerful LLMs without GPUs | VentureBeat https://venturebeat.com/ai/new-transformer-architecture-could-enable-powerful-llms-without-gpus/ 0 comments
- AI researchers found a way to run LLMs at a lightbulb-esque 13 watts with no loss in performance | Tom's Hardware https://www.tomshardware.com/tech-industry/artificial-intelligence/ai-researchers-found-a-way-to-run-llms-at-a-lightbulb-esque-13-watts-with-no-loss-in-performance 0 comments
- [AINews] Test-Time Training, MobileLLM, Lilian Weng on Hallucination (Plus: Turbopuffer) • Buttondown https://buttondown.email/ainews/archive/ainews-to-be-named-3686/ 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2406.02528] Scalable MatMul-free Language Modeling
See how to search.