Hacker News
- QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models https://arxiv.org/abs/2310.16795 12 comments
Linking pages
- GitHub - arpita8/Awesome-Mixture-of-Experts-Papers: Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts. https://github.com/arpita8/Awesome-Mixture-of-Experts-Papers 17 comments
- The GPU Poor strike back - by Omar Sanseviero https://thehackerllama.substack.com/p/the-gpu-poor-strike-back 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [2310.16795] QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models
See how to search.