Hacker News
Linking pages
- Large Transformer Model Inference Optimization | Lil'Log https://lilianweng.github.io/posts/2023-01-10-inference-optimization/ 20 comments
- Accelerating Neural Networks on Mobile and Web with Sparse Inference – Google AI Blog https://ai.googleblog.com/2021/03/accelerating-neural-networks-on-mobile.html 1 comment
- Learn how to make BERT smaller and faster | The Rasa Blog | Rasa https://blog.rasa.com/compressing-bert-for-faster-prediction-2/ 0 comments
- Improving Sparse Training with RigL – Google AI Blog https://ai.googleblog.com/2020/09/improving-sparse-training-with-rigl.html 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [1902.09574] The State of Sparsity in Deep Neural Networks
See how to search.