discu
Newsletters
Mentions
Extension
Pricing
Login
Sign Up
Reddit
[R] Sparse is Enough in Scaling Transformers
https://arxiv.org/abs/2111.12763
5 comments
29/11/2021
machinelearning