- [R] Linear Transformers Are Secretly Fast Weight Memory Systems https://arxiv.org/abs/2102.11174 2 comments machinelearning
Linking pages
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2102.11174] Linear Transformers Are Secretly Fast Weight Programmers
See how to search.