- [R] Hydra Attention: Efficient Attention with Many Heads - Meta AI 2022 - 197x faster than standard attention https://arxiv.org/abs/2209.07484 32 comments machinelearning
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2209.07484] Hydra Attention: Efficient Attention with Many Heads
See how to search.