Linking pages
Linked pages
- Paving the way to efficient architectures: StripedHyena-7B, open source models offering a glimpse into a world beyond Transformers https://www.together.ai/blog/stripedhyena-7b 72 comments
- [2212.14052] Hungry Hungry Hippos: Towards Language Modeling with State Space Models https://arxiv.org/abs/2212.14052 54 comments
- [2312.00752] Mamba: Linear-Time Sequence Modeling with Selective State Spaces https://arxiv.org/abs/2312.00752 42 comments
- https://arxiv.org/abs/2307.08621 36 comments
- GitHub - state-spaces/mamba https://github.com/state-spaces/mamba 2 comments
- In-context Learning and Induction Heads https://transformer-circuits.pub/2022/in-context-learning-and-induction-heads/index.html 0 comments
Related searches:
Search whole site: site:gonzoml.substack.com
Search title: Mamba: Linear-Time Sequence Modeling with Selective State Spaces
See how to search.