Hacker News
- Mamba: Linear-Time Sequence Modeling with Selective State Spaces https://arxiv.org/abs/2312.00752 37 comments
- Mamba: Linear-Time Sequence Modeling with Selective State Spaces https://arxiv.org/abs/2312.00752 5 comments machinelearning
Linking pages
- GitHub - johnma2006/mamba-minimal: Simple, minimal implementation of Mamba in one file of PyTorch. https://github.com/johnma2006/mamba-minimal 109 comments
- My AI Timelines Have Sped Up (Again) https://www.alexirpan.com/2024/01/10/ai-timelines-2024.html 95 comments
- Mamba Explained | Kola Ayonrinde https://www.kolaayonrinde.com/blog/2024/02/11/mamba.html 93 comments
- How to make LLMs go fast https://vgel.me/posts/faster-inference/ 54 comments
- Mamba Explained https://thegradient.pub/mamba-explained/ 44 comments
- Why we might have superintelligence sooner than most think https://pauseai.info/urgency 4 comments
- GitHub - state-spaces/mamba https://github.com/state-spaces/mamba 2 comments
- Modeling Extremely Large Images with xT – The Berkeley Artificial Intelligence Research Blog https://bair.berkeley.edu/blog/2024/03/21/xt/ 2 comments
- Ring Attention Explained | Coconut Mode https://coconut-mode.com/posts/ring-attention/ 2 comments
- State-space LLMs: Do we need Attention? https://www.interconnects.ai/p/llms-beyond-attention 1 comment
- GitHub - HazyResearch/aisys-building-blocks: Building blocks for foundation models. https://github.com/HazyResearch/aisys-building-blocks 1 comment
- A Visual Guide to Mamba and State Space Models https://maartengrootendorst.substack.com/p/a-visual-guide-to-mamba-and-state 1 comment
- GitHub - havenhq/mamba-chat: Mamba-Chat: A chat LLM based on the state-space model architecture 🐍 https://github.com/havenhq/mamba-chat 0 comments
- GitHub - apapiu/mamba_small_bench: Trying out the Mamba architecture on small examples (cifar-10, shakespeare char level etc.) https://github.com/apapiu/mamba_small_bench 0 comments
- Mamba: Linear-Time Sequence Modeling with Selective State Spaces https://gonzoml.substack.com/p/mamba-linear-time-sequence-modeling 0 comments
- GitHub - AIoT-MLSys-Lab/Efficient-LLMs-Survey: Efficient Large Language Models: A Survey https://github.com/AIoT-MLSys-Lab/Efficient-LLMs-Survey 0 comments
- GitHub - flawedmatrix/mamba-ssm: Implementation of mamba with rust https://github.com/flawedmatrix/mamba-ssm 0 comments
- A Visual Guide to Mamba and State Space Models https://newsletter.maartengrootendorst.com/p/a-visual-guide-to-mamba-and-state 0 comments
- Compute Thresholds are Ineffective - by Dean W. Ball https://hyperdimensional.substack.com/p/compute-thresholds-are-ineffective 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2312.00752] Mamba: Linear-Time Sequence Modeling with Selective State Spaces
See how to search.