Hacker News
- Mamba: Linear-Time Sequence Modeling with Selective State Spaces https://arxiv.org/abs/2312.00752 37 comments
- Mamba: Linear-Time Sequence Modeling with Selective State Spaces https://arxiv.org/abs/2312.00752 5 comments machinelearning
Linking pages
- Codestral Mamba | Mistral AI | Frontier AI in your hands https://mistral.ai/news/codestral-mamba/ 138 comments
- GitHub - johnma2006/mamba-minimal: Simple, minimal implementation of Mamba in one file of PyTorch. https://github.com/johnma2006/mamba-minimal 108 comments
- My AI Timelines Have Sped Up (Again) https://www.alexirpan.com/2024/01/10/ai-timelines-2024.html 95 comments
- Mamba Explained | Kola Ayonrinde https://www.kolaayonrinde.com/blog/2024/02/11/mamba.html 93 comments
- I. From GPT-4 to AGI: Counting the OOMs - SITUATIONAL AWARENESS https://situational-awareness.ai/from-gpt-4-to-agi/ 91 comments
- GitHub - idoh/mamba.np: A pure NumPy implementation of Mamba. https://github.com/idoh/mamba.np 64 comments
- How to make LLMs go fast https://vgel.me/posts/faster-inference/ 54 comments
- Mamba Explained https://thegradient.pub/mamba-explained/ 44 comments
- Fast LLM Inference From Scratch https://andrewkchan.dev/posts/yalm.html 28 comments
- 100M Token Context Windows — Magic https://magic.dev/blog/100m-token-context-windows 22 comments
- Tiny Time Mixers(TTMs): Powerful Zero/Few-Shot Forecasting Models by IBM https://aihorizonforecast.substack.com/p/tiny-time-mixersttms-powerful-zerofew?post_page-reds--= 18 comments
- Tiny Time Mixers(TTMs): Powerful Zero/Few-Shot Forecasting Models by IBM https://aihorizonforecast.substack.com/p/tiny-time-mixersttms-powerful-zerofew?post_page-reml--= 15 comments
- Meta Open-Sources MEGALODON LLM for Efficient Long Sequence Modeling - InfoQ https://www.infoq.com/news/2024/06/meta-llm-megalodon/ 10 comments
- Solomonic learning: Large language models and the art of induction - Amazon Science https://www.amazon.science/blog/solomonic-learning-large-language-models-and-the-art-of-induction 9 comments
- Why we might have superintelligence sooner than most think https://pauseai.info/urgency 4 comments
- Tiny Time Mixers(TTMs): Powerful Zero/Few-Shot Forecasting Models by IBM https://aihorizonforecast.substack.com/p/tiny-time-mixersttms-powerful-zerofew?post_page-redlear--= 3 comments
- GitHub - state-spaces/mamba https://github.com/state-spaces/mamba 2 comments
- Modeling Extremely Large Images with xT – The Berkeley Artificial Intelligence Research Blog https://bair.berkeley.edu/blog/2024/03/21/xt/ 2 comments
- Ring Attention Explained | Coconut Mode https://coconut-mode.com/posts/ring-attention/ 2 comments
- Cartesia https://cartesia.ai/blog/sonic 2 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2312.00752] Mamba: Linear-Time Sequence Modeling with Selective State Spaces
See how to search.