- [R] Hyena Hierarchy: Towards Larger Convolutional Language Models https://arxiv.org/abs/2302.10866 3 comments machinelearning
Linking pages
- Emerging Architectures for LLM Applications | Andreessen Horowitz https://a16z.com/2023/06/20/emerging-architectures-for-llm-applications/ 95 comments
- How to make LLMs go fast https://vgel.me/posts/faster-inference/ 54 comments
- Tiny Time Mixers(TTMs): Powerful Zero/Few-Shot Forecasting Models by IBM https://aihorizonforecast.substack.com/p/tiny-time-mixersttms-powerful-zerofew?post_page-reds--= 18 comments
- Tiny Time Mixers(TTMs): Powerful Zero/Few-Shot Forecasting Models by IBM https://aihorizonforecast.substack.com/p/tiny-time-mixersttms-powerful-zerofew?post_page-reml--= 15 comments
- Tiny Time Mixers(TTMs): Powerful Zero/Few-Shot Forecasting Models by IBM https://aihorizonforecast.substack.com/p/tiny-time-mixersttms-powerful-zerofew?post_page-redlear--= 3 comments
- Aman's AI Journal • Primers • Overview of Large Language Models https://aman.ai/primers/ai/LLM/ 1 comment
- Transformers Revolutionized AI. What Will Replace Them? https://www.forbes.com/sites/robtoews/2023/09/03/transformers-revolutionized-ai-what-will-replace-them/ 1 comment
- State-space LLMs: Do we need Attention? https://www.interconnects.ai/p/llms-beyond-attention 1 comment
- GitHub - HazyResearch/aisys-building-blocks: Building blocks for foundation models. https://github.com/HazyResearch/aisys-building-blocks 1 comment
- Hyena Hierarchy: Towards Larger Convolutional Language Models https://ermongroup.github.io/blog/hyena/ 0 comments
- Ahead of AI #7: Large Language Models 3.0 https://magazine.sebastianraschka.com/p/ahead-of-ai-7-large-language-models 0 comments
- AI Research Highlights In 3 Sentences Or Less (June -July 2023) https://magazine.sebastianraschka.com/p/ai-research-highlights-in-3-sentences-738 0 comments
- GitHub - AIoT-MLSys-Lab/Efficient-LLMs-Survey: Efficient Large Language Models: A Survey https://github.com/AIoT-MLSys-Lab/Efficient-LLMs-Survey 0 comments
- The Four Wars of the AI Stack (Dec 2023 Recap) https://www.latent.space/p/dec-2023 0 comments
- The Four Wars of the AI Stack (Dec 2023 Recap) https://www.latent.space/i/140396949/mixtral-sparks-a-gpuinference-war 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2302.10866] Hyena Hierarchy: Towards Larger Convolutional Language Models
See how to search.