Hacker News
- [D]:Replacing the LlamaDecoderLayer Class hugging Face With New LongNet https://arxiv.org/abs/2307.02486 2 comments machinelearning
- [R] LongNet: Scaling Transformers to 1,000,000,000 Tokens https://arxiv.org/abs/2307.02486 28 comments machinelearning
Linking pages
- Alan’s conservative countdown to AGI – Dr Alan D. Thompson – Life Architect https://lifearchitect.ai/agi/ 73 comments
- Ahead of AI #11: New Foundation Models https://magazine.sebastianraschka.com/p/ahead-of-ai-11-new-foundation-models 34 comments
- Big Post About Big Context - by Grigory Sapunov - Gonzo ML https://gonzoml.substack.com/p/big-post-about-big-context 19 comments
- Aman's AI Journal • Primers • Overview of Large Language Models https://aman.ai/primers/ai/LLM/ 1 comment
- Any-to-Any Generative AI Model, Generative AI for 3D game development, Mobile app Development from Text, Context Window of 1 Billion+ tokens, Bark on Discord and more https://aibrews.substack.com/p/any-to-any-generative-ai-model-generative 0 comments
- Microsoft’s LongNet Scales Transformer to One Billion Tokens | Synced https://syncedreview.com/2023/07/10/microsofts-longnet-scales-transformer-to-one-billion-tokens/ 0 comments
- AI Research Highlights In 3 Sentences Or Less (June -July 2023) https://magazine.sebastianraschka.com/p/ai-research-highlights-in-3-sentences-738 0 comments
- GitHub - AIoT-MLSys-Lab/Efficient-LLMs-Survey: Efficient Large Language Models: A Survey https://github.com/AIoT-MLSys-Lab/Efficient-LLMs-Survey 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2307.02486] LongNet: Scaling Transformers to 1,000,000,000 Tokens
See how to search.