Hacker News
- Meta AI Megabyte: Predicting Million-Byte Sequences with Multiscale Transformers https://arxiv.org/abs/2305.07185 3 comments
- [R] MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers https://arxiv.org/abs/2305.07185 91 comments machinelearning
Linking pages
- Artificial General Intelligence Is Already Here https://www.noemamag.com/artificial-general-intelligence-is-already-here/ 62 comments
- How the BPE tokenization algorithm used by large language models works. | sidsite https://sidsite.com/posts/bpe/ 11 comments
- The Problem with Tokenization in LLMs https://matt-rickard.com/the-problem-with-tokenization-in-llms 1 comment
- 🥇Top ML Papers of the Week - by elvis - NLP Newsletter https://nlp.elvissaravia.com/p/top-ml-papers-of-the-week-dae 0 comments
- It doesn't matter if the robots are sentient or not https://www.mkaic.blog/p/it-doesnt-matter-if-the-robots 0 comments
- GitHub - AIoT-MLSys-Lab/Efficient-LLMs-Survey: Efficient Large Language Models: A Survey https://github.com/AIoT-MLSys-Lab/Efficient-LLMs-Survey 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2305.07185] MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers
See how to search.