- [R] In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs Miss - AIRI, Moscow, Russia 2024 - RMT 137M a fine-tuned GPT-2 with recurrent memory is able to find 85% of hidden needles in a 10M Haystack! https://arxiv.org/abs/2402.10790 26 comments machinelearning
Linking pages
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2402.10790] In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs Miss
See how to search.