- [D] Daily Paper Discussions - FlashAttention 3 https://arxiv.org/abs/2407.08608 6 comments machinelearning
Linking pages
- LLM Research Papers: The 2024 List https://magazine.sebastianraschka.com/p/llm-research-papers-the-2024-list 11 comments
- Outperforming cuBLAS on H100: a Worklog https://cudaforfun.substack.com/p/outperforming-cublas-on-h100-a-worklog 0 comments
- Why large language models struggle with long contexts https://www.understandingai.org/p/why-large-language-models-struggle 0 comments
- Why AI language models choke on too much text - Ars Technica https://arstechnica.com/ai/2024/12/why-ai-language-models-choke-on-too-much-text/ 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2407.08608] FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision
See how to search.