FlexAttention: The Flexibility of PyTorch with the Performance of FlashAttention | PyTorch - discu.eu

Hacker News

FlexAttention: The Flexibility of PyTorch with the Performance of FlashAttention https://pytorch.org/blog/flexattention/ 24 comments 8/8/2024

Linking pages

GitHub - Ligo-Biosciences/AlphaFold3: Open source implementation of AlphaFold3 https://github.com/Ligo-Biosciences/AlphaFold3 37 comments
Ways to use torch.compile : ezyang’s blog http://blog.ezyang.com/2024/11/ways-to-use-torch-compile/ 6 comments
Fast Video Generation with Sliding Tile Attention | Hao AI Lab @ UCSD https://hao-ai-lab.github.io/blogs/sta/ 2 comments
Yashovardhan Srivastava | A genius, shy and broke bloke. https://yash-sri.xyz/blog/flex_attention 1 comment
CUDA-Free Inference for LLMs | PyTorch https://pytorch.org/blog/cuda-free-inference-for-llms/ 0 comments
PyTorch 2.5 Release Blog | PyTorch https://pytorch.org/blog/pytorch2-5/ 0 comments
GitHub - nebius/kvax: A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism. https://github.com/nebius/kvax 0 comments
PyTorch 2.7 Release | PyTorch https://pytorch.org/blog/pytorch-2-7/ 0 comments

Linked pages

Related searches:

Search whole site: site:pytorch.org

Search title: FlexAttention: The Flexibility of PyTorch with the Performance of FlashAttention | PyTorch

See how to search.

Submit link to: