GitHub - tspeterkim/flash-attention-minimal: Flash Attention in ~100 lines of CUDA (forward pass only) - discu.eu

Hacker News

Show HN: Flash Attention in ~100 lines of CUDA https://github.com/tspeterkim/flash-attention-minimal 39 comments 16/3/2024

Reddit

[P] Flash Attention in ~100 lines of CUDA https://github.com/tspeterkim/flash-attention-minimal 2 comments 8/3/2024 machinelearning

Linking pages

Breaking into ML as a New Grad - by Kndrej Aarpathy https://kndrej.substack.com/p/breaking-into-ml-as-a-new-grad 0 comments
I came back to school to study hardware after 5 years of doing ML | Taeksang Peter Kim https://tspeterkim.github.io/posts/becoming-a-student-again 0 comments

Linked pages

Would you like to stay up to date with Computer science? Checkout Computer science Weekly.

Related searches:

Search whole site: site:github.com

Search title: GitHub - tspeterkim/flash-attention-minimal: Flash Attention in ~100 lines of CUDA (forward pass only)

See how to search.

Submit link to: