Inside the Matrix: Visualizing Matrix Multiplication, Attention and Beyond | PyTorch - discu.eu

Hacker News

Inside the Matrix: Visualizing Matrix Multiplication, Attention and Beyond https://pytorch.org/blog/inside-the-matrix/ 34 comments 26/9/2023

Linking pages

Low-Rank Pruning of Llama2 https://mobiusml.github.io/low-rank-llama2/ 3 comments
The AI OS (Sept 2023 Recap) - by swyx - Latent Space https://www.latent.space/p/sep-2023 0 comments

Linked pages

GitHub - karpathy/nanoGPT: The simplest, fastest repository for training/finetuning medium-sized GPTs. https://github.com/karpathy/nanoGPT 366 comments
[2106.09685] LoRA: Low-Rank Adaptation of Large Language Models https://arxiv.org/abs/2106.09685 8 comments
[2205.14135] FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness https://arxiv.org/abs/2205.14135 3 comments
mm ref https://bhosmer.github.io/mm/ref.html 1 comment
[2305.19370] Blockwise Parallel Transformer for Long Context Large Models https://arxiv.org/abs/2305.19370 0 comments

Related searches:

Search whole site: site:pytorch.org

Search title: Inside the Matrix: Visualizing Matrix Multiplication, Attention and Beyond | PyTorch

See how to search.

Submit link to: