Hacker News
- Fast Multidimensional Matrix Multiplication on CPU from Scratch (2022) https://siboehm.com/articles/22/Fast-MMM-on-CPU 23 comments
Linking pages
- Beating OpenBLAS and MKL in 150 lines of C Code: A Tutorial on High-Performance Matrix Multiplication https://salykova.github.io/matmul-cpu 159 comments
- TFLOPS Matrix Multiplication on an 8-core CPU in C from Scratch https://salykova.github.io/matmul-c 0 comments
- Beating NumPy's Matrix Multiplication in 150 lines of C code https://salykova.github.io/matmul 0 comments
Linked pages
- https://godbolt.org 794 comments
- aarch64_amx.py · GitHub https://gist.github.com/dougallj/7a75a3be1ec69ca550e7c36dc75e0d6f 301 comments
- Beware of fast-math https://simonbyrne.github.io/notes/fastmath/ 189 comments
- Agner Fog https://www.agner.org/ 35 comments
- uops.info https://uops.info/ 5 comments
- GitHub - sirupsen/napkin-math: Techniques and numbers for estimating system's performance from first-principles https://github.com/sirupsen/napkin-math 1 comment
- Math Kernel Library - Wikipedia https://en.wikipedia.org/wiki/Math_Kernel_Library 0 comments
- BLAS (Basic Linear Algebra Subprograms) https://netlib.org/blas/#_blas_routines 0 comments
Related searches:
Search whole site: site:siboehm.com
Search title: Fast Multidimensional Matrix Multiplication on CPU from Scratch
See how to search.