Hacker News
- The fastest unified matrix multiplication https://www.modular.com/blog/the-worlds-fastest-unified-matrix-multiplication 2 comments
Lobsters
- The world's fastest unified matrix multiplication https://www.modular.com/blog/the-worlds-fastest-unified-matrix-multiplication 2 comments ai , assembly , compilers , math , performance
Linking pages
- Mojo: the point of view of a researcher using Python - Pierre Augier's website https://augierpi.gricad-pages.univ-grenoble-alpes.fr/mojo-the-point-of-view-of-a-researcher-using-python.html 8 comments
- Modular: A unified, extensible platform to superpower your AI https://www.modular.com/blog/a-unified-extensible-platform-to-superpower-your-ai 1 comment
Linked pages
- TensorFlow http://tensorflow.org/ 440 comments
- PyTorch http://pytorch.org/ 100 comments
- Modular: AIâs compute fragmentation: what matrix multiplication teaches us https://www.modular.com/blog/ais-compute-fragmentation-what-matrix-multiplication-teaches-us 44 comments
- Modular: AI development starts here https://www.modular.com/ 39 comments
- ONNX Runtime | Home https://www.onnxruntime.ai/ 36 comments
- TensorFlow Lite | ML for Mobile and Edge Devices https://www.tensorflow.org/lite/ 22 comments
- Single-precision floating-point format - Wikipedia https://en.wikipedia.org/wiki/Single-precision_floating-point_format 13 comments
- Just-in-time compilation - Wikipedia https://en.wikipedia.org/wiki/Just-in-time_compilation 11 comments
- Apache TVM https://tvm.apache.org/ 1 comment
- DLRM: An advanced, open source deep learning recommendation model https://ai.facebook.com/blog/dlrm-an-advanced-open-source-deep-learning-recommendation-model/ 1 comment
- Assembly language - Wikipedia https://en.m.wikipedia.org/wiki/Assembly_language 1 comment
- bfloat16 floating-point format - Wikipedia https://en.wikipedia.org/wiki/Bfloat16_floating-point_format 1 comment
- Introducing nvFuser, a deep learning compiler for PyTorch | PyTorch https://pytorch.org/blog/introducing-nvfuser-a-deep-learning-compiler-for-pytorch/ 0 comments
- TensorRT SDK | NVIDIA Developer https://developer.nvidia.com/tensorrt 0 comments
- Math Kernel Library - Wikipedia https://en.wikipedia.org/wiki/Math_Kernel_Library 0 comments
- Ahead-of-time compilation - Wikipedia https://en.wikipedia.org/wiki/Ahead-of-time_compilation 0 comments
- GitHub - ARM-software/ComputeLibrary: The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies. https://github.com/ARM-software/ComputeLibrary 0 comments
- XLA: Optimizing Compiler for Machine Learning | TensorFlow https://www.tensorflow.org/xla 0 comments
- Large language model - Wikipedia https://en.wikipedia.org/wiki/Large_language_model 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:modular.com
Search title: Modular: The world's fastest unified matrix multiplication
See how to search.