Linking pages
- Nx (Numerical Elixir) is now publicly available - Dashbit Blog https://dashbit.co/blog/nx-numerical-elixir-is-now-publicly-available 31 comments
- GitHub - zhimin-z/awesome-awesome-artificial-intelligence: A curated list of awesome curated lists of many topics closely related to artificial intelligence. https://github.com/zhimin-z/awesome-awesome-artificial-intelligence 15 comments
- GitHub - tensorchord/Awesome-LLMOps: An awesome & curated list of best LLMOps tools for developers https://github.com/tensorchord/Awesome-LLMOps 5 comments
- GitHub - tensorchord/awesome-open-source-mlops: An awesome & curated list of best open source MLOps/LLMOps tools for data scientists. https://github.com/tensorchord/awesome-open-source-mlops 0 comments
- GitHub - zhimin-z/awesome-awesome-machine-learning: A curated list of awesome curated lists of many topics closely related to machine learning. https://github.com/zhimin-z/awesome-awesome-machine-learning 0 comments
Linked pages
- Halide https://halide-lang.org/ 73 comments
- GitHub - facebookincubator/AITemplate: AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference. https://github.com/facebookincubator/AITemplate 71 comments
- TACO: The Tensor Algebra Compiler | Website for the TACO project http://tensor-compiler.org/ 54 comments
- NN-512 https://nn-512.com/ 44 comments
- MLIR https://mlir.llvm.org/ 30 comments
- Dive into Deep Learning Compiler — Dive into Deep Learning Compiler 0.1 documentation https://tvm.d2l.ai/ 13 comments
- [1805.08166] Learning to Optimize Tensor Programs https://arxiv.org/abs/1805.08166 13 comments
- GitHub - microsoft/hummingbird: Hummingbird compiles trained ML models into tensor computation for faster inference. https://github.com/microsoft/hummingbird/ 10 comments
- [2002.03794] The Deep Learning Compiler: A Comprehensive Survey https://arxiv.org/abs/2002.03794 8 comments
- MLC | Home https://mlc.ai/summer22/ 6 comments
- [2203.08069] DISTAL: The Distributed Tensor Algebra Compiler https://arxiv.org/abs/2203.08069 6 comments
- GitHub - openai/triton: Development repository for the Triton language and compiler https://github.com/openai/triton 5 comments
- GitHub - nebuly-ai/nos: Module to Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elastic quotas - Effortless optimization at its finest! https://github.com/nebuly-ai/nebulgym 3 comments
- Apache TVM https://tvm.apache.org/ 1 comment
- [2105.04663] GSPMD: General and Scalable Parallelization for ML Computation Graphs https://arxiv.org/abs/2105.04663 1 comment
- [2202.04305] Compiler Support for Sparse Tensor Computations in MLIR https://arxiv.org/abs/2202.04305 0 comments
- GitHub - pytorch/glow: Compiler for Neural Network hardware accelerators https://github.com/pytorch/glow/ 0 comments
- Fireiron | Proceedings of the ACM International Conference on Parallel Architectures and Compilation Techniques https://dl.acm.org/doi/10.1145/3410463.3414632 0 comments
- [1805.00907] Glow: Graph Lowering Compiler Techniques for Neural Networks https://arxiv.org/abs/1805.00907 0 comments
- [2008.01040] A Learned Performance Model for Tensor Processing Units https://arxiv.org/abs/2008.01040 0 comments