Hacker News
- Making deep learning go brrrr from first principles (2022) https://horace.io/brrr_intro.html 18 comments
Linking pages
- How Nvidia’s CUDA Monopoly In Machine Learning Is Breaking - OpenAI Triton And PyTorch 2.0 https://www.semianalysis.com/p/nvidiaopenaitritonpytorch 112 comments
- JAX vs Julia (vs PyTorch) · Patrick Kidger https://kidger.site/thoughts/jax-vs-julia/ 110 comments
- The Mathematics of Training LLMs — with Quentin Anthony of Eleuther AI https://www.latent.space/p/transformers-math#details 66 comments
- Normcore LLM Reads · GitHub https://gist.github.com/veekaybee/be375ab33085102f9027853128dc5f0e 54 comments
- Optimal Performance without Static Graphs by Fusing Tensor Operation Streams https://burn.dev/blog/fusion-tensor-operation-streams/ 33 comments
- A guide to LLM inference and performance https://www.baseten.co/blog/llm-transformer-inference-guide/ 14 comments
- What Shapes Do Matrix Multiplications Like? [medium] https://www.thonking.ai/p/what-shapes-do-matrix-multiplications 12 comments
- Transformer Inference Arithmetic | kipply's blog https://kipp.ly/blog/transformer-inference-arithmetic/ 4 comments
- Yandex Publishes YaLM 100B. It’s the Largest GPT-Like Neural Network in Open Source | by Mikhail Khrushchev | Yandex | Medium https://medium.com/yandex/yandex-publishes-yalm-100b-its-the-largest-gpt-like-neural-network-in-open-source-d1df53d0e9a6 3 comments
- Transformer Inference Arithmetic | kipply's blog https://carolchen.me/blog/transformer-inference-arithmetic/ 2 comments
- GitHub - HazyResearch/aisys-building-blocks: Building blocks for foundation models. https://github.com/HazyResearch/aisys-building-blocks 1 comment
- Losses Learned https://sebastianraschka.com/blog/2022/losses-learned-part1.html 0 comments
- Every Google vs OpenAI Argument, Dissected - by swyx https://lspace.swyx.io/p/google-vs-openai 0 comments
- Dissecting Batching Effects in GPT Inference https://le.qun.ch/en/blog/2023/05/13/transformer-batching/ 0 comments
- Models generating training data: huge win or fake win? https://dblalock.substack.com/p/models-generating-training-data-huge 0 comments
- Transformer Inference Arithmetic | kipply's blog https://kipp.ly/transformer-inference-arithmetic/ 0 comments
- Modular: Developer Voices: Deep Dive with Chris Lattner on Mojo https://www.modular.com/blog/developer-voices-deep-dive-with-chris-lattner-on-mojo 0 comments
- How To Write A Fast Matrix Multiplication From Scratch With Tensor Cores | Alex Armbruster https://alexarmbr.github.io/2024/08/10/How-To-Write-A-Fast-Matrix-Multiplication-From-Scratch-With-Tensor-Cores.html 0 comments
Related searches:
Search whole site: site:horace.io
Search title: Making Deep Learning go Brrrr From First Principles
See how to search.