Hacker News
- Domain specific architectures for AI inference https://fleetwood.dev/posts/domain-specific-architectures 0 comments
- Domain specific architectures for AI inference https://fleetwood.dev/posts/domain-specific-architectures 0 comments
- Domain specific architectures for AI inference https://fleetwood.dev/posts/domain-specific-architectures 0 comments
Lobsters
- Domain specific architectures for AI inference https://fleetwood.dev/posts/domain-specific-architectures 0 comments ai , hardware
Linked pages
- https://openai.com/index/learning-to-reason-with-llms/ 1525 comments
- Groq http://groq.com 492 comments
- DRAMeXchange - World leading DRAM and NAND Flash market research firm, with more than a decade of most authoritative database https://www.dramexchange.com/ 142 comments
- How has DeepSeek improved the Transformer architecture? | Epoch AI https://epoch.ai/gradient-updates/how-has-deepseek-improved-the-transformer-architecture 68 comments
- Moore's law - Wikipedia https://en.wikipedia.org/wiki/Moore%27s_law 67 comments
- [2304.01433] TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings https://arxiv.org/abs/2304.01433 55 comments
- https://www.reuters.com/technology/openai-set-finalize-first-custom-chip-design-this-year-2025-02-10/ 50 comments
- Jim Keller (engineer) - Wikipedia https://en.wikipedia.org/wiki/Jim_Keller_(engineer) 42 comments
- How To Scale Your Model https://jax-ml.github.io/scaling-book/ 31 comments
- [1911.05289] The Deep Learning Revolution and Its Implications for Computer Architecture and Chip Design https://arxiv.org/abs/1911.05289 24 comments
- Thinking, Fast and Slow - Wikipedia http://en.m.wikipedia.org/wiki/Thinking,_Fast_and_Slow 24 comments
- Adder (electronics) - Wikipedia https://en.wikipedia.org/wiki/Adder_(electronics) 24 comments
- NVIDIA Hopper Architecture In-Depth | NVIDIA Technical Blog https://developer.nvidia.com/blog/nvidia-hopper-architecture-in-depth/ 20 comments
- Making Deep Learning go Brrrr From First Principles https://horace.io/brrr_intro.html 20 comments
- Timing Technology: Lessons From The Media Lab · Gwern.net https://www.gwern.net/Timing 16 comments
- [2309.06180] Efficient Memory Management for Large Language Model Serving with PagedAttention https://arxiv.org/abs/2309.06180 16 comments
- Direct memory access - Wikipedia https://en.wikipedia.org/wiki/Direct_memory_access 8 comments
- http://www.eecs.harvard.edu/~htk/publication/1982-kung-why-systolic-architecture.pdf 5 comments
- The Golden Opportunity for American AI - Microsoft On the Issues https://blogs.microsoft.com/on-the-issues/2025/01/03/the-golden-opportunity-for-american-ai/ 5 comments
- Transformer Inference Arithmetic | kipply's blog https://kipp.ly/blog/transformer-inference-arithmetic/ 4 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:fleetwood.dev
Search title: Domain specific architectures for AI inference
See how to search.