Linking pages
- Google Gemini Eats The World – Gemini Smashes GPT-4 By 5X, The GPU-Poors https://www.semianalysis.com/p/google-gemini-eats-the-world-gemini 113 comments
- Zen 4c: AMD’s Response to Hyperscale ARM & Intel Atom https://www.semianalysis.com/p/zen-4c-amds-response-to-hyperscale 57 comments
- Nvidia B100, B200, GB200 - COGS, Pricing, Margins, Ramp - Oberon, Umbriel, Miranda https://www.semianalysis.com/p/nvidia-b100-b200-gb200-cogs-pricing 19 comments
- Is Intel Back? Foundry & Product Resurgence Measured https://www.semianalysis.com/p/is-intel-back-foundry-and-product 16 comments
- AMD MI300 – Taming The Hype – AI Performance, Volume Ramp, Customers, Cost, IO, Networking, Software https://www.semianalysis.com/p/amd-mi300-taming-the-hype-ai-performance 2 comments
- Meta Custom Silicon: What's Old Is New https://www.semianalysis.com/p/meta-custom-silicon-whats-old-is 1 comment
- Groq Inference Tokenomics: Speed, But At What Cost? https://www.semianalysis.com/p/groq-inference-tokenomics-speed-but 1 comment
- Multi-Datacenter Training: OpenAI's Ambitious Plan To Beat Google's Infrastructure https://www.semianalysis.com/p/multi-datacenter-training-openais 1 comment
- TSMC’s Heroic Assumption – Low Utilization Rates, Fab Cancellation, 3nm Volumes, Automotive Weakness, AI Advanced Packaging Demands, 2024 Capex Weakness https://www.semianalysis.com/p/tsmcs-heroic-assumption-low-utilization 0 comments
- On Device AI – Double-Edged Sword https://www.semianalysis.com/p/on-device-ai-double-edged-sword 0 comments
- Diminishing Returns in Machine Learning Part 1 https://www.fromthenew.world/p/diminishing-returns-in-machine-learning 0 comments
- TPUv5e: The New Benchmark in Cost-Efficient Inference and Training for <200B Parameter Models https://www.semianalysis.com/p/tpuv5e-the-new-benchmark-in-cost 0 comments
- Broadcom’s Google TPU Revenue Explosion, Networking Boom, VMWare Integration https://www.semianalysis.com/p/broadcoms-google-tpu-revenue-explosion 0 comments
- Amazon Anthropic: Poison Pill or Empire Strikes Back https://www.semianalysis.com/p/amazon-anthropic-poison-pill-or-empire 0 comments
- Nvidia Blackwell Perf TCO Analysis - B100 vs B200 vs GB200NVL72 https://www.semianalysis.com/p/nvidia-blackwell-perf-tco-analysis 0 comments
- Is Intel Back? Foundry & Product Resurgence Measured https://open.substack.com/pub/semianalysis/p/is-intel-back-foundry-and-product?r=6gq23 0 comments
- OpenAI Is Doomed - Et tu, Microsoft? https://www.semianalysis.com/p/openai-is-doomed-et-tu-microsoft 0 comments
- [V]ery [L]ong [I]ncoherent [W]riteup - by Daud's Scout https://irrationalanalysis.substack.com/p/very-long-incoherent-writeup 0 comments
- GB200 Hardware Architecture - Component Supply Chain & BOM https://www.semianalysis.com/p/gb200-hardware-architecture-and-component 0 comments
- The Future of Compute: NVIDIA's Crown is Slipping https://mohitdagarwal.substack.com/p/from-dominance-to-dilemma-nvidia 0 comments
Linked pages
- How Nvidia’s CUDA Monopoly In Machine Learning Is Breaking - OpenAI Triton And PyTorch 2.0 https://www.semianalysis.com/p/nvidiaopenaitritonpytorch 112 comments
- The Inference Cost Of Search Disruption – Large Language Model Cost Analysis https://www.semianalysis.com/p/the-inference-cost-of-search-disruption 47 comments
- Google Apollo: The >$3 Billion Game-Changer in Datacenter Networking https://www.semianalysis.com/p/google-apollo-the-3-billion-game 5 comments
- The AI Brick Wall – A Practical Limit For Scaling Dense Transformer Models, and How GPT 4 Will Break Past It https://www.semianalysis.com/p/the-ai-brick-wall-a-practical-limit 0 comments
- Peeling The Onion’s Layers - Large Language Models Search Architecture And Cost https://www.semianalysis.com/p/peeling-the-onions-layers-large-language 0 comments
- Marvell's DSP Dilemma? Networking’s Tectonic Shift Led By Broadcom, Nvidia, Arista Networks, Microsoft, Meta, Macom, and more https://www.semianalysis.com/p/marvells-dsp-dilemma-networkings 0 comments
- An in-depth look at Google’s first Tensor Processing Unit (TPU) | Google Cloud Blog https://cloud.google.com/blog/products/ai-machine-learning/an-in-depth-look-at-googles-first-tensor-processing-unit-tpu 0 comments
- [2104.05158] Software-Hardware Co-design for Fast and Scalable Training of Deep Learning Recommendation Models https://arxiv.org/abs/2104.05158 0 comments
Related searches:
Search whole site: site:www.semianalysis.com
Search title: Google AI Infrastructure Supremacy: Systems Matter More Than Microarchitecture
See how to search.