Transformer Inference Arithmetic | kipply's blog - discu.eu

Hacker News

Transformer Inference Arithmetic (2022) https://kipp.ly/blog/transformer-inference-arithmetic 4 comments 13/5/2023

Linking pages

Transformer Math 101 | EleutherAI Blog https://blog.eleuther.ai/transformer-math/ 13 comments
Why GPT-3.5 is (mostly) cheaper than Llama 2 https://www.cursor.so/blog/llama-inference 10 comments
GitHub - 152334H/tortoise-tts-fast: Fast TorToiSe inference (5x or your money back!) https://github.com/152334H/tortoise-tts-fast 7 comments
Transformer Inference Arithmetic | kipply's blog https://carolchen.me/blog/transformer-inference-arithmetic/ 2 comments
How fast can we perform a forward pass? https://bounded-regret.ghost.io/how-fast-can-we-perform-a-forward-pass/ 0 comments
Nintil - Set Sail For Fail? On AI risk https://nintil.com/ai-safety 0 comments
Speeding up the GPT - KV cache | Becoming The Unbeatable https://immortal3.github.io/becoming-the-unbeatable/posts/gpt-kvcache/ 0 comments
How is LLaMa.cpp possible? - by Finbarr Timbers https://finbarrtimbers.substack.com/p/how-is-llamacpp-possible 0 comments
On Device AI – Double-Edged Sword https://www.semianalysis.com/p/on-device-ai-double-edged-sword 0 comments
Dissecting Batching Effects in GPT Inference https://le.qun.ch/en/blog/2023/05/13/transformer-batching/ 0 comments
Transformer Memory Arithmetic: Understanding all the Bytes in nanoGPT https://erees.dev/transformer-memory/ 0 comments
The Novice's LLM Training Guide https://rentry.co/llm-training 0 comments
Domain specific architectures for AI inference https://fleetwood.dev/posts/domain-specific-architectures 0 comments

Linked pages

Related searches:

Search whole site: site:kipp.ly

Search title: Transformer Inference Arithmetic | kipply's blog

See how to search.

Submit link to: