Large Transformer Model Inference Optimization | Lil'Log - discu.eu

Hacker News

Large Transformer Model Inference Optimization https://lilianweng.github.io/posts/2023-01-10-inference-optimization/ 20 comments 20/1/2023

Linking pages

Normcore LLM Reads · GitHub https://gist.github.com/veekaybee/be375ab33085102f9027853128dc5f0e 54 comments
The Transformer Family Version 2.0 | Lil'Log https://lilianweng.github.io/posts/2023-01-27-the-transformer-family-v2/ 46 comments
CNN vs. Vision Transformer: A Practitioner’s Guide to Selecting the Right Model | Tobias’ blog https://tobiasvanderwerff.github.io/2024/05/15/cnn-vs-vit.html 9 comments
GPT-4 architecture: what we can deduce from research literature | Kirill Gadjello's personal blog and website https://kir-gadjello.github.io/posts/gpt4-some-technical-hypotheses/ 6 comments
Foundation Models: The future (still) isn't happening fast enough https://www.madrona.com/foundation-models/ 1 comment
GitHub - AmberLJC/LLMSys-PaperList: Large Language Model (LLM) Systems Paper List https://github.com/AmberLJC/LLMSys-PaperList/ 1 comment
Speeding up the GPT - KV cache | Becoming The Unbeatable https://immortal3.github.io/becoming-the-unbeatable/posts/gpt-kvcache/ 0 comments
How is LLaMa.cpp possible? - by Finbarr Timbers https://finbarrtimbers.substack.com/p/how-is-llamacpp-possible 0 comments
Some Math behind Neural Tangent Kernel | Lil'Log https://lilianweng.github.io/posts/2022-09-08-ntk/ 0 comments

Linked pages

Related searches:

Search whole site: site:lilianweng.github.io

Search title: Large Transformer Model Inference Optimization | Lil'Log

See how to search.

Submit link to: