How to make LLMs go fast - discu.eu

Hacker News

How to make LLMs go fast https://vgel.me/posts/faster-inference/ 54 comments 22/12/2023

Linking pages

Representation Engineering Mistral-7B an Acid Trip https://vgel.me/posts/representation-engineering/ 75 comments
Normcore LLM Reads · GitHub https://gist.github.com/veekaybee/be375ab33085102f9027853128dc5f0e 54 comments
Aman's AI Journal • Primers • Overview of Large Language Models https://aman.ai/primers/ai/LLM/ 1 comment
The Four Wars of the AI Stack (Dec 2023 Recap) https://www.latent.space/p/dec-2023 0 comments
The Four Wars of the AI Stack (Dec 2023 Recap) https://www.latent.space/i/140396949/mixtral-sparks-a-gpuinference-war 0 comments

Linked pages

Related searches:

Search whole site: site:vgel.me

Search title: How to make LLMs go fast

See how to search.

Submit link to: