GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks - discu.eu

Hacker News

Efficient streaming language models with attention sinks https://github.com/mit-han-lab/streaming-llm 65 comments 2/10/2023

Linking pages

The New Kings of Open Source AI (Oct 2023 Recap) https://www.latent.space/p/oct-2023 3 comments
GitHub - oscinis-com/Awesome-LLM-Productization: Awesome-LLM-Productization: a curated list of tools/tricks/news/regulations about AI and Large Language Model (LLM) productization https://github.com/oscinis-com/Awesome-LLM-Productization 1 comment
I've picked the top GitHub repos for you https://hackerpulse.substack.com/p/ive-picked-the-top-github-repos-for 1 comment
GitHub - intel/intel-extension-for-transformers: ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡ https://github.com/intel/intel-extension-for-transformers 0 comments
GitHub - AIoT-MLSys-Lab/Efficient-LLMs-Survey: Efficient Large Language Models: A Survey https://github.com/AIoT-MLSys-Lab/Efficient-LLMs-Survey 0 comments
ICLR 2024 — Best Papers & Talks (Benchmarks, Reasoning & Agents) — ft. Graham Neubig, Aman Sanger, Moritz Hardt) https://www.latent.space/p/iclr-2024-benchmarks-agents 0 comments

Linked pages

https://arxiv.org/abs/2309.17453 12 comments

Related searches:

Search whole site: site:github.com

Search title: GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks

See how to search.

Submit link to: