GitHub - FMInference/FlexGen: Running large language models on a single GPU for throughput-oriented scenarios. - discu.eu

Hacker News

FlexGen: Running large language models on a single GPU https://github.com/FMInference/FlexGen 43 comments 26/3/2023

Reddit

FlexGen: Running large language models like OPT-175B/GPT-3 on a single GPU. Up to 100x faster than other offloading systems https://github.com/FMInference/FlexGen 2 comments 21/2/2023 coolgithubprojects

Linking pages

What I learned from looking at 900 most popular open source AI tools https://huyenchip.com/2024/03/14/ai-oss.html 41 comments
Mini-post: first look at LLaMA. Background | by Enryu | Mar, 2023 | Medium https://medium.com/@enryu9000/mini-post-first-look-at-llama-4403517d41a1 27 comments
GitHub - taishi-i/awesome-ChatGPT-repositories: A curated list of resources dedicated to open source GitHub repositories related to ChatGPT https://github.com/taishi-i/awesome-ChatGPT-repositories 5 comments
GitHub - tensorchord/Awesome-LLMOps: An awesome & curated list of best LLMOps tools for developers https://github.com/tensorchord/Awesome-LLMOps 5 comments
awesome-marketing-datascience/awesome-ai.md at master · underlines/awesome-marketing-datascience · GitHub https://github.com/underlines/awesome-marketing-datascience/blob/master/awesome-ai.md 1 comment
GitHub - NeuralCoder3/transpilation: A summary of ideas about transpilation -- work in progress https://github.com/NeuralCoder3/transpilation 1 comment
GitHub - AIoT-MLSys-Lab/Efficient-LLMs-Survey: Efficient Large Language Models: A Survey https://github.com/AIoT-MLSys-Lab/Efficient-LLMs-Survey 0 comments

Linked pages

Related searches:

Search whole site: site:github.com

Search title: GitHub - FMInference/FlexGen: Running large language models on a single GPU for throughput-oriented scenarios.

See how to search.

Submit link to: