Hacker News
- FlexGen: Running large language models on a single GPU https://github.com/FMInference/FlexGen 43 comments
- FlexGen: Running large language models like OPT-175B/GPT-3 on a single GPU. Up to 100x faster than other offloading systems https://github.com/FMInference/FlexGen 2 comments coolgithubprojects
Linking pages
- What I learned from looking at 900 most popular open source AI tools https://huyenchip.com/2024/03/14/ai-oss.html 40 comments
- Mini-post: first look at LLaMA. Background | by Enryu | Mar, 2023 | Medium https://medium.com/@enryu9000/mini-post-first-look-at-llama-4403517d41a1 27 comments
- GitHub - taishi-i/awesome-ChatGPT-repositories: A curated list of resources dedicated to open source GitHub repositories related to ChatGPT https://github.com/taishi-i/awesome-ChatGPT-repositories 5 comments
- GitHub - tensorchord/Awesome-LLMOps: An awesome & curated list of best LLMOps tools for developers https://github.com/tensorchord/Awesome-LLMOps 5 comments
- awesome-marketing-datascience/awesome-ai.md at master · underlines/awesome-marketing-datascience · GitHub https://github.com/underlines/awesome-marketing-datascience/blob/master/awesome-ai.md 1 comment
- GitHub - NeuralCoder3/transpilation: A summary of ideas about transpilation -- work in progress https://github.com/NeuralCoder3/transpilation 1 comment
- GitHub - AIoT-MLSys-Lab/Efficient-LLMs-Survey: Efficient Large Language Models: A Survey https://github.com/AIoT-MLSys-Lab/Efficient-LLMs-Survey 0 comments
Linked pages
Related searches:
Search whole site: site:github.com
Search title: GitHub - FMInference/FlexGen: Running large language models on a single GPU for throughput-oriented scenarios.
See how to search.