Hacker News
- High-Throughput Generative Inference of Large Language Models with a Single GPU https://arxiv.org/abs/2303.06865 36 comments
Linking pages
- GitHub - HazyResearch/aisys-building-blocks: Building blocks for foundation models. https://github.com/HazyResearch/aisys-building-blocks 1 comment
- Ahead of AI #7: Large Language Models 3.0 https://magazine.sebastianraschka.com/p/ahead-of-ai-7-large-language-models 0 comments
- GitHub - AIoT-MLSys-Lab/Efficient-LLMs-Survey: Efficient Large Language Models: A Survey https://github.com/AIoT-MLSys-Lab/Efficient-LLMs-Survey 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [2303.06865] High-throughput Generative Inference of Large Language Models with a Single GPU
See how to search.