Hacker News
- PowerInfer: Fast Large Language Model Serving with a Consumer-Grade GPU [pdf] https://ipads.se.sjtu.edu.cn/_media/publications/powerinfer-20231219.pdf 9 comments
Linking pages
Related searches:
Search whole site: site:ipads.se.sjtu.edu.cn
Search title: PowerInfer: Fast Large Language Model Serving with a Consumer-Grade GPU [pdf]
See how to search.