Hacker News
- High-Speed Large Language Model Serving on PCs with Consumer-Grade GPUs https://github.com/SJTU-IPADS/PowerInfer 83 comments
Linked pages
Related searches:
Search whole site: site:github.com
Search title: GitHub - SJTU-IPADS/PowerInfer: High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
See how to search.