Hacker News
- A Python Library to 6-7x the inference speed of your HF models https://github.com/MDK8888/GPTFast 15 comments
- [N] LLM models up to 7 times acceleration. http://github.com/MDK8888/GPTFast 3 comments machinelearning
Linked pages
Related searches:
Search whole site: site:github.com
Search title: GitHub - MDK8888/GPTFast: Accelerate your Hugging Face Transformers 6-7x. Native to Hugging Face and PyTorch.
See how to search.