site:mobiusml.github.io - discu.eu

Hacker News

Faster and Smaller Whisper: A Deep Dive into Quantization and Torch Compilation https://mobiusml.github.io/whisper-static-cache-blog/ 0 comments 4/6/2024