Hacker News
- OpenAI releases Whisper v3, new generation open source ASR model https://github.com/openai/whisper 58 comments
- How can I get word-level timestamps in OpenAI's Whisper ASR? https://github.com/openai/whisper 22 comments languagetechnology
- [N] OpenAI's Whisper released https://github.com/openai/whisper/ 46 comments machinelearning
Linking pages
- Introducing Whisper https://openai.com/blog/whisper/ 547 comments
- GitHub - ggerganov/whisper.cpp: Port of OpenAI's Whisper model in C/C++ https://github.com/ggerganov/whisper.cpp 212 comments
- GitHub - abus-aikorea/voice-pro: Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers. https://github.com/abus-aikorea/voice-pro 192 comments
- GitHub - pluja/awesome-privacy: Awesome Privacy - A curated list of services and alternatives that respect your privacy because PRIVACY MATTERS. https://github.com/pluja/awesome-privacy 124 comments
- GitHub - jianchang512/pyvideotrans: Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音 https://github.com/jianchang512/pyvideotrans 116 comments
- GitHub - McCloudS/subgen: Autogenerate subtitles using OpenAI Whisper Model via Jellyfin, Plex, and Tautulli https://github.com/McCloudS/subgen 105 comments
- GitHub - schibsted/WAAS: Whisper as a Service (Basic WIP API for transcribing speech) https://github.com/schibsted/WAAS 92 comments
- Underjord | Why do ML on the Erlang VM? https://underjord.io/why-ml-on-erlang.html 87 comments
- GitHub - elanmart/cbp-translate https://github.com/elanmart/cbp-translate 80 comments
- GitHub - niedev/RTranslator: Open source real-time translation app for Android that runs locally https://github.com/niedev/RTranslator 64 comments
- GitHub - modal-labs/quillman https://github.com/modal-labs/quillman 60 comments
- Year of the Voice - Chapter 2: Let's talk - Home Assistant https://www.home-assistant.io/blog/2023/04/27/year-of-the-voice-chapter-2/ 57 comments
- GitHub - innovatorved/whisper.api: This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model. https://github.com/innovatorved/whisper.api 50 comments
- GitHub - Vaibhavs10/insanely-fast-whisper https://github.com/Vaibhavs10/insanely-fast-whisper 43 comments
- Speech to Text is finally ready - Whisper review. - Circus Scientist https://www.circusscientist.com/2022/09/28/speech-to-text-is-finally-ready-whisper-review/ 36 comments
- smoores.dev - Phonetic Matching https://smoores.dev/post/phonetic_matching/ 36 comments
- The rise of self-hosted apps https://chromakode.com/post/the-rise-of-self-hosted-apps/ 35 comments
- Putting a full power search engine in Ecto https://moosie.us/parade_db_ecto 25 comments
- Hacker Public Radio ~ The Technology Community Podcast http://hackerpublicradio.org/eps.php?id=1428 23 comments
- SOTA ASR Tooling: Long-form Transcription - by Amgad Hasan https://amgadhasan.substack.com/p/sota-asr-tooling-long-form-transcription 21 comments
Linked pages
- FFmpeg http://ffmpeg.org/index.html#message 1113 comments
- Rust Programming Language https://www.rust-lang.org/ 595 comments
- Introducing Whisper https://openai.com/blog/whisper/ 547 comments
- PyTorch http://pytorch.org/ 100 comments
- GitHub - openai/tiktoken https://github.com/openai/tiktoken 74 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:github.com
Search title: GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision
See how to search.