Linking pages
- SOTA ASR Tooling: Long-form Transcription - by Amgad Hasan https://amgadhasan.substack.com/p/sota-asr-tooling-long-form-transcription 21 comments
- GitHub - transitive-bullshit/yt-semantic-search: OpenAI-powered semantic search for any YouTube playlist – featuring the All-In Podcast. 💪 https://github.com/transitive-bullshit/yt-semantic-search 5 comments
- Speaker Diarization for Whisper-Generated Transcripts https://www.ufarooqi.com/speaker-diarization-for-whisper-transcripts/ 5 comments
- Using AI to turn Youtube videos into Karaoke https://www.jaxgeller.com/using-ai-to-turn-youtube-videos-into-karaoke/ 2 comments
- GitHub - AakashKumarNain/annotated_research_papers: This repo contains annotated research papers that I found really good and useful https://github.com/AakashKumarNain/annotated_research_papers 0 comments
- awesome-whisper/readme.md at main · sindresorhus/awesome-whisper · GitHub https://github.com/sindresorhus/awesome-whisper/blob/main/readme.md 0 comments
- GitHub - sindresorhus/awesome-whisper: 🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI https://github.com/sindresorhus/awesome-whisper 0 comments
- Announcing the most cost-effective audio transcription API - Sieve Blog https://www.sievedata.com/blog/commoditizing-audio-transcription 0 comments
- GitHub - Huanshere/VideoLingo: Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组 https://github.com/Huanshere/VideoLingo 0 comments
- Yay Emacs: Tweaking my video workflow with WhisperX and subed-record :: Sacha Chua https://sachachua.com/blog/2024/10/yay-emacs-tweaking-my-video-workflow-with-whisperx-and-subed-record/ 0 comments
Linked pages
- GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision https://github.com/openai/whisper/ 126 comments
- GitHub - guillaumekln/faster-whisper: Faster Whisper transcription with CTranslate2 https://github.com/guillaumekln/faster-whisper 16 comments
- Previous PyTorch Versions | PyTorch https://pytorch.org/get-started/previous-versions/ 9 comments
- pyannote/speaker-diarization · Hugging Face https://huggingface.co/pyannote/speaker-diarization 6 comments
- GitHub - OpenNMT/CTranslate2: Fast inference engine for Transformer models https://github.com/OpenNMT/CTranslate2 3 comments
- [2303.00747] WhisperX: Time-Accurate Speech Transcription of Long-Form Audio https://arxiv.org/abs/2303.00747 0 comments
- pyannote/speaker-diarization-3.1 · Hugging Face https://huggingface.co/pyannote/speaker-diarization-3.1 0 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - m-bain/whisperX: WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
See how to search.