Hacker News
- A speech-to-text practitioner’s criticisms of industry and academia https://thegradient.pub/a-speech-to-text-practitioners-criticisms-of-industry-and-academia/ 53 comments
Linking pages
- GitHub - snakers4/silero-models: Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple https://github.com/snakers4/silero-models 99 comments
- Silero Speech-To-Text Models | PyTorch https://pytorch.org/hub/snakers4_silero-models_stt/ 9 comments
- GitHub - amrzv/awesome-colab-notebooks: Collection of google colaboratory notebooks for fast and easy experiments https://github.com/amrzv/awesome-colab-notebooks 0 comments
- GitHub - snakers4/open_stt: Open STT https://github.com/snakers4/open_stt 0 comments
Linked pages
- Common Voice https://voice.mozilla.org/ 234 comments
- Goodhart's law - Wikipedia http://en.wikipedia.org/wiki/Goodhart%27s_law 221 comments
- The End of Starsky Robotics. In 2015, I got obsessed with the idea… | by Stefan Seltz-Axmacher | Starsky Robotics 10–4 Labs | Medium https://medium.com/starsky-robotics-blog/the-end-of-starsky-robotics-acb8a6a8a5f5 157 comments
- Bevor Sie zu YouTube weitergehen https://www.youtube.com/channel/UCYO_jab_esuFRV4b17AJtAw 128 comments
- Ways to think about machine learning — Benedict Evans https://www.ben-evans.com/benedictevans/2018/06/22/ways-to-think-about-machine-learning-8nefy 71 comments
- GitHub - espnet/espnet: End-to-End Speech Processing Toolkit https://github.com/espnet/espnet 28 comments
- [1512.02595] Deep Speech 2: End-to-End Speech Recognition in English and Mandarin http://arxiv.org/abs/1512.02595 19 comments
- More Efficient NLP Model Pre-training with ELECTRA – Google AI Blog https://ai.googleblog.com/2020/03/more-efficient-nlp-model-pre-training.html 12 comments
- Towards an ImageNet Moment for Speech-to-Text https://thegradient.pub/towards-an-imagenet-moment-for-speech-to-text/ 10 comments
- openslr.org http://www.openslr.org/12/ 4 comments
- Sequence Modeling with CTC https://distill.pub/2017/ctc/ 1 comment
- [1904.05862] wav2vec: Unsupervised Pre-training for Speech Recognition https://arxiv.org/abs/1904.05862 1 comment
- [1808.00158] Speaker Recognition from Raw Waveform with SincNet https://arxiv.org/abs/1808.00158 0 comments
- GitHub - snakers4/open_stt: Open STT https://github.com/snakers4/open_stt 0 comments
- GitHub - facebookresearch/fairseq: Facebook AI Research Sequence-to-Sequence Toolkit written in Python. https://github.com/pytorch/fairseq 0 comments
Related searches:
Search whole site: site:thegradient.pub
Search title: A Speech-To-Text Practitioner’s Criticisms of Industry and Academia
See how to search.