Hacker News
- Towards an ImageNet Moment for Speech-to-Text https://thegradient.pub/towards-an-imagenet-moment-for-speech-to-text/ 10 comments
Linking pages
- GitHub - snakers4/silero-models: Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple https://github.com/snakers4/silero-models 99 comments
- A Speech-To-Text Practitioner’s Criticisms of Industry and Academia https://thegradient.pub/a-speech-to-text-practitioners-criticisms-of-industry-and-academia/ 53 comments
- Silero Speech-To-Text Models | PyTorch https://pytorch.org/hub/snakers4_silero-models_stt/ 9 comments
- GitHub - theblackcat102/edgedict: Working online speech recognition based on RNN Transducer. ( Trained model release available in release ) https://github.com/theblackcat102/Online-Speech-Recognition 2 comments
- GitHub - amrzv/awesome-colab-notebooks: Collection of google colaboratory notebooks for fast and easy experiments https://github.com/amrzv/awesome-colab-notebooks 0 comments
- GitHub - snakers4/open_stt: Open STT https://github.com/snakers4/open_stt 0 comments
Linked pages
- A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet https://people.xiph.org/~jm/demo/lpcnet_codec/ 73 comments
- NLP's ImageNet moment has arrived https://thegradient.pub/nlp-imagenet/ 42 comments
- Goodhart’s Law: Are Academic Metrics Being Gamed? https://thegradient.pub/over-optimization-of-academic-publishing-metrics/ 29 comments
- [1512.02595] Deep Speech 2: End-to-End Speech Recognition in English and Mandarin http://arxiv.org/abs/1512.02595 19 comments
- [1905.11946] EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks https://arxiv.org/abs/1905.11946 10 comments
- [1512.03385] Deep Residual Learning for Image Recognition http://arxiv.org/abs/1512.03385 6 comments
- Sequence Modeling with CTC https://distill.pub/2017/ctc/ 1 comment
- Common Voice https://voice.mozilla.org/en/datasets 1 comment
- NLP's ImageNet moment has arrived http://ruder.io/nlp-imagenet/ 1 comment
- [1909.13719] RandAugment: Practical automated data augmentation with a reduced search space https://arxiv.org/abs/1909.13719 0 comments
- [1409.0473] Neural Machine Translation by Jointly Learning to Align and Translate http://arxiv.org/abs/1409.0473 0 comments
- GitHub - snakers4/open_stt: Open STT https://github.com/snakers4/open_stt 0 comments
- GitHub - google/sentencepiece: Unsupervised text tokenizer for Neural Network-based text generation. https://github.com/google/sentencepiece 0 comments
Related searches:
Search whole site: site:thegradient.pub
Search title: Towards an ImageNet Moment for Speech-to-Text
See how to search.