Hacker News
- Amphion: An open-source audio, music, and speech generation toolkit https://github.com/open-mmlab/Amphion 2 comments
Linked pages
- GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision https://github.com/openai/whisper/ 126 comments
- Official Drivers | NVIDIA http://www.nvidia.com/Download/index.aspx 103 comments
- Installation Guide — NVIDIA Cloud Native Technologies documentation https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html 31 comments
- https://arxiv.org/pdf/2006.11239.pdf 25 comments
- https://arxiv.org/abs/2301.02111 15 comments
- CUDA Toolkit 12.1 Downloads | NVIDIA Developer https://developer.nvidia.com/cuda-downloads 5 comments
- GitHub - CompVis/latent-diffusion: High-Resolution Image Synthesis with Latent Diffusion Models https://github.com/CompVis/latent-diffusion 5 comments
- [2106.06103] Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech https://arxiv.org/abs/2106.06103 2 comments
- [2010.02502] Denoising Diffusion Implicit Models https://arxiv.org/abs/2010.02502 1 comment
- GitHub - jik876/hifi-gan: HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis https://github.com/jik876/hifi-gan 0 comments
- [1910.06711] MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis https://arxiv.org/abs/1910.06711 0 comments
- GitHub - resemble-ai/Resemblyzer: A python package to analyze and compare voices with deep learning https://github.com/resemble-ai/Resemblyzer 0 comments
- GitHub - facebookresearch/encodec: State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio. https://github.com/facebookresearch/encodec 0 comments