- [R] Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers https://arxiv.org/abs/2301.02111 15 comments machinelearning
Linking pages
- Microsoft’s new AI can simulate anyone’s voice with 3 seconds of audio | Ars Technica https://arstechnica.com/information-technology/2023/01/microsofts-new-ai-can-simulate-anyones-voice-with-3-seconds-of-audio/?comments=1&comments-page=3 1293 comments
- Microsoft’s new AI can simulate anyone’s voice with 3 seconds of audio | Ars Technica https://arstechnica.com/information-technology/2023/01/microsofts-new-ai-can-simulate-anyones-voice-with-3-seconds-of-audio/ 124 comments
- GitHub - microsoft/unilm: Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities https://github.com/microsoft/unilm 104 comments
- GitHub - suno-ai/bark: 🔊 Text-prompted Generative Audio Model https://github.com/suno-ai/bark 93 comments
- bark/README.md at main · suno-ai/bark · GitHub https://github.com/suno-ai/bark/blob/main/README.md 60 comments
- GitHub - aimerou/top-ai-papers: A curated list of the most impressive AI papers https://github.com/aimerou/top-ai-papers 9 comments
- Microsoft To Launch VALL-E, A Voice DALL-E https://www.theinsaneapp.com/2023/01/microsoft-launched-a-voice-based-dall-e-called-vall-e.html 7 comments
- Microsoft VALL-E AI Can Clone Your Voice From 3-Second Audio Clip https://www.businessinsider.com/microsoft-chatgpt-vall-e-valle-voice-text-clone-listen-clip 7 comments
- This new AI can simulate your voice from just 3 seconds of audio | Fox News https://www.foxnews.com/tech/new-ai-simulate-voice-3-seconds-audio 1 comment
- New Microsoft AI can accurately mimic a human voice after analyzing a 3-second sample | TechSpot https://www.techspot.com/news/97217-new-microsoft-ai-can-accurately-mimic-human-voice.html 0 comments
- GitHub - serp-ai/bark-with-voice-clone: 🔊 Text-prompted Generative Audio Model - With the ability to clone voices https://github.com/serp-ai/bark-with-voice-clone 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2301.02111] Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
See how to search.