[R] Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers - discu.eu

Reddit

[R] Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers https://arxiv.org/abs/2301.02111 15 comments 6/1/2023 machinelearning

Linking pages

Microsoft’s new AI can simulate anyone’s voice with 3 seconds of audio | Ars Technica https://arstechnica.com/information-technology/2023/01/microsofts-new-ai-can-simulate-anyones-voice-with-3-seconds-of-audio/?comments=1&comments-page=3 1293 comments
VALL-E https://valle-demo.github.io/ 151 comments
Microsoft’s new AI can simulate anyone’s voice with 3 seconds of audio | Ars Technica https://arstechnica.com/information-technology/2023/01/microsofts-new-ai-can-simulate-anyones-voice-with-3-seconds-of-audio/ 124 comments
GitHub - microsoft/unilm: Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities https://github.com/microsoft/unilm 104 comments
GitHub - suno-ai/bark: 🔊 Text-prompted Generative Audio Model https://github.com/suno-ai/bark 93 comments
GitHub - 2noise/ChatTTS: TTS https://github.com/2noise/ChatTTS 82 comments
bark/README.md at main · suno-ai/bark · GitHub https://github.com/suno-ai/bark/blob/main/README.md 60 comments
GitHub - aimerou/top-ai-papers: A curated list of the most impressive AI papers https://github.com/aimerou/top-ai-papers 9 comments
Microsoft To Launch VALL-E, A Voice DALL-E https://www.theinsaneapp.com/2023/01/microsoft-launched-a-voice-based-dall-e-called-vall-e.html 7 comments
Microsoft VALL-E AI Can Clone Your Voice From 3-Second Audio Clip https://www.businessinsider.com/microsoft-chatgpt-vall-e-valle-voice-text-clone-listen-clip 7 comments
This new AI can simulate your voice from just 3 seconds of audio | Fox News https://www.foxnews.com/tech/new-ai-simulate-voice-3-seconds-audio 1 comment
ⓍTTS - TTS 0.19.0 documentation https://tts.readthedocs.io/en/dev/models/xtts.html#training 1 comment
New Microsoft AI can accurately mimic a human voice after analyzing a 3-second sample | TechSpot https://www.techspot.com/news/97217-new-microsoft-ai-can-accurately-mimic-human-voice.html 0 comments
GitHub - serp-ai/bark-with-voice-clone: 🔊 Text-prompted Generative Audio Model - With the ability to clone voices https://github.com/serp-ai/bark-with-voice-clone 0 comments
VALL-E https://plachtaa.github.io/ 0 comments

Would you like to stay up to date with Computer science? Checkout Computer science Weekly.

Related searches:

Search whole site: site:arxiv.org

Search title: [R] Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers

See how to search.

Submit link to: