Microsoft unveils text-to-speech AI model VALL-E, which was trained on English speech data and can simulate a person's voice with three seconds of sample audio (Damir Yalalov/Metaverse Post)

[ad_1]


Damir Yalalov / Metaverse Post:

Microsoft unveils text-to-speech AI model VALL-E, which was trained on English speech data and can simulate a person’s voice with three seconds of sample audio  —  IN BRIEF  —  With just a three-second sample of any voice, the transformer-based TTS model VALL-E can produce speech in every voice.

Source link

The post Microsoft unveils text-to-speech AI model VALL-E, which was trained on English speech data and can simulate a person's voice with three seconds of sample audio (Damir Yalalov/Metaverse Post) appeared first on The Alike.

[ad_2]

Source link

Comments are closed.