Microsoft VALL-E Clones Your Voice in 3 Seconds

Microsoft VALL-E Clones Your Voice in 3 Seconds

Kaustubh Katdare
@thebigk

Updated: Oct 27, 2024

Views: 479

AI's turned into a mimicry artist. Microsoft's VALL-E, a neural code language model can learn to clone any voice in just 3 seconds. The AI model can work on a small audio clip of the target speaker and train itself to synthesize high-quality, personalised speech.

Microsoft engineers have trained VALL-E on 60K hours of data, which is 100x larger than any existing system used for text to speech synthesis (TTS).

The research paper indicates that VALL-E can preserve the naturalness and emotions and acoustic environment of the target speaker.

0

Replies

Howdy guest!

Dear guest, you must be logged-in to participate on CrazyEngineers. We would love to have you as a member of our community. Consider creating an account or login.

About CrazyEngineers

The official CrazyEngineers Community

Founded on: Nov 26, 2005

Members online

Currently there are no users online.

Activity feed

Eagle Spa has joined CrazyEngineers as a new member

9h
Md Salma Sulthana has joined CrazyEngineers as a new member

13h
CHANCHAL MONDAL has joined CrazyEngineers as a new member

1d
Nishanth Sakthi has joined CrazyEngineers as a new member

1d
Sanjay Kokate has joined CrazyEngineers as a new member

1d
Greta replied on Cam for Inclined...

1d
Greta replied on Need a Name...

1d
Greta Blake has joined CrazyEngineers as a new member

1d
Cristiana has posted Kenapa Semua Pemula...

1d
Code replied on Starting From Nashik...

1d

CrazyEngineers is ⚡ powered by
Jatra Community Software

Home Channels Search Login Register