Meta Introduces Generative AI Model For Speech ‘Voicebox’

0
27
Meta Introduces Generative AI Model For Speech ‘Voicebox’


New Delhi: Meta has developed a cutting-edge generative AI mannequin ‘Voicebox’, designed to revolutionise the sphere of speech technology. “We’ve developed Voicebox, the first model that can generalise to speech-generation tasks it was not specifically trained to accomplish with state-of-the-art performance,” Meta stated in a weblog publish.

According to the corporate, Voicebox generates photos and textual content in a wide range of kinds, and it might probably create outputs from scratch or modify samples offered to it. However, as an alternative of making an image or a passage of textual content, Voicebox produces high-quality audio clips.

The mannequin helps speech synthesis throughout six languages, together with English, French, German, Spanish, Polish, and Portuguese, in addition to performs noise removing, content material enhancing, fashion conversion, and numerous pattern technology.

Moreover, Meta stated that Voicebox makes use of a brand new strategy to studying simply from uncooked audio and an accompanying transcription. Unlike autoregressive fashions for audio technology, Voicebox can modify any a part of a given pattern, not simply the top of an audio clip it’s given.

Further, the tech big stated that Voicebox is educated to foretell a speech section when given the encompassing speech and the transcript of the section.

Once the mannequin has realized to infill speech from context, it may be utilized throughout a variety of speech technology duties, together with producing parts of an audio recording with out re-creating the complete recording.

This versatility permits Voicebox to carry out effectively throughout a wide range of duties, together with — in-context text-to-speech synthesis, cross-lingual fashion switch, speech denoising and enhancing, and numerous speech sampling.





Source hyperlink