OpenAI’s voice cloning AI model only needs a 15-second sample to work

A rendition of OpenAI’s logo, which looks like a stylized whirlpool.
Illustration: The Verge

OpenAI is offering limited access to a text-to-voice generation platform it developed called Voice Engine, which can create a synthetic voice based on a 15-second clip of someone’s voice. The AI-generated voice can read out text prompts on command in the same language as the speaker or in a number of other languages. “These small scale deployments are helping to inform our approach, safeguards, and thinking about how Voice Engine could be used for good across various industries,” OpenAI said in its blog post.

Companies with access include the education technology company Age of Learning, visual storytelling platform HeyGen, frontline health software maker Dimagi, AI communication app creator Livox, and health system Lifespan.

In these...

Continue reading…



from The Verge - All Posts https://ift.tt/cFThD8s

Comments

Popular posts from this blog

Gemini app finally expands to audio files

Amazon is offering a like-new Kindle Paperwhite 2024 for just $107

Apple Watch Series 8, SE 2, and Ultra hands-on: triple the fun