OpenAI has launched a Voice Engine that works on synthetic voice generation using text or 15-second audio samples. OpenAI is anticipating challenges, misuse, and usage by broader people

OpenAI Voice Engine
OpenAI has launched a voice engine that uses text input and a single 15-second audio sample to generate natural-sounding speech that closely resembles the original speaker. The voice cloning tool model is available for preview and usage for a group of testers who agreed to follow safety guidelines. Anticipating the misuse of the synthetic voice technology, OpenAI plans to release it in an informed manner for a broader audience.
The Voice Engine project was first launched in 2022 and can create emotive and realistic voices using only a single 15-second sample. Based on the results from the test on a small group of people, OpenAI sees Voice Engine potential to be used for good across various industries.
According to the official OpenAI Voice Engine document, it was developed in late 2022 and has been used to power the preset voices available in the text-to-speech API as well as ChatGPT Voice and Read Aloud. At the same time, we are taking a cautious and informed approach to a broader release due to the potential for synthetic voice misuse. We hope to start a dialogue on the responsible deployment of synthetic voices, and how society can adapt to these new capabilities. Based on these conversations and the results of these small-scale tests, we will make a more informed decision about whether and how to deploy this technology at scale.
Also Read: How Artists and Filmmakers Use Sora Video Tool: Watch Top Sora-Created Videos Released by OpenAI
Also Read: OpenAI Entering AI Chip Manufacturing To Challenge Nvidia’s Dominance
Also Read: Why Tyler Perry Puts $800M Studio Expansion on hold after seeing OpenAI’s Sora
1. Current voice
2. Reference audio
3. Generated audio
OpenAI is working closely with researchers, policymakers, and other people to see the challenges and possible outcomes of implementing Voice Engine to broader users.
This post was last modified on March 29, 2024 11:56 pm
Pick your task, get the best AI model for it — images, video, slides, research,…
Learn what Agentic AI is, how it works, and how it differs from Generative AI.…
Discover the 13 best free online vocal remover AI tools for 2026, designed to isolate…
Explore the top 13 yield farming platforms for 2026, featuring secure, trusted, and high-APY crypto…
Explore the best AI learning platforms for 2026, including Coursera, edX, Udacity, and more. Learn…
Explore the 13 best Polygon wallets in 2026, comparing security, DeFi access, hardware and mobile…