OpenAI launches new voice intelligence features in its API
Key takeaways
- Open AI said Thursday that its API will now include a number of new voice intelligence features designed to help developers create apps that can talk, transcribe, and translate conversations with users.
- The company s new GPT‑Realtime‑2 is another voice model, built to create a realistic vocal simulation that can converse with users.
- The company is also launching GPT‑Realtime‑Translate which, just as it sounds, is designed to provide real-time translation services that keep pace with the user, conversationally.
Open AI said Thursday that its API will now include a number of new voice intelligence features designed to help developers create apps that can talk, transcribe, and translate conversations with users.
The company s new GPT‑Realtime‑2 is another voice model, built to create a realistic vocal simulation that can converse with users. However, unlike its predecessor (GPT-Realtime-1.5) this one is built with GPT‑5‑class reasoning that Open AI says was created to deal with more complicated requests from users.
The company is also launching GPT‑Realtime‑Translate which, just as it sounds, is designed to provide real-time translation services that keep pace with the user, conversationally. The feature includes more than 70 input languages (that is, the languages that it can comprehend) and 13 output languages (the languages it relays to the speaker).