OpenAI launches GPT-Realtime-2 with live translation and transcription features
Key takeaways
- The company’s latest GPT‑Realtime‑2 is an additional voice model designed to produce realistic vocal simulations for conversational interactions.
- Additionally, the company is introducing GPT‑Realtime‑Translate, which is intended to offer real-time translation services that keep pace with ongoing conversations.
- Furthermore, a new transcription feature called GPT-Realtime-Whisper has been launched, providing users with live speech-to-text capabilities during interactions.
Why this matters: local context for readers following news across Pakistan and the region.
Add ARY News on Google AAResize Open AI announced on Thursday that its API now incorporates several new voice intelligence features to help developers build applications that can talk, transcribe, and translate conversations with users.
The company’s latest GPT‑Realtime‑2 is an additional voice model designed to produce realistic vocal simulations for conversational interactions. Unlike its predecessor, GPT-Realtime-1.5, this version is built with GPT‑5‑level reasoning, which Open AI claims enables it to handle more complex user requests.
Additionally, the company is introducing GPT‑Realtime‑Translate, which is intended to offer real-time translation services that keep pace with ongoing conversations. This feature supports over 70 input languages and 13 output languages.