pakistan
OpenAI unveils three audio models for real-time voice tasks
Key takeaways
- Add ARY News on Google AAResize Open AI introduced three audio models for its developer platform on Thursday, aiming to make voice-based software agents more conversational and capable of completing tasks in real time.
- The launch of the application programming interface (API) moves the Chat GPT-maker beyond transcription and chat toward agents that can listen, translate and act during live conversations.
- The new models are GPT-Realtime-2, GPT-Realtime-Translate and GPT-Realtime-Whisper.
Why this matters: local context for readers following news across Pakistan and the region.
Add ARY News on Google AAResize Open AI introduced three audio models for its developer platform on Thursday, aiming to make voice-based software agents more conversational and capable of completing tasks in real time.
The launch of the application programming interface (API) moves the Chat GPT-maker beyond transcription and chat toward agents that can listen, translate and act during live conversations.
The new models are GPT-Realtime-2, GPT-Realtime-Translate and GPT-Realtime-Whisper. OpenAI said they are available to test in its developer playground.
Article preview — originally published by ARY News. Full story at the source.
Read full story on ARY News →
More top stories
Also covered by
Investing.com
OpenAI unveils three audio models for real-time voice tasks
9to5Mac
OpenAI has new voice models that reason, translate, and transcribe as you speak
TechCrunch
OpenAI launches new voice intelligence features in its API
BBC News
English clubs reach all three European finals for first time
Aggregated and edited by the Scoop newsroom. We surface news from ARY News alongside other reporting so you can compare coverage in one place.
Editorial policy · Corrections · About Scoop