OpenAI launched three real-time voice models, bringing GPT-5-class reasoning, 70-language translation, and live transcription ...
OpenAI announced its most advanced speech-to-speech AI model yet, GPT-Realtime. The new model, now available through OpenAI’s updated Realtime API, is said to be more reliable and cheaper than the ...
The new features could be handy for customer service systems, but OpenAI says they have applications that work across a ...
What’s new: OpenAI released three voice AI models with real-time reasoning, translation, and transcription capabilities, aiming to make conversations more interactive and task-oriented. Who’s testing: ...
The three are GPT-Realtime-2, a successor to the company’s existing realtime voice model with what OpenAI describes as GPT-5-class reasoning; GPT-Realtime-Translate, a live translation model with more ...
AI thrives on data but feeding it the right data is harder than it seems. As enterprises scale their AI initiatives, they face the challenge of managing diverse data pipelines, ensuring proximity to ...
What if your next phone call with customer support didn’t feel like a frustrating maze of robotic prompts but instead like a natural, empathetic conversation? Imagine an AI that not only understands ...
OpenAI's Realtime API is now optimized and generally available. You can try its latest speech-to-speech model, gpt-realtime. The upgrades improve OpenAI's voice ...
Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Dany Lepage discusses the architectural ...
Agora's Conversational AI Engine offers key enhancements to the Realtime API for more natural communication and interaction. This milestone builds on Agora's partnership with OpenAI, as the Realtime ...
OpenAI unveiled new API updates at its Dev Day on Monday, introducing GPT-5 Pro, its latest language model, its new video generation model Sora 2, and a smaller, cheaper voice model. The addition of ...