Built by former Meta and Microsoft engineers, KittenTTS is a tiny open-weight voice AI model designed to run locally on CPUs ...
Xiaomi has open-sourced OmniVoice, a multilingual AI voice cloning model supporting hundreds of languages with fast speech ...
A fan-favorite alien from Andy Weir’s sci-fi novel Project Hail Mary has come to ...
The three are GPT-Realtime-2, a successor to the company’s existing realtime voice model with what OpenAI describes as GPT-5-class reasoning; GPT-Realtime-Translate, a live translation model with more ...
The new features could be handy for customer service systems, but OpenAI says they have applications that work across a ...
GPT‑Realtime‑Whisper is a new streaming transcription model built for low-latency speech-to-text. It transcribes audio as ...
OpenAI launched three new audio models that can reason, translate across 70+ languages, and transcribe speech in real time, ...
OpenAI has introduced three new audio models through its API, expanding its push into ...
OpenAI launches GPT Realtime 2 for advanced voice reasoning alongside a new Codex Chrome extension to automate browser ...
The launch ⁠of ⁠the application programming interface (API) moves the ChatGPT-maker beyond ​transcription and chat toward ...
There has always been one glaring issue with Voice AI demos. It seems like magic until something too complicated is thrown at ...