Key takeaways
The ChatGPT creator has reorganized several internal teams over the past two months to overhaul its audio AI capabilities, consolidating engineering, product, and research functions under a single initiative led by Kundan Kumar, a researcher recruited from Character.AI.
The move signals OpenAI's strategic shift toward audio-first interfaces and dedicated hardware.
The company is targeting a Q1 2026 launch for its new audio model, with internal deadlines set for the end of March.
The upcoming model aims to produce more natural-sounding speech, handle real-time back-and-forth interactions more effectively, and even speak while users are talking—capabilities that current models cannot manage.
OpenAI will reportedly base the model on a new architecture, though it remains unclear whether this represents an entirely different algorithm design or a new transformer implementation.
Current and former employees have indicated that OpenAI's audio models currently lag behind its text-based models in accuracy and response speed, making these improvements crucial for the company's planned consumer device launch.
Jony Ive leads hardware vision
The audio AI improvements are being developed in preparation for OpenAI's first line of consumer devices, expected to launch approximately one year from now.
The hardware initiative is being shaped by legendary Apple designer Jony Ive, whose firm io Products was acquired by OpenAI for $6.5 billion in May 2025.
In a joint statement announcing the acquisition, OpenAI CEO Sam Altman wrote on X: "thrilled to be partnering with Jony, imo the greatest designer in the world. excited to try to create a new generation of AI-powered computers."
Altman and Ive elaborated on their collaboration in a statement posted on OpenAI's website. "AI is an incredible technology, but great tools require work at the intersection of technology, design, and understanding people and the world," Altman said.
Ive expressed his enthusiasm for the project, stating: "I have a growing sense that everything I have learned over the last 30 years has led me to this moment.
While I am both anxious and excited about the responsibility of the substantial work ahead, I am so grateful for the opportunity to be part of such an important collaboration."
The acquisition brought approximately 55 engineers, scientists, researchers, and product development specialists from io Products into OpenAI. Ive and his design firm LoveFrom maintain independence while assuming deep design and creative responsibilities across OpenAI.
Multiple device concepts in development
OpenAI is reportedly exploring a family of audio-focused devices with various form factors, including smart glasses, screenless speakers, and a voice-operated pen-like device.
The devices are being designed to act as persistent audio interfaces rather than traditional screen-based gadgets, with an emphasis on reducing device addiction.
Industry reports indicate that Ive has made reducing screen dependence a priority, viewing audio-first design as an opportunity to address shortcomings of previous consumer electronics.
The devices under development aim to enable voice-first control and contextual awareness through cameras and microphones, allowing users to interact with AI capabilities without screens.
Manufacturing for the first device is reportedly being handled by Foxconn, with production likely taking place in Vietnam or the United States rather than China.
The project carries the internal codename "Gumdrop," according to supply chain sources.
Industry-wide shift to audio interfaces
OpenAI's move into audio-first hardware aligns with broader industry trends where voice is becoming the primary interface for AI interactions.
Smart speakers are already present in more than one-third of U.S. homes, while companies like Meta, Google, and others are developing audio-centric AI experiences across various form factors, including smart glasses and wearables.
The company faces challenges in encouraging widespread adoption of voice interaction with AI products, as many users remain unaccustomed to this mode of engagement.
However, the combination of improved audio models, sophisticated hardware design, and OpenAI's established AI capabilities positions the company to potentially define the reference experience for conversational AI devices.
Read more:
Microsoft CEO Predicts 2026 Will Mark Critical Shift In AI Adoption
Google Expands AI Overviews To 2 Billion Users Worldwide
Bytedance Turns To Huawei AI Chips Amid U.S. Restrictions