OpenAI Slashes Realtime API Costs and Expands Voice Capabilities

2 Sources

OpenAI announces significant cost reductions for its Realtime API and introduces new voice options, potentially revolutionizing AI-powered voice assistants and chatbots.

News article

OpenAI Announces Major Cost Reductions for Realtime API

OpenAI, the leading AI company, has unveiled significant cost reductions and new features for its Realtime API at a developers' event in London. The company plans to implement automatic caching of audio and text inputs, which could slash the cost of long conversations by up to 80 percent 1.

New Pricing Structure

The new pricing structure aims to make the Realtime API more accessible to developers:

  • Cached text inputs will see a 50% price reduction
  • Cached audio inputs will enjoy an 80% discount

This move addresses concerns from developers who previously found the API pricing prohibitively expensive for many use cases 2.

Expanded Voice Capabilities

In addition to cost reductions, OpenAI has introduced five new voices for speech-to-speech applications on its platform. The company showcased three of these voices - Ash, Verse, and the British-sounding Ballad - in a post on X 2. These new voices are designed to be more expressive and easier to control than previous iterations.

Applications and Potential Impact

The Realtime API, released in early October, is designed for creating applications featuring voice assistants and AI agents. It's already being utilized by companies such as Healthify, Speak, and Twilio 1. The API enables developers to build bots that can interact through voice or text and perform actions like ordering food or scheduling appointments.

With the new pricing structure and enhanced voice capabilities, OpenAI is positioning itself to revolutionize various industries:

  1. Customer Service: Companies can develop more responsive and cost-effective voice-based customer service platforms.
  2. Voice-overs: Users can generate voice-overs using AI-generated voices, similar to platforms like Replica and ElevenLabs 2.
  3. Real-time Communication: The improvements in latency and expressiveness of the voices could lead to more natural-sounding AI interactions.

Challenges and Considerations

While these advancements are promising, OpenAI acknowledges some challenges:

  1. Network Dependency: The company warns that network conditions heavily affect real-time audio processing, which can be challenging when conditions are unpredictable 2.
  2. Authentication: As the API is still in beta, OpenAI cannot offer client-side authentication at this time 2.
  3. Ethical Concerns: OpenAI's history with AI-powered speech has been controversial, as evidenced by the pause in using one of their voices after actress Scarlett Johansson spoke out about its similarity to her voice 2.

As OpenAI continues to innovate in the realm of AI-powered speech and text interactions, these latest developments in the Realtime API represent a significant step forward in making advanced AI capabilities more accessible and affordable for developers and businesses alike.

Explore today's top stories

AI-Designed Antibiotics Show Promise in Fighting Drug-Resistant Superbugs

MIT researchers use generative AI to create novel antibiotics effective against drug-resistant bacteria, including gonorrhea and MRSA, potentially ushering in a new era of antibiotic discovery.

IEEE Spectrum logoMassachusetts Institute of Technology logoBBC logo

8 Sources

Science and Research

19 hrs ago

AI-Designed Antibiotics Show Promise in Fighting

Cohere Raises $500 Million, Hires Meta's AI Research Head in Bid to Challenge AI Giants

Canadian AI startup Cohere secures $500 million in funding, reaching a $6.8 billion valuation, and appoints former Meta AI research head Joelle Pineau as Chief AI Officer, positioning itself as a secure enterprise AI solution provider.

TechCrunch logoFinancial Times News logoReuters logo

13 Sources

Business and Economy

19 hrs ago

Cohere Raises $500 Million, Hires Meta's AI Research Head

Brain Implant Decodes Inner Speech with Password Protection, Advancing AI-Assisted Communication

Scientists have developed a brain-computer interface that can decode inner speech with up to 74% accuracy, using a password system to protect user privacy. This breakthrough could revolutionize communication for people with severe speech impairments.

Nature logoNew Scientist logoNews-Medical logo

9 Sources

Science and Research

19 hrs ago

Brain Implant Decodes Inner Speech with Password

AI-Generated Errors in Australian Murder Case Highlight Legal Risks of Artificial Intelligence

A senior Australian lawyer apologizes for submitting AI-generated fake quotes and non-existent case judgments in a murder trial, causing a 24-hour delay and raising concerns about AI use in legal proceedings.

AP NEWS logoeuronews logoCBS News logo

9 Sources

Technology

3 hrs ago

AI-Generated Errors in Australian Murder Case Highlight

TeraWulf Secures $3.7B AI Hosting Deal Backed by Google, Pivoting from Bitcoin Mining

TeraWulf, a Bitcoin mining company, has signed a major AI infrastructure hosting deal with Fluidstack, backed by Google. This pivot could significantly boost the company's revenue and marks a shift in strategy for cryptocurrency miners facing challenges.

Cointelegraph logoEconomic Times logoBenzinga logo

7 Sources

Business and Economy

19 hrs ago

TeraWulf Secures $3.7B AI Hosting Deal Backed by Google,
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo