Hume AI Unveils Voice Control: A Breakthrough in Customizable AI Voices

2 Sources

Hume AI launches Voice Control, an innovative tool allowing users to create custom AI voices by adjusting 10 distinct vocal dimensions, offering a new level of personalization in voice AI technology.

News article

Hume AI Introduces Voice Control: A New Era of Customizable AI Voices

Hume AI, a New York-based artificial intelligence firm, has unveiled an innovative tool called Voice Control, marking a significant advancement in the realm of AI-generated voices. This experimental feature, launched on Monday, allows users and developers to create custom AI voices without the need for coding, AI prompt engineering, or sound design skills 12.

The Technology Behind Voice Control

Voice Control offers a unique approach to voice customization by providing granular control over 10 different dimensions of voice characteristics. Users can adjust parameters such as gender, assertiveness, buoyancy, confidence, enthusiasm, nasality, relaxedness, smoothness, tepidity, and tightness 1. This level of customization is achieved through a slider-based interface that ranges from -100 to +100 for each metric, allowing for precise fine-tuning of vocal attributes 12.

Addressing Industry Challenges

The introduction of Voice Control addresses a significant pain point in the AI industry: the reliance on preset voices that often fail to meet specific brand or application needs. By offering this level of customization, Hume AI aims to provide a safer and more flexible alternative to voice cloning, a practice that has raised ethical and practical concerns 2.

Integration and Accessibility

Voice Control is currently available in beta and can be accessed by anyone registered on Hume's platform. The tool integrates seamlessly with Hume's Empathic Voice Interface (EVI) AI model, likely utilizing the EVI-2 model for this experimental feature 12. This integration ensures that the customized voices can be easily deployed in various applications, from customer service chatbots to digital assistants and accessibility features.

The Science Behind the Technology

Hume's approach is rooted in emotion science and utilizes a proprietary model based on cross-cultural voice recordings paired with emotional survey data. The company claims to have developed a new "unsupervised approach" that preserves most characteristics of each base voice when specific parameters are varied 12. This methodology allows for the disentanglement of different voice dimensions, resulting in audible and distinct changes when adjustments are made.

Future Developments and Industry Impact

Looking ahead, Hume plans to expand the range of base voices, introduce additional interpretable dimensions, and develop advanced tools for analyzing and visualizing voice characteristics 1. These developments could potentially reshape the landscape of voice AI technology, offering new possibilities for personalized and emotionally intelligent voice interfaces across various industries.

Competitive Landscape

Hume's focus on voice customization and emotional intelligence positions it as a strong competitor in the voice AI space. While companies like OpenAI and ElevenLabs offer libraries of pre-set voices, Hume's approach to granular customization sets it apart in the market 2. This innovative tool could have far-reaching implications for industries relying on AI-driven voice solutions, from customer service to entertainment and beyond.

As Voice Control enters the market, it represents a significant step forward in the evolution of AI-driven voice technology, offering unprecedented levels of customization and control to developers and users alike.

Explore today's top stories

Google's Veo 3 AI Video Generator Sparks Creativity and Concerns

Google's release of Veo 3, an advanced AI video generation model, has led to a surge in realistic AI-generated content and creative responses from real content creators, raising questions about the future of digital media and misinformation.

Ars Technica logoMashable logo

2 Sources

Technology

8 hrs ago

Google's Veo 3 AI Video Generator Sparks Creativity and

OpenAI's Vision for ChatGPT: From Chatbot to 'Super Assistant'

OpenAI's internal strategy document reveals plans to evolve ChatGPT into an AI 'super assistant' that deeply understands users and serves as an interface to the internet, aiming to help with various aspects of daily life.

The Verge logoLaptopMag logo

2 Sources

Technology

44 mins ago

OpenAI's Vision for ChatGPT: From Chatbot to 'Super

Meta Shifts to AI-Driven Product Risk Assessments, Raising Concerns

Meta plans to automate up to 90% of product risk assessments using AI, potentially speeding up product launches but raising concerns about overlooking serious risks that human reviewers might catch.

engadget logoNPR logoEconomic Times logo

3 Sources

Technology

43 mins ago

Meta Shifts to AI-Driven Product Risk Assessments, Raising

Google Launches AI Edge Gallery: Run AI Models Locally on Android Phones

Google quietly released an experimental app called AI Edge Gallery, allowing Android users to download and run AI models locally without an internet connection. The app supports various AI tasks and will soon be available for iOS.

TechCrunch logoEconomic Times logo

2 Sources

Technology

43 mins ago

Google Launches AI Edge Gallery: Run AI Models Locally on

Google to Appeal Antitrust Decision on Online Search Monopoly

Google announces plans to appeal a federal judge's antitrust decision regarding its online search monopoly, maintaining that the original ruling was incorrect. The case involves proposals to address Google's dominance in search and related advertising, with implications for AI competition.

Reuters logoEconomic Times logoMarket Screener logo

3 Sources

Policy and Regulation

42 mins ago

Google to Appeal Antitrust Decision on Online Search
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Β© 2025 Triveous Technologies Private Limited
Twitter logo
Instagram logo
LinkedIn logo