Hume AI Unveils Voice Control: A Breakthrough in Customizable AI Voices

Hume AI Introduces Voice Control: A New Era of Customizable AI Voices

Hume AI, a New York-based artificial intelligence firm, has unveiled an innovative tool called Voice Control, marking a significant advancement in the realm of AI-generated voices. This experimental feature, launched on Monday, allows users and developers to create custom AI voices without the need for coding, AI prompt engineering, or sound design skills 1

The Technology Behind Voice Control

Voice Control offers a unique approach to voice customization by providing granular control over 10 different dimensions of voice characteristics. Users can adjust parameters such as gender, assertiveness, buoyancy, confidence, enthusiasm, nasality, relaxedness, smoothness, tepidity, and tightness 1

. This level of customization is achieved through a slider-based interface that ranges from -100 to +100 for each metric, allowing for precise fine-tuning of vocal attributes 1

Addressing Industry Challenges

The introduction of Voice Control addresses a significant pain point in the AI industry: the reliance on preset voices that often fail to meet specific brand or application needs. By offering this level of customization, Hume AI aims to provide a safer and more flexible alternative to voice cloning, a practice that has raised ethical and practical concerns 2

Integration and Accessibility

Voice Control is currently available in beta and can be accessed by anyone registered on Hume's platform. The tool integrates seamlessly with Hume's Empathic Voice Interface (EVI) AI model, likely utilizing the EVI-2 model for this experimental feature 1

. This integration ensures that the customized voices can be easily deployed in various applications, from customer service chatbots to digital assistants and accessibility features.

The Science Behind the Technology

Hume's approach is rooted in emotion science and utilizes a proprietary model based on cross-cultural voice recordings paired with emotional survey data. The company claims to have developed a new "unsupervised approach" that preserves most characteristics of each base voice when specific parameters are varied 1

. This methodology allows for the disentanglement of different voice dimensions, resulting in audible and distinct changes when adjustments are made.

Future Developments and Industry Impact

Looking ahead, Hume plans to expand the range of base voices, introduce additional interpretable dimensions, and develop advanced tools for analyzing and visualizing voice characteristics 1

. These developments could potentially reshape the landscape of voice AI technology, offering new possibilities for personalized and emotionally intelligent voice interfaces across various industries.

Competitive Landscape

Hume's focus on voice customization and emotional intelligence positions it as a strong competitor in the voice AI space. While companies like OpenAI and ElevenLabs offer libraries of pre-set voices, Hume's approach to granular customization sets it apart in the market 2

. This innovative tool could have far-reaching implications for industries relying on AI-driven voice solutions, from customer service to entertainment and beyond.

As Voice Control enters the market, it represents a significant step forward in the evolution of AI-driven voice technology, offering unprecedented levels of customization and control to developers and users alike.

Hume AI Unveils Voice Control: A Breakthrough in Customizable AI Voices

Hume AI Introduces Voice Control: A New Era of Customizable AI Voices

The Technology Behind Voice Control

Addressing Industry Challenges

Integration and Accessibility

The Science Behind the Technology

Future Developments and Industry Impact

Competitive Landscape

References

This AI Tool Will Let You Customise Voices for AI Systems

Hume launches Voice Control allowing users and developers to make custom AI voices

Related Stories

Hume Unveils EVI 3: A Breakthrough in Customizable AI Voice Generation

Hume AI Unveils Octave: A Revolutionary AI Voice Generator with Human-Like Emotional Nuance

Google DeepMind acquires Hume AI leadership to accelerate emotionally intelligent voice technology

Recent Highlights

OpenAI secures $110 billion funding round from Amazon, Nvidia, and SoftBank at $730B valuation

Samsung unveils Galaxy S26 lineup with Privacy Display tech and expanded AI capabilities

Anthropic faces Pentagon ultimatum over AI use in mass surveillance and autonomous weapons

Recent Highlights

Today's Top Stories

Trump orders federal agencies to ban Anthropic after Pentagon dispute over AI surveillance

ChatGPT reaches 900 million active users as OpenAI secures $110 billion in historic funding

Nvidia unveils new AI chip with Groq technology to accelerate inference computing for OpenAI

Humanity's Last Exam reveals the gap between AI and human intelligence despite rapid progress