Maya Research raises $1.9M to build voice AI that speaks like a local across languages

2 Sources

Share

Maya Research, a Bengaluru and San Francisco-based AI startup, has raised $1.9 million in a seed round led by South Park Commons to develop conversational AI models designed for native speakers across languages and dialects. The company's emotive voice model Maya 1 ranks sixth globally among open-weight models on Speech Arena, placing it on par with OpenAI's GPT-Realtime-2.

Maya Research Secures Funding to Build Voice AI for Billions

Maya Research has raised $1.9 million in a seed round led by South Park Commons, marking a significant step for the AI startup focused on building voice interfaces that speak and respond like native speakers across languages and cultural contexts

1

2

. Founded in 2025 by New York University graduates Dheemanth Reddy and Bharath Kumar Kakumani, the Bengaluru and San Francisco-based company aims to serve the next four to five billion internet users who will interact with technology primarily through voice rather than text.

Source: ET

Source: ET

"The next four to five billion people will not use AI the way today's power users do," Dheemanth Reddy, co-founder and CEO, told ET. "For them, voice is not a feature. It is how they live. The interface has to think and speak like them"

1

. The company plans to use the capital to train larger conversational models, expand deployment infrastructure, and deepen its understanding of how users in voice-first markets interact with AI systems.

Conversational AI Models That Think Beyond Text-to-Speech

While most Voice AI companies treat voice as a layer built on top of text—where users speak, a large language model generates a response, and a text-to-speech engine reads it back—Maya Research believes this approach misses the larger opportunity

1

. The startup is building conversational AI models designed to speak, think, and respond like native speakers across local languages, dialects, and cultural contexts.

"Models today know how to talk, but they don't know what to talk about," Reddy explained. "Humans carry hesitation, affirmation, uncertainty and emotion inside conversations. The challenge is not generating speech. The challenge is deciding what to say, when to say it and how to say it"

1

. This philosophy sets Maya apart in a crowded market that includes startups like ElevenLabs and Cartesia, as well as Indian companies such as Gnani.ai, Skit.ai, and Yellow.ai.

Maya 1 Competes with OpenAI on Speech Arena

Maya 1, the company's emotive voice model released under Apache 2.0, currently ranks sixth globally among open-weight models on Speech Arena, a benchmark used to evaluate conversational speech systems

2

. The model holds a Quality Elo score of 1,051, placing it on par with OpenAI's GPT-Realtime-2

2

. Maya Research is the only Indian company represented on the Speech Arena leaderboard.

The startup has crossed 440,000 model downloads on Hugging Face, while its consumer application has surpassed 3 million downloads across India, Southeast Asia, and the Middle East and North Africa region

2

. Telugu-speaking users form Maya's largest market today, followed by users in Uttar Pradesh and West Bengal, with significant usage among women discussing topics ranging from shopping and family life to devotion and parenting

1

.

Building AI Sovereignty Through Regional Dialect Support

Maya operates both a model platform and a consumer application. Its models are commercially available through FAL, while its consumer app creates a data flywheel that allows the company to observe how users interact with conversational AI across different languages and regions

1

. The company employs people to travel to villages and towns across India to record conversations and understand how people naturally speak, collecting regional variations and dialects rather than relying on standardized language datasets.

"India is not a text economy, it never was," Reddy stated. "While the conversation around AI sovereignty has focused on large language models and text interfaces, the more urgent question is: whose voice models are Indians talking to? Right now, that answer is almost entirely foreign. Maya exists to change that"

2

. This focus on regional dialect support reflects a broader belief that voice AI adoption in emerging markets will depend as much on cultural familiarity as technological capability.

Targeting Voice-First Markets and Discovery Layer Opportunities

"The internet was built around English and text, which quietly left most of the world outside the interface," said Prateek Mehta, general partner at South Park Commons. "As voice becomes the way billions of people interact with technology, the company with the richest multilingual speech data and the strongest conversational models will have a defining advantage"

1

.

While many consumer AI companies experiment with subscription models, Reddy said the larger opportunity lies in becoming a discovery and navigation layer that helps users access products, services, and information they may not otherwise find. The company eventually expects to generate revenue by helping users discover relevant financial products and services

1

. Bharath Kumar Kakumani, co-founder and CTO, emphasized that "technology should feel magical to people" and belong to them rather than feeling foreign

2

.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved