Gnani.ai unveils Inya VoiceOS, India's first 5-billion-parameter voice-to-voice AI model

2 Sources

Share

Indian AI startup Gnani.ai launched Inya VoiceOS at the India AI Impact Summit 2026, marking a significant milestone as India's first voice-to-voice AI system. The 5-billion-parameter model processes speech directly without converting to text, trained on 14 million hours of multilingual Indian speech data. Prime Minister Narendra Modi formally unveiled the system, which supports over 15 Indian languages and handles code-mixed conversations.

Gnani.ai Introduces India's First Voice-to-Voice AI System

Gnani.ai unveiled Inya VoiceOS at the India AI Impact Summit 2026 held at Bharat Mandapam, where Prime Minister Narendra Modi formally introduced what the company describes as India's first voice-to-voice AI system

1

. The launch represents a significant step forward under the IndiaAI Mission, showcasing the country's expanding capabilities in artificial intelligence development. Unlike conventional voice assistants that rely on speech-to-text and text-to-speech conversion layers, Inya VoiceOS processes and generates spoken responses directly, working natively with speech without intermediate transformations

2

.

Source: Analytics Insight

Source: Analytics Insight

Technical Architecture Behind the 5-Billion-Parameter Voice-to-Voice AI

The AI foundational model operates on 5 billion parameters and has been pre-trained on over 14 million hours of multilingual Indian speech data, with additional fine-tuning on more than 1.2 million hours of task-specific audio

2

. Training also incorporated trillions of text tokens to strengthen reasoning and linguistic grounding. By processing speech directly, Inya VoiceOS retains conversational nuances such as tone, pauses, and emotion that typically get lost in traditional conversion methods. The company claims sub-second response times and 24 kHz audio output with natural-sounding prosody, delivering natural conversations that feel more human-like

2

.

Multilingual Capabilities and Code-Mixed Conversations

Inya VoiceOS supports more than 15 Indian languages and handles code-mixed conversations, a common communication pattern across India where speakers blend multiple languages within single exchanges

2

. The system manages interruptions and overlapping speech during real-time interactions, making it suitable for authentic conversational scenarios. Gnani.ai trained the model using one of the largest sovereign voice datasets assembled for Indian languages, ensuring the system understands regional linguistic variations and cultural communication styles

2

. The company built and deployed the entire system within India, emphasizing data sovereignty and local innovation

1

.

Applications Across Government and Private Sectors

The voice-to-voice AI model opens pathways for multilingual government helplines, grievance redressal systems, and emergency response platforms where quick, accurate communication proves critical

2

. For private sector deployment, Inya VoiceOS could power hands-free, voice-driven workflows in banking, insurance, healthcare, and logistics operations. The current release exists as a research preview, with Gnani.ai indicating that a larger 14-billion-parameter version is already in development

2

. This progression suggests the company aims to expand capabilities and accuracy as the technology matures, potentially positioning India as a competitive player in voice AI development alongside global counterparts.

Source: Digit

Source: Digit

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2026 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo