India AI Impact Summit 2026: Gnani.ai unveils India’s first 5B voice-to-voice AI model, Inya VoiceOS

HIGHLIGHTS

Inya VoiceOS processes and generates speech directly, without traditional speech-to-text or text-to-speech layers.

Trained on 14+ million hours of multilingual speech data, it supports over 15 Indian languages with sub-second latency.

Currently in research preview, a larger 14B-parameter version is already in development.

India AI Impact Summit 2026: Gnani.ai unveils India’s first 5B voice-to-voice AI model, Inya VoiceOS

India’s AI startup Gnani.ai introduced what it describes as the country’s first 5-billion-parameter voice-to-voice foundational model under the India AI Mission. The model, called Inya VoiceOS, was showcased at the India AI Impact Summit 2026 at Bharat Mandapam, where Prime Minister Narendra Modi formally unveiled it.

Digit.in Survey
✅ Thank you for completing the survey!

Unlike conventional voice assistants that convert speech into text and then back into audio, Inya VoiceOS works directly with speech. The system processes and generates spoken responses natively, without relying on intermediate speech-to-text or text-to-speech layers. According to the company, this allows it to better retain tone, pauses, emotion and other subtle cues that shape human conversations.

Gnani.ai said the model has been trained on one of the largest sovereign voice datasets assembled for Indian languages. It is built on 5 billion parameters, pre-trained on over 14 million hours of multilingual speech data and further fine-tuned on more than 1.2 million hours of task-specific audio. The training also incorporated trillions of text tokens to strengthen reasoning and linguistic grounding.

The company claims sub-second response times and 24 kHz audio output with natural-sounding prosody. It supports more than 15 Indian languages and is designed to handle code-mixed conversations, a common pattern in India, along with interruptions and overlapping speech during real-time interactions.

It can be used for multilingual government helplines, grievance redressal systems and emergency response platforms. For the private sector, it could power hands-free, voice-driven workflows in banking, insurance, healthcare and logistics.

It must be noted that the current release is a research preview, with Gnani.ai indicating that a larger 14-billion-parameter version is in development. The company emphasised that the model has been built and deployed entirely within India.

Ashish Singh

Ashish Singh

Ashish Singh is the Chief Copy Editor at Digit. He's been wrangling tech jargon since 2020 (Times Internet, Jagran English '22). When not policing commas, he's likely fueling his gadget habit with coffee, strategising his next virtual race, or plotting a road trip to test the latest in-car tech. He speaks fluent Geek. View Full Profile

Digit.in
Logo
Digit.in
Logo