Sarvam AI Launches Sarvam-1: A Breakthrough LLM for Indian Languages

5 Sources

Sarvam AI, an Indian startup, has introduced Sarvam-1, a large language model optimized for 10 Indian languages and English. This 2-billion-parameter model outperforms larger competitors and addresses key challenges in processing Indic languages.

News article

Sarvam-1: A Groundbreaking LLM for Indian Languages

Bengaluru-based startup Sarvam AI has unveiled Sarvam-1, a pioneering large language model (LLM) designed specifically for Indian languages 1. This 2-billion-parameter model supports 10 major Indian languages alongside English, marking a significant advancement in natural language processing for the region 2.

Technical Specifications and Performance

Sarvam-1 operates on a specialized tokenizer developed by Sarvam AI, trained on 4 trillion tokens using NVIDIA's H100 Tensor Core GPUs 1. The model demonstrates exceptional performance, outperforming larger models like Gemma-2-2B and Llama-3.2-3B on standard benchmarks 3.

Key achievements include:

  • Improved token efficiency: 1.4-2.1 tokens per word for Indian languages, compared to 4-8 in existing models 4
  • Superior performance on benchmarks like MMLU, ARC-Challenge, and IndicGenBench 3
  • 4-6 times faster inference speeds compared to larger models 3

Training Data and Methodology

The model's training corpus, Sarvam-2T, consists of approximately 2 trillion tokens evenly distributed across the supported languages, with Hindi making up about 20% of the data 2. Sarvam AI employed advanced synthetic-data-generation techniques to create high-quality training datasets, addressing the lack of depth in existing web-crawled Indic language data 2.

Applications and Availability

Sarvam-1 is designed to power a range of applications, including:

  • Voice and messaging agents
  • Automated customer support
  • Voice recognition
  • Language translation tools 1

The base model is available for download on Hugging Face, allowing developers to create AI applications for Indic language users 4.

Industry Collaboration and Infrastructure

Sarvam AI partnered with Yotta Data Services for the model's development, utilizing Yotta's Shakti Cloud infrastructure 4. The training process involved 1,024 GPUs over a five-day period, leveraging NVIDIA's NeMo framework 5.

Significance and Future Implications

Sarvam-1 represents a milestone in India's AI journey, potentially positioning the country as a leader in AI innovation 5. By addressing the technological gap faced by billions of Indic language speakers, the model could democratize access to advanced NLP capabilities across various sectors, including legal, public, finance, and others 5.

As the first LLM trained entirely with data, research, and compute from India, Sarvam-1 marks the beginning of Sarvam AI's mission to build full-stack sovereign AI for the country 5. This development aligns with the growing emphasis on localized AI solutions and could significantly impact the AI landscape in India and beyond.

Explore today's top stories

NVIDIA Unveils Major GeForce NOW Upgrade with RTX 5080 Performance and Expanded Game Library

NVIDIA announces significant upgrades to its GeForce NOW cloud gaming service, including RTX 5080-class performance, improved streaming quality, and an expanded game library, set to launch in September 2025.

CNET logoengadget logoPCWorld logo

9 Sources

Technology

6 hrs ago

NVIDIA Unveils Major GeForce NOW Upgrade with RTX 5080

Space: The New Frontier of 21st Century Warfare

As nations compete for dominance in space, the risk of satellite hijacking and space-based weapons escalates, transforming outer space into a potential battlefield with far-reaching consequences for global security and economy.

AP NEWS logoTech Xplore logoeuronews logo

7 Sources

Technology

22 hrs ago

Space: The New Frontier of 21st Century Warfare

OpenAI Tweaks GPT-5 to Be 'Warmer and Friendlier' Amid User Backlash

OpenAI updates GPT-5 to make it more approachable following user feedback, sparking debate about AI personality and user preferences.

ZDNet logoTom's Guide logoFuturism logo

6 Sources

Technology

14 hrs ago

OpenAI Tweaks GPT-5 to Be 'Warmer and Friendlier' Amid User

Russian Disinformation Campaign Exploits AI to Spread Fake News

A pro-Russian propaganda group, Storm-1679, is using AI-generated content and impersonating legitimate news outlets to spread disinformation, raising concerns about the growing threat of AI-powered fake news.

Rolling Stone logoBenzinga logo

2 Sources

Technology

22 hrs ago

Russian Disinformation Campaign Exploits AI to Spread Fake

AI in Healthcare: Patients Trust AI Medical Advice Over Doctors, Raising Concerns and Challenges

A study reveals patients' increasing reliance on AI for medical advice, often trusting it over doctors. This trend is reshaping doctor-patient dynamics and raising concerns about AI's limitations in healthcare.

ZDNet logoMedscape logoEconomic Times logo

3 Sources

Health

14 hrs ago

AI in Healthcare: Patients Trust AI Medical Advice Over
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo