Sarvam AI Launches Sarvam-1: A Breakthrough LLM for Indian Languages

5 Sources

Sarvam AI, an Indian startup, has introduced Sarvam-1, a large language model optimized for 10 Indian languages and English. This 2-billion-parameter model outperforms larger competitors and addresses key challenges in processing Indic languages.

News article

Sarvam-1: A Groundbreaking LLM for Indian Languages

Bengaluru-based startup Sarvam AI has unveiled Sarvam-1, a pioneering large language model (LLM) designed specifically for Indian languages 1. This 2-billion-parameter model supports 10 major Indian languages alongside English, marking a significant advancement in natural language processing for the region 2.

Technical Specifications and Performance

Sarvam-1 operates on a specialized tokenizer developed by Sarvam AI, trained on 4 trillion tokens using NVIDIA's H100 Tensor Core GPUs 1. The model demonstrates exceptional performance, outperforming larger models like Gemma-2-2B and Llama-3.2-3B on standard benchmarks 3.

Key achievements include:

  • Improved token efficiency: 1.4-2.1 tokens per word for Indian languages, compared to 4-8 in existing models 4
  • Superior performance on benchmarks like MMLU, ARC-Challenge, and IndicGenBench 3
  • 4-6 times faster inference speeds compared to larger models 3

Training Data and Methodology

The model's training corpus, Sarvam-2T, consists of approximately 2 trillion tokens evenly distributed across the supported languages, with Hindi making up about 20% of the data 2. Sarvam AI employed advanced synthetic-data-generation techniques to create high-quality training datasets, addressing the lack of depth in existing web-crawled Indic language data 2.

Applications and Availability

Sarvam-1 is designed to power a range of applications, including:

  • Voice and messaging agents
  • Automated customer support
  • Voice recognition
  • Language translation tools 1

The base model is available for download on Hugging Face, allowing developers to create AI applications for Indic language users 4.

Industry Collaboration and Infrastructure

Sarvam AI partnered with Yotta Data Services for the model's development, utilizing Yotta's Shakti Cloud infrastructure 4. The training process involved 1,024 GPUs over a five-day period, leveraging NVIDIA's NeMo framework 5.

Significance and Future Implications

Sarvam-1 represents a milestone in India's AI journey, potentially positioning the country as a leader in AI innovation 5. By addressing the technological gap faced by billions of Indic language speakers, the model could democratize access to advanced NLP capabilities across various sectors, including legal, public, finance, and others 5.

As the first LLM trained entirely with data, research, and compute from India, Sarvam-1 marks the beginning of Sarvam AI's mission to build full-stack sovereign AI for the country 5. This development aligns with the growing emphasis on localized AI solutions and could significantly impact the AI landscape in India and beyond.

Explore today's top stories

OpenAI Challenges Court Order to Preserve Deleted ChatGPT Conversations Amid NYT Lawsuit

OpenAI appeals a court order requiring it to indefinitely store deleted ChatGPT conversations as part of The New York Times' copyright lawsuit, citing user privacy concerns and setting a precedent for AI data retention.

The Verge logoengadget logoGizmodo logo

9 Sources

Technology

16 hrs ago

OpenAI Challenges Court Order to Preserve Deleted ChatGPT

Anysphere's Cursor AI Coding Assistant Secures $900M Funding, Reaches $9.9B Valuation

Anysphere, the company behind the AI coding assistant Cursor, has raised $900 million in funding, reaching a $9.9 billion valuation. The startup has surpassed $500 million in annual recurring revenue, making it potentially the fastest-growing software startup ever.

TechCrunch logoBloomberg Business logoSiliconANGLE logo

4 Sources

Technology

16 hrs ago

Anysphere's Cursor AI Coding Assistant Secures $900M

US-UAE AI Data Campus Deal Faces Security Hurdles Despite High-Profile Announcement

A multi-billion dollar deal to build one of the world's largest AI data center hubs in the UAE, involving major US tech companies, is far from finalized due to persistent security concerns and geopolitical complexities.

Reuters logoEconomic Times logoInvesting.com logo

4 Sources

Technology

8 hrs ago

US-UAE AI Data Campus Deal Faces Security Hurdles Despite

PwC Report Reveals AI's Positive Impact on Job Market: Workers Become 'More Valuable'

A new PwC study challenges common fears about AI's impact on jobs, showing that AI is actually creating jobs, boosting wages, and increasing worker value across industries.

CNBC logoEconomic Times logo

2 Sources

Business and Economy

8 hrs ago

PwC Report Reveals AI's Positive Impact on Job Market:

AI Film Festival Showcases the Future of Movie-Making Technology

Runway's AI Film Festival in New York highlights the growing role of artificial intelligence in filmmaking, showcasing innovative short films and sparking discussions about AI's impact on the entertainment industry.

AP NEWS logoABC News logoThe Seattle Times logo

5 Sources

Technology

8 hrs ago

AI Film Festival Showcases the Future of Movie-Making
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Β© 2025 Triveous Technologies Private Limited
Twitter logo
Instagram logo
LinkedIn logo