Meta Releases Omnilingual ASR: Open-Source Speech Recognition for 1,600+ Languages

Reviewed byNidhi Govil

2 Sources

Share

Meta's Superintelligence Labs debuts an open-source automatic speech recognition system supporting over 1,600 languages, including 500 previously untranscribed low-resource languages. The system uses zero-shot learning to extend coverage to potentially 5,400+ languages.

News article

Meta Unveils Comprehensive Speech Recognition System

Meta's newly formed Superintelligence Labs has released Omnilingual Automatic Speech Recognition (ASR), an open-source transcription system that represents a significant leap in multilingual AI capabilities. The system supports over 1,600 languages and can transcribe 500 "low-resourced" languages for the first time, including Hwana from Nigeria, Rotokas from Papua New Guinea, and Güilá Zapotec from Mexico

1

.

The release marks a substantial expansion beyond existing solutions like OpenAI's Whisper model, which supports only 99 languages. Meta's system achieves character error rates under 10% in 78% of supported languages, according to the company's research paper

2

.

Technical Architecture and Zero-Shot Learning

The Omnilingual ASR suite includes multiple model families trained on more than 4.3 million hours of audio data. The system features wav2vec 2.0 models ranging from 300 million to 7 billion parameters, CTC-based ASR models for efficient transcription, and LLM-ASR models that combine speech encoders with Transformer-based text decoders

2

.

A key innovation is the zero-shot in-context learning capability, which allows users to provide just a few paired examples of audio and text in a new language at inference time. This feature enables the model to transcribe additional utterances in that language without requiring retraining, potentially expanding coverage to more than 5,400 languages

2

.

Addressing Digital Language Divide

Meta emphasizes that these "low-resourced" languages are not "well represented on the internet," creating barriers for speakers of less widely represented languages. The company hopes to bridge this digital divide by making high-quality transcriptions available for previously underserved communities

1

.

The base model, Omnilingual wav2vec 2.0, was trained to understand diverse languages and speech patterns without requiring extensive specific examples for each language. Meta has also released the Omnilingual ASR Corpus, containing transcribed speech in 350 underserved languages

1

.

Open Source Licensing and Availability

Unlike Meta's previous AI releases, Omnilingual ASR is distributed under a standard Apache 2.0 license rather than the restrictive Llama license that limited enterprise use. This change allows researchers and developers to implement the technology freely in commercial and enterprise projects without licensing fees

2

.

The complete suite is available on Meta's website, GitHub, and Hugging Face, including demonstration spaces and technical documentation. Meta worked with local organizations to recruit and compensate native speakers from remote or under-documented regions to ensure authentic language representation

1

.

Performance Limitations and Future Applications

Meta acknowledges that the system is "experimental software" and that transcriptions are not perfect, recommending users double-check outputs for accuracy. The company notes that performance may not match fully trained systems but provides a scalable approach for bringing new languages into digital platforms

1

.

The technology supports various applications including voice assistants, transcription tools, subtitles, oral archive digitization, and accessibility features. This release positions Meta competitively against other tech companies leveraging large language models for translation, including Google Translate's zero-shot capabilities and real-time translation features in Apple's AirPods

1

.

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo