The Outpost is a comprehensive collection of curated artificial intelligence software tools that cater to the needs of small business owners, bloggers, artists, musicians, entrepreneurs, marketers, writers, and researchers.
© 2025 TheOutpost.AI All rights reserved
Curated by THEOUTPOST
On Thu, 3 Apr, 12:02 AM UTC
2 Sources
[1]
Gladia launches Solaria as AI-based multi-lingual speech recognition model for speech-to-text transcription
Gladia, an AI transcription and audio intelligence provider, launched Solaria, a next-gen automatic speech recognition (ASR) model designed to redefine real-time communications for call centers and other voice-first platforms. Solaria now empowers businesses to enhance and expand their customer service operations with AI-powered voice technology that delivers unmatched language coverage -- supporting 40+ languages previously inaccessible with other solutions -- without compromising quality or speed. While outsourcing has long been a cost reduction strategy in the call center industry, businesses now face a new, critical challenge: providing seamless, multilingual support at scale. With 49% of global executives reporting financial losses due to language barriers, the demand for scalable, high-quality multilingual solutions has never been greater. "We've seen in the market a huge surge in voice AI. It's like voice is part of our life again, and we are introducing a new product called Solaria, which is a model that is real time with advanced capabilities," said Jean-Louis Queguiner, CEO of Gladia, in an interview with GamesBeat. "And it's going to be the fastest on the market, and the most accurate in the market, covering 100 languages." The product also has features like real time sentiment analysis and real time translation, he said. It handles speech to text translation and transcription. This is important to do in real time for voice agents or call centers, where someone may have to answer a question that comes in with a different language. Solaria: An enterprise-ready model for global customer experience Solaria is a speech-to-text (STT) engine built for global scalability. Solaria was designed to meet the demands of today's contact centers, where both AI automation and human agents need high-accuracy, low-latency, and real-time multilingual support to succeed. The model achieves industry-leading results in speech recognition, delivering both accuracy and fast processing speed. Recent benchmarks show Solaria has reached an unmatched 94% Word Accuracy Rate (WAR) average in English, Spanish, French and other common languages, while maintaining an ultra-low latency of 270 millisecond, making the conversation feel natural and responsive. While real-time speech-to-text is often measured by speed alone, accuracy and language coverage are equally crucial for businesses providing seamless services across regions. Unlike other speech-to-text models that prioritize speed over usability, Solaria balances industry-leading accuracy and speed with unmatched language coverage -- 100 languages in total, with exclusive support for 42 languages not matched by competitors. For high-population markets and key outsourcing hubs like Bangladesh, India, and The Philippines, native-level accuracy in regional languages is now offered through Solaria. With native-level transcription, real-time code-switching, and translation across all supported languages, businesses can expand into global markets without constraints. Designed for enterprise-scale voice automation, Solaria delivers: Best-in-class accuracy in high-population languages such as Tagalog, Bengali, Punjabi, Tamil, Urdu, Persian, and Marathi. Ability to adapt the model to industry-specific terminology (like medical or financial jargon) and have it extract critical data, like names, addresses, and numericals. Adaptive speech processing, ensuring high accuracy in noisy call center environments. Enterprise-grade data security, in full compliance with GDRP, HIPPA, and SOC 2. With the addition of Solaria to its product portfolio, Gladia allows businesses to enhance customer service by improving AI-powered voice agents, making IVRs and virtual assistants more reliable across multiple languages, while also optimizing human-assisted workflows with real-time transcriptions and translations to help agents provide more effective assistance. "Speech is the most natural way to connect with the world -- for the first time, automated speech recognition is closing the divide, enabling humans and AI to truly speak the same language," said Jean-Louis Quéguiner, CEO of Gladia, in a statement. "With Solaria, we have made a breakthrough in AI-powered voice technology that unlocks new opportunities for businesses, driving efficiency and delivering more seamless, impactful customer experiences across diverse languages and markets. Solaria is built for next-generation voice platforms ready to lead this transformation on a global scale." Serving more than 700 enterprise customers worldwide, including Attention, Circleback, Method Financial, and VEED.IO, Gladia delivers enterprise-grade service and scalability, backed by dedicated support and infrastructure in the U.S. and Europe, guaranteeing reliable performance for mission-critical applications. Companies looking to scale globally, optimize operational costs, and enhance customer experiences can start building with Gladia's API today. As part of the Solaria launch, Gladia has partnered with LiveKit, a leading open-source developer framework for real-time AI voice agents, to power real-time, multilingual translation within AI-driven applications. This gives developers global language capabilities out of the box through seamless integration with Gladia's API. Following its $16 million Series A round in 2024 and today's rollout of Solaria, Gladia has taken another critical step toward establishing itself as a leading end-to-end API audio infrastructure provider -- combining speech recognition, generative AI, and voice generation capabilities to help enterprise users and developers tap into the full potential of real-time audio data. Paris-based Gladia was founded in 2022 by Jean-Louis Queguiner (ex-OVHCloud) and Jonathan Soto (ex-MIT/Sigfox). Gladia's product has been adopted by over 150,000 users and 700 enterprise clients -- including industry leaders like Attention, Circleback, Method Financial, and VEED.IO. There is a 300 millisecond delay between the moment you start speaking and the moment you receive the first event of voice being activated. It takes 100 milliseconds to do the transcription and so you have near instant results. To improve the accuracy further, Queguiner said the company needs to train on more data. And it needs to work with the data augmentations to make the data more robust. The company has enterprise pricing in price but has not disclosed it yet. He said it will be among the most affordable solutions in the market.
[2]
French startup Gladia launches next-generation multilingual speech-to-text AI model Solaria - SiliconANGLE
French startup Gladia launches next-generation multilingual speech-to-text AI model Solaria Paris-based artificial intelligence startup Gladia SAS, developer of AI transcription and audio intelligence, today announced the launch of Solaria, a state-of-the-art AI model designed for real-time multilingual communications. Although many businesses outsource transcription and translation for call centers and other business uses to save on cost, it is becoming increasingly necessary to build real-time support to handle global customer bases. According to a 2023 market report from language industry analyst Slator 49% of executives surveyed worldwide stated that they saw financial losses due to language barriers. Gladia said it built Solaria to deliver industry-leading results in speech recognition with high accuracy at ultra-fast speeds compared to the competition in the market. Company benchmarks have shown the AI is capable of an average word accuracy rate of 94% -- the highest in the industry -- for English, Spanish, French and other common languages. When a user starts talking, its fastest time to the first word is around 270 milliseconds, making it one of the most responsive speech-to-text models in the industry. This is like the time it takes when speaking to Apple Inc.'s Siri or "Hey, Google," and how long the user has to wait for the first words to appear on the screen. This also demonstrates how quickly the AI reacts when it's interrupted mid-sentence, allowing it to quickly adjust and react. The lower the latency, the more fluid a conversation it can have with the user. Deepgram Inc.'s platform is the only competitor with a shorter latency, at a 223-millisecond response. The AI delivers complete transcripts in just 698 milliseconds, which is almost half a second faster than most competitors. Deepgram takes an average of 1040 ms, while Speechmatics takes around 1158 ms. "Speech is the most natural way to connect with the world -- for the first time, automated speech recognition is closing the divide, enabling humans and AI to truly speak the same language," said Jean-Louis Quéguiner, chief executive and co-founder of Gladia. Gladia said Solaria is built to handle 100 languages including support for 42 underserved languages not matched by its competitors. The company's team included native-level accuracy for high-population markets and regional languages common to call-center outsourcing hubs such as Tagalog, Bengali, Punjabi, Tamil, Urdu, Persian and Marathi. It also covers emerging voice markets such as Hatian Creole, Maori, Javanese and Malagasy. The company built the AI to adapt and learn industry-specific terminology so that it can fit into business-critical operations and understand employee speech patterns including medical or financial jargon. The AI is also able to process speech in loud or noisy environments, such as those that exist in cluttered call centers ensuring high accuracy. "With Solaria, we have made a breakthrough in AI-powered voice technology that unlocks new opportunities for businesses, driving efficiency and delivering more seamless, impactful customer experiences across diverse languages and markets," added Quéguiner. As part of its launch, Gladia announced a strategic partnership with LiveKit, an open-source developer framework for real-time AI voice agents. This will enable developers to use Gladia's application programming interface to build voice conversational agents with built-in multilingual translation capabilities for AI-powered applications. Since launching its first transcription and audio intelligence API in 2023, Gladia has gained notable traction in the enterprise market, particularly for meeting recorders and note-taking assistants. The company's platform is now used by more than 700 customers globally, including Attention Inc., Circleback Inc., Method Financial Inc., Recall AI Inc., Sana Labs AB and VEED.IO Ltd.
Share
Share
Copy Link
Gladia, a French AI startup, introduces Solaria, a next-generation automatic speech recognition model offering real-time, multilingual support for global businesses, particularly in call center operations.
French AI startup Gladia has launched Solaria, a cutting-edge automatic speech recognition (ASR) model designed to revolutionize real-time communications for call centers and voice-first platforms. This next-generation AI-powered solution aims to address the growing demand for scalable, high-quality multilingual support in global business operations 12.
Solaria boasts an impressive array of features that set it apart in the competitive speech recognition market:
The model's ability to handle a wide range of languages with high accuracy addresses a critical need in the global market, where 49% of executives report financial losses due to language barriers 2.
Solaria is designed to meet the demands of today's enterprise-scale voice automation:
These features make Solaria particularly valuable for businesses looking to expand into global markets and optimize their customer service operations.
As part of the Solaria launch, Gladia has partnered with LiveKit, an open-source developer framework for real-time AI voice agents. This collaboration will enable developers to integrate Gladia's API for building voice conversational agents with built-in multilingual translation capabilities 12.
Gladia has already gained significant traction in the enterprise market, serving over 700 customers worldwide, including notable companies such as Attention, Circleback, Method Financial, and VEED.IO 12.
Gladia was founded in 2022 by Jean-Louis Queguiner (ex-OVHCloud) and Jonathan Soto (ex-MIT/Sigfox). The Paris-based startup recently secured a $16 million Series A funding round in 2024, positioning itself as a leading end-to-end API audio infrastructure provider 1.
The launch of Solaria represents a significant step forward in AI-powered voice technology. By combining speech recognition, generative AI, and voice generation capabilities, Gladia is poised to help enterprise users and developers tap into the full potential of real-time audio data 1.
As businesses continue to face challenges in providing seamless, multilingual support at scale, solutions like Solaria are likely to play an increasingly important role in shaping the future of global customer experience and communication technologies.
Gladia, a French AI startup, has secured $16 million in Series A funding to develop an advanced multilingual real-time audio transcription and analytics engine, aiming to revolutionize voice-first platforms across various industries.
4 Sources
4 Sources
ElevenLabs, an AI startup valued at $3.3 billion, has introduced Scribe, a new speech-to-text model claiming 97% accuracy in English and support for over 99 languages, positioning itself as a strong competitor in the AI transcription market.
4 Sources
4 Sources
Deepgram launches Aura-2, a new text-to-speech AI model designed for enterprise use, outperforming competitors in blind tests and offering cost-effective, high-quality voice solutions for business applications.
2 Sources
2 Sources
OpenAI introduces new AI models for speech-to-text and text-to-speech, offering improved accuracy, customization, and potential for building AI agents with voice capabilities.
7 Sources
7 Sources
Mistral AI introduces Saba, a 24-billion-parameter language model tailored for the Middle East and South Asia, excelling in Arabic and South Indian languages like Tamil and Malayalam.
4 Sources
4 Sources