DeepL Voice Translation Launches in 40+ Languages

DeepL Expands Beyond Text With Voice-to-Voice Translation Suite

DeepL, the Cologne-based language AI company renowned for its text translation tools, has launched DeepL Voice-to-Voice, a real-time translation suite designed to handle live business communications across more than 40 languages1

. The product suite addresses four distinct use cases: virtual meetings, mobile and web conversations, group settings for frontline workers, and enterprise applications through an API. Supported languages include all 24 official EU languages plus Vietnamese, Thai, Arabic, Norwegian, Hebrew, Bengali, and Tagalog2

. Jarek Kutylowski, DeepL's founder and CEO, described the launch as reaching "another frontier in translation," emphasizing that the technology allows everyone to speak naturally in their own language without the friction or cost of interpreters2

Source: The Next Web

Real-Time Translation for Meetings Targets Zoom and Microsoft Teams

DeepL is releasing add-ons for platforms like Zoom and Microsoft Teams, where listeners can either hear real-time translation while others speak in native languages or follow real-time translated text on screen1

. Voice for Meetings, which enables participants to speak in their native language while others hear simultaneous translation, is opening an early access programme in June2

. The company is inviting organizations to join a waitlist for this program. Voice for Conversations, which enables AI-powered spoken translation across mobile and web without requiring app installation, is now generally available2

. The speech translation engine also supports group conversations in settings like training sessions or workshops, allowing participants to join through a QR code1

Source: TechCrunch

Enterprise API Opens Translation for Business Communications

The Voice-to-Voice API, which lets businesses embed DeepL's translation engine into their own customer-facing applications such as call centers, is in ongoing early access2

. Kutylowski noted that AI is reimagining what customer service will look like in the coming years, explaining that a translation layer helps companies provide support in languages where qualified staff are scarce and expensive to hire1

. A customization feature called Spoken Terms, which allows the system to learn industry-specific vocabulary, company names, and personal names, is scheduled to become generally available on 7 May2

. DeepL has positioned the product as an enterprise tool, emphasizing that its voice technology never uses customer data to train its models and does not permanently store transcription or translation data after a call ends—a data security framing aimed at regulated industries2

Latency Challenges Persist Despite Translation Quality Edge

Kutylowski acknowledged that the challenges in creating a real-time translation product center on striking a balance between reducing latency—the delay between someone speaking and the translated audio playing back—and maintaining accurate results1

. A live demonstration by Chief Product Officer Gonzalo Gaiolas at DeepL Connect Seoul on 15 April exposed the system's current limitation: a visible delay of one to two sentences between the speaker finishing and the translation being delivered2

. Gaiolas acknowledged the lag directly, stating that "different languages have different word orders and sentence structures, which causes delays in real-time interpretation," according to Seoul Economic Daily2

. The current system works through a three-step pipeline: speech is converted to text, the text is translated using DeepL's established translation engine, and the output is then converted back to speech synthesis2

. Going forward, DeepL wants to develop an end-to-end voice translation model that skips the text step entirely1

Competing Against Well-Funded Rivals and Platform Giants

DeepL faces competition from several well-funded startups working in adjacent corners of the space. Sanas, which last year raised $65 million from Quadrille Capital and Teleperformance, uses AI to modify a speaker's accent in real time—a tool aimed primarily at call center agents1

. Dubai-based Camb.AI focuses on speech synthesis and translation for media and entertainment companies, helping them dub and localize video content at scale1

. Palabra, backed by Reddit co-founder Alexis Ohanian's firm Seven Seven Six, is building a real-time speech translation engine designed to preserve both the meaning and the speaker's original voice, putting it in more direct competition with what DeepL is now building1

. Google, Microsoft, and Zoom all offer their own meeting translation features—the platforms DeepL is simultaneously challenging and integrating with. In blind evaluations commissioned by DeepL and conducted independently by Slator, a language industry research firm, 96% of professional linguists preferred DeepL Voice over the native translation solutions in Google Meet, Microsoft Teams, and Zoom, citing superior fluency and contextual accuracy2

. The current system translates using a fixed synthetic voice, but DeepL plans to release a voice-preservation feature that maintains the speaker's original voice characteristics in the translated output by the end of 2026.

DeepL launches real-time voice translation for meetings and business communications

DeepL Expands Beyond Text With Voice-to-Voice Translation Suite

Real-Time Translation for Meetings Targets Zoom and Microsoft Teams

Enterprise API Opens Translation for Business Communications

Latency Challenges Persist Despite Translation Quality Edge

Competing Against Well-Funded Rivals and Platform Giants

References

DeepL, known for text translation, now wants to translate your voice | TechCrunch

DeepL launches real-time voice-to-voice translation in 40+ languages

Related Stories

DeepL Unveils Advanced Translation LLM for Business Users, Outperforming Competitors

AI Translation Giant DeepL Explores $5 Billion U.S. IPO

DeepL Expands AI Translation Services to Include Traditional Chinese

Recent Highlights

Nvidia RTX Spark chips power new AI laptops with up to 128GB memory and local agent capabilities

Florida sues OpenAI and Sam Altman over ChatGPT safety, alleging AI harms linked to violence

Trump signs AI executive order seeking voluntary 30-day review after industry pushback

Recent Highlights

Today's Top Stories

UN Report Warns AI Could Consume 3% of Global Electricity and Water for 1.3 Billion by 2030

Meta Business Agent launches globally to automate customer support across WhatsApp and Instagram

Google launches Gemma 4 12B, bringing multimodal AI agents to consumer laptops with 16GB RAM

Google's Gemini Avatar lets you create an AI clone of yourself in minutes, sparking deepfake concerns