5 Sources
[1]
Google announces Gemini 3.5 Live Translate for instant voice-to-voice translation
Google has been chasing real-time translation for years, which it says has been one of its "pioneering machine learning experiments." We've seen numerous demos on stage at Google events in the past, but you needed Google phones, earbuds, or some other specific setup. Last year, Google brought real-time translation to more users in the Translate app, and now it's expanding availability more. With the release of Gemini 3.5 Live Translate, you'll have access to instant translation in more places and with lower latency than ever before. The new AI model is part of the version 3.5 family that launched at I/O. Before today, Google had only rolled out the Flash version, but we're expecting a Pro model to drop in the coming weeks. Gemini 3.5 Live Translate is a speech-to-speech model tuned to automatically detect and translate in more than 70 languages. Google says Gemini 3.5 Live Translate is fast enough to keep up with a normal conversation, following just a few seconds behind the speaker while also matching intonation, pacing, and pitch. In short, the voice sounds more like you than a generic robot. The demos, which are all being recorded under controlled conditions, do sound impressive. You won't have to wait long to verify the model's abilities for yourself, though. Gemini 3.5 Live Translate is rolling out across several parts of the Google ecosystem. Developers can begin building with a public preview in the Gemini Live API or AI Studio. The model processes speech continuously and handles all the multilingual inputs automatically, saving developers from manually configuring settings. It also filters out background noise in busy environments. Select enterprise customers will also get access to the new translation model in Google Meet starting this month in advance of a wider rollout. Google says it's tweaking the Meet interface to bring the live translate feature to the front, too. Most notably, 3.5 Live Translate will come to the Google Translate app on both Android and iOS soon. At the tail end of last year, Google began testing Gemini-based live translation in the app with any earbuds (and in the iOS app); previously, you needed to have the company's Pixel Buds with an Android phone. The pending update will expand further with the addition of the latest 3.5 model. Not only can you use any earbuds, you don't need earbuds at all. If you don't have any handy, you can hold the phone up to your ear like you're on a call to hear the spoken translation. However, this "listening mode" only works on Android at this time. The audio streams from Gemini 3.5 Live Translate are intended to sound lifelike even if they don't exactly mimic the user's voice. However, Google is still proceeding cautiously. All Gemini 3.5 Live Translate audio streams will have SynthID watermarks integrated into the waveform data. This will mark the speech as AI-generated, and there is (currently) no way to remove that.
[2]
Google's Gemini 3.5 Live Translate enables realistic real-time translation at the speed of natural conversations
Google's Gemini 3.5 Live Translate enables realistic real-time translation at the speed of natural conversations Google LLC's newest artificial intelligence tool promises to bring real-time translation to every smartphone user, enabling more natural and fluid conversations between speakers of different languages. That's according to a new blog post which announced the arrival of Gemini 3.5 Live Translate, which explained that it's the company's most advanced audio model for speech-to-speech translation released to date. Whereas traditional translation tools have always been cumbersome because of the way speech is processed and then translated in turns, Gemini 3.5 Live Translate is much speedier. According to Google, it's able to listen continuously as someone is talking, translate what they're saying and then speak to the other person in their own language. What this means is that non-multilinguals will be able to engage in almost-natural conversations, with only a couple of seconds of delay - similar perhaps, to long-distance telephone calls back in the days of rotary telephones. Google Product Manager Anuda Weerasinghe and Senior Staff Software Engineer Tony Lu said in the co-authored blog post that Gemini 3.5 Live Translate is able to detect which language a person is speaking automatically, so there's no need to set anything up first. It supports more than 70 languages at launch, and that means it can support "thousands" of different language pairings. The company is making it available to developers and enterprises, so we can expect the capability to be integrated with third-party communication platforms in the near future. Of course, it's also being rolled out to everyone directly in the Google Translate application. This isn't Google's first attempt at real-time translation, but earlier efforts have always required specific hardware such as the company's own smartphones and earbuds. Gemini 3.5 Live Translate is different in that it can work on any smartphone. It's also based on a new architecture that changes how the translation process works. It relies on "continuous stream translation" which means that it doesn't have to wait until one person has finished speaking before it starts generating a response. It results in much more fluid translated conversations, as the video below demonstrates: Weerasinghe and Lu said Gemini 3.5 Live Translate is designed for the realities of the real world, meaning it can perform well in noisy environments and handle overlapping voices and informal speech. This means it's suitable for more practical use cases, including customer support calls, classrooms, guided tours, ride-sharing services, live broadcasts and so on, they said. They also emphasized the quality of the model's voices. Rather than the robotic, synthetic voices found on the standard Google Translate app, it tries to preserve the speaker's authenticity by matching their pacing, intonation and emotional tone. As such, the translated speech sounds a lot more natural, enhancing the flow of the conversation. Google's long-term goal with Gemini 3.5 Live Translate is to change the world by enabling people to converse naturally with anyone in the world, regardless of the languages they speak. By the sounds of it, it has a lot of potential to make life easier for travelers and anyone trying to do business with foreign entities.
[3]
Gemini 3.5 Live Translate Debuts With Real-Time Speech Translation
It is rolling out across supported Google products starting today Google on Wednesday rolled out its latest speech-to-speech translation model called Gemini 3.5 Live Translate. Google claims it is designed to enable more natural multilingual conversations. As per the company, the new AI model can detect more than 70 languages and generate translated speech. The result is claimed to preserve a speaker's tone, pacing, and intonation while continuously delivering near real-time translations. Gemini 3.5 Live Translate is rolling out across Google Translate, Google Meet, Google AI Studio, and the Gemini Live API for developers. Gemini 3.5 Live Translate Processes Audio As It is Streamed in Real-Time In a blog post, Google emphasised that Gemini 3.5 Live Translate is the next step in its translation efforts, which began nearly two decades ago as one of the company's earliest machine learning experiments. The company claims it is suitable for multilingual meetings, live broadcasts, lessons, customer support interactions, and real-time interpretation. Gemini 3.5 Live Translate handles live translation with minimal delay, according to Google, and it is said to process audio as it is being streamed in real-time, generating translated audio a few seconds behind the original speaker. The company said it also avoids the pauses that are often associated with traditional turn-based translation systems. The new speech-to-speech translation model can identify and switch between more than 70 languages, according to Google. It also eliminates the need to manually configure language settings. Gemini 3.5 Live Translate is said to remain effective in noisy and unpredictable environments. Upgraded translation capabilities are also coming to Google Meet, the company announced. Like Live Translate, the app will support translation in more than 70 languages, which is a significant increase from the previous limit of five languages. Apart from this, the tech giant is bringing Gemini 3.5 Live Translate to the Google Translate app on Android and iOS. It will be available through Live Translate, and the translated speech can be heard through any paired compatible headphones. On Android, Google is rolling out a Listening Mode that allows translated audio to be played directly through a smartphone's earpiece. Tackling concerns around AI-generated content, the company noted that all audio generated by Gemini 3.5 Live Translate is embedded with SynthID, the company's watermarking technology. It is claimed to be integrated directly into generated audio and is designed to help identify AI-generated content. Developers can access Gemini 3.5 Live Translate in public preview through the Gemini Live API and Google AI Studio. Google highlighted integrations with platforms including Agora, Fishjam, LiveKit, Pipecat, and Vision Agents, to help them build and deploy voice translation applications more easily. The AI model is rolling out globally across supported Google products starting today.
[4]
Google Gemini 3.5 Live Translate: AI Now Listens, Translates & Replies Instantly in 70 Languages
Google's Gemini 3.5 can now translate spoken conversations in real time across 70 languages. The feature makes it easier for people who speak different languages to communicate naturally. Google has rolled out a new Live Translate feature for Gemini 3.5. It can listen to a person speaking, translate the conversation, and talk back in another language within seconds. On June 9, the model was launched with support for more than 70 languages. The live-translation feature is designed to make conversations easier between people who do not speak the same language. Instead of typing text into a translator, users can simply talk and hear the translated response. The update is one of Google's biggest steps yet in real-time language translation.
[5]
Gemini 3.5 Live Translate Debuts with Natural-Sounding Voice Translation
Google has introduced Gemini 3.5 Live Translate for real-time voice translation. The new feature is designed to make conversations across different languages feel more natural and less interrupted. Google has announced Gemini 3.5 Live Translate, a new feature that can translate spoken conversations in real time. The new AI tool supports 70 languages and can be accessed through the , the Gemini Live API for developers, and Google Meet for enterprise users. The new translation tool is designed to make people feel as if they are speaking to each other directly, regardless of the language they use. Unlike the older translation models, Gemini 3.5 translates whatever the speaker says almost instantly, keeping a few seconds gap. It doesn't require the user to complete a sentence. Google has assured that can preserve parts of a speaker's tone and speaking style. This helps the translated voice sound less mechanical and more like a real conversation. The goal is to reduce the pauses and awkward moments that often happen during translated discussions. While the tool can be accessed through multiple ways, Google Meet users will benefit the most. Previously, speech translation was limited to 5 languages; but it has now expanded to 70 languages, and 2,000 language pairings within a single meeting. Also Read: Gemini 3.5's rollout can help users during travel, video calls, and conversations between friends or colleagues who speak different languages. Instead of switching between apps or typing messages, users can continue speaking normally while Gemini handles the translation. The launch arrives at a time when tech companies are competing with each other to build better communication tools. with companies such as Microsoft and OpenAI, which are also adding voice features to their AI products. The real contest may not be about translation alone. It may be about which platform becomes the easiest way for people to communicate across languages and locations.
Share
Copy Link
Google has released Gemini 3.5 Live Translate, its most advanced speech-to-speech AI translation model yet. The tool supports more than 70 languages and enables natural multilingual conversations with just seconds of delay. It's rolling out across Google Translate, Google Meet, and the Gemini Live API, marking a significant expansion in accessibility for real-time translation technology.
Google has officially launched Gemini 3.5 Live Translate, positioning it as the company's most advanced speech-to-speech AI translation model to date
1
. The new AI model represents a significant leap in making real-time speech translation accessible beyond the company's proprietary hardware. While Google has demonstrated translation capabilities at various events over the years, previous implementations required specific setups like Google phones or Pixel Buds with Android devices1
. Gemini 3.5 Live Translate changes this dynamic entirely, working on any smartphone and eliminating hardware barriers that previously limited adoption.
Source: Analytics Insight
The technology behind Gemini 3.5 Live Translate relies on a fundamentally different architecture than traditional translation tools. Instead of processing speech in turns, the AI model uses "continuous stream translation" that listens as someone speaks, translates their words, and delivers output in real time
2
. This approach means the system doesn't wait for a speaker to finish before generating a response, resulting in much more fluid conversations2
. According to Google Product Manager Anuda Weerasinghe and Senior Staff Software Engineer Tony Lu, the model processes audio as it streams, generating translated audio just a few seconds behind the original speaker3
. This minimal delay creates an experience similar to long-distance telephone calls, enabling what Google describes as natural multilingual conversations2
.Gemini 3.5 Live Translate launches with support for more than 70 languages, automatically detecting which language a person is speaking without requiring manual configuration
2
. This capability enables thousands of different language pairings, significantly expanding the practical applications for instant voice-to-voice translation2
. The AI model automatically identifies and switches between supported languages, eliminating the need for users to manually configure settings3
. For Google Meet specifically, this represents a dramatic improvement from the previous limit of five languages to more than 70 languages and 2,000 language pairings within a single meeting5
.Google emphasizes that Gemini 3.5 Live Translate maintains conversational flow by matching the speaker's intonation, pacing, and pitch
1
. Rather than producing robotic, synthetic voices typical of standard translation apps, the AI-driven translation technology attempts to preserve the speaker's authenticity by matching their emotional tone and speaking style2
. This focus on natural-sounding output helps reduce the awkward pauses and mechanical delivery that often characterize translated discussions5
. The model also handles real-world challenges effectively, performing well in noisy environments while managing overlapping voices and informal speech patterns2
.
Source: Analytics Insight
Gemini 3.5 Live Translate is rolling out globally across multiple Google products starting today
3
. Developers can access the model through a public preview in the Gemini Live API and Google AI Studio, with integrations already available through platforms including Agora, Fishjam, LiveKit, Pipecat, and Vision Agents3
. The model processes speech continuously and handles multilingual inputs automatically, saving developers from manual configuration while filtering out background noise in busy environments1
. Select enterprise customers gain access to the translation model in Google Meet this month ahead of a wider rollout1
.Related Stories
The Google Translate app on both Android and iOS will receive Gemini 3.5 Live Translate soon, building on last year's expansion that enabled Gemini-based live translation with any earbuds
1
. Users can hear translated speech through any paired compatible headphones, and notably, earbuds aren't required at all1
. Android users gain access to a "Listening Mode" that plays translated audio directly through the smartphone's earpiece, allowing users to hold the phone to their ear like a regular call1
. This feature is currently exclusive to Android1
.Google is proceeding cautiously with safeguards for real-time spoken conversation translation. All audio generated by Gemini 3.5 Live Translate includes SynthID watermarks embedded directly into the waveform data
1
. These watermarks identify the speech as AI-generated, and there is currently no way to remove them1
. Google highlighted that SynthID is integrated directly into generated audio and is designed to help identify AI-generated content3
.
Source: Ars Technica
Google positions Gemini 3.5 Live Translate as suitable for diverse use cases including multilingual meetings, live broadcasts, classroom lessons, customer support interactions, guided tours, ride-sharing services, and real-time interpretation
3
. The launch arrives as tech companies compete to build better communication tools, with Microsoft and OpenAI also adding voice features to their AI products5
. Google's long-term goal centers on changing how people communicate globally by enabling natural conversations with anyone regardless of the languages they speak2
. For travelers and businesses engaging with foreign entities, the technology promises to simplify communication without requiring users to switch between apps or type messages5
.Summarized by
Navi
[2]
[4]
[5]
27 Aug 2025•Technology

03 Oct 2024•Technology

26 Mar 2026•Technology

1
Policy and Regulation

2
Policy and Regulation

3
Policy and Regulation
