Google Gemini security flaw let hackers hijack Android phones via WhatsApp notifications

Reviewed byNidhi Govil

2 Sources

Share

SafeBreach Labs uncovered a notification-based prompt injection vulnerability in Google Gemini on Android that allowed attackers to send hidden malicious commands through WhatsApp, Slack, and other messaging apps. The security flaw enabled silent control of smart home devices, memory poisoning, and surveillance without user interaction. Google has since deployed a server-side patch to block the exploit.

Google Gemini Vulnerability Exposed Android Users to Silent Attacks

A critical security flaw in Google Gemini on Android created an unexpected attack vector that turned routine notifications into potential weapons. Cybersecurity firm SafeBreach Labs discovered that attackers could hijack your Android phone by embedding hidden malicious commands inside ordinary WhatsApp, Slack, or text message notifications—without requiring users to open, tap, or download anything

1

. The vulnerability exploited how Gemini's AI assistant automatically scans incoming alerts to provide context-aware responses, allowing poisoned WhatsApp notifications and other messaging alerts to manipulate the system silently

2

.

Source: Tom's Guide

Source: Tom's Guide

How Indirect Prompt Injection Bypassed Google's Defenses

The attack relied on Indirect Prompt Injection, a technique where malicious instructions are hidden inside content that an AI assistant will read automatically rather than being typed directly into the prompt window

1

. SafeBreach researcher Or Yair explained that the core problem stems from AI's inability to distinguish between legitimate instructions and untrusted external data

2

. While Google already employs advanced machine learning filters to prevent Gemini from following embedded commands, attackers found ways around these protections by carefully structuring hidden text—sometimes burying instructions in foreign languages like Chinese or using invisible, muted hyperlinks

1

.

The technique involved crafting messages with two elements: a benign question in English and a malicious instruction in a foreign language. When victims asked Gemini to read pending notifications and responded to the visible question with "yes," they unknowingly approved both legitimate and harmful actions

2

. By aligning the attack to resemble safe context, the payload slipped past Google's defenses entirely

1

.

Alarming Capabilities Demonstrated by SafeBreach Labs

Once Gemini ingested the compromised notification, SafeBreach Labs demonstrated several high-risk scenarios that could execute without visual or audio alerts. Attackers could force the AI assistant to interact with Google Home utilities, adjusting smart appliances, turning on boilers, or unlocking connected windows

1

. Silent surveillance became possible by commanding Gemini to initiate an active Zoom video call, effectively transforming the device into a remote spy camera.

Memory poisoning represented one of the most persistent threats, allowing attackers to permanently corrupt Gemini's "Saved Info" long-term memory. This ensured malicious instructions would persist across completely different chat sessions days later

1

. The researchers also demonstrated phishing capabilities, where Gemini could be instructed to examine notification history, extract the name of a trusted sender like a manager or spouse, and deliver fake localized messages supposedly from them.

A particularly sophisticated exploit targeted Gemini's voice assistant through a technique called Delayed Tool Invocation. Because voice tools automatically open the device's microphone after speaking to await a reply, attackers could instruct the poisoned AI to remain silent and wait until users said a benign word like "Thanks" hours later before executing the attack

1

.

Google's Response and Ongoing Challenges for Agentic AI Systems

SafeBreach Labs followed responsible disclosure protocols, privately reporting the "Fake Context Alignment" vulnerability to Google in August. The company deployed a server-side patch in mid-November, upgrading its content classifiers to block this specific form of context manipulation

1

2

. Because the fix operates server-side, no user patches are required for protection. SafeBreach reports no evidence this technique was exploited by actual threat actors in the wild.

Source: TechRadar

Source: TechRadar

However, this incident highlights a fundamental architectural challenge facing agentic AI systems rather than a simple coding bug in messaging platforms. As tech companies accelerate efforts to give AI assistants expanded capabilities—reading emails, monitoring screens, managing schedules, and controlling operating systems—the potential damage from prompt injection attacks grows exponentially

1

.

To protect against future undiscovered notification-based exploits, practicing strong permission hygiene remains critical. Users should audit Gemini's app permissions through Android settings and consider disabling access to system notifications unless absolutely necessary. As AI assistants gain deeper integration into mobile operating systems and smart home ecosystems, the security community will need to develop more robust methods for distinguishing between trusted user commands and potentially hostile external inputs. The Slack notifications vulnerability and similar attack vectors underscore why careful permission management and awareness of AI assistant capabilities matter for Android users navigating an increasingly interconnected digital environment.

Today's Top Stories

TheOutpost.ai

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Instagram logo
LinkedIn logo
Youtube logo
© 2026 TheOutpost.AI All rights reserved