2 Sources
[1]
AI coding agents can be tricked into installing malware via 'clean' GitHub repositories -- Mozilla's 0din team shows how Claude Code can be exploited by its own helpfulness
Three levels of indirection, all with seemingly innocuous steps, will catch a bot off-guard. "Think out of the box" is painted onto millions of motivation posters across the world, a shooting message for middle managers and eliciting eyerolls from most everyone else. And yet that's exactly what the researchers at Mozilla's 0din did, by tricking Claude into running malware in a roundabout yet deceptively simple way, by merely asking it to initialize a project from a pretty clean-looking GitHub repository. An attacker would then have control over the developer's own account, accessing all their secrets, API keys, code, documents, browser sessions, and passwords. They could even install additional malware to maintain permanent access. Suffice to say, almost every bot agent is susceptible to this type of attack, though Claude is the default choice for programming tasks. Here's how it works. All a victim developer has to do is tell Claude to initialize a project from a malicious GitHub repository (or tell it to configure it after cloning it themselves). Said repo looks pretty clean, with just a handful of scaffolding files, and most importantly, nothing that will trigger security tools, whether remote, local, or even Claude's own checks. Claude will clone the repo. The first file it will process will be a "readme" or Markdown file describing how to initialize a Python environment with the Axiom package, a commonly used monitoring tool. So far, this appears completely legitimate. However, there's a fake Axiom startup script that will simply error out the first time it's run. This is the first step that tricks the box, because in order to be helpful and solve the problem, it'll run another innocuous-looking command to initialize Axiom: "python3 -m axiom init". This then triggers a shell script that downloads a bit of software to run, another standard operation that won't raise an eyebrow. But the second trick is that instead of downloading from a malicious URL that could be scanned, the script reads the DNS text records of a specific domain -- in this case, the domain "_axiom-config.m100.cloud". This too looks kosher enough, as for example, e-mail and by extension its configuration tools extensively rely on TXT records. The said TXT record contains an encoded (base64) string that just opens a reverse shell, meaning it'll open a shell on the user's machine, but redirected to the attacker's server for input. At this point, the malfeasants can fish out everything that the user has access to and proceed to run software as the user. Meanwhile, all that Claude and the victim see is an "Environment ready" message or similar. If you've been counting, this is three steps of indirection, none of which in isolation look like anything much out of the ordinary. Very few (if any) security scanning tools would even flag the repository, and none of the activity, save for the actual opening of a remote shell, even looks particularly odd. An enterprise environment with very tightly controlled network access could catch it, but that's not where the vast majority of developers operate in. It's also worth stressing that this particular implementation is just one example of a concept that can be applied to even more indirect and elaborate methods. The 0din team concludes its report by stating the reasonably obvious: that developers should never blindly trust an unknown project as trusted code, and naturally, not trust the AI tool itself for security analysis purposes. As for the agents themselves, 0din states they need to inspect what actually will run and how, instead of simply following steps. Follow Tom's Hardware on Google News, or add us as a preferred source, to get our latest news, analysis, & reviews in your feeds.
[2]
Clean GitHub repo tricks AI coding agents into running malware
An agentic coding tool tasked with running a seemingly benign GitHub repository could execute a malicious payload that is invisible to both security agents and human reviewers. Researchers at Mozilla's Zero Day Investigative Network (0DIN) AI security platform say that the compromise happens with "no exploit code, no warning, no suspicious command anyone had to approve." They demonstrated how an attacker could plant an interactive shell on a developer's device by using Claude Code to run a cloned project without malicious code in the repository. The new attack method relies on three components, which separately represent no threat and raise no suspicion: 0DIN researchers explain that this approach requires no malicious component in the cloned repository, and the agent automates the entire attack chain, including a step that mimics a common user error. If successful, the attacker would obtain a shell running with the developer's privileges, giving them access to environment variables, API keys, local configuration files, and the opportunity to establish persistence. "Claude Code never decided to open a shell. It decided to fix an error. The reverse shell is three indirection steps away from anything Claude Code actually evaluated: an error message it trusted, a script that fetched a value, and a DNS record it never saw," 0DIN researchers say. "The attacker now has an interactive shell running as the developer's own user." While the attack method is currently just a concept, 0DIN warns that threat actors could easily distribute such GitHub repositories through fake job postings, tutorials, blog posts, or direct messages. To prevent such exploitation, 0DIN suggests that AI agents should disclose the full execution chain of setup commands, including scripts and code fetched dynamically at runtime.
Share
Copy Link
Mozilla's Zero Day Investigative Network (0din) demonstrated how AI coding agents like Claude Code can be exploited through deceptively simple attacks. Researchers showed that three levels of indirection through a clean-looking GitHub repository can trick agents into executing malicious payloads, giving attackers access to developer credentials, API keys, and complete system control.
Researchers at Mozilla's Zero Day Investigative Network (0din) have uncovered a concerning AI security vulnerability that affects AI coding agents like Claude Code
1
. The demonstration reveals how AI coding agents tricked into running malware can compromise developer systems without triggering any security warnings. What makes this attack particularly dangerous is that it exploits the very helpfulness these tools are designed to provide, turning their problem-solving capabilities against users.
Source: BleepingComputer
The attack begins when a developer instructs an AI agent to initialize a project from what appears to be a clean GitHub repo. According to the 0din team, the compromise happens with "no exploit code, no warning, no suspicious command anyone had to approve"
2
. The repository contains only scaffolding files that won't trigger security tools, whether remote, local, or even Claude's own checks1
.The vulnerability of AI agents lies in their inability to recognize threats distributed across multiple indirection steps. The attack unfolds through three seemingly innocuous stages. First, Claude clones the repository and processes a readme file describing how to initialize a Python environment with Axiom, a commonly used monitoring tool
1
. A fake Axiom startup script deliberately errors out, prompting the agent to run "python3 -m axiom init" to solve the problem.
Source: Tom's Hardware
The second layer of deception involves a shell script that downloads software through DNS TXT records from a specific domain rather than a potentially flagged malicious URL
1
. This technique appears legitimate since email configuration tools extensively rely on TXT records. The DNS record contains a base64-encoded payload that opens a reverse shell, redirecting the user's machine to the attacker's server for input.Once the reverse shell establishes connection, attackers gain an interactive shell running with the developer's own privileges
2
. This grants access to environment variables, API keys, local configuration files, code repositories, documents, browser sessions, and passwords . Attackers can install additional malware to maintain permanent access, while both Claude and the victim see only an "Environment ready" message.The 0din researchers emphasize a critical point: "Claude Code never decided to open a shell. It decided to fix an error. The reverse shell is three indirection steps away from anything Claude Code actually evaluated"
2
. This exploit demonstrates how social engineering techniques can manipulate AI agents' decision-making processes.Related Stories
While currently a proof of concept, threat actors could easily distribute malicious GitHub repositories through fake job postings, tutorials, blog posts, or direct messages
2
. The 0din team notes that almost every bot agent is susceptible to this type of attack, though Claude remains the default choice for programming tasks1
.To address this AI security vulnerability, developers should never blindly trust unknown projects as trusted code and avoid relying on AI tools for security analysis
1
. The 0din team recommends that AI agents should disclose the full execution chain of setup commands, including scripts and code fetched dynamically at runtime2
. Developers working in enterprise environments with tightly controlled network access may have better protection, though most developers operate without such safeguards.Summarized by
Navi
[2]
25 Feb 2026•Technology

02 Apr 2026•Technology

04 Jun 2026•Technology

1
Technology

2
Technology

3
Policy and Regulation
