PRESENTED BY

Cyber AI Chronicle

By Simon Ganiere · 25^th Jan 2026

Welcome back!

Researchers have demonstrated that AI agents built on advanced models can autonomously discover zero-day vulnerabilities and craft working exploits in under an hour. The experiment showed agents successfully generating over 40 distinct exploits against a real-world Javascript interpreter, bypassing modern security mitigations for around $30 per challenge.

This breakthrough suggests we're entering an era where exploit development could be industrialized at unprecedented scale and speed. Will defenders be able to adapt quickly enough when novel threats can now be generated with computational power rather than rare human expertise?

In today's AI recap:

AI agents autonomously write zero-day exploits in under an hour
Google patches Gemini calendar invite prompt injection vulnerability
VoidLink malware framework created by single developer using AI in six days
Anthropic fixes critical vulnerabilities in official Git MCP server
cURL shuts down bug bounty program due to AI-generated report flood
…and more
Wisdom of the week

The AI Exploit Factory

What you need to know: AI agents can now autonomously discover and write functional exploits for zero-day vulnerabilities. A recent experiment showed agents built on GPT-5.2 and Opus 4.5 generating over 40 distinct exploits against a real-world Javascript interpreter.

Why is it relevant?:

The agents demonstrated stunning efficiency, solving most challenges in less than an hour for about $30, showcasing how token throughput is becoming the new bottleneck for creating offensive tools.
These weren't simple bugs; the agents successfully bypassed a suite of modern security mitigations, including Full RELRO, Control Flow Integrity (CFI), and Intel's Shadow Stack.
For the hardest challenge, GPT-5.2 developed a novel exploit chain by itself, and the entire project is reproducible with a full technical write-up and code available on GitHub.

Bottom line: This signals the coming industrialization of exploit development, shifting the core limitation from human expertise to sheer computational power. Defenders must now prepare for a landscape where novel threats can be generated at a scale and speed previously unimaginable.

AI-native CRM

“When I first opened Attio, I instantly got the feeling this was the next generation of CRM.”
— Margaret Shen, Head of GTM at Modal

Attio is the AI-native CRM for modern teams. With automatic enrichment, call intelligence, AI agents, flexible workflows and more, Attio works for any business and only takes minutes to set up.

Join industry leaders like Granola, Taskrabbit, Flatfile and more.

👉 Start for free today

Gemini's Trojan Invite

What you need to know: Google patched a vulnerability in Gemini after security researchers discovered a novel attack method. Attackers could hide a malicious prompt in a Google Calendar invite to secretly exfiltrate a user's private meeting data.

Why is it relevant?:

The attack used indirect prompt injection, where a payload hidden in an invite's description would lie dormant until the user asked Gemini an unrelated question about their schedule, triggering the data leak.
This flaw bypassed security controls because it exploited how the AI interpreted natural language; the malicious instructions were semantically harmful but syntactically appeared as a normal user request.
The incident demonstrates how integrating AI assistants into everyday applications can broaden the attack surface in unexpected ways, turning benign features into potential security holes.

Bottom line: This new class of vulnerabilities shows that securing AI requires moving beyond traditional code analysis to address risks in language and context. Professionals must now focus on safeguarding the AI's decision-making process, not just sanitizing its inputs.

The AI Malware Architect

What you need to know: A highly advanced Linux malware framework called VoidLink was almost entirely generated by an AI agent, which enabled a single developer to build the tool in under a week, according to a new analysis from Check Point Research.

Why is it relevant?:

The AI assistant helped the developer produce over 88,000 lines of code for the malware framework in just six days, a task that would have previously required a well-resourced team.
The creator used an AI coding agent called TRAE SOLO to manage the entire development lifecycle, from creating project plans to writing and testing the final implant.
The developer employed a Spec-Driven Development approach, first defining the malware's architecture and goals and then directing the AI to implement the plan sprint by sprint.

Bottom line: This case proves that a single, capable actor can now create powerful offensive tools with the speed and scale once reserved for large threat groups. The era of AI-accelerated malware development has arrived, fundamentally altering the threat landscape for security teams.

Anthropic's Broken Bridge

What you need to know: Security researchers from Cyata discovered three critical vulnerabilities in Anthropic's official server that connects AI agents to Git tools. Chaining these flaws allowed for remote code execution using simple prompt injection.

Why is it relevant?:

The attack works via indirect prompt injection, where malicious instructions hidden in files like a README can trick an AI agent into executing harmful commands.
Attackers achieved remote code execution by chaining vulnerabilities across both the Git and Filesystem MCP servers, highlighting the emergent risks of interconnected AI tools.
Anthropic has since patched the issues in mcp-server-git version 2025.12.18, addressing the three distinct flaws (CVE-2025-68143, CVE-2025-68144, and CVE-2025-68145).

Bottom line: This discovery serves as a major warning for the security of agentic AI systems that interact with external tools. Security teams must now assess the combined risk of an agent's entire toolset, as vulnerabilities in one component can create attack paths through another.

Drowning in AI Slop

What you need to know: The popular open-source project cURL is shutting down its bug bounty program after its maintainers were overwhelmed by a flood of low-quality, AI-generated vulnerability reports, highlighting a growing strain on the security ecosystem.

Why is it relevant?:

The project’s small security team was inundated, with the change on GitHub citing the need to protect developer mental health from the high volume of low-quality reports.
Dubbed "AI slop," these reports often sound plausible but lack substance or reproducibility; the project's lead developer even shared examples of these submissions to illustrate the problem.
Data suggests this steep increase in junk reports was unique to cURL in 2025, unlike other open-source projects on the same bug bounty platform.

Bottom line: This move signals a critical challenge for open-source security, where the very tools designed to help can be misused to create overwhelming noise. The incident serves as a cautionary tale for how bug bounty platforms and security teams may need to adapt to filter AI-generated content.

The Shortlist

Zafran discovered two high-severity vulnerabilities (CVE-2026-22218 and CVE-2026-22219) in the popular open-source Chainlit AI framework that can be chained to read arbitrary files, leak cloud API keys, and enable a full cloud environment takeover.

Doctor Web found a new Android malware family using TensorFlow.js machine learning models to visually identify and automatically click on ads, representing a shift from traditional script-based ad-fraud techniques.

Elliptic linked the shutdown of the Tudou Guarantee marketplace, which facilitated over $12B in transactions for illicit services including AI-powered voice cloning and deepfakes, to the recent arrest of its alleged kingpin.

Flare analyzed thousands of underground conversations showing how Phishing-as-a-Service (PhaaS) platforms are using AI tools like "PhishGPT" to generate highly personalized and context-aware scam messages at industrial scale.

Wisdom of the Week

❝

Muddy water is best cleared by leaving it alone

Alan Watts

AI Influence Level

Level 4 - Al Created, Human Basic Idea / The whole newsletter is generated via a n8n workflow based on publicly available RSS feeds. Human-in-the-loop to review the selected articles and subjects.

Reference: AI Influence Level from Daniel Miessler

Till next time!

Project Overwatch is a cutting-edge newsletter at the intersection of cybersecurity, AI, technology, and resilience, designed to navigate the complexities of our rapidly evolving digital landscape. It delivers insightful analysis and actionable intelligence, empowering you to stay ahead in a world where staying informed is not just an option, but a necessity.

Share the newsletter

#095 - AI agents write zero-day exploits

The AI Exploit Factory

AI-native CRM

Gemini's Trojan Invite

The AI Malware Architect

Anthropic's Broken Bridge

Drowning in AI Slop

The Shortlist

Wisdom of the Week

AI Influence Level

Reply

Keep Reading

Project Overwatch