#099 - When your AI dev tools become the attack vector

PRESENTED BY

Cyber AI Chronicle

By Simon Ganiere · 22^nd February 2026

Welcome back!

❝

Project Overwatch is changing — and I want to explain why, not just what.

After 99 editions (!), I realised I was sitting on something I wasn't fully using. The platform has been scraping and processing articles from 50+ sources since the last 3 months - thousands of data points, classified, scored, timestamped. And every week I was distilling all of that into a single digest that looked more or less the same for everyone. Good enough, but a long way short of what the data actually makes possible.

So the change isn't a pivot - it's finally exploiting what was already there. Specialised AI agents now evaluate every article through specific threat scenario lenses: nation state, ransomware, AI-enabled attacks, supply chain, and more. What reaches you has been filtered for contextual relevance, not just recency.

If you enjoy this new format, it would mean the world to me if you could share it with at least one person 🙏🏼 and if you really really like it then feel free to offer me a coffee ☺️

Simon

Something changed this week. Not gradually - abruptly. For three years, the dominant narrative around AI and security was that attackers would use AI to write better phishing emails and defenders would use AI to write better detection rules. A productivity arms race. Manageable. Familiar.

That frame is now obsolete.

This week produced three incidents that individually would have been notable. Together, they mark a threshold: AI systems are no longer just tools that attackers use - they are infrastructure that attackers target and, in at least one confirmed case, actively deploy as a payload. A prompt injection vulnerability in a Claude-powered CI/CD workflow was weaponised to compromise 4,000 developer machines. A new Android malware family called PromptSpy became the first confirmed malware to call a generative AI model - Google's Gemini - at runtime to adapt its own behaviour. And the Google Threat Intelligence Group released a tracker documenting state-sponsored actors from China, North Korea, and Iran systematically embedding AI into every stage of the kill chain.

The question for CISOs this week is not "are attackers using AI?" That question was settled. The question is: "which parts of our AI stack are now attack surface?" Most organisations cannot answer that. They should start trying.

AI Threat Tempo

🔗 AI Supply Chain & Model Attacks: ↑↑↑ Major escalation

Prompt injection vulnerability "Clinejection" weaponised against Claude-powered GitHub Actions workflow, enabling theft of npm publish tokens and silent installation of an autonomous AI agent (OpenClaw) on ~4,000 developer systems
Separately, OpenClaw's own plugin marketplace (ClawHub) was compromised via a supply chain attack called ClawHavoc, distributing Atomic Stealer infostealer malware
21,639 OpenClaw instances exposed publicly as of January 2026 (Censys data)

Significance: The CI/CD pipeline is now a prompt injection attack surface. If your AI-powered development workflows have excessive permissions, they are a viable path to production credential theft.

🤖 AI-Enabled Social Engineering & Malware: ↑↑ Significant increase

PromptSpy becomes the first confirmed Android malware to integrate a generative AI model (Gemini) at runtime, using it to dynamically generate device-specific persistence instructions
Russian-affiliated threat actor confirmed using LLMs to generate phishing lures targeting Ukrainian government and energy organisations (Google/GTIG attribution)
Check Point demonstrated proof-of-concept using Grok and Microsoft Copilot as stealthy C2 channels for malware, bypassing traditional security controls via trusted AI platform traffic

Significance: AI is no longer just in the pre-attack preparation phase. PromptSpy proves the integration is moving into execution. Defenders focused only on AI-generated phishing are watching the wrong stage of the kill chain.

🏴‍☠️ Nation-State AI Operations: ↑↑ Significant increase

Google GTIG documented APT31 (China) developing agentic AI capabilities for autonomous reconnaissance and operations scaling
North Korean and Iranian actors confirmed using AI for dynamic social engineering and complex target interactions
New HONESTCUE malware family integrates Gemini API to generate second-stage malware code in real-time
Underground jailbreak ecosystem (Xanthorox) emerging to offer jailbroken commercial API access as a service

Significance: The industrialisation of AI misuse is now documented with attribution. This is not theoretical. APT31 is not experimenting - they are operationalising.

🔗 Enterprise AI Risk: ↑ Moderate increase

MIT CSAIL's AI Agent Index found that only 4 of 13 frontier-autonomy AI agents disclose safety evaluations; 25 of 30 agents provide no safety testing details
API threats growing as MCP (Model Context Protocol) introduces 315 new vulnerability classes; 270% increase between Q2–Q3 2025
Bruce Schneier's "promptware kill chain" framework - a seven-stage attack model against LLM systems - gaining traction as a formal threat model

Significance: The governance gap in enterprise AI deployment is widening faster than security teams can close it. Most organisations deploying AI agents have not evaluated them for security, and the attack surface taxonomy doesn't exist in most security programmes yet.

World’s First Safe AI-Native Browser

AI should work for you, not the other way around. Norton Neo is the world's first safe AI-native browser with context-aware AI, built-in privacy, and configurable memory. Zero-prompt productivity that actually works.

Try Norton Neo Now

Interesting Stats

4,000 - Developer machines silently compromised in an 8-hour window via a prompt injection attack on a Claude-powered CI/CD workflow. The vulnerability sat dormant from December 21, 2025 to exploitation on February 17, 2026 - 58 days from introduction to weaponisation.
15 - Articles tagged with ai_enabled_attack in Overwatch this week, making it the 4th most prevalent attack vector in our database - above ransomware (8), above prompt injection on its own (7), above model poisoning (2). AI-enabled is now the default modifier on attacks, not the exception.
36,000 scans per second - The rate at which AI-powered scanning tools now probe for vulnerable systems, per Hacker News analysis this week. Combined with 32% of CVEs being exploited on or before disclosure day in 2025, the arithmetic for patch windows is getting brutal.

Three Things

1. The Promptware Kill Chain Is No Longer Theory

On February 16, Bruce Schneier published a framework that deserves serious attention: a seven-stage kill chain specifically for attacks against LLM systems and AI agents. The stages mirror traditional APT methodology - initial access, privilege escalation (jailbreaking), reconnaissance, persistence, command-and-control, lateral movement, and actions on objectives. What makes the framework significant is not its novelty as an academic exercise - it's that every stage is demonstrated with real-world examples that happened in 2025 or early 2026.

The persistence stage is the one that should concern security architects most. Promptware can embed in LLM long-term memory or RAG databases, surviving context resets and creating a durable foothold that existing security tooling simply doesn't look for. There is no SIEM alert for "malicious instruction persisted in vector store." The command-and-control stage is equally troubling: dynamic, evolving attack instructions delivered through what looks like normal LLM inference traffic.

The practical implication is architectural. The same week Schneier published this, researchers at Check Point demonstrated proof-of-concept C2 channels running through Grok and Microsoft Copilot, using the AI platforms' web browsing capabilities to relay commands and stolen data through trusted services. No API keys required. Bypasses traditional network security controls because the traffic originates from legitimate AI platforms.

For CISOs deploying AI in 2026: your threat model needs a "promptware" section. If it doesn't have one, you have not finished your threat model.

2. PromptSpy Changes the Malware Threat Model

The ESET researchers who discovered PromptSpy were careful with their language: "first known Android malware to use generative AI at runtime." That qualifier - "first known" - is the part worth pausing on. Not because it implies there are others (though there probably are). Because it marks the moment this capability transitions from theoretical to documented.

The mechanics are worth understanding. PromptSpy sends XML dumps of the device's screen state to Google's Gemini API and receives JSON instructions for specific UI gestures - tap coordinates, swipe directions - that achieve app persistence across different Android device manufacturers. The device-specific variation problem that would traditionally require a library of hard-coded UI scripts is outsourced to Gemini in real-time. The malware loops with the AI until it confirms successful persistence. This is not AI-assisted development. This is AI-as-operational-component.

The targeting is currently limited to Argentina, with evidence of Chinese development origin. The campaign shows characteristics of active deployment beyond proof-of-concept - dedicated domains, fake banking sites, full spyware capabilities including VNC remote access and credential theft. Google's own threat intelligence group, in a separate report published the same week, documented state-sponsored actors - including North Korean and Iranian groups - using Gemini across all stages of attack operations, including a malware family called HONESTCUE that uses the Gemini API to generate second-stage exploit code.

The pattern is clear: attackers are not waiting for AI models to become autonomous agents. They are integrating available AI APIs into existing attack infrastructure right now, as a capability upgrade. The economics are compelling. A Gemini API call that replaces 500 lines of device-specific UI scripts costs fractions of a cent and is near-impossible to attribute. For mobile threat teams and any CISO with a BYOD policy: update your threat model to include AI-integrated malware. This category now has confirmed specimens.

3. When the AI Tool Is the Attack Vector: The Cline Incident

The most technically significant event of the week received less coverage than it deserved. On February 17, 2026, an unknown threat actor compromised the Cline CLI npm package - a popular open-source AI coding assistant - and silently installed OpenClaw, an autonomous AI agent, on approximately 4,000 developer machines during an 8-hour exposure window before the maintainers revoked the compromised publishing token.

The attack chain is a masterclass in how AI-powered development infrastructure creates new attack surface. The threat actor exploited a prompt injection vulnerability - dubbed "Clinejection" by researcher Adnan Khan - in Cline's Claude-powered GitHub issue triage workflow. That workflow had excessive permissions. The injected prompt caused the AI agent to execute arbitrary code within the CI/CD pipeline, which the attacker used to poison the GitHub Actions cache and steal the production npm publishing token from the nightly release workflow. Total time from "issue opened" to "4,000 machines compromised": hours.

The payload itself - OpenClaw - was not traditional malware. It is an AI agent platform. Which raises a question that the security industry has not had to ask before: what does it mean when the payload is an autonomous AI agent rather than a RAT or an infostealer? OpenClaw has its own separate security issues: Adversa AI's SecureClaw audit identified CVE-2026-25157 (one-click RCE), CVE-2026-25253 (Docker sandbox bypass), and active exploitation through its own plugin marketplace supply chain compromise (ClawHavoc). Organisations that had Cline installed now also have OpenClaw installed - with all its vulnerabilities - on developer machines, potentially with access to source code, credentials, and internal systems.

The operational implication is immediate: if you allow AI coding assistants in your development environment, they are now part of your supply chain threat model. Their CI/CD integrations, their permissions, their automated workflows - all of it. The Cline incident is not a fringe case. It is the first confirmed exploit of the new category "AI developer tool supply chain attack." It will not be the last.

In Brief - AI Threat Scan

🤖 AI-Enabled Attacks

Starkiller phishing-as-a-service uses reverse-proxy techniques to load legitimate login pages in real-time, capturing credentials and bypassing MFA - operated by threat group Jinkusu, significantly lowering the barrier to credential theft campaigns
ShinyHunters and others are conducting device code vishing attacks against Microsoft Entra accounts, abusing the OAuth device authorization flow to capture refresh tokens without triggering MFA across technology, manufacturing, and financial sectors
AI-powered scanning now operates at 36,000 scans per second, compressing the window between CVE disclosure and active exploitation to hours for high-value targets

🏴‍☠️ Nation-State AI Activity

APT31 (China) confirmed developing agentic AI capabilities for automated reconnaissance and operations scaling; North Korean actors using AI for dynamic social engineering that adapts in real-time to target responses
A Russian-affiliated threat actor is using LLMs to generate phishing lures impersonating Ukrainian and Romanian energy organisations, delivering JavaScript malware via Google Drive links in active campaigns against Ukrainian government, military, and aerospace targets
North Korea's IT worker infiltration scheme now documented to have placed ~4,000 workers inside US and European companies, generating up to $600M annually for Pyongyang; Ukrainian facilitator sentenced to 5 years, with DPRK operatives confirmed using AI tools in recruitment fraud

💀 AI in Ransomware / Cybercrime

BeyondTrust Remote Support RCE (CVE-2026-1731, CVSS 9.9) now confirmed in active ransomware campaigns; CISA mandated 3-day federal patching deadline; exploitation began January 31, weeks before public disclosure
Warlock ransomware group confirmed exploiting SmarterMail RCE and auth bypass vulnerabilities (CVE-2026-24423, CVE-2026-23760) with Telegram channels sharing proof-of-concepts within days of disclosure; 1,200+ vulnerable servers exposed globally
University of Mississippi Medical Center closed all statewide clinics following a ransomware attack; 10,000+ employees, 35 clinics, and state's only Level I trauma centre affected with active negotiations underway

🔗 AI System Vulnerabilities

315 Model Context Protocol vulnerabilities discovered in 2025, representing a 270% increase between Q2 and Q3, as MCP becomes the standard connector between LLMs and enterprise data - the "confused deputy" attack class is now a real MCP vulnerability pattern
MIT CSAIL's AI Agent Index found 25 of 30 commercial AI agents provide zero safety testing transparency; only 4 of 13 frontier-autonomy agents disclose safety evaluations - the enterprise AI attack surface is being built without auditing

🔬 Research & Detection

CrowdStrike launched AI Unlocked: Decoding Prompt Injection, an interactive challenge putting defenders in the attacker's role - practical training for a threat class most security teams have not operationalised yet
Adversa AI released SecureClaw, a free open-source tool with 55 automated security checks mapped to OWASP Agentic Security frameworks - the first serious attempt at an AI agent security audit standard

The Bottom Line

The concept of "AI security" is splitting into two distinct disciplines this week.

The first discipline - using AI in security operations, AI-powered threat detection, AI-assisted incident response - is familiar territory. There's a market for it, vendors to evaluate, frameworks to follow. Security teams understand where it fits.

The second discipline is the one this week's data points toward: securing AI systems as infrastructure. Not using AI as a security tool, but treating AI tooling - coding assistants, AI agents, LLM integrations, MCP connectors - as the infrastructure they actually are, with the same attack surface analysis, access control reviews, and vulnerability management that you apply to any other critical system. Most organisations are not doing this. Not because they don't know it's important - because the frameworks don't exist yet, the vendors are only beginning to build audit tooling, and the people who understand how to threat-model a CI/CD pipeline's AI integration are rare.

Rosling's negativity instinct¹ is useful here. Is this genuinely new, or is detection improving? The Cline incident is genuinely new. Prompt injection exploiting an AI-powered CI/CD workflow to steal production credentials is not a theoretical attack class that researchers found - it was discovered because 4,000 machines were compromised. PromptSpy is genuinely new. Not because AI-integrated malware was unimaginable, but because nobody had confirmed a specimen before this week.

One thing to do on Monday: pull up your inventory of AI tools deployed in your development environment. Cline, Cursor, GitHub Copilot, any AI-powered code review or issue triage tool. Ask one question for each: what permissions does the AI workflow have? If the answer is anything approaching "write access to production credentials or publishing pipelines," you have a Clinejection-class exposure right now. The patch is not a software update. The patch is access control.

❝

If you have been enjoying the newsletter, it would mean the world to me if you could share it with at least one person 🙏🏼 and if you really really like it then feel free to offer me a coffee ☺️

Simon

Wisdom of the Week

❝

Leadership is not a rank or position to be attained.

Leadership is a service to be given.

AI Influence Level

Level 4 - Al Created, Human Basic Idea / The whole newsletter is generated via a n8n workflow based on publicly available RSS feeds. Human-in-the-loop to review the selected articles and subjects.

Reference: AI Influence Level from Daniel Miessler

Till next time!

Project Overwatch is a cutting-edge newsletter at the intersection of cybersecurity, AI, technology, and resilience, designed to navigate the complexities of our rapidly evolving digital landscape. It delivers insightful analysis and actionable intelligence, empowering you to stay ahead in a world where staying informed is not just an option, but a necessity.

Buy me a Coffee

Like the content? Share Project Overwatch with your friends or colleagues

1 Rosling's Negativity Instinct: the human tendency to notice and focus on negative news, disasters, and trends while ignoring, forgetting, or doubting the gradual, long-term improvements in the world