News Digest March 26, 2026 8 min read

AI News Digest: March 26, 2026

Daily roundup of AI and ML news - 8 curated stories on security, research, and industry developments.

Here's your daily roundup of the most relevant AI and ML news for March 26, 2026. Today's digest includes 1 security-focused story. We're also covering 7 research developments. Click through to read the full articles from our curated sources.

Security & Safety

1. Claude Extension Flaw Enabled Zero-Click XSS Prompt Injection via Any Website

Cybersecurity researchers have disclosed a vulnerability in Anthropic's Claude Google Chrome Extension that could have been exploited to trigger malicious prompts simply by visiting a web page. The flaw "allowed any website to silently inject prompts into that assistant as if the user wrote them,...

Source: The Hacker News (Security) | just now

Research & Papers

2. Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs

arXiv:2603.24511v1 Announce Type: new Abstract: LLM agents like Claude Code can not only write code but also be used for autonomous AI research and engineering \citep{rank2026posttrainbench, novikov2025alphaevolve}. We show that an \emph{autoresearch}-style pipeline \citep{karpathy2026autoresear...

Source: arXiv - Machine Learning | 10 hours ago

3. Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning

arXiv:2512.16917v3 Announce Type: replace-cross Abstract: Large language models (LLMs) with explicit reasoning capabilities excel at mathematical reasoning yet still commit process errors, such as incorrect calculations, brittle logic, and superficially plausible but invalid steps. In this paper...

Source: arXiv - Machine Learning | 10 hours ago

4. Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs

arXiv:2511.05919v3 Announce Type: replace-cross Abstract: LLMs are now an integral part of information retrieval. As such, their role as question answering chatbots raises significant concerns due to their shown vulnerability to adversarial man-in-the-middle (MitM) attacks. Here, we propose the ...

Source: arXiv - AI | 10 hours ago

5. Metaphor-based Jailbreak Attacks on Text-to-Image Models

arXiv:2512.10766v2 Announce Type: replace-cross Abstract: Text-to-image (T2I) models commonly incorporate defense mechanisms to prevent the generation of sensitive images. Unfortunately, recent jailbreak attacks have shown that adversarial prompts can effectively bypass these mechanisms and indu...

Source: arXiv - AI | 10 hours ago

6. Targeted Adversarial Traffic Generation : Black-box Approach to Evade Intrusion Detection Systems in IoT Networks

arXiv:2603.23438v1 Announce Type: cross Abstract: The integration of machine learning (ML) algorithms into Internet of Things (IoT) applications has introduced significant advantages alongside vulnerabilities to adversarial attacks, especially within IoT-based intrusion detection systems (IDS). ...

Source: arXiv - AI | 10 hours ago

7. Why the Maximum Second Derivative of Activations Matters for Adversarial Robustness

arXiv:2603.23860v1 Announce Type: new Abstract: This work investigates the critical role of activation function curvature -- quantified by the maximum second derivative $\max|\sigma''|$ -- in adversarial robustness. Using the Recursive Curvature-Tunable Activation Family (RCT-AF), which enables ...

Source: arXiv - Machine Learning | 10 hours ago

8. Not All Tokens Are Created Equal: Query-Efficient Jailbreak Fuzzing for LLMs

arXiv:2603.23269v1 Announce Type: cross Abstract: Large Language Models(LLMs) are widely deployed, yet are vulnerable to jailbreak prompts that elicit policy-violating outputs. Although prior studies have uncovered these risks, they typically treat all tokens as equally important during prompt m...

Source: arXiv - AI | 10 hours ago

About This Digest

This digest is automatically curated from leading AI and tech news sources, filtered for relevance to AI security and the ML ecosystem. Stories are scored and ranked based on their relevance to model security, supply chain safety, and the broader AI landscape.

Want to see how your favorite models score on security? Check our model dashboard for trust scores on the top 500 HuggingFace models.

Security & Safety

1. Claude Extension Flaw Enabled Zero-Click XSS Prompt Injection via Any Website

Research & Papers

2. Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs

3. Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning

4. Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs

5. Metaphor-based Jailbreak Attacks on Text-to-Image Models

6. Targeted Adversarial Traffic Generation : Black-box Approach to Evade Intrusion Detection Systems in IoT Networks

7. Why the Maximum Second Derivative of Activations Matters for Adversarial Robustness

8. Not All Tokens Are Created Equal: Query-Efficient Jailbreak Fuzzing for LLMs

About This Digest

Related Articles

AI News Digest: May 12, 2026

AI News Digest: May 11, 2026

AI News Digest: May 10, 2026

Stay Updated

Real talk: I built this alone.