Here's your daily roundup of the most relevant AI and ML news for May 20, 2026. We're also covering 8 research developments. Click through to read the full articles from our curated sources.
Research & Papers
1. Quantum Adversarial Machine Learning: From Classical Adaptations to Quantum-Native Methods
arXiv:2605.18821v1 Announce Type: new Abstract: Machine learning has revolutionized numerous industrial domains. Despite recent advances, machine learning models remain vulnerable to adversarial threats. Adversarial machine learning is a field that studies these vulnerabilities to build robust m...
Source: arXiv - Machine Learning | 10 hours ago
2. Be Kind, Rewrite: Benign Projections via Rewriting Defend Against LLM Data Poisoning Attacks
arXiv:2605.19147v1 Announce Type: cross Abstract: Large language models (LLMs) are highly susceptible to backdoor attacks (BAs), wherein training samples are poisoned using trigger-based harmful content. Furthermore, existing defenses have proven ineffective when extensively tested across BA pat...
Source: arXiv - Machine Learning | 10 hours ago
3. Detecting Fluent Optimization-Based Adversarial Prompts via Sequential Entropy Changes
arXiv:2605.19966v1 Announce Type: new Abstract: Optimization-based adversarial suffixes can jailbreak aligned large language models (LLMs) while remaining fluent, weakening static and windowed perplexity-based detectors. We cast adversarial suffix detection as an online change-point detection pr...
Source: arXiv - Machine Learning | 10 hours ago
4. Faster-GCG: Efficient Discrete Optimization Jailbreak Attacks against Aligned Large Language Models
arXiv:2410.15362v2 Announce Type: replace Abstract: Aligned Large Language Models (LLMs) have attracted significant attention for their safety, particularly in the context of jailbreak attacks that attempt to bypass guardrails via adversarial prompts. Among existing approaches, the Greedy Coordi...
Source: arXiv - Machine Learning | 10 hours ago
5. Learning Rate Matters: Vanilla LoRA May Suffice for LLM Fine-tuning
arXiv:2602.04998v2 Announce Type: replace Abstract: Low-Rank Adaptation (LoRA) is the prevailing approach for efficient large language model (LLM) fine-tuning. Building on this paradigm, recent studies have proposed alternative initialization strategies, architectural modifications, and optimiza...
Source: arXiv - Machine Learning | 10 hours ago
6. DarkLLM: Learning Language-Driven Adversarial Attacks with Large Language Models
arXiv:2605.18868v1 Announce Type: cross Abstract: While vision and multimodal foundation models underpin critical tasks from perception to complex reasoning, they remain highly vulnerable to adversarial attacks. However, traditional adversarial attacks are typically limited to single, predefined...
Source: arXiv - Machine Learning | 10 hours ago
7. MoCo-EA: Exploiting Adversarial Mode Connectivity for Efficient Evolutionary Attacks
arXiv:2605.18919v1 Announce Type: cross Abstract: Evolutionary algorithms for adversarial attacks leverage population-based search to discover perturbations without gradient information, but suffer from inefficient crossover operations that destroy adversarial properties through discrete interpo...
Source: arXiv - Machine Learning | 10 hours ago
8. Robust Linear Dueling Bandits with Post-serving Context under Unknown Delays and Adversarial Corruptions
arXiv:2605.01752v3 Announce Type: replace Abstract: We study linear dueling bandits in volatile environments characterized by the simultaneous presence of post-serving contexts, delayed feedback, and adversarial corruption. Feedback is subject to unknown stochastic or adversarial delays and a cu...
Source: arXiv - Machine Learning | 10 hours ago
About This Digest
This digest is automatically curated from leading AI and tech news sources, filtered for relevance to AI security and the ML ecosystem. Stories are scored and ranked based on their relevance to model security, supply chain safety, and the broader AI landscape.
Want to see how your favorite models score on security? Check our model dashboard for trust scores on the top 500 HuggingFace models.