A Survey of Recent Advances in Adversarial Attack and Defense on Vision-Language Models

Md Iqbal Hossain
Neeresh Kumar Perla
Afia Sajeeda
Siyu Xia
Ming Shao

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In the rapidly advancing domain of artificial intelligence, Vision-Language Models (VLMs) have emerged as critical tools by synergizing visual and textual data processing to facilitate a multitude of applications including automated image captioning, accessibility enhancements, and intelligent responses to multimodal queries. This survey explores the evolving paradigm of Pre-training, Fine-tuning, and Inference that has notably enhanced the capabilities of VLMs, allowing them to perform effectively across various downstream tasks and even enable zero-shot predictions. Despite their advancements, VLMs are vulnerable to adversarial attacks, largely because of their reliance on large-scale, internet-sourced pre-training datasets. These attacks can significantly undermine the models' integrity by manipulating their input interpretations, posing severe security risks and eroding user trust. Our survey delves into the complexities of these adversarial threats, which range from single-modal to sophisticated multimodal strategies, highlighting the urgent need for robust defense mechanisms. We discuss innovative defense strategies that adapt model architectures, integrate adversarially robust training objectives, and employ fine-tuning techniques to counteract these vulnerabilities. This paper aims to provide a comprehensive overview of current challenges and future directions in the adversarial landscape of VLMs, emphasizing the importance of securing these models to ensure their safe integration into various real-world applications.

Version published to 10.20944/preprints202511.1363.v1
Nov 18, 2025

From Vulnerability to Robustness: A Survey of Patch Attacks and Defenses in Computer Vision

This article has 2 authors:
1. Xinyun Liu
2. Ronghua Xu
This article has no evaluationsLatest version Oct 22, 2025
Deepfake Detection Across Image, Video, and Audio: A Comprehensive Survey with Empirical Evaluation of Generalization and Robustness

This article has 4 authors:
1. Hong-Hanh Nguyen-Le
2. Van-Tuan Tran
3. Dinh-Thuc Nguyen
4. Nhien-An Le-Khac
This article has no evaluationsLatest version Nov 5, 2025
A Comprehensive Review: The Evolving Cat-and-Mouse Game in Network Intrusion Detection Systems Leveraging Machine Learning

This article has 6 authors:
1. Qutaiba Alasad
2. Meaad Ahmed
3. Shahad Alahmed
4. Omer T. Khattab
5. Saba Alaa Abdulwahhab
6. Jiann-Shuin Yuan
This article has no evaluationsLatest version Oct 8, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

From Vulnerability to Robustness: A Survey of Patch Attacks and Defenses in Computer Vision

Deepfake Detection Across Image, Video, and Audio: A Comprehensive Survey with Empirical Evaluation of Generalization and Robustness

A Comprehensive Review: The Evolving Cat-and-Mouse Game in Network Intrusion Detection Systems Leveraging Machine Learning