Evaluation of Domain-Specific Prompt Engineering Attacks on Large Language Models

Charly Ashcroft
Kahari Whitaker

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The rapid integration of artificial intelligence into critical domains such as healthcare, finance, and legal services has necessitated a closer examination of the robustness and reliability of advanced language models. Adversarial prompt engineering presents a novel and significant method to systematically evaluate and exploit vulnerabilities within these models, highlighting the imperative for enhanced defensive strategies. A comprehensive evaluation was conducted on Claude and Gemini models, employing domain-specific adversarial prompts to test their performance across various sectors. The results indicated significant degradation in accuracy, reliability, and response time under adversarial conditions, revealing context-dependent vulnerabilities that compromise model integrity. Detailed statistical analyses and visualizations illustrated the substantial impact of adversarial inputs, providing robust evidence of the necessity for improved mitigation techniques. Patterns of susceptibility were identified, suggesting the need for tailored defensive approaches for different domains. The study contributes valuable insights into the inherent weaknesses of advanced language models, emphasizing the importance of ongoing research and development to enhance model resilience and ensure their reliable deployment in real-world applications.

Version published to 10.22541/au.172252453.36267312/v1
Aug 1, 2024

Risk Assessment Graphs: Utilizing Attack Graphs for Risk Assessment

This article has 5 authors:
1. Simon Unger
2. Ektor Arzoglou
3. Markus Heinrich
4. Dirk Scheuermann
5. Stefan Katzenbeisser
This article has no evaluationsLatest version Jul 24, 2024
Large Language Models in Healthcare Decision Support: A Review

This article has 5 authors:
1. Raja Vavekanand
2. Pinja Karttunen
3. Yue Xu
4. Stephanie Milani
5. Huao Li
This article has no evaluationsLatest version Jul 23, 2024
Assessing the Response Strategies of Large Language Models Under Uncertainty: A Comparative Study Using Prompt Engineering

This article has 2 authors:
1. Nehoda Lainwright
2. Moyat Pemberton
This article has no evaluationsLatest version Aug 1, 2024

Listed in

Abstract

Article activity feed

Related articles

Risk Assessment Graphs: Utilizing Attack Graphs for Risk Assessment

Large Language Models in Healthcare Decision Support: A Review

Assessing the Response Strategies of Large Language Models Under Uncertainty: A Comparative Study Using Prompt Engineering