Fine-grained Insider Threat Detection with Large Language Models: A Comparative Study

Parvin Ahmadi Doval Amiri
Alexis Brissard
Frédéric Cuppens
Amal Zouaq

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Insider threats remain a significant challenge in cybersecurity, demanding more effective and efficient detection strategies. The advent of Large Language Models (LLMs) presents new opportunities in Insider Threat Detection (ITD), particularly in monitoring and analyzing behavioral patterns indicative of potential threats. However, LLMs also present limitations, such as the tendency to generate inaccurate or misleading outputs due to their generative nature. This study explores the use of LLMs for ITD by leveraging the CERT r4.2 dataset. We perform a comprehensive comparative analysis of fine-tuned models—specifically BERT, LLaMA 3, and Phi 3—used as classifiers in both binary and multi-class classification tasks, as well as generative models through in-context learning (ICL) techniques. Our findings demonstrate that fine-tuned LLMs achieve high accuracy and stability in detecting insider threats, even across complex multi-class scenarios. These models consistently outperform baseline methods, effectively capturing subtle behavioral cues associated with insider risks. Additionally, we introduce a refined Chain-of-Thought (CoT) prompting method that significantly improves ICL performance, particularly for scenario-specific threat identification. We also investigate the models' ability to manage previously unseen insider behaviors by incorporating a dedicated “Unknown” class. Results reveal that LLMs frequently misclassify these unknown behaviors as benign, especially in high-risk contexts, underscoring the difficulty of detecting novel threats in practical ITD applications.

Version published to 10.21203/rs.3.rs-7511791/v1 on Research Square
Sep 23, 2025

SecuDevSLM: A Systematic Security Evaluation Framework for Small Language Models on Mobile Devices

This article has 6 authors:
1. Zijun Li
2. Tengfei Tu
3. Yuxin Lian
4. Zicheng Kong
5. Aofan Liu
6. Wenmin Li
This article has no evaluationsLatest version Nov 7, 2025
A Study on Improving the Automatic Classification Performance of Cybersecurity MITRE ATT&CK Tactics Using NLP-Based ModernBERT and BERTopic Models

This article has 4 authors:
1. Jaehwan Baek
2. Jeonghoon O
3. Seungwoo Jeong
4. Wooju Kim
This article has no evaluationsLatest version Nov 13, 2025
Multimodal Advanced Persistent Threat Detection and Attribution Using Heterogenous Graph Neural Network and Analysis Using Explainable AI

This article has 6 authors:
1. Premanand Ghadekar
2. Sai Kulkarni
3. Pranav Jadhav
4. Isha Kulkarni
5. Om Lohade
6. Isha Mahajan
This article has no evaluationsLatest version Oct 10, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

SecuDevSLM: A Systematic Security Evaluation Framework for Small Language Models on Mobile Devices

A Study on Improving the Automatic Classification Performance of Cybersecurity MITRE ATT&CK Tactics Using NLP-Based ModernBERT and BERTopic Models

Multimodal Advanced Persistent Threat Detection and Attribution Using Heterogenous Graph Neural Network and Analysis Using Explainable AI