MalGTA: Large language Model-based Guided Malware Tactical Analysis

Wenjie Guo
Jingfeng Xue
Zeyang Liu
Weijie Han
Jingjing Hu

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In High-Performance Computing (HPC) environments, a comprehensive understanding of cybersecurity threats and their underlying attack strategies is essential. However, current research predominantly focuses on maliciousness determination, typically emphasizing the code's operational behaviors rather than the attack strategies employed. The advancements in multimedia computing, particularly Large Language Models (LLMs), have paved the way for innovative solutions to the aforementioned bottleneck. This work proposes MalGTA (Guided Malware Tactical Analysis), an LLM-based system that automates ATT\&CK (Adversarial Tactics, Techniques, and Common Knowledge)-aligned malware tactical analysis through Cuckoo Sandbox-driven dynamic profiling. Specifically, we construct a multi-source knowledge base integrated with Retrieval-Augmented Generation (RAG), which mitigates hallucinations in LLMs through context-sensitive threat intelligence retrieval. In addition, we propose a query optimization strategy to address challenges related to input information overload and attention dispersion in LLMs, enabling context-aware data refinement from Cuckoo reports. Finally, this study conducts dynamic analysis on classical VirusShare and Advanced Persistent Threat (APT) samples and constructs an evaluation dataset based on the authoritative malware analysis platform HybridAnalysis. Experimental results show the effectiveness of the method.

Version published to 10.21203/rs.3.rs-6226366/v1 on Research Square
Mar 31, 2025

AI-Powered Automated Bug Bounty Platform

This article has 5 authors:
1. Tahir Naquash
2. Zeeshan Yalakpalli
3. Shania Margaret Saini
4. Shivshankar -
5. Ayesha Siddiqua
This article has no evaluationsLatest version Jun 17, 2025
Benchmarking Large Language Models for Data Pipeline Code Generation and Execution

This article has 4 authors:
1. Chiara Rucco
2. Motaz Saad
3. Tobia Martina
4. Antonella Longo
This article has no evaluationsLatest version Jul 2, 2025
Unknown Vulnerability Mining for Power Monitoring Systems Aided by Large Language Modeling

This article has 4 authors:
1. Manpo Li
2. Xuerui Yang
3. Xiaochen Yang
4. Shugui Zhang
This article has no evaluationsLatest version Jun 18, 2025

Listed in

Abstract

Article activity feed

Related articles

AI-Powered Automated Bug Bounty Platform

Benchmarking Large Language Models for Data Pipeline Code Generation and Execution

Unknown Vulnerability Mining for Power Monitoring Systems Aided by Large Language Modeling