Machine Learning-Based Vulnerability Detection in Rust Code Using LLVM IR and Transformer Model

Young Lee
Syeda Jannatul Boshra
Jeong Yang
Zechun Cao
Gongbo Liang

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Rust’s growing popularity in high-integrity systems requires automated vulnerability detection in order to maintain its strong safety guarantees. Although Rust’s ownership model and compile-time checks prevent many errors, sometimes unexpected bugs may occasionally pass analysis, underlining the necessity for automated safe and unsafe code detection. This paper presents Rust-IR-BERT, a machine learning approach to detect security vulnerabilities in Rust code by analyzing its compiled LLVM intermediate representation (IR) instead of the raw source code. Using LLVM IR provides a language-neutral, semantically rich view of the program, capturing data and control flow, and reducing the noise of high-level syntax differences. Our method leverages a transformer model, GraphCodeBERT, to embed the IR and CatBoost classifier to classify code as vulnerable or safe. When evaluated on a mix of known buggy and safe code, this method obtained 98.11% overall accuracy, with a recall of 99.31% for safe code and 93.67% for vulnerable code. Our evaluation utilizes a diverse dataset of over 2,300 CVE-linked and Rust snippets compiled to LLVM IR, facilitating wide-range of coverage across real-world crates.

Version published to 10.20944/preprints202506.0788.v1
Jun 10, 2025

Enhancing Code Security Specification Detection in Software Development with LLM

This article has 4 authors:
1. Shanqi Zhan
2. Ying Lin
3. Yao Yao
4. Junlin Zhu
This article has no evaluationsLatest version Jun 3, 2025
TestLock: A Testability Logic Locking method against Machine Learning-based Oracle-less attacks

This article has 3 authors:
1. Marziye Pandi
2. Mostafa Moghaddas
3. Hakem Beitollahi
This article has no evaluationsLatest version Jun 16, 2025
AI-Powered Automated Bug Bounty Platform

This article has 5 authors:
1. Tahir Naquash
2. Zeeshan Yalakpalli
3. Shania Margaret Saini
4. Shivshankar -
5. Ayesha Siddiqua
This article has no evaluationsLatest version Jun 17, 2025

Listed in

Abstract

Article activity feed

Related articles

Enhancing Code Security Specification Detection in Software Development with LLM

TestLock: A Testability Logic Locking method against Machine Learning-based Oracle-less attacks

AI-Powered Automated Bug Bounty Platform