Automated CVE Severity Prediction Using Deep Learning and Explainable AI

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Cybersecurity vulnerabilities represent a critical threat to information systems, often leading to data breaches and operational disruptions. Accurate assessment of vulnerability severity is therefore essential for effective risk prioritization. The Common Vulnerabilities and Exposures (CVE) system maintains a catalog of such vulnerabilities, each accompanied by a brief textual description and a severity score, typically assigned using the Common Vulnerability Scoring System (CVSS). However, assigning severity scores is time-consuming and resource-intensive, underscoring the need for automated prediction methods. In this study, we explore the automatic prediction of CVE severity levels directly from textual descriptions using machine learning. To address class imbalance, we leverage GPT-Neo, a generative language model, to synthetically augment underrepresented categories. We fine-tune a DeBERTa-based deep learning model for classification, achieving high accuracy in predicting severity levels from text alone. To enhance the interpretability of our model, we employ Local Interpretable Model-agnostic Explanations (LIME) to identify key terms and phrases that most strongly influenced model decisions. This approach demonstrates strong predictive performance and provides insight into the linguistic patterns associated with vulnerability severity.

Article activity feed