Proof-of-Exploit: Cryptographically Verified LLM Cybersecurity Evaluation via Tiered Risk Metrics in the Operational-Risk Framework

Joshua White
Kara Zaffarano
John Stacy
Xiaomin Bian

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Existing Large Language Model cybersecurity evaluations rely on text-based plausibility scoring systems that fail to validate operational exploit viability. In this paper we present the Operational Risk Framework (ORF), advancing beyond our prior MalcodeEval work through three (3) innovations: 1.) ECDSA-P384 cryptographic execution validation providing non-repudiable proof-of-exploit, 2.) MITRE ATT&amp;CK-aligned tiered scoring with CVSS v4.0-derived severity weights, 3.) and six-phase progressive validation tracking 217 Indicators of Compromise within isolated VM environments.The utility of this framework is demonstrated through detailed case studies that have revealed granular disparities in capabilities and multi-stage attack progression, often obscured by standard pass/fail binary metrics. This work contributes systematic LLM-to-CVSS mapping and open cryptographic protocols toward NIST AI RMF 2.0 development.

Version published to 10.20944/preprints202603.2243.v1
Mar 27, 2026

Verifiable Model Procurement for Industrial CPS Using Cryptographic Performance Attestation

This article has 3 authors:
1. Jay Bojič Burgos
2. Urban Sedlar
3. Matevž Pustišek
This article has no evaluationsLatest version Feb 24, 2026
Formal Decision Traces for Data-Driven Verification and Post- Quantum Attestation in Cyber-Resilient Explainable AI Systems

This article has 1 author:
1. Tiffany A. Ceasor
This article has no evaluationsLatest version Feb 11, 2026
Architectural Unsuitability of Linux for Airborne Avionics OS: A System-Level Certification-Oriented Analysis

This article has 1 author:
1. Haoran Lu
This article has no evaluationsLatest version Mar 24, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Verifiable Model Procurement for Industrial CPS Using Cryptographic Performance Attestation

Formal Decision Traces for Data-Driven Verification and Post- Quantum Attestation in Cyber-Resilient Explainable AI Systems

Architectural Unsuitability of Linux for Airborne Avionics OS: A System-Level Certification-Oriented Analysis