Addressing the Deployment Gap: Hybrid Symbolic-Statistical Vulnerability Detection in Safety-Critical C/C++ Systems

Jude E. Ameh
Abayomi Otebolaku
Augustine Ikpehai
Alex Shenfield
Dauda Sule

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Machine learning for vulnerability detection presents a persistent paradox: while academic benchmarks report 95–97% accuracy, production deployment remains below 15% due to fundamental failures under real-world distribution shifts and asymmetric error costs. This research identifies three primary drivers of this gap: class imbalance collapse, multiclass instability across sparse categories, and brittleness under adversarial code obfuscation. To address these, we present a hybrid symbolic-statistical architecture that utilizes a staged decision system to route inputs through deterministic template matching, statistical machine learning fallback, and explicit safety-rule overrides. The system was evaluated through a large-scale empirical study on the C3-VULMAP corpus, consisting of 6.3 million C/C + + functions. Quantitative results demonstrate that the hybrid system achieves 99.11% binary accuracy with an overall false negative rate of 0.89% and an average latency of 264 ms, making it suitable for CI/CD integration. Critically, the architecture maintains robustness under adversarial conditions, yielding an 8% reduction in false negatives on obfuscated code compared to pure machine learning baselines. A formative practitioner study with seven security engineers utilizing their own production codebases found an 85% preference for the system’s pattern-based explanations over black-box confidence scores, citing increased trustworthiness for regulatory audit requirements. By trading marginal benchmark optimization for production reliability and mechanistic interpretability, this hybrid approach provides a viable pathway for deploying automated code analysis in safety-critical domains.

Version published to 10.21203/rs.3.rs-9370984/v1 on Research Square
Apr 10, 2026

A Survey on Distributed System Testing Techniques

This article has 1 author:
1. Mahitha Geddavalasa
This article has no evaluationsLatest version Apr 10, 2026
Bridging Developer–QA Gaps Using Large Language Models and Automation: A Pilot Evaluation of AutoVisQA

This article has 1 author:
1. Tanvir Hasan
This article has no evaluationsLatest version Apr 17, 2026
Merging LoRA Adapters for Multi-Task Code Analysis: An Empirical Study of Linear Combination and Task Interference

This article has 2 authors:
1. Sankalp Pathak
2. Sanjay Garg
This article has no evaluationsLatest version Apr 16, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Survey on Distributed System Testing Techniques

Bridging Developer–QA Gaps Using Large Language Models and Automation: A Pilot Evaluation of AutoVisQA

Merging LoRA Adapters for Multi-Task Code Analysis: An Empirical Study of Linear Combination and Task Interference