A Middleware System for Detecting and Mitigating Unsafe Tool Use in Large Language Models

Fishon Amos
Solomon Emmanuel
James Chukwuemeka

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

As Large Language Models (LLMs) increasingly integrate with external tools and APIs, the risk of hallucinated or unsafe tool invocations poses significant challenges for production deployments. We present HGuard, a middleware system designed to detect, prevent, and mitigate dangerous tool use in LLM-powered applications. Our system employs a multi-stage validation pipeline incorporating schema validation, fuzzy matching, and configurable policy enforcement to intercept potentially harmful tool calls before execution. Through comprehensive evaluation on 100 diverse test scenarios, we demonstrate that HGuard achieves 98% accuracy in detecting unsafe tool calls with minimal latency overhead (<10ms median). The system successfully prevents unauthorized API calls, parameter hallucinations, and phantom tool invocations while maintaining high throughput (>5,000 requests/second). These results establish HallucinationGuard as a practical safety layer for production AI systems requiring reliable tool use capabilities.

Version published to 10.20944/preprints202506.1398.v1
Jun 17, 2025

InfraMLForge: Developer Tooling for Rapid LLM Development and Scalable Deployment

This article has 1 author:
1. Yuhan Zhang
This article has no evaluationsLatest version Jun 3, 2025
Unknown Vulnerability Mining for Power Monitoring Systems Aided by Large Language Modeling

This article has 4 authors:
1. Manpo Li
2. Xuerui Yang
3. Xiaochen Yang
4. Shugui Zhang
This article has no evaluationsLatest version Jun 18, 2025
SafeServe: Scalable Tooling for Release Safety and Push Testing in Multi-App Monetization Platforms

This article has 1 author:
1. Yuhan Zhang
This article has no evaluationsLatest version Jun 3, 2025

Listed in

Abstract

Article activity feed

Related articles

InfraMLForge: Developer Tooling for Rapid LLM Development and Scalable Deployment

Unknown Vulnerability Mining for Power Monitoring Systems Aided by Large Language Modeling

SafeServe: Scalable Tooling for Release Safety and Push Testing in Multi-App Monetization Platforms