GraphSentry: Contract-Checked Graph Surgery for Budgeted LLM Reasoning DAGs

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Budgeted inference-time optimization can improve large language model (LLM) reasoning without weight updates, yet multi-step pipelines are brittle: small edits or crossover between partially correct procedures often violate interface assumptions, triggering late failures and retries. We propose GraphSentry, a certificate-driven topology search framework that turns those implicit assumptions into explicit, auditable interface contracts. GraphSentry represents reasoning procedures as grammar-generated typed DAGs; each node emits an (artifact, certificate) pair, where certificates are deterministic predicates evaluated from logged evidence. During search, GraphSentry performs contract-checked graph surgery (mutation and crossover), rejects incompatible splices at module boundaries, and applies bounded repair (K=3) with downstream-only re-execution. Across seven benchmarks, GraphSentry improves accuracy by +5.3–+10.8 points over baselines while reducing token usage by 29–45% and improving latency by 1.6–2.0×. Across tasks, gains correlate with the prevalence of structural invalidity. Finally, discovered topologies transfer zero-shot across models and outperform native baselines without additional search.

Article activity feed