Claim Check + Semaphore-Based Queuing: A Fault-Tolerant Pattern for Distributed OCR/LLM Inference without Managed Infrastructure

Alejandro Jaime

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

(CCSQI) pattern: a formally specified, production-validated architecture for fault-tolerant GPU inference that replaces \$1,480--\$2,100/month of managed cloud infrastructure with five open-source components on commodity hardware. The pattern targets \emph{temporally decoupled} workloads — systems where producers and consumers operate in separate time windows and the SLA is completeness before a deadline, not real-time latency. For this workload class, reactive autoscaling solves a problem the architecture does not generate. and \texttt{asyncio.shield} produce deadlock under 30+ concurrent tasks with no exception and no log entry. The resolution follows from the actor model~\cite{hewitt1973}: liveness belongs to the broker, exclusion belongs to the semaphore. We additionally prove that a six-operation pipeline has exactly four structurally reachable failure modes, three automatically repairable and one preventable by operation ordering. The pattern is validated over 90 days, 4,000 documents, 24 classification types, with zero document loss and \$0 recurring infrastructure cost.

Version published to 10.21203/rs.3.rs-8960646/v1 on Research Square
Mar 3, 2026

Symbolic Typestate Inference for Guard Reduction: A Bytecode Optimization Pass for Faster Java FFM

This article has 1 author:
1. Arnab Karmakar
This article has no evaluationsAppears in 1 listLatest version Feb 17, 2026
<p class="MDPI12title">The Missing Layer in Modern IT: Governance of Commitments, Not Just Compute and Data

This article has 2 authors:
1. Rao Mikkilineni
2. W. Patrick Kelly
This article has no evaluationsLatest version Mar 25, 2026
GraphSentry: Contract-Checked Graph Surgery for Budgeted LLM Reasoning DAGs

This article has 5 authors:
1. Rui Li
2. Shuang Cao
3. Ruihua Liu
4. Alexandre Duprey
5. Shuchen Ge
This article has no evaluationsLatest version Feb 20, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Symbolic Typestate Inference for Guard Reduction: A Bytecode Optimization Pass for Faster Java FFM

<p class="MDPI12title">The Missing Layer in Modern IT: Governance of Commitments, Not Just Compute and Data

GraphSentry: Contract-Checked Graph Surgery for Budgeted LLM Reasoning DAGs