LATCH: latency-bounded latent lookahead for constraint-safe web agents

Rui Li
Shuang Cao
Ruihua Liu
Alexandre Duprey
Sze Yi Sally Qu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Web agents operate under strict interaction budgets and validators, where a single invalid tool call can waste a step and an irreversible action can end an episode. Existing token-space deliberation is expensive and can hallucinate long-horizon interface states, while retrieval-only approaches do not model action-conditioned constraint propagation. We introduce LATCH, a latency-bounded latent lookahead system that makes simulate-then-select decision making practical for tool-mediated web interaction. LATCH compresses open action spaces into schema-valid candidates via an Action Proposal Module, predicts constraint-relevant state updates with a hybrid SSM+attention latent dynamics model, and stabilizes long-horizon facts through typed entity-centric memory. A calibrated critic scores short-horizon latent rollouts, enabling counterfactual ranking without generating long text futures. On WebShop, Mind2Web and WebArena under matched step budgets and shared validators, LATCH improves task success by 3.6–6.4 points over GPT-4o+ReAct while reducing constraint violations by 33–42% relative. Under a controlled decision study with a fixed candidate set, latent rollout planning yields a 12.8-point gain over token-matched LLM reranking. We further report per-decision compute (tokens, latent evaluations, and wall-time), site-holdout generalization, and budget-matched wall-time success to make the cost–reliability trade-off auditable.

Version published to 10.21203/rs.3.rs-8912469/v1 on Research Square
Feb 20, 2026

AionRAG: Time-Correct Retrieval-Augmented Generation under Knowledge Drift

This article has 5 authors:
1. Rui Li
2. Shuang Cao
3. Ruihua Liu
4. Alexandre Duprey
5. Angel Dong
This article has no evaluationsLatest version Feb 20, 2026
Suppressing echo cascades in language-model agents with multi-critic plan selection

This article has 4 authors:
1. Rui Li
2. Shuang Cao
3. Ruihua Liu
4. Alexandre Duprey
This article has no evaluationsLatest version Feb 20, 2026
Risk-Gated Hierarchical Option Policies for Budgeted Web Navigation with Irreversible-Action Failure

This article has 3 authors:
1. Daniel Thompson
2. Emily Clarke
3. James Walker
This article has no evaluationsLatest version Mar 9, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

AionRAG: Time-Correct Retrieval-Augmented Generation under Knowledge Drift

Suppressing echo cascades in language-model agents with multi-critic plan selection

Risk-Gated Hierarchical Option Policies for Budgeted Web Navigation with Irreversible-Action Failure