Harness Engineering for Language Agents: The Harness Layer as Control, Agency, and Runtime

Chaoyue He
Xin Zhou
Di Wang
Hong Xu
Wei Liu
Chunyan Miao

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Language agents that act through tools, files, browsers, APIs, and persistent sessions are shaped by more than the base model or a single prompt. Their reliability depends on a harness layer that determines which instructions remain authoritative, what actions are available, how state is carried forward, and how failures are handled over time. This position paper argues that recent practice has made this layer visible enough to warrant explicit treatment in NLP. We propose and operationalize a working decomposition of the harness layer as control, agency, and runtime (CAR); situate harness engineering in the arc from software engineering through prompt and context engineering; and provide a lightweight audit of 40 harness-relevant works in our selected evidence base, suggesting a visibility gap between academic papers and public engineering notes. We further argue that many reported agent gains may be partly harness-sensitive rather than purely model-driven, and propose HarnessCard as a lightweight reporting artifact, including a filled example. Grounded in papers, benchmarks, protocols, and engineering notes through 20th Mar, 2026, we argue that progress in language agents should report not only the model, but also the harness layer that turns capability into governed action.

Version published to 10.20944/preprints202603.1756.v1
Mar 23, 2026

Agent Harness for Large Language Model Agents: A Survey

This article has 9 authors:
1. Qianyu Meng
2. Yanan Wang
3. Liyi Chen
4. Qimeng Wang
5. Chengqiang Lu
6. Wei Wu
7. Yan Gao
8. Yi Wu
9. Yao Hu
This article has no evaluationsLatest version Apr 9, 2026
Agent Harness for Large Language Model Agents: A Survey

This article has 9 authors:
1. Qianyu Meng
2. Yanan Wang
3. Liyi Chen
4. Qimeng Wang
5. Chengqiang Lu
6. Wei Wu
7. Yan Gao
8. Yi Wu
9. Yao Hu
This article has no evaluationsLatest version Apr 9, 2026
Remote-Capable Knowledge Work Should Default to AI-Enabled Flexibility

This article has 6 authors:
1. Chaoyue He
2. Xin Zhou
3. Di Wang
4. Hong Xu
5. Wei Liu
6. Chunyan Miao
This article has no evaluationsLatest version Apr 8, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Agent Harness for Large Language Model Agents: A Survey

Agent Harness for Large Language Model Agents: A Survey

Remote-Capable Knowledge Work Should Default to AI-Enabled Flexibility