Large Language Model Agents: A Comprehensive Survey on Architectures, Capabilities, and Applications

Yiming Lei
Jiawei Xu
Chia Xin Liang
Ziqian Bi
Xiaoming Li
Danyang Zhang
Junhao Song
Zhenyu Yu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large Language Model (LLM) agents represent a paradigm shift in artificial intelligence, combining the remarkable reasoning capabilities of foundation models with the ability to perceive environments, make decisions, and take actions autonomously. This comprehensive survey provides an in-depth examination of LLM-based agents across multiple dimensions. We first establish a formal definition of LLM agents and trace their evolution from early language models to today's sophisticated autonomous systems. We then present a novel taxonomy that organizes the field into four fundamental categories: reasoning-enhanced agents that leverage chain-of-thought and tree-structured deliberation; tool-augmented agents that extend LLM capabilities through external APIs and knowledge bases; multi-agent systems that enable collaborative problem-solving through inter-agent communication; and memory-augmented agents that maintain persistent context across interactions. For each category, we analyze representative architectures, discuss key innovations, and evaluate their relative strengths and limitations. We further examine diverse applications spanning software engineering, scientific research, embodied robotics, and web automation, supported by systematic comparisons on established benchmarks including SWE-bench, WebArena, and AgentBench. Our analysis reveals that while current agents achieve impressive performance on structured tasks, significant challenges remain in areas such as long-horizon planning, hallucination mitigation, and safe deployment. We conclude by identifying promising research directions, including neuro-symbolic integration, multi-modal perception, and human-agent collaboration frameworks, providing a roadmap for advancing this rapidly evolving field.

Version published to 10.20944/preprints202512.2119.v1
Dec 24, 2025

Towards a Science of Scaling Agent Systems

This article has 20 authors:
1. Yubin Kim
2. Ken Gu
3. Chanwoo Park
4. Chunjong Park
5. Samuel Schmidgall
6. A. Ali Heydari
7. Yao Yan
8. Zhihan Zhang
9. Yuchen Zhuang
10. Yun Liu
11. Mark Malhotra
12. Paul Liang
13. Hae Won Park
14. Yuzhe Yang
15. Xuhai Xu
16. Yilun Du
17. Shwetak Patel
18. Tim Althoff
19. Daniel McDuff
20. Xin Liu
This article has no evaluationsLatest version Jan 23, 2026
Tool and Agent Selection for Large Language Model Agents in Production: A Survey

This article has 9 authors:
1. Elias Lumer
2. Anmol Gulati
3. Faheem Nizar
4. Dzmitry Hedroits
5. Atharva Mehta
6. Henry Hwangbo
7. Vamse Kumar Subbiah
8. Pradeep Honaganahalli Basavaraju
9. James A. Burke
This article has no evaluationsLatest version Dec 12, 2025
Reasoning in Large Language Models: From Chain-of-Thought to Massively Decomposed Agentic Processes

This article has 8 authors:
1. Yiming Lei
2. Jiawei Xu
3. Chia Xin Liang
4. Ziqian Bi
5. Xiaoming Li
6. Danyang Zhang
7. Junhao Song
8. Zhenyu Yu
This article has no evaluationsLatest version Dec 24, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Towards a Science of Scaling Agent Systems

Tool and Agent Selection for Large Language Model Agents in Production: A Survey

Reasoning in Large Language Models: From Chain-of-Thought to Massively Decomposed Agentic Processes