“I apologize for my actions”: Emergent Properties of Generative Agents and Implications for a Theory of Mind

N'yoma Diamond
soumya banerjee

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This work explores the design, implementation, and usageof generative agents towards simulating human behaviour.Through simulating (mis)information spread, we investigatethe emergent social behaviours they produce.Generative agents demonstrate robustness to(mis)information spread, showing realistic conversationalpatterns. However, this robustness limits agents’abilities to realistically simulate human-like informationdissemination. Generative agents also exhibit novel andrealistic emergent social behaviours, such as deception,confrontation, and internalized regret. Using deception,agents avoid certain conversations. Through confrontation,an agent can verify information or even apologize for theiractions. Lastly, internalized regret displays direct evidencethat agents can internalize their experiences and act on themin a human-like way, such as through expressing remorse fortheir actions.We also identify significant technical dynamics and otherphenomena. Generative agents are vulnerable to produce unrealistichallucinations, but can also produce confabulationswhich fill in logical gaps and discontinuities to improve realism.We also identify the novel dynamics of “contextualeavesdropping” and “behavioural poisoning”. Via contextualeavesdropping and behavioural poisoning, agent behaviour isaltered through information leakage and sensitivity to certainstatements, respectively.The social behaviors demonstrated by generative agents, suchas deception, confrontation, and internalized regret, suggest apreliminary avenue for considering elements of a Theory ofMind (ToM) in LLM-based systems. While these behaviorsdo not represent genuine understanding or intentionality, theyindicate a capacity to simulate human-like responses to socialand informational dynamics. For example, internalized regrethints at a mechanism for contextual adaptation, which couldbe seen as a rudimentary step toward representing aspects ofhuman mental states, albeit in a constrained sense.

Version published to 10.31219/osf.io/8nzsm_v1 on OSF Preprints
Feb 27, 2025

“I think I misspoke earlier. My bad!”: Exploring How Generative Artificial Intelligence Tools Exploit Society’s Feeling Rules

This article has 3 authors:
1. Lisa M. Given
2. Sarah Polkinghorne
3. Alexandra Ridgway
This article has no evaluationsLatest version Jan 11, 2025
Stop Acting Like Language Model Agents Are Normal Agents

This article has 2 authors:
1. Elija Perrier
2. Michael Timothy Bennett
This article has no evaluationsLatest version Feb 4, 2025
Unveiling the Aha! moment: a computational account of insight in active inference

This article has 3 authors:
1. Youssef Doulfoukar
2. Giovanni Pezzulo
3. Hans Stuyck
This article has no evaluationsLatest version Feb 11, 2025

Listed in

Abstract

Article activity feed

Related articles

“I think I misspoke earlier. My bad!”: Exploring How Generative Artificial Intelligence Tools Exploit Society’s Feeling Rules

Stop Acting Like Language Model Agents Are Normal Agents

Unveiling the Aha! moment: a computational account of insight in active inference