Addressing Trust Requirements in the Design of an Open-Source Multi-Agent LLM-Based Domain-Specific Chatbot

Jonatan Axetorn
Felix Edholm
Felix Dobslaw
Lucas Gren

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large Language Models (LLMs) have the potential to automate knowledge-intensive interactions in enterprise systems, yet their adoption is often limited. One reason is a lack of user trust. This study examines how trust can be systematically engineered into an LLM-driven, multi-agent chatbot that handles routine human-resources (HR) queries. We follow a two-cycle Design Science Research methodology. Cycle 1 triangulated a systematic literature review with a thematic analysis over semi-structured interviews of six employees at a global firm and a confirmatory workshop with five AI experts to elicit and validate trust requirements. Cycle II instantiated these requirements in a multi-agent LLM chatbot prototype artifact and evaluated whether the artifact satisfies them through controlled user sessions and expert walkthroughs, emphasizing perceived usefulness and trust captured in post-task interviews (n = 11) and operationalizing trust via alignment-oriented measures (faithfulness, answer relevancy, and adversarial robustness). The study yields a refined taxonomy of external (transparency, organizational safeguards, third-party security) and internal (model provenance, bias risk, reliability) trust factors, identifying reliability as the primary determinant of adoption. The implemented design achieved >= 0.86 on trust-aligned metrics and was endorsed by 9/11 participants as ready for field deployment. These findings demonstrate that trust can be proactively addressed through design and offer prescriptive guidelines for software engineers seeking to embed LLMs safely and responsibly in socio-technical contexts.

Version published to 10.21203/rs.3.rs-7494256/v1 on Research Square
Sep 10, 2025

Understanding the impact of an organisational LLM agent on the collaborative practices of knowledge work

This article has 3 authors:
1. Kenton O’Hara
2. Michael Massimi
3. Miftah Khan
This article has no evaluationsLatest version Oct 7, 2025
The feedback loop: A systematic review of how evaluation practices inform conversational agent design

This article has 2 authors:
1. Neşe Baz Aktaş
2. Adem Akbıyık
This article has no evaluationsLatest version Sep 29, 2025
Chatbot-guided Search delivers Low-Relevance News and can exacerbate Gender Gaps in Political Knowledge

This article has 2 authors:
1. Kokil Jaidka
2. Shaz Furniturewala
This article has no evaluationsLatest version Nov 2, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Understanding the impact of an organisational LLM agent on the collaborative practices of knowledge work

The feedback loop: A systematic review of how evaluation practices inform conversational agent design

Chatbot-guided Search delivers Low-Relevance News and can exacerbate Gender Gaps in Political Knowledge