Experiments of A Diagnostic Framework for Addressee Recognition and Response Selection in Ideologically Diverse Conversations with Large Language Models

David Segod
Ricardo Alvarez
Patrick McAllister
Michael Peterson

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The increasing deployment of conversational AI systems in real-world applications has brought significant attention to the challenges posed by ideological biases embedded in their outputs. The concept of a "multi-ideology hangover" addresses how conflicting ideological influences in training data persist and impact the relevance and neutrality of responses during dialogue generation. This research presents a diagnostic framework for evaluating the effects of ideological bias on addressee recognition and response selection in LLMs, using a combination of coreference resolution, topic modeling, and contextual embeddings. Through experiments involving ideologically diverse conversations, the results reveal that LLMs exhibit inconsistent behavior in ideologically charged contexts, leading to potential bias amplification and reduced accuracy in addressee recognition. The findings demonstrate the limitations of current automated evaluation techniques, demonstrating the need for more advanced bias mitigation strategies and robust evaluation methods to ensure neutrality in conversational AI systems. The study provides key insights into the underlying difficulties faced by LLMs in handling ideologically conflicting dialogues, offering a foundation for improving future conversational systems in politically and culturally sensitive environments.

Version published to 10.31219/osf.io/j69sz on OSF Preprints
Oct 4, 2024

When Corporate Chatbots Show Bias: A Multi-Dimensional Analysis of LLMs in Enterprise Settings

This article has 3 authors:
1. Shreya Bhattacharya
2. Vincent Hagenow
3. Marco Di Gennaro
This article has no evaluationsLatest version May 16, 2025
Advancing Conversational Diagnostic AI with Multimodal Reasoning

This article has 36 authors:
1. Ryutaro Tanno
2. Khaled Saab
3. Jan Freyberg
4. Chunjong Park
5. Tim Strother
6. Yong Cheng
7. Wei-Hung Weng
8. David Barrett
9. David Stutz
10. Nenad Tomasev
11. Anil Palepu
12. Valentin Liévin
13. Yash Sharma
14. Abdullah Ahmed
15. Elahe Vedadi
16. Roma Ruparel
17. Kimberly Kanada
18. Cian Hughes
19. Yun Liu
20. Geoff Brown
21. Yang Gao
22. Sean Li
23. S. Sara Mahdavi
24. James Manyika
25. Katherine Chou
26. Yossi Matias
27. Avinatan Hassidim
28. Dale Webster
29. Pushmeet Kohli
30. Ali Eslami
31. Joelle Barral
32. Adam Rodman
33. Vivek Natarajan
34. Mike Schaekermann
35. Tao Tu
36. Alan Karthikesalingam
This article has no evaluationsLatest version Jun 4, 2025
MultiLLM – Self Reflect Iterative Prompt Methodology based Automated Essay Scoring System

This article has 2 authors:
1. R. Johnsi
2. G. Bharadwaja Kumar
This article has no evaluationsLatest version Jun 18, 2025

Listed in

Abstract

Article activity feed

Related articles

When Corporate Chatbots Show Bias: A Multi-Dimensional Analysis of LLMs in Enterprise Settings

Advancing Conversational Diagnostic AI with Multimodal Reasoning

MultiLLM – Self Reflect Iterative Prompt Methodology based Automated Essay Scoring System