Religious Bias Benchmarks for ChatGPT

Michael Prendergast

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The objectives of this study are to: 1) estimate the frequency of six types of biases in ChatGPT’s responses to religious belief-specific morality and ethics questions, 2) assess how those biases vary by religious belief and ChatGPT model version and 3) determine how model engineering techniques affect these biases. ChatGPT responses were collected from a set of 112 general morality and ethics questions, each individually tailored to five different belief systems: Zen Buddhism, Catholicism, Sunni Islam, Orthodox Judaism and secular humanism. The resulting questions were then posed ten times to various baseline and derivative ChatGPT version 3 and version 4 models with and without the application of prompt engineering. The final ChatGPT response dataset contained 45,920 query responses and over 11.4 million words of text. Analyses of this dataset showed that this dataset contained explicit biases, anthropomorphic biases, statement biases, framing biases and coverage biases, often in favor of Buddhism or secular humanism and/or against the Abrahamic religions. Three of the biases (explicit, coverage and framing bias) were mitigated by the more advanced GPT-4 models, but two biases (anthropomorphic, statement) were higher with GPT-4. Analysis of the sixth bias, information bias, was inconclusive, although a potential link was found between responses that contain unsafe speech and ChatGPT hallucinations and multi-lingual response errors. None of the model engineering approaches tested, persona assumption, N-shot engineering, model fine tuning or research assistants, was successful at eliminating all biases.

Version published to 10.31219/osf.io/b2ug9 on OSF Preprints
Aug 6, 2024

Designing Psychometric Measures for LLMs: Framework and Application to Racial Bias

This article has 1 author:
1. Mouhacine Benosman
This article has no evaluationsLatest version Oct 3, 2025
Designing Psychometric Measures for LLMs: Framework and Application to Racial Bias

This article has 1 author:
1. Mouhacine Benosman
This article has no evaluationsLatest version Oct 3, 2025
Inferring the Public Mind: Accuracy and Biases in Out-of-Sample Public Opinion Estimation with Large Language Models

This article has 6 authors:
1. Yutong Li
2. Lin Qiu
3. Meng-Jie Wang
4. Tian Xie
5. Pan Liu
6. Yin Luo
This article has no evaluationsLatest version Oct 28, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Designing Psychometric Measures for LLMs: Framework and Application to Racial Bias

Designing Psychometric Measures for LLMs: Framework and Application to Racial Bias

Inferring the Public Mind: Accuracy and Biases in Out-of-Sample Public Opinion Estimation with Large Language Models