Prompt Engineering for Scale Development in Generative Psychometrics

Lara Lee Russell-Lasalandra
Hudson Golino

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This Monte Carlo simulation examines how prompt engineering strategies shape the quality of large language model (LLM)–generated personality assessment items within the AI-GENIE framework for generative psychometrics. Item pools targeting the Big Five traits were generated using multiple prompting designs (zero-shot, few-shot, persona-based, and adaptive), model temperatures, and LLMs, then evaluated and reduced using network psychometric methods. Across all conditions, AI-GENIE reliably improved structural validity following reduction, with the magnitude of its incremental contribution inversely related to the quality of the incoming item pool. Prompt design exerted a substantial influence on both pre- and post-reduction item quality. Adaptive prompting consistently outperformed non-adaptive strategies by sharply reducing semantic redundancy, elevating pre-reduction structural validity, and preserving substantially larger item pool, particularly when paired with newer, higher-capacity models. These gains were robust across temperature settings for most models, indicating that adaptive prompting mitigates common trade-offs between creativity and psychometric coherence. An exception was observed for the GPT-4o model at high temperatures, suggesting model-specific sensitivity to adaptive constraints at elevated stochasticity. Overall, the findings demonstrate that adaptive prompting is the strongest approach in this context, and that its benefits scale with model capability, motivating continued investigation of model–prompt interactions in generative psychometric pipelines.

Version published to 10.31234/osf.io/znqkm_v2 on OSF Preprints
Mar 17, 2026
Version published to 10.31234/osf.io/znqkm_v1 on OSF Preprints
Mar 16, 2026

Prompt Engineering for Scale Development in Generative Psychometrics

This article has 2 authors:
1. Lara Lee Russell-Lasalandra
2. Hudson Golino
This article has no evaluationsLatest version Mar 17, 2026
Leveraging AI for Automatic Item Generation for Psychological Scales

This article has 3 authors:
1. Xijuan Zhang
2. Kentaro Suzuki
3. Kai Wen Zhou
This article has no evaluationsLatest version Apr 7, 2026
Generative Psychometrics via AI-GENIE: Automatic Item Generation and Validation with Network-Integrated Evaluation

This article has 3 authors:
1. Lara Lee Russell-Lasalandra
2. Alexander P. Christensen
3. Hudson Golino
This article has no evaluationsLatest version Apr 20, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Prompt Engineering for Scale Development in Generative Psychometrics

Leveraging AI for Automatic Item Generation for Psychological Scales

Generative Psychometrics via AI-GENIE: Automatic Item Generation and Validation with Network-Integrated Evaluation