Characterizing Documented Psychosocial Stressors in Pediatric Psychiatric Emergencies with an Open-Weight Large Language Model

Carson S Hartlage
Erika Rasnick Manning
Jonathan Bernard
Sia Vaish
James Gray
Melissa Young
Teresa Pestian
Patricia Tachinardi
Eneida A Mendonca
Alonzo T Folger
Cole Brokamp

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Objective

To evaluate whether a locally hosted open-weight large language model (LLM) can extract documented psychosocial factors from pediatric psychiatric intake notes and apply validated extraction to a large emergency psychiatry cohort.

Materials and Methods

We identified emergency department presentations at Cincinnati Children’s Hospital Medical Center from January 1, 2016, through December 31, 2024, among patients <18 years with psychiatric billing diagnoses. Using full-text intake notes, gpt-oss:120b classified peer conflict, sleep disruption, and school-related academic, attendance, and disciplinary issues as detected, negated, or indeterminate. Four human raters independently reviewed 50 notes. We compared Fleiss’ κ among humans alone versus humans plus the LLM, assessed repeated-query stability across 50 independent calls per note, and applied the workflow to all eligible notes.

Results

Among 37,315 eligible admissions, 22,284 had eligible intake notes; 22,270 produced parseable JSON. In detected-vs-not-detected coding, human-plus-LLM reliability did not differ significantly from human-only reliability across measures (human κ, 0.71–0.94; human-plus-LLM κ, 0.70–0.93). Stability was associated with human agreement: mean LLM-human agreement increased from 42.6% for classifications with <80% stability to 82.7% for 100% stability (Pearson r=0.36). Full-cohort extraction showed frequent and overlapping documented factors: sleep disruption was most frequently detected (57.7%), followed by peer conflict (47.2%), academic issues (43.4%), disciplinary issues (43.3%), and attendance issues (16.9%).

Discussion

Agreement varied by construct and was strongest when repeated model outputs were stable.

Conclusion

Locally hosted open-weight LLMs can support scalable structured extraction of documented psychosocial factors from pediatric psychiatric intake notes after local validation.

Version published to 10.64898/2026.06.08.26354931 on medRxiv
Jun 9, 2026

A Local Outpatient Practice-Level Prediction Model for Short-Term Psychiatric Emergency Presentation

This article has 5 authors:
1. John L. Havlik
2. Ben Tyrrell
3. Jeanette Polaschek
4. Nathan Bell
5. Eric R. Arzubi
This article has no evaluationsLatest version Jul 1, 2026
Mental Health Outcomes of Foster and Adopted Individuals with Adverse Childhood Experiences: A Validation of Known Risks Using EHR Data

This article has 11 authors:
1. Anita C Randolph
2. Evan Dastin-van Rijn
3. Shelby Anderson
4. Logan Caola
5. Erich Kummerfeld
6. Christi Sullivan
7. Stefanie Simpson
8. Aarav Kallar
9. Ritwick Banerjee
10. Audrey Houghton
11. Mark Fiecas
This article has no evaluationsLatest version May 30, 2026
Frailty outweighs psychiatric variables in predicting clinical outcomes among older adults receiving consultation-liaison psychiatry: a multicentre prospective cohort study (OLD-3 Study)

This article has 16 authors:
1. Leire Narvaiza-Grau
2. Miguel Alonso-Sanchez
3. Jorge Cuevas-Esteban
4. Monica Prat-Galbany
5. Eduardo Delgado-Parada
6. Cristina Pujol-Riera
7. Beatriz Villagrasa-Blasco
8. Sara Crivilles-Mas
9. Mikel Etxandi-Santolaya
10. Nestor Arbelo-Cabrera
11. Paloma Muñoz-Calero
12. Iñigo Alberdi-Paramo
13. Mar Baz
14. Sara Lakis-Granell
15. Eduardo Fuster-Nacher
16. Maria Iglesias-Gonzalez
This article has no evaluationsLatest version Jul 1, 2026

Discuss this preprint

Listed in

Abstract

Objective

Materials and Methods

Results

Discussion

Conclusion

Article activity feed

Related articles

A Local Outpatient Practice-Level Prediction Model for Short-Term Psychiatric Emergency Presentation

Mental Health Outcomes of Foster and Adopted Individuals with Adverse Childhood Experiences: A Validation of Known Risks Using EHR Data

Frailty outweighs psychiatric variables in predicting clinical outcomes among older adults receiving consultation-liaison psychiatry: a multicentre prospective cohort study (OLD-3 Study)