Multimodal Physiological Assessment for Clinical Competency Classification in Simulation-Based Medical Education: A Machine Learning Approach

Solomon Prince Teye-Lartey
Jacob Schmieder
Umesh Yadav
Shaza Aouthmany
Cristina Alvarado
Kimberly Jenkins
Thomas J. Papadimos
Serkan Toy
Kris Brickman
Anthony Braida
Brent Altenhof
Scott M. Pappada

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background Medical errors remain a leading cause of preventable harm, yet current competency assessments often rely on subjective evaluations that overlook critical performance indicators, particularly learners' responses to clinical stress. Although physiological stress markers have been linked to performance outcomes, no widely adopted or scalable framework has integrated these biomarkers with performance data to identify learners requiring additional training before real-world practice. Methods This prospective observational study developed machine learning models to classify clinical competency using multimodal data from healthcare learners. Data were collected from 152 learners (74 Emergency Medicine residents, 70 Anesthesiology residents, 8 Emergency Medical Services students) across 470 high-fidelity simulation scenarios. A multimodal assessment platform synchronized physiological signals (electrodermal activity, heart rate, skin temperature) from Empatica E4 wristbands with expert evaluations. A genetic algorithm was employed for feature selection, and neural network models were evaluated using multiple leave-N-out strategies to assess generalizability across learners and scenarios. Results The neural network achieved 84–85% balanced accuracy across thresholds 0.45–0.70, with sensitivity 93.3–95.4% and specificity 72.9–76.2%. Despite class imbalance (80.6% competent, 19.4% novice), performance remained robust, with Matthew's correlation coefficients of 0.687–0.706 and precision–recall area-under-the-curve (PR-AUC) values of 0.969–0.970 across thresholds. Conclusions This study demonstrates that integrating physiological metrics with machine learning supports objective, data-driven competency assessment. By capturing stress-performance relationships that traditional evaluations often overlook, this framework may provide an early warning system to identify learners who may require additional training and lay the foundation for more precise, data-informed medical education.

Version published to 10.21203/rs.3.rs-8842924/v1 on Research Square
Feb 22, 2026

Using Virtual Learning Environments to Improve the Quality and Availability of Clinical Examination Education for Medical Students

This article has 2 authors:
1. David Hewitt
2. Michael Ratcliffe
This article has no evaluationsLatest version Feb 17, 2026
Quantifying Learning Curves in Ultrasound Training: A Real-Time Consultation Analysis Using a Novel Half-Life Metric

This article has 5 authors:
1. Ying Wang
2. Yahong Wang
3. Xiao Yang
4. Sheng Cai
5. Li Jianchu
This article has no evaluationsLatest version Jan 9, 2026
Risk Prediction in Spine Surgery: Traditional Models, Artificial Intelligence, and the Challenge of Clinical Translation

This article has 5 authors:
1. Samer Salman
2. Rohan Phadke
3. Rahul Kumar
4. Arbaz Momin
5. Alireza Tavakkoli
This article has no evaluationsLatest version Jan 8, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Using Virtual Learning Environments to Improve the Quality and Availability of Clinical Examination Education for Medical Students

Quantifying Learning Curves in Ultrasound Training: A Real-Time Consultation Analysis Using a Novel Half-Life Metric

Risk Prediction in Spine Surgery: Traditional Models, Artificial Intelligence, and the Challenge of Clinical Translation