Natural Language Processing for Phenotyping: A Feasibility Study in Predicting ASA Physical Status from Preoperative Clinical Narratives (Motivated by the Study on ASA Classification Prediction from Preoperative Notes by Chung et al.)

Ricardo Pietrobon
Aline Machiavelli
Luiza Paulsen Rodrigues
Amit Agrey
Lizzy Nkeangnyi
Giselle Zechia
Victor Galvão
Lucas Teixeira

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This report explores the use of natural language processing to automate clinical phenotyping by predicting the American Society of Anesthesiologists physical status classification based on preoperative evaluation notes. Motivated by a recent study that demonstrated the feasibility of text-based severity prediction, we assess how unstructured clinical narratives can serve as rich, standalone inputs for risk stratification. Using a large, standardized dataset of preoperative notes, we trained and evaluated multiple machine learning models, including random forests, linear classifiers, word embedding models, and transformer-based deep learning architectures. Performance was benchmarked using standard classification metrics, and interpretability was examined through Shapley value analysis of influential clinical terms. The results showed that models using the full narrative outperformed those using isolated sections, with the highest accuracy achieved by domain-adapted deep learning models. Most misclassifications occurred between adjacent severity categories, and reviewer analyses indicated the models often made clinically plausible predictions. Our findings demonstrate that natural language processing can effectively extract phenotypic signals from clinical text, reducing reliance on structured data and manual review. We highlight essential dataset characteristics, model evaluation strategies, and future directions for improving accuracy, generalizability, and clinical adoption of automated text-based phenotyping methods.

Version published to 10.31219/osf.io/qk7av_v1 on OSF Preprints
May 29, 2025

Benchmarking large language models for cardiovascular risk stratification using clinical vignettes

This article has 11 authors:
1. José Ferreira Santos
2. Regina Brito Duarte
3. Inês Mota
4. Rita Carvalheira Santos
5. José Maria Moreira
6. Joana Campos
7. Nuno André Silva
8. Bernardo Neves
9. Ricardo Ladeiras-Lopes
10. Francisca Leite
11. Helder Dores
This article has no evaluationsLatest version Dec 30, 2025
Development and internal validation of a machine learning–based prediction model and simplified screening score for in-hospital falls: a retrospective cohort study

This article has 9 authors:
1. Onishi Tatsuki
2. Tatsuyoshi Ikenoue
3. Norihide Itoh
4. Takumi Nishioka
5. Keima Nagasaka
6. Ryo Okochi
7. Haru Adachi
8. Naoko Matsuo
9. Yoshiya Ueno
This article has no evaluationsLatest version Jan 23, 2026
Benchmarking Ensemble Machine Learning Algorithms for the Early Prediction of Stroke in Imbalanced Clinical Cohorts: A Comparative Analysis and Decision Curve Assessment

This article has 2 authors:
1. Ibrahim Ibrahim Shuaibu
2. Yousaf Hussain
This article has no evaluationsLatest version Jan 22, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Benchmarking large language models for cardiovascular risk stratification using clinical vignettes

Development and internal validation of a machine learning–based prediction model and simplified screening score for in-hospital falls: a retrospective cohort study

Benchmarking Ensemble Machine Learning Algorithms for the Early Prediction of Stroke in Imbalanced Clinical Cohorts: A Comparative Analysis and Decision Curve Assessment