Language Model Applications for Early Diagnosis of Childhood Epilepsy

Jitse Loyens
Geertruida Slinger
Nynke Doornebal
Kees P.J. Braun
Willem M. Otte
Eric van Diessen

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Objective

Accurate and timely epilepsy diagnosis is crucial to reduce delayed or unnecessary treatment. While language serves as an indispensable source of information for diagnosing epilepsy, its computational analysis remains relatively unexplored. This study assessed – and compared – the diagnostic value of different language model applications in extracting information and identifying overlooked language patterns from first-visit documentation to improve the early diagnosis of childhood epilepsy.

Methods

We analyzed 1,561 patient letters from two independent first seizure clinics. The dataset was divided into training and test sets to evaluate performance and generalizability. We employed two approaches: an established Naïve Bayes model as a natural language processing technique, and a sentence-embedding model based on the Bidirectional Encoder Representations from Transformers (BERT)-architecture. Both models analyzed anamnesis data only. Within the training sets we identified predictive features, consisting of keywords indicative of ‘epilepsy’ or ‘no epilepsy’. Model outputs were compared to the clinician’s final diagnosis (gold standard) after follow-up. We computed accuracy, sensitivity, and specificity for both models.

Results

The Naïve Bayes model achieved an accuracy of 0.73 (95% CI: 0.68-0.78), with a sensitivity of 0.79 (95% CI: 0.74-0.85) and a specificity of 0.62 (95% CI: 0.52-0.72). The sentence-embedding model demonstrated comparable performance with an accuracy of 0.74 (95% CI: 0.68-0.79), sensitivity of 0.74 (95% CI: 0.68-0.80), and specificity of 0.73 (95% CI: 0.61-0.84).

Conclusion

Both models demonstrated relatively good performance in diagnosing childhood epilepsy solely based on first-visit patient anamnesis text. Notably, the more advanced sentence-embedding model showed no significant improvement over the computationally simpler Naïve Bayes model. This suggests that modeling of anamnesis data does depend on word order for this particular classification task. Further refinement and exploration of language models and computational linguistic approaches are necessary to enhance diagnostic accuracy in clinical practice.

Version published to 10.1101/2025.01.31.25321308 on medRxiv
Feb 3, 2025

The Scale for Estimating Prognosis of Epilepsy to predict in patients with epilepsy: A 506-Patient Cohort Study

This article has 6 authors:
1. Xu Chen
2. Jian-qing Ge
3. Xiao-Bo Ma
4. Qiang Zhang
5. Xiao-hong LI
6. Qi YANG
This article has no evaluationsLatest version Dec 19, 2025
Automated Detection Of Clinical High Risk Population Of Schizophrenia: Assessing The Generalizability Of NLP And LLM-Based Methods

This article has 30 authors:
1. Jiaee Cheong
2. Cheryl M. Corcoran
3. Kathryn E. Lewandowski
4. Ofer Pasternak
5. Sinead Kelly
6. Sylvain Bouix
7. Abraham Reichenberg
8. Carrie E. Bearden
9. Guillermo Cecchi
10. Justin T. Baker
11. Marek Kubicki
12. Tina Kapur
13. Daniel H. Mathalon
14. Kang-Ik K. Cho
15. Inge Winter-van Rossum
16. Michael J. Coleman
17. Tashrif Billah
18. Dheshan Mohandass
19. Yoonho Chung
20. Habiballah Rahimi Eichi
21. Youngsun T. Cho
22. Zailyn Tamayo
23. Jessica Hartmann
24. Patrick D. McGorry
25. Rene S. Kahn
26. John M. Kane
27. Scott W. Woods
28. Martha E. Shenton
29. Barnaby Nelson
30. John Torous
This article has no evaluationsLatest version Feb 4, 2026
Integrating Structural Brain MRI and Clinical Phenotypes for Automated ADHD Diagnosis: A Multimodal Deep Learning Approach

This article has 2 authors:
1. Soumyadip Roy
2. Ratnakar Dash
This article has no evaluationsLatest version Jan 8, 2026

Discuss this preprint

Listed in

Abstract

Objective

Methods

Results

Conclusion

Article activity feed

Related articles

The Scale for Estimating Prognosis of Epilepsy to predict in patients with epilepsy: A 506-Patient Cohort Study

Automated Detection Of Clinical High Risk Population Of Schizophrenia: Assessing The Generalizability Of NLP And LLM-Based Methods

Integrating Structural Brain MRI and Clinical Phenotypes for Automated ADHD Diagnosis: A Multimodal Deep Learning Approach