From manual search to artificial intelligence: A methodological framework for developing patient journey models from electronic health records using natural language processing

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Improving care pathways for chronic conditions requires a holistic, data-driven understanding of the patient journey. This study introduces a novel framework that reconstructs and simulates patient journeys using structured medication data and probabilistic modeling. Chronic spontaneous urticaria (CSU) was selected as a use case, with symptom onset estimated via natural language processing (NLP) and treatment cycles identified through Anatomical Therapeutic Chemical (ATC) code sequences—an approach not previously applied in this context. A Markov model was built to simulate transitions across health states using de-identified EHR data from 74,664 CSU patients. Results revealed significant delays in diagnosis and variability in treatment initiation, with 29.7% of patients receiving no therapy. The model enables simulation of targeted interventions to streamline care, reduce delays, and optimize resource use. While intentionally simple, the framework is scalable across chronic conditions and supports value-based care through data-driven decision-making.

Article activity feed