Medical Diagnosis Coding Automation: Similarity Search vs. Generative AI

Vanessa Klotzman

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Objective

This study aims to predict ICD-10-CM codes for medical diagnoses from short diagnosis descriptions and compare two distinct approaches: similarity search and using a generative model with few-shot learning.

Materials and Methods

The text-embedding-ada-002 model was used to embed textual descriptions of 2023 ICD-10-CM diagnosis codes, provided by the Centers provided for Medicare & Medicaid Services. GPT-4 used few-shot learning. Both models underwent performance testing on 666 data points from the eICU Collaborative Research Database.

Results

The text-embedding-ada-002 model successfully identified the relevant code from a set of similar codes 80% of the time, while GPT-4 achieved a 50 % accuracy in predicting the correct code.

Discussion

The work implies that text-embedding-ada-002 could automate medical coding better than GPT-4, highlighting potential limitations of generative language models for complicated tasks like this.

Conclusion

The research shows that text-embedding-ada-002 outperforms GPT-4 in medical coding, highlighting embedding models’ usefulness in the domain of medical coding.

Version published to 10.1101/2024.04.26.24306470v1 on medRxiv
Apr 29, 2024

Fine-Tuning for Accuracy: Evaluation of GPT for Automatic Assignment of ICD Codes to Clinical Documentation

This article has 9 authors:
1. Khalid Nawab
2. Madalyn Fernbach
3. Sayuj Atreya
4. Samina Asfandiyar
5. Gulalai Khan
6. Riya Arora
7. Iqbal Hussain
8. Shadi Hijjawi
9. Richard Schreiber
This article has no evaluationsLatest version May 10, 2024
Benchmarking Large Language Models for Extraction of International Classification of Diseases Codes from Clinical Documentation

This article has 17 authors:
1. Ashley Simmons
2. Kullaya Takkavatakarn
3. Megan McDougal
4. Brian Dilcher
5. Jami Pincavitch
6. Lukas Meadows
7. Justin Kauffman
8. Eyal Klang
9. Rebecca Wig
10. Gordon Smith
11. Ali Soroush
12. Robert Freeman
13. Donald J Apakama
14. Alexander W Charney
15. Roopa Kohli-Seth
16. Girish N Nadkarni
17. Ankit Sakhuja
This article has no evaluationsLatest version May 3, 2024
Developing and Testing a Framework for Coding General Practitioners' Free-Text Diagnoses in Electronic Medical Records - A Reliability Study for Generating Training Data in Natural Language Processing

This article has 6 authors:
1. Audrey Wallnöfer
2. Jakob M. Burgstaller
3. Katja Weiss
4. Thomas Rosemann
5. Oliver Senn
6. Stefan Markun
This article has no evaluationsLatest version Apr 12, 2024

Listed in

Abstract

Objective

Materials and Methods

Results

Discussion

Conclusion

Article activity feed

Related articles

Fine-Tuning for Accuracy: Evaluation of GPT for Automatic Assignment of ICD Codes to Clinical Documentation

Benchmarking Large Language Models for Extraction of International Classification of Diseases Codes from Clinical Documentation

Developing and Testing a Framework for Coding General Practitioners' Free-Text Diagnoses in Electronic Medical Records - A Reliability Study for Generating Training Data in Natural Language Processing