Case-level artificial intelligence for multi-photo teledermatology submissions: development and internal validation using patient-submitted dermatology images

Vatsal Pravinbhai Patel
Nishi Seth
Abhijeet Patel
Yash Jayeshbhai Patel

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background

Store-and-forward teledermatology commonly relies on several patient-submitted photographs of the same concern, but most dermatology artificial intelligence models classify single images independently.

Objective

To develop and internally validate a case-level diagnostic-support model that aggregates multiple patient-submitted photographs for common dermatologic conditions.

Methods

We conducted a retrospective diagnostic-modeling study using the Skin Condition Image Network, a public dataset of deidentified selftaken dermatology images from US adults. We curated 2,336 cases comprising 5,041 images across 10 common inflammatory, allergic, and infectious conditions. Cases were split at the submission level into training, validation, and held-out test sets. Frozen general-purpose and dermatology-specific encoders were compared with image-level classifiers and a gated-attention multiple instance learning model that generated one case-level output from 1–3 images.

Results

The strongest image-level baseline, dermatology-specific embeddings with random forest classification, achieved macro/micro ROCAUCs of 0.797/0.854. Case-level aggregation improved discrimination, with dermatology-specific embeddings plus multiple instance learning achieving mean macro/micro ROC-AUCs of 0.819/0.863 across repeated stratified experiments. The locked final model achieved macro/micro ROCAUCs of 0.800/0.849 on the held-out test set. Balanced-threshold sensitivity/specificity examples were 0.702/0.688 for eczema and 0.818/0.826 for urticaria.

Limitations

Internal validation used a 10-condition subset from a US volunteer dataset; external validation, calibration, subgroup performance analysis, and prospective workflow studies are required.

Conclusion

Modeling the teledermatology submission as a multi-image case better reflects asynchronous dermatology workflow than single-image classification. The model is preliminary clinician-facing support for structured review and triage, not autonomous diagnosis.

Key Points

Store-and-forward teledermatology submissions usually contain multiple patient-submitted photographs, whereas most dermatology AI models classify single images independently.
This study developed a case-level multiple instance learning model that aggregates 1–3 photographs from the same SCIN submission and produces one clinician-facing diagnostic-support output.
Case-level aggregation modestly improved discrimination over the strongest image-level baseline and produced threshold-specific sensitivity/specificity outputs suitable for structured review and triage research.

Version published to 10.64898/2026.05.21.26353816 on medRxiv
Jun 1, 2026

Deep learning-based recognition model for surgical phases of minimally invasive hysterectomy: A multicentre retrospective study

This article has 13 authors:
1. Ryo Koike
2. Shin Takenaka
3. Yukio Suzuki
4. Hiroki Matsuzaki
5. Yuichi Harada
6. Makoto Nakabayashi
7. Yusuke Hirose
8. Kenro Chikazawa
9. Kanae Shimada
10. Eri Yoshiizumi
11. Hiroaki Komatsu
12. Hiroshi Tanabe
13. Koji Matsumoto
This article has no evaluationsLatest version May 17, 2026
AI Decision Support for Challenging Teledermatology Cases: MedGemma Performance in the Dermatology ECHO Program

This article has 5 authors:
1. Jeffrey B. Appiagyei
2. Ruth O. Otu
3. Mollie Henry
4. Benjamin W. Casterline
5. Mirna Becevic
This article has no evaluationsLatest version May 26, 2026
Pixel-Based Skin Tone Estimation on Dermoscopy: A Dual-Rater MST Benchmark and Feasibility Study

This article has 3 authors:
1. Amindu Kumarasinghe
2. Vinh Bui
3. Reza Ghanbarzadeh
This article has no evaluationsLatest version May 17, 2026

Discuss this preprint

Listed in

Abstract

Background

Objective

Methods

Results

Limitations

Conclusion

Key Points

Article activity feed

Related articles

Deep learning-based recognition model for surgical phases of minimally invasive hysterectomy: A multicentre retrospective study

AI Decision Support for Challenging Teledermatology Cases: MedGemma Performance in the Dermatology ECHO Program

Pixel-Based Skin Tone Estimation on Dermoscopy: A Dual-Rater MST Benchmark and Feasibility Study