LUCID: Intelligent Informative Frame Selection in Otoscopy for Enhanced Diagnostic Utilitys

Hao Lu
Muhammet F. Demir
Gabriella I. Puchall
Zian Shang
Tucker Corwen
Shalaka Chavan
Carl D. Langefeld
Amy Zinnia
Muhammad Khalid Khan Niazi
Aaron C. Moberly
Metin N. Gurcan

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Accurate diagnosis of middle ear diseases, such as acute otitis media (AOM), remains a clinical challenge due to the reliance on subjective visual assessment through otoscopy. While deep learning has shown promise in improving diagnostic accuracy using digital otoscopy videos, existing models often rely on manually selected still frames, a step that reduces their practicality in real-world clinical workflows. In this study, we present the first systematic method (LUCID) for automatically identifying the most informative frame (MIF) selection in otoscopy videos. Through analyzing of 713 videos, we identified three key factors that impact frame informativeness: eardrum visibility, eardrum coverage, and image clarity. We then develop a novel MIF pipeline that integrates (1) a ResNet-50 classifier trained on over 38,000 labeled frames to assess eardrum visibility, (2) a binary-adversarial CAM (BC-AdvCAM) method for weakly supervised eardrum segmentation and coverage estimation, and (3) a specialized blur and focus detection algorithm tailored to otoscope imagery. These components are combined into an "informative score" to rank frames automatically. Comparative evaluations using human reviewers and diagnostic AI models show that frames selected by our AI method perform comparably to expert-selected frames—achieving similar classification accuracy across multiple deep learning architectures. Notably, using the top four frames per video identified by our method significantly improves diagnostic accuracy over using a single expert-selected frame. This framework offers a scalable, expert-level tool for automating key frame selection and enhancing AI-based otoscopy diagnosis. The code is available at : https://github.com/CAIR-LAB-WFUSM/informatic_frame_selction.git

Version published to 10.21203/rs.3.rs-7502743/v1 on Research Square
Sep 15, 2025

Eleflai: Towards primary care-centered AI-driven mobile otoscope

This article has 21 authors:
1. Lijun Bai
2. Yubiao Yue
3. Rui Zheng
4. Zhuonan Wang
5. Wen Luo
6. Xinyu Zeng
7. Kai Wang
8. Hao Hu
9. Linghan Cai
10. Peng Li
11. Yubin Chen
12. Fan Zhang
13. Zedong Zhang
14. Zuqian Wu
15. Feiyan Lu
16. Guo Li
17. Wantian Ke
18. Ruoxue Li
19. Shaoyi Du
20. Zhixi Feng
21. Wei Wei
This article has no evaluationsLatest version Oct 13, 2025
Artificial Intelligence in Otitis Media Diagnosis: A Review of Diagnostic Accuracy, Limitations, Clinical Integration and Future Directions

This article has 1 author:
1. Sulymon Saka
This article has no evaluationsLatest version Oct 6, 2025
Automated Image Quality Evaluation of Periapical Radiographs Using Deep Learning

This article has 4 authors:
1. Xiuting Chi
2. Mingchao Wang
3. Yue Gao
4. Zhipu Ge
This article has no evaluationsLatest version Oct 21, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Eleflai: Towards primary care-centered AI-driven mobile otoscope

Artificial Intelligence in Otitis Media Diagnosis: A Review of Diagnostic Accuracy, Limitations, Clinical Integration and Future Directions

Automated Image Quality Evaluation of Periapical Radiographs Using Deep Learning