Automated Cough Detection System based on Vision Transformers

Keming Tan
Jacky Smith
Patrick Gaydecki

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Cough is a common symptom with significant public health implications. Objective cough detection is crucial disease monitoring, the development of effective therapies and improving patient care. This study aimed to develop an automated, accurate, and generalisable system for detecting cough events using machine learning. We developed an automated cough detection model based on the Vision Transformer (ViT) architecture. The dataset comprising 232 24-hour recordings across nine diagnostic categories was used for model training and evaluation. Recordings were segmented into one-second clips, converted into spectrograms, and classified using the ViT model. We evaluated the model using sensitivity, specificity, precision, and F1 score as key metrics. Our model achieved a sensitivity of 91.5%, specificity of 99.0%, precision of 73.9%, and F1 score of 0.82 on the test dataset, demonstrating strong performance. Bland-Altman analysis revealed an average difference (bias) of 97 cough events per 24h recording. The model effectively automates cough detection, offering a significant efficiency improvement over manual method. Future improvements may involve enhancing preprocessing techniques, reducing model complexity, and refining the labelling process to align better with the model's detection approach.

Version published to 10.21203/rs.3.rs-6268391/v1 on Research Square
May 5, 2025

HealthGuard: AI-powered Early Detection of Tuberculosis using Machine Learning

This article has 5 authors:
1. Rupsa Chakraborty
2. Aditi Chaurasia
3. Sandipan Chatterjee
4. Soumyadeep Mukherjee
5. Prasun Chowdhury
This article has no evaluationsLatest version Jan 6, 2026
Radiographic Pneumonia Detection and Multiclass Classification Using Deep Learning Models

This article has 4 authors:
1. Ayenew Walle Kebede
2. Melesew Mossie Beyene
3. Addisu Taye Tamene
4. Amare Gedif Yalew
This article has no evaluationsLatest version Feb 3, 2026
Voice as a Digital Biomarker: Foundation Model-Based COPD Assessment

This article has 9 authors:
1. Sang Mee Lee
2. Hyein Ryu
3. Sunga Kong
4. Sun Hye Shin
5. Wooseong Huh
6. Myung Jin Chung
7. Juhee Cho
8. Taeyoung Kim
9. Hye Yun Park
This article has no evaluationsLatest version Dec 18, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

HealthGuard: AI-powered Early Detection of Tuberculosis using Machine Learning

Radiographic Pneumonia Detection and Multiclass Classification Using Deep Learning Models

Voice as a Digital Biomarker: Foundation Model-Based COPD Assessment