Systematic Evaluation of Label Noise Effects on Accuracy and Calibration in Deep Neural Networks

Christopher Boseak

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Label noise is a pervasive issue in real-world datasets that can degrade both the accuracy and calibration of deep neural networks. In this study, we systematically examine how symmetric (random) and asymmetric (class-dependent) label noise influence model accuracy and confidence calibration in image classification using the CIFAR-10 dataset and a ResNet-18 architecture. We apply five levels of label noise (0%, 10%, 20%, 40%, 60%) and evaluate their effects using metrics such as test accuracy, Expected Calibration Error (ECE), and predictive entropy. Our findings show that increasing noise levels significantly degrade classification accuracy and impair model calibration. In particular, asymmetric noise at a 60% corruption level causes test accuracy to drop to approximately 38.7% while ECE surges above 35%, indicating extreme overconfidence in incorrect predictions. By contrast, symmetric noise at the same noise level yields higher predictive entropy (uncertainty) and a comparatively modest miscalibration (ECE ∼9%). These results highlight the importance of distinguishing noise types when assessing model robustness and reliability. All experiments are reproducible, with code and data publicly available to facilitate further investigation.

Version published to 10.21203/rs.3.rs-7197053/v1 on Research Square
Jul 24, 2025

Classification Accuracy Estimation Without Labels via Architecture-Agnostic Model Agreement

This article has 5 authors:
1. Erin Woo
2. Hyungkook Jun
3. Sangyeop Yeo
4. YoungIk Eom
5. YuSeung Ma
This article has no evaluationsLatest version Sep 1, 2025
Uncertainty Quantification Based on Block Masking of Test Images

This article has 3 authors:
1. Pai-Xuan Wang
2. Chien-Hung Liu
3. Shingchern D. You
This article has no evaluationsLatest version Aug 13, 2025
Out-of-Distribution Performance Analysis of Skin Lesion Classifiers for dermoscopic images

This article has 7 authors:
1. Eva Milara
2. Vanesa Gómez-Martínez
3. David Chushig-Muzo
4. María Castro-Fernández
5. Gustavo M. Callico
6. Conceição Granja
7. Cristina Soguero-Ruiz
This article has no evaluationsLatest version Sep 9, 2025

Listed in

Abstract

Article activity feed

Related articles

Classification Accuracy Estimation Without Labels via Architecture-Agnostic Model Agreement

Uncertainty Quantification Based on Block Masking of Test Images

Out-of-Distribution Performance Analysis of Skin Lesion Classifiers for dermoscopic images