Effect of Features Burdened with Classical Error, Misclassifications, Berkson Error, and Group-Summary Assignments on Machine Learning Performance and Interpretability

Zeinab Mohamed
Tamer Oraby

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Measurement error is widespread in modern data pipelines, but its influence on predictive modeling and variable importance assessments in machine learning is often underestimated. Our study examines how input measurement error affects both prediction accuracy and interpretability across four common additive error types: classical error, misclassification of predictive features, Berkson error, caused by algorithmic or model-based assignment errors, and group-summary (aggregation) assignment errors. Using controlled Monte Carlo simulation experiments that cover regression and classification, we evaluate five popular machine learning models: generalized linear models, support vector machines, random forests, XGBoost, and multilayer perceptrons, across various error levels. To distinguish the effect of the error from optimization issues, hyperparameters are tuned separately for each dataset, and performance is measured using standard predictive metrics and permutation feature importance. Results show that measurement error generally reduces predictive performance and can significantly alter the ranking and estimated contribution of highly predictive features, with different degradation patterns depending on the error type and model. These findings clarify when performance declines might be mistaken for distributional shifts, among other things, and highlight the importance of diagnosing and correcting measurement error before drawing conclusions about model robustness, generalization, or variable relevance.

Version published to 10.21203/rs.3.rs-8963493/v1 on Research Square
Feb 27, 2026

When Error Metrics Contradict: Clarifying RMSE and MAE in Machine Learning Evaluation

This article has 2 authors:
1. Walter Chen
2. Kieu Anh Nguyen
This article has no evaluationsLatest version Jan 30, 2026
EGADB: An Enhanced Genetic Algorithm for Class Imbalance Problems

This article has 4 authors:
1. Oluwafunmilola Aderannibi Adepegba
2. Stephen Olatunde Olabiyisi
3. Solomon Akinboro
4. Emmanuel Okyere Ekwam
This article has no evaluationsLatest version Feb 10, 2026
PruneBERT: Context-Aware Sentence Classification through Statistical Relevance Pruning

This article has 5 authors:
1. Raghav Kaushik R
2. Jeganathan L
3. Janaki Meena M
4. Ummity Srinivasa Rao
5. Jayaram Balabaskaran
This article has no evaluationsLatest version Feb 6, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

When Error Metrics Contradict: Clarifying RMSE and MAE in Machine Learning Evaluation

EGADB: An Enhanced Genetic Algorithm for Class Imbalance Problems

PruneBERT: Context-Aware Sentence Classification through Statistical Relevance Pruning