Classifying with the Fine Structure of Distributions: Leveraging Distributional Information for Robust and Plausible Naïve Bayes

Quirin Stier
Jörg Hoffmann
Michael C. Thrun

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In machine learning, the Bayes classifier represents the theoretical optimum for minimizing classifica-tion errors. Since estimating high-dimensional probability densities is impractical, simplified approxima-tions such as naïve Bayes and k-nearest neighbor are widely used as baseline classifiers. Despite their simplicity, these methods require design choices—such as the distance measures in kNN, or the feature independence in naïve Bayes. In particular, naïve Bayes relies on implicit assumptions by using Gaussi-an mixtures or univariate kernel density estimators. Such design choices, however, often fail to capture heterogeneous distributional structures across features. We propose a flexible naïve Bayes classifier that leverages Pareto Density Estimation (PDE), a parame-ter-free, non-parametric approach shown to outperform standard kernel methods in exploratory statis-tics (Thrun et al., 2020). PDE avoids prior distributional assumptions and supports interpretability through visualization of class-conditional likelihoods. In addition, we address a recently described pit-fall of Bayes’ theorem: the misclassification of observations with low evidence. Building on the con-cept of plausible Bayes (Ultsch & Lötsch, 2022), we introduce a safeguard to handle uncertain cases more reliably. While not aiming to surpass state-of-the-art classifiers, our results show that PDE-flexible naïve Bayes with uncertainty handling provides a robust, scalable, and interpretable baseline that can be applied across diverse data scenarios.

Version published to 10.20944/preprints202509.2207.v1
Sep 26, 2025

Bayesian Statistical Hypothesis Testing in the Era of Big Data

This article has 7 authors:
1. carlos barrera-causil
2. juan carlos correa
3. Johny Javier Pambabay Calero
4. Sergio Alex Bauz Olvera
5. Daniel Andres Dıaz-Pachon
6. julian tejada
7. Fernando Marmolejo-Ramos
This article has no evaluationsLatest version Aug 28, 2025
Geometric Mixture Classifier A Discriminative Per-Class Mixture of Hyperplanes for Fast, Transparent Classification

This article has 1 author:
1. Prasanth K K
This article has no evaluationsLatest version Oct 1, 2025
Clustering by non-parametric smoothing

This article has 1 author:
1. David Hofmeyr
This article has no evaluationsLatest version Oct 1, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Bayesian Statistical Hypothesis Testing in the Era of Big Data

Geometric Mixture Classifier A Discriminative Per-Class Mixture of Hyperplanes for Fast, Transparent Classification

Clustering by non-parametric smoothing