Geometric Mixture Classifier A Discriminative Per-Class Mixture of Hyperplanes for Fast, Transparent Classification

Prasanth K K

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Many real-world categories are multimodal, with a single class occupying several disjoint regions of feature space. Classical linear models such as logistic regression or linear SVMs impose a single global hyperplane and therefore fail on such data, while kernel SVMs and deep networks can capture multimodality but often trade off interpretability, require extensive tuning, or incur high computational costs. We introduce the Geometric Mixture Classifier (GMC), a discriminative model that represents each class as a mixture of hyperplanes. Within a class, GMC combines plane scores using a temperature-controlled soft-OR (log-sum-exp), smoothly approximating the maximum; across classes, it applies a standard softmax to yield probabilistic posteriors. An optional Random Fourier Features (RFF) mapping equips GMC with nonlinear capacity while preserving linear inference in the number of planes and lifted dimensions. To make GMC practical, we develop a training recipe including geometry-aware ini tialization via k-means, automatic plane budgeting with silhouette score, alpha-annealing, usage-aware L2 regularization, label smoothing, and early stopping. Experiments on syn thetic multimodal benchmarks (moons, circles, anisotropic blobs, two-spirals) and real tabular/image datasets (iris, wine, WDBC breast cancer, digits) show that GMC consis tently outperforms linear baselines and k-NN, matches or exceeds RBF-SVM, Random Forests, and compact MLPs, and enables transparent geometric introspection via plane and class-level responsibility visualizations. Because inference scales linearly in the number of planes and features, GMC is CPU efficient, requiring only microseconds per example—comparable to or faster than RBF-SVM and compact MLPs. With post-hoc temperature scaling, calibration improves (ECE reduced from 0.06 to 0.02). GMC thus offers a favorable trade-off between accuracy, interpretability, and efficiency: more expressive than linear models, and lighter and more transparent than kernel or deep models.

Version published to 10.21203/rs.3.rs-7734425/v1 on Research Square
Oct 1, 2025

A Generalized Geometric Theoretical Framework of Centroid Discriminant Analysis for Linear Classification of Multi-dimensional Data

This article has 3 authors:
1. Yue Wu
2. Jialin Zhao
3. Carlo Vittorio Cannistraci
This article has no evaluationsLatest version Nov 3, 2025
KOS: Kernel-based Optimal Subspaces Method for Data Classification

This article has 1 author:
1. Lakhdar Remaki
This article has no evaluationsLatest version Oct 14, 2025
Classifying with the Fine Structure of Distributions: Leveraging Distributional Information for Robust and Plausible Naïve Bayes

This article has 3 authors:
1. Quirin Stier
2. Jörg Hoffmann
3. Michael C. Thrun
This article has no evaluationsLatest version Sep 26, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Generalized Geometric Theoretical Framework of Centroid Discriminant Analysis for Linear Classification of Multi-dimensional Data

KOS: Kernel-based Optimal Subspaces Method for Data Classification

Classifying with the Fine Structure of Distributions: Leveraging Distributional Information for Robust and Plausible Naïve Bayes