Feature Selection by Mutual Information

Philip E. Cheng
Juin-Der Lee
Alexander N. Savostyanov
Sheng-Kai Lee
Michelle Liou

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Mutual information (MI), a crucial component in statistical inference and an essential tool for data analysis, has been largely overlooked for seven decades in the statistical literature. Emerging from the analysis of data information within the realms of biological, engineering and physical sciences, essential working MI formulas have been involved with asymmetric expressions of terms found in both MI and Shannon entropy, consequently leading to a reduction in effective statistical inference. The innovative observation of the equivalence among the three principles: maximum entropy, maximum likelihood, and minimum MI, has offered new insights into the geometry of data likelihood and established a new framework for statistical inference by Cheng et al. (2008, 2010). Advanced data analysis, in contrast to the existing methods, is established based on the MI identities and the fundamental Pythagorean law of conditional MI. This article presents the new methodology by elaborating its effective applications to feature selection in genetics for predicting patients with depressive disorders.

Version published to 10.20944/preprints202507.1954.v1
Jul 23, 2025

STbayes: An R package for creating, fitting and understanding Bayesian models of social transmission

This article has 2 authors:
1. Michael Chimento
2. William Hoppitt
This article has no evaluationsLatest version Jun 11, 2025
Generating positive-definite correlation matrices with additional structure

This article has 2 authors:
1. SEAN PINKNEY
2. Cody T. Ross
This article has no evaluationsLatest version Jul 5, 2025
Using Large Language Models to Suggest Informative Prior Distributions in Bayesian Statistics

This article has 4 authors:
1. Michael A. Riegler
2. Kristoffer Herland Helton
3. Vajira Thambawita
4. Hugo L. Hammer
This article has no evaluationsLatest version Jul 23, 2025

Listed in

Abstract

Article activity feed

Related articles

STbayes: An R package for creating, fitting and understanding Bayesian models of social transmission

Generating positive-definite correlation matrices with additional structure

Using Large Language Models to Suggest Informative Prior Distributions in Bayesian Statistics