TabularGRPO: Modern Mixture-Of-Experts Transformer with Group Relative Policy Optimization GRPO for Tabular Data Learning

Enkhtogtokh Togootogtokh
Christian Klasen

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Tabular data remains the cornerstone of decision-making in healthcare, finance, and industrial analytics. We propose TabularGRPO, a novel reinforcement learning framework that synergizes Mixture-of-Experts (MoE) architectures with variance-reduced policy gradients. TabularGRPO addresses three fundamental challenges in tabular learning: 1) Feature-type heterogeneity through dynamic expert routing, 2) Class imbalance via group-wise advantage normalization, and 3) Sample inefficiency with KL-regularized policy updates. Evaluations on challenging datasets demonstrate TabularGRPO’s superiority over current dominanting models as XGBoost, Catboost with 6.0% higher precision and 13.0% higher F1 score, establishing new state-of-the-art performance. Code and benchmarks are publicly released. The code we used to train and evaluate our models is available at https://github.com/enkhtogtokh/tabulargrpo

Version published to 10.32388/a9q3vc
Mar 31, 2025

PHM-MAX-XGBoost-LSTM-SVM: A Hybrid Model for Time Series Forecasting – A Comparative Analysis with Traditional and Modern Approaches

This article has 1 author:
1. Bishwajit Prasad Gond
This article has no evaluationsLatest version Jul 2, 2025
A Survey of Mixture of Experts Models: Architectures and Applications in Business and Finance

This article has 1 author:
1. Satyadhar Joshi
This article has no evaluationsLatest version May 20, 2025
Federated hybrid ARIMAX-LSTM for Collaborative Fan Fault Prognostics: A Cement Plant Case Study

This article has 3 authors:
1. Noureddine ALLASSAK
2. Salima TRICHNI
3. Fouzia OMARY
This article has no evaluationsLatest version Jul 7, 2025

Listed in

Abstract

Article activity feed

Related articles

PHM-MAX-XGBoost-LSTM-SVM: A Hybrid Model for Time Series Forecasting – A Comparative Analysis with Traditional and Modern Approaches

A Survey of Mixture of Experts Models: Architectures and Applications in Business and Finance

Federated hybrid ARIMAX-LSTM for Collaborative Fan Fault Prognostics: A Cement Plant Case Study