Overcoming the Curse of Dimensionality with Synolitic AI

Alexey Zaikin
Ivan Sviridov
Artem Sosedka
Anastasia Linich
Ruslan Nasyrov
Evgeny Mirkes
Tatiana Tyukina

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In this study, we present a systematic evaluation of Synolitic Graph Neural Networks (SGNNs), a novel framework that transforms high-dimensional tabular data into sample-specific graphs using ensembles of low-dimensional pairwise classifiers. We demonstrate that augmenting these graphs with topology-aware node descriptors (such as degree, strength, closeness, and betweenness centrality) and applying graph sparsification techniques, either via minimum spanning connectivity or fixed-probability edge retention, can significantly improve classification performance. We evaluate both convolution-based (GCN) and attention-based graph neural networks (GATv2) across two training regimes: a foundation model setting where multiple datasets are concatenated, and dataset-specific training. Results show that attention-based models generally exhibit superior performance across classification tasksвАФin the foundation regime, dense (non-sparsified) graphs with node features yield 92.83 ROC-AUC for GATv2 and 92.34 for GCN (vs. 90.80 for XGBoost), and in the dataset-specific regime, GATv2 with minimal connectivity and node features reaches 88.96 ROC-AUC (vs. 86.84 for XGBoost). A leave-one-dataset-out evaluation further indicates out-of-domain transfer to previously unseen datasets (mean ROC-AUC: 0.78 with node features; 0.71 with maximum-threshold sparsification; 0.70 without features). Importantly, we demonstrate that the SGNN framework is capable of overcoming the curse of dimensionality, outperforming traditional machine learning models such as XGBoost in scenarios where the number of features exceeds the number of training samplesвАФmaintaining ROC-AUC above 80\% with only 5\% of the training data, while XGBoost drops to 60\%. Furthermore, SGNNs exhibit robustness to feature redundancy and correlation, with duplicating all features and adding noise producing only minor deviations, reducing the need for manual feature engineering or dimensionality reduction. Across all settings, SGNNs enhanced with node features consistently outperform XGBoost baselines, underscoring the effectiveness of integrating graph-based structural representations, topology-aware augmentation, and controlled sparsification in classification tasks.

Version published to 10.20944/preprints202512.0123.v1
Dec 2, 2025

A Debiasing Framework for Graph Neural Networks Using Contrastive Learning

This article has 1 author:
1. Kushal Bhatt
This article has no evaluationsLatest version Dec 21, 2025
Multi-Scale Computational Analysis of Wikipedia’s Telling of Global History

This article has 7 authors:
1. Steph Buongiorno
2. Jo Guldi
3. Marnie Hughes-Warrington
4. Nan Jiang
5. Rosie Larson
6. Sohan Bellam
7. Gregory J. Palermo
This article has no evaluationsLatest version Jan 19, 2026
Inverse Design of High-Entropy Superalloys Using Machine Learning and Generative Artificial Intelligence

This article has 4 authors:
1. François Rousseau
2. Thierry Belmonte
3. Frédéric Sur
4. Alexandre Nominé
This article has no evaluationsLatest version Dec 25, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Debiasing Framework for Graph Neural Networks Using Contrastive Learning

Multi-Scale Computational Analysis of Wikipedia’s Telling of Global History

Inverse Design of High-Entropy Superalloys Using Machine Learning and Generative Artificial Intelligence