Augmentative Semi-Supervised Learning for Autism Screening: A Novel Framework

Rabia Naseer Rao
Hiran Thabrew
Seyed Reza Shahamiri

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Autism Spectrum Disorder (ASD) is a neurodevelopmental condition for which early identification is essential to provide appropriate support and effective treatment. However, current diagnostic methods are resource-intensive and often inaccessible. Artificial Intelligence offers a promising alternative, but its effectiveness is hindered by algorithmic bias arising from data scarcity and imbalanced, largely unlabeled datasets. Such bias can lead to model overfitting, impaired learning, and poor generalization. While semi-supervised learning (SSL) can reduce reliance on manual labels through pseudo-label generation, conventional SSL approaches perform poorly under severe class imbalance, often amplifying label noise and bias. To address these challenges, we propose a novel Augmentative Semi-supervised Learning (ASSL) framework designed for robust learning in the presence of class imbalance and label scarcity. ASSL first applies pattern-based sampling to construct a balanced labeled dataset. It then employs a Collaborative Decision Labeling (CDL) strategy, where two heterogeneous models assign pseudo-labels using Dynamic Dual Thresholding (DDT), retaining only samples jointly and confidently labeled by both models. The framework was applied to the Autism AI dataset (over 12,000 participants), most of whom lacked diagnostic labels, producing severe class imbalance. ASSL improved sensitivity by 15.3%, specificity by 30.2%, and accuracy by 15.9% over conventional screening methods. Next, in external validation on the NHANES diabetes dataset, ASSL achieved a 7.9% gain in sensitivity and better discriminatory performance under imbalance. These results demonstrate that ASSL is a scalable and generalizable approach for limited and imbalanced health data tasks, offering a pathway to reduce algorithmic bias across screening applications.

Version published to 10.21203/rs.3.rs-8600100/v1 on Research Square
Feb 4, 2026

Beyond Transfer Learning: A Generative Self-Supervised Framework for fMRI-Based Diagnosis on Small and Imbalanced Datasets

This article has 4 authors:
1. Ershad Hassanpour Golagani
2. Saeed Masoudnia
3. Ahmad Kalhor
4. Hamid Soltanian-Zadeh
This article has no evaluationsLatest version Mar 16, 2026
Mapping the Landscape of ASD-AI: Multimodal Gains, XAI Adoption, and Fairness Gaps - A Systematic Review

This article has 3 authors:
1. Waqas Ahmad
2. Ashraf Zia
3. Muhammad Zakarya
This article has no evaluationsLatest version Mar 20, 2026
Learning with Imbalance Noisy Labels via Confidence-guided Sample Mixing and Negative Learning

This article has 3 authors:
1. Yu Liu
2. Guanjia Zhang
3. Xiang Wei
This article has no evaluationsLatest version Feb 13, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Beyond Transfer Learning: A Generative Self-Supervised Framework for fMRI-Based Diagnosis on Small and Imbalanced Datasets

Mapping the Landscape of ASD-AI: Multimodal Gains, XAI Adoption, and Fairness Gaps - A Systematic Review

Learning with Imbalance Noisy Labels via Confidence-guided Sample Mixing and Negative Learning