Silent collapse in large neural networks: standard evaluation conceals systematic reasoning failure

Yin Li

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Fine-tuned neural networks can achieve near-perfect scores on standard benchmarks while systematically relying on spurious shortcuts rather than genuine reasoning—a phenomenon we term ‘silent collapse’. Through controlled experiments across four architecture families (86M–14B parameters), six tasks, and two modalities, we show that silent collapse becomes more severe with increasing model scale: larger models require progressively tighter training constraints to maintain genuine reasoning capability, with the optimal trainable fraction falling from ~50% at 160M to ~15% at 6.9B parameters. Two prospective predictions on models up to 14 billion parameters were experimentally tested, with results largely consistent with the predicted trends. Evaluation of widely-deployed models reveals that a leading NLI classifier achieves 90% on standard benchmarks but performs at chance level under adversarial evaluation (I_wild = 0.37). Together, these results show that standard benchmarks can be non-diagnostic for shortcut reliance at scale, and that calibrated constraint provides a practical way to make fine-tuning outcomes reliably reproducible.

Version published to 10.21203/rs.3.rs-9086863/v1 on Research Square
Mar 23, 2026

The Reliability Chasm: AI Accuracy Across Three Reasoning Domains

This article has 3 authors:
1. Kuldeep pandit
2. Vtasla Pandit
3. Aayan Pandit
This article has no evaluationsLatest version Mar 16, 2026
Graduated Dissent: Budgeted Disagreement Resolution for Multi-Model Inference

This article has 1 author:
1. Andrew Michael Brilliant
This article has no evaluationsLatest version Mar 24, 2026
Critical Safety Attention Heads: Architecture-Dependent Vulnerabilities in LLMs

This article has 5 authors:
1. Letian Sha
2. Peijie Sun
3. Hao Xue
4. Shijie Hao
5. Fu Xiao
This article has no evaluationsLatest version Mar 16, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

The Reliability Chasm: AI Accuracy Across Three Reasoning Domains

Graduated Dissent: Budgeted Disagreement Resolution for Multi-Model Inference

Critical Safety Attention Heads: Architecture-Dependent Vulnerabilities in LLMs