Why Grounded Large Language Models Fail Without Domain-Specialized Retrieval: An Experimental Scientometric Study in Solar Physics

Andre Insardi
Andre Gradvohl

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Recent applications of large language models in scientometric analysis often assume that the underlying evidence space is neutral and given. In this study, we challenge this assumption by explicitly modeling information retrieval as a causal component shaping model-based analytical outputs. We propose a three-phase experimental framework that separates the construction of the evidence space via semantic retrieval, the analytical evaluation of that space prior to text generation, and model-based analytical agency under controlled grounding and structural enforcement conditions. We develop and release SciBERT-SolarPhysics-Search, a domain-specialized semantic retriever trained through domain-adaptive pretraining and supervised contrastive fine-tuning, and compare generic and specialized retrieval strategies, showing that specialized retrieval increases domain semantic coverage from 48.2% to 71.6% and conceptual connectivity density from 0.19 to 0.37. We further observe that improvements in retrieval quality and grounding alone do not ensure coherent scientometric outputs. Only configurations combining specialized retrieval, explicit grounding, and structural enforcement reduce the proportion of unsupported analytical statements from 0.62 to 0.08 in the agent-level evaluation. These results indicate that reliable integration of language 1 models into scientometrics depends on explicit control of retrieval infrastructure, evaluation criteria, and analytical constraints.

Version published to 10.21203/rs.3.rs-8710557/v1 on Research Square
Feb 6, 2026

AionRAG: Time-Correct Retrieval-Augmented Generation under Knowledge Drift

This article has 5 authors:
1. Rui Li
2. Shuang Cao
3. Ruihua Liu
4. Alexandre Duprey
5. Angel Dong
This article has no evaluationsLatest version Feb 20, 2026
Task-Conditioned Representation Adaptation for Many-Shot In-Context Learning via Continued Pretraining

This article has 3 authors:
1. Lukas Schneider
2. Anna-Maria Keller
3. Michael Tobias Fischer
This article has no evaluationsLatest version Feb 16, 2026
Synthetic Participants Generated by Large Language Models: A Systematic Literature Review

This article has 3 authors:
1. Eduard Kuric
2. Peter Demcak
3. Matus Krajcovic
This article has no evaluationsLatest version Mar 10, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

AionRAG: Time-Correct Retrieval-Augmented Generation under Knowledge Drift

Task-Conditioned Representation Adaptation for Many-Shot In-Context Learning via Continued Pretraining

Synthetic Participants Generated by Large Language Models: A Systematic Literature Review