A Case Study on the Stability of Neural Network Climate Prediction Models with Different Training Stop Criteria

Xiangjun Shi
Ping Zhou
Sirui He

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Due to randomness factors in the machine learning model construction process, reproducibility is compromised. This study investigates the impact of randomness on model stability and evaluates techniques for reducing this impact using the widely adopted shallow neural network model as a testbed. Randomness in this neural network model arises from three events: randomly initializing model parameters, randomly selecting a validation subset, and randomly sampling batches for parameter updates. Among these, batch randomness exerts a much weaker impact than the other two factors. In this study, the model training is stopped when the validation performance fails to improve or when a preset threshold for loss or epoch number is met. The final model stability is considerably better when using threshold criteria than when using validation criterion, as the former avoids the randomness associated with selecting a validation subset. Sensitivity experiments show that scaling the model’s initial parameters (i.e., weights) to 0.1 times their original values can mitigate the impact of initialization randomness, thereby markedly improving model stability while also substantially enhancing predictive skill. Furthermore, weight decay and multi-model ensembles, which are two commonly used techniques, can also markedly enhance model stability. From the perspective of this case study, the compression of model initial parameters yields better improvements in stability compared to weight decay, and unlike multi-model ensemble methods that entail substantial increases in computational cost, it serves as a preferable technique for improving model stability.

Version published to 10.3390/atmos17050523
May 20, 2026
Version published to 10.20944/preprints202604.0821.v1
Apr 13, 2026

Bayesian-Optimized Neural Networks with High-Fidelity FEM for Intelligent Residual Strength Prediction in Damaged Ships

This article has 4 authors:
1. Jianxiao Deng
2. Fei Peng
3. Jinlei Mu
4. Hailiang Hou
This article has no evaluationsLatest version Apr 30, 2026
Performance assessment of statistical and deep learning models for predicting chloride trends in island groundwater systems influenced by saltwater intrusion and sea level rise

This article has 4 authors:
1. Yong Sang Kim
2. Meejoung Kim
3. Ujwalkumar D. Patil
4. Byoungyong Lee
This article has no evaluationsLatest version Apr 9, 2026
Impact of Input Feature Scenarios on Metaheuristic-Optimized LSTM and GRU Networks Applied to Load Forecasting for Building Energy Management

This article has 3 authors:
1. Aline S Lima
2. Mayron P Cardoso
3. Lúcia V R Arruda
This article has no evaluationsLatest version Apr 8, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Bayesian-Optimized Neural Networks with High-Fidelity FEM for Intelligent Residual Strength Prediction in Damaged Ships

Performance assessment of statistical and deep learning models for predicting chloride trends in island groundwater systems influenced by saltwater intrusion and sea level rise

Impact of Input Feature Scenarios on Metaheuristic-Optimized LSTM and GRU Networks Applied to Load Forecasting for Building Energy Management