Optimization Methods for Solving Deep Learning Problems: A Case Study of Adaptive Learning Rate Optimizers

Ganiyu A. Saheed
Muhammed Abdulkabir
Onanusi A. Babajide

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In this project, we study the optimization methods impacts on deep learning tasks, where we particularly focus on adaptive learning rate optimizers (e.g., AdaGrad, RMSProp, and Adam) and describe them, stating their strengths, weaknesses, and scenarios where they excel or underperform. We employ an experimental approach to analyze their performance, generalization, computational efficiency, and hyperparameter sensitivity. The study compares the performance of adaptive optimizers against a traditional method (SGD) and a non-tuning machine learning model (LDA). Our empirical results show that Adam performs best both on the train and test set in terms of accuracy, speed, generalization, and computational efficiency.

Version published to 10.20944/preprints202604.0443.v1
Apr 8, 2026

From Grid Search to Modern Deterministic Ideas: A Brief Review of Hyper-Parameter Optimisation in Deep and Foundation Model Era

This article has 1 author:
1. Mehdi Neshat
This article has no evaluationsLatest version Mar 30, 2026
Continual Test-Time Adaptation: A Comprehensive Survey

This article has 9 authors:
1. Sarthak Kumar Maharana
2. Shambhavi Mishra
3. Yunbei Zhang
4. Shuaicheng Niu
5. Taki Hasan Rafi
6. Jihun Hamm
7. Marco Pedersoli
8. Jose Dolz
9. Yunhui Guo
This article has no evaluationsLatest version Mar 19, 2026
Efficient Optimization of Large Language Models via Parameter-Efficient Tuning and Adaptive Inference

This article has 1 author:
1. Yang Ji
This article has no evaluationsLatest version Apr 10, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

From Grid Search to Modern Deterministic Ideas: A Brief Review of Hyper-Parameter Optimisation in Deep and Foundation Model Era

Continual Test-Time Adaptation: A Comprehensive Survey

Efficient Optimization of Large Language Models via Parameter-Efficient Tuning and Adaptive Inference