Application of Machine Learning and Data Augmentation Algorithms in the Discovery of Metal Hydrides for Hydrogen Storage

Giancarlo Beltrame
Erika Michela Dematteis
Vitalie Stavila
Paola Rizzi
Marcello Baricco
Mauro Palumbo

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The development of efficient and sustainable hydrogen storage materials is a key challenge for realizing hydrogen as a clean and flexible energy carrier. Among various options, metal hydrides offer high volumetric storage density and operational safety, yet their application is limited by thermodynamic, kinetic, and compositional constraints. In this work, we investigate the potential of machine learning (ML) to predict key thermodynamic properties—equilibrium plateau pressure, enthalpy, and entropy of hydride formation—based solely on alloy composition using Magpie-generated descriptors. We significantly expand an existing experimental dataset from ~400 to 806 entries and assess the impact of dataset size and data augmentation, using the PADRE algorithm, on model performance. Models including Support Vector Machines and Gradient Boosted Random Forests were trained and optimized via grid search and cross-validation. Results show a marked improvement in predictive accuracy with increased dataset size, while data augmentation benefits are limited to smaller datasets and do not improve accuracy in underrepresented pressure regimes. Furthermore, clustering and cross-validation analyses highlight the limited generalizability of models across different material classes, though high accuracy is achieved when training and testing within a single hydride family (e.g., AB2). The study demonstrates the viability and limitations of ML for accelerating hydride discovery, emphasizing the importance of dataset diversity and representation for robust property prediction.

Version published to 10.3390/met15111221
Nov 4, 2025
Version published to 10.20944/preprints202508.1673.v1
Aug 22, 2025

Prediction of Thermodynamic Properties of Sacred Pepper (Piper auritum) Using Machine Learning

This article has 5 authors:
1. Cesar Cesar Pérez-Alonso
2. José Alvarez-Ramirez
3. Reyna Natividad
4. Ever Peralta-Reyes
5. Alejandro Regalado-Méndez
This article has no evaluationsLatest version Jan 9, 2026
Inverse Design of High-Entropy Superalloys Using Machine Learning and Generative Artificial Intelligence

This article has 4 authors:
1. François Rousseau
2. Thierry Belmonte
3. Frédéric Sur
4. Alexandre Nominé
This article has no evaluationsLatest version Dec 25, 2025
Machine learning force field molecular dynamics simulation of SEI formation on lithium metal

This article has 4 authors:
1. Atsuo Yamada
2. Norio Takenaka
3. Taiga Iwata
4. Théophane Bernhard
This article has no evaluationsLatest version Dec 12, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Prediction of Thermodynamic Properties of Sacred Pepper (Piper auritum) Using Machine Learning

Inverse Design of High-Entropy Superalloys Using Machine Learning and Generative Artificial Intelligence

Machine learning force field molecular dynamics simulation of SEI formation on lithium metal