Discovering Chemical Space from First Principles with Reinforcement Learning

Bjarke Hastrup
Francois Cornet
Tejs Vegge
Arghya Bhowmik

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Discovering novel stable molecules without training data remains a grand scientific challenge. Current molecular generative models are trained on large, pre-curated datasets, which introduce biases and limit exploration of novel chemistry. In contrast, we propose a new paradigm: autonomous, generalized agents capable of mapping vast, unknown chemical spaces without any pretraining. For the first time, we present a self-guided agent that autonomously constructs valid 3D isomers under stoichiometric constraints and is trained exclusively online using reinforcement learning. Unlike existing approaches that generally overfit to a specific chemical formula, we establish a multi-composition training scheme that enables a broad generalization across diverse chemistry, guided by energy- and validity-based rewards. Our agent can discover up to an order of magnitude more valid isomers on unseen test formulas than the baseline. These results fulfil the promise of online RL as a powerful paradigm for scalable tabula rasa exploration of the chemical configuration space.

Version published to 10.21203/rs.3.rs-6900238/v1 on Research Square
Jun 17, 2025

PURE: Policy-guided Unbiased REpresentations for structure-constrained molecular generation

This article has 7 authors:
1. Abhor Gupta
2. Barathi Lenin
3. Sean Current
4. Rohit Batra
5. Balaraman Ravindran
6. Karthik Raman
7. Srinivasan Parthasarathy
This article has no evaluationsLatest version May 24, 2025
Large Language Models as Materials Science Adapted Learners

This article has 11 authors:
1. Tong Xie
2. Yuwei Wan
3. Yixuan Liu
4. Yuchen Zeng
5. Shaozhou Wang
6. Wenjie Zhang
7. Clara Grazian
8. Chunyu Kit
9. Wanli Ouyang
10. Dongzhan Zhou
11. Bram Hoex
This article has no evaluationsLatest version Jul 7, 2025
Learning to Explore Tree Neighbourhoods for Phylogenetic Inference

This article has 4 authors:
1. Federico Julian Camerota Verdù
2. Andrea Gasparin
3. Luca Bortolussi
4. Lorenzo Castelli
This article has no evaluationsLatest version Jun 25, 2025

Listed in

Abstract

Article activity feed

Related articles

PURE: Policy-guided Unbiased REpresentations for structure-constrained molecular generation

Large Language Models as Materials Science Adapted Learners

Learning to Explore Tree Neighbourhoods for Phylogenetic Inference