Scaling Molecular Representation Learning with Hierarchical Mixture-of-Experts

Xiang Zhang
Shenbao Yu
Jie Xia
Fan Yang

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Recent advancements in large-scale self-supervised pretraining have significantly improved molecular representation learning, yet challenges persist, particularly when addressing distributional shifts (e.g., under scaffold-split). Drawing inspiration from the success of Mixture-of-Experts (MoE) networks in NLP, we introduce H-MoE, a hierarchical MoE model tailored for molecular representation learning. Since conventional routing strategies struggle to capture global molecular information—such as scaffold structures, which are crucial for enhancing generalization—we propose a hierarchical routing mechanism. This mechanism first utilizes scaffold-level structural guidance before refining molecular characteristics at the atomic level. To optimize expert assignment, we incorporate scaffold routing contrastive loss, ensuring scaffold-consistent routing while preserving discriminability across molecular categories. Furthermore, a curriculum learning approach and dynamic expert allocation strategy are employed to enhance adaptability. Extensive experiments on molecular property prediction tasks demonstrate the effectiveness of our method in capturing molecular diversity and improving generalization across different tasks.

Version published to 10.1101/2025.08.09.669511 on bioRxiv
Aug 12, 2025

MolUNet++: Adaptive-grained Explicit Substructure and Interaction Aware Molecular Representation Learning

This article has 6 authors:
1. Fanding Xu
2. Zhiwe Yang
3. Wu Su
4. Lizhuo Wang
5. Deyu Meng
6. Jiangang Long
This article has no evaluationsLatest version Jul 10, 2025
TMolNet: A Task-Aware Multimodal Neural Network for Molecular Property Prediction

This article has 3 authors:
1. cao han
2. Xianghong Tang
3. Jianguang Lu
This article has no evaluationsLatest version Jul 31, 2025
Multi-Modal Protein Representation Learning with CLASP

This article has 3 authors:
1. Nicolas Bolouri
2. Joseph Szymborski
3. Amin Emad
This article has no evaluationsLatest version Aug 12, 2025

Listed in

Abstract

Article activity feed

Related articles

MolUNet++: Adaptive-grained Explicit Substructure and Interaction Aware Molecular Representation Learning

TMolNet: A Task-Aware Multimodal Neural Network for Molecular Property Prediction

Multi-Modal Protein Representation Learning with CLASP