Multimodal Feature Fusion for Molecular Property Classification

Jing Liu
Yin Wang
Li Xue
Qiaorong Wu
Wenwei Tao
Yiwei Wang
Jianming Wu
Jiesi Luo

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Accurate molecular property prediction is a cornerstone of modern chemical science, driving progress in drug discovery, materials design, and environmental research. Yet, most existing models remain unimodal, while multimodal approaches often rely on simple aggregation, leaving much of the complementary chemical information underexploited. In this work, we present a multimodal feature fusion framework that unites the strengths of deep chemical language processing (CLP) models and molecular fingerprints, integrating sequential and structural representations for more comprehensive molecular characterization. Unlike previous heuristic combinations, our framework systematically investigates the principles of effective cross-modal fusion. We benchmark ten CLP architectures and eight fingerprint types through exhaustive combinatorial search to identify the most synergistic configurations. This exploration shows that aggregating multiple models does not necessarily improve performance; instead, successful fusion requires data-aware design guided by feature integration and complementarity. The proposed strategy effectively couples sequential features learned from SMILES with structural information captured by molecular fingerprints, resulting in a coherent and chemically interpretable molecular representation. Evaluated across 60 datasets from MoleculeNet and TOXRIC, our fusion models deliver consistent and substantial gains over state-of-the-art baselines. Beyond outperforming existing architectures, this work provides conceptual insights and practical guidelines for multimodal fusion in molecular property prediction, highlighting the importance of efficient fusion strategies in building robust and generalizable molecular models.

Version published to 10.21203/rs.3.rs-9239998/v1 on Research Square
Apr 10, 2026

ArcMol Enables Task-Adaptive Spherical Representation Learning for Molecular Property Prediction

This article has 7 authors:
1. Lijuan Chen
2. yurong zou
3. Zhongning Guo
4. Zihan zou
5. Duanyang Qin
6. Dingguo Xu
7. Taijin Wang
This article has no evaluationsLatest version Apr 9, 2026
MDSLabChemBridge: Multi-Engine Molecular Descriptor Generation and ML-Ready Feature Engineering

This article has 1 author:
1. Yogesh Kumar
This article has no evaluationsLatest version Mar 31, 2026
A Multimodal Semi-Supervised Learning Framework for Pharmaceutical Cocrystals Prediction

This article has 3 authors:
1. Sohrab Rohani
2. Mohammad Ghanavati
3. Seyed Mohamad Moosavi
This article has no evaluationsLatest version Mar 30, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

ArcMol Enables Task-Adaptive Spherical Representation Learning for Molecular Property Prediction

MDSLabChemBridge: Multi-Engine Molecular Descriptor Generation and ML-Ready Feature Engineering

A Multimodal Semi-Supervised Learning Framework for Pharmaceutical Cocrystals Prediction