arXiv Open Access 2025

Structure-Aware Fusion with Progressive Injection for Multimodal Molecular Representation Learning

Zihao Jing Yan Sun Yan Yi Li Sugitha Janarthanan Alana Deng +1 lainnya
Lihat Sumber

Abstrak

Multimodal molecular models often suffer from 3D conformer unreliability and modality collapse, limiting their robustness and generalization. We propose MuMo, a structured multimodal fusion framework that addresses these challenges in molecular representation through two key strategies. To reduce the instability of conformer-dependent fusion, we design a Structured Fusion Pipeline (SFP) that combines 2D topology and 3D geometry into a unified and stable structural prior. To mitigate modality collapse caused by naive fusion, we introduce a Progressive Injection (PI) mechanism that asymmetrically integrates this prior into the sequence stream, preserving modality-specific modeling while enabling cross-modal enrichment. Built on a state space backbone, MuMo supports long-range dependency modeling and robust information propagation. Across 29 benchmark tasks from Therapeutics Data Commons (TDC) and MoleculeNet, MuMo achieves an average improvement of 2.7% over the best-performing baseline on each task, ranking first on 22 of them, including a 27% improvement on the LD50 task. These results validate its robustness to 3D conformer noise and the effectiveness of multimodal fusion in molecular representation. The code is available at: github.com/selmiss/MuMo.

Topik & Kata Kunci

Penulis (6)

Z

Zihao Jing

Y

Yan Sun

Y

Yan Yi Li

S

Sugitha Janarthanan

A

Alana Deng

P

Pingzhao Hu

Format Sitasi

Jing, Z., Sun, Y., Li, Y.Y., Janarthanan, S., Deng, A., Hu, P. (2025). Structure-Aware Fusion with Progressive Injection for Multimodal Molecular Representation Learning. https://arxiv.org/abs/2510.23640

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓