arXiv Open Access 2024

ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area

Junxian Li Di Zhang Xunzhi Wang Zeying Hao Jingdi Lei +14 lainnya

Lihat Sumber

Abstrak

Large Language Models (LLMs) have achieved remarkable success and have been applied across various scientific fields, including chemistry. However, many chemical tasks require the processing of visual information, which cannot be successfully handled by existing chemical LLMs. This brings a growing need for models capable of integrating multimodal information in the chemical domain. In this paper, we introduce \textbf{ChemVLM}, an open-source chemical multimodal large language model specifically designed for chemical applications. ChemVLM is trained on a carefully curated bilingual multimodal dataset that enhances its ability to understand both textual and visual chemical information, including molecular structures, reactions, and chemistry examination questions. We develop three datasets for comprehensive evaluation, tailored to Chemical Optical Character Recognition (OCR), Multimodal Chemical Reasoning (MMCR), and Multimodal Molecule Understanding tasks. We benchmark ChemVLM against a range of open-source and proprietary multimodal large language models on various tasks. Experimental results demonstrate that ChemVLM achieves competitive performance across all evaluated tasks. Our model can be found at https://huggingface.co/AI4Chem/ChemVLM-26B.

Topik & Kata Kunci

cs.LG cs.CV

Penulis (19)

Junxian Li

Di Zhang

Xunzhi Wang

Zeying Hao

Jingdi Lei

Qian Tan

Cai Zhou

Wei Liu

Yaotian Yang

Xinrui Xiong

Weiyun Wang

Zhe Chen

Wenhai Wang

Wei Li

Shufei Zhang

Mao Su

Wanli Ouyang

Yuqiang Li

Dongzhan Zhou

Format Sitasi

APA MLA BibTeX

Li, J., Zhang, D., Wang, X., Hao, Z., Lei, J., Tan, Q. et al. (2024). ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area. https://arxiv.org/abs/2408.07246

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2024
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓