arXiv Open Access 2026

AdditiveLLM2: A Multi-modal Large Language Model for Additive Manufacturing

Peter Pak Amir Barati Farimani

Lihat Sumber

Abstrak

This work presents AdditiveLLM2 a multi-modal, domain adapted large language model built upon the instruction tuned variant of the Gemma 3 model using a relatively small dataset of around 50 million tokens. The dataset (AdditiveLLM2-OA) consists of open-access additive manufacturing journal articles with data extracted for the domain adaptive pretraining and visual instruction tuning processes. Various stages of the developed model are evaluated with the Additive-Manufacturing-Benchmark which consists of additive manufacturing domain specific tasks compiled published resources. AdditiveLLM2 exhibits proficiency in both language and vision based tasks, achieving accuracies upwards of 90% in general additive manufacturing knowledge. This domain adaptive pretraining and instruction tuning strategy outline an accessible specialization method for large language models to a domain such as additive manufacturing.

Topik & Kata Kunci

cs.LG

Penulis (2)

Peter Pak

Amir Barati Farimani

Format Sitasi

APA MLA BibTeX

Pak, P., Farimani, A.B. (2026). AdditiveLLM2: A Multi-modal Large Language Model for Additive Manufacturing. https://arxiv.org/abs/2603.22017

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2026
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓