arXiv Open Access 2025

ixi-GEN: Efficient Industrial sLLMs through Domain Adaptive Continual Pretraining

Seonwu Kim Yohan Na Kihun Kim Hanhee Cho Geun Lim +5 lainnya
Lihat Sumber

Abstrak

The emergence of open-source large language models (LLMs) has expanded opportunities for enterprise applications; however, many organizations still lack the infrastructure to deploy and maintain large-scale models. As a result, small LLMs (sLLMs) have become a practical alternative despite inherent performance limitations. While Domain Adaptive Continual Pretraining (DACP) has been explored for domain adaptation, its utility in commercial settings remains under-examined. In this study, we validate the effectiveness of a DACP-based recipe across diverse foundation models and service domains, producing DACP-applied sLLMs (ixi-GEN). Through extensive experiments and real-world evaluations, we demonstrate that ixi-GEN models achieve substantial gains in target-domain performance while preserving general capabilities, offering a cost-efficient and scalable solution for enterprise-level deployment.

Topik & Kata Kunci

Penulis (10)

S

Seonwu Kim

Y

Yohan Na

K

Kihun Kim

H

Hanhee Cho

G

Geun Lim

M

Mintae Kim

S

Seongik Park

K

Ki Hyun Kim

Y

Youngsub Han

B

Byoung-Ki Jeon

Format Sitasi

Kim, S., Na, Y., Kim, K., Cho, H., Lim, G., Kim, M. et al. (2025). ixi-GEN: Efficient Industrial sLLMs through Domain Adaptive Continual Pretraining. https://arxiv.org/abs/2507.06795

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓