arXiv Open Access 2025

QianfanHuijin Technical Report: A Novel Multi-Stage Training Paradigm for Finance Industrial LLMs

Shupeng Li Weipeng Lu Linyun Liu Chen Lin Shaofei Li +14 lainnya

Lihat Sumber

Abstrak

Domain-specific enhancement of Large Language Models (LLMs) within the financial context has long been a focal point of industrial application. While previous models such as BloombergGPT and Baichuan-Finance primarily focused on knowledge enhancement, the deepening complexity of financial services has driven a growing demand for models that possess not only domain knowledge but also robust financial reasoning and agentic capabilities. In this paper, we present QianfanHuijin, a financial domain LLM, and propose a generalizable multi-stage training paradigm for industrial model enhancement. Our approach begins with Continual Pre-training (CPT) on financial corpora to consolidate the knowledge base. This is followed by a fine-grained Post-training pipeline designed with increasing specificity: starting with Financial SFT, progressing to Finance Reasoning RL and Finance Agentic RL, and culminating in General RL aligned with real-world business scenarios. Empirical results demonstrate that QianfanHuijin achieves superior performance across various authoritative financial benchmarks. Furthermore, ablation studies confirm that the targeted Reasoning RL and Agentic RL stages yield significant gains in their respective capabilities. These findings validate our motivation and suggest that this fine-grained, progressive post-training methodology is poised to become a mainstream paradigm for various industrial-enhanced LLMs.

Topik & Kata Kunci

cs.CL

Penulis (19)

Shupeng Li

Weipeng Lu

Linyun Liu

Chen Lin

Shaofei Li

Zhendong Tan

Hanjun Zhong

Yucheng Zeng

Chenghao Zhu

Mengyue Liu

Daxiang Dong

Jianmin Wu

Yunting Xiao

Annan Li

Danyu Liu

Jingnan Zhang

Licen Liu

Dawei Yin

Dou Shen

Format Sitasi

APA MLA BibTeX

Li, S., Lu, W., Liu, L., Lin, C., Li, S., Tan, Z. et al. (2025). QianfanHuijin Technical Report: A Novel Multi-Stage Training Paradigm for Finance Industrial LLMs. https://arxiv.org/abs/2512.24314

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓