arXiv Open Access 2026

FiMI: A Domain-Specific Language Model for Indian Finance Ecosystem

Aboli Kathar Aman Kumar Anusha Kamath Araveeti Srujan Ashish Sharma +35 lainnya
Lihat Sumber

Abstrak

We present FiMI (Finance Model for India), a domain-specialized financial language model developed by National Payments Corporation of India (NPCI) for Indian digital payment systems. We develop two model variants: FiMI Base and FiMI Instruct. FiMI adapts the Mistral Small 24B architecture through a multi-stage training pipeline, beginning with continuous pre-training on 68 Billion tokens of curated financial, multilingual (English, Hindi, Hinglish), and synthetic data. This is followed by instruction fine-tuning and domain-specific supervised fine-tuning focused on multi-turn, tool-driven conversations that model real-world workflows, such as transaction disputes and mandate lifecycle management. Evaluations reveal that FiMI Base achieves a 20\% improvement over the Mistral Small 24B Base model on finance reasoning benchmark, while FiMI Instruct outperforms the Mistral Small 24B Instruct model by 87\% on domain-specific tool-calling. Moreover, FiMI achieves these significant domain gains while maintaining comparable performance to models of similar size on general benchmarks.

Penulis (40)

A

Aboli Kathar

A

Aman Kumar

A

Anusha Kamath

A

Araveeti Srujan

A

Ashish Sharma

C

Chandra Bhushan

D

Divya Sorate

D

Duddu Prasanth Kumar

E

Evan Acharya

H

Harsh Sharma

H

Hrithik Kadam

K

Kanishk Singla

K

Keyur Doshi

K

Kiran Praveen

K

Kolisetty Krishna SK

K

Krishanu Adhikary

L

Lokesh MPT

M

Mayurdeep Sonowal

N

Nadeem Shaikh

N

Navya Prakash

N

Nimit Kothari

N

Nitin Kukreja

P

Prashant Devadiga

R

Rakesh Paul

R

Ratanjeet Pratap Chauhan

R

Raunak Kalani

R

Raviraj Joshi

S

Shamanth MH

S

Shantanu Pandey

S

Shubham Soni

S

Siddharth Dixit

S

Smriti Jopat

S

Sunil Patel

S

Suraj Singh

S

Suvradip Paul

T

Tulasi Pilla

U

Utkarsh Vaidya

V

Vineeth Nambiar

V

Vishal Kanvaty

Y

Yatharth Dedhia

Format Sitasi

Kathar, A., Kumar, A., Kamath, A., Srujan, A., Sharma, A., Bhushan, C. et al. (2026). FiMI: A Domain-Specific Language Model for Indian Finance Ecosystem. https://arxiv.org/abs/2602.05794

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2026
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓