arXiv Open Access 2023

German FinBERT: A German Pre-trained Language Model

Moritz Scherrmann

Lihat Sumber

Abstrak

This study presents German FinBERT, a novel pre-trained German language model tailored for financial textual data. The model is trained through a comprehensive pre-training process, leveraging a substantial corpus comprising financial reports, ad-hoc announcements and news related to German companies. The corpus size is comparable to the data sets commonly used for training standard BERT models. I evaluate the performance of German FinBERT on downstream tasks, specifically sentiment prediction, topic recognition and question answering against generic German language models. My results demonstrate improved performance on finance-specific data, indicating the efficacy of German FinBERT in capturing domain-specific nuances. The presented findings suggest that German FinBERT holds promise as a valuable tool for financial text analysis, potentially benefiting various applications in the financial domain.

Topik & Kata Kunci

cs.CL stat.ML

Penulis (1)

Moritz Scherrmann

Format Sitasi

APA MLA BibTeX

Scherrmann, M. (2023). German FinBERT: A German Pre-trained Language Model. https://arxiv.org/abs/2311.08793

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2023
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓