arXiv Open Access 2024

Economy Watchers Survey Provides Datasets and Tasks for Japanese Financial Domain

Masahiro Suzuki Hiroki Sakaji
Lihat Sumber

Abstrak

Natural language processing (NLP) tasks in English and general domains are widely available and are often used to evaluate pre-trained language models. In contrast, fewer tasks are available for languages other than English and in the financial domain. Particularly, tasks in the Japanese and financial domains are limited. We develop two large datasets using data published by a Japanese central government agency. The datasets provide three Japanese financial NLP tasks, including 3- and 12-class classifications for categorizing sentences, along with a 5-class classification task for sentiment analysis. Our datasets are designed to be comprehensive and updated by leveraging an automatic update framework that ensures that the latest task datasets are publicly always available.

Topik & Kata Kunci

Penulis (2)

M

Masahiro Suzuki

H

Hiroki Sakaji

Format Sitasi

Suzuki, M., Sakaji, H. (2024). Economy Watchers Survey Provides Datasets and Tasks for Japanese Financial Domain. https://arxiv.org/abs/2407.14727

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓