arXiv Open Access 2022

Transformer Encoder for Social Science

Haosen Ge In Young Park Xuancheng Qian Grace Zeng

Lihat Sumber

Abstrak

High-quality text data has become an important data source for social scientists. We have witnessed the success of pretrained deep neural network models, such as BERT and RoBERTa, in recent social science research. In this paper, we propose a compact pretrained deep neural network, Transformer Encoder for Social Science (TESS), explicitly designed to tackle text processing tasks in social science research. Using two validation tests, we demonstrate that TESS outperforms BERT and RoBERTa by 16.7% on average when the number of training samples is limited (<1,000 training instances). The results display the superiority of TESS over BERT and RoBERTa on social science text processing tasks. Lastly, we discuss the limitation of our model and present advice for future researchers.

Topik & Kata Kunci

cs.CL cs.AI

Penulis (4)

Haosen Ge

In Young Park

Xuancheng Qian

Grace Zeng

Format Sitasi

APA MLA BibTeX

Ge, H., Park, I.Y., Qian, X., Zeng, G. (2022). Transformer Encoder for Social Science. https://arxiv.org/abs/2208.08005

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2022
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓