arXiv Open Access 2022

Pre-trained Sentence Embeddings for Implicit Discourse Relation Classification

Murali Raghu Babu Balusu Yangfeng Ji Jacob Eisenstein

Lihat Sumber

Abstrak

Implicit discourse relations bind smaller linguistic units into coherent texts. Automatic sense prediction for implicit relations is hard, because it requires understanding the semantics of the linked arguments. Furthermore, annotated datasets contain relatively few labeled examples, due to the scale of the phenomenon: on average each discourse relation encompasses several dozen words. In this paper, we explore the utility of pre-trained sentence embeddings as base representations in a neural network for implicit discourse relation sense classification. We present a series of experiments using both supervised end-to-end trained models and pre-trained sentence encoding techniques - SkipThought, Sent2vec and Infersent. The pre-trained embeddings are competitive with the end-to-end model, and the approaches are complementary, with combined models yielding significant performance improvements on two of the three evaluations.

Topik & Kata Kunci

cs.CL cs.AI

Penulis (3)

Murali Raghu Babu Balusu

Yangfeng Ji

Jacob Eisenstein

Format Sitasi

APA MLA BibTeX

Balusu, M.R.B., Ji, Y., Eisenstein, J. (2022). Pre-trained Sentence Embeddings for Implicit Discourse Relation Classification. https://arxiv.org/abs/2210.11005

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2022
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓