arXiv Open Access 2022

Pre-trained Sentence Embeddings for Implicit Discourse Relation Classification

Murali Raghu Babu Balusu Yangfeng Ji Jacob Eisenstein
Lihat Sumber

Abstrak

Implicit discourse relations bind smaller linguistic units into coherent texts. Automatic sense prediction for implicit relations is hard, because it requires understanding the semantics of the linked arguments. Furthermore, annotated datasets contain relatively few labeled examples, due to the scale of the phenomenon: on average each discourse relation encompasses several dozen words. In this paper, we explore the utility of pre-trained sentence embeddings as base representations in a neural network for implicit discourse relation sense classification. We present a series of experiments using both supervised end-to-end trained models and pre-trained sentence encoding techniques - SkipThought, Sent2vec and Infersent. The pre-trained embeddings are competitive with the end-to-end model, and the approaches are complementary, with combined models yielding significant performance improvements on two of the three evaluations.

Topik & Kata Kunci

Penulis (3)

M

Murali Raghu Babu Balusu

Y

Yangfeng Ji

J

Jacob Eisenstein

Format Sitasi

Balusu, M.R.B., Ji, Y., Eisenstein, J. (2022). Pre-trained Sentence Embeddings for Implicit Discourse Relation Classification. https://arxiv.org/abs/2210.11005

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2022
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓