Semantic Scholar Open Access 2023 69 sitasi

A survey on narrative extraction from textual data

Brenda Salenave Santana Ricardo Campos Evelin Amorim A. Jorge Purificação Silvano +1 lainnya

Abstrak

Narratives are present in many forms of human expression and can be understood as a fundamental way of communication between people. Computational understanding of the underlying story of a narrative, however, may be a rather complex task for both linguists and computational linguistics. Such task can be approached using natural language processing techniques to automatically extract narratives from texts. In this paper, we present an in depth survey of narrative extraction from text, providing a establishing a basis/framework for the study roadmap to the study of this area as a whole as a means to consolidate a view on this line of research. We aim to fulfill the current gap by identifying important research efforts at the crossroad between linguists and computer scientists. In particular, we highlight the importance and complexity of the annotation process, as a crucial step for the training stage. Next, we detail methods and approaches regarding the identification and extraction of narrative components, their linkage and understanding of likely inherent relationships, before detailing formal narrative representation structures as an intermediate step for visualization and data exploration purposes. We then move into the narrative evaluation task aspects, and conclude this survey by highlighting important open issues under the domain of narratives extraction from texts that are yet to be explored.

Topik & Kata Kunci

Penulis (6)

B

Brenda Salenave Santana

R

Ricardo Campos

E

Evelin Amorim

A

A. Jorge

P

Purificação Silvano

S

Sérgio Nunes

Format Sitasi

Santana, B.S., Campos, R., Amorim, E., Jorge, A., Silvano, P., Nunes, S. (2023). A survey on narrative extraction from textual data. https://doi.org/10.1007/s10462-022-10338-7

Akses Cepat

Lihat di Sumber doi.org/10.1007/s10462-022-10338-7
Informasi Jurnal
Tahun Terbit
2023
Bahasa
en
Total Sitasi
69×
Sumber Database
Semantic Scholar
DOI
10.1007/s10462-022-10338-7
Akses
Open Access ✓