Semantic Scholar Open Access 2021 64 sitasi

Local Interpretations for Explainable Natural Language Processing: A Survey

Siwen Luo Hamish Ivison S. Han Josiah Poon

Lihat Sumber DOI

Abstrak

As the use of deep learning techniques has grown across various fields over the past decade, complaints about the opaqueness of the black-box models have increased, resulting in an increased focus on transparency in deep learning models. This work investigates various methods to improve the interpretability of deep neural networks for Natural Language Processing (NLP) tasks, including machine translation and sentiment analysis. We provide a comprehensive discussion on the definition of the term interpretability and its various aspects at the beginning of this work. The methods collected and summarised in this survey are only associated with local interpretation and are specifically divided into three categories: (1) interpreting the model’s predictions through related input features; (2) interpreting through natural language explanation; (3) probing the hidden states of models and word representations.

Topik & Kata Kunci

Computer Science

Penulis (4)

Siwen Luo

Hamish Ivison

S. Han

Josiah Poon

Format Sitasi

APA MLA BibTeX

Luo, S., Ivison, H., Han, S., Poon, J. (2021). Local Interpretations for Explainable Natural Language Processing: A Survey. https://doi.org/10.1145/3649450

Akses Cepat

Lihat di Sumber doi.org/10.1145/3649450

Informasi Jurnal

Tahun Terbit: 2021
Bahasa: en
Total Sitasi: 64×
Sumber Database: Semantic Scholar
DOI: 10.1145/3649450
Akses: Open Access ✓