arXiv Open Access 2021

The Human Evaluation Datasheet 1.0: A Template for Recording Details of Human Evaluation Experiments in NLP

Anastasia Shimorina Anya Belz
Lihat Sumber

Abstrak

This paper introduces the Human Evaluation Datasheet, a template for recording the details of individual human evaluation experiments in Natural Language Processing (NLP). Originally taking inspiration from seminal papers by Bender and Friedman (2018), Mitchell et al. (2019), and Gebru et al. (2020), the Human Evaluation Datasheet is intended to facilitate the recording of properties of human evaluations in sufficient detail, and with sufficient standardisation, to support comparability, meta-evaluation, and reproducibility tests.

Topik & Kata Kunci

Penulis (2)

A

Anastasia Shimorina

A

Anya Belz

Format Sitasi

Shimorina, A., Belz, A. (2021). The Human Evaluation Datasheet 1.0: A Template for Recording Details of Human Evaluation Experiments in NLP. https://arxiv.org/abs/2103.09710

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2021
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓