Semantic Scholar Open Access 2017 784 sitasi

Crowdsourcing Multiple Choice Science Questions

Johannes Welbl Nelson F. Liu Matt Gardner

Lihat Sumber DOI

Abstrak

We present a novel method for obtaining high-quality, domain-targeted multiple choice questions from crowd workers. Generating these questions can be difficult without trading away originality, relevance or diversity in the answer options. Our method addresses these problems by leveraging a large corpus of domain-specific text and a small set of existing questions. It produces model suggestions for document selection and answer distractor choice which aid the human question generation process. With this method we have assembled SciQ, a dataset of 13.7K multiple choice science exam questions. We demonstrate that the method produces in-domain questions by providing an analysis of this new dataset and by showing that humans cannot distinguish the crowdsourced questions from original questions. When using SciQ as additional training data to existing questions, we observe accuracy improvements on real science exams.

Topik & Kata Kunci

Computer Science Mathematics

Penulis (3)

Johannes Welbl

Nelson F. Liu

Matt Gardner

Format Sitasi

APA MLA BibTeX

Welbl, J., Liu, N.F., Gardner, M. (2017). Crowdsourcing Multiple Choice Science Questions. https://doi.org/10.18653/v1/W17-4413

Akses Cepat

Lihat di Sumber doi.org/10.18653/v1/W17-4413

Informasi Jurnal

Tahun Terbit: 2017
Bahasa: en
Total Sitasi: 784×
Sumber Database: Semantic Scholar
DOI: 10.18653/v1/W17-4413
Akses: Open Access ✓