arXiv Open Access 2020

GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing

Tao Yu Chien-Sheng Wu Xi Victoria Lin Bailin Wang Yi Chern Tan +4 lainnya

Lihat Sumber

Abstrak

We present GraPPa, an effective pre-training approach for table semantic parsing that learns a compositional inductive bias in the joint representations of textual and tabular data. We construct synthetic question-SQL pairs over high-quality tables via a synchronous context-free grammar (SCFG) induced from existing text-to-SQL datasets. We pre-train our model on the synthetic data using a novel text-schema linking objective that predicts the syntactic role of a table field in the SQL for each question-SQL pair. To maintain the model's ability to represent real-world data, we also include masked language modeling (MLM) over several existing table-and-language datasets to regularize the pre-training process. On four popular fully supervised and weakly supervised table semantic parsing benchmarks, GraPPa significantly outperforms RoBERTa-large as the feature representation layers and establishes new state-of-the-art results on all of them.

Topik & Kata Kunci

cs.CL cs.AI

Penulis (9)

Tao Yu

Chien-Sheng Wu

Xi Victoria Lin

Bailin Wang

Yi Chern Tan

Xinyi Yang

Dragomir Radev

Richard Socher

Caiming Xiong

Format Sitasi

APA MLA BibTeX

Yu, T., Wu, C., Lin, X.V., Wang, B., Tan, Y.C., Yang, X. et al. (2020). GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing. https://arxiv.org/abs/2009.13845

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2020
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓