arXiv Open Access 2020

GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing

Tao Yu Chien-Sheng Wu Xi Victoria Lin Bailin Wang Yi Chern Tan +4 lainnya
Lihat Sumber

Abstrak

We present GraPPa, an effective pre-training approach for table semantic parsing that learns a compositional inductive bias in the joint representations of textual and tabular data. We construct synthetic question-SQL pairs over high-quality tables via a synchronous context-free grammar (SCFG) induced from existing text-to-SQL datasets. We pre-train our model on the synthetic data using a novel text-schema linking objective that predicts the syntactic role of a table field in the SQL for each question-SQL pair. To maintain the model's ability to represent real-world data, we also include masked language modeling (MLM) over several existing table-and-language datasets to regularize the pre-training process. On four popular fully supervised and weakly supervised table semantic parsing benchmarks, GraPPa significantly outperforms RoBERTa-large as the feature representation layers and establishes new state-of-the-art results on all of them.

Topik & Kata Kunci

Penulis (9)

T

Tao Yu

C

Chien-Sheng Wu

X

Xi Victoria Lin

B

Bailin Wang

Y

Yi Chern Tan

X

Xinyi Yang

D

Dragomir Radev

R

Richard Socher

C

Caiming Xiong

Format Sitasi

Yu, T., Wu, C., Lin, X.V., Wang, B., Tan, Y.C., Yang, X. et al. (2020). GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing. https://arxiv.org/abs/2009.13845

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2020
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓