Single-dataset Experts for Multi-dataset Question Answering

Dan Friedman; Ben Dodge; Danqi Chen

doi:10.18653/v1/2021.emnlp-main.495

Single-dataset Experts for Multi-dataset Question Answering

Abstract

Many datasets have been created for training reading comprehension models, and a natural question is whether we can combine them to build models that (1) perform better on all of the training datasets and (2) generalize and transfer better to new datasets. Prior work has addressed this goal by training one network simultaneously on multiple datasets, which works well on average but is prone to over- or under-fitting different sub- distributions and might transfer worse compared to source models with more overlap with the target dataset. Our approach is to model multi-dataset question answering with an ensemble of single-dataset experts, by training a collection of lightweight, dataset-specific adapter modules (Houlsby et al., 2019) that share an underlying Transformer model. We find that these Multi-Adapter Dataset Experts (MADE) outperform all our baselines in terms of in-distribution accuracy, and simple methods based on parameter-averaging lead to better zero-shot generalization and few-shot transfer performance, offering a strong and versatile starting point for building new reading comprehension systems.

Anthology ID:: 2021.emnlp-main.495
Volume:: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2021
Address:: Online and Punta Cana, Dominican Republic
Editors:: Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 6128–6137
Language:
URL:: https://aclanthology.org/2021.emnlp-main.495/
DOI:: 10.18653/v1/2021.emnlp-main.495
Bibkey:
Cite (ACL):: Dan Friedman, Ben Dodge, and Danqi Chen. 2021. Single-dataset Experts for Multi-dataset Question Answering. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 6128–6137, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
Cite (Informal):: Single-dataset Experts for Multi-dataset Question Answering (Friedman et al., EMNLP 2021)
Copy Citation:
PDF:: https://aclanthology.org/2021.emnlp-main.495.pdf
Video:: https://aclanthology.org/2021.emnlp-main.495.mp4

PDF Cite Search Video Fix data