Chain-of-Skills: A Configurable Model for Open-Domain Question Answering

Kaixin Ma; Hao Cheng; Yu Zhang; Xiaodong Liu; Eric Nyberg; Jianfeng Gao

doi:10.18653/v1/2023.acl-long.89

Chain-of-Skills: A Configurable Model for Open-Domain Question Answering

Kaixin Ma, Hao Cheng, Yu Zhang, Xiaodong Liu, Eric Nyberg, Jianfeng Gao

Abstract

The retrieval model is an indispensable component for real-world knowledge-intensive tasks, e.g., open-domain question answering (ODQA). As separate retrieval skills are annotated for different datasets, recent work focuses on customized methods, limiting the model transfer- ability and scalability. In this work, we propose a modular retriever where individual modules correspond to key skills that can be reused across datasets. Our approach supports flexible skill configurations based on the target domain to boost performance. To mitigate task interference, we design a novel modularization parameterization inspired by sparse Transformer. We demonstrate that our model can benefit from self-supervised pretraining on Wikipedia and fine-tuning using multiple ODQA datasets, both in a multi-task fashion. Our approach outperforms recent self-supervised retrievers in zero-shot evaluations and achieves state-of-the-art fine-tuned retrieval performance on NQ, HotpotQA and OTT-QA.

Anthology ID:: 2023.acl-long.89
Volume:: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1599–1618
Language:
URL:: https://aclanthology.org/2023.acl-long.89/
DOI:: 10.18653/v1/2023.acl-long.89
Bibkey:
Cite (ACL):: Kaixin Ma, Hao Cheng, Yu Zhang, Xiaodong Liu, Eric Nyberg, and Jianfeng Gao. 2023. Chain-of-Skills: A Configurable Model for Open-Domain Question Answering. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1599–1618, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: Chain-of-Skills: A Configurable Model for Open-Domain Question Answering (Ma et al., ACL 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.acl-long.89.pdf
Video:: https://aclanthology.org/2023.acl-long.89.mp4

PDF Cite Search Video Fix data