ZS4IE: A toolkit for Zero-Shot Information Extraction with simple Verbalizations

Oscar Sainz, Haoling Qiu, Oier Lopez de Lacalle, Eneko Agirre, Bonan Min


Abstract
The current workflow for Information Extraction (IE) analysts involves the definition of the entities/relations of interest and a training corpus with annotated examples. In this demonstration we introduce a new workflow where the analyst directly verbalizes the entities/relations, which are then used by a Textual Entailment model to perform zero-shot IE. We present the design and implementation of a toolkit with a user interface, as well as experiments on four IE tasks that show that the system achieves very good performance at zero-shot learning using only 5–15 minutes per type of a user’s effort. Our demonstration system is open-sourced at https://github.com/BBN-E/ZS4IE. A demonstration video is available at https://vimeo.com/676138340.
Anthology ID:
2022.naacl-demo.4
Volume:
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: System Demonstrations
Month:
July
Year:
2022
Address:
Hybrid: Seattle, Washington + Online
Editors:
Hannaneh Hajishirzi, Qiang Ning, Avi Sil
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
27–38
Language:
URL:
https://aclanthology.org/2022.naacl-demo.4
DOI:
10.18653/v1/2022.naacl-demo.4
Bibkey:
Cite (ACL):
Oscar Sainz, Haoling Qiu, Oier Lopez de Lacalle, Eneko Agirre, and Bonan Min. 2022. ZS4IE: A toolkit for Zero-Shot Information Extraction with simple Verbalizations. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: System Demonstrations, pages 27–38, Hybrid: Seattle, Washington + Online. Association for Computational Linguistics.
Cite (Informal):
ZS4IE: A toolkit for Zero-Shot Information Extraction with simple Verbalizations (Sainz et al., NAACL 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.naacl-demo.4.pdf
Code
 bbn-e/zs4ie +  additional community code
Data
ANLIFEVERMultiNLISNLI