New Frontiers of Information Extraction

Muhao Chen, Lifu Huang, Manling Li, Ben Zhou, Heng Ji, Dan Roth


Abstract
This tutorial targets researchers and practitioners who are interested in AI and ML technologies for structural information extraction (IE) from unstructured textual sources. Particularly, this tutorial will provide audience with a systematic introduction to recent advances of IE, by answering several important research questions. These questions include (i) how to develop an robust IE system from noisy, insufficient training data, while ensuring the reliability of its prediction? (ii) how to foster the generalizability of IE through enhancing the system’s cross-lingual, cross-domain, cross-task and cross-modal transferability? (iii) how to precisely support extracting structural information with extremely fine-grained, diverse and boundless labels? (iv) how to further improve IE by leveraging indirect supervision from other NLP tasks, such as NLI, QA or summarization, and pre-trained language models? (v) how to acquire knowledge to guide the inference of IE systems? We will discuss several lines of frontier research that tackle those challenges, and will conclude the tutorial by outlining directions for further investigation.
Anthology ID:
2022.naacl-tutorials.3
Volume:
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Tutorial Abstracts
Month:
July
Year:
2022
Address:
Seattle, United States
Editors:
Miguel Ballesteros, Yulia Tsvetkov, Cecilia O. Alm
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
14–25
Language:
URL:
https://aclanthology.org/2022.naacl-tutorials.3
DOI:
10.18653/v1/2022.naacl-tutorials.3
Bibkey:
Cite (ACL):
Muhao Chen, Lifu Huang, Manling Li, Ben Zhou, Heng Ji, and Dan Roth. 2022. New Frontiers of Information Extraction. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Tutorial Abstracts, pages 14–25, Seattle, United States. Association for Computational Linguistics.
Cite (Informal):
New Frontiers of Information Extraction (Chen et al., NAACL 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.naacl-tutorials.3.pdf
Data
CoNLL++