Measurement Extraction with Natural Language Processing: A Review

Jan Göpfert, Patrick Kuckertz, Jann Weinand, Leander Kotzur, Detlef Stolten


Abstract
Quantitative data is important in many domains. Information extraction methods draw structured data from documents. However, the extraction of quantities and their contexts has received little attention in the history of information extraction. In this review, an overview of prior work on measurement extraction is presented. We describe different approaches to measurement extraction and outline the challenges posed by this task. The review concludes with an outline of potential future research. Research strains in measurement extraction tend to be isolated and lack a common terminology. Improvements in numerical reasoning, more extensive datasets, and the consideration of wider contexts may lead to significant improvements in measurement extraction.
Anthology ID:
2022.findings-emnlp.161
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2022
Month:
December
Year:
2022
Address:
Abu Dhabi, United Arab Emirates
Editors:
Yoav Goldberg, Zornitsa Kozareva, Yue Zhang
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2191–2215
Language:
URL:
https://aclanthology.org/2022.findings-emnlp.161
DOI:
10.18653/v1/2022.findings-emnlp.161
Bibkey:
Cite (ACL):
Jan Göpfert, Patrick Kuckertz, Jann Weinand, Leander Kotzur, and Detlef Stolten. 2022. Measurement Extraction with Natural Language Processing: A Review. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 2191–2215, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
Cite (Informal):
Measurement Extraction with Natural Language Processing: A Review (Göpfert et al., Findings 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.findings-emnlp.161.pdf