TextGraphs 2022 Shared Task on Natural Language Premise Selection

Marco Valentino, Deborah Ferreira, Mokanarangan Thayaparan, André Freitas, Dmitry Ustalov


Abstract
The Shared Task on Natural Language Premise Selection (NLPS) asks participants to retrieve the set of premises that are most likely to be useful for proving a given mathematical statement from a supporting knowledge base. While previous editions of the TextGraphs shared tasks series targeted multi-hop inference for explanation regeneration in the context of science questions (Thayaparan et al., 2021; Jansen and Ustalov, 2020, 2019), NLPS aims to assess the ability of state-of-the-art approaches to operate on a mixture of natural and mathematical language and model complex multi-hop reasoning dependencies between statements. To this end, this edition of the shared task makes use of a large set of approximately 21k mathematical statements extracted from the PS-ProofWiki dataset (Ferreira and Freitas, 2020a). In this summary paper, we present the results of the 1st edition of the NLPS task, providing a description of the evaluation data, and the participating systems. Additionally, we perform a detailed analysis of the results, evaluating various aspects involved in mathematical language processing and multi-hop inference. The best-performing system achieved a MAP of 15.39, improving the performance of a TF-IDF baseline by approximately 3.0 MAP.
Anthology ID:
2022.textgraphs-1.11
Volume:
Proceedings of TextGraphs-16: Graph-based Methods for Natural Language Processing
Month:
October
Year:
2022
Address:
Gyeongju, Republic of Korea
Editors:
Dmitry Ustalov, Yanjun Gao, Alexander Panchenko, Marco Valentino, Mokanarangan Thayaparan, Thien Huu Nguyen, Gerald Penn, Arti Ramesh, Abhik Jana
Venue:
TextGraphs
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
105–113
Language:
URL:
https://aclanthology.org/2022.textgraphs-1.11
DOI:
Bibkey:
Cite (ACL):
Marco Valentino, Deborah Ferreira, Mokanarangan Thayaparan, André Freitas, and Dmitry Ustalov. 2022. TextGraphs 2022 Shared Task on Natural Language Premise Selection. In Proceedings of TextGraphs-16: Graph-based Methods for Natural Language Processing, pages 105–113, Gyeongju, Republic of Korea. Association for Computational Linguistics.
Cite (Informal):
TextGraphs 2022 Shared Task on Natural Language Premise Selection (Valentino et al., TextGraphs 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.textgraphs-1.11.pdf
Code
 ai-systems/tg2022task_premise_retrieval