INNOVATORS at SemEval-2021 Task-11: A Dependency Parsing and BERT-based model for Extracting Contribution Knowledge from Scientific Papers

Hardik Arora; Tirthankar Ghosal; Sandeep Kumar; Suraj Patwal; Phil Gooch

doi:10.18653/v1/2021.semeval-1.61

INNOVATORS at SemEval-2021 Task-11: A Dependency Parsing and BERT-based model for Extracting Contribution Knowledge from Scientific Papers

Hardik Arora, Tirthankar Ghosal, Sandeep Kumar, Suraj Patwal, Phil Gooch

Abstract

In this work, we describe our system submission to the SemEval 2021 Task 11: NLP Contribution Graph Challenge. We attempt all the three sub-tasks in the challenge and report our results. Subtask 1 aims to identify the contributing sentences in a given publication. Subtask 2 follows from Subtask 1 to extract the scientific term and predicate phrases from the identified contributing sentences. The final Subtask 3 entails extracting triples (subject, predicate, object) from the phrases and categorizing them under one or more defined information units. With the NLPContributionGraph Shared Task, the organizers formalized the building of a scholarly contributions-focused graph over NLP scholarly articles as an automated task. Our approaches include a BERT-based classification model for identifying the contributing sentences in a research publication, a rule-based dependency parsing for phrase extraction, followed by a CNN-based model for information units classification, and a set of rules for triples extraction. The quantitative results show that we obtain the 5th, 5th, and 7th rank respectively in three evaluation phases. We make our codes available at https://github.com/HardikArora17/SemEval-2021-INNOVATORS.

Anthology ID:: 2021.semeval-1.61
Volume:: Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021)
Month:: August
Year:: 2021
Address:: Online
Editors:: Alexis Palmer, Nathan Schneider, Natalie Schluter, Guy Emerson, Aurelie Herbelot, Xiaodan Zhu
Venue:: SemEval
SIG:: SIGLEX
Publisher:: Association for Computational Linguistics
Note:
Pages:: 502–510
Language:
URL:: https://aclanthology.org/2021.semeval-1.61/
DOI:: 10.18653/v1/2021.semeval-1.61
Bibkey:
Cite (ACL):: Hardik Arora, Tirthankar Ghosal, Sandeep Kumar, Suraj Patwal, and Phil Gooch. 2021. INNOVATORS at SemEval-2021 Task-11: A Dependency Parsing and BERT-based model for Extracting Contribution Knowledge from Scientific Papers. In Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021), pages 502–510, Online. Association for Computational Linguistics.
Cite (Informal):: INNOVATORS at SemEval-2021 Task-11: A Dependency Parsing and BERT-based model for Extracting Contribution Knowledge from Scientific Papers (Arora et al., SemEval 2021)
Copy Citation:
PDF:: https://aclanthology.org/2021.semeval-1.61.pdf

PDF Cite Search Fix data