Using Sentence-level Classification Helps Entity Extraction from Material Science Literature

Ankan Mullick, Shubhraneel Pal, Tapas Nayak, Seung-Cheol Lee, Satadeep Bhattacharjee, Pawan Goyal


Abstract
In the last few years, several attempts have been made on extracting information from material science research domain. Material Science research articles are a rich source of information about various entities related to material science such as names of the materials used for experiments, the computational software used along with its parameters, the method used in the experiments, etc. But the distribution of these entities is not uniform across different sections of research articles. Most of the sentences in the research articles do not contain any entity. In this work, we first use a sentence-level classifier to identify sentences containing at least one entity mention. Next, we apply the information extraction models only on the filtered sentences, to extract various entities of interest. Our experiments for named entity recognition in the material science research articles show that this additional sentence-level classification step helps to improve the F1 score by more than 4%.
Anthology ID:
2022.lrec-1.483
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
4540–4545
Language:
URL:
https://aclanthology.org/2022.lrec-1.483
DOI:
Bibkey:
Cite (ACL):
Ankan Mullick, Shubhraneel Pal, Tapas Nayak, Seung-Cheol Lee, Satadeep Bhattacharjee, and Pawan Goyal. 2022. Using Sentence-level Classification Helps Entity Extraction from Material Science Literature. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 4540–4545, Marseille, France. European Language Resources Association.
Cite (Informal):
Using Sentence-level Classification Helps Entity Extraction from Material Science Literature (Mullick et al., LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.483.pdf