ParsSimpleQA: The Persian Simple Question Answering Dataset and System over Knowledge Graph

Hamed Babaei Giglou, Niloufar Beyranvand, Reza Moradi, Amir Mohammad Salehoof, Saeed Bibak


Abstract
The simple question answering over the knowledge graph concerns answering single-relation questions by querying the facts in the knowledge graph. This task has drawn significant attention in recent years. However, there is a demand for a simple question dataset in the Persian language to study open-domain simple question answering. In this paper, we present the first Persian single-relation question answering dataset and a model that uses a knowledge graph as a source of knowledge to answer questions. We create the ParsSimpleQA dataset semi-automatically in two steps. First, we build single-relation question templates. Next, we automatically create simple questions and answers using templates, entities, and relations from Farsbase. To present the reliability of the presented dataset, we proposed a simple question-answering system that receives questions and uses deep learning and information retrieval techniques for answering questions. The experimental results presented in this paper show that the ParsSimpleQA dataset is very promising for the Persian simple question-answering task.
Anthology ID:
2022.nlp4dh-1.9
Volume:
Proceedings of the 2nd International Workshop on Natural Language Processing for Digital Humanities
Month:
November
Year:
2022
Address:
Taipei, Taiwan
Editors:
Mika Hämäläinen, Khalid Alnajjar, Niko Partanen, Jack Rueter
Venue:
NLP4DH
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
59–68
Language:
URL:
https://aclanthology.org/2022.nlp4dh-1.9
DOI:
Bibkey:
Cite (ACL):
Hamed Babaei Giglou, Niloufar Beyranvand, Reza Moradi, Amir Mohammad Salehoof, and Saeed Bibak. 2022. ParsSimpleQA: The Persian Simple Question Answering Dataset and System over Knowledge Graph. In Proceedings of the 2nd International Workshop on Natural Language Processing for Digital Humanities, pages 59–68, Taipei, Taiwan. Association for Computational Linguistics.
Cite (Informal):
ParsSimpleQA: The Persian Simple Question Answering Dataset and System over Knowledge Graph (Babaei Giglou et al., NLP4DH 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.nlp4dh-1.9.pdf
Dataset:
 2022.nlp4dh-1.9.Dataset.zip