The Effect of Data Encoding on Relation Triplet Identification

Steinunn Friðriksdóttir, Hafsteinn Einarsson


Abstract
This paper presents a novel method for creating relation extraction data for low-resource languages. Relation extraction (RE) is a task in natural language processing that involves identifying and extracting meaningful relationships between entities in text. Despite the increasing need to extract relationships from unstructured text, the limited availability of annotated data in low-resource languages presents a significant challenge to the development of high-quality relation extraction models. Our method leverages existing methods for high-resource languages to create training data for low-resource languages. The proposed method is simple, efficient and has the potential to significantly improve the performance of relation extraction models for low-resource languages, making it a promising avenue for future research.
Anthology ID:
2023.nodalida-1.50
Volume:
Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)
Month:
May
Year:
2023
Address:
Tórshavn, Faroe Islands
Editors:
Tanel Alumäe, Mark Fishel
Venue:
NoDaLiDa
SIG:
Publisher:
University of Tartu Library
Note:
Pages:
500–507
Language:
URL:
https://aclanthology.org/2023.nodalida-1.50
DOI:
Bibkey:
Cite (ACL):
Steinunn Friðriksdóttir and Hafsteinn Einarsson. 2023. The Effect of Data Encoding on Relation Triplet Identification. In Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa), pages 500–507, Tórshavn, Faroe Islands. University of Tartu Library.
Cite (Informal):
The Effect of Data Encoding on Relation Triplet Identification (Friðriksdóttir & Einarsson, NoDaLiDa 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.nodalida-1.50.pdf