D-Terminer: Online Demo for Monolingual and Bilingual Automatic Term Extraction

Ayla Rigouts Terryn, Veronique Hoste, Els Lefever


Abstract
This contribution presents D-Terminer: an open access, online demo for monolingual and multilingual automatic term extraction from parallel corpora. The monolingual term extraction is based on a recurrent neural network, with a supervised methodology that relies on pretrained embeddings. Candidate terms can be tagged in their original context and there is no need for a large corpus, as the methodology will work even for single sentences. With the bilingual term extraction from parallel corpora, potentially equivalent candidate term pairs are extracted from translation memories and manual annotation of the results shows that good equivalents are found for most candidate terms. Accompanying the release of the demo is an updated version of the ACTER Annotated Corpora for Term Extraction Research (version 1.5).
Anthology ID:
2022.term-1.7
Volume:
Proceedings of the Workshop on Terminology in the 21st century: many faces, many places
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Rute Costa, Sara Carvalho, Ana Ostroški Anić, Anas Fahad Khan
Venue:
TERM
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
33–40
Language:
URL:
https://aclanthology.org/2022.term-1.7
DOI:
Bibkey:
Cite (ACL):
Ayla Rigouts Terryn, Veronique Hoste, and Els Lefever. 2022. D-Terminer: Online Demo for Monolingual and Bilingual Automatic Term Extraction. In Proceedings of the Workshop on Terminology in the 21st century: many faces, many places, pages 33–40, Marseille, France. European Language Resources Association.
Cite (Informal):
D-Terminer: Online Demo for Monolingual and Bilingual Automatic Term Extraction (Rigouts Terryn et al., TERM 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.term-1.7.pdf