An Adaptable Lexical Simplification Architecture for Major Ibero-Romance Languages

Daniel Ferrés, Horacio Saggion, Xavier Gómez Guinovart


Abstract
Lexical Simplification is the task of reducing the lexical complexity of textual documents by replacing difficult words with easier to read (or understand) expressions while preserving the original meaning. The development of robust pipelined multilingual architectures able to adapt to new languages is of paramount importance in lexical simplification. This paper describes and evaluates a modular hybrid linguistic-statistical Lexical Simplifier that deals with the four major Ibero-Romance Languages: Spanish, Portuguese, Catalan, and Galician. The architecture of the system is the same for the four languages addressed, only the language resources used during simplification are language specific.
Anthology ID:
W17-5406
Volume:
Proceedings of the First Workshop on Building Linguistically Generalizable NLP Systems
Month:
September
Year:
2017
Address:
Copenhagen, Denmark
Editors:
Emily Bender, Hal Daumé III, Allyson Ettinger, Sudha Rao
Venue:
WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
40–47
Language:
URL:
https://aclanthology.org/W17-5406
DOI:
10.18653/v1/W17-5406
Bibkey:
Cite (ACL):
Daniel Ferrés, Horacio Saggion, and Xavier Gómez Guinovart. 2017. An Adaptable Lexical Simplification Architecture for Major Ibero-Romance Languages. In Proceedings of the First Workshop on Building Linguistically Generalizable NLP Systems, pages 40–47, Copenhagen, Denmark. Association for Computational Linguistics.
Cite (Informal):
An Adaptable Lexical Simplification Architecture for Major Ibero-Romance Languages (Ferrés et al., 2017)
Copy Citation:
PDF:
https://aclanthology.org/W17-5406.pdf