The Simplification of the Language of Public Administration: The Case of Ombudsman Institutions

Gabriel Gonzalez-Delgado, Borja Navarro-Colorado


Abstract
Language produced by Public Administrations has crucial implications in citizens’ lives. However, its syntactic complexity and the use of legal jargon, among other factors, make it difficult to be understood for laypeople and certain target audiences. The NLP task of Automatic Text Simplification (ATS) can help to the necessary simplification of this technical language. For that purpose, specialized parallel datasets of complex-simple pairs need to be developed for the training of these ATS systems. In this position paper, an on-going project is presented, whose main objectives are (a) to extensively analyze the syntactical, lexical, and discursive features of the language of English-speaking ombudsmen, as samples of public administrative language, with special attention to those characteristics that pose a threat to comprehension, and (b) to develop the OmbudsCorpus, a parallel corpus of complex-simple supra-sentential fragments from ombudsmen’s case reports that have been manually simplified by professionals and annotated with standardized simplification operations. This research endeavor aims to provide a deeper understanding of the simplification process and to enhance the training of ATS systems specialized in administrative texts.
Anthology ID:
2024.determit-1.12
Volume:
Proceedings of the Workshop on DeTermIt! Evaluating Text Difficulty in a Multilingual Context @ LREC-COLING 2024
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Giorgio Maria Di Nunzio, Federica Vezzani, Liana Ermakova, Hosein Azarbonyad, Jaap Kamps
Venues:
DeTermIt | WS
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
125–133
Language:
URL:
https://aclanthology.org/2024.determit-1.12
DOI:
Bibkey:
Cite (ACL):
Gabriel Gonzalez-Delgado and Borja Navarro-Colorado. 2024. The Simplification of the Language of Public Administration: The Case of Ombudsman Institutions. In Proceedings of the Workshop on DeTermIt! Evaluating Text Difficulty in a Multilingual Context @ LREC-COLING 2024, pages 125–133, Torino, Italia. ELRA and ICCL.
Cite (Informal):
The Simplification of the Language of Public Administration: The Case of Ombudsman Institutions (Gonzalez-Delgado & Navarro-Colorado, DeTermIt-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.determit-1.12.pdf