A Novel Methodology for Enhancing Cross-language and Domain Adaptability in Temporal Expression Normalization

Alejandro Sánchez de Castro; Lourdes Araujo; Juan Martinez-Romo

doi:10.1162/coli.a.12

A Novel Methodology for Enhancing Cross-language and Domain Adaptability in Temporal Expression Normalization

Alejandro Sánchez de Castro, Lourdes Araujo, Juan Martinez-Romo

Abstract

Accurate temporal expression normalization, the process of assigning a numerical value to a temporal expression, is essential for tasks such as timeline creation and temporal reasoning. While rule-based normalization systems are limited in adaptability across different domains and languages, deep-learning solutions in this area have not been extensively explored. An additional challenge is the scarcity of manually annotated corpora with temporal annotations. To address the adaptability limitations of current systems, we propose a highly adaptable methodology that can be applied to multiple domains and languages. This can be achieved by leveraging a multilingual Pre-trained Language Model (PTLM) with a fill-mask architecture, using a Value Intermediate Representation (VIR) where the temporal expression value format is adjusted to the fill-mask representation. Our approach involves a two-phase training process. Initially, the model is trained with a novel masking policy on a large English biomedical corpus that is automatically annotated with normalized temporal expressions, along with a complementary hand-crafted temporal expressions corpus. This addresses the lack of manually annotated data and helps to achieve sufficient capacity for adaptation to diverse domains or languages. In the second phase, we show how the model can be tailored to different domains and languages using various techniques, showcasing the versatility of the proposed methodology. This approach significantly outperforms existing systems.

Anthology ID:: 2025.cl-4.7
Volume:: Computational Linguistics, Volume 51, Issue 4 - December 2025
Month:: December
Year:: 2025
Address:: Cambridge, MA
Venue:: CL
SIG:
Publisher:: MIT Press
Note:
Pages:: 1303–1335
Language:
URL:: https://aclanthology.org/2025.cl-4.7/
DOI:: 10.1162/coli.a.12
Bibkey:
Cite (ACL):: Alejandro Sánchez de Castro, Lourdes Araujo, and Juan Martinez-Romo. 2025. A Novel Methodology for Enhancing Cross-language and Domain Adaptability in Temporal Expression Normalization. Computational Linguistics, 51(4):1303–1335.
Cite (Informal):: A Novel Methodology for Enhancing Cross-language and Domain Adaptability in Temporal Expression Normalization (de Castro et al., CL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.cl-4.7.pdf

PDF Cite Search Fix data