Generation of named entities

Marisa Jiménez


Abstract
In this paper we present an overview of an approach developed at Microsoft Research to generate strings for named entities such as places and dates. This approach uses abstract representations as input. We first provide an overview of our system to identify named entities in text. Next we present our approach to generate these entities from abstract representations, known as “logical forms” in our system. We then focus on the generation of place names in Spanish. We discuss our technique to generate Spanish place names from a logical form where language-specific features, such as word order, or capitalization conventions do not exist. We finally present the details of a study that we carried out to help us make sound linguistic decisions in the generation of place names in Spanish.
Anthology ID:
2001.mtsummit-papers.32
Volume:
Proceedings of Machine Translation Summit VIII
Month:
September 18-22
Year:
2001
Address:
Santiago de Compostela, Spain
Editor:
Bente Maegaard
Venue:
MTSummit
SIG:
Publisher:
Note:
Pages:
Language:
URL:
https://aclanthology.org/2001.mtsummit-papers.32
DOI:
Bibkey:
Cite (ACL):
Marisa Jiménez. 2001. Generation of named entities. In Proceedings of Machine Translation Summit VIII, Santiago de Compostela, Spain.
Cite (Informal):
Generation of named entities (Jiménez, MTSummit 2001)
Copy Citation:
PDF:
https://aclanthology.org/2001.mtsummit-papers.32.pdf