@inproceedings{dourado-sa-etal-2022-enhancing,
title = "Enhancing Geocoding of Adjectival Toponyms With Heuristics",
author = "Dourado S{\'a}, Breno and
Coelho da Silva, Ticiana and
Fernandes de Macedo, Jose Antonio",
editor = "Afli, Haithem and
Alam, Mehwish and
Bouamor, Houda and
Casagran, Cristina Blasi and
Boland, Colleen and
Ghannay, Sahar",
booktitle = "Proceedings of the LREC 2022 workshop on Natural Language Processing for Political Sciences",
month = jun,
year = "2022",
address = "Marseille, France",
publisher = "European Language Resources Association",
url = "https://aclanthology.org/2022.politicalnlp-1.6/",
pages = "37--45",
abstract = "Unstructured text documents such as news and blogs often present references to places. Those references, called toponyms, can be used in various applications like disaster warning and touristic planning. However, obtaining the correct coordinates for toponyms, called geocoding, is not easy since it`s common for places to have the same name as other locations. The process becomes even more challenging when toponyms appear in adjectival form, as they are different from the place`s actual name. This paper addresses the geocoding task and aims to improve, through a heuristic approach, the process for adjectival toponyms. So first, a baseline geocoder is defined through experimenting with a set of heuristics. After that, the baseline is enhanced by adding a normalization step to map adjectival toponyms to their noun form at the beginning of the geocoding process. The results show improved performance for the enhanced geocoder compared to the baseline and other geocoders."
}
<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="dourado-sa-etal-2022-enhancing">
<titleInfo>
<title>Enhancing Geocoding of Adjectival Toponyms With Heuristics</title>
</titleInfo>
<name type="personal">
<namePart type="given">Breno</namePart>
<namePart type="family">Dourado Sá</namePart>
<role>
<roleTerm authority="marcrelator" type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Ticiana</namePart>
<namePart type="family">Coelho da Silva</namePart>
<role>
<roleTerm authority="marcrelator" type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jose</namePart>
<namePart type="given">Antonio</namePart>
<namePart type="family">Fernandes de Macedo</namePart>
<role>
<roleTerm authority="marcrelator" type="text">author</roleTerm>
</role>
</name>
<originInfo>
<dateIssued>2022-06</dateIssued>
</originInfo>
<typeOfResource>text</typeOfResource>
<relatedItem type="host">
<titleInfo>
<title>Proceedings of the LREC 2022 workshop on Natural Language Processing for Political Sciences</title>
</titleInfo>
<name type="personal">
<namePart type="given">Haithem</namePart>
<namePart type="family">Afli</namePart>
<role>
<roleTerm authority="marcrelator" type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Mehwish</namePart>
<namePart type="family">Alam</namePart>
<role>
<roleTerm authority="marcrelator" type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Houda</namePart>
<namePart type="family">Bouamor</namePart>
<role>
<roleTerm authority="marcrelator" type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Cristina</namePart>
<namePart type="given">Blasi</namePart>
<namePart type="family">Casagran</namePart>
<role>
<roleTerm authority="marcrelator" type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Colleen</namePart>
<namePart type="family">Boland</namePart>
<role>
<roleTerm authority="marcrelator" type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Sahar</namePart>
<namePart type="family">Ghannay</namePart>
<role>
<roleTerm authority="marcrelator" type="text">editor</roleTerm>
</role>
</name>
<originInfo>
<publisher>European Language Resources Association</publisher>
<place>
<placeTerm type="text">Marseille, France</placeTerm>
</place>
</originInfo>
<genre authority="marcgt">conference publication</genre>
</relatedItem>
<abstract>Unstructured text documents such as news and blogs often present references to places. Those references, called toponyms, can be used in various applications like disaster warning and touristic planning. However, obtaining the correct coordinates for toponyms, called geocoding, is not easy since it‘s common for places to have the same name as other locations. The process becomes even more challenging when toponyms appear in adjectival form, as they are different from the place‘s actual name. This paper addresses the geocoding task and aims to improve, through a heuristic approach, the process for adjectival toponyms. So first, a baseline geocoder is defined through experimenting with a set of heuristics. After that, the baseline is enhanced by adding a normalization step to map adjectival toponyms to their noun form at the beginning of the geocoding process. The results show improved performance for the enhanced geocoder compared to the baseline and other geocoders.</abstract>
<identifier type="citekey">dourado-sa-etal-2022-enhancing</identifier>
<location>
<url>https://aclanthology.org/2022.politicalnlp-1.6/</url>
</location>
<part>
<date>2022-06</date>
<extent unit="page">
<start>37</start>
<end>45</end>
</extent>
</part>
</mods>
</modsCollection>
%0 Conference Proceedings
%T Enhancing Geocoding of Adjectival Toponyms With Heuristics
%A Dourado Sá, Breno
%A Coelho da Silva, Ticiana
%A Fernandes de Macedo, Jose Antonio
%Y Afli, Haithem
%Y Alam, Mehwish
%Y Bouamor, Houda
%Y Casagran, Cristina Blasi
%Y Boland, Colleen
%Y Ghannay, Sahar
%S Proceedings of the LREC 2022 workshop on Natural Language Processing for Political Sciences
%D 2022
%8 June
%I European Language Resources Association
%C Marseille, France
%F dourado-sa-etal-2022-enhancing
%X Unstructured text documents such as news and blogs often present references to places. Those references, called toponyms, can be used in various applications like disaster warning and touristic planning. However, obtaining the correct coordinates for toponyms, called geocoding, is not easy since it‘s common for places to have the same name as other locations. The process becomes even more challenging when toponyms appear in adjectival form, as they are different from the place‘s actual name. This paper addresses the geocoding task and aims to improve, through a heuristic approach, the process for adjectival toponyms. So first, a baseline geocoder is defined through experimenting with a set of heuristics. After that, the baseline is enhanced by adding a normalization step to map adjectival toponyms to their noun form at the beginning of the geocoding process. The results show improved performance for the enhanced geocoder compared to the baseline and other geocoders.
%U https://aclanthology.org/2022.politicalnlp-1.6/
%P 37-45
Markdown (Informal)
[Enhancing Geocoding of Adjectival Toponyms With Heuristics](https://aclanthology.org/2022.politicalnlp-1.6/) (Dourado Sá et al., PoliticalNLP 2022)
ACL
- Breno Dourado Sá, Ticiana Coelho da Silva, and Jose Antonio Fernandes de Macedo. 2022. Enhancing Geocoding of Adjectival Toponyms With Heuristics. In Proceedings of the LREC 2022 workshop on Natural Language Processing for Political Sciences, pages 37–45, Marseille, France. European Language Resources Association.