A Text Embedding Model with Contrastive Example Mining for Point-of-Interest Geocoding

Hibiki Nakatani, Hiroki Teranishi, Shohei Higashiyama, Yuya Sawada, Hiroki Ouchi, Taro Watanabe


Abstract
Geocoding is a fundamental technique that links location mentions to their geographic positions, which is important for understanding texts in terms of where the described events occurred. Unlike most geocoding studies that targeted coarse-grained locations, we focus on geocoding at a fine-grained point-of-interest (POI) level. To address the challenge of finding appropriate geo-database entries from among many candidates with similar POI names, we develop a text embedding-based geocoding model and investigate (1) entry encoding representations and (2) hard negative mining approaches suitable for enhancing the model’s disambiguation ability. Our experiments show that the second factor significantly impact the geocoding accuracy of the model.
Anthology ID:
2025.coling-main.486
Volume:
Proceedings of the 31st International Conference on Computational Linguistics
Month:
January
Year:
2025
Address:
Abu Dhabi, UAE
Editors:
Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
Venue:
COLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
7279–7291
Language:
URL:
https://aclanthology.org/2025.coling-main.486/
DOI:
Bibkey:
Cite (ACL):
Hibiki Nakatani, Hiroki Teranishi, Shohei Higashiyama, Yuya Sawada, Hiroki Ouchi, and Taro Watanabe. 2025. A Text Embedding Model with Contrastive Example Mining for Point-of-Interest Geocoding. In Proceedings of the 31st International Conference on Computational Linguistics, pages 7279–7291, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):
A Text Embedding Model with Contrastive Example Mining for Point-of-Interest Geocoding (Nakatani et al., COLING 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.coling-main.486.pdf