Geo-Spatially Informed Models for Geocoding Unstructured Addresses

Uddeshya Singh, Devanapalli Ravi Shankar, Gowtham Bellala, Vikas Goel


Abstract
Geocoding customer addresses and determining precise locations is a crucial component for any e-commerce company. Shipment delivery costs make up a significant portion of overall expenses, and having exact customer locations not only improves operational efficiency but also reduces costs and enhances the customer experience. While state-of-the-art geocoding systems are well-suited for developed countries with structured city layouts and high-quality reference corpora, they are less effective in developing countries like India, where addresses are highly unstructured and reliable reference data is scarce. Recent research has focused on creating geocoding systems tailored for developing nations such as India. In this work, we propose a method to geocode addresses in such environments. We explored various approaches to incorporate geo-spatial relationships using an LLM backbone, which provided insights into how the model learns these relationships both explicitly and implicitly. Our proposed approach outperforms the current state-of-the-art system by 20% in drift accuracy within 100 meters, and the state-of-the-art commercial system by 54%. This has a potential to reduce the incorrect delivery hub assignments by 8% which leads to significant customer experience improvements and business savings.
Anthology ID:
2025.coling-industry.19
Volume:
Proceedings of the 31st International Conference on Computational Linguistics: Industry Track
Month:
January
Year:
2025
Address:
Abu Dhabi, UAE
Editors:
Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert, Kareem Darwish, Apoorv Agarwal
Venue:
COLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
236–242
Language:
URL:
https://aclanthology.org/2025.coling-industry.19/
DOI:
Bibkey:
Cite (ACL):
Uddeshya Singh, Devanapalli Ravi Shankar, Gowtham Bellala, and Vikas Goel. 2025. Geo-Spatially Informed Models for Geocoding Unstructured Addresses. In Proceedings of the 31st International Conference on Computational Linguistics: Industry Track, pages 236–242, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):
Geo-Spatially Informed Models for Geocoding Unstructured Addresses (Singh et al., COLING 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.coling-industry.19.pdf