Generating titles for millions of browse pages on an e-Commerce site

Prashant Mathur, Nicola Ueffing, Gregor Leusch


Abstract
We present two approaches to generate titles for browse pages in five different languages, namely English, German, French, Italian and Spanish. These browse pages are structured search pages in an e-commerce domain. We first present a rule-based approach to generate these browse page titles. In addition, we also present a hybrid approach which uses a phrase-based statistical machine translation engine on top of the rule-based system to assemble the best title. For the two languages English and German we have access to a large amount of already available rule-based generated and curated titles. For these languages we present an automatic post-editing approach which learns how to post-edit the rule-based titles into curated titles.
Anthology ID:
W17-3525
Volume:
Proceedings of the 10th International Conference on Natural Language Generation
Month:
September
Year:
2017
Address:
Santiago de Compostela, Spain
Venue:
INLG
SIG:
SIGGEN
Publisher:
Association for Computational Linguistics
Note:
Pages:
158–167
Language:
URL:
https://aclanthology.org/W17-3525
DOI:
10.18653/v1/W17-3525
Bibkey:
Cite (ACL):
Prashant Mathur, Nicola Ueffing, and Gregor Leusch. 2017. Generating titles for millions of browse pages on an e-Commerce site. In Proceedings of the 10th International Conference on Natural Language Generation, pages 158–167, Santiago de Compostela, Spain. Association for Computational Linguistics.
Cite (Informal):
Generating titles for millions of browse pages on an e-Commerce site (Mathur et al., INLG 2017)
Copy Citation:
PDF:
https://aclanthology.org/W17-3525.pdf