Supervised Neural Clustering via Latent Structured Output Learning: Application to Question Intents

Iryna Haponchyk, Alessandro Moschitti


Abstract
Previous pre-neural work on structured prediction has produced very effective supervised clustering algorithms using linear classifiers, e.g., structured SVM or perceptron. However, these cannot exploit the representation learning ability of neural networks, which would make supervised clustering even more powerful, i.e., general clustering patterns can be learned automatically. In this paper, we design neural networks based on latent structured prediction loss and Transformer models to approach supervised clustering. We tested our methods on the task of automatically recreating categories of intents from publicly available question intent corpora. The results show that our approach delivers 95.65% of F1, outperforming the state of the art by 17.24%.
Anthology ID:
2021.naacl-main.263
Volume:
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Month:
June
Year:
2021
Address:
Online
Editors:
Kristina Toutanova, Anna Rumshisky, Luke Zettlemoyer, Dilek Hakkani-Tur, Iz Beltagy, Steven Bethard, Ryan Cotterell, Tanmoy Chakraborty, Yichao Zhou
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
3364–3374
Language:
URL:
https://aclanthology.org/2021.naacl-main.263
DOI:
10.18653/v1/2021.naacl-main.263
Bibkey:
Cite (ACL):
Iryna Haponchyk and Alessandro Moschitti. 2021. Supervised Neural Clustering via Latent Structured Output Learning: Application to Question Intents. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 3364–3374, Online. Association for Computational Linguistics.
Cite (Informal):
Supervised Neural Clustering via Latent Structured Output Learning: Application to Question Intents (Haponchyk & Moschitti, NAACL 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.naacl-main.263.pdf
Video:
 https://aclanthology.org/2021.naacl-main.263.mp4
Code
 ikernels/intent-qa