Structural Contrastive Representation Learning for Zero-shot Multi-label Text Classification

Tianyi Zhang, Zhaozhuo Xu, Tharun Medini, Anshumali Shrivastava


Abstract
Zero-shot multi-label text classification (ZMTC) is a fundamental task in natural language processing with applications in the cold start problem of recommendation systems. Ideally, one would learn an expressive representation of both input text and label features so that ZMTC is transformed into a nearest neighbor search problem. However, the existing representation learning approaches for ZMTC struggle with accuracy as well as poor training efficiency. Firstly, the input text is structural, consisting of both short title sentences and long content paragraphs. It is challenging to model the correlation between short label descriptions and long structural input documents. Secondly, the enormous label space in ZMTC forces the existing approaches to perform multi-stage learning with label engineering. As a result, the training overhead is significant. In this paper, we address both problems by introducing an end-to-end structural contrastive representation learning approach. We propose a randomized text segmentation (RTS) technique to generate high-quality contrastive pairs. This RTS technique allows us to model title-content correlation. Additionally, we simplify the multi-stage ZMTC learning strategy by avoiding label engineering. Extensive experiments demonstrate that our approach leads to up to 2.33% improvement in precision@1 and 5.94x speedup in training time on publicly available datasets. Our code is available publicly.
Anthology ID:
2022.findings-emnlp.362
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2022
Month:
December
Year:
2022
Address:
Abu Dhabi, United Arab Emirates
Editors:
Yoav Goldberg, Zornitsa Kozareva, Yue Zhang
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4937–4947
Language:
URL:
https://aclanthology.org/2022.findings-emnlp.362
DOI:
10.18653/v1/2022.findings-emnlp.362
Bibkey:
Cite (ACL):
Tianyi Zhang, Zhaozhuo Xu, Tharun Medini, and Anshumali Shrivastava. 2022. Structural Contrastive Representation Learning for Zero-shot Multi-label Text Classification. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 4937–4947, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
Cite (Informal):
Structural Contrastive Representation Learning for Zero-shot Multi-label Text Classification (Zhang et al., Findings 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.findings-emnlp.362.pdf