Combining Shallow and Deep Representations for Text-Pair Classification

Vincent Nguyen; Sarvnaz Karimi; Zhenchang Xing

Combining Shallow and Deep Representations for Text-Pair Classification

Vincent Nguyen, Sarvnaz Karimi, Zhenchang Xing

Abstract

Text-pair classification is the task of determining the class relationship between two sentences. It is embedded in several tasks such as paraphrase identification and duplicate question detection. Contemporary methods use fine-tuned transformer encoder semantic representations of the classification token in the text-pair sequence from the transformer’s final layer for class prediction. However, research has shown that earlier parts of the network learn shallow features, such as syntax and structure, which existing methods do not directly exploit. We propose a novel convolution-based decoder for transformer-based architecture that maximizes the use of encoder hidden features for text-pair classification. Our model exploits hidden representations within transformer-based architecture. It outperforms a transformer encoder baseline on average by 50% (relative F1-score) on six datasets from the medical, software engineering, and open-domains. Our work shows that transformer-based models can improve text-pair classification by modifying the fine-tuning step to exploit shallow features while improving model generalization, with only a slight reduction in efficiency.

Anthology ID:: 2021.alta-1.7
Volume:: Proceedings of the 19th Annual Workshop of the Australasian Language Technology Association
Month:: December
Year:: 2021
Address:: Online
Editors:: Afshin Rahimi, William Lane, Guido Zuccon
Venue:: ALTA
SIG:
Publisher:: Australasian Language Technology Association
Note:
Pages:: 68–78
Language:
URL:: https://aclanthology.org/2021.alta-1.7
DOI:
Bibkey:
Cite (ACL):: Vincent Nguyen, Sarvnaz Karimi, and Zhenchang Xing. 2021. Combining Shallow and Deep Representations for Text-Pair Classification. In Proceedings of the 19th Annual Workshop of the Australasian Language Technology Association, pages 68–78, Online. Australasian Language Technology Association.
Cite (Informal):: Combining Shallow and Deep Representations for Text-Pair Classification (Nguyen et al., ALTA 2021)
Copy Citation:
PDF:: https://aclanthology.org/2021.alta-1.7.pdf
Data: BLUE, SNLI

PDF Cite Search