Learning Semantic Sentence Embeddings using Sequential Pair-wise Discriminator

Badri Narayana Patro, Vinod Kumar Kurmi, Sandeep Kumar, Vinay Namboodiri


Abstract
In this paper, we propose a method for obtaining sentence-level embeddings. While the problem of securing word-level embeddings is very well studied, we propose a novel method for obtaining sentence-level embeddings. This is obtained by a simple method in the context of solving the paraphrase generation task. If we use a sequential encoder-decoder model for generating paraphrase, we would like the generated paraphrase to be semantically close to the original sentence. One way to ensure this is by adding constraints for true paraphrase embeddings to be close and unrelated paraphrase candidate sentence embeddings to be far. This is ensured by using a sequential pair-wise discriminator that shares weights with the encoder that is trained with a suitable loss function. Our loss function penalizes paraphrase sentence embedding distances from being too large. This loss is used in combination with a sequential encoder-decoder network. We also validated our method by evaluating the obtained embeddings for a sentiment analysis task. The proposed method results in semantic embeddings and outperforms the state-of-the-art on the paraphrase generation and sentiment analysis task on standard datasets. These results are also shown to be statistically significant.
Anthology ID:
C18-1230
Erratum e1:
C18-1230e1
Volume:
Proceedings of the 27th International Conference on Computational Linguistics
Month:
August
Year:
2018
Address:
Santa Fe, New Mexico, USA
Editors:
Emily M. Bender, Leon Derczynski, Pierre Isabelle
Venue:
COLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2715–2729
Language:
URL:
https://aclanthology.org/C18-1230/
DOI:
Bibkey:
Cite (ACL):
Badri Narayana Patro, Vinod Kumar Kurmi, Sandeep Kumar, and Vinay Namboodiri. 2018. Learning Semantic Sentence Embeddings using Sequential Pair-wise Discriminator. In Proceedings of the 27th International Conference on Computational Linguistics, pages 2715–2729, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
Cite (Informal):
Learning Semantic Sentence Embeddings using Sequential Pair-wise Discriminator (Patro et al., COLING 2018)
Copy Citation:
PDF:
https://aclanthology.org/C18-1230.pdf