Semi-supervised Deep Embedded Clustering with Anomaly Detection for Semantic Frame Induction

Zheng Xin Yong, Tiago Timponi Torrent


Abstract
Although FrameNet is recognized as one of the most fine-grained lexical databases, its coverage of lexical units is still limited. To tackle this issue, we propose a two-step frame induction process: for a set of lexical units not yet present in Berkeley FrameNet data release 1.7, first remove those that cannot fit into any existing semantic frame in FrameNet; then, assign the remaining lexical units to their correct frames. We also present the Semi-supervised Deep Embedded Clustering with Anomaly Detection (SDEC-AD) model—an algorithm that maps high-dimensional contextualized vector representations of lexical units to a low-dimensional latent space for better frame prediction and uses reconstruction error to identify lexical units that cannot evoke frames in FrameNet. SDEC-AD outperforms the state-of-the-art methods in both steps of the frame induction process. Empirical results also show that definitions provide contextual information for representing and characterizing the frame membership of lexical units.
Anthology ID:
2020.lrec-1.431
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
3509–3519
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.431
DOI:
Bibkey:
Cite (ACL):
Zheng Xin Yong and Tiago Timponi Torrent. 2020. Semi-supervised Deep Embedded Clustering with Anomaly Detection for Semantic Frame Induction. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 3509–3519, Marseille, France. European Language Resources Association.
Cite (Informal):
Semi-supervised Deep Embedded Clustering with Anomaly Detection for Semantic Frame Induction (Yong & Torrent, LREC 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.lrec-1.431.pdf