Watset: Local-Global Graph Clustering with Applications in Sense and Frame Induction

Dmitry Ustalov, Alexander Panchenko, Chris Biemann, Simone Paolo Ponzetto


Abstract
We present a detailed theoretical and computational analysis of the Watset meta-algorithm for fuzzy graph clustering, which has been found to be widely applicable in a variety of domains. This algorithm creates an intermediate representation of the input graph, which reflects the “ambiguity” of its nodes. Then, it uses hard clustering to discover clusters in this “disambiguated” intermediate graph. After outlining the approach and analyzing its computational complexity, we demonstrate that Watset shows competitive results in three applications: unsupervised synset induction from a synonymy graph, unsupervised semantic frame induction from dependency triples, and unsupervised semantic class induction from a distributional thesaurus. Our algorithm is generic and can also be applied to other networks of linguistic data.
Anthology ID:
J19-3002
Volume:
Computational Linguistics, Volume 45, Issue 3 - September 2019
Month:
September
Year:
2019
Address:
Cambridge, MA
Venue:
CL
SIG:
Publisher:
MIT Press
Note:
Pages:
423–479
Language:
URL:
https://aclanthology.org/J19-3002
DOI:
10.1162/coli_a_00354
Bibkey:
Cite (ACL):
Dmitry Ustalov, Alexander Panchenko, Chris Biemann, and Simone Paolo Ponzetto. 2019. Watset: Local-Global Graph Clustering with Applications in Sense and Frame Induction. Computational Linguistics, 45(3):423–479.
Cite (Informal):
Watset: Local-Global Graph Clustering with Applications in Sense and Frame Induction (Ustalov et al., CL 2019)
Copy Citation:
PDF:
https://aclanthology.org/J19-3002.pdf
Code
 nlpub/watset-java +  additional community code
Data
FrameNet