Dee Ann Reisinger
2020
The Universal Decompositional Semantics Dataset and Decomp Toolkit
Aaron Steven White
|
Elias Stengel-Eskin
|
Siddharth Vashishtha
|
Venkata Subrahmanyan Govindarajan
|
Dee Ann Reisinger
|
Tim Vieira
|
Keisuke Sakaguchi
|
Sheng Zhang
|
Francis Ferraro
|
Rachel Rudinger
|
Kyle Rawlins
|
Benjamin Van Durme
Proceedings of the Twelfth Language Resources and Evaluation Conference
We present the Universal Decompositional Semantics (UDS) dataset (v1.0), which is bundled with the Decomp toolkit (v0.1). UDS1.0 unifies five high-quality, decompositional semantics-aligned annotation sets within a single semantic graph specification—with graph structures defined by the predicative patterns produced by the PredPatt tool and real-valued node and edge attributes constructed using sophisticated normalization procedures. The Decomp toolkit provides a suite of Python 3 tools for querying UDS graphs using SPARQL. Both UDS1.0 and Decomp0.1 are publicly available at http://decomp.io.
2019
Deep Generalized Canonical Correlation Analysis
Adrian Benton
|
Huda Khayrallah
|
Biman Gujral
|
Dee Ann Reisinger
|
Sheng Zhang
|
Raman Arora
Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019)
We present Deep Generalized Canonical Correlation Analysis (DGCCA) – a method for learning nonlinear transformations of arbitrarily many views of data, such that the resulting transformations are maximally informative of each other. While methods for nonlinear two view representation learning (Deep CCA, (Andrew et al., 2013)) and linear many-view representation learning (Generalized CCA (Horst, 1961)) exist, DGCCA combines the flexibility of nonlinear (deep) representation learning with the statistical power of incorporating information from many sources, or views. We present the DGCCA formulation as well as an efficient stochastic optimization algorithm for solving it. We learn and evaluate DGCCA representations for three downstream tasks: phonetic transcription from acoustic & articulatory measurements, recommending hashtags and recommending friends on a dataset of Twitter users.