Francis Dutil
2017
Plan, Attend, Generate: Character-Level Neural Machine Translation with Planning
Caglar Gulcehre
|
Francis Dutil
|
Adam Trischler
|
Yoshua Bengio
Proceedings of the 2nd Workshop on Representation Learning for NLP
We investigate the integration of a planning mechanism into an encoder-decoder architecture with attention. We develop a model that can plan ahead when it computes alignments between the source and target sequences not only for a single time-step but for the next k time-steps as well by constructing a matrix of proposed future alignments and a commitment vector that governs whether to follow or recompute the plan. This mechanism is inspired by strategic attentive reader and writer (STRAW) model, a recent neural architecture for planning with hierarchical reinforcement learning that can also learn higher level temporal abstractions. Our proposed model is end-to-end trainable with differentiable operations. We show that our model outperforms strong baselines on character-level translation task from WMT’15 with fewer parameters and computes alignments that are qualitatively intuitive.
Adversarial Generation of Natural Language
Sandeep Subramanian
|
Sai Rajeswar
|
Francis Dutil
|
Chris Pal
|
Aaron Courville
Proceedings of the 2nd Workshop on Representation Learning for NLP
Generative Adversarial Networks (GANs) have gathered a lot of attention from the computer vision community, yielding impressive results for image generation. Advances in the adversarial generation of natural language from noise however are not commensurate with the progress made in generating images, and still lag far behind likelihood based methods. In this paper, we take a step towards generating natural language with a GAN objective alone. We introduce a simple baseline that addresses the discrete output space problem without relying on gradient estimators and show that it is able to achieve state-of-the-art results on a Chinese poem generation dataset. We present quantitative results on generating sentences from context-free and probabilistic context-free grammars, and qualitative language modeling results. A conditional version is also described that can generate sequences conditioned on sentence characteristics.
Search
Fix data
Co-authors
- Yoshua Bengio 1
- Aaron Courville 1
- Çağlar Gu̇lçehre 1
- Christopher Pal 1
- Sai Rajeswar 1
- show all...