Deep Bayesian Natural Language Processing

Jen-Tzung Chien


Abstract
This introductory tutorial addresses the advances in deep Bayesian learning for natural language with ubiquitous applications ranging from speech recognition to document summarization, text classification, text segmentation, information extraction, image caption generation, sentence generation, dialogue control, sentiment classification, recommendation system, question answering and machine translation, to name a few. Traditionally, “deep learning” is taken to be a learning process where the inference or optimization is based on the real-valued deterministic model. The “semantic structure” in words, sentences, entities, actions and documents drawn from a large vocabulary may not be well expressed or correctly optimized in mathematical logic or computer programs. The “distribution function” in discrete or continuous latent variable model for natural language may not be properly decomposed or estimated. This tutorial addresses the fundamentals of statistical models and neural networks, and focus on a series of advanced Bayesian models and deep models including hierarchical Dirichlet process, Chinese restaurant process, hierarchical Pitman-Yor process, Indian buffet process, recurrent neural network, long short-term memory, sequence-to-sequence model, variational auto-encoder, generative adversarial network, attention mechanism, memory-augmented neural network, skip neural network, stochastic neural network, predictive state neural network and policy neural network. We present how these models are connected and why they work for a variety of applications on symbolic and complex patterns in natural language. The variational inference and sampling method are formulated to tackle the optimization for complicated models. The word and sentence embeddings, clustering and co-clustering are merged with linguistic and semantic constraints. A series of case studies and domain applications are presented to tackle different issues in deep Bayesian processing, learning and understanding. At last, we will point out a number of directions and outlooks for future studies.
Anthology ID:
P19-4006
Volume:
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts
Month:
July
Year:
2019
Address:
Florence, Italy
Editors:
Preslav Nakov, Alexis Palmer
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
25–30
Language:
URL:
https://aclanthology.org/P19-4006
DOI:
10.18653/v1/P19-4006
Bibkey:
Cite (ACL):
Jen-Tzung Chien. 2019. Deep Bayesian Natural Language Processing. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, pages 25–30, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):
Deep Bayesian Natural Language Processing (Chien, ACL 2019)
Copy Citation:
PDF:
https://aclanthology.org/P19-4006.pdf