Xiyuan Zhang
2019
D-NET: A Pre-Training and Fine-Tuning Framework for Improving the Generalization of Machine Reading Comprehension
Hongyu Li
|
Xiyuan Zhang
|
Yibing Liu
|
Yiming Zhang
|
Quan Wang
|
Xiangyang Zhou
|
Jing Liu
|
Hua Wu
|
Haifeng Wang
Proceedings of the 2nd Workshop on Machine Reading for Question Answering
In this paper, we introduce a simple system Baidu submitted for MRQA (Machine Reading for Question Answering) 2019 Shared Task that focused on generalization of machine reading comprehension (MRC) models. Our system is built on a framework of pretraining and fine-tuning, namely D-NET. The techniques of pre-trained language models and multi-task learning are explored to improve the generalization of MRC models and we conduct experiments to examine the effectiveness of these strategies. Our system is ranked at top 1 of all the participants in terms of averaged F1 score. Our codes and models will be released at PaddleNLP.
Proactive Human-Machine Conversation with Explicit Conversation Goal
Wenquan Wu
|
Zhen Guo
|
Xiangyang Zhou
|
Hua Wu
|
Xiyuan Zhang
|
Rongzhong Lian
|
Haifeng Wang
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
Though great progress has been made for human-machine conversation, current dialogue system is still in its infancy: it usually converses passively and utters words more as a matter of response, rather than on its own initiatives. In this paper, we take a radical step towards building a human-like conversational agent: endowing it with the ability of proactively leading the conversation (introducing a new topic or maintaining the current topic). To facilitate the development of such conversation systems, we create a new dataset named Konv where one acts as a conversation leader and the other acts as the follower. The leader is provided with a knowledge graph and asked to sequentially change the discussion topics, following the given conversation goal, and meanwhile keep the dialogue as natural and engaging as possible. Konv enables a very challenging task as the model needs to both understand dialogue and plan over the given knowledge graph. We establish baseline results on this dataset (about 270K utterances and 30k dialogues) using several state-of-the-art models. Experimental results show that dialogue models that plan over the knowledge graph can make full use of related knowledge to generate more diverse multi-turn conversations. The baseline systems along with the dataset are publicly available.
Search
Fix data
Co-authors
- Haifeng Wang 2
- Hua Wu (吴华) 2
- Xiangyang Zhou 2
- Zhen Guo 1
- Hongyu Li 1
- show all...