Durashi Langappuli
2020
Dialog policy optimization for low resource setting using Self-play and Reward based Sampling
Tharindu Madusanka
|
Durashi Langappuli
|
Thisara Welmilla
|
Uthayasanker Thayasivam
|
Sanath Jayasena
Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation
Search