ACL Anthology
News
(current)
FAQ
(current)
Corrections
(current)
Submissions
(current)
GitHub
Durashi
Langappuli
2020
pdf
bib
Dialog policy optimization for low resource setting using Self-play and Reward based Sampling
Tharindu Madusanka
|
Durashi Langappuli
|
Thisara Welmilla
|
Uthayasanker Thayasivam
|
Sanath Jayasena
Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation
Search
Co-authors
Sanath Jayasena
1
Tharindu Madusanka
1
Uthayasanker Thayasivam
1
Thisara Welmilla
1
Venues
paclic
1
Fix author