%0 Conference Proceedings %T On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems %A Su, Pei-Hao %A Gašić, Milica %A Mrkšić, Nikola %A Rojas-Barahona, Lina M. %A Ultes, Stefan %A Vandyke, David %A Wen, Tsung-Hsien %A Young, Steve %Y Erk, Katrin %Y Smith, Noah A. %S Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) %D 2016 %8 August %I Association for Computational Linguistics %C Berlin, Germany %F su-etal-2016-line %R 10.18653/v1/P16-1230 %U https://aclanthology.org/P16-1230/ %U https://doi.org/10.18653/v1/P16-1230 %P 2431-2441