Efficient (Soft) Q-Learning for Text Generation with Limited Good Data Han Guo author Bowen Tan author Zhengzhong Liu author Eric Xing author Zhiting Hu author 2022-12 text Findings of the Association for Computational Linguistics: EMNLP 2022 Yoav Goldberg editor Zornitsa Kozareva editor Yue Zhang editor Association for Computational Linguistics Abu Dhabi, United Arab Emirates conference publication guo-etal-2022-efficient 10.18653/v1/2022.findings-emnlp.518 https://aclanthology.org/2022.findings-emnlp.518/ 2022-12 6969 6991