PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning Xuekai Zhu author Biqing Qi author Kaiyan Zhang author Xinwei Long author Zhouhan Lin author Bowen Zhou author 2024-06 text Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) Kevin Duh editor Helena Gomez editor Steven Bethard editor Association for Computational Linguistics Mexico City, Mexico conference publication zhu-etal-2024-pad 10.18653/v1/2024.naacl-long.142 https://aclanthology.org/2024.naacl-long.142/ 2024-06 2571 2597