Using Natural Language Explanations to Improve Robustness of In-context Learning

Xuanli He; Yuxiang Wu; Oana-Maria Camburu; Pasquale Minervini; Pontus Stenetorp

doi:10.18653/v1/2024.acl-long.728

Using Natural Language Explanations to Improve Robustness of In-context Learning

Xuanli He, Yuxiang Wu, Oana-Maria Camburu, Pasquale Minervini, Pontus Stenetorp

Abstract

Recent studies demonstrated that large language models (LLMs) can excel in many tasks via in-context learning (ICL). However, recentworks show that ICL-prompted models tend to produce inaccurate results when presented with adversarial inputs. In this work, we investigate whether augmenting ICL with natural language explanations (NLEs) improves the robustness of LLMs on adversarial datasets covering natural language inference and paraphrasing identification. We prompt LLMs with a small set of human-generated NLEs to produce further NLEs, yielding more accurate results than both a zero-shot-ICL setting and using only human-generated NLEs. Our results on five popular LLMs (GPT3.5-turbo, Llama2, Vicuna, Zephyr, and Mistral) show that our approach yields over 6% improvement over baseline approaches for eight adversarial datasets: HANS, ISCS, NaN, ST, PICD, PISP, ANLI, and PAWS. Furthermore, previous studies have demonstrated that prompt selection strategies significantly enhance ICL on in-distribution test sets. However, our findings reveal that these strategies do not match the efficacy of our approach for robustness evaluations, resulting in an accuracy drop of 8% compared to the proposed approach.

Anthology ID:: 2024.acl-long.728
Volume:: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: August
Year:: 2024
Address:: Bangkok, Thailand
Editors:: Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 13477–13499
Language:
URL:: https://aclanthology.org/2024.acl-long.728/
DOI:: 10.18653/v1/2024.acl-long.728
Bibkey:
Cite (ACL):: Xuanli He, Yuxiang Wu, Oana-Maria Camburu, Pasquale Minervini, and Pontus Stenetorp. 2024. Using Natural Language Explanations to Improve Robustness of In-context Learning. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 13477–13499, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):: Using Natural Language Explanations to Improve Robustness of In-context Learning (He et al., ACL 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.acl-long.728.pdf

PDF Cite Search Fix data