A Reinforcement Learning Framework for Cross-Lingual Stance Detection Using Chain-of-Thought Alignment

Binghui Li; Minghui Zou; Xiaowang Zhang; Shizhan Chen; Zhiyong Feng

doi:10.18653/v1/2025.findings-acl.1115

A Reinforcement Learning Framework for Cross-Lingual Stance Detection Using Chain-of-Thought Alignment

Binghui Li, Minghui Zou, Xiaowang Zhang, Shizhan Chen, Zhiyong Feng

Abstract

Cross-lingual stance detection identifies users’ attitudes toward specific targets in texts by transferring knowledge from source languages to target languages. Previous studies have typically facilitated this transfer by translating and aligning labels or targets. However, these methods cannot effectively perform cross-lingual transfer of the complex reasoning processes in stance detection. To address this challenge, we propose a reinforcement learning framework using cross-lingual Chain-of-Thought (CoT) alignment, referred to as RCCA. Specifically, we adopt a cross-lingual CoT alignment strategy to obtain the high-quality CoTs generated from target language inputs. After that, we leverage reinforcement learning by sampling CoTs and assigning rewards according to predefined rules, aiming to enhance the model’s generalization capabilities in the target language. Experimental results on four multilingual datasets demonstrate that our approach outperforms competitive methods.

Anthology ID:: 2025.findings-acl.1115
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 21674–21688
Language:
URL:: https://aclanthology.org/2025.findings-acl.1115/
DOI:: 10.18653/v1/2025.findings-acl.1115
Bibkey:
Cite (ACL):: Binghui Li, Minghui Zou, Xiaowang Zhang, Shizhan Chen, and Zhiyong Feng. 2025. A Reinforcement Learning Framework for Cross-Lingual Stance Detection Using Chain-of-Thought Alignment. In Findings of the Association for Computational Linguistics: ACL 2025, pages 21674–21688, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: A Reinforcement Learning Framework for Cross-Lingual Stance Detection Using Chain-of-Thought Alignment (Li et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.1115.pdf

PDF Cite Search Fix data