Vishwa Pardeshi
2026
Do VLMs Have a Moral Backbone? A Study on the Fragile Morality of Vision-Language Models
Zhining Liu | Tianyi Wang | Xiao Lin | Penghao Ouyang | Gaotang Li | Ze Yang | Hui Liu | Sumit Keswani | Vishwa Pardeshi | Huijun Zhao | Wei Fan | Hanghang Tong
Findings of the Association for Computational Linguistics: ACL 2026
Zhining Liu | Tianyi Wang | Xiao Lin | Penghao Ouyang | Gaotang Li | Ze Yang | Hui Liu | Sumit Keswani | Vishwa Pardeshi | Huijun Zhao | Wei Fan | Hanghang Tong
Findings of the Association for Computational Linguistics: ACL 2026
Despite substantial efforts toward improving the moral alignment of Vision-Language Models (VLMs), it remains unclear whether their ethical judgments are stable in realistic settings. This work studies moral robustness in VLMs, defined as the ability to preserve moral judgments under textual and visual perturbations that do not alter the underlying moral context. We systematically probe VLMs with a diverse set of model-agnostic multimodal perturbations and find that their moral stances are highly fragile, frequently flipping under simple manipulations. Our analysis reveals systematic vulnerabilities across perturbation types, moral domains, and model scales, including a sycophancy trade-off where stronger instruction-following models are more susceptible to persuasion. We further show that lightweight inference-time interventions can partially restore moral stability. These results demonstrate that moral alignment alone is insufficient and that moral robustness is a necessary criterion for the responsible deployment of VLMs.
LegalDrill: Diagnosis-Driven Synthesis for Legal Reasoning in Small Language Models
Tianchun Li | Haochen Liu | Vishwa Pardeshi | Xingchen Wang | Tianci Liu | Huijun Zhao | Wei Fan | Jing Gao
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Tianchun Li | Haochen Liu | Vishwa Pardeshi | Xingchen Wang | Tianci Liu | Huijun Zhao | Wei Fan | Jing Gao
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Small language models (SLMs) are promising for real-world deployment due to their efficiency and low operational cost. However, their limited capacity struggles with high-stakes legal reasoning tasks that require coherent statute interpretation and logically consistent deduction. Furthermore, training SLMs for such tasks demands high-quality, concise reasoning trajectories, which are prohibitively expensive to manually collect and difficult to curate via standard rejection sampling, which lacks granularity beyond final verdicts. To address these challenges, we propose LegalDrill, a diagnosis-driven synthesis framework that extracts and iteratively refines reasoning trajectories from a capable teacher via fine-grained prompting, then a self-reflective verification is employed to adaptively select the most effective data for the SLM student. The resulting data empower SLM training through supervised fine-tuning and direct preference optimization. Extensive experiments on several legal benchmarks demonstrate that LegalDrill significantly bolsters the legal reasoning capabilities of representative SLMs while bypassing the need for scarce expert annotations, paving a scalable path toward practical legal reasoning systems.