%0 Conference Proceedings %T Steering Llama 2 via Contrastive Activation Addition %A Rimsky, Nina %A Gabrieli, Nick %A Schulz, Julian %A Tong, Meg %A Hubinger, Evan %A Turner, Alexander %Y Ku, Lun-Wei %Y Martins, Andre %Y Srikumar, Vivek %S Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) %D 2024 %8 August %I Association for Computational Linguistics %C Bangkok, Thailand %F rimsky-etal-2024-steering %R 10.18653/v1/2024.acl-long.828 %U https://aclanthology.org/2024.acl-long.828/ %U https://doi.org/10.18653/v1/2024.acl-long.828 %P 15504-15522