PSST: A Benchmark for Evaluation-driven Text Public-Speaking Style Transfer

Huashan Sun, Yixiao Wu, Yizhe Yang, Yinghao Li, Jiawei Li, Yuhao Ye, Yang Gao


Abstract
Language style is necessary for AI systems to accurately understand and generate diverse human language. However, previous text style transfer primarily focused on sentence-level data-driven approaches, limiting exploration of potential problems in large language models (LLMs) and the ability to meet complex application needs. To overcome these limitations, we introduce a novel task called Public-Speaking Style Transfer (PSST), which aims to simulate humans to transform passage-level, official texts into a public-speaking style. Grounded in the analysis of real-world data from a linguistic perspective, we decompose public-speaking style into key sub-styles to pose challenges and quantify the style modeling capability of LLMs. For such intricate text style transfer, we further propose a fine-grained evaluation framework to analyze the characteristics and identify the problems of stylized texts. Comprehensive experiments suggest that current LLMs struggle to generate public speaking texts that align with human preferences, primarily due to excessive stylization and loss of semantic information. We will release our data, code, and model upon acceptance.
Anthology ID:
2024.findings-emnlp.495
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2024
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
8438–8471
Language:
URL:
https://aclanthology.org/2024.findings-emnlp.495
DOI:
Bibkey:
Cite (ACL):
Huashan Sun, Yixiao Wu, Yizhe Yang, Yinghao Li, Jiawei Li, Yuhao Ye, and Yang Gao. 2024. PSST: A Benchmark for Evaluation-driven Text Public-Speaking Style Transfer. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 8438–8471, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
PSST: A Benchmark for Evaluation-driven Text Public-Speaking Style Transfer (Sun et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-emnlp.495.pdf