SIERA: An Evaluation Metric for Text Simplification using the Ranking Model and Data Augmentation by Edit Operations

Hikaru Yamanaka, Takenobu Tokunaga


Abstract
Automatic evaluation metrics are indispensable for text simplification (TS) research. The past TS research adopts three evaluation aspects: fluency, meaning preservation and simplicity. However, there is little consensus on a metric to measure simplicity, a unique aspect of TS compared with other text generation tasks. In addition, many of the existing metrics require reference simplified texts for evaluation. Thus, the cost of collecting reference texts is also an issue. This study proposes a new automatic evaluation metric, SIERA, for sentence simplification. SIERA employs a ranking model for the order relation of simplicity, which is trained by pairs of the original and simplified sentences. It does not require reference sentences for either training or evaluation. The sentence pairs for training are further augmented by the proposed method that utlizes edit operations to generate intermediate sentences with the simplicity between the original and simplified sentences. Using three evaluation datasets for text simplification, we compare SIERA with other metrics by calculating the correlations between metric values and human ratings. The results showed SIERA’s superiority over other metrics with a reservation that the quality of evaluation sentences is consistent with that of the training data.
Anthology ID:
2024.readi-1.5
Volume:
Proceedings of the 3rd Workshop on Tools and Resources for People with REAding DIfficulties (READI) @ LREC-COLING 2024
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Rodrigo Wilkens, Rémi Cardon, Amalia Todirascu, Núria Gala
Venues:
READI | WS
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
47–58
Language:
URL:
https://aclanthology.org/2024.readi-1.5
DOI:
Bibkey:
Cite (ACL):
Hikaru Yamanaka and Takenobu Tokunaga. 2024. SIERA: An Evaluation Metric for Text Simplification using the Ranking Model and Data Augmentation by Edit Operations. In Proceedings of the 3rd Workshop on Tools and Resources for People with REAding DIfficulties (READI) @ LREC-COLING 2024, pages 47–58, Torino, Italia. ELRA and ICCL.
Cite (Informal):
SIERA: An Evaluation Metric for Text Simplification using the Ranking Model and Data Augmentation by Edit Operations (Yamanaka & Tokunaga, READI-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.readi-1.5.pdf