A Lexicon-Based Approach for Detecting Hedges in Informal Text

Jumayel Islam, Lu Xiao, Robert E. Mercer


Abstract
Hedging is a commonly used strategy in conversational management to show the speaker’s lack of commitment to what they communicate, which may signal problems between the speakers. Our project is interested in examining the presence of hedging words and phrases in identifying the tension between an interviewer and interviewee during a survivor interview. While there have been studies on hedging detection in the natural language processing literature, all existing work has focused on structured texts and formal communications. Our project thus investigated a corpus of eight unstructured conversational interviews about the Rwanda Genocide and identified hedging patterns in the interviewees’ responses. Our work produced three manually constructed lists of hedge words, booster words, and hedging phrases. Leveraging these lexicons, we developed a rule-based algorithm that detects sentence-level hedges in informal conversations such as survivor interviews. Our work also produced a dataset of 3000 sentences having the categories Hedge and Non-hedge annotated by three researchers. With experiments on this annotated dataset, we verify the efficacy of our proposed algorithm. Our work contributes to the further development of tools that identify hedges from informal conversations and discussions.
Anthology ID:
2020.lrec-1.380
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
3109–3113
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.380
DOI:
Bibkey:
Cite (ACL):
Jumayel Islam, Lu Xiao, and Robert E. Mercer. 2020. A Lexicon-Based Approach for Detecting Hedges in Informal Text. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 3109–3113, Marseille, France. European Language Resources Association.
Cite (Informal):
A Lexicon-Based Approach for Detecting Hedges in Informal Text (Islam et al., LREC 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.lrec-1.380.pdf