Crossing the Line: Where do Demographic Variables Fit into Humor Detection?

J. A. Meaney


Abstract
Recent humor classification shared tasks have struggled with two issues: either the data comprises a highly constrained genre of humor which does not broadly represent humor, or the data is so indiscriminate that the inter-annotator agreement on its humor content is drastically low. These tasks typically average over all annotators’ judgments, in spite of the fact that humor is a highly subjective phenomenon. We argue that demographic factors influence whether a text is perceived as humorous or not. We propose the addition of demographic information about the humor annotators in order to bin ratings more sensibly. We also suggest the addition of an ‘offensive’ label to distinguish between different generations, in terms of humor. This would allow for more nuanced shared tasks and could lead to better performance on downstream tasks, such as content moderation.
Anthology ID:
2020.acl-srw.24
Volume:
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop
Month:
July
Year:
2020
Address:
Online
Editors:
Shruti Rijhwani, Jiangming Liu, Yizhong Wang, Rotem Dror
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
176–181
Language:
URL:
https://aclanthology.org/2020.acl-srw.24
DOI:
10.18653/v1/2020.acl-srw.24
Bibkey:
Cite (ACL):
J. A. Meaney. 2020. Crossing the Line: Where do Demographic Variables Fit into Humor Detection?. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, pages 176–181, Online. Association for Computational Linguistics.
Cite (Informal):
Crossing the Line: Where do Demographic Variables Fit into Humor Detection? (Meaney, ACL 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.acl-srw.24.pdf
Video:
 http://slideslive.com/38928664