Deep Learning of Audio and Language Features for Humor Prediction

Dario Bertero, Pascale Fung


Abstract
We propose a comparison between various supervised machine learning methods to predict and detect humor in dialogues. We retrieve our humorous dialogues from a very popular TV sitcom: “The Big Bang Theory”. We build a corpus where punchlines are annotated using the canned laughter embedded in the audio track. Our comparative study involves a linear-chain Conditional Random Field over a Recurrent Neural Network and a Convolutional Neural Network. Using a combination of word-level and audio frame-level features, the CNN outperforms the other methods, obtaining the best F-score of 68.5% over 66.5% by CRF and 52.9% by RNN. Our work is a starting point to developing more effective machine learning and neural network models on the humor prediction task, as well as developing machines capable in understanding humor in general.
Anthology ID:
L16-1079
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
496–501
Language:
URL:
https://aclanthology.org/L16-1079
DOI:
Bibkey:
Cite (ACL):
Dario Bertero and Pascale Fung. 2016. Deep Learning of Audio and Language Features for Humor Prediction. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 496–501, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Deep Learning of Audio and Language Features for Humor Prediction (Bertero & Fung, LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1079.pdf