“Vreselijk mooi!” (terribly beautiful): A Subjectivity Lexicon for Dutch Adjectives.

Tom De Smedt, Walter Daelemans


Abstract
We present a new open source subjectivity lexicon for Dutch adjectives. The lexicon is a dictionary of 1,100 adjectives that occur frequently in online product reviews, manually annotated with polarity strength, subjectivity and intensity, for each word sense. We discuss two machine learning methods (using distributional extraction and synset relations) to automatically expand the lexicon to 5,500 words. We evaluate the lexicon by comparing it to the user-given star rating of online product reviews. We show promising results in both in-domain and cross-domain evaluation. The lexicon is publicly available as part of the PATTERN software package (http://www.clips.ua.ac.be/pages/pattern).
Anthology ID:
L12-1145
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3568–3572
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/312_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Tom De Smedt and Walter Daelemans. 2012. “Vreselijk mooi!” (terribly beautiful): A Subjectivity Lexicon for Dutch Adjectives.. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 3568–3572, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
“Vreselijk mooi!” (terribly beautiful): A Subjectivity Lexicon for Dutch Adjectives. (De Smedt & Daelemans, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/312_Paper.pdf