Hybrid Models for Aspects Extraction without Labelled Dataset

Wai-Howe Khong, Lay-Ki Soon, Hui-Ngo Goh


Abstract
One of the important tasks in opinion mining is to extract aspects of the opinion target. Aspects are features or characteristics of the opinion target that are being reviewed, which can be categorised into explicit and implicit aspects. Extracting aspects from opinions is essential in order to ensure accurate information about certain attributes of an opinion target is retrieved. For instance, a professional camera receives a positive feedback in terms of its functionalities in a review, but its overly high price receives negative feedback. Most of the existing solutions focus on explicit aspects. However, sentences in reviews normally do not state the aspects explicitly. In this research, two hybrid models are proposed to identify and extract both explicit and implicit aspects, namely TDM-DC and TDM-TED. The proposed models combine topic modelling and dictionary-based approach. The models are unsupervised as they do not require any labelled dataset. The experimental results show that TDM-DC achieves F1-measure of 58.70%, where it outperforms both the baseline topic model and dictionary-based approach. In comparison to other existing unsupervised techniques, the proposed models are able to achieve higher F1-measure by approximately 3%. Although the supervised techniques perform slightly better, the proposed models are domain-independent, and hence more versatile.
Anthology ID:
D19-6611
Volume:
Proceedings of the Second Workshop on Fact Extraction and VERification (FEVER)
Month:
November
Year:
2019
Address:
Hong Kong, China
Editors:
James Thorne, Andreas Vlachos, Oana Cocarascu, Christos Christodoulopoulos, Arpit Mittal
Venue:
WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
63–68
Language:
URL:
https://aclanthology.org/D19-6611
DOI:
10.18653/v1/D19-6611
Bibkey:
Cite (ACL):
Wai-Howe Khong, Lay-Ki Soon, and Hui-Ngo Goh. 2019. Hybrid Models for Aspects Extraction without Labelled Dataset. In Proceedings of the Second Workshop on Fact Extraction and VERification (FEVER), pages 63–68, Hong Kong, China. Association for Computational Linguistics.
Cite (Informal):
Hybrid Models for Aspects Extraction without Labelled Dataset (Khong et al., 2019)
Copy Citation:
PDF:
https://aclanthology.org/D19-6611.pdf