Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/24030
Appears in Collections:Computing Science and Mathematics Journal Articles
Peer Review Status: Refereed
Title: Unsupervised Commonsense Knowledge Enrichment for Domain-Specific Sentiment Analysis
Author(s): Ofek, Nir
Poria, Soujanya
Rokach, Lior
Cambria, Erik
Hussain, Amir
Shabtai, Asaf
Contact Email: ahu@cs.stir.ac.uk
Keywords: Sentiment analysis
Sentiment lexicon
SenticNet
Sentic patterns
Issue Date: Jun-2016
Date Deposited: 15-Aug-2016
Citation: Ofek N, Poria S, Rokach L, Cambria E, Hussain A & Shabtai A (2016) Unsupervised Commonsense Knowledge Enrichment for Domain-Specific Sentiment Analysis. Cognitive Computation, 8 (3), pp. 467-477. http://link.springer.com/article/10.1007/s12559-015-9375-3; https://doi.org/10.1007/s12559-015-9375-3
Abstract: Sentiment analysis in natural language text is a challenging task involving a deep understanding of both syntax and semantics. Leveraging the polarity of multiword expressions—or concepts—rather than single words can mitigate the difficulty of such a task as these expressions carry more contextual information than isolated words. Such contextual information is the key to understanding both the syntactic and semantic structure of natural language text and hence is useful in tasks such as sentiment analysis. In this work, we propose a new method to enrich SenticNet (a publicly available knowledge base for concept-level sentiment analysis) with domain-level concepts composed of aspects and sentiment word pairs, along with a measure of their polarity. We process a set of unlabeled texts and, by considering the statistical co-occurrence information, generate a direct acyclic graph (DAG) of concepts. The polarity score of known concepts is propagated and used to compute polarity scores of new concepts. By designing and implementing our exhaustive algorithm, we are able to use a seed set containing only two sentiment words (goodandbad). In our evaluation conducted on a dataset of hotel reviews, SenticNet was enriched by a factor of three (from 30,000 to nearly 90,000 concepts). The experiments demonstrate the merit of the concepts discovered by our method at improving sentence-level and aspect-level sentiment analysis tasks. Results of the two-factor ANOVA statistical test showed a confidence level of 95%, verifying that the improvements are statistically significant.
URL: http://link.springer.com/article/10.1007/s12559-015-9375-3
DOI Link: 10.1007/s12559-015-9375-3
Rights: The publisher does not allow this work to be made publicly available in this Repository. Please use the Request a Copy feature at the foot of the Repository record to request a copy directly from the author. You can only request a copy if you wish to use this work for your own research or private study.
Licence URL(s): http://www.rioxx.net/licenses/under-embargo-all-rights-reserved

Files in This Item:
File Description SizeFormat 
CogComp-paper-PDF-June2016.pdfFulltext - Published Version567.37 kBAdobe PDFUnder Embargo until 2999-12-13    Request a copy

Note: If any of the files in this item are currently embargoed, you can request a copy directly from the author by clicking the padlock icon above. However, this facility is dependent on the depositor still being contactable at their original email address.



This item is protected by original copyright



Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.