http://hdl.handle.net/1893/24030
Appears in Collections: | Computing Science and Mathematics Journal Articles |
Peer Review Status: | Refereed |
Title: | Unsupervised Commonsense Knowledge Enrichment for Domain-Specific Sentiment Analysis |
Author(s): | Ofek, Nir Poria, Soujanya Rokach, Lior Cambria, Erik Hussain, Amir Shabtai, Asaf |
Contact Email: | ahu@cs.stir.ac.uk |
Keywords: | Sentiment analysis Sentiment lexicon SenticNet Sentic patterns |
Issue Date: | Jun-2016 |
Date Deposited: | 15-Aug-2016 |
Citation: | Ofek N, Poria S, Rokach L, Cambria E, Hussain A & Shabtai A (2016) Unsupervised Commonsense Knowledge Enrichment for Domain-Specific Sentiment Analysis. Cognitive Computation, 8 (3), pp. 467-477. http://link.springer.com/article/10.1007/s12559-015-9375-3; https://doi.org/10.1007/s12559-015-9375-3 |
Abstract: | Sentiment analysis in natural language text is a challenging task involving a deep understanding of both syntax and semantics. Leveraging the polarity of multiword expressions—or concepts—rather than single words can mitigate the difficulty of such a task as these expressions carry more contextual information than isolated words. Such contextual information is the key to understanding both the syntactic and semantic structure of natural language text and hence is useful in tasks such as sentiment analysis. In this work, we propose a new method to enrich SenticNet (a publicly available knowledge base for concept-level sentiment analysis) with domain-level concepts composed of aspects and sentiment word pairs, along with a measure of their polarity. We process a set of unlabeled texts and, by considering the statistical co-occurrence information, generate a direct acyclic graph (DAG) of concepts. The polarity score of known concepts is propagated and used to compute polarity scores of new concepts. By designing and implementing our exhaustive algorithm, we are able to use a seed set containing only two sentiment words (goodandbad). In our evaluation conducted on a dataset of hotel reviews, SenticNet was enriched by a factor of three (from 30,000 to nearly 90,000 concepts). The experiments demonstrate the merit of the concepts discovered by our method at improving sentence-level and aspect-level sentiment analysis tasks. Results of the two-factor ANOVA statistical test showed a confidence level of 95%, verifying that the improvements are statistically significant. |
URL: | http://link.springer.com/article/10.1007/s12559-015-9375-3 |
DOI Link: | 10.1007/s12559-015-9375-3 |
Rights: | The publisher does not allow this work to be made publicly available in this Repository. Please use the Request a Copy feature at the foot of the Repository record to request a copy directly from the author. You can only request a copy if you wish to use this work for your own research or private study. |
Licence URL(s): | http://www.rioxx.net/licenses/under-embargo-all-rights-reserved |
File | Description | Size | Format | |
---|---|---|---|---|
CogComp-paper-PDF-June2016.pdf | Fulltext - Published Version | 567.37 kB | Adobe PDF | Under Embargo until 2999-12-13 Request a copy |
Note: If any of the files in this item are currently embargoed, you can request a copy directly from the author by clicking the padlock icon above. However, this facility is dependent on the depositor still being contactable at their original email address.
This item is protected by original copyright |
Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.
The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/
If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.