Fuzzy semantic classifier for determining strength levels of customer product reviews

Opinion Mining (OM) is one of the new paradigms of information retrieval and computational linguistics. This paradigm is not only concerned with document topic but also the opinion which is expressed. The most challenging area in OM is finding the orientation of customer feeling in reviews such as...

Full description

Saved in:
Bibliographic Details
Main Author: Nadali, Samaneh
Format: Thesis
Language:English
Published: 2012
Subjects:
Online Access:http://psasir.upm.edu.my/id/eprint/32229/1/FSKTM%202012%2011R.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Opinion Mining (OM) is one of the new paradigms of information retrieval and computational linguistics. This paradigm is not only concerned with document topic but also the opinion which is expressed. The most challenging area in OM is finding the orientation of customer feeling in reviews such as blogs, product reviews and so on. Opinion about products is nowadays available from blogs and review sites. So, extracting opinion from these reviews help the user as well merchants to track the most suitable choice of product. There are various tasks in OM. Classification of customer reviews into positive, negative and neutral classes (also known as semantic classification) is one of the tasks that help product manufacturers or businesses to easily identify orientation of their product services. Previous studies focused on the automatic identification of opinion i.e. classifying reviews into positive, negative and neutral only. However, for some applications like flame detection or information analysis, recognizing opinion only might not be sufficient. Thus, identifying strength of opinion is considered as one of the propounded problems from the early days. In this thesis, we extended the holistic lexicon-based approach to opinion mining presented in (Ding et al., 2008), in which the researcher did not focus on finding the strength levels of opinion of each product reviews. To address the mentioned problem, a Fuzzy Semantic Classifier (FSC) is proposed to identify semantic orientation of customer product reviews at a granularity levels such as very strong, strong, moderate, weak, and very weak for each positive and negative class by combining opinion words (i.e. adverb, adjective, verb, and noun). We used fuzzy logic as it is not only using non-numerical values but also it introduces the notion of linguistic variables to overcome the uncertainty of natural language. The proposed classifier (FSC) has been tested on eight benchmark datasets introduced by (Ding et al., 2008). The results of the study showed that a Fuzzy Semantic Classifier (FSC) gave various strength of levels classification in customer product reviews which leads to multi understandability of customer opinions. The percentage of similarity between FSC and human classifications is 74%. This means that the FSC is able to classify various strength levels to very strong, strong, moderate, weak and very weak for each positive and negative class similar to human.