Sentiment analysis using negative selection algorithm for Twitter’s messages / Nazirah Che Alhadi

Micro-blogs as a new textual domain offer a unique proposition for sentiment analysis. Their short document length suggests any sentiment they contain is compact and explicit. It can pose difficulties for standard machine learning document representations because of the short length coupled with the...

Full description

Saved in:
Bibliographic Details
Main Author: Che Alhadi, Nazirah
Format: Thesis
Language:English
Published: 2012
Subjects:
Online Access:https://ir.uitm.edu.my/id/eprint/35377/1/35377.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Micro-blogs as a new textual domain offer a unique proposition for sentiment analysis. Their short document length suggests any sentiment they contain is compact and explicit. It can pose difficulties for standard machine learning document representations because of the short length coupled with their noisy nature. The aim of this project is to classify Twitter’s messages into sentiment categories based on the important keywords. This project methodology consists of five phases which are preliminary study, data collection and preparation, model development, model evaluation and documentation. This project is designed using negative selection algorithm to automatically classify the Twitter’s messages into its sentiment’s category based on important keyword recognition. In order to develop this model classification and prototype, 480 Twitter’s messages were used as training data and 120 Twitter’s messages for testing data to determine the accuracy of the classification model. The accuracy of this model is about 60 percent. Second experiment was carried out by reducing the data to 240 for training data and 60 data for testing. The accuracy for second experiment is improved to 63.33 percent.