Sentiment analysis using negative selection algorithm for Twitter’s messages / Nazirah Che Alhadi

Micro-blogs as a new textual domain offer a unique proposition for sentiment analysis. Their short document length suggests any sentiment they contain is compact and explicit. It can pose difficulties for standard machine learning document representations because of the short length coupled with the...

全面介绍

Saved in:
书目详细资料
主要作者: Che Alhadi, Nazirah
格式: Thesis
语言:English
出版: 2012
主题:
在线阅读:https://ir.uitm.edu.my/id/eprint/35377/1/35377.pdf
标签: 添加标签
没有标签, 成为第一个标记此记录!
实物特征
总结:Micro-blogs as a new textual domain offer a unique proposition for sentiment analysis. Their short document length suggests any sentiment they contain is compact and explicit. It can pose difficulties for standard machine learning document representations because of the short length coupled with their noisy nature. The aim of this project is to classify Twitter’s messages into sentiment categories based on the important keywords. This project methodology consists of five phases which are preliminary study, data collection and preparation, model development, model evaluation and documentation. This project is designed using negative selection algorithm to automatically classify the Twitter’s messages into its sentiment’s category based on important keyword recognition. In order to develop this model classification and prototype, 480 Twitter’s messages were used as training data and 120 Twitter’s messages for testing data to determine the accuracy of the classification model. The accuracy of this model is about 60 percent. Second experiment was carried out by reducing the data to 240 for training data and 60 data for testing. The accuracy for second experiment is improved to 63.33 percent.