Sentiment analysis using negative selection algorithm for Twitter’s messages / Nazirah Che Alhadi

Micro-blogs as a new textual domain offer a unique proposition for sentiment analysis. Their short document length suggests any sentiment they contain is compact and explicit. It can pose difficulties for standard machine learning document representations because of the short length coupled with the...

全面介紹

Saved in:
書目詳細資料
主要作者: Che Alhadi, Nazirah
格式: Thesis
語言:English
出版: 2012
主題:
在線閱讀:https://ir.uitm.edu.my/id/eprint/35377/1/35377.pdf
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
實物特徵
總結:Micro-blogs as a new textual domain offer a unique proposition for sentiment analysis. Their short document length suggests any sentiment they contain is compact and explicit. It can pose difficulties for standard machine learning document representations because of the short length coupled with their noisy nature. The aim of this project is to classify Twitter’s messages into sentiment categories based on the important keywords. This project methodology consists of five phases which are preliminary study, data collection and preparation, model development, model evaluation and documentation. This project is designed using negative selection algorithm to automatically classify the Twitter’s messages into its sentiment’s category based on important keyword recognition. In order to develop this model classification and prototype, 480 Twitter’s messages were used as training data and 120 Twitter’s messages for testing data to determine the accuracy of the classification model. The accuracy of this model is about 60 percent. Second experiment was carried out by reducing the data to 240 for training data and 60 data for testing. The accuracy for second experiment is improved to 63.33 percent.