Stock market classification model using sentiment analysis based on hybrid naive bayes classifiers
Sentiment analysis has become one of the most common method to classify stock market behaviour. Moreover, sentiment analysis has gained a lot of importance in the last decade especially due to the availability of data from social media such as Twitter. However, the accuracy of stock market classific...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | eng eng |
Published: |
2019
|
Subjects: | |
Online Access: | https://etd.uum.edu.my/8123/1/s900600_01.pdf https://etd.uum.edu.my/8123/2/s900600_02.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-uum-etd.8123 |
---|---|
record_format |
uketd_dc |
spelling |
my-uum-etd.81232022-04-04T03:45:32Z Stock market classification model using sentiment analysis based on hybrid naive bayes classifiers 2019 A. Jabbar Alkubaisi, Ghaith Abdulsattar Kamaruddin, Siti Sakira Husni, Husniza Awang Had Salleh Graduate School of Arts & Sciences HG Finance Sentiment analysis has become one of the most common method to classify stock market behaviour. Moreover, sentiment analysis has gained a lot of importance in the last decade especially due to the availability of data from social media such as Twitter. However, the accuracy of stock market classification models is still low, and this has negatively affected the stock market indicators. Furthermore, there are many factors that have a direct effect on the classification models’ accuracies which were not addressed by previous research. One of the factors is the exclusion of spatial-temporal features. Another important factor is the automatic labelling technique which leads to low classification accuracy due to the absence of specific lexicon. The appropriateness of the classifiers to the data features and domain is also another factor, which affect the classification accuracy. In this research, a model for stock market classification based on sentiment analysis is constructed. It is designed to enhance the classification accuracy by the incorporation of tweet timestamp and location features, stock market domain expert labelling technique and the construction of a hybrid Naïve Bayes classifiers to classify the stock market sentiments. The methodology for this research consists of six phases. The first phase is data collection, and the second phase represents the most important phase, which is labelling, in which polarity of data is specified as negative, positive or neutral values. The third phase involves data pre-processing, which is conducted to get only relevant features. The fourth phase is classification in which suitable patterns of the stock market are identified by hybridizing different Naïve Bayes classifiers. The fifth phase is performance and evaluation, and the final phase is recognition for the stock market behaviour. The model produced a significant result in classifying stock market behaviour with accuracy more than 89%. The model is beneficial for investors and researchers. For investors, it enables them to formulate their plans based on accurate indicators whereby it reduces the risk in decision making. For researchers, it draws their attention to the importance of feature engineering, labelling technique, and the classifiers hybridization in enhancing the classification accuracy. 2019 Thesis https://etd.uum.edu.my/8123/ https://etd.uum.edu.my/8123/1/s900600_01.pdf text eng public https://etd.uum.edu.my/8123/2/s900600_02.pdf text eng public phd doctoral |
institution |
Universiti Utara Malaysia |
collection |
UUM ETD |
language |
eng eng |
advisor |
Kamaruddin, Siti Sakira Husni, Husniza |
topic |
HG Finance |
spellingShingle |
HG Finance A. Jabbar Alkubaisi, Ghaith Abdulsattar Stock market classification model using sentiment analysis based on hybrid naive bayes classifiers |
description |
Sentiment analysis has become one of the most common method to classify stock market behaviour. Moreover, sentiment analysis has gained a lot of importance in the last decade especially due to the availability of data from social media such as Twitter. However, the accuracy of stock market classification models is still low, and this has negatively affected the stock market indicators. Furthermore, there are many factors that have a direct effect on the classification models’ accuracies which were not addressed by previous research. One of the factors is the exclusion of spatial-temporal features. Another important factor is the automatic labelling technique which leads to low classification accuracy due to the absence of specific lexicon. The appropriateness of the classifiers to the data features and domain is also another factor, which affect the classification accuracy. In this research, a model for stock market classification based on sentiment analysis is constructed. It is designed to enhance the classification accuracy by the incorporation of tweet timestamp and location features, stock market domain expert labelling technique and the construction of a hybrid Naïve Bayes classifiers to classify the stock market sentiments. The methodology for this research consists of six phases. The first phase is data collection, and the second phase represents the most important phase, which is labelling, in which polarity of data is specified as negative, positive or neutral values. The third phase involves data pre-processing, which is conducted to get only relevant features. The fourth phase is classification in which suitable patterns of the stock market are identified by hybridizing different Naïve Bayes classifiers. The fifth phase is performance and evaluation, and the final phase is recognition for the stock market behaviour. The model produced a significant result in classifying stock market behaviour with accuracy more than 89%. The model is beneficial for investors and researchers. For investors, it enables them to formulate their plans based on accurate indicators whereby it reduces the risk in decision making. For researchers, it draws their attention to the importance of feature engineering, labelling technique, and the classifiers hybridization in enhancing the classification accuracy. |
format |
Thesis |
qualification_name |
Doctor of Philosophy (PhD.) |
qualification_level |
Doctorate |
author |
A. Jabbar Alkubaisi, Ghaith Abdulsattar |
author_facet |
A. Jabbar Alkubaisi, Ghaith Abdulsattar |
author_sort |
A. Jabbar Alkubaisi, Ghaith Abdulsattar |
title |
Stock market classification model using sentiment analysis based on hybrid naive bayes classifiers |
title_short |
Stock market classification model using sentiment analysis based on hybrid naive bayes classifiers |
title_full |
Stock market classification model using sentiment analysis based on hybrid naive bayes classifiers |
title_fullStr |
Stock market classification model using sentiment analysis based on hybrid naive bayes classifiers |
title_full_unstemmed |
Stock market classification model using sentiment analysis based on hybrid naive bayes classifiers |
title_sort |
stock market classification model using sentiment analysis based on hybrid naive bayes classifiers |
granting_department |
Awang Had Salleh Graduate School of Arts & Sciences |
publishDate |
2019 |
url |
https://etd.uum.edu.my/8123/1/s900600_01.pdf https://etd.uum.edu.my/8123/2/s900600_02.pdf |
_version_ |
1747828330989617152 |