Spam filtering using bayesian technique based on independent feature selection

Bayesian technique is one of the classification techniques which can be applied to a certain problem domain such as classification task. Therefore, this technique had been chosen to conduct a classification task with emails dataset where the emails are comprised of spam and non spam emails. Bayesian...

Full description

Saved in:
Bibliographic Details
Main Author: Mohamad, Masurah
Format: Thesis
Language:English
Published: 2006
Subjects:
Online Access:http://eprints.utm.my/id/eprint/4066/1/MasurahMohamadMFSKSM2006.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-utm-ep.4066
record_format uketd_dc
spelling my-utm-ep.40662018-01-15T04:21:33Z Spam filtering using bayesian technique based on independent feature selection 2006-04 Mohamad, Masurah QA75 Electronic computers. Computer science Bayesian technique is one of the classification techniques which can be applied to a certain problem domain such as classification task. Therefore, this technique had been chosen to conduct a classification task with emails dataset where the emails are comprised of spam and non spam emails. Bayesian technique has been applied to observe whether it can produce a good result in spam emails classification or not. Beside, this project also applied Rough set as a comparison technique to classify the spam emails. The classification task is done based on the independent feature selection where only one most occurrence term for each email is chosen as an input to the Bayesian probability. Some of the measurement evaluation had been used to evaluate the classification performance. The measurements are precision, recall, sensitivity, specificity, accuracy and error rate. After the measurements process, these two technique were compared to identify which one of these two techniques is best in classifies spam emails based on the experimental results. The results show that Bayesian technique is good than Rough set technique in classifies spam emails. However the results also indicate that Rough set also suitable for spam filtering problem. Finally, some suggestions were being discussed so that this project can be improved in future work to get a better result compared to the current result which had been retrieved in this project. 2006-04 Thesis http://eprints.utm.my/id/eprint/4066/ http://eprints.utm.my/id/eprint/4066/1/MasurahMohamadMFSKSM2006.pdf application/pdf en public masters Universiti Teknologi Malaysia, Faculty of Computer Science and Information System Faculty of Computer Science and Information System
institution Universiti Teknologi Malaysia
collection UTM Institutional Repository
language English
topic QA75 Electronic computers
Computer science
spellingShingle QA75 Electronic computers
Computer science
Mohamad, Masurah
Spam filtering using bayesian technique based on independent feature selection
description Bayesian technique is one of the classification techniques which can be applied to a certain problem domain such as classification task. Therefore, this technique had been chosen to conduct a classification task with emails dataset where the emails are comprised of spam and non spam emails. Bayesian technique has been applied to observe whether it can produce a good result in spam emails classification or not. Beside, this project also applied Rough set as a comparison technique to classify the spam emails. The classification task is done based on the independent feature selection where only one most occurrence term for each email is chosen as an input to the Bayesian probability. Some of the measurement evaluation had been used to evaluate the classification performance. The measurements are precision, recall, sensitivity, specificity, accuracy and error rate. After the measurements process, these two technique were compared to identify which one of these two techniques is best in classifies spam emails based on the experimental results. The results show that Bayesian technique is good than Rough set technique in classifies spam emails. However the results also indicate that Rough set also suitable for spam filtering problem. Finally, some suggestions were being discussed so that this project can be improved in future work to get a better result compared to the current result which had been retrieved in this project.
format Thesis
qualification_level Master's degree
author Mohamad, Masurah
author_facet Mohamad, Masurah
author_sort Mohamad, Masurah
title Spam filtering using bayesian technique based on independent feature selection
title_short Spam filtering using bayesian technique based on independent feature selection
title_full Spam filtering using bayesian technique based on independent feature selection
title_fullStr Spam filtering using bayesian technique based on independent feature selection
title_full_unstemmed Spam filtering using bayesian technique based on independent feature selection
title_sort spam filtering using bayesian technique based on independent feature selection
granting_institution Universiti Teknologi Malaysia, Faculty of Computer Science and Information System
granting_department Faculty of Computer Science and Information System
publishDate 2006
url http://eprints.utm.my/id/eprint/4066/1/MasurahMohamadMFSKSM2006.pdf
_version_ 1747814492963602432