Spam filtering using bayesian technique based on independent feature selection

Bayesian technique is one of the classification techniques which can be applied to a certain problem domain such as classification task. Therefore, this technique had been chosen to conduct a classification task with emails dataset where the emails are comprised of spam and non spam emails. Bayesian...

全面介紹

Saved in:
書目詳細資料
主要作者: Mohamad, Masurah
格式: Thesis
語言:English
出版: 2006
主題:
在線閱讀:http://eprints.utm.my/id/eprint/4066/1/MasurahMohamadMFSKSM2006.pdf
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
實物特徵
總結:Bayesian technique is one of the classification techniques which can be applied to a certain problem domain such as classification task. Therefore, this technique had been chosen to conduct a classification task with emails dataset where the emails are comprised of spam and non spam emails. Bayesian technique has been applied to observe whether it can produce a good result in spam emails classification or not. Beside, this project also applied Rough set as a comparison technique to classify the spam emails. The classification task is done based on the independent feature selection where only one most occurrence term for each email is chosen as an input to the Bayesian probability. Some of the measurement evaluation had been used to evaluate the classification performance. The measurements are precision, recall, sensitivity, specificity, accuracy and error rate. After the measurements process, these two technique were compared to identify which one of these two techniques is best in classifies spam emails based on the experimental results. The results show that Bayesian technique is good than Rough set technique in classifies spam emails. However the results also indicate that Rough set also suitable for spam filtering problem. Finally, some suggestions were being discussed so that this project can be improved in future work to get a better result compared to the current result which had been retrieved in this project.