Spam image filtering algorithm / Muhammad Hazim Shahimi

Email has become the most important medium of transferring message on the internet. The email users are increasing over the years because it is easy to use and low cost. However, this situation has attracted spammer to advertise their product by sending spam messages to anyone who uses email. As a c...

Full description

Saved in:
Bibliographic Details
Main Author: Shahimi, Muhammad Hazim
Format: Thesis
Language:English
Published: 2015
Subjects:
Online Access:https://ir.uitm.edu.my/id/eprint/57789/1/57789.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Email has become the most important medium of transferring message on the internet. The email users are increasing over the years because it is easy to use and low cost. However, this situation has attracted spammer to advertise their product by sending spam messages to anyone who uses email. As a consequence, the number of spam email has increase unexpectedly. The spam has become a serious problem for the email users. They are flood with a lot of spam in their email. This project is done to help the internet user from being flood by the spam especially spam images. The algorithm in this project will help the existing current spam filtering to enhance the method in filtering spam images. This project applied Optical Character Recognition (OCR) and Bayesian probability to filter the spam images. The evaluation task is done by using formula of Precision, Recall, Error Rate, and Accuracy. Based on the result, the testing dataset has achieved 82% of accuracy. It has shown that the proposed algorithm is good in classifying the images. There are some improvements that can be suggested for future works such as use more data, apply the better OCR technique, use another image feature extraction, and use another classifier.