Detrecting malicious PDF document using support vector machine supervised learning algorithm

Malicious PDF files remain a real threat, in cyber world. In practice, it can affect badly masses of computer users, even after several high-profile security incidents. In spite of a series of a security patches issued by Adobe and other vendors, many users still have vulnerable client software inst...

Full description

Saved in:
Bibliographic Details
Main Author: Dabiranzohouri, Miranda
Format: Thesis
Published: 2014
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-utm-ep.41612
record_format uketd_dc
spelling my-utm-ep.416122017-08-17T01:51:40Z Detrecting malicious PDF document using support vector machine supervised learning algorithm 2014 Dabiranzohouri, Miranda Q Science Malicious PDF files remain a real threat, in cyber world. In practice, it can affect badly masses of computer users, even after several high-profile security incidents. In spite of a series of a security patches issued by Adobe and other vendors, many users still have vulnerable client software installed on their computers. The expressiveness of the PDF format, furthermore, enables attackers to evade detection with little effort. Apart from traditional antivirus products, which are always a step behind attackers, few methods are known that can be deployed for protection of end-user systems. This thesis proposes a machine learning based method for detecting of malicious PDF documents which, instead of analyzing JavaScript or any other content, makes use of essential differences in the structural properties of malicious and benign PDF files. Support Vector Machine is used in order to testify and recognize the benign and malicious PDF file. The collected dataset consists of 2190 instance which 404 of them are malicious and 1786 instance are benign. The experimental results shows that SVM gives better result in limited number of feature compared to MLP and BayesNet method. 2014 Thesis http://eprints.utm.my/id/eprint/41612/ masters Universiti Teknologi Malaysia, Faculty of Computing Faculty of Computing
institution Universiti Teknologi Malaysia
collection UTM Institutional Repository
topic Q Science
spellingShingle Q Science
Dabiranzohouri, Miranda
Detrecting malicious PDF document using support vector machine supervised learning algorithm
description Malicious PDF files remain a real threat, in cyber world. In practice, it can affect badly masses of computer users, even after several high-profile security incidents. In spite of a series of a security patches issued by Adobe and other vendors, many users still have vulnerable client software installed on their computers. The expressiveness of the PDF format, furthermore, enables attackers to evade detection with little effort. Apart from traditional antivirus products, which are always a step behind attackers, few methods are known that can be deployed for protection of end-user systems. This thesis proposes a machine learning based method for detecting of malicious PDF documents which, instead of analyzing JavaScript or any other content, makes use of essential differences in the structural properties of malicious and benign PDF files. Support Vector Machine is used in order to testify and recognize the benign and malicious PDF file. The collected dataset consists of 2190 instance which 404 of them are malicious and 1786 instance are benign. The experimental results shows that SVM gives better result in limited number of feature compared to MLP and BayesNet method.
format Thesis
qualification_level Master's degree
author Dabiranzohouri, Miranda
author_facet Dabiranzohouri, Miranda
author_sort Dabiranzohouri, Miranda
title Detrecting malicious PDF document using support vector machine supervised learning algorithm
title_short Detrecting malicious PDF document using support vector machine supervised learning algorithm
title_full Detrecting malicious PDF document using support vector machine supervised learning algorithm
title_fullStr Detrecting malicious PDF document using support vector machine supervised learning algorithm
title_full_unstemmed Detrecting malicious PDF document using support vector machine supervised learning algorithm
title_sort detrecting malicious pdf document using support vector machine supervised learning algorithm
granting_institution Universiti Teknologi Malaysia, Faculty of Computing
granting_department Faculty of Computing
publishDate 2014
_version_ 1747816582430588928