A hybrid heuristic-statistical peer-to-peer traffic classifier

Peer-to-Peer (P2P) traffic classification is still an open research problem due to the challenges to provide an optimum classifier. In this work, a novel hybrid heuristic and statistical approach to classify P2P traffic is proposed. Heuristics approach provides high accuracy. However, it involves ma...

Full description

Saved in:
Bibliographic Details
Main Author: Hassan Hamid, Mussab Mustafa
Format: Thesis
Published: 2010
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-utm-ep.28468
record_format uketd_dc
spelling my-utm-ep.284682017-08-14T01:44:10Z A hybrid heuristic-statistical peer-to-peer traffic classifier 2010 Hassan Hamid, Mussab Mustafa Unspecified Peer-to-Peer (P2P) traffic classification is still an open research problem due to the challenges to provide an optimum classifier. In this work, a novel hybrid heuristic and statistical approach to classify P2P traffic is proposed. Heuristics approach provides high accuracy. However, it involves many correlation between packets and flows within certain time which make it inapplicable for online classification. On the other hand, statistical classification can classify traffic in an online manner but it needs periodical manual retraining. In the proposed solution, heuristic and statistical classification are combined to overcome their weaknesses. The system involves two modules: offline learning and online statistical classification. In the first module, heuristics are used to classify traces flows into three classes, two which are used for training the online statistical classifier. In the online module, machine learning (ML) algorithms are used to classify traffic on the fly. This work presents an enhancement for existing heuristic classification technique by adding a new class. Using 22 traffic traces downloaded from different shared resources and captured from Universiti Teknologi Malaysia (UTM) campus network between March and June 2010, the proposed system is evaluated. In offline phase (heuristics), the result shows that adding the third class improves the accuracy from 93% to 98%. This module could provide quality examples to be used to train the online statistical classifier. For the online statistical classifier, 64 ML algorithms are investigated. Deep analyses on ML algorithms shows that Decision Tree algorithms provide the best result on both accuracy and processing time. Using examples generated by the heuristic classifiers, the overall statistical classification accuracy is 99% based on analysis on downloaded and captured UTM traces. 2010 Thesis http://eprints.utm.my/id/eprint/28468/ masters Universiti Teknologi Malaysia, Faculty of Electrical Engineering Faculty of Electrical Engineering
institution Universiti Teknologi Malaysia
collection UTM Institutional Repository
topic Unspecified
spellingShingle Unspecified
Hassan Hamid, Mussab Mustafa
A hybrid heuristic-statistical peer-to-peer traffic classifier
description Peer-to-Peer (P2P) traffic classification is still an open research problem due to the challenges to provide an optimum classifier. In this work, a novel hybrid heuristic and statistical approach to classify P2P traffic is proposed. Heuristics approach provides high accuracy. However, it involves many correlation between packets and flows within certain time which make it inapplicable for online classification. On the other hand, statistical classification can classify traffic in an online manner but it needs periodical manual retraining. In the proposed solution, heuristic and statistical classification are combined to overcome their weaknesses. The system involves two modules: offline learning and online statistical classification. In the first module, heuristics are used to classify traces flows into three classes, two which are used for training the online statistical classifier. In the online module, machine learning (ML) algorithms are used to classify traffic on the fly. This work presents an enhancement for existing heuristic classification technique by adding a new class. Using 22 traffic traces downloaded from different shared resources and captured from Universiti Teknologi Malaysia (UTM) campus network between March and June 2010, the proposed system is evaluated. In offline phase (heuristics), the result shows that adding the third class improves the accuracy from 93% to 98%. This module could provide quality examples to be used to train the online statistical classifier. For the online statistical classifier, 64 ML algorithms are investigated. Deep analyses on ML algorithms shows that Decision Tree algorithms provide the best result on both accuracy and processing time. Using examples generated by the heuristic classifiers, the overall statistical classification accuracy is 99% based on analysis on downloaded and captured UTM traces.
format Thesis
qualification_level Master's degree
author Hassan Hamid, Mussab Mustafa
author_facet Hassan Hamid, Mussab Mustafa
author_sort Hassan Hamid, Mussab Mustafa
title A hybrid heuristic-statistical peer-to-peer traffic classifier
title_short A hybrid heuristic-statistical peer-to-peer traffic classifier
title_full A hybrid heuristic-statistical peer-to-peer traffic classifier
title_fullStr A hybrid heuristic-statistical peer-to-peer traffic classifier
title_full_unstemmed A hybrid heuristic-statistical peer-to-peer traffic classifier
title_sort hybrid heuristic-statistical peer-to-peer traffic classifier
granting_institution Universiti Teknologi Malaysia, Faculty of Electrical Engineering
granting_department Faculty of Electrical Engineering
publishDate 2010
_version_ 1747815660725993472