Classification of imbalanced datasets using naive bayes

Imbalanced data set had tendency to effect classifier performance in machine learning due to the greater influence given by majority data that overlooked the minority ones. But in classifying data, more important class is given by the minority data. In order to solve this problem, original Naïve Bay...

Full description

Saved in:
Bibliographic Details
Main Author: Mohd. Sobran, Nur Maisarah
Format: Thesis
Language:English
Published: 2011
Subjects:
Online Access:http://eprints.utm.my/id/eprint/31941/5/NurMaisarahMohdSobranMFKE2011.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Imbalanced data set had tendency to effect classifier performance in machine learning due to the greater influence given by majority data that overlooked the minority ones. But in classifying data, more important class is given by the minority data. In order to solve this problem, original Naïve Bayes was purposed as classifier for imbalanced data set. Our main interest is to investigate the performance of original Naïve Bayes classifier in imbalanced datasets. From the four UCI imbalanced datasets that been used, the purposed techniques show that, Naïve Bayes doing well in Herbaman’s datasets and satisfying results in other datasets.