Integration Of Unsupervised Clustering Algorithm And Supervised Classifier For Pattern Recognition
In a real world, pattern recognition problems in diversified forms are ubiquitous and are critical in most human decision making tasks. In pattern recognition system, achieving high accuracy in pattern classification is crucial. There are two general paradigms for pattern recognition classification...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2017
|
Subjects: | |
Online Access: | http://eprints.usm.my/39416/1/Leong_Shi_Xiang_24_Pages.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-usm-ep.39416 |
---|---|
record_format |
uketd_dc |
spelling |
my-usm-ep.394162019-04-12T05:25:07Z Integration Of Unsupervised Clustering Algorithm And Supervised Classifier For Pattern Recognition 2017 Leong, Shi Xiang TK1-9971 Electrical engineering. Electronics. Nuclear engineering In a real world, pattern recognition problems in diversified forms are ubiquitous and are critical in most human decision making tasks. In pattern recognition system, achieving high accuracy in pattern classification is crucial. There are two general paradigms for pattern recognition classification which are supervised and unsupervised learning. The problems in applying unsupervised learning/clustering is that this method requires teacher during the classification process and it has to learn independently which may lead to poor classification. Whereas for supervised learning method, it requires teacher or prior data (i.e. large, prohibitive and labelled training data) during classification process which in real life, the cost of obtaining sufficient labelled training data is high. In addition, the labelling is time consuming and done manually. To solve the problems mentioned, integration of unsupervised clustering algorithm and the supervised classifier is proposed. The objective of this research is to study the performance/capability of the integration between both unsupervised and supervised learning. In order to achieve the objective, this research is separated into two phases. Phase 1 is mainly to evaluate the performance of clustering algorithm (K-Means and FCM). Phase 2 is to study the performance of proposed integration system which using the data clustered to be used as train data for Naïve Bayes classifier. By adopting the proposed integration system, the limitation of the unsupervised clustering method can be overcome and for supervised learning, the labelling time can be reduced and more training examples are labelled which can be used to train for supervised classifier. As the result, the pattern classification accuracy is also xii increase. For examples, after applying the proposed integration system, the classification accuracy of Fisher’s Iris, Wine and Bacteria18Class has been increased from 88.67% to 96.00%, from 78.33% to 83.45% and from 93.33% to 94.67% respectively as compared to only used unsupervised clustering algorithm. The result has shown that the proposed integration system could be applied to increase the performance of the classification. However, further study is needed in the feature extraction and clustering algorithms part as the performance of the pattern classification is still depending on the data input. 2017 Thesis http://eprints.usm.my/39416/ http://eprints.usm.my/39416/1/Leong_Shi_Xiang_24_Pages.pdf application/pdf en public masters Universiti Sains Malaysia Pusat Pengajian Kejuruteraan Elektrik dan Elektronik |
institution |
Universiti Sains Malaysia |
collection |
USM Institutional Repository |
language |
English |
topic |
TK1-9971 Electrical engineering Electronics Nuclear engineering |
spellingShingle |
TK1-9971 Electrical engineering Electronics Nuclear engineering Leong, Shi Xiang Integration Of Unsupervised Clustering Algorithm And Supervised Classifier For Pattern Recognition |
description |
In a real world, pattern recognition problems in diversified forms are ubiquitous and are critical in most human decision making tasks. In pattern recognition system, achieving high accuracy in pattern classification is crucial. There are two general paradigms for pattern recognition classification which are supervised and unsupervised learning. The problems in applying unsupervised learning/clustering is that this method requires teacher during the classification process and it has to learn independently which may lead to poor classification. Whereas for supervised learning method, it requires teacher or prior data (i.e. large, prohibitive and labelled training data) during classification process which in real life, the cost of obtaining sufficient labelled training data is high. In addition, the labelling is time consuming and done manually. To solve the problems mentioned, integration of unsupervised clustering algorithm and the supervised classifier is proposed. The objective of this research is to study the performance/capability of the integration between both unsupervised and supervised learning.
In order to achieve the objective, this research is separated into two phases. Phase 1 is mainly to evaluate the performance of clustering algorithm (K-Means and FCM). Phase 2 is to study the performance of proposed integration system which using the data clustered to be used as train data for Naïve Bayes classifier. By adopting the proposed integration system, the limitation of the unsupervised clustering method can be overcome and for supervised learning, the labelling time can be reduced and more training examples are labelled which can be used to train for supervised classifier. As the result, the pattern classification accuracy is also
xii
increase. For examples, after applying the proposed integration system, the classification accuracy of Fisher’s Iris, Wine and Bacteria18Class has been increased from 88.67% to 96.00%, from 78.33% to 83.45% and from 93.33% to 94.67% respectively as compared to only used unsupervised clustering algorithm. The result has shown that the proposed integration system could be applied to increase the performance of the classification. However, further study is needed in the feature extraction and clustering algorithms part as the performance of the pattern classification is still depending on the data input. |
format |
Thesis |
qualification_level |
Master's degree |
author |
Leong, Shi Xiang |
author_facet |
Leong, Shi Xiang |
author_sort |
Leong, Shi Xiang |
title |
Integration Of Unsupervised Clustering Algorithm And Supervised Classifier For Pattern Recognition |
title_short |
Integration Of Unsupervised Clustering Algorithm And Supervised Classifier For Pattern Recognition |
title_full |
Integration Of Unsupervised Clustering Algorithm And Supervised Classifier For Pattern Recognition |
title_fullStr |
Integration Of Unsupervised Clustering Algorithm And Supervised Classifier For Pattern Recognition |
title_full_unstemmed |
Integration Of Unsupervised Clustering Algorithm And Supervised Classifier For Pattern Recognition |
title_sort |
integration of unsupervised clustering algorithm and supervised classifier for pattern recognition |
granting_institution |
Universiti Sains Malaysia |
granting_department |
Pusat Pengajian Kejuruteraan Elektrik dan Elektronik |
publishDate |
2017 |
url |
http://eprints.usm.my/39416/1/Leong_Shi_Xiang_24_Pages.pdf |
_version_ |
1747820751057059840 |