A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression

Advancement in gene expression technology offers the ability to measure the expression levels of thousand of genes in parallel. Gene expression microarray data is expected to significantly aid in the development of efficient cancer diagnosis and classification platforms. Key issues that need to be a...

Full description

Saved in:
Bibliographic Details
Main Author: Tan Ah Chik @ Mohamad, Mohd. Saberi
Format: Thesis
Language:English
Published: 2005
Subjects:
Online Access:http://eprints.utm.my/id/eprint/34718/1/MohdSaberiBinTanAhChik%40MohamadMFC2005.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-utm-ep.34718
record_format uketd_dc
spelling my-utm-ep.347182017-10-11T06:44:24Z A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression 2005 Tan Ah Chik @ Mohamad, Mohd. Saberi Unspecified Advancement in gene expression technology offers the ability to measure the expression levels of thousand of genes in parallel. Gene expression microarray data is expected to significantly aid in the development of efficient cancer diagnosis and classification platforms. Key issues that need to be addressed under such circumstances are the efficient selection of a small subset of genes that might profoundly contribute to disease identification from the thousand of genes measured on microarrays that are inherently noisy. This research deals with finding a small subset of informative genes from gene expression data which maximizes the classification accuracy. This research proposed a hybrid between Genetic Algorithm and Support Vector Machine classifier for selecting an optimal small subset of informative genes and classifying the optimal subset. Two benchmark data sets were used to evaluate the usefulness of the approach for small and high dimension data. Although, the experimental results showed that the hybrid method performed better than some of the best previous methods on small dimensional data, its performance deteriorated significantly on the higher dimensional data. An improved version of the hybrid method was designed by introducing a new algorithm for features selection based on improved chromosome representation to replace the original algorithm on the hybrid method which appeared to perform poorly on high dimensional data. The results of the gene expression microarray classification demonstrated that the proposed method performed better than the original and the previous methods. The informative genes from the experiment results proved to be biologically plausible when compared with the biological results produced from biologist and computer scientist researches. 2005 Thesis http://eprints.utm.my/id/eprint/34718/ http://eprints.utm.my/id/eprint/34718/1/MohdSaberiBinTanAhChik%40MohamadMFC2005.pdf application/pdf en public http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:61261?queryType=vitalDismax&query=+A+hybrid+genetic+algorithm+and+support+vector+machine+classifier+for+feature+selection+and+classification+of+gene+expression&public=true masters Universiti Teknologi Malaysia, Faculty of Computer Science and Information System Faculty of Computer Science and Information System
institution Universiti Teknologi Malaysia
collection UTM Institutional Repository
language English
topic Unspecified
spellingShingle Unspecified
Tan Ah Chik @ Mohamad, Mohd. Saberi
A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression
description Advancement in gene expression technology offers the ability to measure the expression levels of thousand of genes in parallel. Gene expression microarray data is expected to significantly aid in the development of efficient cancer diagnosis and classification platforms. Key issues that need to be addressed under such circumstances are the efficient selection of a small subset of genes that might profoundly contribute to disease identification from the thousand of genes measured on microarrays that are inherently noisy. This research deals with finding a small subset of informative genes from gene expression data which maximizes the classification accuracy. This research proposed a hybrid between Genetic Algorithm and Support Vector Machine classifier for selecting an optimal small subset of informative genes and classifying the optimal subset. Two benchmark data sets were used to evaluate the usefulness of the approach for small and high dimension data. Although, the experimental results showed that the hybrid method performed better than some of the best previous methods on small dimensional data, its performance deteriorated significantly on the higher dimensional data. An improved version of the hybrid method was designed by introducing a new algorithm for features selection based on improved chromosome representation to replace the original algorithm on the hybrid method which appeared to perform poorly on high dimensional data. The results of the gene expression microarray classification demonstrated that the proposed method performed better than the original and the previous methods. The informative genes from the experiment results proved to be biologically plausible when compared with the biological results produced from biologist and computer scientist researches.
format Thesis
qualification_level Master's degree
author Tan Ah Chik @ Mohamad, Mohd. Saberi
author_facet Tan Ah Chik @ Mohamad, Mohd. Saberi
author_sort Tan Ah Chik @ Mohamad, Mohd. Saberi
title A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression
title_short A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression
title_full A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression
title_fullStr A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression
title_full_unstemmed A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression
title_sort hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression
granting_institution Universiti Teknologi Malaysia, Faculty of Computer Science and Information System
granting_department Faculty of Computer Science and Information System
publishDate 2005
url http://eprints.utm.my/id/eprint/34718/1/MohdSaberiBinTanAhChik%40MohamadMFC2005.pdf
_version_ 1747816239553576960