Speaker identification based on hybrid feature extraction techniques /

Speech contains many features that can be used to determine gender and speaker identity; it is a natural form of communication between humans. One of the most exciting areas of signal processing is speech processing. Speech contains many features or characteristics that can discriminate the identity...

Full description

Saved in:
Bibliographic Details
Main Author: Abualadas, Feras Eid Dheif Allah (Author)
Format: Thesis
Language:English
Published: Kuala Lumpur : Kulliyyah of Information and Communication Technology, International Islamic University Malaysia, 2020
Subjects:
Online Access:http://studentrepo.iium.edu.my/handle/123456789/10087
Tags: Add Tag
No Tags, Be the first to tag this record!
LEADER 057090000a22004570004500
008 201030s2020 my a f m 000 0 eng d
040 |a UIAM  |b eng  |e rda 
041 |a eng 
043 |a a-my--- 
050 0 0 |a TK7882.S65 
100 1 |a Abualadas, Feras Eid Dheif Allah,  |e author 
245 1 0 |a Speaker identification based on hybrid feature extraction techniques /  |c by Feras Eid Dheif Allah Abualadas 
264 1 |a Kuala Lumpur :  |b Kulliyyah of Information and Communication Technology, International Islamic University Malaysia,   |c 2020 
300 |a xviii, 125 leaves :  |b colour illustrations ;  |c 30cm. 
336 |2 rdacontent  |a text 
337 |2 rdamedia  |a unmediated 
337 |2 rdamedia  |a computer 
338 |2 rdacarrier  |a volume 
338 |2 rdacarrier  |a computer disc 
338 |2 rdacarrier  |a online resource 
347 |2 rdaft  |a text file  |b PDF 
500 |a Abstracts in English and Arabic. 
500 |a "A thesis submitted in fulfilment of the requirement for the degree of Doctor of Philosophy in Computer Science." --On title page. 
502 |a Thesis (Ph.D)--International Islamic University Malaysia, 2020. 
504 |a Includes bibliographical references (leaves 118-125). 
520 |a Speech contains many features that can be used to determine gender and speaker identity; it is a natural form of communication between humans. One of the most exciting areas of signal processing is speech processing. Speech contains many features or characteristics that can discriminate the identity of a person. The human voice is considered one of the important biometric characteristic that can be used for person identification. The proposed speaker identification system (SIS) consists of four phases, namely, pre-processing phase (involves sample resizing to 40000 samples and normalization to ensure that the sound volume will modifying as a standard level), feature extraction phase (involves extracting a set of fundamental voice features that can represent or identify the entire signal of speech), feature selection phase (involves selecting the best features that describe the speaker, where dealing with hundreds number of features leads to increase the workload of recognition) and recognition phase (involves Backpropagation (BP) neural network in this research). In this work the effects of appropriate extracted voice features from various levels of discrete wavelet transformation (DWT) and the concatenation of DWT and curvelet transformation (DWT+Curvelet hereinafter) are studied. The effects of reducing the number of features via Principal component analysis (PCA) on speaker identification is also investigated, and the (BP) neural network was introduced as a classifier. The classifier is trained with a different set of features extracted from three different levels of DWT; these features are extracted one level at a time. The recognition capabilities of the classifier for all levels are compared to determine the best level. This research explores any positive or negative effects of DWT+Curvelet on the classification capability of the proposed system. in addition, this work investigates the effects of reducing the number of features via PCA with DWT and DWT+Curvelet In this research, different three datasets were used for speaker identification system, where these dataset used for train and testing the Feed-Forward Backpropagation (BP). In this approach it is clear that introducing PCA with BP networks improved the accuracy and is an effective method for speaker identification system, where it keeps the effective information and reduces the redundancy of characteristic parameters Four experiments are performed as follows using the three datasets: Experiment 1: only DWT features that extracted from each level of discrete wavelet transformation independently are used to train and test the Neural Network; Experiment 2: the features extracted from each level of (DWT+Curvelet) used to train and test the Neural Network; Experiment 3: With DWT features after utilized principal component analysis used to train and test the neural network; Experiment 4: With (DWT+Curvelet) features after utilized principal component analysis used to train and test the Neural Network. Practical results showed that, the accuracy is improved in level 1 and 2 with database 1 and increased by approximately 5% and 4%, respectively; whereas the accuracy was improved in all levels 1, 2 and 3 with Database 2 and 3 and increased by approximately 11%, 4% and 2% for database 2 and 9%, 11%, 5% for database 3 respectively, when applying (DWT+Curvelet). The system was trained and tested using (Cross-validation). 
596 |a 1 
650 0 |a Speech processing systems 
650 0 |a System identification 
650 0 |a Wavelets (Mathematics) 
655 7 |a Theses, IIUM local 
690 |a Dissertations, Academic  |x Kulliyyah of Information and Communication Technology  |z IIUM 
700 1 |a Akram M. Zeki,  |e degree supervisor 
700 1 |a Muzhir Shaban Al-Ani,  |e degree supervisor 
700 1 |a Az-Eddine Messikh ,  |e degree supervisor 
710 2 |a International Islamic University Malaysia.  |b Kulliyyah of Information and Communication Technology 
856 4 |u http://studentrepo.iium.edu.my/handle/123456789/10087 
900 |a sz-asbh 
999 |c 440070  |d 473269 
952 |0 0  |1 0  |2 lcc  |4 0  |6 T T K7882 S65 A00165S 02020  |7 3  |8 IIUMTHESIS  |9 762446  |a IIUM  |b IIUM  |c MULTIMEDIA  |d 2022-08-04  |g 0.00  |o t TK 7882 S65 A165S 2020  |p 11100418277  |r 1900-01-02  |t 1  |v 0.00  |y THESIS 
952 |0 0  |1 0  |2 lcc  |4 0  |6 TS C D F TK 07882 S00065 A00165S 02020  |7 3  |8 THESISSOFTCOPY  |9 859332  |a IIUM  |b IIUM  |c MULTIMEDIA  |d 2022-08-04  |g 0.00  |o ts cdf TK 7882 S65 A165S 2020  |p 11100418278  |r 1900-01-02  |t 1  |v 0.00  |y THESISDIG