Speaker identification based on hybrid feature extraction techniques /

Speech contains many features that can be used to determine gender and speaker identity; it is a natural form of communication between humans. One of the most exciting areas of signal processing is speech processing. Speech contains many features or characteristics that can discriminate the identity...

全面介紹

Saved in:

書目詳細資料
主要作者:	Abualadas, Feras Eid Dheif Allah (Author)
格式:	Thesis
語言:	English
出版:	Kuala Lumpur : Kulliyyah of Information and Communication Technology, International Islamic University Malaysia, 2020
主題:	Speech processing systems System identification Wavelets (Mathematics) Theses, IIUM local
在線閱讀:	http://studentrepo.iium.edu.my/handle/123456789/10087
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!


LEADER	057090000a22004570004500
008	201030s2020 my a f m 000 0 eng d
040			\|a UIAM \|b eng \|e rda
041			\|a eng
043			\|a a-my---
050	0	0	\|a TK7882.S65
100	1		\|a Abualadas, Feras Eid Dheif Allah, \|e author
245	1	0	\|a Speaker identification based on hybrid feature extraction techniques / \|c by Feras Eid Dheif Allah Abualadas
264		1	\|a Kuala Lumpur : \|b Kulliyyah of Information and Communication Technology, International Islamic University Malaysia, \|c 2020
300			\|a xviii, 125 leaves : \|b colour illustrations ; \|c 30cm.
336			\|2 rdacontent \|a text
337			\|2 rdamedia \|a unmediated
337			\|2 rdamedia \|a computer
338			\|2 rdacarrier \|a volume
338			\|2 rdacarrier \|a computer disc
338			\|2 rdacarrier \|a online resource
347			\|2 rdaft \|a text file \|b PDF
500			\|a Abstracts in English and Arabic.
500			\|a "A thesis submitted in fulfilment of the requirement for the degree of Doctor of Philosophy in Computer Science." --On title page.
502			\|a Thesis (Ph.D)--International Islamic University Malaysia, 2020.
504			\|a Includes bibliographical references (leaves 118-125).
520			\|a Speech contains many features that can be used to determine gender and speaker identity; it is a natural form of communication between humans. One of the most exciting areas of signal processing is speech processing. Speech contains many features or characteristics that can discriminate the identity of a person. The human voice is considered one of the important biometric characteristic that can be used for person identification. The proposed speaker identification system (SIS) consists of four phases, namely, pre-processing phase (involves sample resizing to 40000 samples and normalization to ensure that the sound volume will modifying as a standard level), feature extraction phase (involves extracting a set of fundamental voice features that can represent or identify the entire signal of speech), feature selection phase (involves selecting the best features that describe the speaker, where dealing with hundreds number of features leads to increase the workload of recognition) and recognition phase (involves Backpropagation (BP) neural network in this research). In this work the effects of appropriate extracted voice features from various levels of discrete wavelet transformation (DWT) and the concatenation of DWT and curvelet transformation (DWT+Curvelet hereinafter) are studied. The effects of reducing the number of features via Principal component analysis (PCA) on speaker identification is also investigated, and the (BP) neural network was introduced as a classifier. The classifier is trained with a different set of features extracted from three different levels of DWT; these features are extracted one level at a time. The recognition capabilities of the classifier for all levels are compared to determine the best level. This research explores any positive or negative effects of DWT+Curvelet on the classification capability of the proposed system. in addition, this work investigates the effects of reducing the number of features via PCA with DWT and DWT+Curvelet In this research, different three datasets were used for speaker identification system, where these dataset used for train and testing the Feed-Forward Backpropagation (BP). In this approach it is clear that introducing PCA with BP networks improved the accuracy and is an effective method for speaker identification system, where it keeps the effective information and reduces the redundancy of characteristic parameters Four experiments are performed as follows using the three datasets: Experiment 1: only DWT features that extracted from each level of discrete wavelet transformation independently are used to train and test the Neural Network; Experiment 2: the features extracted from each level of (DWT+Curvelet) used to train and test the Neural Network; Experiment 3: With DWT features after utilized principal component analysis used to train and test the neural network; Experiment 4: With (DWT+Curvelet) features after utilized principal component analysis used to train and test the Neural Network. Practical results showed that, the accuracy is improved in level 1 and 2 with database 1 and increased by approximately 5% and 4%, respectively; whereas the accuracy was improved in all levels 1, 2 and 3 with Database 2 and 3 and increased by approximately 11%, 4% and 2% for database 2 and 9%, 11%, 5% for database 3 respectively, when applying (DWT+Curvelet). The system was trained and tested using (Cross-validation).
596			\|a 1
650		0	\|a Speech processing systems
650		0	\|a System identification
650		0	\|a Wavelets (Mathematics)
655	7		\|a Theses, IIUM local
690			\|a Dissertations, Academic \|x Kulliyyah of Information and Communication Technology \|z IIUM
700	1		\|a Akram M. Zeki, \|e degree supervisor
700	1		\|a Muzhir Shaban Al-Ani, \|e degree supervisor
700	1		\|a Az-Eddine Messikh , \|e degree supervisor
710	2		\|a International Islamic University Malaysia. \|b Kulliyyah of Information and Communication Technology
856	4		\|u http://studentrepo.iium.edu.my/handle/123456789/10087
900			\|a sz-asbh
999			\|c 440070 \|d 473269
952			\|0 0 \|1 0 \|2 lcc \|4 0 \|6 T T K7882 S65 A00165S 02020 \|7 3 \|8 IIUMTHESIS \|9 762446 \|a IIUM \|b IIUM \|c MULTIMEDIA \|d 2022-08-04 \|g 0.00 \|o t TK 7882 S65 A165S 2020 \|p 11100418277 \|r 1900-01-02 \|t 1 \|v 0.00 \|y THESIS
952			\|0 0 \|1 0 \|2 lcc \|4 0 \|6 TS C D F TK 07882 S00065 A00165S 02020 \|7 3 \|8 THESISSOFTCOPY \|9 859332 \|a IIUM \|b IIUM \|c MULTIMEDIA \|d 2022-08-04 \|g 0.00 \|o ts cdf TK 7882 S65 A165S 2020 \|p 11100418278 \|r 1900-01-02 \|t 1 \|v 0.00 \|y THESISDIG

Speaker identification based on hybrid feature extraction techniques /

相似書籍