Malay statistical parametric speech synthesis with intelligibility improvement using artificial intelligence
Speech synthesis is important nowadays and could be a great aid in various applications. So it is important to build a simple, reliable, light-weight, ease of use speech synthesizer. However, conventional speech synthesizers require tedious human efforts to prepare high quality recorded database, an...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2015
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/77693/1/LauCheeYongPFBME2015.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-utm-ep.77693 |
---|---|
record_format |
uketd_dc |
spelling |
my-utm-ep.776932018-06-29T21:29:23Z Malay statistical parametric speech synthesis with intelligibility improvement using artificial intelligence 2015-01 Lau, Chee Yong QH301 Biology Speech synthesis is important nowadays and could be a great aid in various applications. So it is important to build a simple, reliable, light-weight, ease of use speech synthesizer. However, conventional speech synthesizers require tedious human efforts to prepare high quality recorded database, and the intelligibility of synthetic speech may decrease due to the appearance of polyphone (character with more than 1 pronunciation) because the speech synthesizer may not contain the definition of the polyphones. Moreover, the ready speech synthesizers in market are mostly built in Unit Selection method, which is large in database size and relying on Malay linguist knowledge. In this study, statistical parametric speech synthesis method has been adopted using lab speech and free speech data harvested online. The intelligibility improvement has been achieved using Active Learning and Feedforward Neural Network with Back-Propagation. The amount of training data used remained the same throughout this study. The result was evaluated using perception test. The listening test showed that the intelligibility of synthetic speech has been improved about 20%- 30% using the artificial intelligence technique. Volunteers were invited to take part in Active Learning experiment. The result showed no controversy between the result done by volunteers and the correct answer. In conclusion, a light-weight Malay speech synthesizer has been created without relying on Malay linguist knowledge. Using free source as training data can ease the human effort in preparing training database and using artificial intelligence technique can improve the intelligibility of synthetic speech under the same amount of training data used. 2015-01 Thesis http://eprints.utm.my/id/eprint/77693/ http://eprints.utm.my/id/eprint/77693/1/LauCheeYongPFBME2015.pdf application/pdf en public http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:96409 phd doctoral Universiti Teknologi Malaysia, Faculty of Biosciences and Medical Engineering Faculty of Biosciences and Medical Engineering |
institution |
Universiti Teknologi Malaysia |
collection |
UTM Institutional Repository |
language |
English |
topic |
QH301 Biology |
spellingShingle |
QH301 Biology Lau, Chee Yong Malay statistical parametric speech synthesis with intelligibility improvement using artificial intelligence |
description |
Speech synthesis is important nowadays and could be a great aid in various applications. So it is important to build a simple, reliable, light-weight, ease of use speech synthesizer. However, conventional speech synthesizers require tedious human efforts to prepare high quality recorded database, and the intelligibility of synthetic speech may decrease due to the appearance of polyphone (character with more than 1 pronunciation) because the speech synthesizer may not contain the definition of the polyphones. Moreover, the ready speech synthesizers in market are mostly built in Unit Selection method, which is large in database size and relying on Malay linguist knowledge. In this study, statistical parametric speech synthesis method has been adopted using lab speech and free speech data harvested online. The intelligibility improvement has been achieved using Active Learning and Feedforward Neural Network with Back-Propagation. The amount of training data used remained the same throughout this study. The result was evaluated using perception test. The listening test showed that the intelligibility of synthetic speech has been improved about 20%- 30% using the artificial intelligence technique. Volunteers were invited to take part in Active Learning experiment. The result showed no controversy between the result done by volunteers and the correct answer. In conclusion, a light-weight Malay speech synthesizer has been created without relying on Malay linguist knowledge. Using free source as training data can ease the human effort in preparing training database and using artificial intelligence technique can improve the intelligibility of synthetic speech under the same amount of training data used. |
format |
Thesis |
qualification_name |
Doctor of Philosophy (PhD.) |
qualification_level |
Doctorate |
author |
Lau, Chee Yong |
author_facet |
Lau, Chee Yong |
author_sort |
Lau, Chee Yong |
title |
Malay statistical parametric speech synthesis with intelligibility improvement using artificial intelligence |
title_short |
Malay statistical parametric speech synthesis with intelligibility improvement using artificial intelligence |
title_full |
Malay statistical parametric speech synthesis with intelligibility improvement using artificial intelligence |
title_fullStr |
Malay statistical parametric speech synthesis with intelligibility improvement using artificial intelligence |
title_full_unstemmed |
Malay statistical parametric speech synthesis with intelligibility improvement using artificial intelligence |
title_sort |
malay statistical parametric speech synthesis with intelligibility improvement using artificial intelligence |
granting_institution |
Universiti Teknologi Malaysia, Faculty of Biosciences and Medical Engineering |
granting_department |
Faculty of Biosciences and Medical Engineering |
publishDate |
2015 |
url |
http://eprints.utm.my/id/eprint/77693/1/LauCheeYongPFBME2015.pdf |
_version_ |
1747817809181671424 |