Grammar guided automatic speech system for Malay language / Radde Idzwan Abdul Razak
Speech recognition is difficult because of the analog nature of the speech itself. In speech recognition, a system is trained to receive human speech, recognize each word in the speech and transform it into text. There are many existing Automatic Speech Recognition (ASR) system that able to recogniz...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2009
|
Subjects: | |
Online Access: | https://ir.uitm.edu.my/id/eprint/98213/1/98213.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-uitm-ir.98213 |
---|---|
record_format |
uketd_dc |
spelling |
my-uitm-ir.982132024-08-21T23:31:39Z Grammar guided automatic speech system for Malay language / Radde Idzwan Abdul Razak 2009 Abdul Razak, Radde Idzwan Programming languages (Electronic computers) Speech recognition is difficult because of the analog nature of the speech itself. In speech recognition, a system is trained to receive human speech, recognize each word in the speech and transform it into text. There are many existing Automatic Speech Recognition (ASR) system that able to recognize English language speech with high accuracy. Unfortunately, an ASR system normally focuses on only one language. Today, there are no or lack of Malay language recognizable ASR system. Thus, an ASR system that is capable of recognizing Malay words would be beneficial especially in Malaysia. In order for the ASR system to transform speech signal into a specific word, the system requires a database which store list of words with each word pronunciation. This database is called a dictionary. A dictionary is use by an ASR system as references to find words match to the received speech signal. In order to be able to recognize spoken Malay words, a dictionary must contain a list of Malay words and each word must be associated with its pronunciation. Thus, this research is focusing on building Malay dictionary for an ASR system. The dictionary built in this research is meant to be used for ASR system based on Sphinx-4 framework. Each word is tested for recognition accuracy on the ASR system prototype. Trial and error method was used to produce word’s phonemes that are most accurate. Well defined words in the dictionary do not guarantee high recognition accuracy. Word Error Rate (WER) typically increases when the size of dictionary is increase. Grammar implementation would reduce the number of words to be combined to construct a sentence. Thus, it increases the chances of accurate recognition with large dictionary for continuous speech. The finding of this research shows that Sphinx-4 is a good framework to be use with Malay language ASR. The outcome of this research proved that grammar able to increase recognition accuracy of Malay language ASR. 2009 Thesis https://ir.uitm.edu.my/id/eprint/98213/ https://ir.uitm.edu.my/id/eprint/98213/1/98213.pdf text en public masters Universiti Teknologi MARA (UiTM) Faculty of Computer and Mathematical Sciences Abd Rahman, Nurazzah |
institution |
Universiti Teknologi MARA |
collection |
UiTM Institutional Repository |
language |
English |
advisor |
Abd Rahman, Nurazzah |
topic |
Programming languages (Electronic computers) |
spellingShingle |
Programming languages (Electronic computers) Abdul Razak, Radde Idzwan Grammar guided automatic speech system for Malay language / Radde Idzwan Abdul Razak |
description |
Speech recognition is difficult because of the analog nature of the speech itself. In speech recognition, a system is trained to receive human speech, recognize each word in the speech and transform it into text. There are many existing Automatic Speech Recognition (ASR) system that able to recognize English language speech with high accuracy. Unfortunately, an ASR system normally focuses on only one language. Today, there are no or lack of Malay language recognizable ASR system. Thus, an ASR system that is capable of recognizing Malay words would be beneficial especially in Malaysia. In order for the ASR system to transform speech signal into a specific word, the system requires a database which store list of words with each word pronunciation. This database is called a dictionary. A dictionary is use by an ASR system as references to find words match to the received speech signal. In order to be able to recognize spoken Malay words, a dictionary must contain a list of Malay words and each word must be associated with its pronunciation. Thus, this research is focusing on building Malay dictionary for an ASR system. The dictionary built in this research is meant to be used for ASR system based on Sphinx-4 framework. Each word is tested for recognition accuracy on the ASR system prototype. Trial and error method was used to produce word’s phonemes that are most accurate. Well defined words in the dictionary do not guarantee high recognition accuracy. Word Error Rate (WER) typically increases when the size of dictionary is increase. Grammar implementation would reduce the number of words to be combined to construct a sentence. Thus, it increases the chances of accurate recognition with large dictionary for continuous speech. The finding of this research shows that Sphinx-4 is a good framework to be use with Malay language ASR. The outcome of this research proved that grammar able to increase recognition accuracy of Malay language ASR. |
format |
Thesis |
qualification_level |
Master's degree |
author |
Abdul Razak, Radde Idzwan |
author_facet |
Abdul Razak, Radde Idzwan |
author_sort |
Abdul Razak, Radde Idzwan |
title |
Grammar guided automatic speech system for Malay language / Radde Idzwan Abdul Razak |
title_short |
Grammar guided automatic speech system for Malay language / Radde Idzwan Abdul Razak |
title_full |
Grammar guided automatic speech system for Malay language / Radde Idzwan Abdul Razak |
title_fullStr |
Grammar guided automatic speech system for Malay language / Radde Idzwan Abdul Razak |
title_full_unstemmed |
Grammar guided automatic speech system for Malay language / Radde Idzwan Abdul Razak |
title_sort |
grammar guided automatic speech system for malay language / radde idzwan abdul razak |
granting_institution |
Universiti Teknologi MARA (UiTM) |
granting_department |
Faculty of Computer and Mathematical Sciences |
publishDate |
2009 |
url |
https://ir.uitm.edu.my/id/eprint/98213/1/98213.pdf |
_version_ |
1811768896929136640 |