Grammar guided automatic speech system for Malay language / Radde Idzwan Abdul Razak

Speech recognition is difficult because of the analog nature of the speech itself. In speech recognition, a system is trained to receive human speech, recognize each word in the speech and transform it into text. There are many existing Automatic Speech Recognition (ASR) system that able to recogniz...

Full description

Saved in:
Bibliographic Details
Main Author: Abdul Razak, Radde Idzwan
Format: Thesis
Language:English
Published: 2009
Subjects:
Online Access:https://ir.uitm.edu.my/id/eprint/98213/1/98213.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-uitm-ir.98213
record_format uketd_dc
spelling my-uitm-ir.982132024-08-21T23:31:39Z Grammar guided automatic speech system for Malay language / Radde Idzwan Abdul Razak 2009 Abdul Razak, Radde Idzwan Programming languages (Electronic computers) Speech recognition is difficult because of the analog nature of the speech itself. In speech recognition, a system is trained to receive human speech, recognize each word in the speech and transform it into text. There are many existing Automatic Speech Recognition (ASR) system that able to recognize English language speech with high accuracy. Unfortunately, an ASR system normally focuses on only one language. Today, there are no or lack of Malay language recognizable ASR system. Thus, an ASR system that is capable of recognizing Malay words would be beneficial especially in Malaysia. In order for the ASR system to transform speech signal into a specific word, the system requires a database which store list of words with each word pronunciation. This database is called a dictionary. A dictionary is use by an ASR system as references to find words match to the received speech signal. In order to be able to recognize spoken Malay words, a dictionary must contain a list of Malay words and each word must be associated with its pronunciation. Thus, this research is focusing on building Malay dictionary for an ASR system. The dictionary built in this research is meant to be used for ASR system based on Sphinx-4 framework. Each word is tested for recognition accuracy on the ASR system prototype. Trial and error method was used to produce word’s phonemes that are most accurate. Well defined words in the dictionary do not guarantee high recognition accuracy. Word Error Rate (WER) typically increases when the size of dictionary is increase. Grammar implementation would reduce the number of words to be combined to construct a sentence. Thus, it increases the chances of accurate recognition with large dictionary for continuous speech. The finding of this research shows that Sphinx-4 is a good framework to be use with Malay language ASR. The outcome of this research proved that grammar able to increase recognition accuracy of Malay language ASR. 2009 Thesis https://ir.uitm.edu.my/id/eprint/98213/ https://ir.uitm.edu.my/id/eprint/98213/1/98213.pdf text en public masters Universiti Teknologi MARA (UiTM) Faculty of Computer and Mathematical Sciences Abd Rahman, Nurazzah
institution Universiti Teknologi MARA
collection UiTM Institutional Repository
language English
advisor Abd Rahman, Nurazzah
topic Programming languages (Electronic computers)
spellingShingle Programming languages (Electronic computers)
Abdul Razak, Radde Idzwan
Grammar guided automatic speech system for Malay language / Radde Idzwan Abdul Razak
description Speech recognition is difficult because of the analog nature of the speech itself. In speech recognition, a system is trained to receive human speech, recognize each word in the speech and transform it into text. There are many existing Automatic Speech Recognition (ASR) system that able to recognize English language speech with high accuracy. Unfortunately, an ASR system normally focuses on only one language. Today, there are no or lack of Malay language recognizable ASR system. Thus, an ASR system that is capable of recognizing Malay words would be beneficial especially in Malaysia. In order for the ASR system to transform speech signal into a specific word, the system requires a database which store list of words with each word pronunciation. This database is called a dictionary. A dictionary is use by an ASR system as references to find words match to the received speech signal. In order to be able to recognize spoken Malay words, a dictionary must contain a list of Malay words and each word must be associated with its pronunciation. Thus, this research is focusing on building Malay dictionary for an ASR system. The dictionary built in this research is meant to be used for ASR system based on Sphinx-4 framework. Each word is tested for recognition accuracy on the ASR system prototype. Trial and error method was used to produce word’s phonemes that are most accurate. Well defined words in the dictionary do not guarantee high recognition accuracy. Word Error Rate (WER) typically increases when the size of dictionary is increase. Grammar implementation would reduce the number of words to be combined to construct a sentence. Thus, it increases the chances of accurate recognition with large dictionary for continuous speech. The finding of this research shows that Sphinx-4 is a good framework to be use with Malay language ASR. The outcome of this research proved that grammar able to increase recognition accuracy of Malay language ASR.
format Thesis
qualification_level Master's degree
author Abdul Razak, Radde Idzwan
author_facet Abdul Razak, Radde Idzwan
author_sort Abdul Razak, Radde Idzwan
title Grammar guided automatic speech system for Malay language / Radde Idzwan Abdul Razak
title_short Grammar guided automatic speech system for Malay language / Radde Idzwan Abdul Razak
title_full Grammar guided automatic speech system for Malay language / Radde Idzwan Abdul Razak
title_fullStr Grammar guided automatic speech system for Malay language / Radde Idzwan Abdul Razak
title_full_unstemmed Grammar guided automatic speech system for Malay language / Radde Idzwan Abdul Razak
title_sort grammar guided automatic speech system for malay language / radde idzwan abdul razak
granting_institution Universiti Teknologi MARA (UiTM)
granting_department Faculty of Computer and Mathematical Sciences
publishDate 2009
url https://ir.uitm.edu.my/id/eprint/98213/1/98213.pdf
_version_ 1811768896929136640