Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli
Stemming is important thing to improve retrieval effectiveness. Stemming is used to reduce the size of indexing file for relevancy of document retrieval. Stemming is technique to truncate the word into the root word that will reduce vocabulary size and improve recall. The Malay affixes consist of...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2005
|
Online Access: | https://ir.uitm.edu.my/id/eprint/1429/1/TB_EDATUL%20MULIANA%20GHAZALLI%20CS%2005_5%20P01.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-uitm-ir.1429 |
---|---|
record_format |
uketd_dc |
spelling |
my-uitm-ir.14292019-04-05T07:12:54Z Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli 2005 Ghazalli, Edatul Muliana Stemming is important thing to improve retrieval effectiveness. Stemming is used to reduce the size of indexing file for relevancy of document retrieval. Stemming is technique to truncate the word into the root word that will reduce vocabulary size and improve recall. The Malay affixes consist of four different types such as prefix, prefix-suffix, suffix and infix. An effective and powerfiil of Malay stemmer is it just not to move the suffixes rules only but it must remove all four types of affixes. Without removing all the affixes, the stem caimot be effectively used to index of Malay documents. So in order to get the best order of morphological rule for effective and powerfiil stemmer the researcher has to find out the best order of morphological rule to stem Malay words based on first character for each alphabet. This project involves the use of two combinations simultaneously. The words that could not stem correctly by the first combination of best order which is primary will shift to alternative combination of best order of morphological rule. The resuhs of experiment B, which is enhance project is better than experiment A, which is Rules-Application-Order (RAO) by Fatimah (1995) because that algorithm has successfully stemmed all word begin with alphabet "A" until "Z" that extracted fi-om Quran documents. 2005 Thesis https://ir.uitm.edu.my/id/eprint/1429/ https://ir.uitm.edu.my/id/eprint/1429/1/TB_EDATUL%20MULIANA%20GHAZALLI%20CS%2005_5%20P01.pdf text en public degree Universiti Teknologi MARA Faculty of Computer and Mathematical Sciences |
institution |
Universiti Teknologi MARA |
collection |
UiTM Institutional Repository |
language |
English |
description |
Stemming is important thing to improve retrieval effectiveness. Stemming is
used to reduce the size of indexing file for relevancy of document retrieval.
Stemming is technique to truncate the word into the root word that will reduce
vocabulary size and improve recall. The Malay affixes consist of four different
types such as prefix, prefix-suffix, suffix and infix. An effective and powerfiil of
Malay stemmer is it just not to move the suffixes rules only but it must remove
all four types of affixes. Without removing all the affixes, the stem caimot be
effectively used to index of Malay documents. So in order to get the best order of
morphological rule for effective and powerfiil stemmer the researcher has to find
out the best order of morphological rule to stem Malay words based on first
character for each alphabet. This project involves the use of two combinations
simultaneously. The words that could not stem correctly by the first combination
of best order which is primary will shift to alternative combination of best order
of morphological rule. The resuhs of experiment B, which is enhance project is
better than experiment A, which is Rules-Application-Order (RAO) by Fatimah
(1995) because that algorithm has successfully stemmed all word begin with
alphabet "A" until "Z" that extracted fi-om Quran documents. |
format |
Thesis |
qualification_level |
Bachelor degree |
author |
Ghazalli, Edatul Muliana |
spellingShingle |
Ghazalli, Edatul Muliana Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli |
author_facet |
Ghazalli, Edatul Muliana |
author_sort |
Ghazalli, Edatul Muliana |
title |
Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli |
title_short |
Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli |
title_full |
Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli |
title_fullStr |
Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli |
title_full_unstemmed |
Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word / Edatul Muliana Ghazalli |
title_sort |
enhancement of rules-application-order (rao) stemming algorithm based on the first character of malay word / edatul muliana ghazalli |
granting_institution |
Universiti Teknologi MARA |
granting_department |
Faculty of Computer and Mathematical Sciences |
publishDate |
2005 |
url |
https://ir.uitm.edu.my/id/eprint/1429/1/TB_EDATUL%20MULIANA%20GHAZALLI%20CS%2005_5%20P01.pdf |
_version_ |
1783733013510619136 |