Study of stemming algorithm for Malay words which begin with alphabets 'M' / Mohd Zawawi Mohd Yunus
This research concerns a study of stemming algorithm for Malay words begin with alphabet 'M'. This research involves a Malay stemming approach called Rules-Application-Order (RAO). The performance of this Malay stemming algorithm is tested using the test collection of 1066 words that start...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2000
|
Subjects: | |
Online Access: | https://ir.uitm.edu.my/id/eprint/98081/1/98081.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-uitm-ir.98081 |
---|---|
record_format |
uketd_dc |
spelling |
my-uitm-ir.980812024-07-28T16:10:22Z Study of stemming algorithm for Malay words which begin with alphabets 'M' / Mohd Zawawi Mohd Yunus 2000 Mohd Yunus, Mohd Zawawi Analysis This research concerns a study of stemming algorithm for Malay words begin with alphabet 'M'. This research involves a Malay stemming approach called Rules-Application-Order (RAO). The performance of this Malay stemming algorithm is tested using the test collection of 1066 words that starts with the letter 'M' that have been extracted from 6236 Malay Quran documents. It also used 24 different combinations of Malay affixes that consist of prefix, prefix-suffix, suffix and infix. The results are obtained from the experiments that use the four rules and it combination. The type of errors found in the stemming algorithm is overstemmed, understemmed, spelling exception and unstemmed. These stemming algorithm problems will be solved by doing five experiments such as analysis the existing algorithm, do correction in the file, adding rules, correct the stemming algorithm and use two combination rules. The results of the experiments will show that the algorithm has successfully stemmed all Malay words begin with alphabet 'M' that extracted from Quran documents. 2000 Thesis https://ir.uitm.edu.my/id/eprint/98081/ https://ir.uitm.edu.my/id/eprint/98081/1/98081.pdf text en public degree Universiti Teknologi MARA (UiTM) Faculty of Information Technology and Quantitative Sciences Abu Bakar, Zainab |
institution |
Universiti Teknologi MARA |
collection |
UiTM Institutional Repository |
language |
English |
advisor |
Abu Bakar, Zainab |
topic |
Analysis |
spellingShingle |
Analysis Mohd Yunus, Mohd Zawawi Study of stemming algorithm for Malay words which begin with alphabets 'M' / Mohd Zawawi Mohd Yunus |
description |
This research concerns a study of stemming algorithm for Malay words begin with alphabet 'M'. This research involves a Malay stemming approach called Rules-Application-Order (RAO). The performance of this Malay stemming algorithm is tested using the test collection of 1066 words that starts with the letter 'M' that have been extracted from 6236 Malay Quran documents. It also used 24 different combinations of Malay affixes that consist of prefix, prefix-suffix, suffix and infix. The results are obtained from the experiments that use the four rules and it combination. The type of errors found in the stemming algorithm is overstemmed, understemmed, spelling exception and unstemmed. These stemming algorithm problems will be solved by doing five experiments such as analysis the existing algorithm, do correction in the file, adding rules, correct the stemming algorithm and use two combination rules. The results of the experiments will show that the algorithm has successfully stemmed all Malay words begin with alphabet 'M' that extracted from Quran documents. |
format |
Thesis |
qualification_level |
Bachelor degree |
author |
Mohd Yunus, Mohd Zawawi |
author_facet |
Mohd Yunus, Mohd Zawawi |
author_sort |
Mohd Yunus, Mohd Zawawi |
title |
Study of stemming algorithm for Malay words which begin with alphabets 'M' / Mohd Zawawi Mohd Yunus |
title_short |
Study of stemming algorithm for Malay words which begin with alphabets 'M' / Mohd Zawawi Mohd Yunus |
title_full |
Study of stemming algorithm for Malay words which begin with alphabets 'M' / Mohd Zawawi Mohd Yunus |
title_fullStr |
Study of stemming algorithm for Malay words which begin with alphabets 'M' / Mohd Zawawi Mohd Yunus |
title_full_unstemmed |
Study of stemming algorithm for Malay words which begin with alphabets 'M' / Mohd Zawawi Mohd Yunus |
title_sort |
study of stemming algorithm for malay words which begin with alphabets 'm' / mohd zawawi mohd yunus |
granting_institution |
Universiti Teknologi MARA (UiTM) |
granting_department |
Faculty of Information Technology and Quantitative Sciences |
publishDate |
2000 |
url |
https://ir.uitm.edu.my/id/eprint/98081/1/98081.pdf |
_version_ |
1811768886112026624 |