Retrieval performance of Qur'anic texts (Pimpinan Ar-Rahman) in Jawi and Rumi Malay

The enhancement in technology has sped the search process making more information available to users in a short amount of time. This also involves semantically enriched data for CLIR, which can either be in the form of different language or different writing system. This research investigated the re...

Full description

Saved in:
Bibliographic Details
Main Author: Fauziah binti Abdul Wahid (Author)
Format: Thesis
Language:English
Published: Kuala Lumpur: Kulliyyah of Information and Communication Technology, International Islamic University Malaysia, 2014
Subjects:
Online Access:http://studentrepo.iium.edu.my/handle/123456789/5526
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The enhancement in technology has sped the search process making more information available to users in a short amount of time. This also involves semantically enriched data for CLIR, which can either be in the form of different language or different writing system. This research investigated the retrieval performance of Quranic texts translation of Pimpinan Ar-Rahman in Jawi Malay (the old version of Malay Language) and Rumi Malay (the new Roman version of Malay Language) with different version of Malay Language and letter scripting considerations. Methodology implemented was by using queries formulated from FAQs written by Muslim scholars in authenticated printed sources produced in Malaysia. 50 FAQs were selected from Utusan Melayu newspaper for Jawi Malay queries and 50 FAQs from Berita Harian newspaper for Rumi Malay queries. Keyword(s) were derived by researcher from FAQs as queries to represent each FAQs. Queries were in Jawi Malay Original, Rumi Malay Transliterated, Rumi Malay Original and Jawi Malay Transliterated. Matching process was conducted by posting queries into Indri Retrieval. Result from matching process produced a results of list of Ayat and these Ayat undergo relevance judgement process that involved three human judges to determine the relevancy of FAQ topic and Ayat, while verifying the accuracy of retrieval. Data analysis that consisted of three measures; precision, recall and overlap, where data was obtained from Indri Retrieval test conducted in Indri search engine. Recall, precision and overlap discovered competency of Indri for handling multi-script Quranic texts produced high recall for both Jawi-96% and Rumi-91%, high precision for both Jawi-91% and Rumi-91%, followed by high overlap for Rumi-83% but moderate overlap for Jawi-72%. To conclude, this research is among research in retrieval of Quranic texts in multi-scripts Malay language retrieval that managed to outline state-of-the-art information retrieval system model for retrieval of multi-scripts for parallel corpus using Indri search engine in Roman and non-Roman scripts of Malay language.
Item Description:Abstract in English and Arabic.
"A thesis submitted in fulfilment of the requirement for the degree of Doctor of Philosophy in Library and Information Science."--On title page.
Physical Description:xviii, 306 leaves : illustration. ; 30cm.
Bibliography:Includes bibliographical references (leaves 151-157).