Cross language information retrieval (Malay-Arabic) for hadith document using stemming and exact matching technique / Farhana Hasan

Classical Information Retrieval (IR) is the sifting out of the documents most relevant to a user's information requirement expressed as a "query", from a large electronic store of documents. A search engine performs IR by retrieving relevant web pages from the internet. Cross Language...

Full description

Saved in:
Bibliographic Details
Main Author: Hasan, Farhana
Format: Thesis
Language:English
Published: 2010
Online Access:https://ir.uitm.edu.my/id/eprint/87107/1/87107.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Classical Information Retrieval (IR) is the sifting out of the documents most relevant to a user's information requirement expressed as a "query", from a large electronic store of documents. A search engine performs IR by retrieving relevant web pages from the internet. Cross Language Information Retrieval (CLIR) allows the user to state their query in one language, and retrieve documents in another. Some CLIR systems use language resources such as bilingual dictionaries to translate the user's original query. Generally, Hadith directory provide facility to search Hadith, but the main problem is translation between Malay to Arabic Hadith document is rarely found and it use Arabic as lingual franca. Thus mean, only people who have master on Arabic or at least have basic Arabic can use that system. As effect from this situation, it will create language barrier for the non-Arabic because only a few people especially Malay people can use this facility. Therefore, Cross Language Information Retrieval (CLIR) is use to overcome this problem. The objectives of this project are to develop a Cross Language Information Retrieval CLIR (Malay-Arabic) search engine for Hadith (Sahih Bukhari & Sahih Muslim) text documents using stemming and exact match and to create a digitized dictionary (Malay-Arabic) with a limited scope. In investigate the retrieval effectiveness by using Recall and Precision formula, there are five experiments are conducted based on the queries on that language (Roslan, 2008).