Web based directory for Malay Hadith Retrieval System using inverted indexed files technique / Siti Fariza Mohd Yusak

Information retrieval is to find all documents relevant for a user query in a collection of documents. Information retrieval (IR) is fast becoming the dominant form of information access over taking traditional database style searching. An inverted file typically includes the vocabulary, a vector co...

Full description

Saved in:
Bibliographic Details
Main Author: Mohd Yusak, Siti Fariza
Format: Thesis
Language:English
Published: 2010
Online Access:https://ir.uitm.edu.my/id/eprint/64143/1/64143.PDF
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Information retrieval is to find all documents relevant for a user query in a collection of documents. Information retrieval (IR) is fast becoming the dominant form of information access over taking traditional database style searching. An inverted file typically includes the vocabulary, a vector containing all distinct words in the text collection and a list of all document numbers in which each distinct word occurs. The lists of document numbers constitute the largest part of an index (N.Ziviani, E.Silva deMoura, CNavarro, R.A. Baeza-Yates, 2000). Inverted files require less space and provide greater functionality. Web directories are taxonomies for the classification of Web documents. This kind of IR systems presents a specific type of search where the document collection is restricted to one area of the category graph. Generally, existing Mutiara Hadith provide a search engine to search hadith using stemming and thesaurus but the main problem is it will searching by non-specific hadith. Thus mean, it will retrieve all word from the word entry. In order to improve the retrieval of relevant document, web-based directory which search hadith by main topic will be created. The objective of this project is to develop web-based directory hadith by topics for Hadith Shahih Bukhari and Hadith Shahih Muslim text document by using inberted indexed files technique. It is also to evaluate the retrieval effectiveness by using Recall and Precision formula. Evaluation from this result shows that retrieval effectiveness is well again usability in the query and inverted index using web-based directory.