Keyword indexing for text documents using signature files / Abdul Hakim A. Gafa

Information retrieval is the first step in developing retrieval systems for text document in collections. Signature File is popular and effective in searching and retrieving processes (Zobel and Moffat, 2006) other than Inverted Files. This project explores the potential and limitation of prototype...

Full description

Saved in:
Bibliographic Details
Main Author: A. Gafa, Abdul Hakim
Format: Thesis
Language:English
Published: 2008
Subjects:
Online Access:https://ir.uitm.edu.my/id/eprint/98182/1/98182.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-uitm-ir.98182
record_format uketd_dc
spelling my-uitm-ir.981822024-07-29T09:39:02Z Keyword indexing for text documents using signature files / Abdul Hakim A. Gafa 2008 A. Gafa, Abdul Hakim Information organization Information retrieval is the first step in developing retrieval systems for text document in collections. Signature File is popular and effective in searching and retrieving processes (Zobel and Moffat, 2006) other than Inverted Files. This project explores the potential and limitation of prototype text search engines using Signature Files on Malaysian Text Documents. Malaysian Text Documents is an official text report of proceedings and debates in parliament which is documented in Malay Language and maintained by House of Parliament. These document are categorizes into House of Commons and House of Lords. Currently, searching and retrieving information from text document in Malay Language are done manually. These process are tedious, very time consuming and inefficient. Text search engine prototype using signature file can speed up the process of searching and retrieving information from Malaysian text documents. The main of this project is to compare the effectiveness of searching Text documents between using Signature files algorithm and Inverted files algorithm. In order to achieve the main objective, the Signature Files algorithm for indexing methods needs to be understood and implemented. A text search engine prototype for Malay Text Document will developed as a tools to evaluate the effectiveness of searching Text Documents using Signature Files and Inverted Files. 2008 Thesis https://ir.uitm.edu.my/id/eprint/98182/ https://ir.uitm.edu.my/id/eprint/98182/1/98182.pdf text en public degree Universiti Teknologi MARA (UiTM) Faculty of Computer and Mathematical Sciences Sheikh Aljunid, Syed Ahmad
institution Universiti Teknologi MARA
collection UiTM Institutional Repository
language English
advisor Sheikh Aljunid, Syed Ahmad
topic Information organization
spellingShingle Information organization
A. Gafa, Abdul Hakim
Keyword indexing for text documents using signature files / Abdul Hakim A. Gafa
description Information retrieval is the first step in developing retrieval systems for text document in collections. Signature File is popular and effective in searching and retrieving processes (Zobel and Moffat, 2006) other than Inverted Files. This project explores the potential and limitation of prototype text search engines using Signature Files on Malaysian Text Documents. Malaysian Text Documents is an official text report of proceedings and debates in parliament which is documented in Malay Language and maintained by House of Parliament. These document are categorizes into House of Commons and House of Lords. Currently, searching and retrieving information from text document in Malay Language are done manually. These process are tedious, very time consuming and inefficient. Text search engine prototype using signature file can speed up the process of searching and retrieving information from Malaysian text documents. The main of this project is to compare the effectiveness of searching Text documents between using Signature files algorithm and Inverted files algorithm. In order to achieve the main objective, the Signature Files algorithm for indexing methods needs to be understood and implemented. A text search engine prototype for Malay Text Document will developed as a tools to evaluate the effectiveness of searching Text Documents using Signature Files and Inverted Files.
format Thesis
qualification_level Bachelor degree
author A. Gafa, Abdul Hakim
author_facet A. Gafa, Abdul Hakim
author_sort A. Gafa, Abdul Hakim
title Keyword indexing for text documents using signature files / Abdul Hakim A. Gafa
title_short Keyword indexing for text documents using signature files / Abdul Hakim A. Gafa
title_full Keyword indexing for text documents using signature files / Abdul Hakim A. Gafa
title_fullStr Keyword indexing for text documents using signature files / Abdul Hakim A. Gafa
title_full_unstemmed Keyword indexing for text documents using signature files / Abdul Hakim A. Gafa
title_sort keyword indexing for text documents using signature files / abdul hakim a. gafa
granting_institution Universiti Teknologi MARA (UiTM)
granting_department Faculty of Computer and Mathematical Sciences
publishDate 2008
url https://ir.uitm.edu.my/id/eprint/98182/1/98182.pdf
_version_ 1811768893693231104