Search engine for books / Amierul Izzuddin Azman

Search engine is web-based software that searches for and identifies things in a collection that match the user's keywords or characters, and is mostly used to locate specific websites on the Internet. The basic idea of information retrieval (IR), as is well-known in the computer industry, is t...

全面介绍

Saved in:

书目详细资料
主要作者:	Azman, Amierul Izzuddin
格式:	Thesis
语言:	English
出版:	2022
主题:	QA Mathematics
在线阅读:	https://ir.uitm.edu.my/id/eprint/59359/1/59359.pdf
标签:	添加标签没有标签, 成为第一个标记此记录!

id	my-uitm-ir.59359
record_format	uketd_dc
spelling	my-uitm-ir.593592022-07-21T15:39:27Z Search engine for books / Amierul Izzuddin Azman 2022-01 Azman, Amierul Izzuddin QA Mathematics Electronic Computers. Computer Science Web-based user interfaces. User interfaces (Computer systems) Search engine is web-based software that searches for and identifies things in a collection that match the user's keywords or characters, and is mostly used to locate specific websites on the Internet. The basic idea of information retrieval (IR), as is well-known in the computer industry, is to search a given amount of data and retrieve those records that fulfil a set of criteria. For this project, it will be more on developing a search system that focus on the domain of books. Although many search engines available for book searching, most of them still have room for improvement. The reality to find books using search engine are quite challenging because, the users need to know the title of the book in order to retrieved relevant result from the search engine. Therefore, this project aims to propose a search engine that might produce better result by manipulating the indexing structure of the search engine. Software Development Life Cycle also known as SDLC was used as the methodology in this project development. This project applies the vector space model for the matching process, because the chosen software library for this project which is Apache Lucene is using the vector space model as its foundation. In addition, the Bag-of-Words approach was used as the basis for the indexing module. The indexing process indexed the information of the books such as the book title, the author name, publisher, year published, pages count and synopsis of the book. The search engine's assessment criteria include recall and precision. In IR, recall and precision have long been used as standard evaluation criteria. Keenly, the results of this project prove that index files that contain more domain information can increase the relevancy of a search engine 2022-01 Thesis https://ir.uitm.edu.my/id/eprint/59359/ https://ir.uitm.edu.my/id/eprint/59359/1/59359.pdf text en public degree Universiti Teknologi MARA, Perak Faculty of Computer and Mathematical Sciences Azizan, Azilawati
institution	Universiti Teknologi MARA
collection	UiTM Institutional Repository
language	English
advisor	Azizan, Azilawati
topic	QA Mathematics QA Mathematics QA Mathematics
spellingShingle	QA Mathematics QA Mathematics QA Mathematics Azman, Amierul Izzuddin Search engine for books / Amierul Izzuddin Azman
description	Search engine is web-based software that searches for and identifies things in a collection that match the user's keywords or characters, and is mostly used to locate specific websites on the Internet. The basic idea of information retrieval (IR), as is well-known in the computer industry, is to search a given amount of data and retrieve those records that fulfil a set of criteria. For this project, it will be more on developing a search system that focus on the domain of books. Although many search engines available for book searching, most of them still have room for improvement. The reality to find books using search engine are quite challenging because, the users need to know the title of the book in order to retrieved relevant result from the search engine. Therefore, this project aims to propose a search engine that might produce better result by manipulating the indexing structure of the search engine. Software Development Life Cycle also known as SDLC was used as the methodology in this project development. This project applies the vector space model for the matching process, because the chosen software library for this project which is Apache Lucene is using the vector space model as its foundation. In addition, the Bag-of-Words approach was used as the basis for the indexing module. The indexing process indexed the information of the books such as the book title, the author name, publisher, year published, pages count and synopsis of the book. The search engine's assessment criteria include recall and precision. In IR, recall and precision have long been used as standard evaluation criteria. Keenly, the results of this project prove that index files that contain more domain information can increase the relevancy of a search engine
format	Thesis
qualification_level	Bachelor degree
author	Azman, Amierul Izzuddin
author_facet	Azman, Amierul Izzuddin
author_sort	Azman, Amierul Izzuddin
title	Search engine for books / Amierul Izzuddin Azman
title_short	Search engine for books / Amierul Izzuddin Azman
title_full	Search engine for books / Amierul Izzuddin Azman
title_fullStr	Search engine for books / Amierul Izzuddin Azman
title_full_unstemmed	Search engine for books / Amierul Izzuddin Azman
title_sort	search engine for books / amierul izzuddin azman
granting_institution	Universiti Teknologi MARA, Perak
granting_department	Faculty of Computer and Mathematical Sciences
publishDate	2022
url	https://ir.uitm.edu.my/id/eprint/59359/1/59359.pdf
_version_	1783735029585674240

Search engine for books / Amierul Izzuddin Azman

相似书籍