Hybrid concept-based lattice mining model using formal concept analysis (FCA) and adjacency matrix for Al-Qur'an text retrieval

Introduction: In Information Retrieval (IR), searching process involves a query that is matched to relevant documents using various techniques. Information retrieval regarding AI-Qur'an involves the retrieval of verses relating to specific concepts of interests but the contributions on the quer...

Full description

Saved in:
Bibliographic Details
Main Author: Hasni binti Hassan (Author)
Other Authors: x
Format: Thesis Book
Language:English
Subjects:
x
x
Tags: Add Tag
No Tags, Be the first to tag this record!
LEADER 04833cam a2200397 7i4500
001 0000100005
005 20210331090000.0
007 axx
008 210310s2021 my eng
020 |a x  
040 |a x 
050 0 0 |a QA76 
090 0 0 |a QA76   |b .H37 2020 
100 0 |a Hasni binti Hassan   |e author  
245 0 0 |a Hybrid concept-based lattice mining model using formal concept analysis (FCA) and adjacency matrix for Al-Qur'an text retrieval   |c Hasni binti Hassan. 
246 0 |a x. 
264 0 |c 2020. 
300 |a xvii, 294 leaves;   |c 31 cm. 
336 |a text  |2 rdacontent 
337 |a unmediated  |2 rdamedia 
338 |a volume  |2 rdacarrier 
347 |a x 
500 |a x 
502 |a Thesis (Degree of Doctor of Philosophy) - Universiti Sultan Zainal Abidin, 2020 
504 |a Includes bibliographical references (leaves 233-253) 
505 0 |a 1. Introduction -- 2. Review of literature -- 3. Methodology -- 4. Results and discussions -- 5. Conclusions and recommendations 
520 |a Introduction: In Information Retrieval (IR), searching process involves a query that is matched to relevant documents using various techniques. Information retrieval regarding AI-Qur'an involves the retrieval of verses relating to specific concepts of interests but the contributions on the query matching are relatively low due to the nature of the Qur'an itself. The process of extracting information from AI-Qur'an text is complicated where the challenges come in many forms such as same concepts that might be mentioned in different verses, a verse that may be alluded to many themes, a concept mentioned using different words, and a term that may refer to different things and might have different name(s). However, semantic query matching for AI-Qur'an text can be improved by emphasizing the processes of text extraction and similarity analysis. Therefore, this study aims to contribute to the process of semantic query matching focusing on the domain of pilgrimage by proposing a model called Concept­Based Lattice Mining (CBLM). Methodology: The research methodology involves four main stages that include key terms extraction, preparation of two datasets, Formal Concept Analysis (FCA) and concept-based lattice mining process, and finally measuring lattice similarity between FCA concept lattices. Prior to proposing the similarity algorithm, a comparison to a base model was conducted and it was found that the similarity formula gives similar answer to this research but it only measure first level similarity between graphs. However, this research proposes it further step in the algorithm to refine the degree of similarity within a dataset up to the second level. Dataset under study were 53 verses related to Hajj and Umrah from the AI-Qur'an (taken from AI-Hilali English extended Qur'an translation) and related hadiths. The reference dataset was obtained based on questions and answers related to Hajj and Umrah from the website of' Jabatan Agama dan Kemajuan Islam Malaysia' (JAKIM). Categorization of the datasets and results were validated by domain experts and implementation of the CBLM model in both datasets was evaluated by comparing accuracy and Kappa values. Results: After several experiments conducted, results showed that the accuracy obtained was from 70% to 83%, in line with the improvement of Kappa values. Overall, the performance of the dataset of JAKIM is consistent with the judgment by the domain experts; exhibiting its validity to be used as the reference dataset in testing the proposed technique of the CBLM model. Similar justification could be employed with the dataset of AI-Qur'an and Hadiths where superior performance in terms of average precision, F-Measure, and accuracy were observed; indicating its potential use in conjunction with the CBLM model. Since to date, there is no published standard on the range of acceptable percentage of accuracy for non­standard datasets as in the case of this study, the accuracy obtained supported by improved Kappa's statistic is deemed satisfactory for this study. Conclusion: Overall, this research not only contributed to keyword extraction of Qur'anic text by proposing a hybrid text extraction model but also highlighted the importance ofFCA theory in the determination of the underlying concepts in Qur'anic text. It also indicates that the CBLM model contributes as a useful technique for similarity analysis using Formal Concept Analysis and graph theory.  
600 0 0 |a x 
610 2 0 |a Universiti Sultan Zainal Abidin --   |x Dissertations  
650 0 |a Dissertations, Academic  
650 0 |a Computational intelligence  
650 0 |a Data mining  
651 0 |a x  
700 0 |a x  
710 2 |a Universiti Sultan Zainal Abidin  
999 |a 1000182126  |b Thesis  |c Reference  |e Tembila Bibliographic & Index Unit