Adaptive model for semantic question answering disambiguation over linked data

Semantic Question Answering (SQA) accepts natural language question (NL) from users and presents the exact answer retrieved from the linked data. It requires three disambiguations which are NL question disambiguation, linked data environment disambiguation and multi-types of word disambiguation....

Full description

Saved in:
Bibliographic Details
Main Author: Sofian, Hazrina
Format: Thesis
Language:English
Published: 2018
Subjects:
Online Access:http://psasir.upm.edu.my/id/eprint/76956/1/FSKTM%202018%2066%20-%20IR.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-upm-ir.76956
record_format uketd_dc
spelling my-upm-ir.769562020-02-11T01:52:31Z Adaptive model for semantic question answering disambiguation over linked data 2018-04 Sofian, Hazrina Semantic Question Answering (SQA) accepts natural language question (NL) from users and presents the exact answer retrieved from the linked data. It requires three disambiguations which are NL question disambiguation, linked data environment disambiguation and multi-types of word disambiguation. Firstly, the NL disambiguation involves the disambiguation of three meta-mapping aspects which are the variation of question pattern, question complexity and linguistic terminologies of NL questions posed by users. Secondly, the linked data disambiguation involves the disambiguation of another four meta-mapping aspects which are the variation of datatype, resource heterogeneity, knowledge-based (KB) concept terminology and the variation of structure in the linked data. Thirdly, the word disambiguation involves the disambiguation between the linguistic terminology and the KB concept terminology. These three disambiguations are needed to be addressed simultaneously because through empirical study that had been carried out, this research has found that the Simple Protocol and RDF Query Language (SPARQL) components are determined by these seven meta-mapping aspects. Most existing researches modify the question, manually; select only certain patterns of NL questions or select only simple questions from the dataset. Moreover, certain processes are semi-automated as some SQAs rely heavily on pre-determined lexicon knowledge for word disambiguation or manually annotate mapping for the SPARQL query constructions. However, the manual or semi-automated process is unable to cater for new question patterns posed by users or to adapt the contents in the linked data that is ever-changing and incrementally growing. These motivate this research to firstly design the Adaptive-based Natural Language Disambiguation (ANLD) model which is integrated with the Linguistic-based SPARQL Translation Model (LBSTM), selective (Part of Speech Tagging) POS tag extraction technique, composition of syntactic representation technique and model matching technique to disambiguate NL questions. Next, this research designs the Adaptive-based Linked Data Structure Disambiguation (ALID) model that is executed if the output of the ANLD model is not able to retrieve answer from the linked data. ALID uses component-based approach and feedback loop approach to disambiguate linked data environment and to disambiguate the word ambiguity. Precision, recall and f-measure are used as performance metrics to evaluate the accuracy of the SPARQL queries which are the outputs of this research. The accuracy is evaluated by comparing the constructed SPARQL queries with the golden standard results provided by the dataset. These results illustrate that the adaptive models are able to perform the three SQA disambiguation abilities simultaneously without manual modification. These achievements empower autonomous processing of translating NL questions to the SPARQL queries that involves users with unpredictable style of question writings against the linked data that is incrementally growing in terms of size and complexity. Semantic Web Speech processing systems Semantic networks (Information theory) 2018-04 Thesis http://psasir.upm.edu.my/id/eprint/76956/ http://psasir.upm.edu.my/id/eprint/76956/1/FSKTM%202018%2066%20-%20IR.pdf text en public doctoral Universiti Putra Malaysia Semantic Web Speech processing systems Semantic networks (Information theory)
institution Universiti Putra Malaysia
collection PSAS Institutional Repository
language English
topic Semantic Web
Speech processing systems
Semantic networks (Information theory)
spellingShingle Semantic Web
Speech processing systems
Semantic networks (Information theory)
Sofian, Hazrina
Adaptive model for semantic question answering disambiguation over linked data
description Semantic Question Answering (SQA) accepts natural language question (NL) from users and presents the exact answer retrieved from the linked data. It requires three disambiguations which are NL question disambiguation, linked data environment disambiguation and multi-types of word disambiguation. Firstly, the NL disambiguation involves the disambiguation of three meta-mapping aspects which are the variation of question pattern, question complexity and linguistic terminologies of NL questions posed by users. Secondly, the linked data disambiguation involves the disambiguation of another four meta-mapping aspects which are the variation of datatype, resource heterogeneity, knowledge-based (KB) concept terminology and the variation of structure in the linked data. Thirdly, the word disambiguation involves the disambiguation between the linguistic terminology and the KB concept terminology. These three disambiguations are needed to be addressed simultaneously because through empirical study that had been carried out, this research has found that the Simple Protocol and RDF Query Language (SPARQL) components are determined by these seven meta-mapping aspects. Most existing researches modify the question, manually; select only certain patterns of NL questions or select only simple questions from the dataset. Moreover, certain processes are semi-automated as some SQAs rely heavily on pre-determined lexicon knowledge for word disambiguation or manually annotate mapping for the SPARQL query constructions. However, the manual or semi-automated process is unable to cater for new question patterns posed by users or to adapt the contents in the linked data that is ever-changing and incrementally growing. These motivate this research to firstly design the Adaptive-based Natural Language Disambiguation (ANLD) model which is integrated with the Linguistic-based SPARQL Translation Model (LBSTM), selective (Part of Speech Tagging) POS tag extraction technique, composition of syntactic representation technique and model matching technique to disambiguate NL questions. Next, this research designs the Adaptive-based Linked Data Structure Disambiguation (ALID) model that is executed if the output of the ANLD model is not able to retrieve answer from the linked data. ALID uses component-based approach and feedback loop approach to disambiguate linked data environment and to disambiguate the word ambiguity. Precision, recall and f-measure are used as performance metrics to evaluate the accuracy of the SPARQL queries which are the outputs of this research. The accuracy is evaluated by comparing the constructed SPARQL queries with the golden standard results provided by the dataset. These results illustrate that the adaptive models are able to perform the three SQA disambiguation abilities simultaneously without manual modification. These achievements empower autonomous processing of translating NL questions to the SPARQL queries that involves users with unpredictable style of question writings against the linked data that is incrementally growing in terms of size and complexity.
format Thesis
qualification_level Doctorate
author Sofian, Hazrina
author_facet Sofian, Hazrina
author_sort Sofian, Hazrina
title Adaptive model for semantic question answering disambiguation over linked data
title_short Adaptive model for semantic question answering disambiguation over linked data
title_full Adaptive model for semantic question answering disambiguation over linked data
title_fullStr Adaptive model for semantic question answering disambiguation over linked data
title_full_unstemmed Adaptive model for semantic question answering disambiguation over linked data
title_sort adaptive model for semantic question answering disambiguation over linked data
granting_institution Universiti Putra Malaysia
publishDate 2018
url http://psasir.upm.edu.my/id/eprint/76956/1/FSKTM%202018%2066%20-%20IR.pdf
_version_ 1747813195149475840