Adaptive model for semantic question answering disambiguation over linked data
Semantic Question Answering (SQA) accepts natural language question (NL) from users and presents the exact answer retrieved from the linked data. It requires three disambiguations which are NL question disambiguation, linked data environment disambiguation and multi-types of word disambiguation....
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2018
|
Subjects: | |
Online Access: | http://psasir.upm.edu.my/id/eprint/76956/1/FSKTM%202018%2066%20-%20IR.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-upm-ir.76956 |
---|---|
record_format |
uketd_dc |
spelling |
my-upm-ir.769562020-02-11T01:52:31Z Adaptive model for semantic question answering disambiguation over linked data 2018-04 Sofian, Hazrina Semantic Question Answering (SQA) accepts natural language question (NL) from users and presents the exact answer retrieved from the linked data. It requires three disambiguations which are NL question disambiguation, linked data environment disambiguation and multi-types of word disambiguation. Firstly, the NL disambiguation involves the disambiguation of three meta-mapping aspects which are the variation of question pattern, question complexity and linguistic terminologies of NL questions posed by users. Secondly, the linked data disambiguation involves the disambiguation of another four meta-mapping aspects which are the variation of datatype, resource heterogeneity, knowledge-based (KB) concept terminology and the variation of structure in the linked data. Thirdly, the word disambiguation involves the disambiguation between the linguistic terminology and the KB concept terminology. These three disambiguations are needed to be addressed simultaneously because through empirical study that had been carried out, this research has found that the Simple Protocol and RDF Query Language (SPARQL) components are determined by these seven meta-mapping aspects. Most existing researches modify the question, manually; select only certain patterns of NL questions or select only simple questions from the dataset. Moreover, certain processes are semi-automated as some SQAs rely heavily on pre-determined lexicon knowledge for word disambiguation or manually annotate mapping for the SPARQL query constructions. However, the manual or semi-automated process is unable to cater for new question patterns posed by users or to adapt the contents in the linked data that is ever-changing and incrementally growing. These motivate this research to firstly design the Adaptive-based Natural Language Disambiguation (ANLD) model which is integrated with the Linguistic-based SPARQL Translation Model (LBSTM), selective (Part of Speech Tagging) POS tag extraction technique, composition of syntactic representation technique and model matching technique to disambiguate NL questions. Next, this research designs the Adaptive-based Linked Data Structure Disambiguation (ALID) model that is executed if the output of the ANLD model is not able to retrieve answer from the linked data. ALID uses component-based approach and feedback loop approach to disambiguate linked data environment and to disambiguate the word ambiguity. Precision, recall and f-measure are used as performance metrics to evaluate the accuracy of the SPARQL queries which are the outputs of this research. The accuracy is evaluated by comparing the constructed SPARQL queries with the golden standard results provided by the dataset. These results illustrate that the adaptive models are able to perform the three SQA disambiguation abilities simultaneously without manual modification. These achievements empower autonomous processing of translating NL questions to the SPARQL queries that involves users with unpredictable style of question writings against the linked data that is incrementally growing in terms of size and complexity. Semantic Web Speech processing systems Semantic networks (Information theory) 2018-04 Thesis http://psasir.upm.edu.my/id/eprint/76956/ http://psasir.upm.edu.my/id/eprint/76956/1/FSKTM%202018%2066%20-%20IR.pdf text en public doctoral Universiti Putra Malaysia Semantic Web Speech processing systems Semantic networks (Information theory) |
institution |
Universiti Putra Malaysia |
collection |
PSAS Institutional Repository |
language |
English |
topic |
Semantic Web Speech processing systems Semantic networks (Information theory) |
spellingShingle |
Semantic Web Speech processing systems Semantic networks (Information theory) Sofian, Hazrina Adaptive model for semantic question answering disambiguation over linked data |
description |
Semantic Question Answering (SQA) accepts natural language question (NL) from
users and presents the exact answer retrieved from the linked data. It requires three
disambiguations which are NL question disambiguation, linked data environment
disambiguation and multi-types of word disambiguation. Firstly, the NL
disambiguation involves the disambiguation of three meta-mapping aspects which
are the variation of question pattern, question complexity and linguistic
terminologies of NL questions posed by users. Secondly, the linked data
disambiguation involves the disambiguation of another four meta-mapping aspects
which are the variation of datatype, resource heterogeneity, knowledge-based (KB)
concept terminology and the variation of structure in the linked data. Thirdly, the
word disambiguation involves the disambiguation between the linguistic
terminology and the KB concept terminology. These three disambiguations are
needed to be addressed simultaneously because through empirical study that had
been carried out, this research has found that the Simple Protocol and RDF Query
Language (SPARQL) components are determined by these seven meta-mapping
aspects.
Most existing researches modify the question, manually; select only certain patterns
of NL questions or select only simple questions from the dataset. Moreover, certain
processes are semi-automated as some SQAs rely heavily on pre-determined lexicon
knowledge for word disambiguation or manually annotate mapping for the SPARQL
query constructions. However, the manual or semi-automated process is unable to
cater for new question patterns posed by users or to adapt the contents in the linked
data that is ever-changing and incrementally growing.
These motivate this research to firstly design the Adaptive-based Natural Language
Disambiguation (ANLD) model which is integrated with the Linguistic-based
SPARQL Translation Model (LBSTM), selective (Part of Speech Tagging) POS tag
extraction technique, composition of syntactic representation technique and model
matching technique to disambiguate NL questions. Next, this research designs the
Adaptive-based Linked Data Structure Disambiguation (ALID) model that is
executed if the output of the ANLD model is not able to retrieve answer from the
linked data. ALID uses component-based approach and feedback loop approach to
disambiguate linked data environment and to disambiguate the word ambiguity.
Precision, recall and f-measure are used as performance metrics to evaluate the
accuracy of the SPARQL queries which are the outputs of this research. The
accuracy is evaluated by comparing the constructed SPARQL queries with the
golden standard results provided by the dataset. These results illustrate that the
adaptive models are able to perform the three SQA disambiguation abilities
simultaneously without manual modification. These achievements empower
autonomous processing of translating NL questions to the SPARQL queries that
involves users with unpredictable style of question writings against the linked data
that is incrementally growing in terms of size and complexity. |
format |
Thesis |
qualification_level |
Doctorate |
author |
Sofian, Hazrina |
author_facet |
Sofian, Hazrina |
author_sort |
Sofian, Hazrina |
title |
Adaptive model for semantic question answering disambiguation over linked data |
title_short |
Adaptive model for semantic question answering disambiguation over linked data |
title_full |
Adaptive model for semantic question answering disambiguation over linked data |
title_fullStr |
Adaptive model for semantic question answering disambiguation over linked data |
title_full_unstemmed |
Adaptive model for semantic question answering disambiguation over linked data |
title_sort |
adaptive model for semantic question answering disambiguation over linked data |
granting_institution |
Universiti Putra Malaysia |
publishDate |
2018 |
url |
http://psasir.upm.edu.my/id/eprint/76956/1/FSKTM%202018%2066%20-%20IR.pdf |
_version_ |
1747813195149475840 |