DATA INTEGRATION MODEL FOR FASTER DATA EXTRACTION AND RETRIEVAL FROM SEMI-STRUCTURED DATA FORMAT

Collections of data is crucial across a wide variety of field because of increasing data rapidly year by year. These collections are important for many organizations to make a correct decision using business intelligent applications. A business intelligent application must have capability to coll...

Full description

Saved in:
Bibliographic Details
Main Author: MOHD KAMIR YUSOF
Format: Thesis
Language:English
Online Access:http://umt-ir.umt.edu.my:8080/jspui/bitstream/123456789/15638/1/Abstract.pdf
http://umt-ir.umt.edu.my:8080/jspui/bitstream/123456789/15638/2/Full%20Thesis%20-MOHD%20KAMIR%20YUSOF.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-umt-ir.-15638
record_format uketd_dc
spelling my-umt-ir.-156382022-01-17T07:56:28Z DATA INTEGRATION MODEL FOR FASTER DATA EXTRACTION AND RETRIEVAL FROM SEMI-STRUCTURED DATA FORMAT MOHD KAMIR YUSOF Collections of data is crucial across a wide variety of field because of increasing data rapidly year by year. These collections are important for many organizations to make a correct decision using business intelligent applications. A business intelligent application must have capability to collect and integrate all data from different data sources. One of the challenges in development of business intelligent application is data integration. This challenge is happened because of data structure are different. This research is looking for suitable data integration model in order to allows data integration from different data sources. Native XML (NXD) is one the model has been used in data integration. In this model, elements and attributes for each data are extract and store into Relational Database Management System (RDBMS). Meanwhile, the value for each element and attribute are stored in XML format. Based on the experiments has been done by previous researchers, NXD can produce a better performance during data insertion response time and query processing response time using SigmodRecord and DBLP datasets. However, the efficiency of NXD still has room for improvement. In the initial experiment, list of special character can be removed to improve the performance of NXD has been idenfitied in the SigmodRecord and DBLP datasets. UNIVERSITI MALAYSIA TERENGGANU 2021-05 Thesis en http://umt-ir.umt.edu.my:8080/handle/123456789/15638 http://umt-ir.umt.edu.my:8080/jspui/bitstream/123456789/15638/3/license.txt 8a4605be74aa9ea9d79846c1fba20a33 http://umt-ir.umt.edu.my:8080/jspui/bitstream/123456789/15638/1/Abstract.pdf f4b6c8d94e44c89623af9d8c0af15433 http://umt-ir.umt.edu.my:8080/jspui/bitstream/123456789/15638/2/Full%20Thesis%20-MOHD%20KAMIR%20YUSOF.pdf d6642fcd33f8e60ff94edd800619b03c
institution Universiti Malaysia Terengganu
collection UMT Repository System
language English
description Collections of data is crucial across a wide variety of field because of increasing data rapidly year by year. These collections are important for many organizations to make a correct decision using business intelligent applications. A business intelligent application must have capability to collect and integrate all data from different data sources. One of the challenges in development of business intelligent application is data integration. This challenge is happened because of data structure are different. This research is looking for suitable data integration model in order to allows data integration from different data sources. Native XML (NXD) is one the model has been used in data integration. In this model, elements and attributes for each data are extract and store into Relational Database Management System (RDBMS). Meanwhile, the value for each element and attribute are stored in XML format. Based on the experiments has been done by previous researchers, NXD can produce a better performance during data insertion response time and query processing response time using SigmodRecord and DBLP datasets. However, the efficiency of NXD still has room for improvement. In the initial experiment, list of special character can be removed to improve the performance of NXD has been idenfitied in the SigmodRecord and DBLP datasets.
format Thesis
author MOHD KAMIR YUSOF
spellingShingle MOHD KAMIR YUSOF
DATA INTEGRATION MODEL FOR FASTER DATA EXTRACTION AND RETRIEVAL FROM SEMI-STRUCTURED DATA FORMAT
author_facet MOHD KAMIR YUSOF
author_sort MOHD KAMIR YUSOF
title DATA INTEGRATION MODEL FOR FASTER DATA EXTRACTION AND RETRIEVAL FROM SEMI-STRUCTURED DATA FORMAT
title_short DATA INTEGRATION MODEL FOR FASTER DATA EXTRACTION AND RETRIEVAL FROM SEMI-STRUCTURED DATA FORMAT
title_full DATA INTEGRATION MODEL FOR FASTER DATA EXTRACTION AND RETRIEVAL FROM SEMI-STRUCTURED DATA FORMAT
title_fullStr DATA INTEGRATION MODEL FOR FASTER DATA EXTRACTION AND RETRIEVAL FROM SEMI-STRUCTURED DATA FORMAT
title_full_unstemmed DATA INTEGRATION MODEL FOR FASTER DATA EXTRACTION AND RETRIEVAL FROM SEMI-STRUCTURED DATA FORMAT
title_sort data integration model for faster data extraction and retrieval from semi-structured data format
granting_institution UNIVERSITI MALAYSIA TERENGGANU
url http://umt-ir.umt.edu.my:8080/jspui/bitstream/123456789/15638/1/Abstract.pdf
http://umt-ir.umt.edu.my:8080/jspui/bitstream/123456789/15638/2/Full%20Thesis%20-MOHD%20KAMIR%20YUSOF.pdf
_version_ 1747835827522633728