Skip navigation
Run Run Shaw Library City University of Hong KongRun Run Shaw Library

Please use this identifier to cite or link to this item: http://dspace.cityu.edu.hk/handle/2031/7514
Full metadata record
DC FieldValueLanguage
dc.contributor.authorChan, Kam Lamen_US
dc.date.accessioned2015-03-31T01:48:57Z
dc.date.accessioned2017-09-19T08:50:56Z
dc.date.accessioned2019-02-12T06:53:08Z-
dc.date.available2015-03-31T01:48:57Z
dc.date.available2017-09-19T08:50:56Z
dc.date.available2019-02-12T06:53:08Z-
dc.date.issued2014en_US
dc.identifier.other2014csckl070en_US
dc.identifier.urihttp://144.214.8.231/handle/2031/7514-
dc.description.abstractExtensible Markup Language (XML) plays a vital role for data exchange in the Internet. It is able to maintain large number of information and simplify data storage and sharing in plain text format. However, the multiple declarations of XML schema and the structured information with large amount of data may affect usability. In order to consolidate multiple XML document from multiple XML databases to generate report, it takes longer time to develop a customized program to parse document, extract data, and integrate data as a single document. In addition, some possible problems may occur in integration such as rearrangement of data semantics and redundant data of matching element. Therefore, it should have a solution to accept global query to retrieve multiple XML documents and reduce the effort of program maintenance. In this project, it develops a prototype to accept global query input based on integrated schema to retrieve multiple XML documents. The multiple XML documents are stored in different XML databases. It includes three steps which are schema integration, query decomposition and data integration. The proposed prototype can handle three cases of schema integration including zero, one and multiple matching elements. The action of schema integration is automated to integrate two XML schemas as one integrated schema. Then, the prototype accepts global XQuery input in Path and FLWOR expressions. The global query may be decomposed as two sub queries for accessing different XML documents. Moreover, data integration is automated to create integrated document by integrating data based on query input. In order to preserve data semantics and eliminate redundant data, it designed three methodologies which are artificial root creation, reversed sub tree and key/keyRef technique.en_US
dc.rightsThis work is protected by copyright. Reproduction or distribution of the work in any format is prohibited without written permission of the copyright owner.en_US
dc.rightsAccess is restricted to CityU users.en_US
dc.titleRedundancy-free information retrieval on multiple XML documents with data semantics preservationen_US
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.description.supervisorSupervisor: Dr. Fong, Shi Piu Joseph; First Reader: Dr. Ngo, Chong Wah; Second Reader: Dr. Chan, Edwarden_US
Appears in Collections:Computer Science - Undergraduate Final Year Projects 

Files in This Item:
File SizeFormat 
fulltext.html146 BHTMLView/Open
Show simple item record


Items in Digital CityU Collections are protected by copyright, with all rights reserved, unless otherwise indicated.

Send feedback to Library Systems
Privacy Policy | Copyright | Disclaimer