Skip navigation
Run Run Shaw Library City University of Hong KongRun Run Shaw Library

Please use this identifier to cite or link to this item: http://dspace.cityu.edu.hk/handle/2031/9482
Full metadata record
DC FieldValueLanguage
dc.contributor.authorWong, Tsz Fungen_US
dc.date.accessioned2021-11-17T04:07:25Z-
dc.date.available2021-11-17T04:07:25Z-
dc.date.issued2021en_US
dc.identifier.other2021eewtf295en_US
dc.identifier.urihttp://dspace.cityu.edu.hk/handle/2031/9482-
dc.description.abstractPeople used to store documents in operating systems and cloud storages; hardly can they retrieve document instantly. They are slow and only search for exact wording without ranking results, and they search for the existence of the keyword but not the level of importance. This study investigates a Document Retrieval System (DRS), which has an efficient document retrieval algorithm. DRS is a web application implemented in Python that store documents and supports multiple users. Specifically, it investigates the concepts of information retrieval, text-mining and natural language processing in producing a search engine. To build up the search engine, a numerical statistic, Term Frequency-Inverse Document Frequency, is used to reflect each word's importance to a document in the entire system. Search results are computed by comparing the similarity of each document and search keys in a dimensionality-reduced TF-IDF matrix. To maintain easy-access and easy-to-use features, DRS provides 31 Application-Programming-Interfaces to perform tasks. To evaluate DRS, standard data collections were imported and tested. DRS was able to work on 10000+ documents. Moreover, users were invited to evaluate DRS.en_US
dc.rightsThis work is protected by copyright. Reproduction or distribution of the work in any format is prohibited without written permission of the copyright owner.en_US
dc.rightsAccess is restricted to CityU users.en_US
dc.titleDocuments retrieval systemen_US
dc.contributor.departmentDepartment of Electrical Engineeringen_US
dc.description.supervisorSupervisor: Prof. Chow, Tommy W S; Assessor: Dr. Chan, K Len_US
Appears in Collections:Electrical Engineering - Undergraduate Final Year Projects 

Files in This Item:
File SizeFormat 
fulltext.html148 BHTMLView/Open
Show simple item record


Items in Digital CityU Collections are protected by copyright, with all rights reserved, unless otherwise indicated.

Send feedback to Library Systems
Privacy Policy | Copyright | Disclaimer