Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/3428
Title: Search engine selection with hierarchical categorization
Authors: Shiu, Koon-hang
Subjects: Hong Kong Polytechnic University -- Dissertations
Web search engines
Search engines
Categories (Mathematics)
Issue Date: 2003
Publisher: The Hong Kong Polytechnic University
Abstract: Search engines are very successful in locating resources on the Internet. However they cover only small and different parts of the vast Internet. While using metasearching to simultaneously engage multiple general search engines can increase the coverage, users encounter difficulties in finding target information in the large set of results returned. In addition, a large amount of valuable resources are only available behind specialty search engines or databases. The problem of covering the Internet has been replaced by the problem of selecting suitable search engines for a given query. This thesis focuses on developing an algorithm to construct a hierarchical category of specialty search engines automatically to assist in solving the coverage problem and the search engine selection problem. We believe that metasearching specialty search engines can effectively discover large amount of hidden Web resources. In particular, we proposed to categorize specialty search engines automatically by sending probe queries to them, fetch and analyze the returned result documents. By determining the relevancies between the returned documents and search categories, specialty search engines can be associated with nodes in a hierarchical search engine category directory for metasearching. By utilizing the category, search engines that have a high possibility of relevant information and resources can be easily selected by a metasearch engine. In this thesis, we present the concepts, designs, implementation details, and validation of the proposed categorization algorithm and a metasearch engine prototype, which seeks to demonstrate such a search engine category can be beneficial in finding essential Web resources.
Description: vi, 84 leaves : ill. ; 30 cm.
PolyU Library Call No.: [THS] LG51 .H577M COMP 2003 Shiu
Rights: All rights reserved.
Type: Thesis
URI: http://hdl.handle.net/10397/3428
Appears in Collections:COMP Theses
PolyU Electronic Theses

Files in This Item:
File Description SizeFormat 
b17330580_link.htmFor PolyU Users 162 BHTMLView/Open
b17330580_ir.pdfFor All Users (Non-printable) 2.88 MBAdobe PDFView/Open


All items in the PolyU Institutional Repository are protected by copyright, with all rights reserved, unless otherwise indicated. No item in the PolyU IR may be reproduced for commercial or resale purposes.