PolyU Institutional Repository >
Electronic and Information Engineering >
EIE Theses >
Please use this identifier to cite or link to this item:
|Title: ||Shape analysis for image retrieval|
|Authors: ||Choi, Wai-pak|
|Subjects: ||Hong Kong Polytechnic University -- Dissertations|
Image processing -- Digital techniques
Optical pattern recognition
|Issue Date: ||2003 |
|Publisher: ||The Hong Kong Polytechnic University|
|Abstract: ||Content-based image retrieval (CBIR) system is designed to help retrieve relevant images in an image database based on their image contents. This system will allow queries on large image databases based on example images, user-constructed sketches and drawings, and other graphical information. Different image features, or descriptions, may have different significance and effectiveness in the interpretation and representation of images in different applications. The Moving Picture Experts Group (MPEG) of the International Standards Organization (ISO) initiated the MPEG-7 standard, which provides standardized core technologies that allow for the description of audiovisual data content in multimedia environments. The most challenging technical issues for a CBIR system are the effectiveness and efficiency of feature extraction and recognition algorithms for content-based image retrieval. The objectives of this thesis are to investigate and develop efficient techniques for shape feature extraction, and to construct a content-based image retrieval system. An introduction to the general concept of image retrieval will be given in this thesis, and the recent development of the MPEG-7 standard will be described. Existing content-based image retrieval systems, and the feature extraction and recognition techniques based on color, texture, shape and motion will be reviewed. Furthermore, more efficient and effective features will be proposed so that a reliable and practical retrieval system becomes possible. Shape descriptors, which are high level descriptions, will be emphasized in this research work.|
In this research, the content-based image retrieval system developed consists of three major parts: boundary extraction, feature extraction and recognition. The first part is based on an active contour model for representing image contours. We have proposed an efficient active contour model which can represent highly irregular boundaries. The contour points can be used to form other shape descriptors such as chain code, curvature scale-space representation, skeleton, etc. After extracting the boundaries, the second part is skeletonization which is an important process that can provide a compact shape representation. We have proposed a fast, efficient and accurate skeletonization method for the extraction of a well-connected Euclidean skeleton based on the boundary information. The skeleton feature can be used as a shape descriptor, which can represent the shape more compactly, and consists of spatial and structural information. In the third part, we have proposed a robust and efficient histogram representation scheme for shape retrieval, which is based on the normalized maximal disks used to represent the shape of an object. The maximal disks are extracted by means of the fast skeletonization technique with a pruning algorithm. The logarithm of the radii of the normalized maximal disks is used to construct a histogram to represent the shape. The proposed representation scheme outperforms the other methods under affine transformation, different distortions and noise levels. Hence, these three major parts are integrated to form a complete system for content-based image retrieval. We have also devised a contour/region-based matching algorithm has been used for retrieving relevant images containing similar shapes from a database. In the algorithm, Hausdorff distance is used to measure the similarity of two point sets. We have devised a robust line-feature-based approach for model-based recognition based on this distance measure. The proposed algorithm can achieve a good performance level in matching, even in a noisy environment or with the existence of occlusion, and can be used as a similarity measure for image retrieval.
|Degree: ||Ph.D., Dept. of Electronic and Information Engineering, The Hong Kong Polytechnic University, 2003.|
|Description: ||xvii, 152 leaves : ill. ; 30 cm.|
PolyU Library Call No.: [THS] LG51 .H577P EIE 2003 Choi
|Rights: ||All rights reserved.|
|Appears in Collections:||EIE Theses|
PolyU Electronic Theses
All items in the PolyU Institutional Repository are protected by copyright, with all rights reserved, unless otherwise indicated.
No item in the PolyU IR may be reproduced for commercial or resale purposes.