Abstract
In this paper, various prominent local and global descriptors are evaluated against each other for analyzing their performance on shape-based image retrieval. Local descriptors include Fourier descriptors, Weber’s local descriptor, local binary patterns, and local ternary patterns. The prominent global descriptors include moment invariants, generic Fourier descriptor (GFD), angular radial transform (ART), wavelet moments (WM), and Zernike moment descriptor (ZMD). In addition, a novel local descriptor is proposed based on the histograms of circular arcs and linear edges, which are detected by means of Hough transform. The proposed local descriptor provides features, which are invariant to geometric transformations and are robust to noise as compared to some existing prominent local descriptors. We also propose an improvement in the performance of global descriptors GFD, ART, WM, and ZMD by taking advantage of the phase information in the comparison process along with their magnitude. Subsequently, the local and global descriptors with the best image-retrieval performances are combined to design an effective retrieval system, which further enhances the retrieval performance substantially. All descriptors are analyzed in terms of six principles set by MPEG-7. Detailed experiments are performed on standard benchmark image databases along with their rotation-invariance and noise test. The results of experiments reveal that the proposed fusion of local and global descriptors outperforms other major descriptors.
Similar content being viewed by others
References
Mejdoub, M., et al.: Embedded lattices tree: an efficient indexing scheme for content based retrieval on image databases. J. Vis. Commun. Image R. 20(2), 145–156 (2009)
Ren, F., Bracewell, D.B.: Advanced information retrieval. Electron. Notes Theor. Comput. Sci. 225(2), 303–317 (2009)
Faloutsos, C., et al.: Efficient and effective querying by image content. J. Intell. Inf. Syst. 3, 231–262 (1994)
Pentland, R.P., Scalroff, S.: Photobooks: tools for content-based manipulation of image databases. In: SPIE Conf. Storage Retrieval Image Video Databases, vol. II, pp. 33–47 (1994)
Mether, M., Kankanhall, M.S., Lee, W.F.: Content-based image retrieval using a composite color-shape approach. Inf. Process. Manag. 34(1), 109–120 (1998)
Smeulders, A.W.M., et al.: Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 22, 1349–1379 (2000)
Hu, M.K.: Visual pattern recognition by moment invariants. IRE Trans. Inf. Theory 8, 179–187 (1962)
Zhang, D., Lu, G.: Shape-based image retrieval using generic Fourier descriptor. Signal Process. Image Commun. 17, 825–848 (2002)
Bober, M.: MPEG-7 visual shape descriptors. IEEE Trans. Circuits Syst. Video Technol. 11(6), 716–719 (2001)
Shen, D., Ip, H.H.S.: Discriminative wavelet shape descriptors for recognition of 2-D patterns”. Pattern Recognit. 32, 151–165 (1999)
Kim, W.-Y., Kim, Y.-S.: A region based shape descriptor using Zernike moments. Signal Process. Image Commun. 16, 95–102 (2000)
Zhang, D., Lu, G.: A comparative study of curvature scale space and Fourier descriptors for shape based image retrieval. J. Vis. Commun. Image R. 14, 41–60 (2003)
Chen, J., Shan, S., He, C., Zhao, G., Pietikainen, M., Chen, X., Gao, W.: WLD: a robust image local descriptor. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1705–1720 (2010)
Ojala, T., Pietikainen, M.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Machine Intell. 24(7), 971–986 (2002)
Moore, S., Bowden, R.: Local binary patterns for multi-view facial expression recognition. Comput. Vis. Image Underst. 115, 541–558 (2011)
Tan, X., Triggs, B.: Enhanced local texture features set for face recognition under different lighting conditions. IEEE Trans. Image Process. 19(6), 1635–1650 (2010)
Zhang, D., Lu, G.: Evaluation of MPEG-7 shape descriptors against other shape descriptors. Multimed. Syst. 9, 15–30 (2003)
Amanatiadis, A., Kaburlasos, V.G., Gasteratos, A., Papadakis, S.E.: Evaluation of shape descriptors for shape based image retrieval. IET Image Process. 5(5), 493–499 (2011)
Kim, H., Kim, J.: Region-based shape descriptor invariant to rotation, scale and translation. Signal Process. Image Commun. 16, 87–93 (2000)
Chen, Z., Sun, S.-K.: A Zernike moment phase-based descriptor for local image representation and matching. IEEE Trans. Image Proc. 19(1), 205–219 (2010)
Oppenheim, A.V., Lim, J.S.: The importance of phase in signals. Proc. IEEE 69(5), 529–550 (1981)
Li, S., Lee, M.-C., Pun, C.-M.: Complex Zernike moments features for shape based image retrieval. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 39(1), 227–237 (2009)
Revaud, J., Lavoue, G., Baskurt, A.: Improving Zernike moments comparison for optimal similarity and rotation angle retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 627–636 (2009)
Chen, Z., Sun, S.-K.: A Zernike moment phase-based descriptor for local image representation and matching. IEEE Trans. Image Process. 19(1), 205–219 (2010)
Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 24(4), 509–522 (2002)
Tu, Z., Yuille, A.: Shape matching and recognition-using generative models and informative features. Proc. Eur. Conf. Comput. Vis. III, 195–209 (2004)
Sebastian, T.B., Klein, P.N., Kimia, B.B.: Recognition of shapes by editing their shock graphs. IEEE Trans. Pattern Anal. Mach. Intell. 26(5), 550–571 (2004)
Ling, H.B., Jacobs, D.W.: Shape classification using the inner-distance. IEEE Trans. Pattern Anal. Mach. Intell. 29(2), 286–299 (2007)
Kauppinen, H., Seppanen, T., Pietikainen, M.: An experimental comparison of autoregressive and Fourier-based descriptors in 2D shape classification. IEEE Trans. PAMI 17(2), 201–207 (1995)
Lo, R.-C., Tsai, W.-H.: Gray-scale Hough transform for thick line detection in gray-scale images. Pattern Recognit. 28(5), 647–661 (1995)
Duda, R.O., Hart, P.E.: Pattern classification and scene analysis, pp. 335–337. Wiley, New York (1973)
Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Prentice Hall, Upper Saddle River, NJ (2008)
Canny, J.: A computational approach for edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 8(6), 679–698 (1986)
Ballard, D.H.: Generalizing the Hough transform to detect arbitrary shapes. Pattern Recognit. 13(2), 111–122 (1981)
Zhang, D.S., Lu, G.: Generic Fourier descriptor for shape-based image retrieval. Proc. IEEE Int. Conf. Multimed. Expo (ICME2002), pp. 425–428 (2002)
Unser, M., Aldroubi, A., Eden, M.: On the asymptotic convergence of B-spline wavelets to Gabor functions. IEEE Trans. Inf. Theory 38, 864–872 (1992)
Zernike, F.: Beugungstheorie des Schneidenverfahrens und seiner verbesserten Form, der Phasenkontrastmethode. Physica 1, 689–701 (1934)
Teague, M.R.: Image analysis via the general theory of moments. J. Opt. Soc. Am. 70, 920–930 (1980)
Swain, M.J., Ballard, D.H.: Color indexing. Int. J. Comput. Vis. 7(1), 11–32 (1991)
Tamura, H., Mori, S., Yamawaki, T.: Textural features corresponding to visual perception. IEEE Trans. Syst. Man Cybernet. 8(6), 460–473 (1978)
Singh, C., Walia, E.: Algorithms for fast computation of Zernike moments and their numerical stability. Image Vis. Comput. 29(4), 251–259 (2011)
Acknowledgments
We are thankful to anonymous reviewers for their useful comments and suggestions to raise the standard of the paper. We are also thankful to the University Grant Commission (UGC), New Delhi, India, for providing research fellowship to Pooja for carrying out the research work leading to PhD degree.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by T. Plagemann.
Rights and permissions
About this article
Cite this article
Singh, C., Sharma, P. Performance analysis of various local and global shape descriptors for image retrieval. Multimedia Systems 19, 339–357 (2013). https://doi.org/10.1007/s00530-012-0288-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00530-012-0288-7