Abstract
Searching interested images based on visual properties or contents of images is a challenging problem and it has received much attention from researchers in the last 20 years. The gap between low-level visual features and high-level semantic understanding of images, which is also known as the semantic gap problem, is the bottleneck to further improvement of the performance of a content-based image retrieval system. In order to solve this semantic gap problem, one of the most popular approaches in recent years is to change the focus from the global content description of images into the local content description by regions (region-based image retrieval) or even the objects in images (object-based image retrieval). Although much research in region-based image retrieval has already been done, there are still three main problems need to be tackled properly: (a) local region-based features, (b) similarity measures, and (c) relevance feedback based on regions. In this paper, we review some recent development in region-based image retrieval with respect to the above three problems and propose some future directions for region-based image retrieval research towards the end of this paper.
Similar content being viewed by others
References
Goodrum, A. (2000). Image information retrieval: An overview of current research. Information Science, 3(2), 63–67.
Smeulders, A., Worring, M., Santini, S., Gupta, A., & Jain, R. (2000). Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22, 1349–1380.
Datta, R., Li, J., & Wang, J. (2005). Content-based image retrieval: Approaches and trends of the new age. In Proceedings of the ACM SIGMM international workshop on multimedia information retrieval (pp. 253–262).
Datta, R., Joshi, D., Li, J., & Wang, J. (2008). Image retrieval: Ideas, influence, and trends of the new age. ACM Computing Surveys, 40(2), 3–63.
Gevers, T., & Smeulders, A. (1997). PicToSeek: A content-based image search engine for the WWW. In Proceedings of international conference on visual information systems (pp. 93–100).
Sclaroff, S., Taycher, L., & Cascia, M. (1997). ImageRover: A content-based image browser for the world wide web. In Proceedings of IEEE international workshop on content-based access of image and video libraries (pp. 2–9).
Inoue, M. (2004). On the need for annotation-based image retrieval. In Proceedings of workshop on information retrieval in context.
Larish, J. (1995). Kodak’s still picture exchange for print and film use. Advanced Imaging, 10(4), 38–39.
Martucci, M. (1995). Digit still marketing at presslink. Advanced Imaging, 10(4), 34–36.
Zhou, X.-S., & Huang, T. (2002). Unifying keywords and visual contents in image retrieval. IEEE Multimedia, 9(2), 23–33.
Niblack, C., Barber, R., Equitz, W., Flickner, M., Glasman, E., Petkovic, D., et al. (1993). The QBIC project: Querying images by content using color, texture, and shape. In Proceedings of SPIE (pp. 173–187).
Smith, J. (1997). Integrated spatial and feature image systems: Retrieval, analysis, and compression. PhD Thesis, Columbia University.
Ogle, V., & Stonebraker, M. (1995). Chabot: Retrieval from a relational database of images. IEEE Computer, 28(9), 40–48.
Smith, J. R., & Chang, S.-F. (1996). VisualSEEk: A fully automated content-based image query system. In Proceedings of ACM international conference on multimedia (pp. 87–98).
Huang, J., & Kumar, R. (1997). Image indexing using color correlograms. In Proceedings of IEEE conference on computer vision and pattern recognition (pp. 762–768).
Wu, K., & Yap, K.-H. (2006). Fuzzy SVM for content-based image retrieval—a pseudo-label support vector machine framework. IEEE Computational Intelligence Magazine, 1, 10–16.
Ma, W.-Y., & Manjunath, B. (1997). NETRA: A toolbox for navigating large image databases. In Proceedings of IEEE international conference on image processing (pp. 568–571).
Carson, C., Thomas, C., Belongie, S., Hellerstein, J., & Malik, J. (1999). Blobworld: A system for region-based image indexing and retrieval. Lecture Notes in Computer Science, 1614, 660.
Wang, J.-Z., Li, J., & Wiederhold, G. (2001). SIMPLicity: Semantics sensitive integrated matching for picture libraries. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23, 947–963.
Rui, Y., Huang, T., Ortega, M., & Mehrotra, S. (1998). Relevance feedback: A power tool for interactive content-based image retrieval. IEEE Transactions on Circuits and Systems for Video Technology, 6, 644–655.
Frederix, G., Caenen, G., & Pauwels, E. (2000). Panormic, adaptive and reconfigurable interface for similarity search. In Proceeding of IEEE international conference on image processing (Vol. 3, pp. 222–225).
Hiroike, A., Musha, Y., Sugimoto, A., & Mori, Y. (1999). Visualization of information spaces to retrieve and browse image data. In Proceeding of visual: Information and information system (Vol. 1, pp. 152–162).
Wu, K., & Yap, K.-H. (2007). Content-based image retrieval using fuzzy perceptual feedback. Multimedia Tools and Applications, 43, 235–251.
Yap, K.-H., & Wu, K. (2005). A soft relevance framework in content-based image retrieval systems. IEEE Transaction on Circuits and Systems for Video Technology, 15, 1557–1568.
Wang, L., Chan, K.-L., & Xue, P. (2005). A criterion for optimizing kernel parameters in KBDA for image retrieval. IEEE Transaction on Systems, Man, and Cybernetics. Part B. Cybernetics, 35, 556–562.
Zhang, Y.-H., Wang, L., Hartley, R., & Li, H.-D. (2007). Where is the weet-bix. In Proceeding of the Asian conference on computer vision (pp. 800–810).
Tsai, C.-F., McGarry, K., & Tait, J. (2003). Image classification using hybrid neural network. In Proceedings of the ACM SIGIR conference on research and development in information retrieval (pp. 431–432).
Fan, J.-P., Gao, Y.-L., Luo, H.-Z., & Xu, G.-Y. (2004). Automatic image annotation by using concept-sensitive salient objects for image content representation. In Proceedings of the ACM SIGIR conference on research and development in information retrieval (pp. 361–368).
Comanicu, D., & Meer, P. (2002). Mean shift: a robust approach toward feature space analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24, 603–619.
Marques, O., Mayron, L., Borba, G., & Gamba, H. (2006). Using visual attention to extract regions of interest in the context of image retrieval. In Proceedings of ACM annual southeast regional conference (pp. 638–643).
Wang, X.-Y., Hu, F.-L., & Yang, H.-Y. (2006). A novel regions-of-interest based image retrieval using multiple features. In Proceedings of the multi-media modelling international conference (Vol. 1, pp. 377–380).
Zhang, M.-X., Lu, Z.-G., & Shen, J.-Y. (2007). Image retrieval with simple invariant features based hierarchical uniform segmentation. In Proceedings of the international conference on computational intelligence and security (Vol. 1, pp. 461–465).
Deng, Y., Manjunath, B., & Shin, H. (1999). Color image segmentation. In Proceedings of IEEE conference on computer vision and pattern recognition (Vol. 2, pp. 446–451).
Almeida, J., Rocha, A., Torres, R., & Goldenstein, S. (2008). Making colors worth more than a thousand words. In Proceedings of the ACM symposium on applied computing (pp. 1180–1186).
MPEG-7 Overview (version 10). (2004). Online Source – http://www.chiariglione.org/mpeg/standards/mpeg-7/mpeg-7.htm#E12E25.
Manjunath, B., Salembier, P., & Sikora, T. (2001). Introduction to MPEG-7: Multimedia content description interface. New York: Wiley.
Chen, Y., & Wang, J. (2002). A region-based fuzzy feature matching approach to content-based image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24, 1252–1267.
Duda, R., Hart, P., & Stork, D. (2001). Pattern classification second edition. New York: Wiley.
Sun, Y.-Q., & Ozawa, S. (2004). A hierarchical approach for region-based image retrieval. In Proceedings of the IEEE conference on computer vision and pattern recognition (Vol. 2, pp. 996–1001).
Jing, F., Li, M., Zhang, H., & Zhang, B. (2002). An efficient and effective region-based image retrieval framework. IEEE Transactions on Image Processing, 13, 699–709.
Jing, F., Zhang, B., Lin, F.-Z., Ma, W.-Y., & Zhang, H.-J. (2001). A novel region-based image retrieval method using relevance feedback. In Proceedings of ACM workshops on multimedia: Multimedia information retrieval (pp. 28–31).
Jing, F., Li, M.-J., Zhang, H.-J., & Zhang, B. (2004). An efficient and effective region-based image retrieval framework. IEEE Transactions on Image Processing, 13(5), 699–709.
Xiong, W., Qiu, B., Tian, Q., Xu, C.-S., Ong, S., Foong, K., et al. (2005). MultiPRE: A novel framework with multiple parallel retrieval engines for content-based image retrieval. In Proceedings of the ACM international conference on multimedia (pp. 1023–1032).
Rahmani, R., Goldman, S., Zhang, H., Krettek, J., & Fritts, J. (2005). Localized content based image retrieval. In Proceedings of the ACM SIGMM international workshop on multimedia information retrieval (pp. 227–236).
Hastie, T., Tibshirani, R., & Friedman, J. (2001). The elements of statistical learning: Data mining, inference, and prediction. New York: Springer.
Zhang, R.-F., & Zhang, Z.-F. (2004). Hidden semantic concept discovery in region based image retrieval. In Proceedings of the IEEE international conference on systems, man and cybernetics (Vol. 1, pp. 1117–1124).
He, X. (2004). Incremental semi-supervised subspace learning for image retrieval. In Proceedings of the ACM conference on multimedia (Vol. 1, pp. 2–8).
Lin, Y., Liu, T., & Chen, H. (2005). Semantic manifold learning for image retrieval. In Proceedings of the ACM conference on multimedia (Vol. 1, pp. 249–258).
Yu, J., & Tian, Q. (2006). Learning image manifolds by semantic subspace projection. In Proceedings of the ACM conference on multimedia (Vol. 1, pp. 297–306).
Cai, D., He, X., & Han, J. (2007). Spectral regression: A unified subspace learning framework for content-based image retrieval. In Proceedings of the ACM conference on multimedia (Vol. 3, pp. 403–412).
Greenspan, H., Dvir, G., & Rubner, Y. (2000). Region correspondence for image matching via EMD flow. In Workshop on content-based access of image and video libraries (pp. 27–31).
Li, J., Wang, J., & Wiederhold, G. (2000). IRM: Integrated region matching for image retrieval. In Proceedings of the ACM international conference on multimedia (pp. 147–156).
Wang, J., & Du, Y.-P. (2001). Scalable integrated region-based image retrieval using IRM and statistical clustering. In Proceedings of the ACM/IEEE-CS joint conference on digital libraries (pp. 268–277).
Wang, J. (2000). Region-based retrieval of biomedical images. In Proceedings of the ACM international conference on multimedia (pp. 511–512).
Chen, Y.-X., & Wang, J. (2003). Looking beyond region boundaries: A robust image similarity measure using fuzzified region features. In Proceedings of the IEEE international conference on fuzzy systems (Vol. 2, pp. 1165–1170).
Weber, R., & Mlivoncic, M. (2003). Efficient region-based image retrieval. In Proceedings of the international conference on information and knowledge management (pp. 69–76).
Sun, Y.-Q., & Ozawa, S. (2003). Semantic-meaningful content-based image retrieval in wavelet domain. In Proceedings of ACM SIGMM international workshop on multimedia information retrieval (pp. 122–129).
Hsieh, J., & Grimson, W. (2003). Spatial template extraction for image retrieval by region matching. IEEE Transactions on Image Processing, 12(11), 1404–1415.
Lv, Q., Charikar, M., & Li, K. (2004). Image similarity search with compact data structures. In Proceedings of the ACM international conference on information and knowledge management (pp. 208–217).
Vasconcelos, N. (2000). Bayesian models for visual information retrieval. Cambridge: Massachusetts Institute of Technology.
Guo, G., Jain, A., Ma, W.-Y., Zhang, H.-J. (2002). Learning similarity measure for natural image retrieval with relevance feedback. IEEE Transactions on Neural Network, 13, 811–820.
Hertz, T., Bar-Hillel, A., & Weinshall, D. (2004). Learning distance functions for image retrieval. In Proceedings of IEEE conference on computer vision and pattern recognition (Vol. 2, pp. 570–577).
Si, L., Jin, R., Hoi, S., & Lyu, M (2006). Collaborative image retrieval via regularized metric learning. Multimedia Systems, 12, 34–44.
Globerson, A., & Roweis, S. (2005). Metric learning by collapsing classes. In Proceedings of advances in neural information processing systems.
Weinberger, K., Blitzer, J., & Saul, L. (2006). Distance metric learning for large margin nearest neighbor classification. In Proceedings of advances in neural information processing systems (pp. 1473–1480).
Chen, Y., Wang, J., & Krovetz, R. (2005). Clue: Cluster-based retrieval of images by unsupervised learning. IEEE Transactions on Image Processing, 14, 1187–1201.
Chen, Y., Wang, J., & Krovetz, R. (2003). An unsupervised learning approach to content-based image retrieval. In Proceedings of IEEE international symposium of signal processing and applications (Vol. 1, pp. 197–200).
Chapelle, O., Scholkopf, B., & Zien, A. (2006). Semi-supervised learning. Cambridge: MIT.
Wagsta, K., & Cardie, C. (2000). Clustering with instance-level constraints. In Proceedings of the IEEE international conference on machine learning (Vol. 1, pp. 1103–1110).
Wagsta, K., Cardie, C., Rogers, S., & Schroell, S. (2001). Constrained k-means clustering with background knowledge. In Proceedings of the IEEE international conference on machine learning (Vol. 1, pp. 577–584).
Klein, D., & Kamvar, S. (2002). From instance-level constraints to space-level constraints: Making the most of prior knowledge in data clustering. In Proceedings of the IEEE international conference on machine learning (Vol. 1, pp. 307–314).
Xing, E., Ng, A., Jordan, M., & Russell, S. (2003). Distance metric learning, with application to clustering with side-information. In Proceedings of the advances in neural information processing systems (Vol. 1, pp. 505–512).
Bar-Hillel, A., Hertz, T., Shental, N., & Weinshall, D. (2003). Learning distance functions using equivalence relations. In Proceedings of IEEE international conference on machine learning (Vol. 1, pp. 11–18).
Shental, N., Bar-Hillel, A., Hertz, T., & Weinshall, D. (2004). Computing Gaussian mixture models with EM using side-information. In Proceedings of the advances in neural information processing systems.
Chang, H., & Yeung, D.-Y. (2005). Stepwise metric adaptation based on semi-supervised learning for boosting image retrieval performance. In Proceedings of the British machine vision conference.
Heisterkamp, D., Peng, J., & Dai, H.-K. (2001). Adaptive quasiconformal kernel metric for image retrieval. In Proceedings of IEEE conference on computer vision and pattern recognition (Vol. 2, pp. 388–393).
Chang, H., & Yeung, D. (2006). Locally linear metric adaption with application to semi-supervised clustering and image retrieval. Pattern Recognition, 39, 1253–1264.
Frome, A., Singer, Y., & Malik, J. (2006). Image retrieval and classification using local distance functions. In Proceedings of advances in neural information processing systems.
Ulusoy, I., & Bishop, C. (2005). Generative versus discriminative methods for object recognition. In Proceedings of IEEE conference on computer vision and pattern recognition.
Tao, D., Tang, X., Li, X., & Wu, X. (2006). Asymmetric bagging and random subspace for support vector machines- based relevance feedback in image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(7), 1088–1099.
Wang, L., Chan, K.-L., & Zhang, Z.-H. (2003). Bootstrapping SVM active learning by incorporating unlabelled images for image retrieval. In Proceedings of IEEE conference on computer vision and pattern recognition.
Tieu, K., & Viola, P. (2004). Boosting image retrieval. International Journal of Computer Vision, 56(1-2), 17–36.
Jiang, W., Er, G., Dai, Q., & Gu, J. (2006). Similarity-based online feature selection in content-based image retrieval. IEEE Transactions on Image Processing, 15(3), 702–712.
Tong, S., & Chang, E. (2001). Support vector machine active learning for image retrieval. In Proceedings of ACM international conference on multimedia (pp. 107–118).
Westerveld, T., & De Vries, A. (2005). Experimental evaluation of a generative probabilistic image retrieval model on ’easy’ data. In Proceedings of multimedia information retrieval workshop.
Jiang, W., Chan, K.-L., Li, M.-J., & Zhang, H.-J. (2005). Mapping low-level features to high-level semantic concepts in region-based image retrieval. In Proceedings of IEEE conference on computer vision and pattern recognition (Vol. 2, pp. 244–249).
Crisianini, N., & Shawe-Taylor, J. (2000). An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press.
Zhang, L., Lin, F., & Zhang, B. (2001). Support vector machine learning for image retrieval. In Proceedings of IEEE international conference on image processing (pp. 721–724).
Chen, Y.-Q., Zhou, X., & Huang, T. (2001). One-class SVM for learning in image retrieval. In Proceedings of IEEE international conference on image processing.
Wang, L., Gao, Y., Chan, K.-L., Xue, P., & Yau, W.-Y. (2005). Retrieval with knowledge-driven kernel design: an approach to improving SVM-based CBIR with relevance. In Proceedings of IEEE international conference on computer vision.
Viola, P., & Jones, M. (2001). Rapid object detection using a boosted cascade of simple features. In Proceedings of IEEE conference on computer vision and pattern recognition (pp. 511–518).
Amores, J., Sebe, N., Radeva, P., Gevers, T., & Smeulders, A. (2004). Boosting contextual information in content-based image retrieval. In Proceedings of ACM SIGMM international workshop on multimedia information retrieval (pp. 31–38).
Schapire, R. (2003). The boosting approach to machine learning: An overview. New York: Springer.
Nguyen, G., & Worring, M. (2003). Query definition using interactive saliency. In Proceedings of the ACM SIGMM international workshop on multimedia information retrieval (pp. 150–156).
Silva, S., Barcelos, C., & Batista, M. (2006). An image retrieval system adaptable to user’s interests by the use of relevance feedback via genetic algorithm. In Proceedings of the Brazilian symposium on multimedia and web (pp. 45–52).
Guan, J., & Qiu, G.-P. (2007). Learning user intention in relevance feedback using optimization. In Proceedings of the ACM SIGMM international workshop on multimedia information retrieval (pp. 41–50).
Savova, V., Tenenbaum, J., Kaelbling, L., & Yuille, A. (2007). The grammar of vision: Probabilistic grammar-based models for visual scene understanding and object categorization. In Proceedings of the annual conference on neural information processing systems.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Huang, W., Gao, Y. & Chan, K.L. A Review of Region-Based Image Retrieval. J Sign Process Syst Sign Image Video Technol 59, 143–161 (2010). https://doi.org/10.1007/s11265-008-0294-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11265-008-0294-3