Abstract
Traditionally Content-Based Image Retrieval (CBIR) problems investigate the occurrence of images matching to a user-submitted query image or a sketch drawn by the user within a large image collection. However, there is often limited support for retrieving semantically similar images from large databases, matching the user’s perception. In this paper, we try to address this semantic gap problem in CBIR by performing a clustering-based retrieval. In the proposed approach we first perform a continuous probabilistic semi-supervised clustering to group similar images to form macro clusters. Macro clusters so formed, ensures class-wise similarity instead of semantic similarity. To retrieve the semantically matching images from these macro clusters formed, the CBIR method is adopted using a cluster within-cluster approach. The key idea is that the macro clusters formed during the initial phase of classification are further classified into micro clusters based on the decision tree approach. For retrieval, as the first step, the macro cluster matching to the user’s query is found. In the next step, to ensure semantic similarity the image is classified to the matching micro cluster. The proposed method is experimentally evaluated first on Wang database which contains complex and diverse images with varying fine details. Further, the experiments are repeated on the Ponce group database and Corel 5K database. The experimental results obtained demonstrate the effectiveness of the proposed approach.
Similar content being viewed by others
References
Afifi AJ, Ashour WM (2012) Content-based image retrieval using invariant color and texture features, in Proc IEEE Int Conf Dig Image Comp Tech Appl (DICTA), Fremantle, Australia, pp. 1–6
Ajorloo H, Lakdashti A (2011) A feature relevance estimation method for content-based image retrieval. Int J Inf Technol Decis Mak 10(05):933–961
Aslandogan YA, Yu CT (1999) Techniques and systems for image and video retrieval. IEEE Trans Knowl Data Eng 11(1):56–63
Carson C, Belongie S, Greenspan H, Malik J (2002) Blobworld: Image segmentation using expectation-maximization and its application to image querying. IEEE Trans Pattern Anal Machine Intell 24(8):1026–1038
S.N. Chandran, D. Gangodkar, and A. Mittal, A semi-supervised probabilistic model for clustering large databases of complex images, Multimedia Tools and Applications, vol. 76, no. 21, 2017, pp. 21937–21959
Chen Y, Wang JZ, Krovetz R (2003) Content-based image retrieval by clustering, in Proc. 5th ACM SIGMM Int. workshop on Multimedia Information Retrieval, Berkeley, CA, USA, pp. 193–200
Corel 5K Dataset [Online] Available: www.ci.gxnu.edu.cn/cbir/Dataset.aspx
Cox IJ, Miller ML, Minka TP, Papathomas TV, Yianilos PN (2000) The Bayesian image retrieval system, PicHunter: theory, implementation, and psychophysical experiments. IEEE Trans Image Process 9(1):20–37
De Marsicoi M, Cinque L, Levialdi S (1997) Indexing pictorial documents by their content: a survey of current techniques. Image Vis Comp 15(2):119-141
Fei-Fei L, Fergus R, Perona P (2007) Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories. Comput Vis Image Underst 106(1):59–70
Flickner M, Sawhney H, Niblack W, Ashley J, Huang Q, Dom B, Gorkani M, Hafner J, Lee D, Petkovic D, Steele D (1995) Query by image and video content: The QBIC system. Computer 28(9):23–32
Goldberger J, Greenspan H, Gordon S (2002) Unsupervised image clustering using the information bottleneck method. Patt Recogn, pp. 158–165
Guo JM, Wu MF (2009) Improved block truncation coding based on the void-and-cluster dithering approach. IEEE Trans Image Process 18(1):211–213
Huang T, Mehrotra S, Ramchandran K (1996) Multimedia analysis and retrieval system (MARS) project, in Proc. 33rd Annual Clinic on Library Applications of Data Processing- Digital Image Access and Retrieva, University of Illinois, Urbana-Champaign, pp. 100-117
Jain AK, Dubes RC (1988) Algorithms for clustering data Prentice-Hall, Inc
Jing F, Li M, Zhang L, Zhang HJ, Zhang B (2003) Learning in region-based image retrieval. Image and Video Retrieval, pp. 199–204
Karakos D, Khudanpur S, Eisner J, Priebe CE (2005) Unsupervised classification via decision trees: An information-theoretic perspective, in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, (ICASSP’05). Pennsylvania, USA , v-1081
Kundu MK, Chowdhury M, Buló SR (2015) A graph-based relevance feedback mechanism in content-based image retrieval. Knowl-Based Syst 73:254–264
Latif A, Rasheed A, Sajid U, Ahmed J, Ali N, Ratyal NI, Zafar B, Dar SH, Sajid M, Khalil T (2019) Content-based image retrieval and feature extraction: a comprehensive review, Mathematical Problems in Engineering
Liu B, Xia Y, Yu PS (2000) Clustering through decision tree construction, in Proc. Ninth Int. Conf. on Information and knowledge management (CIKM) McLean, VA, USA, pp. 20–29
Liu Y, Zhang D, Lu G (2008) Region-based image retrieval with high-level semantics using decision tree learning. Pattern Recogn 41(8):2554–2570
Ma WY, Manjunath BS (1997) Netra: A toolbox for navigating large image databases, in Proc. IEEE Int Conf Image Proc, Santa Barbara, California 1:568-571
Montazer GA, Giveki D (2015) Content based image retrieval system using clustered scale invariant feature transforms. Optik-International Journal for Light and Electron Optics 126(18):1695–1699
Park SS, Seo KK, Jang DS (2007) Fuzzy art-based image clustering method for content-based image retrieval. Int J Inf Technol Decis Mak 6(02):213–233
Pass G, Zabih R (1996) Histogram refinement for content-based image retrieval, in Proc. 3rd IEEE Workshop on Applications of Computer Vision (WACV’96), Sarasota, Florida, USA
Pentland AP, Picard RW, Scarloff S (1996) Photobook: Content-based manipulation for image databases. Int J Comput Vis 18(3):233–254
Ponce group birds and butterflies database [Online] Available: http://www-cvr.ai.uiuc.edu/ponce-grp/data/
Rao MB, Rao BP, Govardhan A (2011) CTDCIRS: content based image retrieval system based on dominant color and texture features. Int J Comput Appl 18(6):40–46
Rokach L, Maimon LO (2005) Clustering methods, Data mining and knowledge discovery handbook Springer, pp. 321-352
Rui Y, Huang TS, Ortega M, Mehrotra S (1998) Relevance feedback: a power tool for interactive content-based image retrieval, IEEE Trans Circ Syst Vid Tech 8(5):644-655
Sezavar A, Farsi H, Mohamadzadeh S (2019) Content-based image retrieval by combining convolutional neural networks and sparse representation. Multimed Tools Appl 78:20895–20912
Sheikholeslami G, Chang W, Zhang A (2002) SemQuery: semantic clustering and querying on heterogeneous features for visual data, IEEE Trans Knowl Data Eng 14(5):988–1002
Shrivastava N, Tyagi V (2014) Content based image retrieval based on relative locations of multiple regions of interest using selective regions matching. Inf Sci 259:212–224
Smeulders AW, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early year. IEEE Trans Pattern Anal Mach Intell 22(12):1349–1380
Smith JR, Chang SF (1996) VisualSEEK: A fully automated content-based image query system, in Proc. fourth ACM Int. Conf. on Multimedia, Boston, Massachusetts, USA
Tai XY, Wang LD, Chen Q, Fuji R, Kenji KK (2009) A New Method Of Medical Image Retrieval Based On Color–Texture Correlogram and Gti Mode. Int J Info Tech Dec Making 8(2):239–248
Town C, Sinclair D (2000) Content based image retrieval using semantic visual categories, Society of Manufacturing Engineers
Wan J, Wang D, Hoi SCH, Wu P, Zhu J, Zhang Y, Li J (2014) Deep learning for content-based image retrieval: A comprehensive study, in Proc. 22nd ACM Int. Conf. on Multimedia, Orlando, Florida, USA, pp. 157–166
Wang XY, Liang LL, Li WY, Li DM, Yang HY (2016) A new SVM-based relevance feedback image retrieval using probabilistic feature and weighted kernel function. J Vis Commun Image Represent 38:256–275
Wang JZ, Li J, Wiederhold G (2001) SIMPLIcity: Semantics-sensitive integrated matching for picture libraries, IEEE Trans. Pattern Anal Mach Intell 23(9):947–963
Wangdatabase [Online] Available: http://wang.ist.psu.edu/docs/related.html
Yikun Y, Shengjie J, Jinrong H, Bisheng X, Jiabo L, Ru X (2020) Image retrieval via learning content-based deep quality model towards big data, Future Generation Computer Systems
Younus ZS, Mohamad D, Saba T, Alkawaz MH, Rehman A, Al-Rodhaan M, Al-Dhelaan A (2015) Content-based image retrieval using PSO and K-means clustering algorithm. Arab J Geosci 8(8):6211–6224
Zhu S, Zou L, Fang B (2014) Content based image retrieval via a transductive model. J Intell Inf Syst 42(1):95–109
Zhang B, Gao Y, Liu J (2010) Local derivative pattern versus local binary pattern: face recognition with high-order local pattern descriptor. IEEE Trans Image Processing 19(2):533–544
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
S, N.C., Gangodkar, D. A novel image retrieval technique based on semi supervised clustering. Multimed Tools Appl 80, 35741–35769 (2021). https://doi.org/10.1007/s11042-021-11542-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-11542-3