Skip to main content
Log in

A novel image retrieval technique based on semi supervised clustering

  • 1166: Advances of machine learning in data analytics and visual information processing
  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Traditionally Content-Based Image Retrieval (CBIR) problems investigate the occurrence of images matching to a user-submitted query image or a sketch drawn by the user within a large image collection. However, there is often limited support for retrieving semantically similar images from large databases, matching the user’s perception. In this paper, we try to address this semantic gap problem in CBIR by performing a clustering-based retrieval. In the proposed approach we first perform a continuous probabilistic semi-supervised clustering to group similar images to form macro clusters. Macro clusters so formed, ensures class-wise similarity instead of semantic similarity. To retrieve the semantically matching images from these macro clusters formed, the CBIR method is adopted using a cluster within-cluster approach. The key idea is that the macro clusters formed during the initial phase of classification are further classified into micro clusters based on the decision tree approach. For retrieval, as the first step, the macro cluster matching to the user’s query is found. In the next step, to ensure semantic similarity the image is classified to the matching micro cluster. The proposed method is experimentally evaluated first on Wang database which contains complex and diverse images with varying fine details. Further, the experiments are repeated on the Ponce group database and Corel 5K database. The experimental results obtained demonstrate the effectiveness of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24
Fig. 25
Fig. 26
Fig. 27
Fig. 28
Fig. 29
Fig. 30

Similar content being viewed by others

References

  1. Afifi AJ, Ashour WM (2012) Content-based image retrieval using invariant color and texture features, in Proc IEEE Int Conf Dig Image Comp Tech Appl (DICTA), Fremantle, Australia, pp. 1–6

  2. Ajorloo H, Lakdashti A (2011) A feature relevance estimation method for content-based image retrieval. Int J Inf Technol Decis Mak 10(05):933–961

    Article  Google Scholar 

  3. Aslandogan YA, Yu CT (1999) Techniques and systems for image and video retrieval. IEEE Trans Knowl Data Eng 11(1):56–63

    Article  Google Scholar 

  4. Carson C, Belongie S, Greenspan H, Malik J (2002) Blobworld: Image segmentation using expectation-maximization and its application to image querying. IEEE Trans Pattern Anal Machine Intell 24(8):1026–1038

    Article  Google Scholar 

  5. S.N. Chandran, D. Gangodkar, and A. Mittal, A semi-supervised probabilistic model for clustering large databases of complex images, Multimedia Tools and Applications, vol. 76, no. 21, 2017, pp. 21937–21959

    Article  Google Scholar 

  6. Chen Y, Wang JZ, Krovetz R (2003) Content-based image retrieval by clustering, in Proc. 5th ACM SIGMM Int. workshop on Multimedia Information Retrieval, Berkeley, CA, USA, pp. 193–200

  7. Corel 5K Dataset [Online] Available: www.ci.gxnu.edu.cn/cbir/Dataset.aspx

  8. Cox IJ, Miller ML, Minka TP, Papathomas TV, Yianilos PN (2000) The Bayesian image retrieval system, PicHunter: theory, implementation, and psychophysical experiments. IEEE Trans Image Process 9(1):20–37

    Article  Google Scholar 

  9. De Marsicoi M, Cinque L, Levialdi S (1997) Indexing pictorial documents by their content: a survey of current techniques. Image Vis Comp 15(2):119-141

  10. Fei-Fei L, Fergus R, Perona P (2007) Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories. Comput Vis Image Underst 106(1):59–70

    Article  Google Scholar 

  11. Flickner M, Sawhney H, Niblack W, Ashley J, Huang Q, Dom B, Gorkani M, Hafner J, Lee D, Petkovic D, Steele D (1995) Query by image and video content: The QBIC system. Computer 28(9):23–32

    Article  Google Scholar 

  12. Goldberger J, Greenspan H, Gordon S (2002) Unsupervised image clustering using the information bottleneck method. Patt Recogn, pp. 158–165

  13. Guo JM, Wu MF (2009) Improved block truncation coding based on the void-and-cluster dithering approach. IEEE Trans Image Process 18(1):211–213

    Article  MathSciNet  Google Scholar 

  14. Huang T, Mehrotra S, Ramchandran K (1996) Multimedia analysis and retrieval system (MARS) project, in Proc. 33rd Annual Clinic on Library Applications of Data Processing- Digital Image Access and Retrieva, University of Illinois, Urbana-Champaign, pp. 100-117

  15. Jain AK, Dubes RC (1988) Algorithms for clustering data Prentice-Hall, Inc

  16. Jing F, Li M, Zhang L, Zhang HJ, Zhang B (2003) Learning in region-based image retrieval. Image and Video Retrieval, pp. 199–204

  17. Karakos D, Khudanpur S, Eisner J, Priebe CE (2005) Unsupervised classification via decision trees: An information-theoretic perspective, in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, (ICASSP’05). Pennsylvania, USA , v-1081

  18. Kundu MK, Chowdhury M, Buló SR (2015) A graph-based relevance feedback mechanism in content-based image retrieval. Knowl-Based Syst 73:254–264

  19. Latif A, Rasheed A, Sajid U, Ahmed J, Ali N, Ratyal NI, Zafar B, Dar SH, Sajid M, Khalil T (2019) Content-based image retrieval and feature extraction: a comprehensive review, Mathematical Problems in Engineering

  20. Liu B, Xia Y, Yu PS (2000) Clustering through decision tree construction, in Proc. Ninth Int. Conf. on Information and knowledge management (CIKM) McLean, VA, USA, pp. 20–29

  21. Liu Y, Zhang D, Lu G (2008) Region-based image retrieval with high-level semantics using decision tree learning. Pattern Recogn 41(8):2554–2570

    Article  Google Scholar 

  22. Ma WY, Manjunath BS (1997) Netra: A toolbox for navigating large image databases, in Proc. IEEE Int Conf Image Proc, Santa Barbara, California 1:568-571

  23. Montazer GA, Giveki D (2015) Content based image retrieval system using clustered scale invariant feature transforms. Optik-International Journal for Light and Electron Optics 126(18):1695–1699

    Article  Google Scholar 

  24. Park SS, Seo KK, Jang DS (2007) Fuzzy art-based image clustering method for content-based image retrieval. Int J Inf Technol Decis Mak 6(02):213–233

    Article  Google Scholar 

  25. Pass G, Zabih R (1996) Histogram refinement for content-based image retrieval, in Proc. 3rd IEEE Workshop on Applications of Computer Vision (WACV’96), Sarasota, Florida, USA

  26. Pentland AP, Picard RW, Scarloff S (1996) Photobook: Content-based manipulation for image databases. Int J Comput Vis 18(3):233–254

    Article  Google Scholar 

  27. Ponce group birds and butterflies database [Online] Available: http://www-cvr.ai.uiuc.edu/ponce-grp/data/

  28. Rao MB, Rao BP, Govardhan A (2011) CTDCIRS: content based image retrieval system based on dominant color and texture features. Int J Comput Appl 18(6):40–46

    Google Scholar 

  29. Rokach L, Maimon LO (2005) Clustering methods, Data mining and knowledge discovery handbook Springer, pp. 321-352

  30. Rui Y, Huang TS, Ortega M, Mehrotra S (1998) Relevance feedback: a power tool for interactive content-based image retrieval, IEEE Trans Circ Syst Vid Tech 8(5):644-655

  31. Sezavar A, Farsi H, Mohamadzadeh S (2019) Content-based image retrieval by combining convolutional neural networks and sparse representation. Multimed Tools Appl 78:20895–20912

    Article  Google Scholar 

  32. Sheikholeslami G, Chang W, Zhang A (2002) SemQuery: semantic clustering and querying on heterogeneous features for visual data, IEEE Trans Knowl Data Eng 14(5):988–1002

  33. Shrivastava N, Tyagi V (2014) Content based image retrieval based on relative locations of multiple regions of interest using selective regions matching. Inf Sci 259:212–224

    Article  Google Scholar 

  34. Smeulders AW, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early year. IEEE Trans Pattern Anal Mach Intell 22(12):1349–1380

    Article  Google Scholar 

  35. Smith JR, Chang SF (1996) VisualSEEK: A fully automated content-based image query system, in Proc. fourth ACM Int. Conf. on Multimedia, Boston, Massachusetts, USA

  36. Tai XY, Wang LD, Chen Q, Fuji R, Kenji KK (2009) A New Method Of Medical Image Retrieval Based On Color–Texture Correlogram and Gti Mode. Int J Info Tech Dec Making 8(2):239–248

  37. Town C, Sinclair D (2000) Content based image retrieval using semantic visual categories, Society of Manufacturing Engineers

  38. Wan J, Wang D, Hoi SCH, Wu P, Zhu J, Zhang Y, Li J (2014) Deep learning for content-based image retrieval: A comprehensive study, in Proc. 22nd ACM Int. Conf. on Multimedia, Orlando, Florida, USA, pp. 157–166

  39. Wang XY, Liang LL, Li WY, Li DM, Yang HY (2016) A new SVM-based relevance feedback image retrieval using probabilistic feature and weighted kernel function. J Vis Commun Image Represent 38:256–275

    Article  Google Scholar 

  40. Wang JZ, Li J, Wiederhold G (2001) SIMPLIcity: Semantics-sensitive integrated matching for picture libraries, IEEE Trans. Pattern Anal Mach Intell 23(9):947–963

    Article  Google Scholar 

  41. Wangdatabase [Online] Available: http://wang.ist.psu.edu/docs/related.html

  42. Yikun Y, Shengjie J, Jinrong H, Bisheng X, Jiabo L, Ru X (2020) Image retrieval via learning content-based deep quality model towards big data, Future Generation Computer Systems

  43. Younus ZS, Mohamad D, Saba T, Alkawaz MH, Rehman A, Al-Rodhaan M, Al-Dhelaan A (2015) Content-based image retrieval using PSO and K-means clustering algorithm. Arab J Geosci 8(8):6211–6224

    Article  Google Scholar 

  44. Zhu S, Zou L, Fang B (2014) Content based image retrieval via a transductive model. J Intell Inf Syst 42(1):95–109

    Article  Google Scholar 

  45. Zhang B, Gao Y, Liu J (2010) Local derivative pattern versus local binary pattern: face recognition with high-order local pattern descriptor. IEEE Trans Image Processing 19(2):533–544

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nisha Chandran S.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

S, N.C., Gangodkar, D. A novel image retrieval technique based on semi supervised clustering. Multimed Tools Appl 80, 35741–35769 (2021). https://doi.org/10.1007/s11042-021-11542-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-021-11542-3

Keywords

Navigation