Skip to main content

A Fusion of Bag of Word Model and Hierarchical K-Means++ in Image Retrieval

  • Conference paper
  • First Online:
Integrated Uncertainty in Knowledge Modelling and Decision Making (IUKM 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9978))

Abstract

This paper proposes an Image Retrieval method using the Bag of Word model combining with Hierarchical K-Means++ algorithm to store and arrange databases, called BoW-HKM++ approach. A Hierarchical tree is proposed to improve the processing speed in retrieval data. In this research: (1) we first build a bag of word model based on SIFT feature extraction on images; (2) then show its improvement by using inverted index technical for whole feature databases; (3) the Hierarchical K-Means++ algorithm is also proposed in the bag of word model to optimize retrieval system. Experiment results were evaluated with k-fold cross-validation method on 2 datasets of James Z. Wang Research Group. Finally, the obtained results will be compared with other method and discussed to prove the effectiveness of proposed framework in Image Retrieval problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)

    Article  Google Scholar 

  2. Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2006, vol. 2, pp. 2161–2168. IEEE Computer Society, Washington, DC, USA (2006)

    Google Scholar 

  3. Lopes, A.P.B., Avila, S.E.F., Peixoto, A.N.A., Oliveira, R.S., Araújo, A., de A.A.: A bag-of-features approach based on Hue-SIFT descriptor for nude detection. In: 17th European Signal Processing Conference (EUSIPCO 2009), pp. 1552–1556 (2009)

    Google Scholar 

  4. Korytkowski, M., Scherer, R., Staszewski, P., Woldan, P.: Bag-of-features image indexing and classification in Microsoft SQL server relational database. In: 2015 IEEE 2nd International Conference on Cybernetics (CYBCONF), pp. 478–482 (2015)

    Google Scholar 

  5. Law, M.T., Thome, N., Cord, M.: Bag-of-words image representation: key ideas and further insight. In: Ionescu, B., Benois-Pineau, J., Piatrik, T., Quénot, G. (eds.) Fusion in Computer Vision. Advances in Computer Vision and Pattern Recognition, pp. 29–52. Springer International Publishing, Switzerland (2014)

    Chapter  Google Scholar 

  6. Alkhawlani, M., Elmogy, M., Elbakry, H.: Content-based image retrieval using local features descriptors and bag-of-visual words. Int. J. Adv. Comput. Sci. Appl. (IJACSA) 6(9) (2015)

    Google Scholar 

  7. Liu, J.: Image retrieval based on bag-of-words model. arXiv preprint arXiv:1304.5168

  8. Arthur, D., Vassilvitskii, S.: k-means++: the advantages of careful seeding. In: SODA 2007 Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 1027–1035 (2007)

    Google Scholar 

  9. Ledwich, L., Williams, S.: Reduced SIFT features for image retrieval and indoor localisation. In: Australian Conference on Robotics and Automation (2004)

    Google Scholar 

  10. Almeida, J., Torres, R.S., Goldenstein, S.: SIFT applied to CBIR. Revista de Sistemas de Informacao da FSMA n. 4, pp. 41–48 (2009)

    Google Scholar 

  11. Philbin, J.: Scalable object retrieval in very large image collections, D. Phil thesis (2010)

    Google Scholar 

  12. Nguyen, Q.T., Bui, T.D.: Speech classification using SIFT features on pectrogram images. Vietnam J. Comput. Sci. (2016). doi:10.1007/s40595-016-0071-3

    Google Scholar 

  13. Bahmani, B., Moseley, B., Vattani, A., Kumar, R., Vassilvitskii, S.: Scalable K-Means++. Proc. VLDB Endowment 5(7), 622–633 (2012)

    Article  Google Scholar 

  14. Sargent, D., Chen, C.-I., Tsai, C.-M., Wang, Y.-F., Koppel, D.: Feature detector and descriptor for medical images. In: SPIE Proceedings: Medical Imaging, vol. 7259, 27 March 2009. doi: 10.1117/12.811210

  15. Ashraf, R., Bashir, K., Irtaza, A., Mahmood, M.T.: Content based image retrieval using embedded neural networks with bandletized regions. Entropy 17, 3552–3580 (2015)

    Article  Google Scholar 

  16. http://wang.ist.psu.edu/docs/related/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to My Kieu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Kieu, M., Lai, K.D., Tran, T.D., Le, T.H. (2016). A Fusion of Bag of Word Model and Hierarchical K-Means++ in Image Retrieval. In: Huynh, VN., Inuiguchi, M., Le, B., Le, B., Denoeux, T. (eds) Integrated Uncertainty in Knowledge Modelling and Decision Making. IUKM 2016. Lecture Notes in Computer Science(), vol 9978. Springer, Cham. https://doi.org/10.1007/978-3-319-49046-5_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-49046-5_34

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-49045-8

  • Online ISBN: 978-3-319-49046-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics