Abstract
Image retrieval is a recognition technique in the field of computer vision. In most cases, high-quality retrieval is often supported by adequate learning instances. However, in the process of learning instance selection, some useless, repeated, invalid, and even mistaken learning instances are often selected. Low-quality instances not only add to the computing burden but also decrease the retrieval quality. In this study, we propose a learning instance optimization method. Initially, we classify the images into scene and object images by using the K-means clustering model. We use different methods to handle these two groups of images. For scene images, we use the Euclidean distance of the GIST descriptor to select the optimized learning instances. For object images, we use the improved spatial pyramid matching and optimal instance distance methods to select the optimized learning instances. Finally, we implement experiments using one large image database to check the effectiveness of our proposed algorithm. Results show that our method can not only improve retrieval quality but also decrease the number of learning instances.
Similar content being viewed by others
References
Barrow HG, Tenenbaum JM, Bolles RC, Wolf HC (1977) Parametric correspondence and chamfer matching: two new techniques for image matching. tech. rep., DTIC Document
Bart E, Porteous I, Perona P, Welling M (2008) Unsupervised learning of visual taxonomies. In: Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on IEEE, pp 1–8
Bosch A, Zisserman A, Munoz X (2007) Representing shape with a spatial pyramid kernel. In Proceedings of the 6th ACM International Conference on Image and Video Retrieval, ACM, pp 401–408
Boser BE, Guyon IM, Vapnik VN (1992) A training algorithm for optimal margin classifiers. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, ACM, pp 144–152
Carson C, Thomas M, Belongie S, Hellerstein JM, Malik J (1999) Blobworld: a system for region-based image indexing and retrieval. In: International Conference on Advances in Visual Information Systems. Springer, pp 509–517
Chong W, Blei D, Li F-F (2009) Simultaneous image classification and annotation. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on IEEE, pp 1903–1910
Comaniciu D, Ramesh V, Meer P (2000) Real-time tracking of non-rigid objects using mean shift. In: Computer Vision and Pattern Recognition, 2000. Proceedings. IEEE Conference on, vol 2, IEEE, pp 142–149
Csurka G, Dance C, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints. In: Workshop on statistical learning in computer vision, ECCV, vol 1. Prague, pp 1–2
Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L (2009) Imagenet: a large-scale hierarchical image database. In: Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on, IEEE, pp 248–255
Fei-Fei L, Fergus R, Perona P (2006) One-shot learning of object categories. IEEE Trans Pattern Anal Mach Intell 28(4):594–611
Felzenszwalb P, McAllester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on IEEE, pp 1–8
Fergus R, Bernal H, Weiss Y, Torralba A (2010) Semantic label sharing for learning with many categories. In: European Conference on Computer Vision. Springer, pp 762–775
Fergus R, Weiss Y, Torralba A (2009) Semi-supervised learning in gigantic image collections. In: Advances in neural information processing systems, pp 522–530
Griffin G, Holub A, Perona P (2007) Caltech-256 object category dataset
Hays J, Efros AA (2007) Scene completion using millions of photographs. In ACM Transactions on Graphics (TOG), vol 26, ACM, p 4
He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE International Conference on Computer Vision, pp 1026–1034
Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167
Jaakkola TS, Haussler D (1998) Exploiting generative models in discriminative classifiers. Adv Neural Inf Proces Syst 11(11):487–493
Jain AK (2010) Data clustering: 50 years beyond k-means. Pattern Recogn Lett 31(8):651–666
Kanimozhi T, Latha K (2015) An integrated approach to region based image retrieval using firefly algorithm and support vector machine. Neurocomputing 151:1099–1111
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp 1097–1105
Kumar N, Berg AC, Belhumeur PN, Nayar SK (2009) Attribute and simile classifiers for face verification. In 2009 I.E. 12th International Conference on Computer Vision, IEEE, pp 365–372
Lampert CH, Nickisch H, Harmeling S (2009) Learning to detect unseen object classes by between-class attribute transfer. In: Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on, IEEE, pp 951–958
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: 2006 I.E. Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06), vol 2. IEEE, pp 2169–2178
Lin Y, Lv F, Zhu S, Yang M, Cour T, Yu K, Cao L, Huang T (2011) Large-scale image classification: fast feature extraction and svm training. In: Computer Vision and Pattern Recognition (CVPR), 2011 I.E. Conference on, IEEE, pp 1689–1696
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Maji S, Berg AC (2009) Max-margin additive classifiers for detection. In: 2009 I.E. 12th International Conference on Computer Vision, IEEE, pp 40–47
Master S (2014) Large scale object detection. Czech Technical University
Palatucci M, Pomerleau D, Hinton GE, Mitchell TM (2009) Zero-shot learning with semantic output codes. In: Advances in neural information processing systems, pp 1410–1418
Parikh D, Grauman K (2011) Relative attributes. In: 2011 International Conference on Computer Vision, IEEE, pp 503–510
Perronnin F, Sánchez J, Mensink T (2010) Improving the fisher kernel for large-scale image classification. In: European Conference on Computer Vision. Springer, pp 143–156
Rubner Y, Tomasi C, Guibas LJ (2000) The earth mover’s distance as a metric for image retrieval. Int J Comput Vis 40(2):99–121
Russell BC, Torralba A, Murphy KP, Freeman WT (2008) Labelme: a database and web-based tool for image annotation. Int J Comput Vis 77(1–3):157–173
Swain MJ, Ballard DH (1992) Indexing via color histograms. In: Active Perception and Robot Vision. Springer, pp 261–273
Toyama K, Blake A (2001) Probabilistic tracking in a metric space. In: Computer Vision, 2001. ICCV 2001. Proceedings. Eighth IEEE International Conference on, vol 2, IEEE, pp 50–57
Ulges A, Schulze C, Keysers D, Breuel T (2008) Identifying relevant frames in weakly labeled videos for training concept detectors. In: Proceedings of the 2008 international conference on Content-based image and video retrieval, ACM, pp 9–16
Van de Sande KEA, Gevers T, Snoek CGM (2008) Evaluation of color descriptors for object and scene recognition. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition: CVPR 2008, pp 1–8
Vedaldi A, Zisserman A (2011) Image classification practical. http://www.robots.ox.ac.uk/vgg/share/practical-image-classification.htm
Wu H, Miao Z, Chen J, Yang J, Gao X (2015) Recognition improvement through the optimisation of learning instances. IET Comput Vis 9(3):419–427
Wu H, Miao Z, Wang Y, Lin M (2015) Optimized recognition with few instances based on semantic distance. Vis Comput 31(4):367–375
Wu R, Yan S, Shan Y, Dang Q, Sun G (2015) Deep image: scaling up image recognition. arXiv preprint arXiv:1501.02876, 7(8)
Ye Z, Chen X, Li Z (2010) Video based mobile location search with large set of sift points in cloud. In: Proceedings of the 2010 ACM multimedia workshop on Mobile cloud media computing, ACM, pp 25–30
Zha Z-J, Hua X-S, Mei T, Wang J, Qi G-J, Wang Z (2008) Joint multi-label multi-instance learning for image classification. In: Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on IEEE, pp 1–8
Zhang X, Liu W, Dundar M, Badve S, Zhang S (2015) Towards large-scale histopathological image analysis: hashing-based image retrieval. IEEE Trans Med Imaging 34(2):496–506
Wu H, Miao Z, Wang Y, Chen J, Ma C, Zhou T (2015) Image completion with multi-image based on entropy reduction. Neurocomputing 159:157–171
Acknowledgements
This research is sponsored by National Natural Science Foundation of China (Nos.61601033,61371185, 61401029, 61571049), China Postdoctoral Science Foundation(212400201, 2016M591109), the Fundamental Research Funds for the Central Universities (Nos. 2014KJJCB32, 2013NT57, 2012LYB46), Research Funds (15ZR003) and by SRF for ROCS, SEM, NSFC61273274, 61370127 and 61201158,NSFB4123104, FRFCU 2014JBZ004, Z131110001913143.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Li, Y., Bie, R., Zhang, C. et al. Optimized learning instance-based image retrieval. Multimed Tools Appl 76, 16749–16766 (2017). https://doi.org/10.1007/s11042-016-3950-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-016-3950-9