skip to main content
10.1145/3318299.3318322acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlcConference Proceedingsconference-collections
research-article

VLAD Encoding Based on LLC for Image Classification

Authors Info & Claims
Published:22 February 2019Publication History

ABSTRACT

The Vector of Locally Aggregated Descriptors (VLAD) method, developed from BOW and Fisher Vector, has got great successes in image classification and retrieval. However, the traditional VLAD only assigns local descriptors to the closest visual words in the codebook, which is a hard voting process that leads to a large quantization error. In this paper, we propose an approach to fuse VLAD and locality-constrained linear coding (LLC), compared with the original method, several nearest neighbor centers are considered when assigning local descriptors. We use the reconstruction coefficients of LLC to obtain the weights of several nearest neighbor centers. Due to the excellent representation ability of the reconstruction coefficients for local descriptors, we also combine it with VLAD coding. Experiments were conducted on the 15 Scenes, UIUC Sports Event and Corel 10 datasets to demonstrate that our proposed method has outstanding performance in terms of classification accuracy. Our approach also does not generate much additional computational cost while encoding features.

References

  1. Sivic, J. and Zisserman, A. 2003. Video google: a text retrieval approach to object matching in videos. In IEEE International Conference on Computer Vision, 1470--1477. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Lowe, D. G. 2004. Distinctive image features from scale-invariant keypoints. International journal of computer vision, 60(2), 91--110. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Cortes, C. and Vapnik, V. 1995. Support-vector networks. Machine learning, 20(3), 273--297. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Lazebnik, S., Schmid, C. and Ponce, J. 2006. Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In IEEE Conference on Computer Vision and Pattern Recognition, 2169--2178. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Liu, L., Wang, L. and Liu, X. 2011. In defense of soft-assignment coding. In IEEE International Conference on Computer Vision, 2486--2493. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Yang, J., Yu, K., Gong, Y. and Huang, T. 2009. Linear spatial pyramid matching using sparse coding for image classification. In IEEE Conference on Computer Vision and Pattern Recognition, 179--1801.Google ScholarGoogle Scholar
  7. Wang, J., Yang, J., Yu, K., Lv, F., Huang, T. and Gong, Y. 2010. Locality-constrained linear coding for image classification. In IEEE Conference on Computer Vision and Pattern Recognition, 3360--3367.Google ScholarGoogle Scholar
  8. Cinbis, R. G., Verbeek, J. and Schmid, C. 2012. Image categorization using Fisher kernels of non-iid image models. In IEEE Conference on Computer Vision and Pattern Recognition, 2184--2191. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Perronnin, F., Sánchez, J. and Mensink, T. 2010. Improving the fisher kernel for large-scale image classification. In European conference on computer vision, 143--156. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Jégou, H., Douze, M., Schmid, C. and Pérez, P. 2010. Aggregating local descriptors into a compact image representation. In IEEE Conference on Computer Vision and Pattern Recognition, 3304--3311.Google ScholarGoogle Scholar
  11. Jégou, H., Perronnin, F., Douze, M., Sánchez, J., Pérez, P. and Schmid, C. 2012. Aggregating local image descriptors into compact codes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(9), 1704--1716. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Spyromitros-Xioufis, E., Papadopoulos, S., Kompatsiaris, I. Y., Tsoumakas, G. and Vlahavas, I. 2014. A comprehensive study over vlad and product quantization in large-scale image retrieval. IEEE Transactions on Multimedia, 16(6), 1713--1728.Google ScholarGoogle ScholarCross RefCross Ref
  13. Kastaniotis, D., Fotopoulou, F., Theodorakopoulos, I., Economou, G. and Fotopoulos, S. 2017. HEp-2 cell classification with vector of hierarchically aggregated residuals. Pattern Recognition, 65, 47--57. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Duta, I. C., Ionescu, B., Aizawa, K. and Sebe, N. 2017. Spatio-Temporal Vector of Locally Max Pooled Features for Action Recognition in Videos. In IEEE Conference on Computer Vision and Pattern Recognition, 3205--3214.Google ScholarGoogle Scholar
  15. Wang, Y., Cen, Y., Zhao, R., Kan, S. and Hu, S. 2016. Fusion of multiple VLAD vectors based on different features for image retrieval. In IEEE International Conference on Signal Processing, 742--746.Google ScholarGoogle Scholar
  16. Wang, Z., Wang, Y., Wang, L. and Qiao, Y. 2016. Codebook enhancement of VLAD representation for visual recognition. In IEEE International Conference on Acoustics, Speech and Signal Processing, 1258--1262.Google ScholarGoogle Scholar
  17. Kim, T. E. and Kim, M. H. 2015. Improving the search accuracy of the VLAD through weighted aggregation of local descriptors. Journal of Visual Communication and Image Representation, 31, 237--252. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Tan, Z., Wang, W., Jiang, Y. and Wang, R. 2016. A simple but efficient way to combine VLAD with locality-constrained linear coding. In IEEE International Conference on Visual Communications and Image Processing, 1--4.Google ScholarGoogle Scholar
  19. Delhumeau, J., Gosselin, P. H., Jégou, H. and Pérez, P. 2013. Revisiting the VLAD image representation. In Proceedings of the 21st ACM international conference on Multimedia, 653--656. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Arandjelovic, R. and Zisserman, A. 2013. All about VLAD. In IEEE conference on Computer Vision and Pattern Recognition, 1578--1585. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Yu, K., Zhang, T. and Gong, Y. 2009. Nonlinear learning using local coordinate coding. In Advances in neural information processing systems, 2223--2231. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Long, X., Lu, H., Peng, Y., Wang, X. and Feng, S. 2016. Image classification based on improved VLAD. Multimedia Tools and Applications, 75(10), 5533--5555. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Li, L. J. and Fei-Fei, L. 2007. What, where and who? Classifying events by scene and object recognition. In IEEE International Conference on Computer Vision, 1--8.Google ScholarGoogle Scholar
  24. Fei-Fei, L., Fergus, R. and Perona, P. 2007. Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. Computer vision and Image understanding, 106(1), 59--70. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Gao, S., Tsang, I., Chia, L. and Zhao, P. (2010). Local Features Are Not Lonely-Laplacian Sparse Coding for Image Classification. In IEEE Conference on Computer Vision and Pattern Recognition, 3555--3561Google ScholarGoogle Scholar

Index Terms

  1. VLAD Encoding Based on LLC for Image Classification

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICMLC '19: Proceedings of the 2019 11th International Conference on Machine Learning and Computing
      February 2019
      563 pages
      ISBN:9781450366007
      DOI:10.1145/3318299

      Copyright © 2019 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 22 February 2019

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader