Skip to main content
Log in

Discriminative Histogram Intersection Metric Learning and Its Applications

  • Regular Paper
  • Published:
Journal of Computer Science and Technology Aims and scope Submit manuscript

Abstract

In this paper, a novel method called discriminative histogram intersection metric learning (DHIML) is proposed for pair matching and classification. Specifically, we introduce a discrimination term for learning a metric from binary information such as same/not-same or similar/dissimilar, and then combine it with the classification error for the discrimination in classifier construction. Compared with conventional approaches, the proposed method has several advantages. 1) The histogram intersection strategy is adopted into metric learning to deal with the widely used histogram features effectively. 2) By introducing discriminative term and classification error term into metric learning, a more discriminative distance metric and a classifier can be learned together. 3) The objective function is robust to outliers and noises for both features and labels in the training. The performance of the proposed method is tested on four applications: face verification, face-track identification, face-track clustering, and image classification. Evaluations on the challenging restricted protocol of Labeled Faces in the Wild (LFW) benchmark, a dataset with more than 7 000 face-tracks, and Caltech-101 dataset validate the robustness and discriminability of the proposed metric learning, compared with the recent state-of-the-art approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Zeng S, Xiong Y. Weighted average integration of sparse representation and collaborative representation for robust face recognition. Computional Visual Media, 2016, 2(4): 357-365.

    Article  Google Scholar 

  2. Zhu Z, Liang D, Zhang S et al. Traffic-sign detection and classification in the wild. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, June 2016, pp.2110-2118.

  3. Weinberger K, Blitzer J, Sau L. Distance metric learning for large margin nearest neighbor classification. Journal of Machine Learning Research, 2009, 10(2): 207-244.

    Google Scholar 

  4. Davis J V, Kulis B, Jain P et al. Information-theoretic metric leaning. In Proc. the 24th International Conference on Machine Learning, June 2007, pp.209-216.

  5. Guillaumin M, Verbeek J, Schmid C. Is that you? Metric learning approaches for face identification. In Proc. the 12th International Conference on Computer Vision, Sept. 2009, pp.498-505.

  6. Nguyen H V, Bai L. Cosine similarity metric learning for face verification. In Proc. Asian Conference on Computer Vision, November 2010, pp.709-720.

  7. Zhang H, Xing Z, Chen L, Gao Y. Efficient metric all-k-nearest-neighbor search on datasets without any index. Journal of Computer Science and Technology, 2016, 31(6): 1194-1211.

    Article  MathSciNet  Google Scholar 

  8. Lowe D G. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2004, 60(2): 91-110.

    Article  Google Scholar 

  9. Ojala T, Pietikainen M, Maenpaa T. Multiresolution grayscale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Analysis and Machine Intelligence, 2002, 24(7): 971-987.

  10. Rockfellar R. Convex Analysis. Princeton University Press, 2015.

  11. Wu J X, Rehg J M. Beyond the Euclidean distance: Creating effective visual codebooks using the histogram intersection kernel. In Proc. the 12th International Conference on Computer Vision, September 2009, pp.630-637.

  12. Huang K, Ying Y, Campbell C. GSML: A unified framework for sparse metric learning. In Proc. the 9th International Conference on Data Mining, December 2009, pp.189-198.

  13. Liu W, Pokharel P P, Principe J C. Correntropy: Properties and applications in non Gaussian signal processing. IEEE Transactions on Signal Processing, 2007, 55(11): 5286-5298.

  14. Yuan X T, Hu B G. Robust feature extraction via information theoretic learning. In Proc. the 26th Annual International Conference on Machine Learning, June 2009, pp.1193-1200.

  15. Huang G, Ramesh M, Berg T, Learned-Miller E. Labeled faces in the wild: A database for studying face recognition in unconstrained environments. Technical Report, University of Massachusetts, Amherst, October 2007, pp.7-49.

  16. Ying Y, Li P. Distance metric learning with eigenvalue optimization. Journal of Machine Learning Research, 2012, 13(1): 1-26.

    MathSciNet  MATH  Google Scholar 

  17. Guo H, Jiang Z, Davis L S. Discriminative dictionary learning with pariwise constraints. In Proc. Asian Conference on Computer Vision, November 2012, pp.328-342.

  18. Wolf L, Hassner T, Taigman Y. Similarity scores based on background samples. In Proc. Asian Conference on Computer Vision, September 2009, pp.88-97

  19. Taigman Y, Wolf L, Hassner T. Multiple one-shots for utilizing class label information. In Proc. British Machine Vision Conference, September 2009, pp.1-12.

  20. Daugman J G. Complete discrete2-D Gabor transforms by neural networks for image analysis and compression. IEEE Trans. Acoustics, Speech, and Signal Processing, 1988, 36(7): 1169-1179.

  21. Seo H J, Milanfar P. Face verification using the LARK representation. IEEE Trans. Information Forensics and Security, 2011, 6(4): 1275-1286.

    Article  Google Scholar 

  22. Wang F, Leonidas J. Supervised earth mover’s distance learning and its computer vision applications. In Proc. European Conference on Computer Vision, October 2012, pp.442-455.

  23. Hao P, Kamata S. Efficient large-scale video retrieval via discriminative signatures. IEICE Transactions on Information and Systems, 2013, 96(8): 1800-1810.

    Article  Google Scholar 

  24. Liu F F, Fergus R, Perona P. Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories. Computer Vision and Image Understanding, 2007, 106(1): 59-70.

    Article  Google Scholar 

  25. Jiang Z, Lin Z, Davis L. Learning a discriminative dictionary for sparse coding via label consistent K-SVD. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, June 2011 pp.1697-1704.

  26. Xiong C, Johnson D J, Corso J J. Efficient max-margin metric learning. In Proc. ECDM, Jan. 2012.

  27. Le T, Cuturi M. Unsupervised Riemannian metric learning for histograms using aitchison transformations. In Proc. International Conference on Machine Learning, July 2015, pp.2002-2011.

  28. Cuturi M, Avis D. Ground metriclearning. Journal of Machine Learning Research, 2014, 15(1): 533-564.

    MathSciNet  MATH  Google Scholar 

  29. Lazebnik S, Schmid C, Ponce J. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, June 2006, pp.2169-2178.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Peng-Yi Hao.

Electronic supplementary material

Below is the link to the electronic supplementary material.

ESM 1

(PDF 174 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hao, PY., Xia, Y., Li, XX. et al. Discriminative Histogram Intersection Metric Learning and Its Applications. J. Comput. Sci. Technol. 32, 507–519 (2017). https://doi.org/10.1007/s11390-017-1740-0

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11390-017-1740-0

Keywords

Navigation