Abstract
Distance metric learning is an important topic in visual classification tasks. Learning an appropriate distance measure can greatly improve the performance of image recognitions. In this paper, we model an image set as a regularized convex hull and propose a distance metric learning method for image classification based on regularized convex hulls. In particular, a regularized point-to-convex hull distance metric (RPCHD) and a regularized convex hull-to-convex hull distance metric (RCHCHD) are introduced to measure the distance between the query image object and the existing image sets for image classification tasks. The coefficients in distance metric are solved by an approximate optimization strategy. Two SVM-like distance metric learning models are constructed and transformed into the standard support vector machines to learn distance metric matrix in RPCHD and RCHCHD. Positive and negative sample pairs are proposed to represent distance metric matrix. Experiments on three image databases show that our proposed RPCHD and RCHCHD can effectively improve the performance of image classification.
Similar content being viewed by others
References
Cevikalp H, Triggs B (2010) Face recognition based on image sets. IEEE Conf Comput Vis Pattern Recogn. https://doi.org/10.1109/CVPR.2010.5539965
Chang CC, Lin CJ (2011) LIBSVM: a library for support vector machines. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
Chien JT, Wu CC (2002) Discriminant waveletfaces and nearest feature classifiers for face recognition. IEEE Trans Pattern Anal Mach Intell 24(12):1644–1649. https://doi.org/10.1109/TPAMI.2002.1114855
Davis JV, Kulis B, Jain P, Sra S, Dhillon IS (2007) Information-theoretic metric learning. In: Paper presented at the Proceedings of the 24th international conference on Machine learning 209–216. https://doi.org/10.1145/1273496.1273523
Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30. https://doi.org/10.1007/s10846-005-9016-2
Georghiades AS, Belhumeur PN, Kriegman DJ (2001) From few to many: illumination cone models for face recognition under variable lighting and pose. IEEE Trans Pattern Anal Mach Intell 23(6):643–660. https://doi.org/10.1109/34.927464
Globerson A, Roweis ST (2006) Metric learning by collapsing classes. Adv Neural Inform Process Syst. https://doi.org/10.5555/2976248.2976305
Goldberger J, Hinton GE, Roweis ST, Salakhutdinov R (2004) Neighbourhood components analysis. In: NIPS'04: Proceedings of the 17th International Conference on Neural Information Processing Systems 513–520. https://doi.org/10.5555/2976040.2976105
Ham J, Lee D (2008) Grassmann discriminant analysis: a unifying view on subspace-based learning. In: ICML'08: Proceedings of the 25th international conference on Machine learning 376–383. https://doi.org/10.1145/1390156.1390204
Harandi M, Sanderson C, Shirazi S, Lovell B (2011) Graph embedding discriminant analysis on Grassmannian manifolds for improved image set matching. IEEE Conf Comput Vis Pattern Recogn. https://doi.org/10.1109/CVPR.2011.5995564
Hayat M, Bennamoun M, An S (2015) Deep Reconstruction models for image set classification. IEEE Trans Pattern Anal Mach Intell 37(4):713–727. https://doi.org/10.1109/TPAMI.2014.2353635
Hu Y, Mian AS, Owens R (2011a) Sparse approximated nearest points for image set classification. IEEE Comput Soc Conf Comput Vis Pattern Recogn. https://doi.org/10.1109/CVPR.2011.5995500
Hu Y, Mian A, Owens R (2011b) Face recognition using sparse approximated nearest points between image sets. IEEE Trans Pattern Anal Mach Intell. https://doi.org/10.1109/TPAMI.2011.283
Hu J, Lu J, Tan Y-P (2014) Discriminative deep metric learning for face verification in the wild. In: CVPR'14: Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition 1875–1882. https://doi.org/10.1109/CVPR.2014.242
Huang Z, Wang R, Shan S, Chen X (2015) Projection metric learning on grassmann manifold with application to video based face recognition. IEEE Conf Comput Vis Pattern Recogn 2015:140–149. https://doi.org/10.1109/CVPR.2015.7298609
Johnson D, Xiong C, Corso J (2014) Semi-supervised nonlinear distance metric learning via forests of max-margin cluster hierarchies. IEEE Trans Knowl Data Eng. https://doi.org/10.1109/TKDE.2015.2507130
Kim TK, Kittler J, Cipolla R (2007) Discriminative learning and recognition of image set classes using canonical correlations. IEEE Trans Pattern Anal Mach Intell 29:1005. https://doi.org/10.1109/TPAMI.2007.1037
Kim M, Kumar S, Pavlovic V, Rowley H (2008) Face tracking and recognition with visual constraints in real-world videos. IEEE Conf Comput Vis Pattern Recogn 2008:1–8. https://doi.org/10.1109/CVPR.2008.4587572
Köstinger M, Hirzer M, Wohlhart P, Roth PM, Bischof H (2012) Large scale metric learning from equivalence constraints. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence 2288–2295. https://doi.org/10.1109/CVPR.2012.6247939.
Liang J, Hu Q, Zhu P, Wang W (2018) Efficient multi-modal geometric mean metric learning. Pattern Recogn 75:188–198. https://doi.org/10.1016/j.patcog.2017.02.032
Liong VE, Lu J, Tan Y-P, Zhou J (2017) Deep coupled metric learning for cross-modal matching. Trans Multi 19(6):1234–1244. https://doi.org/10.1109/tmm.2016.2646180
Lu J, Wang G, Deng W, Moulin P, Zhou J (2015) Multi-manifold deep metric learning for image set classification. IEEE Conf Comput Vis Pattern Recogn 2015:1137–1145. https://doi.org/10.1109/CVPR.2015.7298717
Mignon A, Jurie F (2012) PCCA: a new approach for distance learning from sparse pairwise constraints. IEEE Conf Comput Vis Pattern Recogn 2012:2666–2672. https://doi.org/10.1109/CVPR.2012.6247987
Nilsback ME, Zisserman A (2006) A visual vocabulary for flower classification a visual vocabulary for flower classification. IEEE Comput Soc Conf Comput Vis Pattern Recogn 2006:1447–1454. https://doi.org/10.1109/CVPR.2006.42
Noh Y, Zhang B, Lee DD (2018) Generative local metric learning for nearest neighbor classification. IEEE Trans Pattern Anal Mach Intell 40(1):106–118. https://doi.org/10.1109/TPAMI.2017.2666151
Okun OG (2006) K-local hyperplane distance nearest-neighbor algorithm and protein fold recognition. Pattern Recogn Image Anal 16(1):19–22. https://doi.org/10.1134/S1054661806010068
Peng J, Zhang L, Li L (2016) Regularized set-to-set distance metric learning for hyperspectral image classification. Pattern Recogn Lett 83:143–151. https://doi.org/10.1016/j.patrec.2016.03.018
Schroff F, Kalenichenko D, Philbin J (2015) FaceNet: a unified embedding for face recognition and clustering. IEEE Conf Comput Vis Pattern Recogn 2015:815–823. https://doi.org/10.1109/CVPR.2015.7298682
Tan M, Wang B, Wu Z, Wang J, Pan G (2016) Weakly supervised metric learning for traffic sign recognition in a LIDAR-equipped vehicle. IEEE Trans Intell Transp Syst 17:1–13. https://doi.org/10.1109/TITS.2015.2506182
Wang R, Chen X (2009) Manifold discriminant analysis. IEEE Conf Comput Vis Pattern Recogn 2009:429–436. https://doi.org/10.1109/CVPR.2009.5206850
Wang R, Shan S, Chen X, Dai Q, Gao W (2012) Manifold-manifold distance and its application to face recognition with image sets. IEEE Trans Image Process 21(10):4466–4479. https://doi.org/10.1109/TIP.2012.2206039
Wang W, Wang R, Huang Z, Shan S, Chen X (2017) Discriminant analysis on riemannian manifold of gaussian distributions for face recognition with image sets. IEEE Trans Image Process 27(1):151–163. https://doi.org/10.1109/TIP.2017.2746993
Weinberger K, Blitzer J, Saul L (2006) Distance metric learning for large margin nearest neighbor classification. J Mach Learn Res. https://doi.org/10.1007/s10845-008-0108-2
Xing EP, Ng AY, Jordan MI, Russell SJ (2002) Distance metric learning with application to clustering with side-information. InL NIPS'02: Proceedings of the 15th International Conference on Neural Information Processing Systems 521–528. https://doi.org/10.5555/2968618.2968683
Xu J, Liu J, Yin J, Sun C (2016) A multi-label feature extraction algorithm via maximizing feature variance and feature-label dependence simultaneously. Knowl Based Syst 98:172–184. https://doi.org/10.1016/j.knosys.2016.01.032
Yamaguchi O, Fukui K, Maeda K-I (1998) Face recognition using temporal image sequence. In: Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition 318–323. https://doi.org/10.1109/AFGR.1998.670968
Yang M, Zhu P, Van Gool L, Zhang L (2013) Face recognition based on regularized nearest points between image sets. In: 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition 1–7. https://doi.org/10.1109/FG.2013.6553727
Zhu P, Zhang L, Zuo W, Zhang D (2013) From point to set: extend the learning of distance metrics. IEEE Int Conf Comput Vis 2013:2664–2671. https://doi.org/10.1109/ICCV.2013.331
Zhu P, Hu Q, Zuo W, Yang M (2014) Multi-granularity distance metric learning via neighborhood granule margin maximization. Inf Sci 282:321–331. https://doi.org/10.1016/j.ins.2014.06.017
Zhu P, Qi R, Hu Q, Wang Q, Zhang C, Yang L (2018) Beyond similar and dissimilar relations: a kernel regression formulation for metric learning. In: IJCAI'18: Proceedings of the 27th International Joint Conference on Artificial Intelligence 3242–3248. https://doi.org/10.5555/3304889.3305111
Acknowledgements
This work was supported by the National Natural Science Foundation of China under Grants 61976027 and 61572082, and the Natural Science Foundation of Liaoning Province (20170540012, 20170540004).
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Rosana Sueli da Motta Jafelice.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhang, X., Wang, C. & Fan, X. Convex hull-based distance metric learning for image classification. Comp. Appl. Math. 40, 113 (2021). https://doi.org/10.1007/s40314-021-01482-x
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s40314-021-01482-x