skip to main content
10.1145/2733373.2806290acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

Multi-cue Augmented Face Clustering

Published: 13 October 2015 Publication History

Abstract

Face clustering is an important but challenging task since facial images always have huge variation due to change in facial expressions, head poses and partial occlusions, etc. Moreover, face clustering is actually an unsupervised problem which makes it more difficult to reach an accurate result. Fortunately, there are some cues that can be used to improve clustering performance. In this paper, two types of cues are employed. The first one is pairwise constraints: must-link and cannot-link constraints, which can be extracted from the temporal and spatial knowledge of data. The other is that each face is associated with a series of attributes (i.e, gender) which can contribute discrimination among faces. To take advantage of the above cues, we propose a new algorithm, Multi-cue Augmented Face Clustering (McAFC), which effectively incorporates the cues via graph-guided sparse subspace clustering technique. Specially, facial images from the same individual are encouraged to be connected while faces from different persons are restrained to be connected. Experiments on three face datasets from real-world videos show the improvements of our algorithm over the state-of-the-art methods.

References

[1]
S. P. Boyd and L. Vandenberghe. Convex optimization. Cambridge university press, 2004.
[2]
X. Cao, C. Zhang, H. Fu, S. Liu, and H. Zhang. Diversity-induced multi-view subspace clustering. In CVPR, pages 586--594, 2015.
[3]
X. Chen, Q. Lin, S. Kim, J. G. Carbonell, E. P. Xing, et al. Smoothing proximal gradient method for general structured sparse regression. The Annals of Applied Statistics, 6(2):719--752, 2012.
[4]
W.-S. Chu, F. De la Torre, and J. F. Cohn. Selective transfer machine for personalized facial action unit detection. In CVPR, pages 3515--3522, 2013.
[5]
R. G. Cinbis, J. Verbeek, and C. Schmid. Unsupervised metric learning for face identification in tv video. In ICCV, pages 1559--1566, 2011.
[6]
T. Cour, B. Sapp, A. Nagle, and B. Taskar. Talking pictures: Temporal grouping and dialog-supervised person recognition. In CVPR, pages 1014--1021, 2010.
[7]
E. Elhamifar and R. Vidal. Sparse subspace clustering: Algorithm, theory, and applications. IEEE Transactions on PAMI, 35(11):2765--2781, 2013.
[8]
A. Fitzgibbon and A. Zisserman. On affine invariant clustering and automatic cast listing in movies. In ECCV, pages 304--320. 2002.
[9]
A. W. Fitzgibbon and A. Zisserman. Joint manifold distance: a new approach to appearance based clustering. In CVPR, pages 19--26, 2003.
[10]
Y. Hu, A. S. Mian, and R. Owens. Sparse approximated nearest points for image set classification. In CVPR, pages 121--128, 2011.
[11]
S. J. Kim, K. Koh, S. Lustig, M. Byod, and D. Gorinevsky. An interior-point method for large-scale l1-regularized logistic regression. JMLR, 8(8):1519--1555, 2007.
[12]
J. MacQueen et al. Some methods for classification and analysis of multivariate observations. In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, volume 1, pages 281--297. Oakland, USA., 1967.
[13]
A. Y. Ng, M. I. Jordan, Y. Weiss, et al. On spectral clustering: Analysis and an algorithm. NIPS, 2:849--856, 2002.
[14]
W. M. Rand. Objective criteria for the evaluation of clustering methods. Journal of the American Statistical association, 66(336):846--850, 1971.
[15]
L. Wolf, T. Hassner, and I. Maoz. Face recognition in unconstrained videos with matched background similarity. In CVPR, pages 529--534, 2011.
[16]
B. Y. Wu, Y. F. Zhang, B. G. Hu, and Q. Ji. Constrained clustering and its application to face clustering in videos. In CVPR, pages 3507--3514, 2013.
[17]
S. Xiao, M. Tan, and D. Xu. Weighted block-sparse low rank representation for face clustering in videos. In ECCV, pages 123--138. 2014.
[18]
Y. F. Zhang, C. S. Xu, H. Lu, and Y. Huang. Character identification in feature-length films using global face-name matching. IEEE Transactions on Multimedia, 11(7):1276--1288, 2009.
[19]
C. Zhou, C. Zhang, X. Li, G. Shi, and X. Cao. Video face clustering via constrained sparse representation. In ICME, pages 1--6, 2014.

Cited By

View all
  • (2024)VideoClusterNet: Self-supervised and Adaptive Face Clustering for VideosComputer Vision – ECCV 202410.1007/978-3-031-73404-5_22(377-396)Online publication date: 30-Oct-2024
  • (2023)Diversity Multi-View Clustering With Subspace and NMF-Based Manifold LearningIEEE Access10.1109/ACCESS.2023.326483711(37041-37051)Online publication date: 2023
  • (2021)Scalable Multi-view Subspace Clustering with Unified AnchorsProceedings of the 29th ACM International Conference on Multimedia10.1145/3474085.3475516(3528-3536)Online publication date: 17-Oct-2021
  • Show More Cited By

Index Terms

  1. Multi-cue Augmented Face Clustering

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '15: Proceedings of the 23rd ACM international conference on Multimedia
    October 2015
    1402 pages
    ISBN:9781450334594
    DOI:10.1145/2733373
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 October 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. face clustering
    2. graph-guided
    3. sparse representation

    Qualifiers

    • Short-paper

    Funding Sources

    • Chinese Academy of Sciences
    • National Natural Science Foundation of China

    Conference

    MM '15
    Sponsor:
    MM '15: ACM Multimedia Conference
    October 26 - 30, 2015
    Brisbane, Australia

    Acceptance Rates

    MM '15 Paper Acceptance Rate 56 of 252 submissions, 22%;
    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)6
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 25 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)VideoClusterNet: Self-supervised and Adaptive Face Clustering for VideosComputer Vision – ECCV 202410.1007/978-3-031-73404-5_22(377-396)Online publication date: 30-Oct-2024
    • (2023)Diversity Multi-View Clustering With Subspace and NMF-Based Manifold LearningIEEE Access10.1109/ACCESS.2023.326483711(37041-37051)Online publication date: 2023
    • (2021)Scalable Multi-view Subspace Clustering with Unified AnchorsProceedings of the 29th ACM International Conference on Multimedia10.1145/3474085.3475516(3528-3536)Online publication date: 17-Oct-2021
    • (2020)Video Face Clustering With Self-Supervised Representation LearningIEEE Transactions on Biometrics, Behavior, and Identity Science10.1109/TBIOM.2019.29472642:2(145-157)Online publication date: Apr-2020
    • (2020)Clustering based Contrastive Learning for Improving Face Representations2020 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020)10.1109/FG47880.2020.00011(109-116)Online publication date: Nov-2020
    • (2019)Self-supervised Face-Grouping on GraphsProceedings of the 27th ACM International Conference on Multimedia10.1145/3343031.3351071(247-256)Online publication date: 15-Oct-2019
    • (2019)Self-Supervised Learning of Face Representations for Video Face Clustering2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019)10.1109/FG.2019.8756609(1-8)Online publication date: May-2019
    • (2019)Tracking Persons-of-Interest via Unsupervised Representation AdaptationInternational Journal of Computer Vision10.1007/s11263-019-01212-1Online publication date: 3-Sep-2019
    • (2016)Joint Face Representation Adaptation and Clustering in VideosComputer Vision – ECCV 201610.1007/978-3-319-46487-9_15(236-251)Online publication date: 17-Sep-2016
    • (2016)Multi-view Subspace Clustering via a Global Low-Rank Affinity MatrixIntelligent Data Engineering and Automated Learning – IDEAL 201610.1007/978-3-319-46257-8_35(321-331)Online publication date: 13-Sep-2016
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media