skip to main content
10.1145/3664647.3681030acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Learning Dual Enhanced Representation for Contrastive Multi-view Clustering

Published: 28 October 2024 Publication History

Abstract

Contrastive multi-view clustering is widely recognized for its effectiveness in mining feature representation across views via contrastive learning (CL), gaining significant attention in recent years. Most existing methods mainly focus on the feature-level or/and cluster-level CL, but there are still two shortcomings. Firstly, feature-level CL is limited by the influence of anomalies and large noise data, resulting in insufficient mining of discriminative feature representation. Secondly, cluster-level CL lacks the guidance of global information and is always restricted by the local diversity information. We in this paper Learn dUal enhanCed rEpresentation for Contrastive Multi-view Clustering (LUCE-CMC) to effectively addresses the above challenges, and it mainly contains two parts, i.e., enhanced feature-level CL (En-FeaCL) and enhanced cluster-level CL (En-CluCL). Specifically, we first adopt a shared encoder to learn shared feature representations between multiple views and then obtain cluster-relevant information that is beneficial to the clustering results. Moreover, we design a reconstitution approach to force the model to concentrate on learning features that are critical to reconstructing the input data, reducing the impact of noisy data and maximizing the sufficient discriminative information of different views in helping the En-FeaCL part. Finally, instead of contrasting the view-specific clustering result like most existing methods do, we in the En-CluCL part make the information at the cluster-level more richer by contrasting the cluster assignment from each view and the cluster assignment obtained from the shared fused features. The end-to-end training methods of the proposed model are mutually reinforcing and beneficial. Extensive experiments conducted on multi-view datasets show that the proposed LUCE-CMC outperforms established baselines to a considerable extent. The source code is released at https://github.com/ShizheHu.

References

[1]
Xiao Cai, Feiping Nie, and Heng Huang. 2013. Multi-View K-Means Clustering on Big Data. In Proceedings of the 23rd International Joint Conference on Artificial Intelligence. 2598--2604.
[2]
Rui Chen, Yongqiang Tang, Yuan Xie, Wenlong Feng, and Wensheng Zhang. 2023. Semisupervised Progressive Representation Learning for Deep Multiview Clustering. IEEE Transactions on Neural Networks and Learning Systems (2023).
[3]
Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. 2020. A simple framework for contrastive learning of visual representations. In Proceedings of the International Conference on Machine Learning. 1597--1607.
[4]
Xinyue Chen, Jie Xu, Yazhou Ren, Xiaorong Pu, Ce Zhu, Xiaofeng Zhu, Zhifeng Hao, and Lifang He. 2023. Federated Deep Multi-View Clustering with Global Self-Supervision. In Proceedings of the 31st ACM International Conference on Multimedia. 3498--3506.
[5]
Li Fei-Fei, Rob Fergus, and Pietro Perona. 2004. Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories. In IEEE Conference on Computer Vision and Pattern Recognition Workshops. 178.
[6]
Azhar A Hamdi, Mohammed S Sayed, Mohamed M Fouad, and Mohiy M Hadhoud. 2018. Unsupervised patterned fabric defect detection using texture filtering and K-means clustering. In 2018 international conference on innovative trends in computer engineering (ITCE). 130--144.
[7]
Chenxi Hu, Tao Wu, Shuaiqi Liu, Chunsheng Liu, Tao Ma, and Fang Yang. 2024. Joint unsupervised contrastive learning and robust GMM for text clustering. Information Processing & Management, Vol. 61, 1 (2024), 103529.
[8]
Shizhe Hu, Ruilin Geng, Zhaoxu Cheng, Chaoyang Zhang, Guoliang Zou, Zhengzheng Lou, and Yangdong Ye. 2022. A parameter-free multi-view information bottleneck clustering method by cross-view weighting. In Proceedings of the 30th ACM International Conference on Multimedia. 3792--3800.
[9]
Shizhe Hu, Zhengzheng Lou, Xiaoqiang Yan, and Yangdong Ye. 2024. A Survey on Information Bottleneck. IEEE Transactions on Pattern Analysis and Machine Intelligence (2024).
[10]
Shizhe Hu, Zhengzheng Lou, and Yangdong Ye. 2021. View-wise versus cluster-wise weight: Which is better for multi-view clustering? IEEE Transactions on Image Processing, Vol. 31 (2021), 58--71.
[11]
Shizhe Hu, Chengkun Zhang, Guoliang Zou, Zhengzheng Lou, and Yangdong Ye. 2024. Deep Multiview Clustering by Pseudo-Label Guided Contrastive Learning and Dual Correlation Learning. IEEE Transactions on Neural Networks and Learning Systems (2024).
[12]
Shizhe Hu, Guoliang Zou, Chaoyang Zhang, Zhengzheng Lou, Ruilin Geng, and Yangdong Ye. 2023. Joint contrastive triple-learning for deep multi-view clustering. Information Processing & Management, Vol. 60, 3 (2023), 103284.
[13]
Zhenyu Huang, Joey Tianyi Zhou, Xi Peng, Changqing Zhang, Hongyuan Zhu, and Jiancheng Lv. 2019. Multi-view Spectral Clustering Network. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence. 2563--2569.
[14]
Yu-Gang Jiang, Guangnan Ye, Shih-Fu Chang, Daniel P. W. Ellis, and Alexander C. Loui. 2011. Consumer video understanding: a benchmark database and an evaluation of human and machine performance. In Proceedings of the 1st International Conference on Multimedia Retrieval. 29.
[15]
Abhishek Kumar, Piyush Rai, and Hal Daumé III. 2011. Co-regularized Multi-view Spectral Clustering. In Advances in Neural Information Processing Systems. 1413--1421.
[16]
Shuang Li and Jonathan Corney. 2023. Multi-view expressive graph neural networks for 3D CAD model classification. Computers in Industry, Vol. 151 (2023), 103993.
[17]
Quan Liu, Yingying Chen, Jinqiao Wang, and Sijiong Zhang. 2018. Multi-view pedestrian captioning with an attention topic CNN model. Computers in Industry, Vol. 97 (2018), 47--53.
[18]
Yiding Lu, Yijie Lin, Mouxing Yang, Dezhong Peng, Peng Hu, and Xi Peng. 2024. Decoupled Contrastive Multi-View Clustering with High-Order Random Walks. In Thirty-Eighth AAAI Conference on Artificial Intelligence, AAAI. 14193--14201.
[19]
Feiping Nie, Jing Li, and Xuelong Li. 2017. Self-weighted Multiview Clustering with Multiple Graphs. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence. 2564--2570.
[20]
Zhenqiu Shu, Bin Li, Cunli Mao, Shengxiang Gao, and Zhengtao Yu. 2024. Structure-guided feature and cluster contrastive learning for multi-view clustering. Neurocomputing (2024), 127555.
[21]
Mengjing Sun, Pei Zhang, Siwei Wang, Sihang Zhou, Wenxuan Tu, Xinwang Liu, En Zhu, and Changjian Wang. 2021. Scalable Multi-view Subspace Clustering with Unified Anchors. In ACM Multimedia Conference. 3528--3536.
[22]
Daniel J. Trosten, Sigurd Løkse, Robert Jenssen, and Michael Kampffmeyer. 2021. Reconsidering Representation Alignment for Multi-View Clustering. In IEEE Conference on Computer Vision and Pattern Recognition. 1255--1265.
[23]
Haiyue Wang, Quan Wang, Qiguang Miao, and Xiaoke Ma. 2024. Joint learning of data recovering and graph contrastive denoising for incomplete multi-view clustering. Information Fusion, Vol. 104 (2024), 102155.
[24]
Jing Wang, Songhe Feng, Gengyu Lyu, and Zhibin Gu. 2023. Triple-Granularity Contrastive Learning for Deep Multi-View Subspace Clustering. In Proceedings of the 31st ACM International Conference on Multimedia. 2994--3002.
[25]
Shiping Wang, Zhaoliang Chen, Shide Du, and Zhouchen Lin. 2021. Learning deep sparse regularizers with applications to multi-view clustering and semi-supervised classification. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 44, 9 (2021), 5042--5055.
[26]
Yiming Wang, Dongxia Chang, Zhiqiang Fu, Jie Wen, and Yao Zhao. 2022. Graph contrastive partial multi-view clustering. IEEE Transactions on Multimedia (2022).
[27]
Jie Xu, Yazhou Ren, Guofeng Li, Lili Pan, Ce Zhu, and Zenglin Xu. 2021. Deep embedded multi-view clustering with collaborative training. Information Sciences, Vol. 573 (2021), 279--290.
[28]
Jie Xu, Huayi Tang, Yazhou Ren, Liang Peng, Xiaofeng Zhu, and Lifang He. 2022. Multi-level Feature Learning for Contrastive Multi-view Clustering. In IEEE Conference on Computer Vision and Pattern Recognition.
[29]
Yuankun Xu, Dong Huang, Chang-Dong Wang, and Jian-Huang Lai. 2024. Deep image clustering with contrastive learning and multi-scale graph convolutional networks. Pattern Recognition, Vol. 146 (2024), 110065.
[30]
Zhe Xue, Junping Du, Hai Zhou, Zhongchao Guan, Yunfei Long, Yu Zang, and Meiyu Liang. 2022. Robust diversified graph contrastive network for incomplete multi-view clustering. In Proceedings of the 30th ACM International Conference on Multimedia. 3936--3944.
[31]
Xiaoqiang Yan, Yiqiao Mao, Mingyuan Li, Yangdong Ye, and Hui Yu. 2023. Multitask image clustering via deep information bottleneck. IEEE Transactions on Cybernetics (2023).
[32]
Xiaoqiang Yan, Yiqiao Mao, Yangdong Ye, and Hui Yu. 2023. Cross-Modal Clustering With Deep Correlated Information Bottleneck Method. IEEE Transactions on Neural Networks and Learning Systems (2023).
[33]
Xihong Yang, Jin Jiaqi, Siwei Wang, Ke Liang, Yue Liu, Yi Wen, Suyuan Liu, Sihang Zhou, Xinwang Liu, and En Zhu. 2023. Dealmvc: Dual contrastive calibration for multi-view clustering. In Proceedings of the 31st ACM International Conference on Multimedia. 337--346.
[34]
Ming Yin, Weitian Huang, and Junbin Gao. 2020. Shared Generative Latent Representation Learning for Multi-View Clustering. In The Thirty-Fourth AAAI Conference on Artificial Intelligence. 6688--6695.
[35]
Runwu Zhou and Yi-Dong Shen. 2020. End-to-End Adversarial-Attention Network for Multi-Modal Clustering. In IEEE/CVF Conference on Computer Vision and Pattern Recognition. 14607--14616.
[36]
Sihang Zhou, Xinwang Liu, Jiyuan Liu, Xifeng Guo, Yawei Zhao, En Zhu, Yongping Zhai, Jianping Yin, and Wen Gao. 2020. Multi-View Spectral Clustering with Optimal Neighborhood Laplacian Matrix. In The Thirty-Fourth AAAI Conference on Artificial Intelligence. 6965--6972.

Index Terms

  1. Learning Dual Enhanced Representation for Contrastive Multi-view Clustering

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '24: Proceedings of the 32nd ACM International Conference on Multimedia
    October 2024
    11719 pages
    ISBN:9798400706868
    DOI:10.1145/3664647
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 October 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. contrastive learning
    2. deep multi-view clustering
    3. representation learning

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    MM '24
    Sponsor:
    MM '24: The 32nd ACM International Conference on Multimedia
    October 28 - November 1, 2024
    Melbourne VIC, Australia

    Acceptance Rates

    MM '24 Paper Acceptance Rate 1,150 of 4,385 submissions, 26%;
    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 209
      Total Downloads
    • Downloads (Last 12 months)209
    • Downloads (Last 6 weeks)79
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media