Abstract
To improve the performance of classifying nodes on unlabeled or scarcely-labeled networks, the task of node classification across networks is proposed for transferring knowledge from similar networks with rich labels. As data distribution shift exists across networks, domain adaptive network embedding is proposed to overcome such challenge by learning network-invariant and discriminative node embeddings, in which domain adaptation technique is applied to network embedding for reducing domain discrepancy. However, existing works merely discuss category-level domain discrepancy which is crucial to better adaptation and classification. In this paper, we propose category-level domain adaptive network embedding. The key idea is minimizing intra-class domain discrepancy and maximizing inter-class domain discrepancy between source and target networks simultaneously. To further enhance classification performance on target network, we reduce embedding variation inside each class and enlarge it between different classes. Graph attention network is adopted for learning network embeddings. In addition, a novel pseudo-labeling strategy for target network is developed to better compute category-level information. Theoretical analysis guarantees the effectiveness of our model. Furthermore, extensive experiments on real-world datasets show that our model achieves the state-of-art performance, in particular, outperforming existing domain adaptive network embedding models by up to 32%.
Similar content being viewed by others
References
Kipf TN, Welling M (2016) Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907
Velickovic P, Cucurull G, Casanova A, Romero A, Lio P, Bengio Y (2017) Graph attention networks. Statistics 1050:20
Hamilton W, Ying Z, Leskovec J (2017) Inductive representation learning on large graphs. Adv Neural Inf Process Syst 30:66
Hu W, Fey M, Zitnik M, Dong Y, Ren H, Liu B, Catasta M, Leskovec J (2020) Open graph benchmark: datasets for machine learning on graphs. Adv Neural Inf Process Syst 33:22118–22133
Feng S, Wan H, Wang N, Li J, Luo M (2021) Twibot-20: a comprehensive twitter bot detection benchmark. In: Proceedings of the 30th ACM international conference on information & knowledge management, pp 4485–4494
Feng S, Tan Z, Wan H, Wang N, Chen Z, Zhang B, Zheng Q, Zhang W, Lei Z, Yang S et al (2022) Twibot-22: towards graph-based twitter bot detection. arXiv preprint arXiv:2206.04564
Pan SJ, Yang Q (2009) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345–1359
Guo G, Wang C, Yan B, Lou Y, Feng H, Zhu J, Chen J, He F, Yu P (2022) Learning adaptive node embeddings across graphs. IEEE Trans Knowl Data Eng 6:66
Shen X, Dai Q, Mao S, Chung F-L, Choi K-S (2020) Network together: Node classification via cross-network deep network embedding. IEEE Trans Neural Netw Learn Syst 32(5):1935–1948
Wu M, Pan S, Zhou C, Chang X, Zhu X (2020) Unsupervised domain adaptive graph convolutional networks. In: Proceedings of the web conference 2020, pp 1457–1467
Zhang X, Du Y, Xie R, Wang C (2021) Adversarial separation network for cross-network node classification. In: Proceedings of the 30th ACM international conference on information & knowledge management, pp 2618–2626
You Y, Chen T, Wang Z, Shen Y (2023) Graph domain adaptation via theory-grounded spectral regularization. In: The eleventh international conference on learning representations
Dai Q, Wu X-M, Xiao J, Shen X, Wang D (2022) Graph transfer learning via adversarial domain adaptation with graph convolution. IEEE Trans Knowl Data Eng 6:66
Wang D, Cui P, Zhu W (2016) Structural deep network embedding. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1225–1234
Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining, pp 701–710
Tang J, Qu M, Wang M, Zhang M, Yan J, Mei Q (2015) Line: large-scale information network embedding. In: Proceedings of the 24th international conference on World Wide Web, pp 1067–1077
Jiang M (2021) Cross-network learning with partially aligned graph convolutional networks. In: Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining, pp 746–755
Ni J, Chang S, Liu X, Cheng W, Chen H, Xu D, Zhang X (2018) Co-regularized deep multi-network embedding. In: Proceedings of the 2018 World Wide Web conference, pp 469–478
Sun J, Zhang Y (2019) Multi-graph convolutional neural networks for representation learning in recommendation. In: IEEE ICDM
Zhuang F, Qi Z, Duan K, Xi D, Zhu Y, Zhu H, Xiong H, He Q (2020) A comprehensive survey on transfer learning. Proc IEEE 109(1):43–76
Karisani P (2022) Multiple-source domain adaptation via coordinated domain encoders and paired classifiers. Proc AAAI Conf Artif Intell 36(7):7087–7095
Grangier D, Iter D (2022) The trade-offs of domain adaptation for neural language models. In: Proceedings of the 60th annual meeting of the association for computational linguistics (Volume 1: Long Papers), pp 3802–3813
Martins P, Marinho Z, Martins AF (2022) Efficient machine translation domain adaptation. In: Proceedings of the 1st workshop on semiparametric methods in NLP: decoupling logic from knowledge, pp 23–29
Tzeng E, Hoffman J, Zhang N, Saenko K, Darrell T (2014) Deep domain confusion: maximizing for domain invariance. arXiv preprint arXiv:1412.3474
Li Z, Zhao X, Zhao C, Tang M, Wang J (2022) Transfering low-frequency features for domain adaptation. In: 2022 IEEE international conference on multimedia and expo (ICME). IEEE, pp 1–6
Shen Y, Yang Y, Yan M, Wang H, Zheng Y, Guibas LJ (2022) Domain adaptation on point clouds via geometry-aware implicits. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 7223–7232
Long M, Cao Y, Wang J, Jordan M (2015) Learning transferable features with deep adaptation networks. In: International conference on machine learning. PMLR, pp 97–105
Long M, Wang J, Ding G, Sun J, Yu PS (2013) Transfer feature learning with joint distribution adaptation. In: Proceedings of the IEEE international conference on computer vision, pp 2200–2207
Kang G, Jiang L, Yang Y, Hauptmann AG (2019) Contrastive adaptation network for unsupervised domain adaptation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4893–4902
Zhu Y, Zhuang F, Wang J, Ke G, Chen J, Bian J, Xiong H, He Q (2020) Deep subdomain adaptation network for image classification. IEEE Trans Neural Netw Learn Syst 32(4):1713–1722
Sun B, Saenko K (2016) Deep coral: Correlation alignment for deep domain adaptation. In: Computer Vision–ECCV 2016 workshops: Amsterdam, The Netherlands, October 8–10 and 15–16, 2016, Proceedings, Part III 14. Springer, pp 443–450
Zellinger W, Grubinger T, Lughofer E, Natschläger T, Saminger-Platz S (2017) Central moment discrepancy (cmd) for domain-invariant representation learning. arXiv preprint arXiv:1702.08811
Yu Q, Hashimoto A, Ushiku Y (2021) Divergence optimization for noisy universal domain adaptation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 2515–2524
Ganin Y, Ustinova E, Ajakan H, Germain P, Larochelle H, Laviolette F, Marchand M, Lempitsky V (2016) Domain-adversarial training of neural networks. J Mach Learn Res 17(1):2030–2096
Shen X, Dai Q, Chung F-l, Lu W, Choi K-S (2020) Adversarial deep network embedding for cross-network node classification. In: Proceedings of the AAAI conference on artificial intelligence, vol 34, pp 2991–2999
Wu J, He J, Ainsworth E (2022) Non-iid transfer learning on graphs. arXiv preprint arXiv:2212.08174
Zhang Y, Song G, Du L, Yang S, Jin Y (2019) Dane: domain adaptive network embedding. In: IJCAI
Li H, Tong H, Weng Y (2022) Domain adaptation in physical systems via graph kernel. In: Proceedings of the 28th ACM SIGKDD conference on knowledge discovery and data mining, pp 868–876
Li H, Wang X, Zhang Z, Zhu W (2022) Ood-gnn: out-of-distribution generalized graph neural network. IEEE Trans Knowl Data Eng 6:66
Fan S, Wang X, Shi C, Cui P, Wang B (2021) Generalizing graph neural networks on out-of-distribution graphs. arXiv preprint arXiv:2111.10657
Chen Y, Zhang Y, Bian Y, Yang H, Kaili M, Xie B, Liu T, Han B, Cheng J (2022) Learning causally invariant representations for out-of-distribution generalization on graphs. Adv Neural Inf Process Syst 35:22131–22148
Wu Y-X, Wang X, Zhang A, He X, Chua T-S (2022) Discovering invariant rationales for graph neural networks
Arjovsky M, Chintala S, Bottou L (2017) Wasserstein generative adversarial networks. In: International conference on machine learning. PMLR, pp 214–223
Mao X, Li Q, Xie H, Lau RY, Wang Z, Paul Smolley S (2017) Least squares generative adversarial networks. In: Proceedings of the IEEE international conference on computer vision, pp 2794–2802
Zhu X, Ghahramani Z (2002) Learning from labeled and unlabeled data with label propagation
Gretton A, Borgwardt KM, Rasch MJ, Schölkopf B, Smola A (2012) A kernel two-sample test. J Mach Learn Res 13(1):723–773
Ben-David S, Blitzer J, Crammer K, Kulesza A, Pereira F, Vaughan JW (2010) A theory of learning from different domains. Mach Learn 79(1):151–175
Tang J, Zhang J, Yao L, Li J, Zhang L, Su Z (2008) Arnetminer: extraction and mining of academic social networks. In: Proceedings of the 14th ACM SIGKDD international conference on knowledge discovery and data mining, pp 990–998
Rozemberczki B, Allen C, Sarkar R (2021) Multi-scale attributed node embedding. J Complex Netw 9(2):1–22
Kipf TN, Welling M (2016) Variational graph auto-encoders. arXiv preprint arXiv:1611.07308
Van der Maaten L, Hinton G (2008) Visualizing data using t-sne. J Mach Learn Res 9(11):66
Acknowledgements
This work was funded by the National Natural Science Foundation of China under grant number U21B2046. This work is also supported by the fellowship of China Postdoctoral Science Foundation 2022M713206.
Author information
Authors and Affiliations
Contributions
BS: Conceptualization, methodology, software, validation, investigation, writing-original draft, writing-review and editing. YW: Validation, formal analysis, resources, writing-review and editing. JS: Methodology, visualization, data curation, investigation, writing-review and editing. HS: Supervision, project administration, funding acquisition. YL: Supervision, project administration, funding acquisition. XC: Supervision, project administration, funding acquisition All authors reviewed the manuscript.
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Shi, B., Wang, Y., Shao, J. et al. Node classification across networks via category-level domain adaptive network embedding. Knowl Inf Syst 65, 5479–5502 (2023). https://doi.org/10.1007/s10115-023-01942-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-023-01942-2