Abstract
The network representation learning task of fusing node multi-dimensional classification information aims to effectively combine node multi-dimensional classification information and network structure information for representation learning, thereby improving the performance of network representation. However, the existing methods only consider multi-dimensional classification information as priori features, which assists the representation learning of the network structure information, lacks the coping mechanism in the case of missing data, and have low robustness in the case of incomplete information. To address these issues, in this paper, we propose a representation learning model based on stochastic perturbation and homophily constraint, called IMCIN. On the one hand, the data transformation is carried out through the random perturbation strategy to improve the adaptability of the model to incomplete information. On the other hand, in the process of learning fusion representation vectors, an attribute similarity retention method based on the principle of homogeneity is designed to further mine the effective semantic information in the incomplete information. Experiments show that our method can effectively deal with the problem of incomplete information and improve the performance of node classification and link prediction tasks.
Similar content being viewed by others
References
Li Q, Cao Z, Ding W, Li Q (2020) A multi-objective adaptive evolutionary algorithm to extract communities in networks. Swarm Evol Comput 52:100629
Li Z, Wang X, Li J, Zhang Q (2021) Deep attributed network representation learning of complex coupling and interaction. Knowl-Based Syst 212:106618
Li Q, Cao Z, Zhong J, Li Q (2019) Graph representation learning with encoding edges. Neurocomputing 361:29–39
McPherson M, Smith-Lovin L, Cook JM (2001) Birds of a feather: homophily in social networks. Ann Rev Sociol 27(1):415–444
Kossinets G, Watts DJ (2009) Origins of homophily in an evolving social network. Am J Sociol 115(2):405–450
Traud AL, Mucha PJ, Porter MA (2012) Social structure of facebook networks. Physica A 391(16):4165–4180
Zhou J, Cui G, Hu S, Zhang Z, Yang C, Liu Z, Wang L, Li C, Sun M (2020) Graph neural networks: a review of methods and applications. AI Open 1:57–81
Zhou J, Liu L, Wei W, Fan J (2022) Network representation learning: From preprocessing, feature extraction to node embedding. ACM Comput Surv (CSUR) 55(2):1–35
Yang H, Chen L, Pan S, Wang H, Zhang P (2022) Discrete embedding for attributed graphs. Pattern Recogn 123:108368
Pham P, Nguyen LT, Vo B, Yun U (2022) Bot2vec: a general approach of intra-community oriented representation learning for bot detection in different types of social networks. Inf Syst 103:101771
Wang R, Liu Y, Chen J (2021) Network representation learning algorithm combined with node text information. J Phys Conf Ser 1769:012054. IOP Publishing
Nguyen TT, Pham MT, Nguyen TT, Huynh TT, Nguyen QVH, Quan TT (2021) Structural representation learning for network alignment with self-supervised anchor links. Expert Syst Appl 165:113857
Chen J, Zhong M, Li J, Wang D, Qian T, Tu H (2021) Effective deep attributed network representation learning with topology adapted smoothing. IEEE Trans Cybernet
Zhao S, Du Z, Chen J, Zhang Y, Tang J, Yu P (2021) Hierarchical representation learning for attributed networks. IEEE Trans Knowl Data Eng
Wu H, Ji J, Tian H, Chen Y, Ge W, Zhang H, Yu F, Zou J, Nakamura M, Liao J (2021) Chinese-named entity recognition from adverse drug event records: radical embedding-combined dynamic embedding-based bert in a bidirectional long short-term conditional random field (bi-lstm-crf) model. JMIR Med Inform 9(12):26407
Wu Z, Zhan M, Zhang H, Luo Q, Tang K (2022) MTGCN: a multi-task approach for node classification and link prediction in graph data. Inf Process Manag 59(3):102902
Chowdhary K (2020) Natural language processing. Fundam Artif Intell 603–649
Chen J, Gong Z, Wang W, Liu W (2021) HNS: hierarchical negative sampling for network representation learning. Inf Sci 542:343–356
Cui P, Wang X, Pei J, Zhu W (2018) A survey on network embedding. IEEE Trans Knowl Data Eng 31(5):833–852
Liu X, Zhuang C, Murata T, Kim K-S, Kertkeidkachorn N (2019) How much topological structure is preserved by graph embeddings? Comput Sci Inf Syst 16(2):597–614
Aiello LM, Barrat A, Schifanella R, Cattuto C, Markines B, Menczer F (2012) Friendship prediction and homophily in social media. ACM Trans Web (TWEB) 6(2):1–33
Zhang F, Sun B, Diao X, Zhao W, Shu T (2021) Prediction of adverse drug reactions based on knowledge graph embedding. BMC Med Inform Decis Mak 21(1):1–11
Leskovec J, Sosič R (2016) Snap: A general-purpose network analysis and graph-mining library. ACM Trans Intell Syst Technol (TIST) 8(1):1–20
Tu C, Yang C, Liu Z, Sun M (2017) Network representation learning: an overview. Scientia sinica informationis 47(8):980–996
Goyal P, Ferrara E (2018) Graph embedding techniques, applications, and performance: a survey. Knowl-Based Syst 151:78–94
Cai H, Zheng VW, Chang KC-C (2018) A comprehensive survey of graph embedding: problems, techniques, and applications. IEEE Trans Knowl Data Eng 30(9):1616–1637
Feng A, You C, Wang S, Tassiulas L (2022) KerGNNs: interpretable graph neural networks with graph kernels. In: Proceedings of the AAAI conference on artificial intelligence, vol 36, pp 6614–6622
Pang Y, Liu C (2022) Efficient-Dyn: dynamic graph representation learning via event-based temporal sparse attention network. arXiv preprint arXiv:2201.01384
Glorot X, Bengio Y (2010) Understanding the difficulty of training deep feedforward neural networks. J Mach Learn Res 9:249–256
Liao L, He X, Zhang H, Chua T-S (2018) Attributed social network embedding. IEEE Trans Knowl Data Eng 30(12):2257–2270
Tang D, Wei F, Qin B, Yang N, Liu T, Zhou M (2015) Sentiment embeddings with applications to sentiment analysis. IEEE Trans Knowl Data Eng 28(2):496–509
Yu H, Dong W, Shi J (2022) RANEDDI: relation-aware network embedding for drug-drug interaction prediction. Inf Sci 582:167–180
Acknowledgements
This work was supported by Natural Sciences Foundation of Zhejiang Province under Grant No. LY22F020003, National Natural Science Foundation of China under Grant No. 62002226 and China Postdoctoral Science Foundation under Grant No. 2022M711715.
Author information
Authors and Affiliations
Contributions
QL wrote the main manuscript text. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Li, Q., Jiang, M. A representation learning model based on stochastic perturbation and homophily constraint. Knowl Inf Syst 65, 5353–5373 (2023). https://doi.org/10.1007/s10115-023-01941-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-023-01941-3