Abstract
Network representation learning can transform nodes into a low-dimensional vector representation, which has been widely used in recent years. As a heterogeneous information network, the academic network contains more information than the homogeneous information network, which attracts many scholars to do research. Most of the existing algorithms cannot make full use of the attribute or text information of nodes, which contains more information than structural features. To solve this problem, we propose a network representation learning algorithm based on research interest and meta-path. Firstly, the author’s research interest are extracted from their published papers, and then the random walk based on meta-path is used. The author node is transformed into several research interest nodes by mapping function, and the skip-gram model is used to train and get the vector representation of the nodes. Experiments show that our model outperforms traditional learning models in several heterogeneous network analysis tasks, such as node classification and similarity search. The obtained research interest representation can help solve the cold start problem of authors in the academic network.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Zhang, D., Yin, J., Zhu, X., et al.: Network representation learning: a survey. IEEE Trans. Big Data, 1 (2018)
Cui, P., Wang, X., Pei, J., et al.: A survey on network embedding. IEEE Trans. Knowl. Data Eng. PP(99), 1 (2017)
Wang, X., Cui, P., Wang, J., et al.: Community preserving network embedding. In: Thirty-First AAAI Conference on Artificial Intelligence (2017)
Zhang, F., Yuan, N.J., Lian, D., et al.: Collaborative knowledge base embedding for recommender systems. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 353–362. ACM (2016)
Liu, C., Bai, B., Skogerbø, G., et al.: NONCODE: an integrated knowledge database of non-coding RNAs. Nucleic Acids Res. 33(Database issue), D112–D115 (2005)
Bobadilla, J., Ortega, F., Hernando, A., et al.: A collaborative filtering approach to mitigate the new user cold start problem. Knowl. Based Syst. 26, 225–238 (2012)
Mikolov, T., Chen, K., Corrado, G., et al.: Efficient estimation of word representations in vector space (2013). arXiv preprint: arXiv:1301.3781
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: International Conference on Machine Learning, pp. 1188–1196 (2014)
Perozzi, B., Al-Rfou, R., Skiena, S.: DeepWalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 701–710. ACM (2014)
Grover, A., Leskovec, J.: node2vec: Scalable feature learning for networks. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2016)
Tang, J., Qu, M., Wang, M., et al.: LINE: large-scale information network embedding (2015)
Sun, Y., Han, J., Yan, X., et al.: PathSim: meta path-based top-k similarity search in heterogeneous information networks. Proc. VLDB Endow. 4(11), 992–1003 (2011)
Dong, Y., Chawla, N.V., Swami, A.: metapath2vec: Scalable representation learning for heterogeneous networks. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 135–144. ACM (2017)
Tu, C., Liu, H., Liu, Z., et al.: Cane: context-aware network embedding for relation modeling. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. Long Papers, vol. 1, pp. 1722–1731 (2017)
Sun, X., Guo, J., Ding, X., et al.: A general framework for content-enhanced network representation learning (2016). arXiv preprint arXiv:1610.02906
Tang, J., Zhang, J., Yao, L., et al.: ArnetMiner: extraction and mining of academic social networks. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 990–998. ACM (2008)
Roussopoulos, N., Kelley, S., Vincent, F.: Nearest neighbor queries. ACM SIGMOD Rec. 24(2), 71–79 (1995)
Jolliffe, I.T., Cadima, J.: Principal component analysis: a review and recent developments. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 374(2065), 20150202 (2016)
Acknowledgments
The work in this paper is supported by the National Key Research and Development Plan (2018YFB1004700, 2016YFB0800403).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhang, W., Liang, Y., Dong, X. (2019). Representation Learning in Academic Network Based on Research Interest and Meta-path. In: Douligeris, C., Karagiannis, D., Apostolou, D. (eds) Knowledge Science, Engineering and Management. KSEM 2019. Lecture Notes in Computer Science(), vol 11776. Springer, Cham. https://doi.org/10.1007/978-3-030-29563-9_34
Download citation
DOI: https://doi.org/10.1007/978-3-030-29563-9_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-29562-2
Online ISBN: 978-3-030-29563-9
eBook Packages: Computer ScienceComputer Science (R0)