Abstract
Academic collaborations improve research efficiency and spur scientific innovation. However, scholarly big data has hindered scholars from finding suitable collaborators. Although some studies have involved the prediction problem of academic collaborations, they neglect the rich dynamic information of the heterogeneous academic network. In this paper, we propose a prediction model for academic collaborations, which considers both the dynamic structure and content information. We first formally define the dynamic academic network and the collaboration prediction problem. Then, a scholar representation model is designed by capturing both the dynamic structure and content features, together with the macro-impact of overall academic trends. Finally, we build the prediction model based on the representation result of scholars. Extensive experiments for predicting new collaborations are conducted on the DBLP dataset. The experimental results on the accuracy, F1, and AUC metrics demonstrate that our method outperforms the baseline methods and can predict academic collaborations efficiently.
Similar content being viewed by others
References
Benchettara N, Kanawati R, Rouveirol C (2010) A supervised machine learning link prediction approach for academic collaboration recommendation. In: Proceedings of the Fourth ACM Conference on Recommender Systems, ACM, RecSys ’10, p 253–256, https://doi.org/10.1145/1864708.1864760, event-place: Barcelona, Spain
Bian R, Koh YS, Dobbie G, Divoli A (2019) Network embedding and change modeling in dynamic heterogeneous networks. In: Proceedings of the 42Nd International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, SIGIR’19, p 861–864, https://doi.org/10.1145/3331184.3331273, event-place: Paris, France
Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022
Chen HH, Gou L, Zhang X, Giles CL (2011) Collabseer: A search engine for collaboration discovery. In: Proceedings of the 11th Annual International ACM/IEEE Joint Conference on Digital Libraries, ACM, JCDL ’11, p 231–240, https://doi.org/10.1145/1998076.1998121, event-place: Ottawa, Ontario, Canada
Cho K, van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using rnn encoder–decoder for statistical machine translation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, p 1724–1734, https://doi.org/10.3115/v1/D14-1179
Cohen S, Ebel L (2013) Recommending collaborators using keywords. In: Proceedings of the 22Nd International Conference on World Wide Web, ACM, WWW ’13 Companion, p 959–962, https://doi.org/10.1145/2487788.2488091, event-place: Rio de Janeiro, Brazil
Dong Y, Chawla NV, Swami A (2017a) Metapath2vec: Scalable representation learning for heterogeneous networks. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, KDD ’17, p 135–144, https://doi.org/10.1145/3097983.3098036, event-place: Halifax, NS, Canada
Dong Y, Ma H, Shen Z, Wang K (2017b) A century of science: Globalization of scientific collaborations, citations, and innovations. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, KDD ’17, p 1437–1446, https://doi.org/10.1145/3097983.3098016, event-place: Halifax, NS, Canada
Grover A, Leskovec J (2016) Node2vec: Scalable feature learning for networks. In: Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, KDD ’16, p 855–864, https://doi.org/10.1145/2939672.2939754, event-place: San Francisco, California, USA
Han S, He D, Jiang J, Yue Z (2013) Supporting exploratory people search: A study of factor transparency and user control. In: Proceedings of the 22Nd ACM International Conference on Information & Knowledge Management, ACM, CIKM ’13, p 449–458, https://doi.org/10.1145/2505515.2505684, event-place: San Francisco, California, USA
Kong X, Jiang H, Bekele TM, Wang W, Xu Z (2017) Random walk-based beneficial collaborators recommendation exploiting dynamic research interests and academic influence. In: Proceedings of the 26th International Conference on World Wide Web Companion, International World Wide Web Conferences Steering Committee, WWW ’17 Companion, p 1371–1377, https://doi.org/10.1145/3041021.3051154, event-place: Perth, Australia
Kong X, Shi Y, Yu S, Liu J, Xia F (2019) Academic social networks: modeling, analysis, mining and applications. J Netw Comput Appl 132:86–103. https://doi.org/10.1016/j.jnca.2019.01.029
Leahey E (2016) From sole investigator to team scientist: trends in the practice and study of research collaboration. Annu Rev Sociol 42(1):81–100. https://doi.org/10.1146/annurev-soc-081715-074219
Lee S, Bozeman B (2005) The impact of research collaboration on scientific productivity. Soc Stud Sci 35(5):673–702. https://doi.org/10.1177/0306312705052359
Li J, Xia F, Wang W, Chen Z, Asabere NY, Jiang H (2014) Acrec: A co-authorship based random walk model for academic collaboration recommendation. In: Proceedings of the 23rd International Conference on World Wide Web, ACM, WWW ’14 Companion, p 1209–1214, https://doi.org/10.1145/2567948.2579034, event-place: Seoul, Korea
Liang W, Zhou X, Huang S, Hu C, Xu X, Jin Q (2018) Modeling of cross-disciplinary collaboration for potential field discovery and recommendation based on scholarly big data. Future Gener Comput Syst 87:591–600. https://doi.org/10.1016/j.future.2017.12.038
Liu Z, Xie X, Chen L (2018) Context-aware academic collaborator recommendation. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, ACM, KDD ’18, p 1870–1879, https://doi.org/10.1145/3219819.3220050, event-place: London, United Kingdom
Mahdavi S, Khoshraftar S, An A (2018) dynnode2vec: Scalable dynamic network embedding. In: Proceedings of the 2018 IEEE International Conference on Big Data, IEEE, p 3762–3765, https://doi.org/10.1109/BigData.2018.8621910
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Proceedings of the 26th International Conference on Neural Information Processing Systems, p 3111–3119
Nguyen GH, Lee JB, Rossi RA, Ahmed NK, Koh E, Kim S (2018) Dynamic network embeddings: From random walks to temporal random walks. In: Proceedings of the 2018 IEEE International Conference on Big Data, IEEE, p 1085–1092, https://doi.org/10.1109/BigData.2018.8622109
Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: Online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, KDD ’14, p 701–710, https://doi.org/10.1145/2623330.2623732, event-place: New York, New York, USA
Pham P, Do P (2019) W-metapath2vec: the topic-driven meta-path-based model for large-scaled content-based heterogeneous information network representation learning. Expert Syst Appl 123:328–344. https://doi.org/10.1016/j.eswa.2019.01.015
Schuster M, Paliwal K (1997) Bidirectional recurrent neural networks. IEEE Trans Signal Process 45(11):2673–2681. https://doi.org/10.1109/78.650093
Sinha A, Shen Z, Song Y, Ma H, Eide D, Hsu BJP, Wang K (2015) An overview of microsoft academic service (mas) and applications. In: Proceedings of the 24th International Conference on World Wide Web, ACM, WWW ’15 Companion, p 243–246, https://doi.org/10.1145/2740908.2742839
Sun Y, Barber R, Gupta M, Aggarwal CC, Han J (2011a) Co-author relationship prediction in heterogeneous bibliographic networks. In: Proceedings of the 2011 International Conference on Advances in Social Networks Analysis and Mining, IEEE, p 121–128, https://doi.org/10.1109/ASONAM.2011.112
Sun Y, Han J, Yan X, Yu PS, Wu T (2011b) Pathsim: meta path-based top-k similarity search in heterogeneous information networks. PVLDB 4(11):992–1003
Tang J, Zhang J, Yao L, Li J, Zhang L, Su Z (2008) Arnetminer: Extraction and mining of academic social networks. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, KDD ’08, p 990–998, https://doi.org/10.1145/1401890.1402008
Tang J, Wu S, Sun J, Su H (2012) Cross-domain collaboration recommendation. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, KDD ’12, p 1285–1293, https://doi.org/10.1145/2339530.2339730, event-place: Beijing, China
Tong H, Faloutsos C, Pan JY (2008) Random walk with restart: fast solutions and applications. Knowl Inf Syst 14(3):327–346. https://doi.org/10.1007/s10115-007-0094-2
Wang W, Bai X, Xia F, Bekele TM, Su X, Tolba A (2017) From triadic closure to conference closure: the role of academic conferences in promoting scientific collaborations. Scientometrics 113(1):177–193. https://doi.org/10.1007/s11192-017-2468-x
Wang W, Liu J, Yang Z, Kong X, Xia F (2019) Sustainable collaborator recommendation based on conference closure. IEEE Trans Comput Soc Syst 6(2):311–322. https://doi.org/10.1109/TCSS.2019.2898198
Wuchty S, Jones BF, Uzzi B (2007) The increasing dominance of teams in production of knowledge. Science 316(5827):1036–1039. https://doi.org/10.1126/science.1136099
Yang C, Liu Z, Zhao D, Sun M, Chang EY (2015) Network representation learning with rich text information. In: Proceedings of the 24th International Conference on Artificial Intelligence, AAAI Press, IJCAI’15, p 2111–2117, event-place: Buenos Aires, Argentina
Zhang C, Bu Y, Ding Y, Xu J (2018) Understanding scientific collaboration: homophily, transitivity, and preferential attachment. J Assoc Inf Sci Technol 69(1):72–86. https://doi.org/10.1002/asi.23916
Zhang C, Swami A, Chawla NV (2019) Shne: Representation learning for semantic-associated heterogeneous networks. In: Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, ACM, WSDM ’19, p 690–698, https://doi.org/10.1145/3289600.3291001, event-place: Melbourne VIC, Australia
Zhang Y, Zhang C, Liu X (2017) Dynamic scholarly collaborator recommendation via competitive multi-agent reinforcement learning. In: Proceedings of the Eleventh ACM Conference on Recommender Systems, ACM, RecSys ’17, p 331–335, https://doi.org/10.1145/3109859.3109914, event-place: Como, Italy
Zhou X, Ding L, Li Z, Wan R (2017) Collaborator recommendation in heterogeneous bibliographic networks using random walks. Inf Retr J 20(4):317–337. https://doi.org/10.1007/s10791-017-9300-3
Acknowledgements
The authors would like to thank the editor and the anonymous reviewers for their constructive comments. This work was supported by the National Nature Science Foundation of China [grant number 61671157] and the Major Project of Philosophy and Social Science Research, Ministry of Education of China [grant number 19JZD010].
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhao, W., Pu, S. Collaboration prediction in heterogeneous academic network with dynamic structure and topic. Knowl Inf Syst 63, 2053–2074 (2021). https://doi.org/10.1007/s10115-021-01580-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-021-01580-6