Skip to main content
Log in

Collaboration prediction in heterogeneous academic network with dynamic structure and topic

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Academic collaborations improve research efficiency and spur scientific innovation. However, scholarly big data has hindered scholars from finding suitable collaborators. Although some studies have involved the prediction problem of academic collaborations, they neglect the rich dynamic information of the heterogeneous academic network. In this paper, we propose a prediction model for academic collaborations, which considers both the dynamic structure and content information. We first formally define the dynamic academic network and the collaboration prediction problem. Then, a scholar representation model is designed by capturing both the dynamic structure and content features, together with the macro-impact of overall academic trends. Finally, we build the prediction model based on the representation result of scholars. Extensive experiments for predicting new collaborations are conducted on the DBLP dataset. The experimental results on the accuracy, F1, and AUC metrics demonstrate that our method outperforms the baseline methods and can predict academic collaborations efficiently.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. https://academic.microsoft.com.

  2. https://jcr.clarivate.com/JCRJournalHomeAction.action.

  3. https://academic.microsoft.com/conferences.

  4. https://dblp.uni-trier.de/xml/.

  5. https://pytorch.org/.

References

  1. Benchettara N, Kanawati R, Rouveirol C (2010) A supervised machine learning link prediction approach for academic collaboration recommendation. In: Proceedings of the Fourth ACM Conference on Recommender Systems, ACM, RecSys ’10, p 253–256, https://doi.org/10.1145/1864708.1864760, event-place: Barcelona, Spain

  2. Bian R, Koh YS, Dobbie G, Divoli A (2019) Network embedding and change modeling in dynamic heterogeneous networks. In: Proceedings of the 42Nd International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, SIGIR’19, p 861–864, https://doi.org/10.1145/3331184.3331273, event-place: Paris, France

  3. Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022

    MATH  Google Scholar 

  4. Chen HH, Gou L, Zhang X, Giles CL (2011) Collabseer: A search engine for collaboration discovery. In: Proceedings of the 11th Annual International ACM/IEEE Joint Conference on Digital Libraries, ACM, JCDL ’11, p 231–240, https://doi.org/10.1145/1998076.1998121, event-place: Ottawa, Ontario, Canada

  5. Cho K, van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using rnn encoder–decoder for statistical machine translation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, p 1724–1734, https://doi.org/10.3115/v1/D14-1179

  6. Cohen S, Ebel L (2013) Recommending collaborators using keywords. In: Proceedings of the 22Nd International Conference on World Wide Web, ACM, WWW ’13 Companion, p 959–962, https://doi.org/10.1145/2487788.2488091, event-place: Rio de Janeiro, Brazil

  7. Dong Y, Chawla NV, Swami A (2017a) Metapath2vec: Scalable representation learning for heterogeneous networks. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, KDD ’17, p 135–144, https://doi.org/10.1145/3097983.3098036, event-place: Halifax, NS, Canada

  8. Dong Y, Ma H, Shen Z, Wang K (2017b) A century of science: Globalization of scientific collaborations, citations, and innovations. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, KDD ’17, p 1437–1446, https://doi.org/10.1145/3097983.3098016, event-place: Halifax, NS, Canada

  9. Grover A, Leskovec J (2016) Node2vec: Scalable feature learning for networks. In: Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, KDD ’16, p 855–864, https://doi.org/10.1145/2939672.2939754, event-place: San Francisco, California, USA

  10. Han S, He D, Jiang J, Yue Z (2013) Supporting exploratory people search: A study of factor transparency and user control. In: Proceedings of the 22Nd ACM International Conference on Information & Knowledge Management, ACM, CIKM ’13, p 449–458, https://doi.org/10.1145/2505515.2505684, event-place: San Francisco, California, USA

  11. Kong X, Jiang H, Bekele TM, Wang W, Xu Z (2017) Random walk-based beneficial collaborators recommendation exploiting dynamic research interests and academic influence. In: Proceedings of the 26th International Conference on World Wide Web Companion, International World Wide Web Conferences Steering Committee, WWW ’17 Companion, p 1371–1377, https://doi.org/10.1145/3041021.3051154, event-place: Perth, Australia

  12. Kong X, Shi Y, Yu S, Liu J, Xia F (2019) Academic social networks: modeling, analysis, mining and applications. J Netw Comput Appl 132:86–103. https://doi.org/10.1016/j.jnca.2019.01.029

    Article  Google Scholar 

  13. Leahey E (2016) From sole investigator to team scientist: trends in the practice and study of research collaboration. Annu Rev Sociol 42(1):81–100. https://doi.org/10.1146/annurev-soc-081715-074219

    Article  Google Scholar 

  14. Lee S, Bozeman B (2005) The impact of research collaboration on scientific productivity. Soc Stud Sci 35(5):673–702. https://doi.org/10.1177/0306312705052359

    Article  Google Scholar 

  15. Li J, Xia F, Wang W, Chen Z, Asabere NY, Jiang H (2014) Acrec: A co-authorship based random walk model for academic collaboration recommendation. In: Proceedings of the 23rd International Conference on World Wide Web, ACM, WWW ’14 Companion, p 1209–1214, https://doi.org/10.1145/2567948.2579034, event-place: Seoul, Korea

  16. Liang W, Zhou X, Huang S, Hu C, Xu X, Jin Q (2018) Modeling of cross-disciplinary collaboration for potential field discovery and recommendation based on scholarly big data. Future Gener Comput Syst 87:591–600. https://doi.org/10.1016/j.future.2017.12.038

    Article  Google Scholar 

  17. Liu Z, Xie X, Chen L (2018) Context-aware academic collaborator recommendation. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, ACM, KDD ’18, p 1870–1879, https://doi.org/10.1145/3219819.3220050, event-place: London, United Kingdom

  18. Mahdavi S, Khoshraftar S, An A (2018) dynnode2vec: Scalable dynamic network embedding. In: Proceedings of the 2018 IEEE International Conference on Big Data, IEEE, p 3762–3765, https://doi.org/10.1109/BigData.2018.8621910

  19. Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Proceedings of the 26th International Conference on Neural Information Processing Systems, p 3111–3119

  20. Nguyen GH, Lee JB, Rossi RA, Ahmed NK, Koh E, Kim S (2018) Dynamic network embeddings: From random walks to temporal random walks. In: Proceedings of the 2018 IEEE International Conference on Big Data, IEEE, p 1085–1092, https://doi.org/10.1109/BigData.2018.8622109

  21. Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: Online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, KDD ’14, p 701–710, https://doi.org/10.1145/2623330.2623732, event-place: New York, New York, USA

  22. Pham P, Do P (2019) W-metapath2vec: the topic-driven meta-path-based model for large-scaled content-based heterogeneous information network representation learning. Expert Syst Appl 123:328–344. https://doi.org/10.1016/j.eswa.2019.01.015

    Article  Google Scholar 

  23. Schuster M, Paliwal K (1997) Bidirectional recurrent neural networks. IEEE Trans Signal Process 45(11):2673–2681. https://doi.org/10.1109/78.650093

    Article  Google Scholar 

  24. Sinha A, Shen Z, Song Y, Ma H, Eide D, Hsu BJP, Wang K (2015) An overview of microsoft academic service (mas) and applications. In: Proceedings of the 24th International Conference on World Wide Web, ACM, WWW ’15 Companion, p 243–246, https://doi.org/10.1145/2740908.2742839

  25. Sun Y, Barber R, Gupta M, Aggarwal CC, Han J (2011a) Co-author relationship prediction in heterogeneous bibliographic networks. In: Proceedings of the 2011 International Conference on Advances in Social Networks Analysis and Mining, IEEE, p 121–128, https://doi.org/10.1109/ASONAM.2011.112

  26. Sun Y, Han J, Yan X, Yu PS, Wu T (2011b) Pathsim: meta path-based top-k similarity search in heterogeneous information networks. PVLDB 4(11):992–1003

    Google Scholar 

  27. Tang J, Zhang J, Yao L, Li J, Zhang L, Su Z (2008) Arnetminer: Extraction and mining of academic social networks. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, KDD ’08, p 990–998, https://doi.org/10.1145/1401890.1402008

  28. Tang J, Wu S, Sun J, Su H (2012) Cross-domain collaboration recommendation. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, KDD ’12, p 1285–1293, https://doi.org/10.1145/2339530.2339730, event-place: Beijing, China

  29. Tong H, Faloutsos C, Pan JY (2008) Random walk with restart: fast solutions and applications. Knowl Inf Syst 14(3):327–346. https://doi.org/10.1007/s10115-007-0094-2

    Article  MATH  Google Scholar 

  30. Wang W, Bai X, Xia F, Bekele TM, Su X, Tolba A (2017) From triadic closure to conference closure: the role of academic conferences in promoting scientific collaborations. Scientometrics 113(1):177–193. https://doi.org/10.1007/s11192-017-2468-x

    Article  Google Scholar 

  31. Wang W, Liu J, Yang Z, Kong X, Xia F (2019) Sustainable collaborator recommendation based on conference closure. IEEE Trans Comput Soc Syst 6(2):311–322. https://doi.org/10.1109/TCSS.2019.2898198

    Article  Google Scholar 

  32. Wuchty S, Jones BF, Uzzi B (2007) The increasing dominance of teams in production of knowledge. Science 316(5827):1036–1039. https://doi.org/10.1126/science.1136099

    Article  Google Scholar 

  33. Yang C, Liu Z, Zhao D, Sun M, Chang EY (2015) Network representation learning with rich text information. In: Proceedings of the 24th International Conference on Artificial Intelligence, AAAI Press, IJCAI’15, p 2111–2117, event-place: Buenos Aires, Argentina

  34. Zhang C, Bu Y, Ding Y, Xu J (2018) Understanding scientific collaboration: homophily, transitivity, and preferential attachment. J Assoc Inf Sci Technol 69(1):72–86. https://doi.org/10.1002/asi.23916

    Article  Google Scholar 

  35. Zhang C, Swami A, Chawla NV (2019) Shne: Representation learning for semantic-associated heterogeneous networks. In: Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, ACM, WSDM ’19, p 690–698, https://doi.org/10.1145/3289600.3291001, event-place: Melbourne VIC, Australia

  36. Zhang Y, Zhang C, Liu X (2017) Dynamic scholarly collaborator recommendation via competitive multi-agent reinforcement learning. In: Proceedings of the Eleventh ACM Conference on Recommender Systems, ACM, RecSys ’17, p 331–335, https://doi.org/10.1145/3109859.3109914, event-place: Como, Italy

  37. Zhou X, Ding L, Li Z, Wan R (2017) Collaborator recommendation in heterogeneous bibliographic networks using random walks. Inf Retr J 20(4):317–337. https://doi.org/10.1007/s10791-017-9300-3

    Article  Google Scholar 

Download references

Acknowledgements

The authors would like to thank the editor and the anonymous reviewers for their constructive comments. This work was supported by the National Nature Science Foundation of China [grant number 61671157] and the Major Project of Philosophy and Social Science Research, Ministry of Education of China [grant number 19JZD010].

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Weidong Zhao.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhao, W., Pu, S. Collaboration prediction in heterogeneous academic network with dynamic structure and topic. Knowl Inf Syst 63, 2053–2074 (2021). https://doi.org/10.1007/s10115-021-01580-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-021-01580-6

Keywords

Navigation