Abstract
In this work, we describe a Linked Data portal, LD Connect, which operates on all bibliographic data produced by IOS Press over the past thirty-five years, including more than a hundred thousand papers, authors, affiliations, keywords, and so forth. However, LD Connect is more than just an RDF-based metadata set of bibliographic records. For example, all affiliations are georeferenced, and co-reference resolution has been performed on organizations and contributors including both authors and editors. The resulting knowledge graph serves as a public dataset, web portal, and query endpoint, and it acts as a data backbone for IOS Press and various bibliographic analytics. In addition to the metadata, LD Connect is also the first portal of its kind that publicly shares document embeddings computed from the full text of all papers and knowledge graph embeddings based on the graph structure, thereby enabling semantic search and automated IOS Press scientometrics. These scientometrics run directly on top of the graph and combine it with the learned embeddings to automatically generate data visualizations, such as author and paper similarity over all journals. By making the involved ontologies, embeddings, and scientometrics all publicly available, we aim to share LD Connect services with not only the Semantic Web community but also the broader public to facilitate research and applications based on this large-scale academic knowledge graph. Particularly, the presented scientometric system generalizes beyond IOS Press data and can be deployed on top of other bibliographic datasets as well.
Z. Liu and M. ShiāBoth authors contributed equally to this work.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
References
Bordes, A., Usunier, N., Garcia-Duran, A., Weston, J., Yakhnenko, O.: Translating embeddings for modeling multi-relational data. Adv. Neural Inf. Process. Syst. 26, 2787ā2795 (2013)
Cai, L., Yan, B., Mai, G., Janowicz, K., Zhu, R.: TransGCN: coupling transformation assumptions with graph convolutional networks for link prediction. In: Proceedings of the 10th International Conference on Knowledge Capture, pp. 131ā138 (2019)
Cohan, A., Feldman, S., Beltagy, I., Downey, D., Weld, D.S.: Specter: document-level representation learning using citation-informed transformers (2020). arXiv preprint arXiv:2004.07180
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: pre-training of deep bidirectional transformers for language understanding. In: NAACL-HLT (1) (2019)
Fan, M., Zhou, Q., Chang, E., Zheng, F.: Transition-based knowledge graph embedding with relational mapping properties. In: Proceedings of the 28th Pacific Asia Conference on Language, Information and Computing, pp. 328ā337 (2014)
Gao, S., Hu, Y., Janowicz, K., McKenzie, G.: A spatiotemporal scientometrics framework for exploring the citation impact of publications and scientists. In: Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 204ā213 (2013)
Hirsch, J.E.: An index to quantify an individualās scientific research output. Proc. Natl. Acad. Sci. 102(46), 16569ā16572 (2005)
Hood, W.W., Wilson, C.S.: The literature of bibliometrics, scientometrics, and informetrics. Scientometrics 52(2), 291ā314 (2001). https://doi.org/10.1023/A:1017919924342
Hu, Y., Janowicz, K., McKenzie, G., Sengupta, K., Hitzler, P.: A linked-data-driven and semantically-enabled journal portal for scientometrics. In: International Semantic Web Conference, pp. 114ā129. Springer (2013)
Hu, Y., McKenzie, G., Yang, J.A., Gao, S., Abdalla, A., Janowicz, K.: A linked-data-driven web portal for learning analytics: data enrichment, interactive visualization, and knowledge discovery. In: LAK Workshops (2014)
Santha kumar, R., Kaliyaperumal, K.: A scientometric analysis of mobile technology publications. Scientometrics 105(2), 921ā939 (2015). https://doi.org/10.1007/s11192-015-1710-7
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: International Conference on Machine Learning, pp. 1188ā1196 (2014)
Mai, G., Janowicz, K., Yan, B.: Combining text embedding and knowledge graph embedding techniques for academic search engines. In: Semdeep/NLIWoD@ ISWC, pp. 77ā88 (2018)
Mai, G., Janowicz, K., Yan, B., Zhu, R., Cai, L., Lao, N.: Multi-scale representation learning for spatial feature distributions using grid cells. In: International Conference on Learning Representations (2020)
McKenzie, G., Janowicz, K., Hu, Y., Sengupta, K., Hitzler, P.: Linked scientometrics: designing interactive scientometrics with linked data and semantic web reasoning (2013)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. Adv. Neural Inf. Process. Syst. 26, 3111ā3119 (2013)
Peters, M.E., et al.: Deep contextualized word representations. In: Proceedings of NAACL-HLT, pp. 2227ā2237 (2018)
Regalia, B., Janowicz, K., Mai, G.: Phuzzy. link: a SPARQL-powered client-sided extensible semantic web browser. In: VOILA@ ISWC, pp. 34ā44 (2017)
Schlichtkrull, M., et al.: Modeling relational data with graph convolutional networks. In: Gangemi, A., et al. (eds.) ESWC 2018. LNCS, vol. 10843, pp. 593ā607. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93417-4_38
Wang, Z., Zhang, J., Feng, J., Chen, Z.: Knowledge graph embedding by translating on hyperplanes. In: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, pp. 1112ā1119. AAAI Press (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Ā© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Liu, Z. et al. (2022). LD Connect: A Linked Data Portal for IOS Press Scientometrics. In: Groth, P., et al. The Semantic Web. ESWC 2022. Lecture Notes in Computer Science, vol 13261. Springer, Cham. https://doi.org/10.1007/978-3-031-06981-9_19
Download citation
DOI: https://doi.org/10.1007/978-3-031-06981-9_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-06980-2
Online ISBN: 978-3-031-06981-9
eBook Packages: Computer ScienceComputer Science (R0)