Abstract
Many digital libraries, such as PubMed, Scopus, appeared with the growth of the Internet: thus, many scientific articles became available in the digital form. We got an opportunity to query articles metadata, gather statistics, build co-authorship graphs, etc. This includes estimating the authors/institutions activity, revealing their interactions and other properties.
In this work we present the analysis of the characteristics of institutions interactions in the miRNA science field using the data from PubMed digital library. To tackle the problem of the institution name writing variability, we proposed the k-mer/n-gram boolean feature vector sorting algorithm -KOFER. We identified the leaders of the field - China, USA -, characterized the interactions and described the country level features of co-authorship. We observed that the USA were leading in the publication activity until China took the lead 4 years ago. However, the USA are the main co-authorship driver in this field.
The work of I.T. was supported by the Federal Agency of Scientific Organizations (project #0324-2019-0040).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ncbi.nlm.nih.gov. Home - PubMed - NCBI (2019). https://www.ncbi.nlm.nih.gov/pubmed/. Accessed 17 Jan 2019
Titov, I., Blinov, A., Research of the structure and evolution of scientific co-authorship based on the analysis of Novosibirsk institutes publications in the biology and medicine science field. Vavilov J. Genet. Sel. 18 (2014)
Zhang, S., Wu, J., Zheng, D., Meng, Y., Yu, H.: An adaptive method for organization name disambiguation with feature reinforcing. In: 26th Pacific Asia Conference on Language, Information and Computation, pp. 237–245 (2012)
Polat, N.: Experiments on company name disambiguation with supervised classification techniques. In: 2013 International Conference on Electronics, Computer and Computation (ICECCO) (2013). https://doi.org/10.1109/ICECCO.2013.6718248
Bell Hirschberg, J., Rosenberg, A.: V-Measure: a conditional entropy-based external cluster evaluation. In: EMNLP, Prague (2007)
Rajaraman, A., Ulman, J.: Data mining. In: I.stanford.edu (2011). http://i.stanford.edu/ullman/mmds/ch1.pdf. Accessed 20 Jan 2019
Natural Language Toolkit - NLTK 3.4 documentation. In: Nltk.org. https://www.nltk.org/. Accessed 20 Jan 2019
Language-check. In: PyPI. https://pypi.org/project/language-check/. Accessed 20 Jan 2019
Scikit-learn: machine learning in Python - scikit-learn 0.20.2 documentation. In: Scikit-learn.org. https://scikit-learn.org/stable/. Accessed 20 Jan 2019
Ester, M., Kriegel, H., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. AAAI Press (1996)
Dice, L.R.: Measures of the Amount of Ecologic Association Between Species (1945)
Fortunato, S., et al.: Science of science. Science (2018). https://doi.org/10.1126/science.aao0185
Cohen, W.W., Ravikumar, P.D., Fienberg, S.E.: A comparison of string distance metrics for name-matching tasks. IIWeb (2003)
Kamber, H.J.: Data Mining: Concepts and Technique. Morgan Kauffman, Burlington (2005)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31, 264–323 (1999)
Ferreira, A.A., Gonçalves, M.A., Laender, A.H.F.: A brief survey of automatic methods for author name. SIGMOD Rec. 41, 15–26 (2012)
Usoltcev, E.: meMo-Minsk - Overview. In: GitHub (2016). https://github.com/meMo-Minsk. Accessed 25 Jan 2019
Gephi - The Open Graph Viz Platform. In: Gephi.org. https://gephi.org/. Accessed 25 Jan 2019
Matplotlib: Python plotting - Matplotlib 3.0.2 documentation. In: Matplotlib.org. https://matplotlib.org/. Accessed 25 Jan 2019
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Firsov, A., Titov, I. (2019). Inter-country Competition and Collaboration in the miRNA Science Field. In: Bjørner, N., Virbitskaite, I., Voronkov, A. (eds) Perspectives of System Informatics. PSI 2019. Lecture Notes in Computer Science(), vol 11964. Springer, Cham. https://doi.org/10.1007/978-3-030-37487-7_3
Download citation
DOI: https://doi.org/10.1007/978-3-030-37487-7_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37486-0
Online ISBN: 978-3-030-37487-7
eBook Packages: Computer ScienceComputer Science (R0)