Abstract
This study intends to describe the development and results of a software designed to analyze millions of articles in the area of Transportation Engineering. This tool intends to support Transportation Planning activities by providing additional information about trends, references and technologies. In order to develop this software, techniques from scientometrics, bibliometrics and informetrics were employed with the support of tools from Computer Science, such as Artificial Intelligence, Data Mining and Natural Language Processing. The result of this study is a structured database that allows browsing the change of interest in different topics along the years in areas related to Transportation Engineering. When analyzing a given area, the database is capable of identifying which authors published works in that area, allowing the identification of specialists and related papers. In addition, the software responsible for creating this database is capable of performing the same analysis in academic corpora of other areas of study.
Similar content being viewed by others
References
Aggarwal, N., Kumar, A., Khatter, H., & Kha, H. (2012). Analysis of the effect of data mining techniques on database. Advances in Engineering Software, 47(1), 164–169.
Aizawa, A. (2003). An information-theoretic perspective of tf–idf measures. Information Processing and Management, 39(1), 45–65.
Bastian M., Heymann S., & Jacomy M. (2009). Gephi: An open source software for exploring and manipulating networks. In International AAAI conference on weblogs and social media.
Björneborn, L., & Ingwersen, P. (2004). Toward a basic framework for webometrics. Journal of the American Society for Information Science and Technology, 55(14), 1216–1227.
Brin, S., & Page, L. (2001). Dynamic data mining: Exploring large rule space by sampling. Technical Report, Stanford: InfoLab.
Furlan, B., Batanović, V., & Nikolić, B. (2013). Semantic similarity of short texts in languages with a deficient natural language processing support. Decision Support Systems, 55(3), 710–719.
Garfield, E. (2000). Use of journal citation reports and journal performance indicators in measuring short and long-term journal impact. http://www.researchgate.net/publication/12262737_Use_of_Journal_Citation_Reports_and_Journal_Performance_Indicators_in_measuring_short_and_long_term_journal_impact. Accessed 14 Aug 2014.
Hea, W., Zhab, S., & Lia, L. (2013). Social media competitive analysis and text mining: A case study in the pizza industry. International Journal of Information Management, 33(3), 464–472.
Hong, T. P., Lin, C. W., Yang, K. T., & Wang, S. L. (2013). Using TF-IDF to hide sensitive itemsets. LLC, 38(4), 502–510.
Jurish, B., & Würzner, K. M. (2013). Word and sentence Tokenization with hidden Markov models. JLCL, 28(2), 61–83.
Leskovec, J., Rajaraman, A., & Ullman, J. D. (2014). Mining of massive datasets. Stanford: Cambridge University Press.
Leydesdorff, L. (2001). The challenge of scientometrics. Boca Raton: Universal Publishers.
Leydesdorff, L., & Milojević, S. (2013). Scientometrics school of informatics and computing. Indiana University. http://arxiv.org/ftp/arxiv/papers/1208/1208.4566.pdf. Accessed 01 July 2011.
Markscheffel, B. (2011). An ontology based visualization approach for the joined interpretation of bibliometrics and webometrics data. http://dl.acm.org/citation.cfm?id=2077520. Accessed 12 Aug 2015.
Princeton University. (2015). Wordnet. https://wordnet.princeton.edu/. Accessed 10 April 2015.
Sage, A. P. (1990). Concise encyclopedia of information processing in systems and organizations. New York: Pergamon.
Silva, J. A. D., & Bianchi, M. D. L. P. (2001). Cientometria: A métrica da ciência. 10.1590/S0103-863X2001000200002. Accessed 25 April 2014.
Spinak, E. (1996). Diccionario enciclopédico de Bibliometria, cienciometría e informetría. Caracas: Unesco.
Tague-Sutcliffe, J. (1992). An introduction to informetrics. http://dl.acm.org/citation.cfm?id=160642. Accessed 20 April 2014.
Van Noorden, R. (2014). Global scientific output doubles every nine years. Nature.com - New Blog. Accessed 07 May 2014.
Yue, X., Di, G., Yu, Y., Wang, W., & Shi, H. (2012). Analysis of the combination of natural language processing and search engine technology. In International workshop on information and electronics engineering (IWIEE).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
de Stefano, E., de Sequeira Santos, M.P. & Balassiano, R. Development of a software for metric studies of transportation engineering journals. Scientometrics 109, 1579–1591 (2016). https://doi.org/10.1007/s11192-016-2152-6
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11192-016-2152-6