A novel emerging topic detection method: A knowledge ecology perspective

https://doi.org/10.1016/j.ipm.2021.102843Get rights and content

Abstract

Emerging topic detection has attracted considerable attention in recent times. While various detection approaches have been proposed in this field, designing a method for accurately detecting emerging topics remains challenging. This paper introduces the perspective of knowledge ecology to the detection of emerging topics and utilizes author-keywords to represent research topics. More precisely, we first improve the novelty metric and recalculate emergence capabilities based on the “ecostate” and “ecorole” attributes of ecological niches. Then, we take the perspective that keywords are analogous to living bodies and map them to the knowledge ecosystem to construct an emerging topics detection method based on ecological niches (ETDEN). Finally, we conduct in-depth comparative experiments to verify the effectiveness and feasibility of ETDEN using data extracted from scientific literature in the ACM Digital Library database. The results demonstrate that the improved novelty indicator helps to differentiate the novelty values of keywords in the same interval. More importantly, ETDEN performs significantly better performance on three terms: the emergence time point and the growth rate of pre-and post-emergence.

Introduction

The knowledge environment is constantly evolving and developing; new knowledge constantly emerges while old knowledge becomes obsolete. The evolution of knowledge discriminates between old and new knowledge by selecting those knowledge units that fit the environmental constraints in which they live (Firestone, 2008). Like organisms, knowledge units experience a full life cycle, which generally includes creation, mobilization, diffusion, and integration. (Stary, 2014). Furthermore, this process has been also subtly summarized as a four-stage S-shaped curve: birth, growth, maturity, and senility (Braun et al., 2000; Van den Oord & Van Witteloostuijn, 2018). Therefore, the evolution of knowledge units in a knowledge ecosystem follows certain universal laws that provide strong theoretical support for detecting emerging research topics.

The study of emerging topic detection has tended to primarily focus on the design of bibliometric indicators that reveal the nature of emergence (Xu et al., 2019). More specifically, many bibliometric indicators for identifying emerging topics from different aspects have been proposed, such as novelty (Tu & Seng, 2012; Wang, 2018; Porter et al., 2019), growth (Ohniwa & Hibino, 2010; Coccia, 2012; Dang et al., 2016), community (Yu et al., 2016; Yoon et al., 2018; Yoo et al., 2019), and uncertainty (Yu et al., 2016). It has been observed that the novelty and growth indicators often appear in the scientific literature on emerging topic detection. However, the indicators proposed in each article above are different and arbitrary, which may result in the lack of well-established linkages between the concept of an emerging topic and these operationalization indicators (Xu et al., 2019). Thus, to accurately detect emerging research topics, an alternative idea should be used to re-examine extant studies. Scientific exploration was conceptualized as a search in a high-dimensional abstract landscape of problems. There exists a clear analogy with the spatial model of evolutionary biology (Börner & Scharnhorst, 2009). Several computational models and basic theories from evolutionary biology can be employed in the study of topic evolution. Researchers were inspired to explore the evolutionary laws of research topics from an ecological perspective (Van den Oord & Van Witteloostuijn, 2018). These studies are instructive in terms of exploring a novel method of emerging topic detection. Namely, the ecological niche theory can be used during the process of detecting emerging topics from a knowledge ecology perspective.

The primary objective of this study is to propose a novel method of emerging topics detection from the perspective of knowledge ecology. More specifically, we first introduce the ecological niche theory to extend the idea of emerging topic detection, which provides two dimensions for classifying bibliometric indicators based on the “ecostate” and “ecorole” attributes. Second, we improve the novelty indicator using a weighting modified method to differentiate the novelty values of research topics in the same interval; we also calculate the slope value of growth at the detection point (niche value) to represent the emergence capability in terms of the “ecorole” attribute of ecological niches. Third, keywords are analogized with living bodies and mapped to the knowledge ecosystem to construct a niche baseline for detecting emerging topics. Finally, we conduct in-depth comparative experiments to verify the effectiveness and feasibility of our approach using data extracted from scientific literature in the ACM Digital Library database.

The rest of this paper is organized as follows: Section 2 reviews ecological niche theory and knowledge ecology, previous studies on bibliometric indicators of emerging topics, and keyword-based emerging topic detection. Section 3 describes the material and methods. Section 4 primarily discusses the study's results. Finally, we conclude with an overview of our findings and implications in Section 5.

Section snippets

Ecological niche theory and knowledge ecology

The ecosystem has typically been thought of as an explicit analogy to academics since it involves actors and their dynamic activities. Currently, academic big data has revived the idea of an “ecology of science” due to the discovery of potential laws that have exhibited sufficient accuracy. Several computational models from evolutionary biology can be adopted in research regarding evolution. Ecological niche theory is an analytical framework for explaining and describing how individuals adapt

Material and methods

According to ecological niche theory, after emerging biological units have broken through a specific natural environment boundary and reached the minimum habitat threshold (Grubb, 1977), they have acquired life energy and can survive steadily in their ecosystem. Likewise, the knowledge units’ emergence capabilities also reflect a stable state in the knowledge ecosystem in addition to unified concept connotations and competitive ability. In this regard, the crux of emergence detection is to

Results and Discussion

When reviewing the literature, an increasing number of bibliometric indicators have been proposed to detect emerging topics. However, there is no consensus among these bibliometric indicators, and there is a lack of specific dimensions for classifying them. Thus, it is essential for the basic theory or approach to establish solid linkages between the concept of emergence and the proposed operationalization indicators. In contrast with the past, academic big data revived the idea of an “ecology

Conclusion

This paper proposed a novel emerging topic detection method based on ecological niches from the perspective of knowledge ecology. Specifically, the research method for emerging topic detection was extended to provide two dimensions for classifying bibliometric indicators based on the “ecostate” and “ecorole” twofold attributes of ecological niches. In addition, the novelty indicator was improved by a weighting modified method that differentiates the novelty values of research topics in the same

CRediT authorship contribution statement

Jinqing Yang: Conceptualization, Methodology, Writing – original draft, Writing – review & editing. Wei Lu: Conceptualization, Methodology, Formal analysis, Supervision. Jiming Hu: Conceptualization, Writing – original draft, Writing – review & editing. Shengzhi Huang: Data curation, Formal analysis.

Acknowledgments

This work was partially supported by Major Projects of National Social Science Foundation of China (no. 17ZDA292), National Natural Science Foundation of China (no. 71874125) and The Young Top-notch Talent Cultivation Program of Hubei Province. We are also grateful to editors and anonymous reviewers for their helpful and valuable comments on our work.

Reference (61)

  • J. Savoy

    Authorship attribution based on a probabilistic topic model

    Information Processing & Management

    (2013)
  • H. Seol et al.

    Demand forecasting for new media services with consideration of competitive relationships using the competitive Bass model and the theory of the niche

    Technol. Forecast. Soc. Chang.

    (2012)
  • H. Small et al.

    Identifying emerging topics in science and technology

    Research policy

    (2014)
  • A. Smith et al.

    Innovation studies and sustainability transitions: the allure of the multi-level perspective and its challenges

    Res. Policy

    (2010)
  • R. Srinivasan

    Sources, characteristics and effects of emerging technologies: Research opportunities in innovation

    Industrial Marketing Management

    (2008)
  • Y.N. Tu et al.

    Indices of novelty for emerging topic detection

    Information processing & management

    (2012)
  • I. Vulić et al.

    Probabilistic topic modeling in multilingual settings: An overview of its methodology and applications

    Information Processing & Management

    (2015)
  • R. Wang et al.

    Atm: Adversarial-neural topic model

    Information Processing & Management

    (2019)
  • C. Wu et al.

    Learning entity-centric document representations using an entity facet topic model

    Information Processing & Management

    (2020)
  • S. Xu et al.

    Emerging topics detection with multiple machine learning models

    Journal of Informetrics

    (2019)
  • H. Yu et al.

    Assessment on the research trend of low-carbon energy technology investment: A bibliometric analysis

    Appl. Energy

    (2016)
  • J. Adams

    The rise of research networks

    Nature

    (2012)
  • B.K. Bao et al.

    Cross-platform emerging topic detection and elaboration from multimedia streams

    ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)

    (2015)
  • J.A.C. Baum et al.

    Organizational niches and the dynamics of organizational founding

    Organ. Sci.

    (1994)
  • K.W. Boyack et al.

    Mapping the backbone of science

    Scientometrics

    (2005)
  • R.R. Braam et al.

    Mapping of science by combined co-citation and word analysis. I. Structural aspects

    J. Am. Soc. Inf. Sci.

    (1991)
  • T. Braun et al.

    Growth and trends of fullerene research as reflected in its journal literature

    Chem. Rev.

    (2000)
  • J.Y. Choi et al.

    A study on diffusion pattern of technology convergence: Patent analysis for Korea

    Sustainability

    (2015)
  • M. Coccia

    Evolutionary growth of knowledge in path-breaking targeted therapies for lung cancer: radical innovations and structure of the new technological paradigm

    International Journal of Behavioural and Healthcare Research

    (2012)
  • M. Coccia

    General properties of the evolution of research fields: a scientometric study of human microbiome, evolutionary robotics, and astrobiology

    Scientometrics

    (2018)
  • Cited by (8)

    • Measuring risk in science

      2023, Journal of Informetrics
    • Revisiting the exploration-exploitation behavior of scholars' research topic selection: Evidence from a large-scale bibliographic database

      2022, Information Processing and Management
      Citation Excerpt :

      After analyzing the career paths of scientists from the computer science field, Bu et al. (2018) found that the collaborators of high-impact scientists prefer to study diverse topics. Moreover, both funding agencies and individual researchers are interested in identifying and following emerging research topics (Liang et al., 2021; Lu et al., 2021; Tu and Seng, 2012; Xu et al., 2022; Yang et al., 2022). Keshavarz and Shekari (2020) recently collected and analyzed interview data from 391 postgraduate students.

    • Efficient Topic Detection Using an Adaptive Neural Network Architecture

      2024, Lecture Notes in Business Information Processing
    • Generating keyphrases for readers: A controllable keyphrase generation framework

      2023, Journal of the Association for Information Science and Technology
    View all citing articles on Scopus
    View full text