skip to main content
10.1145/3632410.3632481acmotherconferencesArticle/Chapter ViewAbstractPublication PagescomadConference Proceedingsconference-collections
extended-abstract

CiteDEK: A hybrid EMD-KNN-DTW model for classification of paper citation trajectories

Authors Info & Claims
Published:04 January 2024Publication History

ABSTRACT

Classifying citation trajectories of scientific publications is crucial. However, they diffuse anomalously due to non-linear, non-stationary, and long-ranged correlations. Previous studies define hard thresholds, arbitrary parameters, and subjective rules to classify based on their rise and fall patterns. It leads to substantial variance and, thus, ambiguous classification. This paper proposes CiteDEK, a hybrid EMD-kNN-DTW classification model framework. It predicts the nature of 5,039 trajectories, each 30 years in length, using only raw time series. We get a classification accuracy of ≈ 76%, and Cohen’s kappa-statistic is 0.63, which is significant.

References

  1. Joyita Chakraborty, Dinesh K Pradhan, and Subrata Nandi. 2023. A multiple k-means cluster ensemble framework for clustering citation trajectories. arXiv preprint arXiv:2309.04949 (2023).Google ScholarGoogle Scholar
  2. Tanmoy Chakraborty, Suhansanu Kumar, Pawan Goyal, Niloy Ganguly, and Animesh Mukherjee. 2015. On the categorization of scientific citation profiles in computer science. Commun. ACM 58, 9 (2015), 82–90.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Giovanni Colavizza and Massimo Franceschet. 2016. Clustering citation histories in the Physical Review. Journal of Informetrics 10, 4 (2016), 1037–1051.Google ScholarGoogle ScholarCross RefCross Ref
  4. Zhenyu Gou, Fan Meng, Zaida Chinchilla-Rodríguez, and Yi Bu. 2022. Encoding the citation life-cycle: the operationalization of a literature-aging conceptual model. Scientometrics 127, 8 (2022), 5027–5052.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Dinesh K Pradhan, Joyita Chakraborty, and Subrata Nandi. 2019. Applications of machine learning in analysis of citation network. In Proceedings of the ACM India joint international conference on data science and management of data. 330–333.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Andrew J Quinn, Vitor Lopes-dos Santos, David Dupret, Anna Christina Nobre, and Mark W Woolrich. 2021. EMD: Empirical mode decomposition and Hilbert-Huang spectral analyses in Python. Journal of open source software 6, 59 (2021).Google ScholarGoogle ScholarCross RefCross Ref
  7. Fred Y Ye and Lutz Bornmann. 2018. “Smart girls” versus “sleeping beauties” in the sciences: The identification of instant and delayed recognition by using the citation angle. Journal of the Association for Information Science and Technology 69, 3 (2018), 359–367.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Maryam Zamani, Erez Aghion, Peter Pollner, Tamas Vicsek, and Holger Kantz. 2021. Anomalous diffusion in the citation time series of scientific publications. Journal of Physics: Complexity 2, 3 (2021), 035024.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. CiteDEK: A hybrid EMD-KNN-DTW model for classification of paper citation trajectories
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            CODS-COMAD '24: Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)
            January 2024
            627 pages

            Copyright © 2024 Owner/Author

            Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 4 January 2024

            Check for updates

            Qualifiers

            • extended-abstract
            • Research
            • Refereed limited
          • Article Metrics

            • Downloads (Last 12 months)21
            • Downloads (Last 6 weeks)6

            Other Metrics

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format .

          View HTML Format