research-article

Text Mining Approach for Identifying Research Trends

Author:
Snezhana Sulova

University of Economics - Varna, Bulgaria

University of Economics - Varna, Bulgaria
View Profile

CompSysTech '21: Proceedings of the 22nd International Conference on Computer Systems and TechnologiesJune 2021Pages 93–98https://doi.org/10.1145/3472410.3472433

Published:07 October 2021Publication History

CompSysTech '21: Proceedings of the 22nd International Conference on Computer Systems and Technologies

Pages 93–98

ABSTRACT

With the increase of unstructured data, the issues connected with automatic text processing, the categorization of documents and the discovery of topics have become objects of growing interest. In order to improve the process of grouping and processing research publications, we would like to propose a method based upon natural language processing. It is based on text mining technologies which aim to identify key tendencies in documents. It processes the content of publications by clustering and identifies the topics of each identified group. This analysis helps by identifying key tendencies as well as discovering emerging new areas of research. Publications from the research literature database, Scopus, were used to test the approach. The topic of the publications is “the application of digital technologies in the logistics business”. The experiments were completed using the RapidMiner Studio software.

Supplemental Material

Available for Download

pptx

p93-sulova-supplement.pptx (1.6 MB)

Presentation slides

References

NSF. 2021. Publications Output: U.S. Trends and International Comparisons | NSF - National Science Foundation. Retrieved March 13, 2021 from https://ncses.nsf.gov/pubs/nsb20206/Google Scholar
Julia Hirschberg and Christopher Manning. 2015. Advances in natural language processing. Artificial Intelligence, 349(6245), 261-266.Google Scholar
Ronen Feldman and James Sanger. 2007. The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data. Cambridge, New York.Google Scholar
Ndengabaganizi James and Rajkumar Kannan. 2017. A Survey on Information Retrieval Models, Techniques and Applications. International Journal of Advanced Research in Computer Science and Software Engineering, 7(7), 16-19.Google ScholarCross Ref
Daniel Jurafsky and James Martin. 2009. Speech and Language Processing. 2nd Edition. NJ, USA: Prentice-Hall, Inc.Google Scholar
Jan Zizka, Frantisek Darena, Arnost Svoboda. 2020. Text Mining with Machine Learning Principles and Techniques. Broken Sound Parkway NW: CRC Press Taylor & Francis Group.Google Scholar
Alejandro Peña-Ayala. 2014. Educational Data Mining. Applications and Trends. Charm Heidelberg: Springer International Publishing.Google Scholar
Boris Bankov. 2017. Extracting Top Trends from Twitter Discussions in Bulgarian. Journal of the Union of Scientists – Varna. "Economic Sciences" series. Vol. 2, 254-259.Google Scholar
David Antons, Eduard Grünwald, Patrick Cichy and Torsten Oliver Salge. 2020. The application of text mining methods in innovation research: current state, evolution patterns, and development priorities. R&D Management published by RADMA and John Wiley & Sons Ltd. 50, 3. 329-351.Google Scholar
Veselina Jecheva and Evgeniya Nikolova. 2016. Some Clustering-Based Methodology Applications to Anomaly Intrusion Detection Systems. International Journal of Security and Its Applications Vol. 10, No. 1. 215-228. http://dx.doi.org/10.14257/ijsia.2016.10.1.20Google ScholarCross Ref
Jiapeng Wang and Yihong Dong. 2020. Measurement of Text Similarity: A Survey. Information 2020, 11, 421; doi:10.3390/info11090421Google Scholar
B. Stoyanov, I. Dimitrov, I. Doytchinova, I. Bangov. 2021. Clustering of Red/White Wine and Allergen/Non-Allergen Data Sets by Using Descriptor Fingerprints. IOP Conference Series: Materials Science and Engineering. Volume 1031, Issue 1, 11. 10.1088/1757-899X/1031/1/012053Google ScholarCross Ref
V. Bureva, E. Sotirova, S. Popov, D. Mavrov, V. Traneva, 2017. Generalized net of cluster analysis process using STING: A statistical information grid approach to spatial data mining. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Volume 10333 LNAI, 2017, 239-248Google Scholar
Kun Song, Xiwen Yao, Feiping Nie, Xuelong Li, Mingliang Xu. 2021. Weighted bilateral K-means algorithm for fast co-clustering and fast spectral clustering. Pattern Recognition. Volume 109, 107560.Google ScholarDigital Library
Mirjana Pejic Bach, Tine Bertoncel, Maja Meško, Dalia Suša Vugec and Lucija Ivancic. 2020. Big Data Usage in European Countries: Cluster Analysis Approach. Data. 5(1), 25; https://doi.org/10.3390/data5010025Google Scholar
Miglena Stoyanova, Julian Vasilev, and Marian Cristescu. 2021. Big data in property management. AIP Conference Proceedings 2333, 070001. https://doi.org/10.1063/5.0041902Google ScholarCross Ref
M. Kehayova-Stoycheva and Julian Vasilev. 2018. Measuring the problem use of internet – internal structures and dependencies. Economics and Computer Science, 4, 1, 6–26.Google Scholar
Boris Bankov, 2018. An Approach for Clustering Social Media Text Messages, Retrieved from Continuous Data Streams. Science. Business. Society: International Scientific Journal. Sofia: Scientific Technical Union of Mechanical Engineering INDUSTRY 4.0 et. al, 3, 1, 6 - 9.Google Scholar
David L. Davies and Donald W. Bouldin. 1979. A Cluster Separation Measure. IEEE Transactions on Pattern Analysis and Machine Intelligence. PAMI-1 (2): 224–227. doi:10.1109/TPAMI.1979.4766909Google Scholar
Erich Schubert and Peter Rousseeuw 2019. Faster k-Medoids Clustering: Improving the PAM, CLARA, and CLARANS Algorithms. In: G. Amato, C. Gennaro, V. Oria, M. Radovanović. 2019. Similarity Search and Applications. SISAP 2019. Lecture Notes in Computer Science, vol 11807. Springer, Cham. https://doi.org/10.1007/978-3-030-32047-8_16Google Scholar
R. Nacheva, K. Vorobyeva and M. Bakaev. 2020. Evaluation and Promotion of M-Learning Accessibility for Smart Education Development. In: A. Chugunov, I. Khodachek, Y. Misnikov, D. Trutnev. 2020. Electronic Governance and Open Society: Challenges in Eurasia. EGOSE 2020. Communications in Computer and Information Science, vol 1349. Springer, Cham. https://doi.org/10.1007/978-3-030-67238-6_8Google Scholar

Recommendations

Seeding the survey and analysis of research literature with text mining

Text mining is a semi-automated process of extracting knowledge from a large amount of unstructured data. Given that the amount of unstructured data being generated and stored is increasing rapidly, the need for automated means to process it is also ...
Read More
Text document clustering based on neighbors

Clustering is a very powerful data mining technique for topic discovery from text documents. The partitional clustering algorithms, such as the family of k-means, are reported performing well on document clustering. They treat the clustering problem as ...
Read More
Empirical research of emerging trends and patterns across the flipped classroom studies using topic modeling
Abstract
This study presents topic modeling based bibliometric characteristics of the articles related to the flipped classroom. The corpus of the study consists of 2959 articles published in the Scopus database as of the end of 2021. In addition to the ...
Highlights
- The flipped classroom approach has an important place in the implementation of student-centered learning environments.
- This study presents topic modeling based bibliometric characteristics of the articles related to the flipped classroom.
- ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CompSysTech '21: Proceedings of the 22nd International Conference on Computer Systems and Technologies
June 2021
230 pages
ISBN:9781450389822
DOI:10.1145/3472410
Editors:
Tzvetomir Vassilev,
Roumen Trifonov
Copyright © 2021 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 7 October 2021
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Clustering
K-Means
Research trends
Text mining
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate241of492submissions,49%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 83
  Total Downloads
- Downloads (Last 12 months)27
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Text Mining Approach for Identifying Research Trends

CompSysTech '21: Proceedings of the 22nd International Conference on Computer Systems and Technologies

ABSTRACT

Supplemental Material

Available for Download

References

Cited By

Recommendations

Seeding the survey and analysis of research literature with text mining

Text document clustering based on neighbors

Empirical research of emerging trends and patterns across the flipped classroom studies using topic modeling

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Text Mining Approach for Identifying Research Trends

CompSysTech '21: Proceedings of the 22nd International Conference on Computer Systems and Technologies

ABSTRACT

Supplemental Material

Available for Download

References

Cited By

Recommendations

Seeding the survey and analysis of research literature with text mining

Text document clustering based on neighbors

Empirical research of emerging trends and patterns across the flipped classroom studies using topic modeling

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media