research-article

Text Mining in Cybersecurity: A Systematic Literature Review

Authors:

Luciano Ignaczak,

Guilherme Goldschmidt,

Cristiano André Da Costa,

Rodrigo Da Rosa RighiAuthors Info & Claims

ACM Computing Surveys (CSUR), Volume 54, Issue 7

Article No.: 140, Pages 1 - 36

https://doi.org/10.1145/3462477

Published: 18 July 2021 Publication History

Abstract

The growth of data volume has changed cybersecurity activities, demanding a higher level of automation. In this new cybersecurity landscape, text mining emerged as an alternative to improve the efficiency of the activities involving unstructured data. This article proposes a Systematic Literature Review (SLR) to present the application of text mining in the cybersecurity domain. Using a systematic protocol, we identified 2,196 studies, out of which 83 were summarized. As a contribution, we propose a taxonomy to demonstrate the different activities in the cybersecurity domain supported by text mining. We also detail the strategies evaluated in the application of text mining tasks and the use of neural networks to support activities involving unstructured data. The work also discusses text classification performance aiming its application in real-world solutions. The SLR also highlights open gaps for future research, such as the analysis of non-English content and the intensification in the usage of neural networks.

Supplementary Material

a140-ignaczak-supp.pdf (ignaczak.zip)

Supplemental movie, appendix, image and software files for, Text Mining in Cybersecurity: A Systematic Literature Review

Download
989.95 KB

References

[1]

Ahmed Abbasi, Stephen France, Zhu Zhang, and Hsinchun Chen. 2010. Selecting attributes for sentiment classification using feature relation networks. IEEE Transactions on Knowledge and Data Engineering 23, 3 (2010), 447–462.

Digital Library

[2]

Adeline Abbe, Cyril Grouin, Pierre Zweigenbaum, and Bruno Falissard. 2016. Text mining applications in psychiatry: A systematic literature review. International Journal of Methods in Psychiatric Research 25, 2 (2016), 86–100.

[3]

Laith Mohammad Abualigah, Ahamad Tajudin Khader, and Essam Said Hanandeh. 2018. A novel weighting scheme applied to improve the text document clustering techniques. Innovative Computing, Optimization and its Applications. Springer, 305–320. https://doi.org/10.1007/978-3-319-66984-7_18

[4]

Mohammed Abuhamad, Ji su Rhim, Tamer AbuHmed, Sana Ullah, Sanggil Kang, and DaeHun Nyang. 2019. Code authorship identification using convolutional neural networks. Future Generation Computer Systems 95 (2019), 104–115.

Digital Library

[5]

Stephen Adams, Bryan Carter, Cody Fleming, and Peter A. Beling. 2018. Selecting system specific cybersecurity attack patterns using topic modeling. In 2018 17th IEEE International Conference on Trust, Security and Privacy in Computing and Communications/12th IEEE International Conference on Big Data Science and Engineering (TrustCom/BigDataSE). Institute of Electrical and Electronics Engineers Inc., 490–497.

[6]

Kayode Sakariyah Adewole, Nor Badrul Anuar, Amirrudin Kamsin, and Arun Kumar Sangaiah. 2017. SMSAD: A framework for spam message and spam account detection. Multimedia Tools and Applications 78, 4 (July 2017), 3925–3960.

[7]

Charu C. Aggarwal. 2018. Machine Learning for Text. Springer.

[8]

Charu C. Aggarwal and ChengXiang Zhai. 2012. Mining Text Data. Springer Science & Business Media.

Digital Library

[9]

Amritanshu Agrawal, Wei Fu, and Tim Menzies. 2018. What is wrong with topic modeling? And how to fix it using search-based software engineering. Information and Software Technology 98 (2018), 74–88.

Digital Library

[10]

Tareq Al-Moslmi, Nazlia Omar, Salwani Abdullah, and Mohammed Albared. 2017. Approaches to cross-domain sentiment analysis: A systematic literature review. IEEE Access 5 (2017), 16173–16192.

[11]

Khalid Al-Rowaily, Muhammad Abulaish, Nur Al-Hasan Haldar, and Majed Al-Rubaian. 2015. BiSAL—A bilingual sentiment analysis lexicon to analyze Dark Web forums for cyber security. Digital Investigation 14 (2015), 53–62.

Digital Library

[12]

Monther Aldwairi and Ali Alwahedi. 2018. Detecting fake news in social media networks. Procedia Computer Science 141 (2018), 215–222.

[13]

Sultan Alneyadi, Elankayer Sithirasenan, and Vallipuram Muthukkumarasamy. 2013. Adaptable N-gram classification model for data leakage prevention. In Proceedings of the 7th International Conference on Signal Processing and Communication Systems (ICSPCS’)https://doi.org/10.1109/ICSPCS.2013.6723919

[14]

Sultan Alneyadi, Elankayer Sithirasenan, and Vallipuram Muthukkumarasamy. 2016. A survey on data leakage prevention systems. Journal of Network and Computer Applications 62 (2016), 137–152.

Digital Library

[15]

Manar Alohaly, Hassan Takabi, and Eduardo Blanco. 2018. A deep learning approach for extracting attributes of ABAC policies. In Proceedings of the 23rd ACM Symposium on Access Control Models and Technologies (SACMAT’18). Association for Computing Machinery, New York, NY, 137–148.

Digital Library

[16]

Basemah Alshemali and Jugal Kalita. 2020. Improving the reliability of deep neural networks in NLP: A review. Knowledge-Based Systems 191 (2020), 105210.

Digital Library

[17]

Berna Altınel and Murat Can Ganiz. 2018. Semantic text classification: A survey of past and recent advances. Information Processing & Management 54, 6 (2018), 1129–1153.

[18]

Flora Amato, Giovanni Cozzolino, Vincenzo Moscato, and Francesco Moscato. 2019. Analyse digital forensic evidences through a semantic-based methodology and NLP techniques. Future Generation Computer Systems 98 (2019), 297–307.

Digital Library

[19]

Flora Amato, Giuseppe De Pietro, Massimo Esposito, and Nicola Mazzocca. 2015. An integrated framework for securing semi-structured health records. Knowledge-Based Systems 79 (2015), 99–117.

Digital Library

[20]

Jungkook An and Hee-Woong Kim. 2018. A data analytics approach to the cybercrime underground economy. IEEE Access 6 (2018), 26636–26652.

[21]

Murugan Anandarajan, Chelsey Hill, and Thomas Nolan. 2019. Text preprocessing. Practical Text Analytics. Springer, 45–59.

[22]

Giulio Angiani, Laura Ferrari, Tomaso Fontanini, Paolo Fornacciari, Eleonora Iotti, Federico Magliani, and Stefano Manicardi. 2016. A comparison between preprocessing techniques for sentiment analysis in twitter. In KDWeb.

[23]

David Antons, Eduard Grünwald, Patrick Cichy, and Torsten Oliver Salge. 2020. The application of text mining methods in innovation research: Current state, evolution patterns, and development priorities. R&D Management 50, 3 (2020), 329–351.

[24]

Xinbo Ban, Shigang Liu, Chao Chen, and Caslon Chua. 2018. A performance evaluation of deep-learnt features for software vulnerability detection. Concurrency Computation 31, 19 (2018).

[25]

Barnali Gupta Banik and Samir Kumar Bandyopadhyay. 2018. Novel text steganography using natural language processing and part-of-speech tagging. IETE Journal of Research 66, 3 (2018), 1–12. https://doi.org/10.1080/03772063.2018.1491807

[26]

Nicole Lang Beebe, Jan Guynes Clark, Glenn B. Dietrich, Myung S. Ko, and Daijin Ko. 2011. Post-retrieval search hit clustering to improve information retrieval effectiveness: Two digital forensics case studies. Decision Support Systems 51, 4 (2011), 732–744.

Digital Library

[27]

Nicole L. Beebe and Lishu Liu. 2014. Clustering digital forensic string search output. Digital Investigation 11, 4 (2014), 314–322.

Digital Library

[28]

Victor Benjamin, Bin Zhang, Jay F Nunamaker, Jr., and Hsinchun Chen. 2016. Examining hacker participation length in cybercriminal internet-relay-chat communities. Journal of Management Information Systems 33, 2 (2016), 482–510. JMISE

[29]

Daniel S. Berman, Anna L. Buczak, Jeffrey S. Chavis, and Cherita L. Corbett. 2019. A survey of deep learning methods for cyber security. Information 10, 4 (2019), 122.

[30]

Sajal Bhatia, Sunny Behal, and Irfan Ahmed. 2018. Distributed denial of service attacks and defense mechanisms: Current landscape and future directions. Versatile Cybersecurity. Springer, 55–97.

[31]

David M. Blei. 2012. Probabilistic topic models. Communications of the ACM 55, 4 (2012), 77–84.

Digital Library

[32]

Tony Bradley. 2019. New Cybersecurity Company Focuses on Addressing the Weakest Link in the Security Chain. Retrieved on April 2021 from https://www.forbes.com/sites/tonybradley/2019/02/20/new-cybersecurity-company-focuses-on-addressing-the-weakest-link-in-the-security-chain/?sh=3dcd66753745.

[33]

Matt Bromiley. 2019. SANS 2019 Incident Response (IR) Survey: It’s Time for a Change. Technical Report. SANS Institute.

[34]

Kevin Matthe Caramancion. 2020. An exploration of disinformation as a cybersecurity threat. In Proceedings of the 3rd International Conference on Information and Computer Technologies (ICICT’20). IEEE, 440–444.

[35]

Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, and Silvia Quarteroni. 2013. An introduction to information retrieval. Web Information Retrieval. Springer, 3–11.

[36]

Ching-Yun Chang and Stephen Clark. 2014. Practical linguistic steganography using contextual synonym substitution and a novel vertex coding method. Computational Linguistics 40, 2 (June 2014), 403–448.

Digital Library

[37]

Xi Chen, Indranil Bose, Alvin Chung Man Leung, and Chenhui Guo. 2011. Assessing the severity of phishing attacks: A hybrid data mining approach. Decision Support Systems 50, 4 (2011), 662–672.

Digital Library

[38]

Zhili Chen, Liusheng Huang, Haibo Miao, Wei Yang, and Peng Meng. 2011. Steganalysis against substitution-based linguistic steganography based on context clusters. Computers & Electrical Engineering 37, 6 (2011), 1071–1081.

Digital Library

[39]

Zhili Chen, Liusheng Huang, and Wei Yang. 2011. Detection of substitution-based linguistic steganography by relative frequency analysis. Digital Investigation 8, 1 (2011), 68–77.

Digital Library

[40]

Long Cheng, Fang Liu, and Danfeng Yao. 2017. Enterprise data breach: Causes, challenges, prevention, and future directions. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 7, 5 (2017), e1211.

[41]

Qi Cheng, Conor Cunningham, Fabian Gacayan, Anni Gu, Alex Hall, Olivia Lee, Ashley Sawyer, Safy Sayoud, Vriti Wadhwa, and Jion Yi. 2018. Hacking Democracy: Cybersecurity and global election interference. http://hdl.handle.net/1773/43754.

[42]

Paul Cichonski, Tom Millar, Tim Grance, and Karen Scarfone. 2012. Computer Security Incident Handling Guide. Technical Report 800-61. National Institute of Standards and Technology (NIST).

[43]

Jessica Clement. 2020. Number of sent and received e-mails per day worldwide from 2017 to 2024. Retrieved from https://www.statista.com/statistics/456500/daily-number-of-e-mails-worldwide/.

[44]

Aviad Cohen, Nir Nissim, Lior Rokach, and Yuval Elovici. 2016. SFEM: Structural feature extraction methodology for the detection of malicious office documents using machine learning methods. Expert Systems with Applications 63 (2016), 324–343.

Digital Library

[45]

Ilenia Confente, Giorgia Giusi Siciliano, Barbara Gaudenzi, and Matthias Eickhoff. 2019. Effects of data breaches from user-generated content: A corporate reputation analysis. European Management Journal 37, 4 (2019), 492–504.

[46]

Aitor Couce-Vieira, David Rios Insua, and Alex Kosgodagan. 2020. Assessing and forecasting cybersecurity impacts. Decision Analysis 17, 4 (2020), 356–374.

Digital Library

[47]

Isuf Deliu, Carl Leichter, and Katrin Franke. 2018. Extracting cyber threat intelligence from hacker forums: Support vector machines versus convolutional neural networks, In Proceedings of the 2017 IEEE International Conference on Big Data (Big Data’17), 3648–3656.

[48]

Li Deng and Yang Liu. 2018. Deep Learning in Natural Language Processing. Springer.

[49]

Prasad M. Deshpande, Salil Joshi, Prateek Dewan, Karin Murthy, Mukesh Mohania, and Sheshnarayan Agrawal. 2014. The Mask of ZoRRo: Preventing information leakage from documents. Knowledge and Information Systems 45, 3 (Dec. 2014), 705–730.

[50]

Brett Drury and Mathieu Roche. 2019. A survey of the applications of text mining for agriculture. Computers and Electronics in Agriculture 163 (2019), 104864.

Digital Library

[51]

April Edwards, David Demoll, and Lynne Edwards. 2020. Detecting cyberbullying activity across platforms. In Proceedings of the 17th International Conference on Information Technology–New Generations (ITNG’20). Springer, 45–50.

[52]

Matthew Edwards, Robert Larson, Benjamin Green, Awais Rashid, and Alistair Baron. 2017. Panning for gold: Automatically analysing online social engineering attack surfaces. Computers & Security 69 (2017), 18–34.

[53]

El-Sayed M. El-Alfy and Ali A. AlHasan. 2016. Spam filtering framework for multimodal mobile communication based on dendritic cell algorithm. Future Generation Computer Systems 64 (2016), 98–107.

Digital Library

[54]

Yong Fang, Yusong Guo, Cheng Huang, and Liang Liu. 2019. Analyzing and identifying data breaches in underground forums. IEEE Access 7 (2019), 1–1.

[55]

Ronen Feldman, James Sanger, et al. 2007. The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data. Cambridge University Press.

[56]

Rafael Ferreira-Mello, Máverick André, Anderson Pinheiro, Evandro Costa, and Cristobal Romero. 2019. Text mining in education. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 9, 6 (2019), e1332.

[57]

Mahak Gambhir and Vishal Gupta. 2017. Recent automatic text summarization techniques: A survey. Artificial Intelligence Review 47, 1 (2017), 1–66.

Digital Library

[58]

Adrien Gendre. 2019. Vade Secure Expands AI-Based Threat Detection with New Computer Vision Engine. Retrieved from https://www.vadesecure.com/en/vade-secure-expands-ai-based-threat-detection-with-new-computer-vision-engine/.

[59]

J. L. Gonzalez-Compean, Oscar Telles, Ivan Lopez-Arevalo, Miguel Morales-Sandoval, Victor J. Sosa-Sosa, and Jesus Carretero. 2019. A policy-based containerized filter for secure information sharing in organizational environments. Future Generation Computer Systems 95 (2019), 430–444.

Digital Library

[60]

NIS Cooperation Group. 2018. Cybersecurity Incident Taxonomy. Technical Report. European Union Agency for Network and Information Security.

[61]

Renchu Guan, Hao Zhang, Yanchun Liang, Fausto Giunchiglia, Lan Huang, and Xiaoyue Feng. 2020. Deep feature-based text clustering and its explanation. IEEE Transactions on Knowledge and Data Engineering (2020).

[62]

Jiafeng Guo, Yixing Fan, Liang Pang, Liu Yang, Qingyao Ai, Hamed Zamani, Chen Wu, W. Bruce Croft, and Xueqi Cheng. 2020. A deep look into neural ranking models for information retrieval. Information Processing & Management 57, 6 (2020), 102067.

[63]

Vishal Gupta, Gurpreet S. Lehal, et al. 2009. A survey of text mining techniques and applications. Journal of Emerging Technologies in Web Intelligence 1, 1 (2009), 60–76.

[64]

Jochen Hartmann, Juliana Huppertz, Christina Schamp, and Mark Heitmann. 2019. Comparing automated text classification methods. International Journal of Research in Marketing 36, 1 (2019), 20–38.

[65]

Keiko Hashizume, David G. Rosado, Eduardo Fernández-Medina, and Eduardo B. Fernandez. 2013. An analysis of security issues for cloud computing. Journal of Internet Services and Applications 4, 1 (2013), 1–13.

[66]

Haibo He and Edwardo A. Garcia. 2009. Learning from imbalanced data. IEEE Transactions on Knowledge and Data Engineering 21, 9 (2009), 1263–1284.

Digital Library

[67]

Mingxing He, Shi-Jinn Horng, Pingzhi Fan, Muhammad Khurram Khan, Ray-Shine Run, Jui-Lin Lai, Rong-Jian Chen, and Adi Sutanto. 2011. An efficient phishing webpage detector. Expert Systems with Applications 38, 10 (2011), 12018–12027.

Digital Library

[68]

Wu He, Ivan Ash, Mohd Anwar, Ling Li, Xiaohong Yuan, Li Xu, and Xin Tian. 2019. Improving employees’ intellectual capacity for cybersecurity through evidence-based malware training. Journal of Intellectual Capital (2019), 203–213.

[69]

Danny Hendler, Shay Kels, and Amir Rubin. 2018. Detecting malicious powershell commands using deep neural networks. In Proceedings of the 2018 on Asia Conference on Computer and Communications Security (ASIACCS’18). Association for Computing Machinery, New York, NY, 187–197.

Digital Library

[70]

Carolyn Holton. 2009. Identifying disgruntled employee systems fraud risk through text mining: A simple solution for a multi-billion dollar problem. Decision Support Systems 46, 4 (2009), 853–864.

Digital Library

[71]

Chen Huang, Yining Li, Chen Change Loy, and Xiaoou Tang. 2016. Learning deep representation for imbalanced classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5375–5384.

[72]

Cheng Huang, JiaYong Liu, Yong Fang, and Zheng Zuo. 2016. A study on Web security incidents in China by analyzing vulnerability disclosure platforms. Computers & Security 58 (2016), 47–62.

Digital Library

[73]

Keman Huang, Michael Siegel, and Stuart Madnick. 2018. Systematically understanding the cyber attack business: A survey. ACM Computing Surveys (CSUR) 51, 4 (2018), 1–36.

Digital Library

[74]

Xiaohong Huang, Yunlong Lu, Dandan Li, and Maode Ma. 2018. A novel mechanism for fast detection of transformed data leakage. IEEE Access 6 (2018), 35926–35936.

[75]

Abdulmalik Humayed, Jingqiang Lin, Fengjun Li, and Bo Luo. 2017. Cyber-physical systems security—A survey. IEEE Internet of Things Journal 4, 6 (2017), 1802–1831.

[76]

Mubashar Hussain, Mansoor Ahmed, Hasan Ali Khattak, Muhammad Imran, Abid Khan, Sadia Din, Awais Ahmad, Gwanggil Jeon, and Alavalapati Goutham Reddy. 2018. Towards ontology-based multilingual URL filtering: A big data problem. The Journal of Supercomputing 74, 10 (Apr. 2018), 5003–5021.

Digital Library

[77]

Breach Level Index. 2018. 2018: Data Privacy and New Regulations Take Center Stage. Technical Report. Breach Level Index.

[78]

International Organization for Standardization 2018. ISO 27000:2018(E). International Organization for Standardization, Geneva, Switzerland.

[79]

International Electrotechnical Commission International Organization for Standardization. 2012. ISO/IEC 27032: 2012—Information technology—Security techniques—Guidelines for cybersecurity.

[80]

Rizwana Irfan, Christine K. King, Daniel Grages, Sam Ewen, Samee U. Khan, Sajjad A. Madani, Joanna Kolodziej, Lizhe Wang, Dan Chen, Ammar Rayes, et al.2015. A survey on text mining in social networks. The Knowledge Engineering Review 30, 2 (2015), 157–170. https://doi.org/10.1017/S0269888914000277

[81]

ISACA. 2019. Glossary of Terms. Retrieved December 3, 2019 from https://csrc.nist.gov/publications/detail/nistir/7298/rev-3/final.

[82]

. 2019. Cybersecurity Workforce Study 2019. Retrieved from https://www.isc2.org/Research/2019-Cybersecurity-Workforce-Study.

[83]

Carina Jacobi, Wouter Van Atteveldt, and Kasper Welbers. 2016. Quantitative analysis of large amounts of journalistic texts using topic modelling. Digital Journalism 4, 1 (2016), 89–106.

[84]

Julian Jang-Jaccard and Surya Nepal. 2014. A survey of emerging threats in cybersecurity. Journal of Comput.er and System Sciences 80, 5 (2014), 973–993. https://doi.org/10.1016/j.jcss.2014.02.005

[85]

Samuel Jero, Maria Leonor Pacheco, Dan Goldwasser, and Cristina Nita-Rotaru. 2019. Leveraging textual specifications for grammar-based fuzzing of network protocols. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 9478–9483.

Digital Library

[86]

Taeho Jo. 2018. Text Mining: Concepts, Implementation, and Big Data Challenge. Vol. 45. Springer.

[87]

Jae Woong Joo, Seo Yeon Moon, Saurabh Singh, and Jong Hyuk Park. 2017. S-Detector: An enhanced security model for detecting Smishing attack for mobile computing. Telecommunication Systems 66, 1 (Jan. 2017), 29–38.

Digital Library

[88]

Arnav Joshi, Ravendar Lal, Tim Finin, and Anupam Joshi. 2013. Extracting cybersecurity related linked data from text. In Proceedings of the 2013 IEEE 7th International Conference on Semantic Computing. IEEE Computer Society, 252–259.

Digital Library

[89]

Gilad Katz, Yuval Elovici, and Bracha Shapira. 2014. CoBAn: A context based model for data leakage prevention. Information Sciences 262 (2014), 137–158.

Digital Library

[90]

Saad Khan, Simon Parkinson, and Yongrui Qin. 2017. Fog computing security: A review of current applications and security solutions. Journal of Cloud Computing 6, 1 (2017), 1–22.

Digital Library

[91]

Rupinder Paul Khandpur, Taoran Ji, Steve Jan, Gang Wang, Chang-Tien Lu, and Naren Ramakrishnan. 2017. Crowdsourcing cybersecurity: Cyber attack detection using social media. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (CIKM’17), Vol. Part F131841. Association for Computing Machinery, 1049–1057.

Digital Library

[92]

Zeynep H. Kilimci and Selim Akyokus. 2018. Deep learning-and word embedding-based heterogeneous classifier ensembles for text classification. Complexity 2018 (2018).

[93]

Nir Kshetri. 2006. The simple economics of cybercrimes. IEEE Security & Privacy 4, 1 (2006), 33–39.

Digital Library

[94]

Sneha Kudugunta and Emilio Ferrara. 2018. Deep neural networks for bot detection. Information Sciences 467 (2018), 312–322.

[95]

Rick Kuhn, Mohammad Raunak, and Raghu Kacker. 2017. It doesn’t have to be like this: Cybersecurity vulnerability trends. IT Professional 19, 6 (2017), 66–70.

Digital Library

[96]

Akshi Kumar, Vaibhav Singh, Tuba Ali, Saurabh Pal, and Jeevanjot Singh. 2020. Empirical evaluation of shallow and deep classifiers for rumor detection. In Advances in Computing and Intelligent Systems. Springer, 239–252.

[97]

B. Shravan Kumar and Vadlamani Ravi. 2016. A survey of the applications of text mining in financial domain. Knowledge-Based Systems 114 (2016), 128–147. https://doi.org/10.1016/j.knosys.2016.10.003

Digital Library

[98]

Siwei Lai, Liheng Xu, Kang Liu, and Jun Zhao. 2015. Recurrent convolutional neural networks for text classification. In Proceedings of the 29th AAAI Conference on Artificial Intelligence.

[99]

Quentin Le Sceller, ElMouatez Billah Karbab, Mourad Debbabi, and Farkhund Iqbal. 2017. SONAR: Automatic detection of cyber security events over the twitter stream. In Proceedings of the 12th International Conference on Availability, Reliability and Security. Association for Computing Machinery, 1–11.

Digital Library

[100]

Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. Nature 521, 7553 (2015), 436–444.

[101]

Kuo-Chan Lee, Chih-Hung Hsieh, Li-Jia Wei, Ching-Hao Mao, Jyun-Han Dai, and Yu-Ting Kuang. 2016. Sec-Buzzer: Cyber security emerging topic mining with open threat intelligence retrieval and timeline event annotation. Soft Computing 21, 11 (July 2016), 2883–2896. https://doi.org/10.1007/s00500-016-2265-0

Digital Library

[102]

Hang Li. 2014. Learning to rank for information retrieval and natural language processing. Synthesis Lectures on Human Language Technologies 7, 3 (2014), 1–121.

[103]

Jian-hua Li. 2018. Cyber security meets artificial intelligence: A survey. Frontiers of Information Technology & Electronic Engineering 19, 12 (2018), 1462–1474.

[104]

Jenny S. Li, Li-Chiou Chen, John V. Monaco, Pranjal Singh, and Charles C. Tappert. 2017. A comparison of classifiers and features for authorship authentication of social networking messages. Concurrency Computation 29, 14 (2017).

[105]

Qian Li, Hao Peng, Jianxin Li, Congyin Xia, Renyu Yang, Lichao Sun, Philip S. Yu, and Lifang He. 2020. A survey on text classification: From shallow to deep learning. Retreived on April 2021 from https://arxiv.org/abs/2008.00364.

[106]

Weifeng Li, Hsinchun Chen, and Jay F. Nunamaker, Jr. 2016. Identifying and profiling key sellers in cyber carding community: AZSecure text mining system. Journal of Management Information Systems 33, 4 (2016), 1059–1086.

[107]

Junqiang Liu and Ke Wang. 2012. Anonymizing bag-valued sparse data by semantic similarity-based clustering. Knowledge and Information Systems 35, 2 (June 2012), 435–461.

[108]

Mitch Macdonald, Richard Frank, Joseph Mei, and Bryan Monk. 2015. Identifying digital threats in a hacker web forum. In Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015 (ASONAM’15). Association for Computing Machinery, New York, NY, 926–933.

Digital Library

[109]

William Marcellino, Kate Cox, Katerina Galai, Linda Slapakova, Amber Jaycocks, and Ruth Harris. 2020. Human-Machine Detection of Online-Based Malign Information. Technical Report. RAND Corporation.

[110]

I. V. Mashechkin, M. I. Petrovskiy, D. S. Popov, and Dmitry V. Tsarev. 2015. Applying text mining methods for data loss prevention. Programming and Computer Software 41, 1 (Jan. 2015), 23–30. https://doi.org/10.1134/S0361768815010041

Digital Library

[111]

Walaa Medhat, Ahmed Hassan, and Hoda Korashy. 2014. Sentiment analysis algorithms and applications: A survey. Ain Shams Engineering Journal 5, 4 (2014), 1093–1113.

[112]

Hasan Mesut Meral, Bülent Sankur, A. Sumru Özsoy, Tunga Güngör, and Emre Sevinç. 2009. Natural language watermarking via morphosyntactic alterations. Computer Speech & Language 23, 1 (2009), 107–125.

Digital Library

[113]

Nikola Milosevic, Ali Dehghantanha, and Kim-Kwang Raymond Choo. 2017. Machine learning aided Android malware classification. Computers & Electrical Engineering 61 (2017), 266–274.

[114]

Natalia Miloslavskaya and Alexander Tolstoy. 2016. Big data, fast data and data lake concepts. Procedia Computer Science 88 (2016), 300–305.

[115]

Gary Miner, John Elder IV, Andrew Fast, Thomas Hill, Robert Nisbet, and Dursun Delen. 2012. Practical Text Mining and Statistical Analysis for Non-Structured Text Data Applications. Academic Press.

[116]

Bhaskar Mitra and Nick Craswell. 2017. Neural models for information retrieval. Retreived on April 2021 from https://arxiv.org/abs/1705.01509.

[117]

Mandar Mitra and B. B. Chaudhuri. 2000. Information retrieval from documents: A survey. Information Retrieval 2, 2 (2000), 141–163.

Digital Library

[118]

Sudip Mittal, Prajit Kumar Das, Varish Mulwad, Anupam Joshi, and Tim Finin. 2016. CyberTwitter: Using Twitter to generate alerts for cybersecurity threats and vulnerabilities. In Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM’16). Institute of Electrical and Electronics Engineers Inc., 860–867.

[119]

Mahmood Moghimi and Ali Yazdian Varjani. 2016. New rule-based phishing detection method. Expert Systems with Applications 53 (2016), 231–242.

Digital Library

[120]

Daša Munková, Michal Munk, and Martin Vozár. 2013. Data pre-processing evaluation for text mining: Transaction/sequence model. Procedia Computer Science 18 (2013), 1198–1207.

[121]

Shane Murnion, William J. Buchanan, Adrian Smales, and Gordon Russell. 2018. Machine learning and semantic analysis of in-game chat for cyberbullying. Computers & Security 76 (2018), 197–213.

[122]

David Nadeau and Satoshi Sekine. 2007. A survey of named entity recognition and classification. Lingvisticae Investigationes 30, 1 (2007), 3–26.

[123]

Maitri P. Naik, Harshadkumar B. Prajapati, and Vipul K. Dabhi. 2015. A survey on semantic document clustering. In Proceedings of the 2015 IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT’15). IEEE, 1–10.

[124]

B. Sri Nandhini and J. I. Sheeba. 2015. Online social network bullying detection using intelligence techniques. Procedia Computer Science 45 (2015), 485–492.

[125]

Masoud Narouei, Hamed Khanpour, Hassan Takabi, Natalie Parde, and Rodney Nielsen. 2017. Towards a top-down policy engineering framework for attribute-based access control. In Proceedings of the 22nd ACM Symposium on Access Control Models and Technologies. 103–114.

Digital Library

[126]

Masoud Narouei, Hassan Takabi, and Rodney Nielsen. 2018. Automatic extraction of access control policies from natural language documents. IEEE Transactions on Dependable and Secure Computing (2018), 1–1.

[127]

Arman Khadjeh Nassirtoussi, Saeed Aghabozorgi, Teh Ying Wah, and David Chek Ling Ngo. 2014. Text mining for market prediction: A systematic review. Expert Systems with Applications 41, 16 (2014), 7653–7670.

Digital Library

[128]

Fitzroy Nembhard, Marco Carvalho, and Thomas Eskridge. 2018. A hybrid approach to improving program security. In Proceedings of the 2017 IEEE Symposium Series on Computational Intelligence (SSCI’18), Vol. 2018-January. Institute of Electrical and Electronics Engineers Inc., 1–8.

[129]

Lily Hay Newman. 2019. Google’s Making It Easier to Safeguard Sensitive Data Troves. Retrieved from https://www.wired.com/story/google-data-loss-prevention-interface/.

[130]

Umara Noor, Zahid Anwar, Tehmina Amjad, and Kim-Kwang Raymond Choo. 2019. A machine learning-based FinTech cyber threat attribution framework using high-level indicators of compromise. Future Generation Computer Systems 96 (2019), 227–242.

Digital Library

[131]

Umara Noor, Zahid Anwar, Asad Waqar Malik, Sharifullah Khan, and Shahzad Saleem. 2019. A machine learning framework for investigating data breaches based on semantic analysis of adversary’s attack patterns in threat intelligence repositories. Future Generation Computer Systems 95 (2019), 467–487.

Digital Library

[132]

Jeremiah O’Connor. 2020. Utilizing NLP To Detect APT in DNS. Retrieved from https://umbrella.cisco.com/blog/nlp-apt-dns.

[133]

Alison O’Mara-Eves, James Thomas, John McNaught, Makoto Miwa, and Sophia Ananiadou. 2015. Using text mining for study identification in systematic reviews: A systematic review of current approaches. Systematic Reviews 4, 1 (2015), 5.

[134]

Lecia Papadopoulos. 2017. How Watson AI is helping companies stay ahead of hackers and cybersecurity attacks. Retrieved from https://www.ibm.com/blogs/watson/2017/08/how-watson-ai-is-helping-companies-stay-ahead-of-cybersecurity-attacks/.

[135]

Won Park, Youngin You, and Kyungho Lee. 2018. Detecting potential insider threat: Analyzing insiders’ sentiment exposed in social media. Security and Communication Networks 2018 (2018). https://doi.org/10.1155/2018/7243296

[136]

Justin W. Patchin and Sameer Hinduja. 2020. Sextortion among adolescents: Results from a national survey of US youth. Sexual Abuse 32, 1 (2020), 30–54.

[137]

Celia Paulsen and Robert Byers. 2019. Glossary of Key Information Security Terms. Technical Report NISTIR 7298. National Institute of Standards and Technology (NIST).

[138]

Marek Pawlicki, Michał Choraś, Rafał Kozik, and Witold Hołubowicz. 2020. On the Impact of network data balancing in cybersecurity applications. International Conference on Computational Science. Springer, 196–210.

Digital Library

[139]

Mirjana Pejić Bach, Živko Krstić, Sanja Seljan, and Lejla Turulja. 2019. Text mining for big data analysis in financial sector: A literature review. Sustainability 11, 5 (2019), 1277.

[140]

Hector Pellet, Stavros Shiaeles, and Stavros Stavrou. 2019. Localising social network users and profiling their movement. Computers & Security 81 (2019), 49–57.

[141]

Ian Perera, Jena Hwang, Kevin Bayas, Bonnie Dorr, and Yorick Wilks. 2019. Cyberattack prediction through public text analysis and mini-theories. In Proceedings of the 2018 IEEE International Conference on Big Data (Big Data’18). Institute of Electrical and Electronics Engineers Inc., 3001–3010.

[142]

Tien D. Phan and Nur Zincir-Heywood. 2018. User identification via neural network based language models. International Journal of Network Management (2018). INMTEhttps://doi.org/10.1002/nem.2049

Digital Library

[143]

Raymond Pompon. 2016. Vulnerability management. IT Security Risk Control Management. Springer, 165–174.

[144]

Clay Posey, Uzma Raja, Robert E. Crossler, and A. J. Burns. 2017. Taking stock of organisations’ protection of privacy: Categorising and assessing threats to personally identifiable information in the USA. European Journal of Information Systems 26, 6 (Nov. 2017), 585–604.

[145]

Santiago Quintero-Bonilla and Angel Martín del Rey. 2020. A new proposal on the advanced persistent threat: A survey. Applied Sciences 10, 11 (2020), 3874.

[146]

David Reinsel, John Gantz, and John Rydning. 2017. Data Age 2025: The Evolution of Data to Life-Critical. Technical Report. IDC, Framingham.

[147]

Jitendra Kumar Rout, Smriti Singh, Sanjay Kumar Jena, and Sambit Bakshi. 2016. Deceptive review detection using labeled and unlabeled data. Multimedia Tools and Applications 76, 3 (Aug. 2016), 3187–3211.

[148]

Arjun Sambamoorthy. 2019. Applying Human Language Understanding to DLP’s Biggest Challenges. Retrieved from https://www.armorblox.com/blog/reinventing-dlp-with-natural-language-understanding/.

[149]

Anna Sapienza, Alessandro Bessi, Saranya Damodaran, Paulo Shakarian, Kristina Lerman, and Emilio Ferrara. 2017. Early warnings of cyber threats in online discussions. Proceedings of the IEEE International Conference on Data Mining Workshops (ICDMW’17), 667–674.

[150]

Mark Scherfling. 2020. Machine Learning with Natural Language Processing. Retrieved from https://community.ibm.com/community/user/security/blogs/mark-scherfling1/2020/05/04/nlp-for-resilient.

[151]

Marian K. Schneider. 2020. Election security: Increasing election integrity by improving cybersecurity. The Future of Election Administration. Springer, 243–259.

[152]

Sicong Shao, Cihan Tunc, Amany Al-Shawi, and Salim Hariri. 2019. Autonomic author identification in internet relay chat (IRC). In Proceedings of the IEEE/ACS International Conference on Computer Systems and Applications (AICCSA’18). https://doi.org/10.1109/AICCSA.2018.8612780

[153]

Jasmeet Singh and Vishal Gupta. 2017. A systematic review of text stemming techniques. Artificial Intelligence Review 48, 2 (2017), 157–217.

Digital Library

[154]

John Slankas, Xusheng Xiao, Laurie Williams, and Tao Xie. 2014. Relation extraction for inferring access control rules from natural language artifacts. In Proceedings of the 30th Annual Computer Security Applications Conference.ACM International Conference Proceeding Series, 366–375.

Digital Library

[155]

Robert Slonje and Peter K. Smith. 2008. Cyberbullying: Another main type of bullying?Scandinavian Journal of Psychology 49, 2 (2008), 147–154.

[156]

Melissa Smoker and Evita March. 2017. Predicting perpetration of intimate partner cyberstalking: Gender and the dark tetrad. Computers in Human Behavior 72 (2017), 390–396.

Digital Library

[157]

Marina Sokolova and Guy Lapalme. 2009. A systematic analysis of performance measures for classification tasks. Information Processing & Management 45, 4 (2009), 427–437.

Digital Library

[158]

Bhavna Soman. 2019. From unstructured data to actionable intelligence: Using machine learning for threat intelligence. Retrieved from https://www.microsoft.com/security/blog/2019/08/08/from-unstructured-data-to-actionable-intelligence-using-machine-learning-for-threat-intelligence/.

[159]

Georgios Spanos and Lefteris Angelis. 2016. The impact of information security events to the stock market: A systematic literature review. Computers & Security 58 (2016), 216–229.

Digital Library

[160]

Georgios Spanos and Lefteris Angelis. 2018. A multi-target approach to estimate software vulnerability characteristics and severity scores. Journal of Systems and Software 146 (2018), 152–166.

[161]

Dima Suleiman and Ghazi Al-Naymat. 2017. SMS spam detection using H2O framework. Procedia Computer Science 113 (2017), 154–161.

[162]

Nan Sun, Jun Zhang, Paul Rimba, Shang Gao, Leo Yu Zhang, and Yang Xiang. 2019. Data-driven cybersecurity incident prediction: A survey. IEEE Communications Surveys Tutorials 21, 2 (Second quarter 2019), 1744–1772.

[163]

Ahmet Ali Süzen. 2020. A risk-assessment of cyber attacks and defense strategies in industry 4.0 ecosystem.International Journal of Computer Network & Information Security 12, 1 (2020).

[164]

Romilla Syed. 2018. Enterprise reputation threats on social media: A case of data breach framing. The Journal of Strategic Information Systems 28, 3 (2018), 257–274.

[165]

Romilla Syed, Maryam Rahafrooz, and Jeffrey M. Keisler. 2018. What it takes to get retweeted: An analysis of software vulnerability messages. Computers in Human Behavior 80 (2018), 207–215.

Digital Library

[166]

Symantec. 2019. Internet Security Threat Report. Retrieved from https://www.symantec.com/content/dam/symantec/docs/reports/istr-24-2019-en.pdf.

[167]

Chee-Wooi Ten, Govindarasu Manimaran, and Chen-Ching Liu. 2010. Cybersecurity for critical infrastructures: Attack and defense modeling. IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans 40, 4 (2010), 853–865.

Digital Library

[168]

Kutub Thakur, Juan Shan, and Al-Sakib Khan Pathan. 2018. Innovations of phishing defense: The mechanism, measurement and defense strategies. International Journal of Communication Networks and Information Security 10, 1 (2018), 19–27.

[169]

Dirk Thorleuchter and Dirk Van den Poel. 2012. Improved multilevel security with latent semantic indexing. Expert Systems with Applications 39, 18 (Dec. 2012), 13462–13471.

Digital Library

[170]

Monica Todros. 2018. Artificial Intelligence in Black and White. Retrieved from https://www.recordedfuture.com/artificial-intelligence-information-security/.

[171]

Andeep S. Toor, Harry Wechsler, Michele Nappi, and Kim-Kwang Raymond Choo. 2018. Visual question authentication protocol (VQAP). Computers & Security 76 (2018), 285–294.

[172]

Wiem Tounsi and Helmi Rais. 2018. A survey on technical threat intelligence in the age of sophisticated cyber attacks. Computers & Security 72 (2018), 212–233.

Digital Library

[173]

Alper Kursat Uysal and Serkan Gunal. 2014. The impact of preprocessing on text classification. Information Processing & Management 50, 1 (2014), 104–112.

Digital Library

[174]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in Neural Information Processing Systems. 5998–6008.

[175]

Ike Vayansky and Sathish A. P. Kumar. 2020. A review of topic modeling methods. Information Systems 94 (2020), 101582.

[176]

Maria Vergelis, Tatyana Shcherbakova, and Tatyana Sidorina. 2019. Spam and phishing in Q2 2019. Retrieved November 30, 2019 from https://securelist.com/spam-and-phishing-in-q2-2019/92379/.

[177]

Sokratis Vidros, Constantinos Kolias, Georgios Kambourakis, and Leman Akoglu. 2017. Automatic detection of online recruitment frauds: Characteristics, methods, and a public dataset. Future Internet 9, 1 (March 2017), 6.

[178]

G. Vinodhini and R. M. Chandrasekaran. 2016. A comparative performance evaluation of neural network based approach for sentiment classification of online reviews. Journal of King Saud University-Computer and Information Sciences 28, 1 (2016), 2–12.

Digital Library

[179]

Rossouw Von Solms and Johan Van Niekerk. 2013. From information security to cyber security. Computers & Security 38 (2013), 97–102.

Digital Library

[180]

Soroush Vosoughi, Deb Roy, and Sinan Aral. 2018. The spread of true and false news online. Science 359, 6380 (2018), 1146–1151.

[181]

Tara Walker. 2017. Hello Amazon Macie: Automatically Discover, Classify, and Secure Content at Scale. Retrieved from https://aws.amazon.com/blogs/aws/launch-amazon-macie-securing-your-s3-buckets/.

[182]

Peng Wang, Bo Xu, Jiaming Xu, Guanhua Tian, Cheng-Lin Liu, and Hongwei Hao. 2016. Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification. Neurocomputing 174 (2016), 806–814.

Digital Library

[183]

Qihua Wang and Hongxia Jin. 2011. Data leakage mitigation for discretionary access control in collaboration clouds. In Proceedings of the 16th ACM Symposium on Access Control Models and Technologies. Association for Computing Machinery, 103–112. https://doi.org/10.1145/1998441.1998457

Digital Library

[184]

Tawei Wang, Karthik N. Kannan, and Jackie Rees Ulmer. 2013. The association between the disclosure and the realization of information security risk factors. Information Systems Research 24, 2 (June 2013), 201–218.

[185]

Zuoguang Wang, Hongsong Zhu, and Limin Sun. 2021. Social engineering in cybersecurity: Effect mechanisms, human vulnerabilities and attack methods. IEEE Access 9 (2021), 11895–11910.

[186]

Tao Wen, Yuqing Zhang, Ying Dong, and Gang Yang. 2015. A novel automatic severity vulnerability assessment framework. Journal of Communications 10, 5 (2015), 320–329. https://doi.org/10.12720/jcm.10.5.320-329

[187]

Tao Wen, Yuqing Zhang, Qianru Wu, and Gang Yang. 2015. ASVC: An automatic security vulnerability categorization framework based on novel features of vulnerability data. Journal of Communications 10, 2 (2015), 107–116.

[188]

Mark A. Williams, Sumi Dey, Roberto Camacho Barranco, Sheikh Motahar Naim, M. Shahriar Hossain, and Monika Akbar. 2019. Analyzing evolving trends of vulnerabilities in national vulnerability database. In Proceedings of the 2018 IEEE International Conference on Big Data (Big Data’18). Institute of Electrical and Electronics Engineers Inc., 3011–3020.

[189]

Alex S. Wilner. 2018. Cybersecurity and its discontents: Artificial intelligence, the Internet of Things, and digital misinformation. International Journal 73, 2 (2018), 308–316.

[190]

Lingyun Xiang, Xingming Sun, Gang Luo, and Bin Xia. 2012. Linguistic steganalysis using the features derived from synonym frequency. Multimedia Tools and Applications 71, 3 (Dec. 2012), 1893–1911.

[191]

Jiaming Xu, Bo Xu, Peng Wang, Suncong Zheng, Guanhua Tian, and Jun Zhao. 2017. Self-taught convolutional neural networks for short text clustering. Neural Networks 88 (2017), 22–31.

[192]

Yueshen Xu, Lei Li, Honghao Gao, Lei Hei, Rui Li, and Yihao Wang. 2020. Sentiment classification with adversarial learning and attention mechanism. Computational Intelligence (2020).

[193]

Jianwu Yang and Xiaoou Chen. 2002. A semi-structured document model for text mining. Journal of Computer Science and Technology 17, 5 (2002), 603–610.

Digital Library

[194]

Feng Yu, Qiang Liu, Shu Wu, Liang Wang, and Tieniu Tan. 2019. Attention-based convolutional approach for misinformation identification from massive and noisy microblog posts. Computers and Security 83 (2019), 106–121.

Digital Library

[195]

Munwar Ali Zardari and Low Tang Jung. 2016. Data security rules/regulations based classification of file data using TsF-kNN algorithm. Cluster Computing 19, 1 (Feb. 2016), 349–368.

Digital Library

[196]

ChengXiang Zhai and Sean Massung. 2016. Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining. Morgan & Claypool.

[197]

Rui Zhao and Kezhi Mao. 2017. Fuzzy bag-of-words model for document representation. IEEE Transactions on Fuzzy Systems 26, 2 (2017), 794–804.

[198]

Zhixuan Zhou, Huankang Guan, Meghana Moorthy Bhat, and Justin Hsu. 2019. Fake news detection via NLP is vulnerable to adversarial attacks. Retreived on April 2021 from https://arxiv.org/abs/1901.09657.

[199]

Tiantian Zhu, Hongyu Gao, Yi Yang, Kai Bu, Yan Chen, Doug Downey, Kathy Lee, and Alok N. Choudhary. 2016. Beating the artificial chaos: Fighting OSN spam using its own templates. IEEE/ACM Transactions on Networking 24, 6 (Dec. 2016), 3856–3869.

Digital Library

[200]

Elżbieta Zielińska, Wojciech Mazurczyk, and Krzysztof Szczypiorski. 2014. Trends in steganography. Communications of the ACM 57, 3 (2014), 86–95.

Digital Library

[201]

Raed Abu Zitar and Adel Hamdan. 2011. Genetic optimized artificial immune system in spam detection: A review and a model. Artificial Intelligence Review 40, 3 (Nov. 2011), 305–377.

Cited By

Williams LKhan HBurnap P(2025)The Evolution of Digital Security by Design Using Temporal Network AnalysisInformatics10.3390/informatics1201000812:1(8)Online publication date: 17-Jan-2025
https://doi.org/10.3390/informatics12010008
Wang YGuo CYan JZhang ZCheng Y(2025)Unmasking hidden threats: Enhanced detection of embedded malicious domains in pirate streaming videosComputers and Electrical Engineering10.1016/j.compeleceng.2025.110087123(110087)Online publication date: Apr-2025
https://doi.org/10.1016/j.compeleceng.2025.110087
Zhu J(2024)Optimization of network security protection posture based on data clusteringApplied Mathematics and Nonlinear Sciences10.2478/amns-2024-03449:1Online publication date: 26-Feb-2024
https://doi.org/10.2478/amns-2024-0344
Show More Cited By

Index Terms

Text Mining in Cybersecurity: A Systematic Literature Review

Recommendations

Automation of systematic literature reviews: A systematic literature review
Abstract Context
Systematic Literature Review (SLR) studies aim to identify relevant primary papers, extract the required data, analyze, and synthesize results to gain further and broader insight into the investigated domain. ...
Cybersecurity awareness for children: A systematic literature review
Abstract
Cybersecurity for children has received much attention and has become a rapidly growing topic due to the increased availability of the internet to children and their consequent exposure to various online risks. This paper aims to ...
Countermeasures and their taxonomies for risk treatment in cybersecurity: A systematic mapping review
Highlights
- 26 catalogues/taxonomies of cybersecurity countermeasures were identified.
- Catalogues/taxonomies mainly focus on general cybersecurity risks or cyberattacks.
- Specific countermeasure taxonomies focus mostly on critical ...
Abstract
Cybersecurity continues to be one of the principal issues in the computing environment. Organizations and researchers have made various efforts to mitigate the risks of cyberspace. The treatment of cyber risk is a fundamental stage in risk ...
Graphical abstract

Display Omitted

Comments

Information & Contributors

Information

Published In

cover image ACM Computing Surveys

ACM Computing Surveys Volume 54, Issue 7

September 2022

778 pages

ISSN:0360-0300

EISSN:1557-7341

DOI:10.1145/3476825

Editor:
Albert Zomaya
University of Sydney, Australia

Issue’s Table of Contents

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 July 2021

Accepted: 01 April 2021

Revised: 01 August 2020

Received: 01 March 2020

Published in CSUR Volume 54, Issue 7

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

28
Total Citations
View Citations
2,303
Total Downloads

Downloads (Last 12 months)426
Downloads (Last 6 weeks)51

Reflects downloads up to 13 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Williams LKhan HBurnap P(2025)The Evolution of Digital Security by Design Using Temporal Network AnalysisInformatics10.3390/informatics1201000812:1(8)Online publication date: 17-Jan-2025
https://doi.org/10.3390/informatics12010008
Wang YGuo CYan JZhang ZCheng Y(2025)Unmasking hidden threats: Enhanced detection of embedded malicious domains in pirate streaming videosComputers and Electrical Engineering10.1016/j.compeleceng.2025.110087123(110087)Online publication date: Apr-2025
https://doi.org/10.1016/j.compeleceng.2025.110087
Zhu J(2024)Optimization of network security protection posture based on data clusteringApplied Mathematics and Nonlinear Sciences10.2478/amns-2024-03449:1Online publication date: 26-Feb-2024
https://doi.org/10.2478/amns-2024-0344
Kaufhold MRiebe TBayer MReuter C(2024)‘We Do Not Have the Capacity to Monitor All Media’: A Design Case Study on Cyber Situational Awareness in Computer Emergency Response TeamsProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642368(1-16)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642368
Taha K(2024)Text Regression Analysis: A Review, Empirical, and Experimental InsightsIEEE Access10.1109/ACCESS.2024.344676512(137333-137344)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3446765
Bhatnagr PRajesh AMisra R(2024)Understanding critical service factors in neobanks: crafting strategies through text miningJournal of Modelling in Management10.1108/JM2-01-2024-0021Online publication date: 8-Oct-2024
https://doi.org/10.1108/JM2-01-2024-0021
Sarker IJanicke HFerrag MAbuadbba A(2024)Multi-aspect rule-based AI: Methods, taxonomy, challenges and directions towards automation, intelligence and transparent cybersecurity modeling for critical infrastructuresInternet of Things10.1016/j.iot.2024.10111025(101110)Online publication date: Apr-2024
https://doi.org/10.1016/j.iot.2024.101110
Chorozidis GGeorgiou KMittas NAngelis L(2024)Knowledge and research mapping of the data and database forensics domains: A bibliometric analysisInformation and Software Technology10.1016/j.infsof.2024.107472171(107472)Online publication date: Jul-2024
https://doi.org/10.1016/j.infsof.2024.107472
Taha KYoo PYeun CHomouz DTaha A(2024)A comprehensive survey of text classification techniques and their research applications: Observational and experimental insightsComputer Science Review10.1016/j.cosrev.2024.10066454(100664)Online publication date: Nov-2024
https://doi.org/10.1016/j.cosrev.2024.100664
Shankar DAzhakath AKhalil NJ． ST． MK． S(2024)Data mining for cyber biosecurity risk management – A comprehensive reviewComputers and Security10.1016/j.cose.2023.103627137:COnline publication date: 12-Apr-2024
https://dl.acm.org/doi/10.1016/j.cose.2023.103627
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Issue’s Table of Contents