Abstract
Recently, negative sequential patterns (NSP) (like missing medical treatments) mining is important in data mining research since it includes negative correlations between item sets, which are overlooked by positive sequential pattern mining (PSP) (for instance, utilization of medical service). Yet, discovering the NSP is very complex than finding PSP because of the important problem complexity occurred by high computational cost, non-occurring elements, as well as huge search space in evaluating NSC, and most of the NSP based existing works are inefficient. Therefore, this paper intends to propose a fast NSP mining algorithm for the disease prediction model. This model includes Data normalization, Data separation based on labels, and Pattern recognition phases. In the midst of data separation, the maximum occurring data is optimally selected using a new algorithm that hybridizes the FireFly (FF) algorithm and Grey Wolf Optimization (GWO). This proposed Firefly induced Grey Wolf optimization (F-GWO) algorithm automatically selects the maximum occurring information as per the PSP support. The proposed model is compared over other conventional methods with varied measures. Especially, the computation cost of our model is 46.87%, 6.27%, 9.37%, 2.76%, and 66.62% better than the existing GA, ABC, PSO, FF, and GWO models respectively.
Similar content being viewed by others
Abbreviations
- NSP:
-
Negative Sequential Patterns.
- PSP:
-
Positive Sequential Patterns.
- NSC:
-
Negative Sequential Candidates.
- FF:
-
FireFly.
- GWO:
-
Grey Wolf Algorithm.
- F-GWO:
-
Firefly Induced Gray Wolf optimization.
- FDR:
-
False Discovery Rate.
- FPR:
-
False positive rate.
- FNR:
-
False negative rate.
- NPV:
-
Negative Predictive Value.
- MCC:
-
Mathews correlation coefficient.
- NOB:
-
Non-Occurring Behaviors.
- ITS:
-
Intelligent Transport Systems.
- ST-NSP:
-
Set Theory-based NSP mining.
- HUNSP:
-
High Utility Negative Sequential Patterns.
- HUNSC:
-
High Utility Negative Sequential Candidates.
- GA:
-
Genetic Algorithm.
- e-NSPFI:
-
Efficient-Negative Sequential Pattern from both Frequent and Infrequent Positive Sequential Patterns.
- F-NSP:
-
Fast NSP Mining Algorithm.
- HUSP:
-
High Utility Sequential Patterns.
- ABC:
-
Artificial Bee colony.
- PSO:
-
Particle Swarm Optimization.
References
Alkan OK, Karagoz P (2015) CRoM and HuspExt: improving efficiency of high utility sequential pattern extraction. IEEE Trans Knowl Data Eng 27(10):2645–2657
Bansal M, Kumar M, Kumar M (2021) 2D object recognition: a comparative analysis of SIFT, SURF and ORB feature descriptors. Multimed Tools Appl 80(12):18839–18857
Bojja GR, Ofori M, Liu J, Loknath Sai Ambati (2020) Early public outlook on the coronavirus disease (COVID-19): A social media study
Buddhakulsomsiri J, Zakarian A (2009) Sequential pattern mining algorithm for automotive warranty data. Comput Ind Eng 57(1):137–147
Cai G, Hio C, Bermingham L, Lee K, Lee I (2014) Sequential pattern mining of geo-tagged photos with an arbitrary regions-of-interest detection method. Expert Syst Appl 41(7):3514–3526
Cao L, Dong X, Zheng Z (2016) E-NSP: efficient negative sequential pattern mining. Artif Intell 235:156–182
Cao L, Dong X, Zheng Z (2016) E-NSP: efficient negative sequential pattern mining. Artif Intell 235:156–182
Cao L, Dong X, Zheng Z (2016) E-NSP: efficient negative sequential pattern mining. Artif Intell 235:156–182
Chen Y, Peng W, Lee S (2015) Mining temporal patterns in time interval-based data. IEEE Trans Knowl Data Eng 27(12):3318–3331
Cheng Y, Lin Y, Chiang K, Tseng VS (2017) Mining sequential risk patterns from large-scale clinical databases for early assessment of chronic diseases: a case study on chronic obstructive pulmonary disease. IEEE Journal of Biomedical and Health Informatics 21(2):303–311
Chhabra P, Garg NK, Kumar M (2020) Content-based image retrieval system using ORB and SIFT features. Neural Comput & Applic 32(7):2725–2733
Chithra S, Kumari RM (2018) Economic emission dispatch in renewable energy systems using FireFly algorithm. Journal of Computational Mechanics, Power System and Control 1(1):18–25
Chung-Ching Y, Chen Y-L (2005) Mining sequential patterns from multidimensional sequence data. IEEE Trans Knowl Data Eng 17(1):136–140
Cui Y, Yang L, Zhao Z, Tang T, Yin M (2013) Sequential grouping heuristic for the two-dimensional cutting stock problem with pattern reduction. Int J Prod Econ 144(2):432–439
Dong X, Zheng Z, Cao L, Zhao Y, Zhang C, Li J, Wei W, Ou Y (2011) e-NSP: efficient negative sequential pattern mining based on identified positive patterns without database rescanning. In Proceedings of the 20th ACM international conference on Information and knowledge management, pp. 825–830
Dong X, Gong Y, Cao L (2018) F-NSP: a fast negative sequential patterns mining method with self-adaptive data storage. Pattern Recogn 84:13–27
Fournier-Viger P, Lin JC-W, Kiran RU, Koh YS, Thomas R (2017) A survey of sequential pattern mining. Data Science and Pattern Recognition 1(1):54–77
Gandomi AH, Yang X-S, Talatahari S, Alavi AH (2013) Firefly algorithm with chaos. Commun Nonlinear Sci Numer Simulat 18:89–98
Gong YS, Liu C, Dong X (2015) Research on typical algorithms in negative sequential pattern mining. Open Automation and Control Systems Journal 7:934–941
Gong Y, Xu T, Dong X et al (2017) E-NSPFI: efficient mining negative sequential pattern from both 690 frequent and infrequent positive sequential patterns. Int J Pattern Recognit Artif Intell 31(2)
Hsueh S-C, Lin M-Y, Chen C-L (2008) Mining negative sequential patterns for e-commerce recommendations. In 2008 IEEE Asia-Pacific Services Computing Conference, pp. 1213–1218. IEEE
Huang J, Tseng C, Ou J, Chen M (2008) A general model for sequential pattern mining with a progressive database. IEEE Trans Knowl Data Eng 20(9):1153–1167
Huynh B, Vo B, Snasel V (2017) An efficient parallel method for mining frequent closed sequential patterns. IEEE Access 5:17392–17402
Kaneiwaa K, Kudo Y (2011) A sequential pattern mining algorithm using rough set theory. Int J Approx Reason 52(6):881–893
Karaboga D, Basturk B (2008) On the performance of artificial bee colony (ABC) algorithm. Appl Soft Comput 8(1):687–697
Khare VK, Rastogi V (2013) Mining positive and negative sequential pattern in incremental transaction databases. International Journal of Computer Applications (0975–8887) 71(1):18–22
Kim C, Lim J-H, Ng RT, Shim K (2007) SQUIRE: sequential pattern mining with quantities. J Syst Softw 80(10):1726–1745
Kumar M, Kumar M (2021) XGBoost: 2D-Object Recognition Using Shape Descriptors and Extreme Gradient Boosting Classifier. In Computational Methods and Data Engineering, pp. 207–222. Springer, Singapore
Kumar M, Chhabra P, Garg NK (2018) An efficient content based image retrieval system using BayesNet and K-NN. Multimed Tools Appl 77(16):21557–21570
Kumar A, Kaur A, Kumar M (2019) Face detection techniques: a review. Artif Intell Rev 52(2):927–948
Lee NCJ, Kelly JR, Park HS, An Y, Judson BL, Burtness BA, Husain ZA (2018) Patterns of failure in high-metastatic node number human papillomavirus-positive oropharyngeal carcinoma. Oral Oncol 85:35–39
McCall J (2005) Genetic algorithms for modelling and optimisation. J Comput Appl Math 184(1):205–222
Min F, Zhang Z-H, Zhai W-J, Shen R-P (2018) Frequent pattern discovery with tri-partition alphabets", Inform,ation science, available online
Mirjalili S (2014) Seyed Mohammad Mirjalili and Andrew Lewis, " Grey wolf optimizer". Adv Eng Softw 69:46–61
Murugan TS, Sarkar A (2018) Optimal cluster head selection by hybridisation of firefly and grey wolf optimisation. Int J Wirel Mob Comput 14(3):296–305
Pedersen MEH, Chipperfield AJ (2010) Simplifying particle swarm optimization. Appl Soft Comput 10(2):618–628
Pei J et al (2004) Mining sequential patterns by pattern-growth: the PrefixSpan approach. IEEE Trans Knowl Data Eng 16(11):1424–1440
Rastogi V, Khare VK (2012) Apriori based: mining positive and negative frequent sequential patterns. International Journal of Latest Trends in Engineering and Technology (IJLTET) 1(3):24–33
Roy RG, Ghoshal D (2020) Grey wolf optimization-based second order sliding mode control for inchworm robot. Robotica 38(9):1539–1557
Singh S, Ahuja U, Kumar M, Kumar K, Sachdeva M (2021) Face mask detection using YOLOv3 and faster R-CNN models: COVID-19 environment. Multimed Tools Appl 80(13):19753–19768
Vinolin V (2019) Breast Cancer detection by optimal classification using GWO algorithm. Multimedia Research 2(2):10–18
Wagh MB, Gomathi N (2019) Improved GWO-CS algorithm-based optimal routing strategy in VANET. Journal of Networking and Communication Systems 2(1):34–42
Xu T, Li T, Dong X (2018) Efficient high utility negative sequential patterns Mining in Smart Campus. IEEE Access 6:23839–23847
Xu T, Li T, Dong X (2018) Efficient high utility negative sequential patterns Mining in Smart Campus. IEEE Access 6:23839–23847
Zhang J, Wang Y, Zhang C, Shi Y (2016) Mining contiguous sequential generators in biological sequences. IEEE/ACM Transactions on Computational Biology and Bioinformatics 13(5):855–867
Zheng Z, Zhao Y, Zuo Z, Cao L (2009) Negative-GSP: An efficient method for mining negative sequential patterns. In Conferences in Research and Practice in Information Technology Series
Zheng Z, Zhao Y, Zuo Z, Cao L (2010) An Efficient GA-Based Algorithm for Mining Negative Sequential Patterns, Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 262–273
Zhu J, Wang K, Wu Y, Hu Z, Wang H (2016) Mining user-aware rare sequential topic patterns in document streams. IEEE Trans Knowl Data Eng 28(7):1790–1804
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Yadav, P. Hybridized optimization oriented fast negative sequential patterns mining. Multimed Tools Appl 81, 5279–5303 (2022). https://doi.org/10.1007/s11042-021-11773-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-11773-4