Skip to main content
Log in

Hybridized optimization oriented fast negative sequential patterns mining

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Recently, negative sequential patterns (NSP) (like missing medical treatments) mining is important in data mining research since it includes negative correlations between item sets, which are overlooked by positive sequential pattern mining (PSP) (for instance, utilization of medical service). Yet, discovering the NSP is very complex than finding PSP because of the important problem complexity occurred by high computational cost, non-occurring elements, as well as huge search space in evaluating NSC, and most of the NSP based existing works are inefficient. Therefore, this paper intends to propose a fast NSP mining algorithm for the disease prediction model. This model includes Data normalization, Data separation based on labels, and Pattern recognition phases. In the midst of data separation, the maximum occurring data is optimally selected using a new algorithm that hybridizes the FireFly (FF) algorithm and Grey Wolf Optimization (GWO). This proposed Firefly induced Grey Wolf optimization (F-GWO) algorithm automatically selects the maximum occurring information as per the PSP support. The proposed model is compared over other conventional methods with varied measures. Especially, the computation cost of our model is 46.87%, 6.27%, 9.37%, 2.76%, and 66.62% better than the existing GA, ABC, PSO, FF, and GWO models respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Abbreviations

NSP:

Negative Sequential Patterns.

PSP:

Positive Sequential Patterns.

NSC:

Negative Sequential Candidates.

FF:

FireFly.

GWO:

Grey Wolf Algorithm.

F-GWO:

Firefly Induced Gray Wolf optimization.

FDR:

False Discovery Rate.

FPR:

False positive rate.

FNR:

False negative rate.

NPV:

Negative Predictive Value.

MCC:

Mathews correlation coefficient.

NOB:

Non-Occurring Behaviors.

ITS:

Intelligent Transport Systems.

ST-NSP:

Set Theory-based NSP mining.

HUNSP:

High Utility Negative Sequential Patterns.

HUNSC:

High Utility Negative Sequential Candidates.

GA:

Genetic Algorithm.

e-NSPFI:

Efficient-Negative Sequential Pattern from both Frequent and Infrequent Positive Sequential Patterns.

F-NSP:

Fast NSP Mining Algorithm.

HUSP:

High Utility Sequential Patterns.

ABC:

Artificial Bee colony.

PSO:

Particle Swarm Optimization.

References

  1. Alkan OK, Karagoz P (2015) CRoM and HuspExt: improving efficiency of high utility sequential pattern extraction. IEEE Trans Knowl Data Eng 27(10):2645–2657

    Article  Google Scholar 

  2. Bansal M, Kumar M, Kumar M (2021) 2D object recognition: a comparative analysis of SIFT, SURF and ORB feature descriptors. Multimed Tools Appl 80(12):18839–18857

    Article  Google Scholar 

  3. Bojja GR, Ofori M, Liu J, Loknath Sai Ambati (2020) Early public outlook on the coronavirus disease (COVID-19): A social media study

  4. Buddhakulsomsiri J, Zakarian A (2009) Sequential pattern mining algorithm for automotive warranty data. Comput Ind Eng 57(1):137–147

    Article  Google Scholar 

  5. Cai G, Hio C, Bermingham L, Lee K, Lee I (2014) Sequential pattern mining of geo-tagged photos with an arbitrary regions-of-interest detection method. Expert Syst Appl 41(7):3514–3526

    Article  Google Scholar 

  6. Cao L, Dong X, Zheng Z (2016) E-NSP: efficient negative sequential pattern mining. Artif Intell 235:156–182

    Article  MathSciNet  Google Scholar 

  7. Cao L, Dong X, Zheng Z (2016) E-NSP: efficient negative sequential pattern mining. Artif Intell 235:156–182

    Article  MathSciNet  Google Scholar 

  8. Cao L, Dong X, Zheng Z (2016) E-NSP: efficient negative sequential pattern mining. Artif Intell 235:156–182

    Article  MathSciNet  Google Scholar 

  9. Chen Y, Peng W, Lee S (2015) Mining temporal patterns in time interval-based data. IEEE Trans Knowl Data Eng 27(12):3318–3331

    Article  Google Scholar 

  10. Cheng Y, Lin Y, Chiang K, Tseng VS (2017) Mining sequential risk patterns from large-scale clinical databases for early assessment of chronic diseases: a case study on chronic obstructive pulmonary disease. IEEE Journal of Biomedical and Health Informatics 21(2):303–311

    Google Scholar 

  11. Chhabra P, Garg NK, Kumar M (2020) Content-based image retrieval system using ORB and SIFT features. Neural Comput & Applic 32(7):2725–2733

    Article  Google Scholar 

  12. Chithra S, Kumari RM (2018) Economic emission dispatch in renewable energy systems using FireFly algorithm. Journal of Computational Mechanics, Power System and Control 1(1):18–25

    Google Scholar 

  13. Chung-Ching Y, Chen Y-L (2005) Mining sequential patterns from multidimensional sequence data. IEEE Trans Knowl Data Eng 17(1):136–140

    Article  Google Scholar 

  14. Cui Y, Yang L, Zhao Z, Tang T, Yin M (2013) Sequential grouping heuristic for the two-dimensional cutting stock problem with pattern reduction. Int J Prod Econ 144(2):432–439

    Article  Google Scholar 

  15. Dong X, Zheng Z, Cao L, Zhao Y, Zhang C, Li J, Wei W, Ou Y (2011) e-NSP: efficient negative sequential pattern mining based on identified positive patterns without database rescanning. In Proceedings of the 20th ACM international conference on Information and knowledge management, pp. 825–830

  16. Dong X, Gong Y, Cao L (2018) F-NSP: a fast negative sequential patterns mining method with self-adaptive data storage. Pattern Recogn 84:13–27

    Article  Google Scholar 

  17. Fournier-Viger P, Lin JC-W, Kiran RU, Koh YS, Thomas R (2017) A survey of sequential pattern mining. Data Science and Pattern Recognition 1(1):54–77

    Google Scholar 

  18. Gandomi AH, Yang X-S, Talatahari S, Alavi AH (2013) Firefly algorithm with chaos. Commun Nonlinear Sci Numer Simulat 18:89–98

    Article  MathSciNet  Google Scholar 

  19. Gong YS, Liu C, Dong X (2015) Research on typical algorithms in negative sequential pattern mining. Open Automation and Control Systems Journal 7:934–941

    Article  Google Scholar 

  20. Gong Y, Xu T, Dong X et al (2017) E-NSPFI: efficient mining negative sequential pattern from both 690 frequent and infrequent positive sequential patterns. Int J Pattern Recognit Artif Intell 31(2)

  21. Hsueh S-C, Lin M-Y, Chen C-L (2008) Mining negative sequential patterns for e-commerce recommendations. In 2008 IEEE Asia-Pacific Services Computing Conference, pp. 1213–1218. IEEE

  22. Huang J, Tseng C, Ou J, Chen M (2008) A general model for sequential pattern mining with a progressive database. IEEE Trans Knowl Data Eng 20(9):1153–1167

    Article  Google Scholar 

  23. Huynh B, Vo B, Snasel V (2017) An efficient parallel method for mining frequent closed sequential patterns. IEEE Access 5:17392–17402

    Article  Google Scholar 

  24. Kaneiwaa K, Kudo Y (2011) A sequential pattern mining algorithm using rough set theory. Int J Approx Reason 52(6):881–893

    Article  Google Scholar 

  25. Karaboga D, Basturk B (2008) On the performance of artificial bee colony (ABC) algorithm. Appl Soft Comput 8(1):687–697

    Article  Google Scholar 

  26. Khare VK, Rastogi V (2013) Mining positive and negative sequential pattern in incremental transaction databases. International Journal of Computer Applications (0975–8887) 71(1):18–22

    Article  Google Scholar 

  27. Kim C, Lim J-H, Ng RT, Shim K (2007) SQUIRE: sequential pattern mining with quantities. J Syst Softw 80(10):1726–1745

    Article  Google Scholar 

  28. Kumar M, Kumar M (2021) XGBoost: 2D-Object Recognition Using Shape Descriptors and Extreme Gradient Boosting Classifier. In Computational Methods and Data Engineering, pp. 207–222. Springer, Singapore

  29. Kumar M, Chhabra P, Garg NK (2018) An efficient content based image retrieval system using BayesNet and K-NN. Multimed Tools Appl 77(16):21557–21570

    Article  Google Scholar 

  30. Kumar A, Kaur A, Kumar M (2019) Face detection techniques: a review. Artif Intell Rev 52(2):927–948

    Article  Google Scholar 

  31. Lee NCJ, Kelly JR, Park HS, An Y, Judson BL, Burtness BA, Husain ZA (2018) Patterns of failure in high-metastatic node number human papillomavirus-positive oropharyngeal carcinoma. Oral Oncol 85:35–39

    Article  Google Scholar 

  32. McCall J (2005) Genetic algorithms for modelling and optimisation. J Comput Appl Math 184(1):205–222

    Article  MathSciNet  Google Scholar 

  33. Min F, Zhang Z-H, Zhai W-J, Shen R-P (2018) Frequent pattern discovery with tri-partition alphabets", Inform,ation science, available online

  34. Mirjalili S (2014) Seyed Mohammad Mirjalili and Andrew Lewis, " Grey wolf optimizer". Adv Eng Softw 69:46–61

    Article  Google Scholar 

  35. Murugan TS, Sarkar A (2018) Optimal cluster head selection by hybridisation of firefly and grey wolf optimisation. Int J Wirel Mob Comput 14(3):296–305

    Article  Google Scholar 

  36. Pedersen MEH, Chipperfield AJ (2010) Simplifying particle swarm optimization. Appl Soft Comput 10(2):618–628

    Article  Google Scholar 

  37. Pei J et al (2004) Mining sequential patterns by pattern-growth: the PrefixSpan approach. IEEE Trans Knowl Data Eng 16(11):1424–1440

    Article  Google Scholar 

  38. Rastogi V, Khare VK (2012) Apriori based: mining positive and negative frequent sequential patterns. International Journal of Latest Trends in Engineering and Technology (IJLTET) 1(3):24–33

    Google Scholar 

  39. Roy RG, Ghoshal D (2020) Grey wolf optimization-based second order sliding mode control for inchworm robot. Robotica 38(9):1539–1557

    Article  Google Scholar 

  40. Singh S, Ahuja U, Kumar M, Kumar K, Sachdeva M (2021) Face mask detection using YOLOv3 and faster R-CNN models: COVID-19 environment. Multimed Tools Appl 80(13):19753–19768

    Article  Google Scholar 

  41. Vinolin V (2019) Breast Cancer detection by optimal classification using GWO algorithm. Multimedia Research 2(2):10–18

    Google Scholar 

  42. Wagh MB, Gomathi N (2019) Improved GWO-CS algorithm-based optimal routing strategy in VANET. Journal of Networking and Communication Systems 2(1):34–42

    Google Scholar 

  43. Xu T, Li T, Dong X (2018) Efficient high utility negative sequential patterns Mining in Smart Campus. IEEE Access 6:23839–23847

    Article  Google Scholar 

  44. Xu T, Li T, Dong X (2018) Efficient high utility negative sequential patterns Mining in Smart Campus. IEEE Access 6:23839–23847

    Article  Google Scholar 

  45. Zhang J, Wang Y, Zhang C, Shi Y (2016) Mining contiguous sequential generators in biological sequences. IEEE/ACM Transactions on Computational Biology and Bioinformatics 13(5):855–867

    Article  Google Scholar 

  46. Zheng Z, Zhao Y, Zuo Z, Cao L (2009) Negative-GSP: An efficient method for mining negative sequential patterns. In Conferences in Research and Practice in Information Technology Series

  47. Zheng Z, Zhao Y, Zuo Z, Cao L (2010) An Efficient GA-Based Algorithm for Mining Negative Sequential Patterns, Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 262–273

  48. Zhu J, Wang K, Wu Y, Hu Z, Wang H (2016) Mining user-aware rare sequential topic patterns in document streams. IEEE Trans Knowl Data Eng 28(7):1790–1804

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Poonam Yadav.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yadav, P. Hybridized optimization oriented fast negative sequential patterns mining. Multimed Tools Appl 81, 5279–5303 (2022). https://doi.org/10.1007/s11042-021-11773-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-021-11773-4

Keywords

Navigation