skip to main content
research-article

VM-NSP: Vertical Negative Sequential Pattern Mining with Loose Negative Element Constraints

Published:17 February 2021Publication History
Skip Abstract Section

Abstract

Negative sequential patterns (NSPs) capture more informative and actionable knowledge than classic positive sequential patterns (PSPs) due to the involvement of both occurring and nonoccurring behaviors and events, which can contribute to many relevant applications. However, NSP mining is nontrivial, as it involves fundamental challenges requiring distinct theoretical foundations and is not directly addressable by PSP mining. In the very limited research reported on NSP mining, a negative element constraint (NEC) is incorporated to only consider the NSPs composed of specific forms of elements (containing either positive or negative items), which results in many valuable NSPs being missed. Here, we loosen the NEC (called loose negative element constraint (LNEC)) to include partial negative elements containing both positive and negative items, which enables the discovery of more flexible patterns but incorporates significant new learning challenges, such as representing and mining complete NSPs. Accordingly, we formalize the LNEC-based NSP mining problem and propose a novel vertical NSP mining framework, VM-NSP, to efficiently mine the complete set of NSPs by a vertical representation (VR) of each sequence. An efficient bitmap-based vertical NSP mining algorithm, bM-NSP, introduces a bitmap hash table--based VR and a prefix-based negative sequential candidate generation strategy to optimize the discovery performance. VM-NSP and its implementation bM-NSP form the first VR-based approach for complete NSP mining with LNEC. Theoretical analyses and experiments confirm the performance superiority of bM-NSP on synthetic and real-life datasets w.r.t. diverse data factors, which substantially expands existing NSP mining methods toward flexible NSP discovery.

References

  1. Fahad Anwar, Ilias Petrounias, Tim Morris, and Vassilis Kodogiannis. 2010. Discovery of events with negative behavior against given sequential patterns. In Proceedings of the 2010 5th IEEE International Conferenceon Intelligent Systems. IEEE, Los Alamitos, CA, 373--378.Google ScholarGoogle ScholarCross RefCross Ref
  2. Sujeevan Aseervatham, Aomar Osmani, and Emmanuel Viennet. 2006. bitSPADE: A lattice-based sequential pattern mining algorithm using bitmap representation. In Proceedings of the 6th International Conference on Data Mining. IEEE, Los Alamitos, CA, 792--797.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Jay Ayres, Jason Flannick, Johannes Gehrke, and Tomi Yiu. 2002. Sequential pattern mining using a bitmap representation. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 429--435.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Philippe Besnard and Thomas Guyet. 2020. Semantics of negative sequential patterns. In Proceedings of the 24th European Conference on Artificial Intelligence (ECAI’20), Vol. 325. 1009--1015.Google ScholarGoogle Scholar
  5. Data Science Lab. 2020. Behavior Analytics, Computing and Informatics. Retrieved January 16, 2021 from https://datasciences.org/behavior-informatics/.Google ScholarGoogle Scholar
  6. Julius Borcea, Petter Brändén, and Thomas Liggett. 2009. Negative dependence and the geometry of polynomials. Journal of the American Mathematical Society 22, 2 (2009), 521--567.Google ScholarGoogle ScholarCross RefCross Ref
  7. Longbing Cao. 2013. Combined mining: Analyzing object and pattern relations for discovering and constructing complex yet actionable patterns. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 3, 2 (2013), 140--155.Google ScholarGoogle ScholarCross RefCross Ref
  8. Longbing Cao. 2018. Data Science Thinking: The Next Scientific, Technological and Economic Revolution. Springer International Publishing.Google ScholarGoogle Scholar
  9. Longbing Cao. 2020. Health and medical behavior informatics. In Biomedical Information Technology (2nd ed.). Elsevier, 735--761.Google ScholarGoogle Scholar
  10. Longbing Cao, Xiangjun Dong, and Zhigang Zheng. 2016. e-NSP: Efficient negative sequential pattern mining. Artificial Intelligence 235 (2016), 156--182.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Longbing Cao, Philip S. Yu, and Vipin Kumar. 2015. Nonoccurring behavior analytics: A new area. IEEE Intelligent Systems 30, 6 (2015), 4--11.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Longbing Cao, Yanchang Zhao, and Chengqi Zhang. 2008. Mining impact-targeted activity patterns in imbalanced data. IEEE Transactions on Knowledge and Data Engineering 20, 8 (2008), 1053--1066.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Ding-Ying Chiu, Yi-Hung Wu, and Arbee L. P. Chen. 2004. An efficient algorithm for mining frequent sequences by a new strategy without support counting. In Proceedings of the 2004 20th International Conference on Data Engineering. IEEE, Los Alamitos, CA, 375--386.Google ScholarGoogle Scholar
  14. Data Science Lab. 2020. Coupling and interaction learning. Retrieved January 16, 2021 from https://datasciences.org/coupling-learning/.Google ScholarGoogle Scholar
  15. Xiangjun Dong, Yongshun Gong, and Longbing Cao. 2018. F-NSP+: A fast negative sequential patterns mining method with self-adaptive data storage. Pattern Recognition 84 (2018), 13--27.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Xiangjun Dong, Yongshun Gong, and Longbing Cao. 2020. e-RNSP: An efficient method for mining repetition negative sequential patterns. IEEE Transactions on Cybernetics 50, 5 (2020), 2084--2096.Google ScholarGoogle ScholarCross RefCross Ref
  17. Xiangjun Dong, Yongshun Gong, and Lulin Zhao. 2014. Comparisons of typical algorithms in negative sequential pattern mining. In Proceedings of the 2014 IEEE Workshop on Electronics, Computer, and Applications. IEEE, Los Alamitos, CA, 387--390.Google ScholarGoogle ScholarCross RefCross Ref
  18. Xiangjun Dong, Ping Qiu, Jinhu Lü, Longbing Cao, and Tiantian Xu. 2019. Mining top-k useful negative sequential patterns via learning. IEEE Transactions on Neural Networks and Learning Systems 30, 9 (2019), 2764--2778.Google ScholarGoogle ScholarCross RefCross Ref
  19. Philippe Fournier-Viger, Antonio Gomariz, Manuel Campos, and Rincy Thomas. 2014. Fast vertical mining of sequential patterns using co-occurrence information. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. 40--52.Google ScholarGoogle ScholarCross RefCross Ref
  20. Yongshun Gong, Chuanlu Liu, and Xiangjun Dong. 2015. Research on typical algorithms in negative sequential pattern mining. Open Automation and Control Systems Journal 7 (2015), 934--941.Google ScholarGoogle ScholarCross RefCross Ref
  21. Yongshun Gong, Tiantian Xu, Xiangjun Dong, and Guohua Lv. 2017. e-NSPFI: Efficient mining negative sequential pattern from both frequent and infrequent positive sequential patterns. International Journal of Pattern Recognition and Artificial Intelligence 31, 2 (2017), 1750002.Google ScholarGoogle ScholarCross RefCross Ref
  22. Thomas Guyet and René Quiniou. 2020. NegPSpan: Efficient extraction of negative sequential patterns with embedding constraints. Data Mining and Knowledge Discovery 34, 2 (2020), 563--609.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Marwan Hassani, Daniel Töws, Alfredo Cuzzocrea, and Thomas Seidl. 2019. BFSPMiner: An effective and efficient batch-free algorithm for mining sequential patterns over data streams. International Journal of Data Science and Analytics 8, 3 (2019), 223--239.Google ScholarGoogle ScholarCross RefCross Ref
  24. Sue-Chen Hsueh, Ming-Yen Lin, and Chien-Liang Chen. 2008. Mining negative sequential patterns for e-commerce recommendations. In Proceedings of the 2008 IEEE Asia-Pacific Services Computing Conference (APSCC’08). IEEE, Los Alamitos, CA, 1213--1218.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Jen-Wei Huang, Yong-Bin Wu, and Bijay Prasad Jaysawal. 2020. On mining progressive positive and negative sequential patterns simultaneously. Journal of Information Science and Engineering 36, 1 (2020), 145--169.Google ScholarGoogle Scholar
  26. B. Ozisikyilmaz, J. Pisharath, Y. Liu, R. Narayanan, W.-K. Liao, G. Memik, and A. Choudhary. [n.d.]. NU-MineBench Version 2.0 Data Set and Technical Report. Retrieved January 16, 2021 from http://cucis.ece.northwestern.edu/projects/DMS/MineBenchDownload.html.Google ScholarGoogle Scholar
  27. S. Kamepalli and R. Kurra. 2014. Frequent negative sequential patterns: A survey. International Journal of Computer Engineering and Technology 5, 3 (2014), 15--121.Google ScholarGoogle Scholar
  28. Przemysław Kazienko. 2008. Mining sequential patterns with negative conclusions. In Proceedings of the International Conference on Data Warehousing and Knowledge Discovery. 423--432.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Vinay Kumar Khare and Vedant Rastogi. 2013. Mining positive and negative sequential pattern in incremental transaction databases. International Journal of Computer Applications 71, 1 (2013), 1--5.Google ScholarGoogle Scholar
  30. R. Uday Kiran, Yutaka Watanobe, Bhaskar Chowdary, Koji Zettsu, Masashi Toyoda, and Masaru Kitsuregawa. 2020. Discovering maximal periodic-frequent patterns in very large temporal databases. In Proceedings of the IEEE International Conference on Data Science and Advanced Analytics (DSAA’20). 1--10.Google ScholarGoogle ScholarCross RefCross Ref
  31. Yuefeng Li, Abdulmohsen Algarni, and Ning Zhong. 2010. Mining positive and negative patterns for relevance feature discovery. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 753--762.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Jefrey Lijffijt, Eirini Spyropoulou, Bo Kang, and Tijl De Bie. 2016. P-N-RMiner: A generic framework for mining interesting structured relational patterns. International Journal of Data Science and Analytics 1, 1 (2016), 61--76.Google ScholarGoogle ScholarCross RefCross Ref
  33. Nancy P Lin, Hung-Jen Chen, Wei-Hua Hao, Hao-En Chueh, and Chung-I. Chang. 2007. Mining negative fuzzy sequential patterns. In Proceedings of the 7th WSEAS International Conference on Simulation, Modelling, and Optimization. 52--57.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Nancy P. Lin, Wei-Hua Hao, Hung-Jen Chen, Chung-I. Chang, and Hao-En Chueh. 2007. An algorithm for mining strong negative fuzzy sequential patterns. International Journal of Computers 1, 3 (2007), 167--172.Google ScholarGoogle Scholar
  35. Chuanlu Liu, Xiangjun Dong, Caoyuan Li, and Yan Li. 2015. SAPNSP: Select actionable positive and negative sequential patterns based on a contribution metric. In Proceedings of the 2015 12th International Conference on Fuzzy Systems and Knowledge Discovery. IEEE, Los Alamitos, CA, 811--815.Google ScholarGoogle Scholar
  36. MBS Online. 2020. MBS Codes. Retrieved January 16, 2021 from http://www9.health.gov.au/mbs.Google ScholarGoogle Scholar
  37. Carl H. Mooney and John F. Roddick. 2013. Sequential pattern mining—Approaches and algorithms. ACM Computing Surveys 45, 2 (2013), 19.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Data Science Lab. 2020. Negative Sequence Analysis. Retrieved January 16, 2021 from https://datasciences.org/negative-sequence-analysis/.Google ScholarGoogle Scholar
  39. Weimin Ouyang and Qinhua Huang. 2009. Mining positive and negative fuzzy multiple level sequential patterns in large transaction databases. In Proceedings of the WRI Global Congress on Intelligent Systems (GCIS’09), Vol. 1. IEEE, Los Alamitos, CA, 500--504.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Weimin Ouyang, Qinhua Huang, and Shuanghu Luo. 2008. Mining positive and negative fuzzy sequential patterns in large transaction databases. In Proceedings of the 5th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD’08), Vol. 5. IEEE, Los Alamitos, CA, 18--23.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Jian Pei, Jiawei Han, and Wei Wang. 2007. Constraint-based sequential pattern mining: The pattern-growth methods. Journal of Intelligent Information Systems 28, 2 (2007), 133--160.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Robin Pemantle. 2000. Towards a theory of negative dependence. Journal of Mathematical Physics 41, 3 (2000), 1371--1390.Google ScholarGoogle ScholarCross RefCross Ref
  43. Data Science Lab. 2020. Pattern Relation Analysis. Retrieved January 16, 2021 from https://datasciences.org/pattern-relation-analysis/.Google ScholarGoogle Scholar
  44. Akhil Ralla, P. Krishna Reddy, and Anirban Mondal. 2019. An incremental technique for mining coverage patterns in large databases. In Proceedings of the 2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA’19). 211--220.Google ScholarGoogle ScholarCross RefCross Ref
  45. SPMF. 2020. SPMF BMS2. Retrieved January 16, 2021 from http://www.philippe-fournier-viger.com/spmf/datasets/BMS2.txt.Google ScholarGoogle Scholar
  46. SPMF. 2020. SPMF Dataset. Retrieved January 16, 2021 from http://www.philippe-fournier-viger.com/spmf/datasets/BMS1_spmf.Google ScholarGoogle Scholar
  47. SPMF. 2020. SPMF FIFA. Retrieved January 16, 2021 from http://www.philippe-fournier-viger.com/spmf/datasets/FIFA.txt.Google ScholarGoogle Scholar
  48. Eirini Spyropoulou and Tijl De Bie. 2014. Mining approximate multi-relational patterns. In Proceedings of the 2014 IEEE International Conference on Data Science and Advanced Analytics (DSAA’14). 477--483.Google ScholarGoogle ScholarCross RefCross Ref
  49. Ramakrishnan Srikant and Rakesh Agrawal. 1996. Mining sequential patterns: Generalizations and performance improvements. In Proceedings of the International Conference on Extending Database Technology. 1--17.Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Shoujin Wang, Longbing Cao, and Yan Wang. 2019. A survey on session-based recommender systems. arXiv:1902.04864Google ScholarGoogle Scholar
  51. Wei Wang. 2020. Nonoccurring Sequential Behavior Analytics. Ph.D. Dissertation. University of Technology, Sydney.Google ScholarGoogle Scholar
  52. Wei Wang and Longbing Cao. 2019. Negative sequence analysis: A review. ACM Computing Surveys 52, 2 (2019), Article 32, 39 pages.Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Tiantian Xu, Xiangjun Dong, Jianliang Xu, and Yongshun Gong. 2017. E-msNSP: Efficient negative sequential patterns mining based on multiple minimum supports. International Journal of Pattern Recognition and Artificial Intelligence 31, 02 (2017), 1750003.Google ScholarGoogle ScholarCross RefCross Ref
  54. Mohammed J Zaki. 2001. SPADE: An efficient algorithm for mining frequent sequences. Machine Learning 42, 1–2 (2001), 31--60.Google ScholarGoogle ScholarCross RefCross Ref
  55. Huaifeng Zhang, Yanchang Zhao, Longbing Cao, and Chengqi Zhang. 2008. Combined association rule mining. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’08). 1069--1074.Google ScholarGoogle ScholarCross RefCross Ref
  56. Yanchang Zhao, Huaifeng Zhang, Longbing Cao, Chengqi Zhang, and Hans Bohlscheid. 2009. Mining both positive and negative impact-oriented sequential rules from transactional data. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. 656--663.Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Yanchang Zhao, Huaifeng Zhang, Shanshan Wu, Jian Pei, Longbing Cao, Chengqi Zhang, and Hans Bohlscheid. 2009. Debt detection in social security by sequence classification using both positive and negative patterns. In Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases. 648--663.Google ScholarGoogle ScholarCross RefCross Ref
  58. Zhigang Zheng. 2012. Negative Sequential Pattern Mining. Ph.D. Dissertation. University of Technology, Sydney.Google ScholarGoogle Scholar
  59. Zhigang Zheng, Yanchang Zhao, Ziye Zuo, and Longbing Cao. 2009. Negative-GSP: An efficient method for mining negative sequential patterns. In Proceedings of the 8th Australasian Data Mining Conference—Volume 101. 63--67.Google ScholarGoogle ScholarDigital LibraryDigital Library
  60. Zhigang Zheng, Yanchang Zhao, Ziye Zuo, and Longbing Cao. 2010. An efficient GA-based algorithm for mining negative sequential patterns. In Advances in Knowledge Discovery and Data Mining. Springer, 262--273.Google ScholarGoogle Scholar

Index Terms

  1. VM-NSP: Vertical Negative Sequential Pattern Mining with Loose Negative Element Constraints

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Information Systems
        ACM Transactions on Information Systems  Volume 39, Issue 2
        April 2021
        391 pages
        ISSN:1046-8188
        EISSN:1558-2868
        DOI:10.1145/3444752
        Issue’s Table of Contents

        Copyright © 2021 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 17 February 2021
        • Accepted: 1 December 2020
        • Revised: 1 November 2020
        • Received: 1 February 2020
        Published in tois Volume 39, Issue 2

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format