skip to main content
10.1145/3511808.3557641acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

MetaRule: A Meta-path Guided Ensemble Rule Set Learning for Explainable Fraud Detection

Published:17 October 2022Publication History

ABSTRACT

Machine learning methods for fraud detection have achieved impressive prediction performance, but often sacrifice critical interpretability in many applications. In this work, we propose to learn interpretable models for fraud detection as a simple rule set. More specifically, we design a novel neural rule learning method by building a condition graph with an expectation to capture the high-order feature interactions. Each path in this condition graph can be regarded as a single rule. Inspired by the key idea of meta learning, we combine the neural rules with rules extracted from the tree-based models in order to provide generalizable rule candidates. Finally, we propose a flexible rule set learning framework by designing a greedy optimization method towards maximizing the recall number of fraud samples with a predefined criterion as the cost. We conduct comprehensive experiments on large-scale industrial datasets. Interestingly, we find that the neural rules and rules extracted from tree-based models can be complementary to each other to improve the prediction performance.

References

  1. Leo Breiman. 2001. Random forests. Machine learning, Vol. 45, 1 (2001), 5--32.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Tianqi Chen and Carlos Guestrin. 2016. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. 785--794.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Peter Clark and Tim Niblett. 1989. The CN2 induction algorithm. Machine learning, Vol. 3, 4 (1989), 261--283.Google ScholarGoogle Scholar
  4. William W Cohen. 1995. Fast effective rule induction. In Machine learning proceedings 1995. Elsevier, 115--123.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Sanjeeb Dash, Oktay Gunluk, and Dennis Wei. 2018. Boolean decision rules via column generation. Advances in neural information processing systems, Vol. 31 (2018).Google ScholarGoogle Scholar
  6. Waleed Hilal, S Andrew Gadsden, and John Yawney. 2021. A Review of Anomaly Detection Techniques and Applications in Financial Fraud. Expert Systems with Applications (2021), 116429.Google ScholarGoogle Scholar
  7. Stephen H. Bach Himabindu Lakkaraju and Jure Leskovec. 2016. Interpretable decision sets: A joint framework for description and prediction. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1675--1684.Google ScholarGoogle Scholar
  8. Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, and Tie-Yan Liu. 2017. Lightgbm: A highly efficient gradient boosting decision tree. Advances in neural information processing systems, Vol. 30 (2017).Google ScholarGoogle Scholar
  9. Himabindu Lakkaraju, Stephen H Bach, and Jure Leskovec. 2016. Interpretable decision sets: A joint framework for description and prediction. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. 1675--1684.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Wenmin Li, Jiawei Han, and Jian Pei. 2001. CMAR: Accurate and efficient classification based on multiple class-association rules. In Proceedings 2001 IEEE international conference on data mining. IEEE, 369--376.Google ScholarGoogle Scholar
  11. Litao Qiao. 2020. Learning Accurate and Interpretable Decision Rule Sets from Neural Networks. In 35th AAAI Conference on Artificial Intelligence.Google ScholarGoogle Scholar
  12. J Ross Quinlan and R Mike Cameron-Jones. 1993. FOIL: A midterm report. In European conference on machine learning. 1--20.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Juan Ramos. 2003. Using tf-idf to determine word relevance in document queries. In Proceedings of the first instructional conference on machine learning, Vol. 242. 29--48.Google ScholarGoogle Scholar
  14. Juan Ramos et al. 2018. Graph attention networks. In Proceedings of the 6th International Conference on Learning Representations.Google ScholarGoogle Scholar
  15. Cynthia Rudin. 2019. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence, Vol. 1, 5 (2019), 206--215.Google ScholarGoogle ScholarCross RefCross Ref
  16. Cynthia Rudin, Chaofan Chen, Zhi Chen, Haiyang Huang, Lesia Semenova, and Chudi Zhong. 2022. Interpretable machine learning: Fundamental principles and 10 grand challenges. Statistics Surveys, Vol. 16 (2022), 1--85.Google ScholarGoogle ScholarCross RefCross Ref
  17. Yizhou Sun, Jiawei Han, Xifeng Yan, Philip S Yu, and Tianyi Wu. 2011. Pathsim: Meta path-based top-k similarity search in heterogeneous information networks. Proceedings of the VLDB Endowment, Vol. 4, 11 (2011), 992--1003.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Finale Doshi-Velez Yimin Liu Erica Klampfl Tong Wang, Cynthia Rudin and Perry MacNeille. 2017. A bayesian framework for learning rule sets for interpretable classification. The Journal of Machine Learning Research, Vol. 18, 1 (2017), 2357--2393.Google ScholarGoogle Scholar
  19. Tong Wang, Cynthia Rudin, Finale Doshi-Velez, Yimin Liu, Erica Klampfl, and Perry MacNeille. 2017. A bayesian framework for learning rule sets for interpretable classification. The Journal of Machine Learning Research, Vol. 18, 1 (2017), 2357--2393.Google ScholarGoogle ScholarCross RefCross Ref
  20. Zhuo Wang, Wei Zhang, Ning Liu, and Jianyong Wang. 2021. Scalable Rule-Based Representation Learning for Interpretable Classification. In Advances in Neural Information Processing Systems.Google ScholarGoogle Scholar
  21. Fan Yang, Kai He, Linxiao Yang, Hongxia Du, Jingbang Yang, Bo Yang, and Liang Sun. 2021. Learning Interpretable Decision Rule Sets: A Submodular Optimization Approach. Advances in Neural Information Processing Systems, Vol. 34 (2021).Google ScholarGoogle Scholar
  22. Xiaoxin Yin and Jiawei Han. 2003. CPAR: Classification based on predictive association rules. In Proceedings of the 2003 SIAM international conference on data mining. SIAM, 331--335.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. MetaRule: A Meta-path Guided Ensemble Rule Set Learning for Explainable Fraud Detection

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management
      October 2022
      5274 pages
      ISBN:9781450392365
      DOI:10.1145/3511808
      • General Chairs:
      • Mohammad Al Hasan,
      • Li Xiong

      Copyright © 2022 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 17 October 2022

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      CIKM '22 Paper Acceptance Rate621of2,257submissions,28%Overall Acceptance Rate1,861of8,427submissions,22%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader