research-article

Open access

Designing Shapelets for Interpretable Data-Agnostic Classification

Authors:

Riccardo Guidotti,

Anna MonrealeAuthors Info & Claims

AIES '21: Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society

Pages 532 - 542

https://doi.org/10.1145/3461702.3462553

Published: 30 July 2021 Publication History

Abstract

Time series shapelets are discriminatory subsequences which are representative of a class, and their similarity to a time series can be used for successfully tackling the time series classification problem. The literature shows that Artificial Intelligence (AI) systems adopting classification models based on time series shapelets can be interpretable, more accurate, and significantly fast. Thus, in order to design a data-agnostic and interpretable classification approach, in this paper we first extend the notion of shapelets to different types of data, i.e., images, tabular and textual data. Then, based on this extended notion of shapelets we propose an interpretable data-agnostic classification method. Since the shapelets discovery can be time consuming, especially for data types more complex than time series, we exploit a notion of prototypes for finding candidate shapelets, and reducing both the time required to find a solution and the variance of shapelets. A wide experimentation on datasets of different types shows that the data-agnostic prototype-based shapelets returned by the proposed method empower an interpretable classification which is also fast, accurate, and stable. In addition, we show and we prove that shapelets can be at the basis of explainable AI methods.

References

[1]

Agrawal, R.; Srikant, R.; et al. 1994. Fast algorithms for mining association rules. In Proc. 20th int. conf. very large data bases, VLDB, volume 1215, 487--499.

Digital Library

[2]

Bailey, T. L.; Elkan, C.; et al. 1994. Fitting a mixture model by expectation maximization to discover motifs in bipolymers.

[3]

Bengio, Y.; Ducharme, R.; Vincent, P.; and Jauvin, C. 2003. A neural probabilistic language model. Journal of machine learning research 3(Feb): 1137--1155.

Digital Library

[4]

Bertsimas, D.; and Dunn, J. 2017. Optimal classification trees. Machine Learning 106(7): 1039--1082.

Digital Library

[5]

Craven, M.; and Shavlik, J. W. 1996. Extracting tree-structured representations of trained networks. In Advances in neural information processing systems, 24--30.

[6]

Demvs ar, J. 2006. Statistical comparisons of classifiers over multiple data sets. Journal of Machine learning research 7(Jan): 1--30.

[7]

Deng, H.; Chen, W.; Ma, A. J.; Shen, Q.; Yuen, P. C.; and Feng, G. 2018. Robust Shapelets Learning: Transform-Invariant Prototypes. In et al., J. L., ed., Pattern Recognition and Computer Vision - First Chinese Conference, PRCV 2018, Proc., Part III, volume 11258 of LNCS, 491--502. Springer.

[8]

Doshi-Velez, F.; and Kim, B. 2017. Towards a rigorous science of interpretable machine learning. arXiv preprint arXiv:1702.08608 .

[9]

Fawaz, H. I.; et al. 2019. Deep learning for time series classification: a review. DAMI 33(4): 917--963.

[10]

Gordon, D.; Hendler, D.; and Rokach, L. 2012. Fast Randomized Model Generation for Shapelet-Based Time Series Classification. CoRR abs/1209.5038. prefixhttp://arxiv.org/abs/1209.5038.

[11]

Grabocka, J.; Schilling, N.; Wistuba, M.; and Schmidt-Thieme, L. 2014. Learning time-series shapelets. In SIGKDD, 392--401. ACM.

[12]

Grabocka, J.; Wistuba, M.; and Schmidt-Thieme, L. 2015. Scalable Discovery of Time-Series Shapelets. CoRR abs/1503.03238. prefixhttp://arxiv.org/abs/1503.03238.

[13]

Guidotti, R.; Monreale, A.; Giannotti, F.; Pedreschi, D.; Ruggieri, S.; and Turini, F. 2019 a. Factual and Counterfactual Explanations for Black Box Decision Making. IEEE Intelligent Systems .

[14]

Guidotti, R.; Monreale, A.; Ruggieri, S.; et al. 2019 b. A survey of methods for explaining black box models. CSUR 51(5): 93.

Digital Library

[15]

Guidotti, R.; and Ruggieri, S. 2019. On The Stability of Interpretable Models. In IJCNN, 1--8. IEEE.

[16]

Hills, J.; Lines, J.; Baranauskas, E.; Mapp, J.; and Bagnall, A. 2014. Classification of time series by shapelet transformation. DAMI 28(4): 851--881.

[17]

Ji, C.; Zhao, C.; Liu, S.; Yang, C.; Pan, L.; Wu, L.; and Meng, X. 2019. A fast shapelet selection algorithm for time series classification. Comput. Networks 148: 231--240.

[18]

Karlsson, I.; Papapetrou, P.; and Boströ m, H. 2016. Generalized random shapelet forests. Data Min. Knowl. Discov. 30(5): 1053--1085.

Digital Library

[19]

Lin, J.; Keogh, E.; Wei, L.; and Lonardi, S. 2007. Experiencing SAX: a novel symbolic representation of time series. Data Mining and knowledge discovery 15(2): 107--144.

[20]

Lines, J.; Davis, L. M.; Hills, J.; and Bagnall, A. 2012. A shapelet transform for time series classification. In SIGKDD, 289--297.

[21]

Manning, C. D.; Raghavan, P.; and Schütze, H. 2008. Introduction to information retrieval. Cambridge university press.

[22]

Melis, D. A.; and Jaakkola, T. 2018. Towards robust interpretability with self-explaining neural networks. In NIPS, 7786--7795.

[23]

Miller, T. 2019. Explanation in artificial intelligence: Insights from the social sciences. Artificial Intelligence 267: 1--38.

[24]

Mueen, A.; Keogh, E.; and Young, N. 2011. Logical-shapelets: an expressive primitive for time series classification. In SIGKDD, 1154--1162. ACM.

[25]

Muir, B. M. 1987. Trust between humans and machines, and the design of decision aids. International journal of man-machine studies 27(5--6): 527--539.

[26]

Murthy, S. K.; Kasif, S.; and Salzberg, S. 1994. A system for induction of oblique decision trees. Journal of artificial intelligence research 2: 1--32.

[27]

Pasquale, F. 2015. The black box society. Harvard University Press.

[28]

Quinlan, J. R. 1993. C4. 5: Programs for Machine Learning. Elsevier.

Digital Library

[29]

Rakthanmanon, T.; and Keogh, E. 2013. Fast shapelets: A scalable algorithm for discovering time series shapelets. In ICDM, 668--676. SIAM.

[30]

Refregier, A. 2003. Shapelets: I. A Method for Image Analysis. Monthly Notices of the Royal Astronomical Society 338.

[31]

Renard, X.; Rifqi, M.; Erray, W.; and Detyniecki, M. 2015. Random-shapelet: An algorithm for fast shapelet discovery. In 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015, Campus des Cordeliers, Paris, France, October 19--21, 2015, 1--10. IEEE.

[32]

Ribeiro, M. T.; et al. 2016. Why should i trust you?: Explaining the predictions of any classifier. In SIGKDD, 1135--1144. ACM.

[33]

Ruth M.J., B. 2019. Counterfactuals in Explainable Artificial Intelligence (XAI): Evidence from Human Reasoning. IJCAI 6276--6282.

[34]

Sabzmeydani, P.; and Mori, G. 2007. Detecting Pedestrians by Learning Shapelet Features. In 2007 IEEE Conference on Computer Vision and Pattern Recognition, 1--8.

[35]

Tan, P.-N.; Steinbach, M.; and Kumar, V. 2016. Introduction to data mining. Pearson Education India.

[36]

Wang, Y.; Emonet, R.; Fromont, É .; Malinowski, S.; Menager, E.; Mosser, L.; and Tavenard, R. 2019. Learning Interpretable Shapelets for Time Series Classification through Adversarial Regularization. CoRR abs/1906.00917. prefixhttp://arxiv.org/abs/1906.00917.

[37]

Wistuba, M.; Grabocka, J.; and Schmidt-Thieme, L. 2015. Ultra-Fast Shapelets for Time Series Classification. CoRR abs/1503.05018. prefixhttp://arxiv.org/abs/1503.05018.

[38]

Yao, W.; and Deng, Z. 2012. A robust pedestrian detection approach based on shapelet feature and Haar detector ensembles. Tsinghua Science and Technology 17(1): 40--50.

[39]

Ye, L.; and Keogh, E. 2009. Time series shapelets: a new primitive for data mining. In SIGKDD, 947--956. ACM.

[40]

Ye, L.; et al. 2011. Time series shapelets: a novel technique that allows accurate, interpretable and fast classification. DAMI 22(1--2): 149--182.

Cited By

Tang JKang QZhou MYin HYao S(2024)MemeNet: Toward a Reliable Local Projection for Image Recognition via Semantic FeaturizationIEEE Transactions on Image Processing10.1109/TIP.2024.335933133(1670-1682)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TIP.2024.3359331
Guidotti R(2022)Exploiting auto-encoders for explaining black-box classifiersIntelligenza Artificiale10.3233/IA-22013916:1(115-129)Online publication date: 8-Jul-2022
https://doi.org/10.3233/IA-220139
Theissler ASpinnato FSchlegel UGuidotti R(2022)Explainable AI for Time Series Classification: A Review, Taxonomy and Research DirectionsIEEE Access10.1109/ACCESS.2022.320776510(100700-100724)Online publication date: 2022
https://doi.org/10.1109/ACCESS.2022.3207765

Index Terms

Designing Shapelets for Interpretable Data-Agnostic Classification
1. Computing methodologies
  1. Artificial intelligence
    1. Knowledge representation and reasoning
  2. Machine learning
2. Information systems
  1. Information systems applications
    1. Data mining
      1. Nearest-neighbor search
    2. Decision support systems

Recommendations

Logical-shapelets: an expressive primitive for time series classification
KDD '11: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining

Time series shapelets are small, local patterns in a time series that are highly predictive of a class and are thus very useful features for building classifiers and for certain visualization and summarization tasks. While shapelets were introduced only ...
TERM: Tree Ensemble Models for Interpretable Rule Mining
Web Information Systems Engineering – WISE 2024
Abstract
Ensemble learning, particularly tree-based ensemble techniques, is acknowledged as the advanced approach to solving a wide range of challenging issues because of its exceptional performance in multiple machine learning applications. Nonetheless, ...
Comprehensible Artificial Intelligence on Knowledge Graphs: A survey
Abstract
Artificial Intelligence applications gradually move outside the safe walls of research labs and invade our daily lives. This is also true for Machine Learning methods on Knowledge Graphs, which has led to a steady increase in their application ...
Graphical abstract

Display Omitted
Highlights
- A history of Comprehensible Artificial Intelligence (CAI) on Knowledge Graphs (KGs).
- A novel taxonomy for CAI on KGs and several lines of research within it.
- Potential for future research on CAI on KGs.

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

AIES '21: Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society

July 2021

1077 pages

ISBN:9781450384735

DOI:10.1145/3461702

Program Chairs:
Marion Fourcade
University of California Berkeley, USA
,
Benjamin Kuipers
University of Michigan, USA
,
Seth Lazar
Australian National University, Australia
,
Deirdre Mulligan
University of California Berkeley, USA

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 July 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

University of Pisa
European Commission

Conference

AIES '21

Sponsor:

SIGAI

AIES '21: AAAI/ACM Conference on AI, Ethics, and Society

May 19 - 21, 2021

Virtual Event, USA

Acceptance Rates

Overall Acceptance Rate 61 of 162 submissions, 38%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
709
Total Downloads

Downloads (Last 12 months)174
Downloads (Last 6 weeks)18

Reflects downloads up to 16 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Tang JKang QZhou MYin HYao S(2024)MemeNet: Toward a Reliable Local Projection for Image Recognition via Semantic FeaturizationIEEE Transactions on Image Processing10.1109/TIP.2024.335933133(1670-1682)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TIP.2024.3359331
Guidotti R(2022)Exploiting auto-encoders for explaining black-box classifiersIntelligenza Artificiale10.3233/IA-22013916:1(115-129)Online publication date: 8-Jul-2022
https://doi.org/10.3233/IA-220139
Theissler ASpinnato FSchlegel UGuidotti R(2022)Explainable AI for Time Series Classification: A Review, Taxonomy and Research DirectionsIEEE Access10.1109/ACCESS.2022.320776510(100700-100724)Online publication date: 2022
https://doi.org/10.1109/ACCESS.2022.3207765

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten