skip to main content
10.1145/3578245.3585036acmconferencesArticle/Chapter ViewAbstractPublication PagesicpeConference Proceedingsconference-collections
research-article

Towards Novel Statistical Methods for Anomaly Detection in Industrial Processes

Published:15 April 2023Publication History

ABSTRACT

This paper presents a novel methodology based on first principles of statistics and statistical learning for anomaly detection in industrial processes and IoT environments. We present a 5-level analytical pipeline that cleans, smooths, and eliminates redundancies from the data, and identifies outliers as well as the features that contribute most to these anomalies. We show how smoothing can make our methodology less sensitive to short-lived anomalies that might be, e.g., due to sensor noise. We validate the methodology on a dataset freely available in the literature. Our results show that we can identify all anomalies in the considered dataset, with the ability of controlling the amount of false positives. This work is the result of a research project co-funded by the Tuscany Region and a company leader in the paper and nonwovens sector. Although the methodology was developed for this domain, we consider here a dataset from a different industrial sector. This shows that our methodology can be generalized to other contexts with similar constraints on limited resources, interpretability, time, and budget.

References

  1. Alsmeyer, G. Chebyshev's Inequality. Springer Berlin Heidelberg, Berlin, Heidelberg, 2011, pp. 239--240.Google ScholarGoogle Scholar
  2. Blázqez-García, A., Conde, A., Mori, U., and Lozano, J. A. A review on outlier/anomaly detection in time series data. ACM Computing Surveys (CSUR) 54, 3 (2021), 1--33.Google ScholarGoogle Scholar
  3. Cabana, E., Lillo, R. E., and Laniado, H. Multivariate outlier detection based on a robust mahalanobis distance with shrinkage estimators. Statistical Papers 62, 4 (nov 2019), 1583--1609.Google ScholarGoogle ScholarCross RefCross Ref
  4. Campbell, N. A. Robust procedures in multivariate analysis i: Robust covariance estimation. Journal of the Royal Statistical Society. Series C (Applied Statistics) 29, 3 (1980), 231--237.Google ScholarGoogle Scholar
  5. Choi, K., Yi, J., Park, C., and Yoon, S. Deep learning for anomaly detection in time-series data: review, analysis, and guidelines. IEEE Access (2021).Google ScholarGoogle Scholar
  6. Craney, T. A., and Surles, J. G. Model-dependent variance inflation factor cutoff values. Quality Engineering 14, 3 (2002), 391--403.Google ScholarGoogle ScholarCross RefCross Ref
  7. Garthwaite, P., and Koch, I. Evaluating the contributions of individual variables to a quadratic form. Australian & New Zealand Journal of Statistics 58 (03 2016).Google ScholarGoogle ScholarCross RefCross Ref
  8. Hubert, M., and Van der Veeken, S. Outlier detection for skewed data. Journal of Chemometrics 22, 3--4 (2008), 235--246.Google ScholarGoogle ScholarCross RefCross Ref
  9. James, G., Witten, D., Hastie, T., and Tibshirani, R. An introduction to statistical learning: with applications in r.Google ScholarGoogle Scholar
  10. Kamoi, R., and Kobayashi, K. Why is the mahalanobis distance effective for anomaly detection?, 2020.Google ScholarGoogle Scholar
  11. Mahalanobis, P. C. On the generalized distance in statistics. National Institute of Science of India.Google ScholarGoogle Scholar
  12. Maronna, R. A., Martin, R. D., Yohai, V. J., and Salibián-Barrera, M. Robust statistics: theory and methods (with R). John Wiley & Sons, 2019.Google ScholarGoogle Scholar
  13. Rousseeuw, P. J., and Leroy, A. M. Robust regression and outlier detection. John Wiley & sons, 2005.Google ScholarGoogle Scholar
  14. Rousseeuw, P. J., and van Zomeren, B. C. Unmasking multivariate outliers and leverage points. Journal of the American Statistical Association 85, 411 (1990), 633--639.Google ScholarGoogle Scholar
  15. Su, Y., Zhao, Y., Niu, C., Liu, R., Sun, W., and Pei, D. Robust anomaly detection for multivariate time series through stochastic recurrent neural network. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & amp; Data Mining (New York, NY, USA, 2019), KDD '19, Association for Computing Machinery, p. 2828--2837.Google ScholarGoogle Scholar
  16. Tao, L., Liu, H., Zhang, J., Su, X., Li, S., Hao, J., Lu, C., Suo, M., and Wang, C. Associated Fault Diagnosis of Power Supply Systems Based on Graph Matching: A Knowledge and Data Fusion Approach. Mathematics 10, 22 (November 2022), 1--28.Google ScholarGoogle Scholar
  17. Tiku, M. L., Islam, M. Q., and Qumsiyeh, S. B. Mahalanobis distance under non-normality. Statistics 44, 3 (2010), 275--290.Google ScholarGoogle ScholarCross RefCross Ref
  18. Todeschini, R., Ballabio, D., Consonni, V., Sahigara, F., and Filzmoser, P. Locally centred mahalanobis distance: A new distance measure with salient features towards outlier detection. Analytica Chimica Acta 787 (2013), 1--9.Google ScholarGoogle ScholarCross RefCross Ref
  19. Wölfel, M., and Ekenel, H. K. Feature weighted mahalanobis distance: Improved robustness for gaussian classifiers. In 2005 13th European Signal Processing Conference (2005), pp. 1--4.Google ScholarGoogle Scholar

Index Terms

  1. Towards Novel Statistical Methods for Anomaly Detection in Industrial Processes

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      ICPE '23 Companion: Companion of the 2023 ACM/SPEC International Conference on Performance Engineering
      April 2023
      421 pages
      ISBN:9798400700729
      DOI:10.1145/3578245

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 15 April 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate252of851submissions,30%

      Upcoming Conference

    • Article Metrics

      • Downloads (Last 12 months)60
      • Downloads (Last 6 weeks)5

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader