skip to main content
10.1145/3409073.3409089acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmltConference Proceedingsconference-collections
research-article

An LSTM-Based Method for Detection and Classification of Sensor Anomalies

Authors Info & Claims
Published:29 July 2020Publication History

ABSTRACT

Most existing machine learning (ML) based solutions for anomaly detection in sensory data rely on carefully hand-crafted features. This approach has a fundamental limitation since it is often application-specific and requires considerable human effort from domain experts. Deep learning models have been demonstrated to have the ability to abstract relevant high-level features from raw data. Long short-term memory (LSTM) recurrent neural networks have proven effective in complex time-series prediction problems. In this paper, we propose an LSTM-based method for anomaly detection in sensory data. We systematically investigate its effectiveness on raw time-series of real medical sensors measurements and show that it achieves the same level of performance as traditional ML models operating on carefully designed feature vectors. The proposed method achieved micro, macro, and weighted precision, recall, and F1-score of over 0.99.

References

  1. Mosenia, A., Sur-Kolay, S., Raghunathan, A., & Jha, N. K. (2017). Wearable medical sensor-based system design: A survey. IEEE Transactions on Multi-Scale Computing Systems, 3(2), 124--138. J. Clerk Maxwell, A Treatise on Electricity and Magnetism, 3rd ed., vol. 2. Oxford: Clarendon, 1892, pp. 68--73.Google ScholarGoogle ScholarCross RefCross Ref
  2. Luo, R., Misra, M., & Himmelblau, D. M. (1999). Sensor fault detection via multiscale analysis and dynamic PCA. Industrial & Engineering Chemistry Research, 38(4), 1489--1495.Google ScholarGoogle ScholarCross RefCross Ref
  3. Geng, Z., Tang, F., Ding, Y., Li, S., & Wang, X. (2017). Noninvasive continuous glucose monitoring using a multisensor-based glucometer and time series analysis. Scientific reports, 7(1), 12650.Google ScholarGoogle Scholar
  4. Van Der Meulen, M. (2004). On the use of smart sensors, common cause failure and the need for diversity. 6th International Symposium Programmable Electronic Systems in Safety Related Applications.Google ScholarGoogle Scholar
  5. Salehinejad, H., Baarbe, J., Sankar, S., Barfett, J., Colak, E., & Valaee, S. (2017). Recent Advances in Recurrent Neural Networks. arXiv preprint arXiv:1801.01078.Google ScholarGoogle Scholar
  6. Karim, F., Majumdar, S., Darabi, H., & Chen, S. (2018). LSTM fully convolutional networks for time series classification. IEEE Access, 6, 1662--1669.Google ScholarGoogle ScholarCross RefCross Ref
  7. Chandola, V., Banerjee, A., & Kumar, V. (2009). Anomaly detection: A survey. ACM computing surveys (CSUR), 41(3), 15.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Patcha, A., & Park, J. M. (2007). An overview of anomaly detection techniques: Existing solutions and latest technological trends. Computer networks, 51(12), 3448--3470. Elsevier.Google ScholarGoogle Scholar
  9. Yao, Z. G., Cheng, L., & Wang, Q. L. (2012). Sensor fault detection, diagnosis and validation-a survey. Applied Mechanics and Materials, 229, 1265--1271. Trans Tech Publications.Google ScholarGoogle ScholarCross RefCross Ref
  10. Pires, I. M., Garcia, N. M., Pombo, N., Florez-Revuelta, F., & Rodriguez, N. D. (2016). Validation Techniques for Sensor Data in Mobile Health Applications. Journal of Sensors, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  11. Salahat, E., & Qasaimeh, M. (2017). Recent advances in features extraction and description algorithms: A comprehensive survey. 2017 IEEE International Conference on Industrial Technology (ICIT), 1059--1063.Google ScholarGoogle ScholarCross RefCross Ref
  12. Fehst, V., La, H. C., Nghiem, T. D., Mayer, B. E., Englert, P., & Fiebig, K. H. (2018). Automatic vs. manual feature engineering for anomaly detection of drinking-water quality. Proceedings of the Genetic and Evolutionary Computation Conference Companion, 5--6. ACM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Juvenile Diabetes Research Foundation Continuous Glucose Monitoring Study Group (2008). JDRF randomized clinical trial to assess the efficacy of real-time continuous glucose monitoring in the management of type 1 diabetes: research design and methods. Diabetes Technol Ther. 10(4), 310--321.Google ScholarGoogle ScholarCross RefCross Ref
  14. Reitermanova, Z. (2010). Data splitting. WDS, 10, (31--36).Google ScholarGoogle Scholar
  15. Goutte, C., & Gaussier, E. (2005). A probabilistic interpretation of precision, recall and F-score, with implication for evaluation. European Conference on Information Retrieval, 345--359. Springer, Berlin, Heidelberg.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Probst, P., & Boulesteix, A. L. (2017). To tune or not to tune the number of trees in random forest?. arXiv preprint arXiv:1705.05654.Google ScholarGoogle Scholar
  17. Genuer, R. (2012). Variance reduction in purely random forests. Journal of Nonparametric Statistics, 24(3), 543--562.Google ScholarGoogle ScholarCross RefCross Ref
  18. Raileanu, L. E., & Stoffel, K. (2004). Theoretical comparison between the gini index and information gain criteria. Annals of Mathematics and Artificial Intelligence, 41(1), 77--93.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Rennie, J. D., Shih, L., Teevan, J., & Karger, D. R. (2003). Tackling the poor assumptions of naive Bayes text classifiers. Proceedings of the 20th international conference on machine learning (ICML-03).Google ScholarGoogle Scholar
  20. Ba, J., & Caruana, R. (2014). Do deep nets really need to be deep?. Advances in neural information processing systems, 2654--2662.Google ScholarGoogle Scholar
  21. Reimers, N., & Gurevych, I. (2017). Optimal hyperparameters for deep lstm-networks for sequence labeling tasks. arXiv preprint arXiv:1707.06799.Google ScholarGoogle Scholar
  22. Lipton, Z. C., Kale, D. C., Elkan, C., & Wetzel, R. (2015). Learning to diagnose with LSTM recurrent neural networks. arXiv preprint arXiv:1511.03677.Google ScholarGoogle Scholar
  23. You, Y., Demmel, J., Keutzer, K., Hsieh, C., Ying, C., & Hseu, J. (2018). Large-Batch Training for LSTM and Beyond. arXiv preprint arXiv: 1901.08256.Google ScholarGoogle Scholar
  24. Graves, A., Mohamed, A. R., & Hinton, G. (2013). Speech recognition with deep recurrent neural networks. IEEE international conference on acoustics, speech and signal processing (ICASSP), 6645--6649.Google ScholarGoogle ScholarCross RefCross Ref
  25. Hermans, M., & Schrauwen, B. (2013). Training and analysing deep recurrent neural networks. Advances in neural information processing systems, 190--198.Google ScholarGoogle Scholar
  26. Neishi, M., Sakuma, J., Tohda, S., Ishiwatari, S., Yoshinaga, N., & Toyoda, M. (2017). A bag of useful tricks for practical neural machine translation: Embedding layer initialization and large batch size. Proceedings of the 4th Workshop on Asian Translation (WAT2017), 99--109.Google ScholarGoogle Scholar
  27. Graves, A. (2012). Supervised sequence labelling. In Supervised sequence labelling with recurrent neural networks (pp. 5-13). Springer, Berlin, Heidelberg.Google ScholarGoogle ScholarCross RefCross Ref
  28. Moniz, J. R. A., & Krueger, D. (2018). Nested LSTMs. arXiv preprint arXiv:1801.10308.Google ScholarGoogle Scholar

Index Terms

  1. An LSTM-Based Method for Detection and Classification of Sensor Anomalies

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICMLT '20: Proceedings of the 2020 5th International Conference on Machine Learning Technologies
      June 2020
      147 pages
      ISBN:9781450377645
      DOI:10.1145/3409073

      Copyright © 2020 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 29 July 2020

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader