Skip to main content

Active Anomaly Detection Technology Based on Ensemble Learning

  • Conference paper
  • First Online:
Data Science (ICPCSEE 2022)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1628))

  • 619 Accesses

Abstract

Anomaly detection is an important problem in various research and application fields. Researchers design reliable schemes to provide solutions for effectively detecting anomaly points. Most of the existing anomaly detection schemes are unsupervised methods, such as anomaly detection methods based on density, distance and clustering. In total, unsupervised anomaly detection methods have many limitations. For example, they cannot be well combined with prior knowledge in some anomaly detection tasks. For some nonlinear anomaly detection tasks, the modeling is complex and faces dimensional disasters, which are greatly affected by noise. Sometimes it is difficult to find abnormal events that users are interested in, and users need to customize model parameters before detection. With the wide application of deep learning technology, it has a good modeling ability to solve linear and nonlinear data relationships, but the application of deep learning technology in the field of anomaly detection has many challenges. If we regard exceptions as a supervised problem, exceptions are a few, and we usually face the problem of too few labels. To obtain a model that performs well in the anomaly detection task, it requires a high initial training set. Therefore, to solve the above problems, this paper proposes a supervised learning method with manual participation. We introduce the integrated learning model and train a supervised anomaly detection model with strong stability and high accuracy through active learning technology. In addition, this paper adopts certain strategies to maximize the accuracy of anomaly detection and minimize the cost of manual labeling. In the experimental link, we will show that our method is better than some traditional anomaly detection algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Breunig, M., Kriegel, H., Ng, R.T., Sander, J.: LOF: Identifying density-based local outliers. ACM SIGMOD Rec. 29(2), 93–104 (2000)

    Google Scholar 

  2. Tang, J., Chen, Z., Fu, A., Cheung, D.: Enhancing effectiveness of outlier detections for low density patterns. In: Advances in Knowledge Discovery and Data Mining, pp. 535–548. Springer, Berlin, Germany (2002)

    Google Scholar 

  3. Schubert, E., Zimek, A., Kriegel, H.-P.: Local outlier detection reconsidered: A generalized view on locality with applications to spatial, video, and network outlier detection. Data Mining Knowl. Discovery 28(1), 190–237 (2014)

    Article  MathSciNet  Google Scholar 

  4. Ren, D., Wang, B., Perrizo, W.: RDF: a density-based outlier detection method using vertical data representation. In: Proc. Int. Conf. DataMining, pp. 503–506 (Nov 2004)

    Google Scholar 

  5. Bai, M., Wang, X., Xin, J., Wang, G.: An Efficient algorithm for distributed density-based outlier detection on big data. Neurocomputing 181, 19–28 (2016). Mar.

    Article  Google Scholar 

  6. Tang, B., He, H.: A local density-based approach for outlier detection. Neurocomputing 241, 171–180 (2017). Jun.

    Article  Google Scholar 

  7. Kriegel, H., Kröger, P., Zimek, A.: Outlier detection techniques. In: Proc. Tutorial KDD, pp. 1–10 (2009)

    Google Scholar 

  8. Eskin, E.: ‘Anomaly detection over noisy data using learned probability distributions. In: Proc. 17th Int. Conf. Mach. Learn. (ICML), pp. 255–262 (Jul. 2000)

    Google Scholar 

  9. Yang, X., Latecki, L.J., Pokrajac, D.: Outlier detection with globally optimal exemplar-based GMM. In: Proc. SIAM Int. Conf. on Mining(SDM), pp. 145–154 (Apr. 2009)

    Google Scholar 

  10. Zhang, L., Lin, J., Karim, R.: Adaptive kernel density-based anomaly detection for nonlinear systems. Knowl. -Based Syst. 139, 50–63 (2018). Jan.

    Article  Google Scholar 

  11. Qin, X., Cao, L., Rundensteiner, E.A., Madden, S.: Scalable kernel density estimation-based local outlier detection over large data streams. In: Proc. EDBT, pp. 421–432 (2019)

    Google Scholar 

  12. Dang, T.T., Ngan, H.Y.T., Liu, W.: Distance-based k-nearest neighbors outlier detection method in large-scale traffic data. In: Proc. IEEE

    Google Scholar 

  13. Luan, T., Min, Y.M., Shahabi, C.: Real-time distance-based outlier detection in data streams. Proceed

    Google Scholar 

Download references

Acknowledgements

The project is supported by the State Grid Research Project “Study on Intelligent Analysis Technology of Abnormal Power Data Quality based on Rule Mining” (5700-202119176A-0–0-00).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shuya Lei .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Liu, W., Lei, S., Peng, L., Feng, J., Pan, S., Gao, M. (2022). Active Anomaly Detection Technology Based on Ensemble Learning. In: Wang, Y., Zhu, G., Han, Q., Wang, H., Song, X., Lu, Z. (eds) Data Science. ICPCSEE 2022. Communications in Computer and Information Science, vol 1628. Springer, Singapore. https://doi.org/10.1007/978-981-19-5194-7_5

Download citation

  • DOI: https://doi.org/10.1007/978-981-19-5194-7_5

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-19-5193-0

  • Online ISBN: 978-981-19-5194-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics