Abstract
Anomaly detection of sensor data is a fundamental and active problem, it is involved in many applications especially the Wireless Sensor Network (WSN) where we detect anomaly by the group data. But for anomaly detection of a single sensor, many methods which consider spatial connection of data are not efficient. Point anomaly and pattern anomaly are two types of anomalies. In this work, we analyze the sensor data and find that pattern anomaly plays a dominant role for a single sensor. Moreover, the sudden change of data is found to be the special form of pattern anomaly. As the data collected by a single sensor is ordered by time, we convert the problem to pattern anomaly detection of time series. In this work, a two-phased algorithm OCSVM_KNN based on One Class Support Vector Machines (OCSVM) and K-Nearest Neighbors (KNN) is proposed. And, in the KNN classifier we introduce Complexity-invariant Distance (CID) as the distance measure method between two time-series sequences. Experimental results on real data sets show that our proposed approach outperforms other existing approaches in terms of anomaly detection rate, false alarm rate and misclassification anomaly rate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: A survey. ACM Computing Surveys (CSUR) 41(3), 15 (2009)
Gupta, M., Gao, J., Aggarwal, C., Han, J.: Outlier detection for temporal data. Synthesis Lectures on Data Mining and Knowledge Discovery 5(1), 1–129 (2014)
Janakiram, D., Adi Mallikarjuna Reddy, V., Phani Kumar, A.: Outlier detection in wireless sensor networks using bayesian belief networks. In: First International Conference on Communication System Software and Middleware, 2006. Comsware 2006, pp. 1–6. IEEE (2006)
Hastie, T., Tibshirani, R.: Discriminant adaptive nearest neighbor classification. IEEE Transactions on Pattern Analysis and Machine Intelligence 18(6), 607–616 (1996)
Wei, Y.: Multi-dimensional time warping based on complexity invariance and its application in sports evaluation. In: 2014 11th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), pp. 677–680. IEEE (2014)
Angiulli, F., Fassetti, F.: Detecting distance-based outliers in streams of data. In: Proceedings of the Sixteenth ACM Conference on Conference on Information and Knowledge Management, pp. 811–820. ACM (2007)
Koupaie, H.M., Ibrahim, S., Hosseinkhani, J.: Outlier detection in stream data by clustering method. International Journal of Advanced Computer Science and Information Technology 2(3), 25–34 (2013)
Zhang, Y., Meratnia, N., Havinga, P.: Outlier detection techniques for wireless sensor networks: A survey. Communications Surveys & Tutorials, IEEE 12(2), 159–170 (2010)
Hill, D.J., Minsker, B.S.: Anomaly detection in streaming environmental sensor data: A data-driven modeling approach. Environmental Modelling & Software 25(9), 1014–1022 (2010)
Budalakoti, S., Srivastava, A.N., Akella, R., Turkov, E.: Anomaly detection in large sets of high-dimensional symbol sequences (2006)
Chandola, V., Mithal, V., Kumar, V.: Comparative evaluation of anomaly detection techniques for sequence data. In: ICDM. pp. 743–748 (2008)
Rebbapragada, U., Protopapas, P., Brodley, C.E., Alcock, C.: Finding anomalous periodic time series. Machine Learning 74(3), 281–313 (2009)
González, F.A., Dasgupta, D.: Anomaly detection using real-valued negative selection. Genetic Programming and Evolvable Machines 4(4), 383–403 (2003)
Yang, Z., Meratnia, N., Havinga, P.: An online outlier detection technique for wireless sensor networks using unsupervised quarter-sphere support vector machine. In: International Conference on Intelligent Sensors, Sensor Networks and Information Processing, 2008. ISSNIP 2008, pp. 151–156. IEEE (2008)
Ma, J., Perkins, S.: Time-series novelty detection using one-class support vector machines. In: Proceedings of the International Joint Conference on Neural Networks, 2003, vol. 3, pp. 1741–1745. IEEE (2003)
Schölkopf, B., Platt, J.C., Shawe-Taylor, J., Smola, A.J., Williamson, R.C.: Estimating the support of a high-dimensional distribution. Neural computation 13(7), 1443–1471 (2001)
Schölkopf, B., Williamson, R.C., Smola, A.J., Shawe-Taylor, J., Platt, J.C.: Support vector method for novelty detection. In: NIPS. vol. 12, pp. 582–588 (1999)
BUILDING A “HAND WRITTEN DIGIT IDENTIFIER” AND SOME MACHINE LEARNING BASICS. https://aidaiict.wordpress.com/2013/10/24/building-a-hand-written-digit-identifier-and-some-machine-learning-basics/
Batista, G.E., Keogh, E.J., Tataw, O.M., de Souza, V.M.: Cid: an efficient complexity-invariant distance for time series. Data Mining and Knowledge Discovery 28(3), 634–669 (2014)
Chen, Y., Hu, B., Keogh, E., Batista, G.E.: Dtw-d: time series semi-supervised learning from a single example. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery And Data Mining, pp. 383–391. ACM (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Su, J., Long, Y., Qiu, X., Li, S., Liu, D. (2015). Anomaly Detection of Single Sensors Using OCSVM_KNN. In: Wang, Y., Xiong, H., Argamon, S., Li, X., Li, J. (eds) Big Data Computing and Communications. BigCom 2015. Lecture Notes in Computer Science(), vol 9196. Springer, Cham. https://doi.org/10.1007/978-3-319-22047-5_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-22047-5_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-22046-8
Online ISBN: 978-3-319-22047-5
eBook Packages: Computer ScienceComputer Science (R0)