Abstract
The problem of anomaly detection in marine Argo data is studied. Based on the common and widely used DBSCAN (Density-Based Spatial Clustering of Applications with Noise) algorithm for anomaly detection in Argo data, as in other fields, there is a major problem in the application of DBSCAN algorithm, which is how to choose the appropriate parameter pairs. To solve this problem, this paper proposes an improved version of DBSCAN, namely CCMD-DBSCAN (DBSCAN based on the classification characteristics of marine data), which solves the above problem by studying the characteristics and laws of Argo data and is successfully applied to anomaly detection of marine data. The experimental results show that the new algorithm can not only determine the appropriate parameter pairs but also has a good anomaly detection effect.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Liu, Z., et al.: China Argo project: progress in China Argo ocean observations and data applications. Acta Oceanol. Sin. 36(06), 1ā11 (2017). https://doi.org/10.1007/s13131-017-1035-x
Rettig, L., Khayati, M., CudrĆ©-Mauroux, P., PiĆ³rkowski, M.: Online anomaly detection over big data streams. In: 2015 IEEE International Conference on Big Data (Big Data), Santa Clara, CA, USA, pp. 1113ā1122 (2015). https://doi.org/10.1109/BigData.2015.7363865
Grubbs, F.E.: Procedures for detecting outlying observations in samples. Technometrics 11(1), 1ā21 (1969)
Lewis T.: Outliers in Statistical Data, 3rd edn. (1994)
Ding, J., Wang, L., Shen, D., et al.: An anomaly detection system on big data. Nat. Sci. J. Hainan Univ. (2015)
Jiang, H., Yao, W.U., Lyu, K., et al.: Ocean data anomaly detection algorithm based on improved k-medoids. In: 2019 Eleventh International Conference on Advanced Computational Intelligence (ICACI) (2019)
Bhaskar, T., Shesu, R.V., Boyer, T.P., et al.: Quality control of oceanographic in situ data from Argo floats using climatological convex hulls. MethodsX 4(2), 469ā479 (2017)
He, Z.-S., Liu, Z.T., Zhuang, Y.B.: Data-partitioning-based parellel DBSCAN algorithm. J. Chin. Comput. Syst. (2006)
Zhu, Q., Tang, X., Liu, Z., et al.: Revised DBSCAN clustering algorithm based on dual grid. In: 2020 Chinese Control and Decision Conference (CCDC) (2020)
Chen, Y., Zhou, L., Bouguila, N., et al.: BLOCK-DBSCAN: fast clustering for large scale data. Pattern Recogn. 109, 107624 (2020)
Kryszkiewicz, M., Lasek, P.: TI-DBSCAN: clustering with DBSCAN by means of the triangle inequality. In: Szczuka, M., Kryszkiewicz, M., Ramanna, S., Jensen, R., Hu, Q. (eds.) RSCTC 2010. LNCS (LNAI), vol. 6086, pp. 60ā69. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-13529-3_8
Gao, X., Gui, Z.P., Long, X., et al.: KDSG-DBSCAN: a high performance DBSCAN algorithm based on K-D tree and spark GraphX. Geogr. Geo-Inf. Sci. 33(6), 1ā7 (2017)
Ohadi, N., Kamandi, A., Shabankhah, M., Fatemi, S.M., Hosseini, S.M., Mahmoudi, A.: SW-DBSCAN: a grid-based DBSCAN algorithm for large datasets. In: 2020 6th International Conference on Web Research (ICWR), Tehran, Iran, pp. 139ā145 (2020)
Zhu, Q., Tang, X., Liu, Z.: Revised DBSCAN clustering algorithm based on dual grid. In: Chinese Control And Decision Conference (CCDC), Hefei, China, pp. 3461ā3466 (2020)
Jolliffe, I.T., Cadima, J.: Principal component analysis: a review and recent developments. Philos. Trans. A Math. Phys. Eng. 374(2065), 20150202 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Ā© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Jiang, Y., Kang, C., Shen, Y., Huang, T., Zhai, G. (2022). Research on Argo Data Anomaly Detection Based on Improved DBSCAN Algorithm. In: Ma, H., Wang, X., Cheng, L., Cui, L., Liu, L., Zeng, A. (eds) Wireless Sensor Networks. CWSN 2022. Communications in Computer and Information Science, vol 1715. Springer, Singapore. https://doi.org/10.1007/978-981-19-8350-4_4
Download citation
DOI: https://doi.org/10.1007/978-981-19-8350-4_4
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-8349-8
Online ISBN: 978-981-19-8350-4
eBook Packages: Computer ScienceComputer Science (R0)