Abstract
Outlier mining is an important branch of data mining and has attracted much attention recently. The density-based method LOF is widely used in application. However, selecting MinPts is non-trivial, and LOF is very sensitive to its parameters MinPts. In this paper, we propose a new outlier detection method based on Voronoi diagram, which we called Voronoi based Outlier Detection (VOD). The proposed method measures the outlier factor automatically by Voronoi neighborhoods without parameter, which provides highly-accurate outlier detection and reduces the time complexity from O(n 2) to O(nlogn).
Supported by the Science and Technology Key Projects of Shandong Province under Grant No.2007GG3WZ10010; Doctoral Scientific Research Foundation of Shandong University of Finance under Grant No.06BSJJ09.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Hawkins, D.: Identification of Outliers. Chapman and Hall, London (1980)
Barnett, V., Lewis, T.: Outliers in Statistical Data. John Wiley, England (1994)
Johnson, T., Kwok, I., Ng, R.: Fast Computation of 2-Dimensional Depth Contours. In: Proceedings of the KDD, pp. 224–228 (1998)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data Clustering: A Review. ACM Comp. Surveys 31(3), 264–323 (1999)
Knorr, E.M., Ng, R.T.: Algorithms for Mining Distance-Based Outliers in Large Datasets. In: Proceedings of the VLDB, pp. 392–403 (1998)
Breunig, M.M., Kriegel, H.-P., Ng, R., Sander, J.: LOF: Identifying Density-Based Local Outliers. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, Dallas, Texas, USA, pp. 93–104 (2000)
Papadimitirou, S., Kitagawa, H., Gibbons, P.B., Faloutsos, C.: LOCI: Fast Outlier Detection Using the Local Correlation Integral. In: Proceedings of the 19th International Conference On Data Engineering, Bangalore, India, pp. 315–326 (2003)
Agyemang, M., Ezeife, C.I.: LSC-Mine: Algorithm for Mining Local Outliers. In: Proceedings of the 15th Information Resource Management Association (IRMA) International Conference, New Orleans, pp. 5–8 (2004)
Tang, J., Chen, Z., Fu, A., David, W.C.: Enhancing Effectiveness of Outlier Detections for Low Density Patterns. In: The 6th Pacific-Asia Conf. on Knowledge Discovery and Data Mining (PAKDD), Taipei, pp. 535–548 (2002)
Jin, W., Tung, A.K.H., Han, J., Wang, W.: Ranking Outliers Using Symmetric Neighborhood Relationship. In: Proceedings of 10th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, Singapore, pp. 577–593 (2006)
Sanjay, C., Pei, S.: SLOM: A New Measure for Local Spatial Outliers. Knowledge and Information Systems 9(4), 412–429 (2006)
Yaling, P., Osmar, R.Z., Yong, G.: An Efficient Reference-Based Approach to Outlier Detection in Large Datasets. In: Proceedings of the Sixth International Conference on Data Mining (ICDM), Washington, DC, USA, pp. 478–487 (2006)
Matthew, G., Raymond, K.W.: An Efficient Histogram Method for Outlier Detection. Advances in Databases: Concepts, Systems and Applications, 176–187 (2007)
Fan, H., Zaïane, O.R., Foss, A., Wu, J.: A Nonparametric Outlier Detection for Effectively Discovering Top-n Outliers from Engineering Data. In: Proceedings of the 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining(PAKDD), Singapore, pp. 557–566 (2006)
Yu, J.X., Qian, W., Lu, H., Zhou, A.: Finding Centric Local Outliers in Categorical/Numerical Spaces. Knowledge and Information Systems 9(3), 309–338 (2006)
Latecki, L.J., Lazarevic, A., Pokrajac, D.: Outlier Detection with Kernel Density Functions. In: Perner, P. (ed.) MLDM 2007. LNCS (LNAI), vol. 4571, pp. 61–75. Springer, Heidelberg (2007)
Preparata, F.P., Shamos, M.I.: Computational Geometry-An Introduction. Springer, Heidelberg (1985)
Fink, E., Pratt, K.B.: Indexing of Time Series by Major Minima and Maxima. In: Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, pp. 2332–2335 (2003)
Sariel, H.-P.: A Replacement for Voronoi Diagrams of Near Linear Size. In: Proceedings of the 42nd IEEE Symposium on Foundations of Computer Science, Las Vegas, Nevada, USA, pp. 94–103 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Qu, J. (2008). Outlier Detection Based on Voronoi Diagram. In: Tang, C., Ling, C.X., Zhou, X., Cercone, N.J., Li, X. (eds) Advanced Data Mining and Applications. ADMA 2008. Lecture Notes in Computer Science(), vol 5139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88192-6_51
Download citation
DOI: https://doi.org/10.1007/978-3-540-88192-6_51
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88191-9
Online ISBN: 978-3-540-88192-6
eBook Packages: Computer ScienceComputer Science (R0)