Abstract
We propose a measure, spatial local outlier measure (SLOM), which captures the local behaviour of datum in their spatial neighbourhood. With the help of SLOM, we are able to discern local spatial outliers that are usually missed by global techniques, like “three standard deviations away from the mean”. Furthermore, the measure takes into account the local stability around a data point and suppresses the reporting of outliers in highly unstable areas, where data are too heterogeneous and the notion of outliers is not meaningful. We prove several properties of SLOM and report experiments on synthetic and real data sets that show that our approach is novel and scalable to large datasets.
Similar content being viewed by others
References
Aggarwal CC, Yu PS (2001) Outlier detection for high dimensional data. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data. Santa Barbara, California, USA
Angiulli F, Pizzuti C. (2002) Fast outlier detection in high dimensional spaces. In: Proceedings of the 6th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD)
Bay SD, Schwabacher M (2003) Mining distance-based outliers in near linear time with randomisation and a simple pruning rule. In: Proceedings of 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
Breunig MM, Kriegel HP, Ng RT, Sander J (2000) LOF: Identifying density-based local outliers. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pp. 93–104. Dallas, Texas, USA
Hawkins D (1980) Identification of outliers. Chapman and Hall, London
Knorr EM, Ng RT (1998) Algorithms for mining distance-based outliers in large datasets. In: Proceedings of 24th International Conference on Very Large Data Bases, pp. 392–403. New York City
Lu CT, Chen DC, Kou YF (2003a) Algorithms for spatial outlier detection. In: Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), pp. 597–600. Melbourne, Florida
Lu CT, Chen DC, Kou YF (2003b) Detecting spatial outliers with multiple attributes. In: Proceedings of 15th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2003), pp 122–128. Sacramento, California
McPhadden M (2002) El Nino and La Nina: Causes and global consequences. Encyclopedia of Global Environmental Change, pp. 353–370
Papadimitriou S, Kitagawa H, Gibbons PB, Faloutsos C (2003) LOCI: Fast outlier detection using the local correlation integral. In: Proceedings of the 19th International Conference on Data Engineering, pp. 315–328. Bangalore, India
Ramaswamy S, Rastogi R, Shim K (2000) Efficient algorithms for mining outliers from large datasets. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pp. 427–438. Dallas, Texas
Shekhar S, Chawla S (2003) Spatial databases: A tour. Prentice Hall
Shekhar S, Lu CT, Zhang PS (2001) Detecting graph-based spatial outliers: Algorithms and applications (a summary of results). In: Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 371–376, San Francisco
Shekhar S, Lu CT, Zhang PS (2003) A unified approach to detecting spatial outliers. GeoInformatica, 7(2), 139–166
Wilcox R (2003) Applying contemporary statistical techniques. Elsevier Science
Author information
Authors and Affiliations
Additional information
Sanjay Chawla is a Senior Lecturer in the School of Information Technologies at the University of Sydney. His research interests span the area of data mining and spatial database management. He is a co-author of the textbook “Spatial Databases: A Tour”, which is published by Prentice Hall. His research work has appeared in leading publications, including IEEE Transaction on Knowledge and Data Engineering and GeoInformatica. He received his Ph.D. in Mathematics from the University of Tennessee, USA.
Pei Sun is currently a Ph.D. student in the School of Information Technology, Sydney University, Australia. His research interests include data mining and spatial database. He received his M.E. degree from the University of New South Wales, Sydney, Australia, in 2002 and a B.E. degree from Beijing Forestry University, China, in 1990.
Rights and permissions
About this article
Cite this article
Chawla, S., Sun, P. SLOM: a new measure for local spatial outliers. Knowl Inf Syst 9, 412–429 (2006). https://doi.org/10.1007/s10115-005-0200-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-005-0200-2