Mining Top-n Local Outliers in Constrained Spatial Networks

Zhang, Chongsheng; Wu, Zhongbo; Qu, Bo; Chen, Hong

doi:10.1007/978-3-540-88192-6_77

Chongsheng Zhang^6,7,
Zhongbo Wu^6,7,
Bo Qu^6,7 &
…
Hong Chen^6,7

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5139))

Included in the following conference series:

International Conference on Advanced Data Mining and Applications

2482 Accesses

Abstract

Outlier mining, also called outlier detection, is a challenging research issue in data mining with important applications as intrusion detection, fraud detection and medical analysis. From the perspective of data, previous work on outlier mining have involved in various types of data such as spatial data, time series data, trajectory data, and sensor data. However, few of them have considered a constrained spatial networks data in which each object must reside or move along a certain edge. In fact, in such special constrained spatial network data environments, previous outlier definitions and the according mining algorithms could work neither properly nor efficiently. In this paper we introduce a new definition of density-based local outlier in constrained spatial networks that considers for each object the outlier-ness with respect to its k nearest neighbors. Moreover , to detect outliers efficiently, we propose a fast cluster-and-bound algorithm that first cluster on each individual edge, then estimate the outlying degree of each cluster and prune those that could not contain top-n outliers, therefore constraining the computation of outliers to only very limited objects. Experiments on synthetic data sets demonstrate the scalability, effectiveness and efficiency of our methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Barnett, V., Lewis, T.: Outliers in Statistical Data. John Wiley & Sons, Chichester (1994)
MATH Google Scholar
Johnson, T., Kwok, I., Ng, R.T.: Fast Computation of 2-Dimensional Depth Contours. In: Proc. SIGKDD, pp. 224–228 (1998)
Google Scholar
Knorr, E.M., Ng, R.T.: Algorithms for MiningDistance-Based Outliers in Large Datasets. In: Proc. of VLDB, pp. 392–403 (1998)
Google Scholar
Knorr, E.M., Ng, R.T.: Finding intensional knowledge of distance-based outliers. In: Proc. of VLDB, pp. 211–222 (1999)
Google Scholar
Ester, M., Kriegel, P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases. In: Proc. KDD, Portland, Oregon, pp. 226–231 (1996)
Google Scholar
Guha, S., Rastogi, R., Shim, K.: Cure: An efficient clustering algorithm for large databases. In: Proc. SIGMOD, Seattle, WA, pp. 73–84 (1998)
Google Scholar
Ng, R., Han, J.: Efficient and effective clustering method for spatial data mining. In: Proc. VLDB, Santiago, Chile, pp. 144–155 (1994)
Google Scholar
Breunig, M., Kriegel, H., Ng, R.T., Sander, J.: LOF: Identifying Density-Based Local Outliers. In: Proc. of ACM SIGMOD 2000, TX (2000)
Google Scholar
Jin, W., Tung, A.K.H., Han, J.: Mining top-n local outliers in large databases. In: Proc. ACM SIGKDD, San Francisco, New York, pp. 293–298 (2001)
Google Scholar
Papadimitirou, S., Kitagawa, H., Gibbons, P.B., Faloutsos, C.: LOCI: Fast outlier detection using the local correlation integral. In: Proc. ICDE, Bangalore, India, pp. 315–326 (2003)
Google Scholar
Lee, J., Han, J., Li, X.: Trajectory Outlier Detection: A Partition-and-Detect Framework. In: Proc. of IEEE ICDE, Cancun, Mexico (April 2008)
Google Scholar
Li, X., Han, J.: Mining Approximate Top-K Subspace Anomalies in Multi-Dimensional Time-Series Data. In: Proc. VLDB 2007, Vienna, Austria (September 2007)
Google Scholar
Yoon, K.-A., Kwon, O.-S., Bae, D.-H.: An Approach to Outlier Detection of Software Measurement Data using the K-means Clustering Method. In: Proc. ESEM 2007, pp. 443–445 (2007)
Google Scholar
Zhu, Y.Y., Shasha, D.: Efficient elastic burst detection in data streams. In: Proc. SIGKDD 2003, New York, pp. 336–345 (2003)
Google Scholar
Zhang, X., Shasha, D.: Better Burst Detection. In: Proc. ICDE 2006, Atlanta, GA, USA, p. 146 (2006)
Google Scholar
Elio Masciari, E.E.: A Framework for Outlier Mining in RFID data. In: Proc. of IDEAS 2007, September 2007, pp. 263–267 (2007)
Google Scholar
Brinkhoff, T.: A Framework for Generating Network-Based Moving Objects. GeoInformatica 6(2), 153–180 (2002)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

School of Information, Renmin University of China, Beijing, 100872, China
Chongsheng Zhang, Zhongbo Wu, Bo Qu & Hong Chen
MOE Key Lab of Data Engineering and Knowledge Engineering, Beijing, 100872, China
Chongsheng Zhang, Zhongbo Wu, Bo Qu & Hong Chen

Authors

Chongsheng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhongbo Wu
View author publications
You can also search for this author in PubMed Google Scholar
Bo Qu
View author publications
You can also search for this author in PubMed Google Scholar
Hong Chen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science, Sichuan University, 610065, Chengdu, China
Changjie Tang
Department of Computer Science, The University of Western Ontario, Canada
Charles X. Ling
School of ITEE, The University of Queensland, Australia
Xiaofang Zhou
Faculty of Science & Engineering, York University, 355 Lumbers Building, M3J 1P3, Toronto, Ontario, Canada
Nick J. Cercone
School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, 4072, Queensland, Australia
Xue Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, C., Wu, Z., Qu, B., Chen, H. (2008). Mining Top-n Local Outliers in Constrained Spatial Networks. In: Tang, C., Ling, C.X., Zhou, X., Cercone, N.J., Li, X. (eds) Advanced Data Mining and Applications. ADMA 2008. Lecture Notes in Computer Science(), vol 5139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88192-6_77

Download citation

DOI: https://doi.org/10.1007/978-3-540-88192-6_77
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88191-9
Online ISBN: 978-3-540-88192-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics