Abstract
Outlier mining, also called outlier detection, is a challenging research issue in data mining with important applications as intrusion detection, fraud detection and medical analysis. From the perspective of data, previous work on outlier mining have involved in various types of data such as spatial data, time series data, trajectory data, and sensor data. However, few of them have considered a constrained spatial networks data in which each object must reside or move along a certain edge. In fact, in such special constrained spatial network data environments, previous outlier definitions and the according mining algorithms could work neither properly nor efficiently. In this paper we introduce a new definition of density-based local outlier in constrained spatial networks that considers for each object the outlier-ness with respect to its k nearest neighbors. Moreover , to detect outliers efficiently, we propose a fast cluster-and-bound algorithm that first cluster on each individual edge, then estimate the outlying degree of each cluster and prune those that could not contain top-n outliers, therefore constraining the computation of outliers to only very limited objects. Experiments on synthetic data sets demonstrate the scalability, effectiveness and efficiency of our methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barnett, V., Lewis, T.: Outliers in Statistical Data. John Wiley & Sons, Chichester (1994)
Johnson, T., Kwok, I., Ng, R.T.: Fast Computation of 2-Dimensional Depth Contours. In: Proc. SIGKDD, pp. 224–228 (1998)
Knorr, E.M., Ng, R.T.: Algorithms for MiningDistance-Based Outliers in Large Datasets. In: Proc. of VLDB, pp. 392–403 (1998)
Knorr, E.M., Ng, R.T.: Finding intensional knowledge of distance-based outliers. In: Proc. of VLDB, pp. 211–222 (1999)
Ester, M., Kriegel, P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases. In: Proc. KDD, Portland, Oregon, pp. 226–231 (1996)
Guha, S., Rastogi, R., Shim, K.: Cure: An efficient clustering algorithm for large databases. In: Proc. SIGMOD, Seattle, WA, pp. 73–84 (1998)
Ng, R., Han, J.: Efficient and effective clustering method for spatial data mining. In: Proc. VLDB, Santiago, Chile, pp. 144–155 (1994)
Breunig, M., Kriegel, H., Ng, R.T., Sander, J.: LOF: Identifying Density-Based Local Outliers. In: Proc. of ACM SIGMOD 2000, TX (2000)
Jin, W., Tung, A.K.H., Han, J.: Mining top-n local outliers in large databases. In: Proc. ACM SIGKDD, San Francisco, New York, pp. 293–298 (2001)
Papadimitirou, S., Kitagawa, H., Gibbons, P.B., Faloutsos, C.: LOCI: Fast outlier detection using the local correlation integral. In: Proc. ICDE, Bangalore, India, pp. 315–326 (2003)
Lee, J., Han, J., Li, X.: Trajectory Outlier Detection: A Partition-and-Detect Framework. In: Proc. of IEEE ICDE, Cancun, Mexico (April 2008)
Li, X., Han, J.: Mining Approximate Top-K Subspace Anomalies in Multi-Dimensional Time-Series Data. In: Proc. VLDB 2007, Vienna, Austria (September 2007)
Yoon, K.-A., Kwon, O.-S., Bae, D.-H.: An Approach to Outlier Detection of Software Measurement Data using the K-means Clustering Method. In: Proc. ESEM 2007, pp. 443–445 (2007)
Zhu, Y.Y., Shasha, D.: Efficient elastic burst detection in data streams. In: Proc. SIGKDD 2003, New York, pp. 336–345 (2003)
Zhang, X., Shasha, D.: Better Burst Detection. In: Proc. ICDE 2006, Atlanta, GA, USA, p. 146 (2006)
Elio Masciari, E.E.: A Framework for Outlier Mining in RFID data. In: Proc. of IDEAS 2007, September 2007, pp. 263–267 (2007)
Brinkhoff, T.: A Framework for Generating Network-Based Moving Objects. GeoInformatica 6(2), 153–180 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, C., Wu, Z., Qu, B., Chen, H. (2008). Mining Top-n Local Outliers in Constrained Spatial Networks. In: Tang, C., Ling, C.X., Zhou, X., Cercone, N.J., Li, X. (eds) Advanced Data Mining and Applications. ADMA 2008. Lecture Notes in Computer Science(), vol 5139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88192-6_77
Download citation
DOI: https://doi.org/10.1007/978-3-540-88192-6_77
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88191-9
Online ISBN: 978-3-540-88192-6
eBook Packages: Computer ScienceComputer Science (R0)