Skip to main content

Mining Top-n Local Outliers in Constrained Spatial Networks

  • Conference paper
Advanced Data Mining and Applications (ADMA 2008)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5139))

Included in the following conference series:

  • 2482 Accesses

Abstract

Outlier mining, also called outlier detection, is a challenging research issue in data mining with important applications as intrusion detection, fraud detection and medical analysis. From the perspective of data, previous work on outlier mining have involved in various types of data such as spatial data, time series data, trajectory data, and sensor data. However, few of them have considered a constrained spatial networks data in which each object must reside or move along a certain edge. In fact, in such special constrained spatial network data environments, previous outlier definitions and the according mining algorithms could work neither properly nor efficiently. In this paper we introduce a new definition of density-based local outlier in constrained spatial networks that considers for each object the outlier-ness with respect to its k nearest neighbors. Moreover , to detect outliers efficiently, we propose a fast cluster-and-bound algorithm that first cluster on each individual edge, then estimate the outlying degree of each cluster and prune those that could not contain top-n outliers, therefore constraining the computation of outliers to only very limited objects. Experiments on synthetic data sets demonstrate the scalability, effectiveness and efficiency of our methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Barnett, V., Lewis, T.: Outliers in Statistical Data. John Wiley & Sons, Chichester (1994)

    MATH  Google Scholar 

  2. Johnson, T., Kwok, I., Ng, R.T.: Fast Computation of 2-Dimensional Depth Contours. In: Proc. SIGKDD, pp. 224–228 (1998)

    Google Scholar 

  3. Knorr, E.M., Ng, R.T.: Algorithms for MiningDistance-Based Outliers in Large Datasets. In: Proc. of VLDB, pp. 392–403 (1998)

    Google Scholar 

  4. Knorr, E.M., Ng, R.T.: Finding intensional knowledge of distance-based outliers. In: Proc. of VLDB, pp. 211–222 (1999)

    Google Scholar 

  5. Ester, M., Kriegel, P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases. In: Proc. KDD, Portland, Oregon, pp. 226–231 (1996)

    Google Scholar 

  6. Guha, S., Rastogi, R., Shim, K.: Cure: An efficient clustering algorithm for large databases. In: Proc. SIGMOD, Seattle, WA, pp. 73–84 (1998)

    Google Scholar 

  7. Ng, R., Han, J.: Efficient and effective clustering method for spatial data mining. In: Proc. VLDB, Santiago, Chile, pp. 144–155 (1994)

    Google Scholar 

  8. Breunig, M., Kriegel, H., Ng, R.T., Sander, J.: LOF: Identifying Density-Based Local Outliers. In: Proc. of ACM SIGMOD 2000, TX (2000)

    Google Scholar 

  9. Jin, W., Tung, A.K.H., Han, J.: Mining top-n local outliers in large databases. In: Proc. ACM SIGKDD, San Francisco, New York, pp. 293–298 (2001)

    Google Scholar 

  10. Papadimitirou, S., Kitagawa, H., Gibbons, P.B., Faloutsos, C.: LOCI: Fast outlier detection using the local correlation integral. In: Proc. ICDE, Bangalore, India, pp. 315–326 (2003)

    Google Scholar 

  11. Lee, J., Han, J., Li, X.: Trajectory Outlier Detection: A Partition-and-Detect Framework. In: Proc. of IEEE ICDE, Cancun, Mexico (April 2008)

    Google Scholar 

  12. Li, X., Han, J.: Mining Approximate Top-K Subspace Anomalies in Multi-Dimensional Time-Series Data. In: Proc. VLDB 2007, Vienna, Austria (September 2007)

    Google Scholar 

  13. Yoon, K.-A., Kwon, O.-S., Bae, D.-H.: An Approach to Outlier Detection of Software Measurement Data using the K-means Clustering Method. In: Proc. ESEM 2007, pp. 443–445 (2007)

    Google Scholar 

  14. Zhu, Y.Y., Shasha, D.: Efficient elastic burst detection in data streams. In: Proc. SIGKDD 2003, New York, pp. 336–345 (2003)

    Google Scholar 

  15. Zhang, X., Shasha, D.: Better Burst Detection. In: Proc. ICDE 2006, Atlanta, GA, USA, p. 146 (2006)

    Google Scholar 

  16. Elio Masciari, E.E.: A Framework for Outlier Mining in RFID data. In: Proc. of IDEAS 2007, September 2007, pp. 263–267 (2007)

    Google Scholar 

  17. Brinkhoff, T.: A Framework for Generating Network-Based Moving Objects. GeoInformatica 6(2), 153–180 (2002)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhang, C., Wu, Z., Qu, B., Chen, H. (2008). Mining Top-n Local Outliers in Constrained Spatial Networks. In: Tang, C., Ling, C.X., Zhou, X., Cercone, N.J., Li, X. (eds) Advanced Data Mining and Applications. ADMA 2008. Lecture Notes in Computer Science(), vol 5139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88192-6_77

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-88192-6_77

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-88191-9

  • Online ISBN: 978-3-540-88192-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics