Skip to main content

A Scalable Algorithm for Detecting Community Outliers in Social Networks

  • Conference paper
Book cover Web-Age Information Management (WAIM 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7418))

Included in the following conference series:

Abstract

Outlier detection is an important problem that has been researched and applied in a myriad of domains ranging from fraudulent transactions to intrusion detection. Most existing methods have been specially developed for detecting global and (or) local outliers by using either content information or structure information. Unfortunately, these conventional algorithms have been facing with unprecedented challenges in social networks, where data and link information are tightly integrated.

In this paper, a novel measurement named Community Outlying Factor is put forward for community outlier, besides its descriptive definition. A scalable community outliers detection algorithm (SCODA), which fully considers both content and structure information of social networks, is proposed. Furthermore, SCODA takes effective measures to minimize the number of input parameters down to only one, the number of outliers. Experimental results demonstrate that the time complexity of SCODA is linear to the number of nodes, which means that our algorithm can easily deal with very large data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Toshniwal, D., Yadav, S.: Adaptive Outlier Detection in Streaming Time Series. In: Proceedings of International Conference on Asia Agriculture and Animal, ICAAA 2011 (2011)

    Google Scholar 

  2. Grubbs, F.E.: Procedures for detecting outlying observations in samples. Technometrics 11 (1969)

    Google Scholar 

  3. Aggarwal, C.C., Yu, P.S.: Outlier detection for high dimensional data. In: Proceedings of the ACM SIGMOD International Conference on Management of Data (2001)

    Google Scholar 

  4. Orair, G., Teixeira, C., Wang, Y., Meira, W., Parthasarathy, S.: Distance-Based Outlier Detection: Consolidation and Renewed Bearing. In: Proceedings of International Conference on Very Large Data Bases, VLDB (2010)

    Google Scholar 

  5. Hodge, V.J., Austin, J.: A Survey of Outlier Detection Methodologies. Artificial Intelligence Review 22 (2004)

    Google Scholar 

  6. Coscia, M., Giannotti, F., Pedreschi, D.: A Classification for Community Discovery Methods in Complex Networks. Statistical Analysis and Data Mining 4 (2011)

    Google Scholar 

  7. Parthasarathy, S., Ruan, Y., Satuluri, V.: Community Discovery in Social Networks: Applications, Methods and Emerging Trends. Social Network Data Analytics, 79–113 (2011)

    Google Scholar 

  8. Aggarwal, C.C.: An Introduction to Social Network Data Analytics. Social Network Data Analytics (2011)

    Google Scholar 

  9. Gao, J., Liang, F., Fan, W., Wang, C., Sun, Y., Han, J.: On Community Outliers and their Efficient Detection in Information Networks. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2010)

    Google Scholar 

  10. Aggarwal, C.C., Zhao, Y., Yu, P.S.: Outlier Detection in Graph Streams. In: Proceedings of the International Conference on Data Engineering, ICDE (2011)

    Google Scholar 

  11. Zhang, J.: Towards Outlier Detection for High-dementional Data Streams using Projected Outlier Analysis Strategy. PhD thesis, Dalhousie University (2009)

    Google Scholar 

  12. Breunig, M.M., Kriegel, H.P., Ng, R.T., Sander, J.: Lof: Identifying density-based local outliers. In: Proceedings of the ACM SIGMOD International Conference on Management of Data (2000)

    Google Scholar 

  13. Papadimitriou, S., Kitagawa, H., Gibbons, P., Faloutsos, C.: LOCI: Fast outlier detection using the local correlation integral. In: Proceedings of the International Conference on Data Engineering, ICDE (2003)

    Google Scholar 

  14. Orair, G.H., Teixeira, C.H.C., Meira Jr., W., Wang, Y., Parthasarathy, S.: Distance-Based Outlier Detection: Consolidation and Renewed Bearing. In: Proceedings of the International Conference on Very Large Data Bases, VLDB (2010)

    Google Scholar 

  15. Moser, F., Ge, R., Ester, M.: Joint cluster analysis of attribute and relationship data withouta-priori specification of the number of clusters. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD (2007)

    Google Scholar 

  16. Yang, T., Jin, R., Chi, Y., Zhu, S.: Combining link and content for community detection: a discriminative approach. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD (2009)

    Google Scholar 

  17. Li, X., Li, Z., Han, J., Lee, J.-G.: Temporal Outlier Detection in Vehicle Traffic Data. In: Proceedings of the IEEE International Conference on Data Engineering, ICDE (2009)

    Google Scholar 

  18. Chakrabarti, D.: AutoPart: Parameter-Free Graph Partitioning and Outlier Detection. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) PKDD 2004. LNCS (LNAI), vol. 3202, pp. 112–124. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  19. Fortunato, S.: Community detection in graphs. Physics Reports 486 (2009)

    Google Scholar 

  20. He, Z., Xu, X., Deng, S.: Scalable Algorithms for Clustering Large Datasets with Mixed Type Attributes. International Journal of Intelligent Systems 20 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ji, T., Gao, J., Yang, D. (2012). A Scalable Algorithm for Detecting Community Outliers in Social Networks. In: Gao, H., Lim, L., Wang, W., Li, C., Chen, L. (eds) Web-Age Information Management. WAIM 2012. Lecture Notes in Computer Science, vol 7418. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32281-5_42

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-32281-5_42

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-32280-8

  • Online ISBN: 978-3-642-32281-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics