Skip to main content

Modeling DNS Activities Based on Probabilistic Latent Semantic Analysis

  • Conference paper
Advanced Data Mining and Applications (ADMA 2010)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6441))

Included in the following conference series:

Abstract

Traditional Web usage mining techniques aim at discovering usage patterns from Web data at the page level, while little work is engaged in at some upper level. In this paper, we propose a novel approach to the characterization of Internet users’ preference and interests at the domain name level. By summarizing Internet user’s domain name access behaviors as the co-occurrences of users and targeting domain names, an aspect model is introduced to classify users and domain names into various groups according to their co-occurrences. Meanwhile, each group is characterized by extracting the property of characteristic users and domain names. Experimental results on real-world data sets show that our approach is effective in which some meaningful groups are identified. Thus, our approach could be used for detecting unusual behaviors on the Internet at the domain name level, which can alleviate the work of searching the joint space of users and domain names.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Srivastava, J., Cooley, R., Deshpande, M.: Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data. SIGKDD Explorations Newsletter 1, 12–23 (2000)

    Article  Google Scholar 

  2. Eirinaki, M., Vazirgiannis, M.: Web Mining for Web Personalization. ACM Transactions on Internet Technology 3, 1–27 (2003)

    Article  Google Scholar 

  3. Getoor, L., Diehl, C.P.: Link Mining: a Survey. ACM SIGKDD Explorations Newsletter 7, 3–12 (2005)

    Article  Google Scholar 

  4. Kohavi, R., Mason, L., Parekh, R., Zheng, Z.: Lessons and Challenges from Mining Retail E-Commerce Data. Machine Learning 57, 83–113 (2004)

    Article  Google Scholar 

  5. Mockapetris, P.: Domain Names: Concepts and Facilities. Internet Request for Comments 1034 (1987)

    Google Scholar 

  6. Hofmann, T.: Probabilistic Latent Semantic Analysis. In: 15th Conference on Uncertainty in Artificial Intelligence, Stockholm (1999)

    Google Scholar 

  7. Hofmann, T.: Probabilistic Latent Semantic Analysis. In: 22nd Annual ACM Conference on Research and Development in Information Retrieval. ACM Press, Berkeley (1999)

    Google Scholar 

  8. Hofmann, T.: Latent Semantic Models for Collaborative Filtering. ACM Transactions on Information Systems 22, 89–115 (2004)

    Article  Google Scholar 

  9. Dempster, A., Laird, N., Rubin, D.: Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of Royal Statistical Society B(39), 1–38 (1977)

    MathSciNet  MATH  Google Scholar 

  10. Newman, M.E.J.: Detecting Community Structure in Networks. Eur. Phys. J. B. 38, 321–330 (2004)

    Article  Google Scholar 

  11. MaxMind, http://www.maxmind.com

  12. CNNIC, http://www.cnnic.cn

  13. Mirkovic, J., Reiher, P.: A Taxonomy of DDoS Attacks and Defense Mechanisms. ACM SIGCOMM Computer Communication Review 34, 39–53 (2004)

    Article  Google Scholar 

  14. OpenDNS, http://www.opendns.com

  15. CSTNET, http://www.cstnet.cn

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yuchi, X., Lee, X., Jin, J., Yan, B. (2010). Modeling DNS Activities Based on Probabilistic Latent Semantic Analysis. In: Cao, L., Zhong, J., Feng, Y. (eds) Advanced Data Mining and Applications. ADMA 2010. Lecture Notes in Computer Science(), vol 6441. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17313-4_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-17313-4_29

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-17312-7

  • Online ISBN: 978-3-642-17313-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics