Skip to main content

Feature Matrices: A Model for Efficient and Anonymous Web Usage Mining

  • Conference paper
  • First Online:
Electronic Commerce and Web Technologies (EC-Web 2001)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2115))

Included in the following conference series:

Abstract

Recent growth of startup companies in the area of Web Usage Mining is a strong indication of the effectiveness of this data in understanding user behaviors. However, the approach taken by industry towards Web Usage Mining is off-line and hence intrusive, static, and cannot differentiate between various roles a single user might play. Towards this end, several researchers studied probabilistic and distance-based models to summarize the collected data and maintain only the important features for analysis. The proposed models are either not flexible to trade-off accuracy for performance per application requirements, or not adaptable in real-time due to high complexity of updating the model. In this paper, we propose a new model, the FM model, which is flexible, tunable, adaptable, and can be used for both anonymous and online analysis. Also, we introduce a novel similarity measure for accurate comparison among FM models of navigation paths or cluster of paths. We conducted several experiments to evaluate and verify the FM model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cadez I., Heckerman D., Meek C, Smyth P., and White S.: Visualization of Navigation Patterns on Web-Site Using Model Based Clustering. Technical Report MSR-TR-00-18, Microsoft Research, Microsoft Corporation, Redmond, WA,(2000)

    Google Scholar 

  2. Huberman B., Pirolli P., Pitkow J., and Lukos R.: Strong Regularities in World Wide Web Surfing. Science, 280, p.p 95–97 (1997)

    Article  Google Scholar 

  3. Mobasher B., Cooley R., and Srivastava J.: Automatic Personalization Based on Web Usage Mining. Special Section of the Communications of ACM on “Personalization Technologies with Data Mining”, 43(8):142–151 (2000)

    Google Scholar 

  4. Mulvenna M.D., Anand S.S., Büchner A.G.: Personalization on the Net using Web mining: Introduction. CACM 43(8): 122–125 (2000)

    Google Scholar 

  5. Perkowitz M., Etzioni O.: Toward Adaptive Web-Sites: Conceptual Framework and Case Study. Artificial Intelligence 118, p.p 245–275 (2000)

    Article  MATH  Google Scholar 

  6. Shahabi C., Zarkesh A.M., Adibi J., and Shah V.: Knowledge Discovery from Users Web Page Navigation. IEEE RIDE97 Workshop, April (1997)

    Google Scholar 

  7. Spiliopoulou M.: Web usage mining for site evaluation: Making a site better fit its users. Special Section of the Communications of ACM on “Personalization Technologies with Data Mining”, 43(8):127–134 (2000)

    Google Scholar 

  8. Srivastava J., Cooley r., Deshpande M., Tan M.N.: Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data. SIGKDD Explorations, Vol. 1, Issue 2 (2000)

    Google Scholar 

  9. Yan T.W., Jacobsen M., Garcia-Molina H., Dayal U.: From User Access Patterns to Dynamic Hypertext Linking. Fifth International World Wide Web Conference, Paris, France, (1996)

    Google Scholar 

  10. Zarkesh A., Adibi J., Shahabi C., Sadri R., and Shah V.: Analysis and Design of Server Informative WWW-Sites. ACM CIKM’ 97 (1997)

    Google Scholar 

  11. Shahabi C., Banaei-Kashani F., Faruque J., Faisal A.: Feature Matrices: A Model for Efficient and Anonymous Mining of Web Navigations. Technical Report USC-CS-00-736, Computer Science Department, University of Southern California.

    Google Scholar 

Hypertext References

  1. HREF1: http://www.personify.com

  2. HREF2: http://www.websidestory.com

  3. HREF3: http://www.webtrends.com

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Shahabi, C., Banaei-Kashani, F., Faruque, J., Faisal, A. (2001). Feature Matrices: A Model for Efficient and Anonymous Web Usage Mining. In: Bauknecht, K., Madria, S.K., Pernul, G. (eds) Electronic Commerce and Web Technologies. EC-Web 2001. Lecture Notes in Computer Science, vol 2115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44700-8_27

Download citation

  • DOI: https://doi.org/10.1007/3-540-44700-8_27

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-42517-5

  • Online ISBN: 978-3-540-44700-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics