Predicting Web Information Content

Zhu, Tingshao; Greiner, Russ; Häubl, Gerald; Price, Bob

doi:10.1007/11577935_13

Predicting Web Information Content

Tingshao Zhu²⁰,
Russ Greiner²⁰,
Gerald Häubl²¹ &
…
Bob Price²⁰

Conference paper

1096 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3169))

Abstract

This paper introduces a novel method for predicting the current information need of a web user from the content of the pages the user has visited and the actions the user has applied to these pages. This inference is based on a parameterized model of how the sequence of actions chosen by the user indicates the degree to which page content satisfies the user’s information need. We show that the model parameters can be estimated using standard methods from a labelled corpus. Data from lab experiments demonstrate that the prediction model can effectively identify the information needs of new users, browsing previously unseen pages. The paper concludes with an overview of our “complete-web” recommendation system, WebIC, which uses the prediction model to recommend useful pages to the user, from anywhere on the Web.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proc. of the 20th International Conference on Very Large Databases (VLDB 1994), Santiago, Chile (September 1994)
Google Scholar
Agrawal, R., Srikant, R.: Mining sequential patterns. In: Proc. of the International Conference on Data Engineering (ICDE), Taipei, Taiwan (March 1995)
Google Scholar
Billsus, D., Pazzani, M.: A hybrid user model for news story classification. In: Proceedings of the Seventh International Conference on User Modeling (UM 1999), Banff, Canada (1999)
Google Scholar
Blackmon, M., Polson, P., Kitajima, M., Lewis, C.: Cognitive walkthrough for the web. In: 2002 ACM conference on human factors in computing systems (CHI 2002), pp. 463–470 (2002)
Google Scholar
Budzik, J., Hammond, K.: Watson: Anticipating and contextualizing information needs. In: Proceedings of 62nd Annual Meeting of the American Society for Information Science, Medford, NJ (1999)
Google Scholar
Choo, C.W., Detlor, B., Turnbull, D.: A behavioral model of information seeking on the web – preliminary results of a study of how managers and it specialists use the web. In: Preston, C. (ed.) Proceedings of the 61st Annual Meeting of the American Society for Information Science, Pittsburgh, PA, October 1998, pp. 290–302 (1998)
Google Scholar
Duda, R., Hart, P.: Pattern Classification and Scene Analysis. Wiley, New York (1973)
MATH Google Scholar
Japkowicz, N.: The class imbalance problem: Significance and strategies. In: Proceedings of the 2000 International Conference on Artificial Intelligence (ICAI 2000) (2000)
Google Scholar
Lewis, D., Knowles, K.: Threading electronic mail: A preliminary study. Information Processing and Management 33(2), 209–217 (1997)
Article Google Scholar
Lieberman, H.: Letizia: An agent that assists web browsing. In: International Joint Conference on Artificial Intelligence, Montreal, Canada (August 1995)
Google Scholar
Ling, C., Li, C.: Data mining for direct marketing problems and solutions. In: Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining (KDD 1998), New York, AAAI Press, Menlo Park (1998)
Google Scholar
Pirolli, P., Fu, W.: Snif-act: A model of information foraging on the world wide web. In: Ninth International Conference on User Modeling, Johnstown, PA (2003)
Google Scholar
Quinlan, R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Mateo (1992)
Google Scholar
Rijsbergen, C.: Information Retrieval, 2nd edn. Butterworths, London (1979)
MATH Google Scholar
Zhu, T., Greiner, R., Häubl, G.: An effective complete-web recommender system. In: The Twelfth International World Wide Web Conference (WWW 2003), Budapest, HUNGARY (May 2003)
Google Scholar
Zhu, T., Greiner, R., Häubl, G.: Learning a model of a web user’s interests. In: Brusilovsky, P., Corbett, A.T., de Rosis, F. (eds.) UM 2003. LNCS, vol. 2702, Springer, Heidelberg (2003)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computing Science, University of Alberta, T6G 2E1, Canada
Tingshao Zhu, Russ Greiner & Bob Price
School of Business, University of Alberta, T6G 2R6, Canada
Gerald Häubl

Authors

Tingshao Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Russ Greiner
View author publications
You can also search for this author in PubMed Google Scholar
Gerald Häubl
View author publications
You can also search for this author in PubMed Google Scholar
Bob Price
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Web Intelligence School of Computing, DePaul University, Chicago, Illinois, USA
Bamshad Mobasher
Department of Computer Science, University of Warwick, Coventry, United Kingdom
Sarabjot Singh Anand

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhu, T., Greiner, R., Häubl, G., Price, B. (2005). Predicting Web Information Content. In: Mobasher, B., Anand, S.S. (eds) Intelligent Techniques for Web Personalization. ITWP 2003. Lecture Notes in Computer Science(), vol 3169. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11577935_13

Download citation

DOI: https://doi.org/10.1007/11577935_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29846-5
Online ISBN: 978-3-540-31655-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics