skip to main content
10.1145/1835449.1835513acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Understanding web browsing behaviors through Weibull analysis of dwell time

Published: 19 July 2010 Publication History

Abstract

Dwell time on Web pages has been extensively used for various information retrieval tasks. However, some basic yet important questions have not been sufficiently addressed, eg, what distribution is appropriate to model the distribution of dwell times on a Web page, and furthermore, what the distribution tells us about the underlying browsing behaviors. In this paper, we draw an analogy between abandoning a page during Web browsing and a system failure in reliability analysis, and propose to model the dwell time using the Weibull distribution. Using this distribution provides better goodness-of-fit to real world data, and it uncovers some interesting patterns of user browsing behaviors not previously reported. For example, our analysis reveals that Web browsing in general exhibits a significant "negative aging" phenomenon, which means that some initial screening has to be passed before a page is examined in detail, giving rise to the browsing behavior that we call "screen-and-glean." In addition, we demonstrate that dwell time distributions can be reasonably predicted purely based on low-level page features, which broadens the possible applications of this study to situations where log data may be unavailable.

References

[1]
R. Abernethy. The New Weibull Handbook. fifth edition, 2006.
[2]
E. Agichtein, E. Brill, and S. Dumais. Improving web search ranking by incorporating user behavior information. In SIGIR, pages 19--26, 2006.
[3]
E. Agichtein, E. Brill, S. Dumais, and R. Ragno. Learning user interaction models for predicting web search result preferences. In SIGIR, pages 3--10, 2006.
[4]
J. Attenberg, S. Pandey, and T. Suel. Modeling and predicting user behavior in sponsored search. In KDD, pages 1067--1076, 2009.
[5]
G. Buscher, L. van Elst, and A. Dengel. Segment-level display time as implicit feedback: a comparison to eye tracking. In SIGIR, pages 67--74, 2009.
[6]
M. Claypool, P. Le, M. Wased, and D. Brown. Implicit interest indicators. In IUI, pages 33--40, 2001.
[7]
A. C. Cohen. Maximum likelihood estimation in the weibull distribution based on complete and on censored samples. Technometrics, 7(4):579--588, 1965.
[8]
W. J. Conover. Practical Nonparametric Statistics. Wiley, third edition, 1998.
[9]
D. Downey, S. Dumais, D. Liebling, and E. Horvitz. Understanding the relationship between searchers' queries and information goals. In CIKM, pages 449--458, 2008.
[10]
S. Fox, K. Karnawat, M. Mydland, S. Dumais, and T. White. Evaluating implicit measures to improve web search. ACM Trans. Inf. Syst., 23(2):147--168, 2005.
[11]
J. H. Friedman. Greedy function approximation: A gradient boosting machine. Annals of Statistics, 29:579--588, 1999.
[12]
T. Joachims, L. Granka, B. Pan, H. Hembrooke, and G. Gay. Accurately interpreting clickthrough data as implicit feedback. In SIGIR, pages 154--161, 2005.
[13]
T. Joachims, L. Granka, B. Pan, H. Hembrooke, F. Radlinski, and G. Gay. Evaluating the accuracy of implicit feedback from clicks and query reformulations in web search. ACM Transaction on Information System, 25(2):7, 2007.
[14]
D. Kelly and N. J. Belkin. Reading time, scrolling and interaction: exploring implicit sources of user preferences for relevance feedback. In SIGIR, pages 408--409, 2001.
[15]
D. Kelly and N. J. Belkin. Display time as implicit feedback: understanding task effects. In SIGIR'04, pages 377--384, 2004.
[16]
D. Kelly and C. Cool. The effects of topic familiarity on information search behavior. In JCDL, pages 74--75, 2002.
[17]
D. Kelly and J. Teevan. Implicit feedback for inferring user preference: a bibliography. SIGIR Forum, 37(2):18--28, 2003.
[18]
E. Lehman. Shapes, moments and estimators of the weibull distribution. IEEE Transactions on Reliability, 12:32--38, 1963.
[19]
Y. Liu, B. Gao, T.-Y. Liu, Y. Zhang, Z. Ma, S. He, and H. Li. BrowseRank: letting web users vote for page importance. In SIGIR, pages 451--458, 2008.
[20]
G. Marchionini and B. Shneiderman. Finding facts vs. browsing knowledge in hypertext systems. Computer, 21(1):70--80, 1988.
[21]
M. Morita and Y. Shinoda. Information filtering based on user behavior analysis and best match text retrieval. In SIGIR, pages 272--281, 1994.
[22]
D. Nichols. Implicit ratings and filtering. In Proceedings of the 5th DELOS Workshop on Filtering and Collaborative Filtering, pages 31--36, 1997.
[23]
H. Rinne. The Weibull Distribution: A Handbook. Chapman & Hall, first edition, 2008.
[24]
J. Teevan, C. Alvarado, M. S. Ackerman, and D. R. Karger. The perfect search engine is not enough: a study of orienteering behavior in directed search. In CHI, pages 415--422, 2004.
[25]
R. W. White and S. M. Drucker. Investigating behavioral variability in web search. In WWW, pages 21--30, 2007.
[26]
R. W. White and S. T. Dumais. Characterizing and predicting search engine switching behavior. In CIKM, pages 87--96, 2009.

Cited By

View all
  • (2024)A Study on the Security of Web Application using Enhanced Secure CookieThe Journal of Korean Institute of Information Technology10.14801/jkiit.2024.22.2.19522:2(195-204)Online publication date: 28-Feb-2024
  • (2024)Unbundle-Rewrite-Rebundle: Runtime Detection and Rewriting of Privacy-Harming Code in JavaScript BundlesProceedings of the 2024 on ACM SIGSAC Conference on Computer and Communications Security10.1145/3658644.3690262(2192-2206)Online publication date: 2-Dec-2024
  • (2024)Bridging the Analytics Gap: Optimizing Content Performance using Actionable Knowledge DiscoveryProceedings of the 35th ACM Conference on Hypertext and Social Media10.1145/3648188.3675121(185-192)Online publication date: 10-Sep-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
July 2010
944 pages
ISBN:9781450301534
DOI:10.1145/1835449
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Weibull analysis
  2. dwell time
  3. user behaviors
  4. web browsing

Qualifiers

  • Research-article

Conference

SIGIR '10
Sponsor:

Acceptance Rates

SIGIR '10 Paper Acceptance Rate 87 of 520 submissions, 17%;
Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)117
  • Downloads (Last 6 weeks)16
Reflects downloads up to 20 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A Study on the Security of Web Application using Enhanced Secure CookieThe Journal of Korean Institute of Information Technology10.14801/jkiit.2024.22.2.19522:2(195-204)Online publication date: 28-Feb-2024
  • (2024)Unbundle-Rewrite-Rebundle: Runtime Detection and Rewriting of Privacy-Harming Code in JavaScript BundlesProceedings of the 2024 on ACM SIGSAC Conference on Computer and Communications Security10.1145/3658644.3690262(2192-2206)Online publication date: 2-Dec-2024
  • (2024)Bridging the Analytics Gap: Optimizing Content Performance using Actionable Knowledge DiscoveryProceedings of the 35th ACM Conference on Hypertext and Social Media10.1145/3648188.3675121(185-192)Online publication date: 10-Sep-2024
  • (2024)RefreshChannels: Exploiting Dynamic Refresh Rate Switching for Mobile Device AttacksProceedings of the 22nd Annual International Conference on Mobile Systems, Applications and Services10.1145/3643832.3661864(359-371)Online publication date: 3-Jun-2024
  • (2024)Inverse Learning with Extremely Sparse Feedback for RecommendationProceedings of the 17th ACM International Conference on Web Search and Data Mining10.1145/3616855.3635797(396-404)Online publication date: 4-Mar-2024
  • (2023)Reliability engineering opportunities in Industry 4.0Engineering Today10.5937/engtoday2300001B2:1(7-22)Online publication date: 2023
  • (2023)Pool-partyProceedings of the 32nd USENIX Conference on Security Symposium10.5555/3620237.3620634(7091-7105)Online publication date: 9-Aug-2023
  • (2023)A Data Quality Measurement Framework Using Distribution-Based Modeling and Simulation in Real-Time Telemedicine SystemsApplied Sciences10.3390/app1313754813:13(7548)Online publication date: 26-Jun-2023
  • (2023)NSDIF: Leveraging Non-Sampling Learning and Denoising Implicit Feedback for RecommendationProceedings of the 2023 5th International Conference on Internet of Things, Automation and Artificial Intelligence10.1145/3653081.3653129(289-294)Online publication date: 24-Nov-2023
  • (2023)SLED: Structure Learning based Denoising for RecommendationACM Transactions on Information Systems10.1145/361138542:2(1-31)Online publication date: 8-Nov-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media