Abstract
Numerous studies have demonstrated the effectiveness of personali-zation using quality criteria both from machine learning / data mining and from user studies. However, a site requires more than a high-performance personalization algorithm: it needs to convince its users to input the data needed by the algorithm. Today’s Web users are becoming increasingly privacy-conscious and less willing to disclose personal data. How can the advantages of personalization (and hence, of disclosure) be communicated effectively, and how can the success of such strategies be measured in terms of improved personalization quality? In this paper, we argue for a tighter integration of the HCI and computational issues involved in these questions. We first outline the problems for personalization that arise from the combination of users’ privacy concerns and sites’ current policies of dealing with privacy issues. We then describe the results of an experiment that investigated the effects of changes to a site’s interface on users’ willingness to disclose data for personalization. This is followed by an overview of studies of the sensitivity of mining algorithms to changes in the availability of these types of data. Based on this, we outline a research agenda for future evaluation studies and user agent design.
This research was supported by the Deutsche Forschungsgemeinschaft, Berlin-Brandenburg Graduate School in Distributed Information Systems (DFG grant no. GRK 316/2).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abrams, M.: Making Notices Work for Real People. In: 25th International Conference of Data Protection & Privacy Commissioners, Sydney, Australia (2003)
Ackerman, M.S., Cranor, L., Reagle, J.: Privacy in E-Commerce: Examining User Scenarios and Privacy Preferences. In: Proceedings of the 1st ACM E-Commerce, pp. 1–8. Denver Co. (1999)
Aggarwal, C.C., Wolf, J.L., Wu, K.-L., Yu, P.S.H.: Horting Hatches an Egg: A New Graph-Theoretic Approach to Collaborative Filtering. In: Proceedings of the Fifth ACM SIGKDD International Conference on KnowledgeDiscovery and Data Mining, San Diego, CA, USA, pp. 201–212 (1999)
Baldi, P., Frasconi, P., Smyth, P.: Modeling the Internet and the Web. In: Probabilistic Methods and Algorithms, John Wiley & Sons, Chichester (2003)
Berendt, B., Günther, O., Spiekermann, S.: Privacy in E-Commerce: Stated Preferences Vs. Actual Behavior. Communication of the ACM (2004) (forthcoming)
Berendt, B., Mobasher, B., Nakagawa, M., Spiliopoulou, M.: The Impact of Site Structure and User Environment on Session Reconstruction in Web Usage Analysis. In: Zaïane, O.R., Srivastava, J., Spiliopoulou, M., Masand, B. (eds.) WebKDD 2003. LNCS (LNAI), vol. 2703, pp. 115–129. Springer, Heidelberg (2003)
Berendt, B., Mobasher, B., Spiliopoulou, M., Wiltshire, J.: Measuring the Accuracy of Sessionizers for Web Usage Analysis. In: Proceedings of the Workshop on Web Mining at SIAM Data Mining Conference 2001, Chicago, IL, pp. 7–14 (2001)
Breese, J., Heckerman, D., Kadie, C.: Empirical Analysis of Predictive Algorithms for Collaborative Filtering. In: Proc. of the Fourteenth Annual Conference on Uncertainty in Artificial Intelligence (UAI 1998), pp. 43–52. Morgan Kaufmann, San Francisco (1998)
Cheung, K.-W., Tian, L.F.: Learning User Similarity and Rating Style for Collaborative Recommendation. In: Advances in Information Retrieval, 25th European Conference on IR Research, Pisa, Italy, pp. 395–410 (2003)
Consumer WebWatch: A Matter of Trust: What Users Want from Web Sites. conducted by Princeton Survey Research Associates (2002), http://www.consumerwebwatch.org/news/1_TOC.htm
Cooley, R., Mobasher, B., Srivastava, J.: Data Preparation for Mining World Wide Web Browsing Patterns. Journal of Knowledge and Information Systems 1(1), 5–32 (1999)
Cranor, L., Langheinrich, M., Marchiori, M., Presler-Marshall, M., Reagle, J.: The Platform for Privacy Preferences 1.0 (P3P1.0) Specification. W3C Recommendation April 16, 2002 (2002)
Cranor, L.F., Arjula, M., Guduru, P.: Use of a P3P User Agent by Early Adopters. In: ACM Workshop on Privacy in the Electronic Society, Washington, DC, USA, pp. 1–10 (2002)
Culnan, M.J., Milne, G.R.: The Culnan-Milne Survey on Consumers & Online Privacy Notices: Summary of Responses. In: Interagency Public Workshop: Get Noticed: Effective Financial Privacy Notices, Washington, D.C (2001)
CyberDialogue: UCO Software to Address Retailers’ $6.2 Billion Privacy Problem (October 2001), http://www.cyberdialogue.com/news/releases/2001/11-07-uco-retail.pdf
Department for Trade and Industry: Informing Consumers About E-Commerce. Conducted by MORI, London: DTI, London (2001), http://www.mori.com/polls/2001/pdf/dti-e-commerce.pdf
Earp, J.B., Baumer, D.C.: Innovative Web Use to Learn About Consumer Behavior and Online Privacy. Communications of the ACM 46(4), 81–83 (2003)
Fox, S., Rainie, L., Horrigan, J., Lenhart, A., Spooner, T., Carter, C.: Trust and Privacy Online: Why Americans Want to Rewrite the Rules. The Pew Internet & American Life Project, Washington, DC (2000), http://www.pewinternet.org/pdfs/PIP_Trust_Privacy_Report.pdf
GartnerG2: Privacy and Security: The Hidden Growth Strategy. Press Release (August 2001), http://www4.gartner.com/5_about/press_releases/2001/pr20010807d.html
Gigerenzer, G., Hoffrage, U.: How to Improve Bayesian Reasoning without Instruction: Frequency Formats. Psychological Review 102, 684–704 (1995)
Hann, I.-H., Hui, K.-L., Lee, T.S., Png, I.P.L.: Online Information Privacy: Measuring the Cost-Benefit Trade-Off. In: Twenty-Third International Conference on Information Systems, Barcelona, Spain (2002)
Harris Interactive: A Survey of Consumer Privacy Attitudes and Behaviors. Rochester, NY (2000), http://www.bbbonline.org/UnderstandingPrivacy/library/harrissummary.pdf
Heckerman, D., Chickering, D.M., Meek, C., Rounthwaite, R., Kadie, C.: Dependency Networks for Inference, Collaborative Filtering, and Data Visualization. Journal of Machine Learning Research 1, 49–75 (2000)
Herlocker, J.L., Konstan, J.A., Terveen, L.G., Riedl, J.T.: Evaluating Collaborative Filtering Recommender Systems. ACM Transactions on Information Systems 22(1), 5–53 (2004)
Hui, K.-L.: Privacy, Information Presentation, and Question Sequence. Working Paper (2004), http://www.comp.nus.edu.sg/~lung/sequence.pdf
Kobsa, A.: Ten Year Anniversary Issue. User Modeling and User-Adapted Interaction, vol. 11. Kluwer Academic Publishers, Dordrecht, Netherlands (2001)
Kobsa, A., Koenemann, J., Pohl, W.: Personalized Hypermedia Presentation Techniques for Improving Customer Relationships. The Knowledge Engineering Review 16(2), 111–155 (2001)
Kohavi, R.: Mining E-Commerce Data: The Good, the Bad, and the Ugly. In: Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, pp. 8–13 (2001)
Leathern, R.: Online Privacy. Payments and Transactions, vol. 1. Jupiter Research, New York (2002)
Mathewson, N., Dingledine, R.: Practical Traffic Analysis: Extending and Resisting Statistical Disclosure. In: The Proceedings of the 2004 Workshop on Privacy Enabling Technologies, Springer, Toronto, Canada (to appear, 2004)
Mobasher, B., Dai, H., Luo, T., Nakagawa, M.: Discovery and Evaluation of Aggregate Usage Profiles for Web Personalization. Data Mining and Knowledge Discovery 6 (2002)
Moe, W., Fader, P.: Which Visits Lead to Purchases? Dynamic Conversion Behavior at E-Commerce Sites. The Wharton School, Working Paper (2000), http://fourps.wharton.upenn.edu/ideas/pdf/00-023.pdf
Moe, W., Fader, P.: Capturing Evolving Visit Behavior in Clickstream Data. The Wharton School, Working Paper (2000), http://www-marketing.wharton.upenn.edu/ideas/pdf/00-003.pdf
Nielsen//Netratings: Cookies Messen Internetnutzung Ungenau Und Leiten Werbe-treibende Fehl. press release (2004), http://www.nielsennetratings.com/pr/pr_040929_germany.pdf
Padmanabhan, B., Zheng, Z., Kimbrough, S.O.: Personalization from Incomplete Data: What You Don’t Know Can Hurt. In: Proc. ACM-SIGKDD 2001, pp. 154–163 (2001)
Patrick, A.S., Kenny, S.: From Privacy Legislation to Interface Design: Implementing Information Privacy in Human-Computer Interfaces. In: Dingledine, R. (ed.) PET 2003. LNCS, vol. 2760, Springer, Heidelberg (2003)
Personalization Consortium: Personalization & Privacy Survey. Personalization Consortium (2000), http://www.personalization.org/SurveyResults.pdf
Privacy Commissioner of New Zealand: Privacy Concerns Loom Large. Conducted by UMR, Issue No. 42, Auckland: PC of New Zealand, Auckland, New Zealand (2001), http://www.privacy.org.nz/privword/42pr.html
Provost, F.J., Fawcett, T., Kohavi, R.: The Case against Accuracy Estimation for Comparing Induction Algorithms. In: Fifteenth International Conference on Machine Learning, pp. 445–453 (1998)
Roy Morgan Research: Privacy and the Community. Prepared for the Office of the Federal Privacy Commissioner, Sydney (2001), http://www.privacy.gov.au/publications/rcommunity.html
Sarwar, B., Karypis, G., Konstan, J., Riedl, J.: Analysis of Recommendation Algorithms for E-Commerce. In: ACM Conference on Electronic Commerce, San Diego, CA, pp. 158–167 (2000)
Spiekermann, S., Grossklags, J., Berendt, B.: E-Privacy in 2nd Generation E-Commerce: Privacy Preferences Versus Actual Behavior. In: EC 2001: Third ACM Conference on Electronic Commerce, Tampa, FL, pp. 38–47 (2001)
Spiliopoulou, M., Mobasher, B., Berendt, B., Nakagawa, M.: A Framework for the Evaluation of Session Reconstruction Heuristics in Web-Usage Analysis. INFORMS Journal on Computing 15, 171–190 (2003)
Sweeney, L.: Computational Disclosure Control: A Primer on Data Privacy Protection. MIT, Cambridge (2001), http://www.swiss.ai.mit.edu/classes/6.805/articles/privacy/sweeney-thesis-draft.pdf
Teltzrow, M., Kobsa, A.: Impacts of User Privacy Preferences on Personalized Systems: A Comparative Study. In: Karat, C.-M., Blom, J., Karat, J. (eds.) Designing Personalized User Experiences for Ecommerce, pp. 315–332. Kluwer, Dordrecht, Netherlands (2004)
Teltzrow, M., Preibusch, S., Berendt, B.: SIMT - a Privacy-Preserving Web Metrics Tool. In: IEEE Conference on Electronic Commerce (CEC 2004), San Diego (2004)
Teo, H.-H., Wan, W., Li, L.: Volunteering Personal Information on the Internet: Effects of Reputation, Privacy Initiatives, and Reward on Online Consumer Behavior. In: 37th Hawaii International Conference on System Sciences, Big Island, Hawaii, USA (2004)
Westin, A.: Testimony before U.S. House of Representatives, Committee on Energy and Commerce, Subcommittee on Commerce, Trade, and Consumer Protection, Hearing on Opinion Surveys: What Consumers Have to Say About Information Privacy (2001), http://energycommerce.house.gov/107/Hearings/05082001hearing209/hearing.htm
Zheng, Z., Padmanabhan, B., Kimbrough, S.O.: On the Existence and Significance of Data Preprocessing Biases in Web-Usage Mining. INFORMS Journal on Computing 15, 148–170 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Berendt, B., Teltzrow, M. (2005). Addressing Users’ Privacy Concerns for Improving Personalization Quality: Towards an Integration of User Studies and Algorithm Evaluation. In: Mobasher, B., Anand, S.S. (eds) Intelligent Techniques for Web Personalization. ITWP 2003. Lecture Notes in Computer Science(), vol 3169. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11577935_4
Download citation
DOI: https://doi.org/10.1007/11577935_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29846-5
Online ISBN: 978-3-540-31655-8
eBook Packages: Computer ScienceComputer Science (R0)