research-article

Persona Prototypes for Improving the Qualitative Evaluation of Recommendation Systems

Authors:
Joanna Misztal-Radecka

AGH University of Science and Technology & Ringier Axel Springer Polska, Cracow, Poland

AGH University of Science and Technology & Ringier Axel Springer Polska, Cracow, Poland
View Profile

,
Bipin Indurkhya

Jagiellonian University, Cracow, Poland

Jagiellonian University, Cracow, Poland
View Profile

UMAP '20 Adjunct: Adjunct Publication of the 28th ACM Conference on User Modeling, Adaptation and PersonalizationJuly 2020Pages 206–212https://doi.org/10.1145/3386392.3399297

Published:13 July 2020Publication History

UMAP '20 Adjunct: Adjunct Publication of the 28th ACM Conference on User Modeling, Adaptation and Personalization

Pages 206–212

ABSTRACT

The majority of existing research in the field of recommendation systems is aimed at optimizing accuracy metrics for given datasets, which leads to an algorithm-driven design of resulting solutions. Given a lack of understanding of the dataset characteristics and insufficient diversity of represented individuals, such approaches lead to amplifying the hidden data biases and existing disparities. In this research, we address this problem by proposing a Persona Prototyping approach that selects a set of the most representative user individuals to help in understanding the complex distribution of user interests and performing a proper qualitative evaluation of recommendation algorithms. A hierarchical density-based clustering technique is applied to distinguish diverse user groups and select their prototypes. Each of the selected representatives is presented in an easily understandable form of a textual user story describing the prototype behaviors, inspired by the concept of persona from the interaction design. We evaluated the diversity and representativeness of selected individuals and the results show that the proposed method is capable of identifying diverse interest archetypes and can be used to improve the qualitative analysis of recommendations and to test how well they respond to the diversity of user needs.

Supplemental Material

3386392.3399297.mp4

mp4

27.5 MB

Download

References

Ricardo Baeza-Yates. 2018. Bias on the web. Commun. ACM 61 (05 2018), 54--61. https://doi.org/10.1145/3209581Google Scholar
Yoshua Bengio. 2019. From System 1 Deep Learning to System 2 Deep Learning. https://nips.cc/Conferences/2019/ScheduleMultitrack?event=15488.Google Scholar
Jacob Bien and Robert Tibshirani. 2011. Prototype selection for interpretable classification. The Annals of Applied Statistics5, 4 (Dec 2011), 2403--2424. https://doi.org/10.1214/11-aoas495Google ScholarCross Ref
David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet Allocation. J. Mach. Learn. Res.3 (March 2003), 993--1022. http://dl.acm.org/citation.cfm?id=944919.944937Google Scholar
Tolga Bolukbasi, Kai-Wei Chang, James Zou, Venkatesh Saligrama, and Adam Kalai. 2016. Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings. In Proceedings of the 30th International Conference on Neural Information Processing Systems (NIPS'16). Curran Associates Inc., Red Hook, NY, USA, 4356--4364.Google ScholarDigital Library
K. Bradley and B. Smyth. 2001. Improving Recommendation Diversity. In Proceedings of the 12th National Conference in Artificial Intelligence and Cognitive Science, Diarmuid O'Donoghue (Ed.). Maynooth, Ireland, 75--84.Google Scholar
Tadeusz Cali'ski and Harabasz JA. 1974. A Dendrite Method for Cluster Analysis. Communications in Statistics - Theory and Methods 3 (01 1974), 1--27. https://doi.org/10.1080/03610927408827101Google Scholar
Ricardo J. G. B. Campello, Davoud Moulavi, and Joerg Sander. 2013. Density-Based Clustering Based on Hierarchical Density Estimates. In Advances in Knowledge Discovery and Data Mining, Jian Pei, Vincent S. Tseng, Longbing Cao, Hiroshi Motoda, and Guandong Xu (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 160--172.Google Scholar
David Caswell and Konstantin Dorr. 2017. Automated Journalism 2.0: Event-Driven Narratives. From simple descriptions to real stories. Journalism Practice(05 2017).Google Scholar
Alan Cooper, Robert Reimann, and Dave Cronin. 2014. About Face: The Essentials of Interaction Design. John Wiley & Sons, Inc., New York, NY, USA.Google ScholarDigital Library
Abhinandan S. Das, Mayur Datar, Ashutosh Garg, and Shyam Rajaram. 2007. Google news personalization: scalable online collaborative filtering. In WWW'07: Proceedings of the 16th international conference on World Wide Web. ACM, New York, NY, USA, 271--280. https://doi.org/10.1145/1242572.1242610Google ScholarDigital Library
D. L. Davies and D. W. Bouldin. 1979. A Cluster Separation Measure. IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI-1, 2 (April 1979),224--227. https://doi.org/10.1109/TPAMI.1979.4766909Google ScholarDigital Library
Martin Ester, Hans-Peter Kriegel, Jörg Sander, and Xiaowei Xu. 1996. A density-based algorithm for discovering clusters in large spatial databases with noise. AAAI Press, 226--231.Google ScholarDigital Library
Sahin Cem Geyik, Ali Dasdan, and Kuang-Chih Lee. 2015. User Clustering in On-line Advertising via Topic Models. CoRRabs/1501.06595 (2015). arXiv:1501.06595 http://arxiv.org/abs/1501.06595Google Scholar
Eduardo Graells-Garrido, Mounia Lalmas, and Filippo Menczer. 2015. First Women, Second Sex: Gender Bias in Wikipedia. CoRRabs/1502.02341 (2015). arXiv:1502.02341 http://arxiv.org/abs/1502.02341Google Scholar
Riccardo Guidotti, Anna Monreale, Franco Turini, Dino Pedreschi, and Fosca Giannotti. 2018. A Survey Of Methods For Explaining Black Box Models. CoRRabs/1802.01933 (2018). arXiv:1802.01933 http://arxiv.org/abs/1802.01933Google Scholar
F. Maxwell Harper and Joseph A. Konstan. 2015. The Movie Lens Datasets: History and Context. ACM Trans. Interact. Intell. Syst.5, 4, Article 19 (Dec. 2015), 19 pages.https://doi.org/10.1145/2827872Google Scholar
Christian Hennig. 2017. Cluster validation by measurement of clustering characteristics relevant to the user. arXiv:stat.ME/1703.09282Google Scholar
Aylin Caliskan Islam, Joanna J. Bryson, and Arvind Narayanan. 2016. Semantics derived automatically from language corpora necessarily contain human biases. CoRRabs/1608.07187 (2016). arXiv:1608.07187 http://arxiv.org/abs/1608.07187Google Scholar
Anil K. Jain and Richard C. Dubes. 1988. Algorithms for Clustering Data. Prentice-Hall, Inc., Upper Saddle River, NJ, USA.Google ScholarDigital Library
Daniel Kahneman. 2011.Thinking, fast and slow. Farrar, Straus and Giroux, New York. https://www.amazon.de/Thinking-Fast-Slow-Daniel-Kahneman/dp/0374275637/ref=wl_it_dp_o_pdT1_nS_nC?ie=UTF8&colid=151193SNGKJT9&coliid=I3OCESLZCVDFL7Google Scholar
Been Kim, Rajiv Khanna, and Oluwasanmi O Koyejo. 2016. Examples are not enough, learn to criticize! Criticism for Interpretability. In Advances in Neural Information Processing Systems 29, D. D.Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Garnett (Eds.). Curran Associates, Inc., 2280--2288.http://papers.nips.cc/paper/6300-examples-are-not-enough-learn-to-criticize-criticism-for-interpretability.pdfGoogle Scholar
Been Kim, Cynthia Rudin, and Julie Shah. 2015. The Bayesian Case Model: A Generative Approach for Case-Based Reasoning and Prototype Classification. arXiv:stat.ML/1503.01161Google Scholar
Leland McInnes, John Healy, and James Melville. 2018. UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. arXiv:stat.ML/1802.03426Google Scholar
Christoph Molnar. 2019. Interpretable Machine Learning. https://christophm.github.io/interpretable-ml-book/.Google Scholar
Donald A. Norman. 2002. The Design of Everyday Things. Basic Books, Inc., NewYork, NY, USA.Google ScholarDigital Library
Juni Nurma Sari, Lukito Nugroho, Ridi Ferdiana, and Paulus Santosa. 2016. Reviewon Customer Segmentation Technique on E-commerce. Advanced Science Letters 22 (10 2016), 3018--3022. https://doi.org/10.1166/asl.2016.7985Google Scholar
Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. 2016. "Why Should I Trust You?": Explaining the Predictions of Any Classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, August 13--17, 2016. 1135--1144.Google ScholarDigital Library
Peter J. Rousseeuw. 1987. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math.20 (1987), 53 -- 65.https://doi.org/10.1016/0377-0427(87)90125--7Google ScholarDigital Library
Badrul M. Sarwar, George Karypis, Joseph Konstan, and John Reidl. 2002. Recommender Systems for Large-Scale E-Commerce: Scalable Neighborhood Formation Using Clustering. In Proceedings of the 5th International Conference on Computer and Information Technology (ICCIT).Google Scholar
Nava Tintarev and Judith Masthoff. 2011. Designing and Evaluating Explanations for Recommender Systems. In Recommender Systems Handbook, Francesco Ricci, Lior Rokach, Bracha Shapira, and Paul B. Kantor (Eds.). Springer US, 479--510. https://doi.org/10.1007/978-0--387--85820--3_15Google Scholar
Virginia Tsintzou, Evaggelia Pitoura, and Panayiotis Tsaparas. 2018. Bias Disparity in Recommendation Systems. CoRRabs/1811.01461 (2018). arXiv:1811.01461http://arxiv.org/abs/1811.01461Google Scholar
Amos Tversky and Daniel Kahneman.1974. Judgment under Uncertainty: Heuristics and Biases. Science 185,4157(1974),1124--1131.https://doi.org/10.1126/science.185.4157.1124arXiv:https://science.sciencemag.org/content/185/4157/1124.full.pdfGoogle Scholar
Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing Data using t-SNE. Journal of Machine Learning Research 9 (2008), 2579--2605. http://www.jmlr.org/papers/v9/vandermaaten08a.htmlGoogle Scholar
Yao Wu and Martin Ester. 2015. FLAME: A Probabilistic Model Combining Aspect Based Opinion Mining and Collaborative Filtering. In Proceedings of the Eighth ACM International Conference on Web Search and Data Mining (WSDM '15). ACM,New York, NY, USA, 199--208. https://doi.org/10.1145/2684822.2685291Google ScholarDigital Library
Yongfeng Zhang and Xu Chen. 2018. Explainable Recommendation: A Survey and New Perspectives. CoRRabs/1804.11192 (2018). arXiv:1804.11192 http://arxiv.org/abs/1804.11192Google Scholar
Yang Zhang, Hesham Mekky, Zhi-Li Zhang, Ruben Torres, Sung-Ju Lee, Alok Tongaonkar, and Marco Mellia. 2015. Detecting malicious activities with user-agent-based profiles. International Journal of Network Management 25, 5 (2015),306--319.Google ScholarCross Ref

Index Terms

Persona Prototypes for Improving the Qualitative Evaluation of Recommendation Systems

Recommendations

Cross-representation mediation of user models

Personalization is considered a powerful methodology for improving the effectiveness of information search and decision making. It has led to the dissemination of systems capable of suggesting relevant and personalized information (or items) to the users,...
Read More
Improving Cold Start Recommendation by Mapping Feature-Based Preferences to Item Comparisons
UMAP '17: Proceedings of the 25th Conference on User Modeling, Adaptation and Personalization

Many Recommender Systems (RSs) rely on user preference data in the form of ratings or likes for items. Previous research has shown that item comparisons can also be effectively used to model user preferences and build RS. However, users often express ...
Read More
Coevolutionary Recommendation Model: Mutual Learning between Ratings and Reviews
WWW '18: Proceedings of the 2018 World Wide Web Conference

Collaborative filtering (CF) is a common recommendation approach that relies on user-item ratings. However, the natural sparsity of user-item rating data can be problematic in many domains and settings, limiting the ability to generate accurate ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
UMAP '20 Adjunct: Adjunct Publication of the 28th ACM Conference on User Modeling, Adaptation and Personalization
July 2020
395 pages
ISBN:9781450379502
DOI:10.1145/3386392
Editors:
Tsvi Kuflik
University of Haifa, Israel
,
Ilaria Torre
University of Genoa, Italy
,
Robin Burke
University of Colorado, Boulder, USA
,
Cristina Gena
University of Turin, Italy
Copyright © 2020 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 13 July 2020
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
hierarchical clustering
interaction design
model interpretability
prototype selection
recommendation explanations
recommender systems
unsupervised learning
user modeling
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate162of633submissions,26%
Upcoming Conference
UMAP '24

Sponsor:

sigchi

sigchi

32nd ACM Conference on User Modeling, Adaptation and Personalization

July 1 - 4, 2024

Cagliari , Italy
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 232
  Total Downloads
- Downloads (Last 12 months)50
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Persona Prototypes for Improving the Qualitative Evaluation of Recommendation Systems

UMAP '20 Adjunct: Adjunct Publication of the 28th ACM Conference on User Modeling, Adaptation and Personalization

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Cross-representation mediation of user models

Improving Cold Start Recommendation by Mapping Feature-Based Preferences to Item Comparisons

Coevolutionary Recommendation Model: Mutual Learning between Ratings and Reviews