Abstract
Recommender systems have become an important tool to cope with the information overload problem by acquiring data about user behavior. After tracing the user’s behavior, through actions or rates, computational recommender systems use information- filtering techniques to recommend items. In order to recommend new items, one of the three major approaches is generally adopted: content-based filtering, collaborative filtering, or hybrid filtering. This paper presents three information-filtering methods, each of them based on one of these approaches. In our methods, the user profile is built up through symbolic data structures and the user and item correlations are computed through dissimilarity functions adapted from the symbolic data analysis (SDA) domain. The use of SDA tools has improved the performance of recommender systems, particularly concerning the find good items task measured by the half-life utility metric, when there is not much information about the user.
Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Arroyo J, Maté C (2009) Forecasting histogram time series with k-nearest neighbours methods. Int J Forecast 25(1): 192–207
Baeza-Yates R, Berthier RN (1999) Modern information retrieval. Addison-Wesley, Boston
Bezerra BLD, De Carvalho FAT (2004) A symbolic approach for content-based information filtering. Inf Process Lett 92(1): 45–52
Billard L, Diday E (2003) From the statistics of data to the statistics of knowledge: symbolic data analysis. J Am Stat Assoc 98(462): 470–487
Billard L, Diday E (2006) Symbolic data analysis. Conceptual statistics and data mining. Wiley, Chichester
Blanco-Fernandez Y, Pazos-Arias JJ, Lopez-Nores M, Gil-Solla A, Ramos-Cabrer M (2006) AVATAR: an improved solution for personalized tv based on semantic inference. IEEE Trans Consum Electron 52(1): 223–231
Bock HH, Diday E (2000) Analysis of symbolic data: exploratory methods for extracting statistical information from complex data. Springer-Verlag, Heidelberg
Breese JS, Heckerman D, Kadie C (1998) Empirical analysis of predictive algorithms for collaborative filtering. In: Cooper GF, Moral S (eds) Proceedings of the 14th conference on uncertainty in artificial intelligence (UAI-98). Morgan Kaufmann, San Francisco, pp 43–52
Burke R (2002) Hybrid recommender systems: survey and experiments. User Model User-Adapted Interact 12(4): 331–370
Cazes P, Chouakria A, Diday E, Schektman S (1997) Extension de l’analyse en composantes principales à des donnes de type intervalle. Revue de Statistique Aplique XLV(3): 5–24
Chavent M (1998) A monothetic clustering method. Pattern Recognit Lett 19(11): 989–996
Chavent M, De Carvalho FAT, Lechevallier Y, Verde R (2006) New clustering methods for interval data. Comput Stat 21(2): 211–230
Ciampi A, Diday E, Lebbe J, Perinel E, Vignes R (2000) Growing a tree classifier with imprecise data. Pattern Recognit Lett 21(9): 787–803
Claypool M, Brown D, Phong L, Waseda M (2001) Inferring user interests. IEEE Internet Comput 5(6): 32–39
Costello E, Doody J, McGinty L, Smyth B (2006) iCARE: intelligent customer assistance for recommending eyewear. In: Proceedings of the 11th international conference on intelligent user interfaces (IUI ’06), Sydney, Australia, January 2006, pp 282–284
Daoud M, Lechani LT, Boughanem M (2009) Towards a graph-based user profile modeling for a session-based personalized search. Knowl Inf Syst 21(3): 365–368
De Carvalho FAT (1995) Histograms in symbolic data analysis. Ann Oper Res 55(2): 229–322
De Carvalho FAT (2007) Fuzzy c-means clustering methods for symbolic interval data. Pattern Recognit Lett 28(4): 423–437
De Carvalho FAT, Brito P, Bock HH (2006) Dynamic clustering methods for interval data based on L2 distance. Comput Stat 21(2): 231–250
De Carvalho FAT, De Souza RMCR, Chavent M, Lechevallier Y (2006) Adaptive Hausdorff distances and dynamic clustering of symbolic data. Pattern Recognit Lett 27(3): 167–179
De Carvalho FAT, Lechevallier Y (2009) Partitional clustering algorithms for symbolic interval data based on single adaptive distances. Pattern Recognit 42(7): 1223–1236
De Carvalho FAT, Lechevallier Y (2009) Dynamic clustering of interval-valued data based on adaptive quadratic distances. IEEE Trans Syst Man Cybernet A: Syst Humans 39(6): 1295–1306
De Carvalho FAT, Csernel M, Lechevallier Y (2009) Clustering constrained symbolic data. Pattern Recognit Lett 30(11): 1037–1045
De Souza RMCR, De Carvalho FAT (2004) Clustering of interval data based on city-block distances. Pattern Recognit Lett 25(3): 353–365
Diday E, Noirhomme-Fraiture M (2008) Symbolic data analysis and the sodas software. Wiley, Chichester
El-Sonbaty Y, Ismail MA (1998) On-line hierarchical clustering. Pattern Recognit Lett 19(14): 1285–1291
El-Sonbaty Y, Ismail MA (1998) Fuzzy clustering for symbolic data. IEEE Trans Fuzzy Syst 6(2): 195–204
Errico JH, Sezan I (2006) Presence based collaborative recommender for networked audiovisual displays. In: Proceedings of the 11th international conference on intelligent user interfaces (IUI ’06), Sydney, Australia, January 2006, pp 297–299
Formica A, Pourabbas E (2008) Content-based similarity of geographic classes organized as partition hierarchies. Knowl Inf Syst 20(2): 221–241
Gori M, Pucci A (2006) A random walk based scoring algorithm with application to recommender systems for large scale E-commerce. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining, Philadelphia, USA, August 2006, pp 127–146
Gowda KC, Diday E (1991) Symbolic clustering using a new dissimilarity measure. Pattern Recognit 24(6): 567–578
Gowda KC, Ravi TR (1995) Divisive clustering of symbolic objects using the concepts of both similarity and dissimilarity. Pattern Recognit 28(8): 1277–1282
Groenen PJF, Winsberg S, Rodrigues O, Diday E (2000) I-Scal: multidimensional scaling of interval dissimilarities. Comput Stat Data Anal 51(1): 360–378
Guo Y, Deng G (2006) An improved personalized collaborative filtering algorithm in E-commerce recommender system. In: Proceedings of the international conference on service systems and service management, Troyes, France, October 2006, pp 1582–1586
Guru DS, Kiranagi BB, Nagabhushan P (2004) Multivalued type proximity measure and concept of mutual similarity value useful for clustering symbolic patterns. Pattern Recognit Lett 25(10): 1203–1213
Herlocker JL, Konstan JA, Terveen LG, Riedl JT (2004) Evaluating collaborative filtering recommender systems. ACM Trans Inf Syst 22(1): 5–53
Huang Z, Zeng D, Chen H (2007) Collaborative-filtering recommendation algorithms for E-commerce. IEEE Intell Syst 22(5): 68–78
Ichino M, Yaguchi H (1994) Generalized Minkowsky metrics for mixed feature type data analysis. IEEE Trans Syst Man Cybernet 24(4): 698–708
Jiao Y, Cao G (2007) A collaborative tagging system for personalized recommendation in B2C electronic commerce. In: Proceedings of the international conference on wireless communications, networking and mobile computing, Shanghai, China, September 2007, pp 3609–3612
Lauro NC, Palumbo F (1997) Principal component analysis of interval data: a symbolic data analysis approach. Comput Stat 15(1): 73–87
Tamine-Lechani L, Boughanem M, Daoud M (2009) Evaluation of contextual information retrieval effectiveness: overview of issues and research. Knowl Inf Syst. doi:10.1007/s10115-009-0231-1
Ludwig B, Mandl S, Mammen SV (2006) What on tonight: user-centered and situation-aware proposals for TV programmes. In: Proceedings of the 11th international conference on intelligent user interfaces (IUI ’06), Sidney, Australia, January 2006, pp 258–260
Leung CW, Chan SC, Chung F (2006) A collaborative filtering framework based on fuzzy association rules and multiple-level similarity. Knowl Inf Syst 10(3): 357–381
Lima Neto EA, De Carvalho FAT (2008) Centre and range method for fitting a linear regression model to symbolic interval data. Comput Stat Data Anal 52(3): 1500–1515
Lima Neto EA, De Carvalho FAT (2010) Constrained linear regression models for symbolic interval-valued variables. Comput Stat Data Anal 54(2): 333–347
Maia ALS, De Carvalho FAT, Ludermir TB (2008) Forecasting models for interval-valued time series. Neurocomputing 71(16–18): 3344–3352
Mali K, Mitra S (2003) Clustering and its validation in a symbolic framework. Pattern Recognit Lett 24(14): 2367–2376
Palumbo F, Verde R (2000) Non-symmetrical factorial discriminant analysis for symbolic objects. Appl Stoch Models Bus Ind 15(4): 419–427
Prudencio RBC, Ludermir TB, De Carvalho FAT (2004) A modal symbolic classifier for selecting time series models. Pattern Recognit Lett 25(4): 911–921
Ralph P, Parsons J (2006) A framework for automatic online personalization. In: Proceedings of the 39th Hawaii international conference on system sciences, Kaloa, Kauai, Hawaii, pp 137b–137b
Ricci F, Werthner H (2006) Introduction to the special issue: recommender systems. Int J Electron Commerce 11(2): 5–9
Salter J, Antonopoulos N (2006) Cinema Screen recommender agent: combining collaborative and content-based filtering. IEEE Intell Syst 21(1): 35–41
Schafer JB, Konstan JA, Riedl J (2001) E-commerce recommendation applications. Data Min Knowl Discov 5(1–2): 115–153
Schweizer B (1984) Distributions are the numbers of the future. In: Nola A, Ventre D (eds) Proceedings of the mathematics of fuzzy systems meeting. University of Naples, Naples, pp 137–149
Shyu ML, Haruechaiyasak C, Chen SC (2006) Mining user access patterns with traversal constraint for predicting web page requests. Knowl Inf Syst 10(4): 515–528
Tamine-Lechani L, Boughanem M, Zemirli Wn (2008) Personalized document ranking: exploiting evidence from multiple user interests for profiling and retrieval. J Digit Inform Manag 6(5): 354–365
Tan AH, Ong HL, Pan H, Ng J, Li QX (2004) Towards personalised web intelligence. Knowl Inf Syst 6(5): 595–616
Vucetic S, Obradovic Z (2005) Collaborative filtering using a regression-based approach. Knowl Inf Syst 7(1): 1–22
Webster A, Vassileva J (2007) Push-Poll recommender system: supporting word of mouth. In: Conati C, McCoy KF, Paliouras G (eds) Proceedings of the 11th international conference UM 2007, user modeling 2007, Corfu, Greece, June 2007. Lecture notes in artificial intelligence, vol. 4511. Springer, Berlin, pp 278–287
Webster A, Vassileva J (2007) The keepup recommender system. In: Proceedings of the 2007 ACM conference on recommender systems, Minneapolis, USA, October 2007, pp 173–176
Wei K, Huang J, Fu S (2007) A survey of E-commerce recommender systems. In: Proceedings of the 2007 international conference on service systems and service management, Chengdu, China, June 2007, pp 1–5
Weng LT, Xu Y, Li Y, Nayak R (2006) A fair peer selection algorithm for an E-commerce oriented distributed recommender system. In: Li Y et al (eds) Advances in intelligent IT: active media technology 2006. IOS Press, Amsterdam
Yu K, Xu X, Ester M, Kriege HP (2003) Feature weighting and instance selection for collaborative filtering: an information-theoretic approach. Knowl Inf Syst 5(2): 201–224
Zanker M, Jessenitschnig M, Jannach D, Gordea S (2007) Recommendation strategies in a commercial context. IEEE Intell Syst 22(3): 69–73
Zhanga Y, Jiao J (2007) An associative classification-based recommendation system for personalization in B2C e-commerce applications. Expert Syst Appl 33(2): 357–367
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Leite Dantas Bezerra, B., Tenorio de Carvalho, F.d. Symbolic data analysis tools for recommendation systems. Knowl Inf Syst 26, 385–418 (2011). https://doi.org/10.1007/s10115-009-0282-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-009-0282-3
Keywords
Profiles
- Francisco de Assis Tenorio de Carvalho View author profile