Skip to main content
Log in

Symbolic data analysis tools for recommendation systems

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Recommender systems have become an important tool to cope with the information overload problem by acquiring data about user behavior. After tracing the user’s behavior, through actions or rates, computational recommender systems use information- filtering techniques to recommend items. In order to recommend new items, one of the three major approaches is generally adopted: content-based filtering, collaborative filtering, or hybrid filtering. This paper presents three information-filtering methods, each of them based on one of these approaches. In our methods, the user profile is built up through symbolic data structures and the user and item correlations are computed through dissimilarity functions adapted from the symbolic data analysis (SDA) domain. The use of SDA tools has improved the performance of recommender systems, particularly concerning the find good items task measured by the half-life utility metric, when there is not much information about the user.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Arroyo J, Maté C (2009) Forecasting histogram time series with k-nearest neighbours methods. Int J Forecast 25(1): 192–207

    Article  Google Scholar 

  2. Baeza-Yates R, Berthier RN (1999) Modern information retrieval. Addison-Wesley, Boston

    Google Scholar 

  3. Bezerra BLD, De Carvalho FAT (2004) A symbolic approach for content-based information filtering. Inf Process Lett 92(1): 45–52

    Article  MATH  MathSciNet  Google Scholar 

  4. Billard L, Diday E (2003) From the statistics of data to the statistics of knowledge: symbolic data analysis. J Am Stat Assoc 98(462): 470–487

    Article  MathSciNet  Google Scholar 

  5. Billard L, Diday E (2006) Symbolic data analysis. Conceptual statistics and data mining. Wiley, Chichester

    Book  MATH  Google Scholar 

  6. Blanco-Fernandez Y, Pazos-Arias JJ, Lopez-Nores M, Gil-Solla A, Ramos-Cabrer M (2006) AVATAR: an improved solution for personalized tv based on semantic inference. IEEE Trans Consum Electron 52(1): 223–231

    Article  Google Scholar 

  7. Bock HH, Diday E (2000) Analysis of symbolic data: exploratory methods for extracting statistical information from complex data. Springer-Verlag, Heidelberg

    Google Scholar 

  8. Breese JS, Heckerman D, Kadie C (1998) Empirical analysis of predictive algorithms for collaborative filtering. In: Cooper GF, Moral S (eds) Proceedings of the 14th conference on uncertainty in artificial intelligence (UAI-98). Morgan Kaufmann, San Francisco, pp 43–52

    Google Scholar 

  9. Burke R (2002) Hybrid recommender systems: survey and experiments. User Model User-Adapted Interact 12(4): 331–370

    Article  MATH  Google Scholar 

  10. Cazes P, Chouakria A, Diday E, Schektman S (1997) Extension de l’analyse en composantes principales à des donnes de type intervalle. Revue de Statistique Aplique XLV(3): 5–24

    Google Scholar 

  11. Chavent M (1998) A monothetic clustering method. Pattern Recognit Lett 19(11): 989–996

    Article  MATH  Google Scholar 

  12. Chavent M, De Carvalho FAT, Lechevallier Y, Verde R (2006) New clustering methods for interval data. Comput Stat 21(2): 211–230

    Article  MATH  MathSciNet  Google Scholar 

  13. Ciampi A, Diday E, Lebbe J, Perinel E, Vignes R (2000) Growing a tree classifier with imprecise data. Pattern Recognit Lett 21(9): 787–803

    Article  Google Scholar 

  14. Claypool M, Brown D, Phong L, Waseda M (2001) Inferring user interests. IEEE Internet Comput 5(6): 32–39

    Article  Google Scholar 

  15. Costello E, Doody J, McGinty L, Smyth B (2006) iCARE: intelligent customer assistance for recommending eyewear. In: Proceedings of the 11th international conference on intelligent user interfaces (IUI ’06), Sydney, Australia, January 2006, pp 282–284

  16. Daoud M, Lechani LT, Boughanem M (2009) Towards a graph-based user profile modeling for a session-based personalized search. Knowl Inf Syst 21(3): 365–368

    Article  Google Scholar 

  17. De Carvalho FAT (1995) Histograms in symbolic data analysis. Ann Oper Res 55(2): 229–322

    Google Scholar 

  18. De Carvalho FAT (2007) Fuzzy c-means clustering methods for symbolic interval data. Pattern Recognit Lett 28(4): 423–437

    Article  Google Scholar 

  19. De Carvalho FAT, Brito P, Bock HH (2006) Dynamic clustering methods for interval data based on L2 distance. Comput Stat 21(2): 231–250

    Article  MATH  MathSciNet  Google Scholar 

  20. De Carvalho FAT, De Souza RMCR, Chavent M, Lechevallier Y (2006) Adaptive Hausdorff distances and dynamic clustering of symbolic data. Pattern Recognit Lett 27(3): 167–179

    Article  Google Scholar 

  21. De Carvalho FAT, Lechevallier Y (2009) Partitional clustering algorithms for symbolic interval data based on single adaptive distances. Pattern Recognit 42(7): 1223–1236

    Article  MATH  Google Scholar 

  22. De Carvalho FAT, Lechevallier Y (2009) Dynamic clustering of interval-valued data based on adaptive quadratic distances. IEEE Trans Syst Man Cybernet A: Syst Humans 39(6): 1295–1306

    Article  Google Scholar 

  23. De Carvalho FAT, Csernel M, Lechevallier Y (2009) Clustering constrained symbolic data. Pattern Recognit Lett 30(11): 1037–1045

    Article  Google Scholar 

  24. De Souza RMCR, De Carvalho FAT (2004) Clustering of interval data based on city-block distances. Pattern Recognit Lett 25(3): 353–365

    Article  Google Scholar 

  25. Diday E, Noirhomme-Fraiture M (2008) Symbolic data analysis and the sodas software. Wiley, Chichester

    MATH  Google Scholar 

  26. El-Sonbaty Y, Ismail MA (1998) On-line hierarchical clustering. Pattern Recognit Lett 19(14): 1285–1291

    Article  MATH  Google Scholar 

  27. El-Sonbaty Y, Ismail MA (1998) Fuzzy clustering for symbolic data. IEEE Trans Fuzzy Syst 6(2): 195–204

    Article  Google Scholar 

  28. Errico JH, Sezan I (2006) Presence based collaborative recommender for networked audiovisual displays. In: Proceedings of the 11th international conference on intelligent user interfaces (IUI ’06), Sydney, Australia, January 2006, pp 297–299

  29. Formica A, Pourabbas E (2008) Content-based similarity of geographic classes organized as partition hierarchies. Knowl Inf Syst 20(2): 221–241

    Article  Google Scholar 

  30. Gori M, Pucci A (2006) A random walk based scoring algorithm with application to recommender systems for large scale E-commerce. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining, Philadelphia, USA, August 2006, pp 127–146

  31. Gowda KC, Diday E (1991) Symbolic clustering using a new dissimilarity measure. Pattern Recognit 24(6): 567–578

    Article  Google Scholar 

  32. Gowda KC, Ravi TR (1995) Divisive clustering of symbolic objects using the concepts of both similarity and dissimilarity. Pattern Recognit 28(8): 1277–1282

    Article  Google Scholar 

  33. Groenen PJF, Winsberg S, Rodrigues O, Diday E (2000) I-Scal: multidimensional scaling of interval dissimilarities. Comput Stat Data Anal 51(1): 360–378

    Article  Google Scholar 

  34. Guo Y, Deng G (2006) An improved personalized collaborative filtering algorithm in E-commerce recommender system. In: Proceedings of the international conference on service systems and service management, Troyes, France, October 2006, pp 1582–1586

  35. Guru DS, Kiranagi BB, Nagabhushan P (2004) Multivalued type proximity measure and concept of mutual similarity value useful for clustering symbolic patterns. Pattern Recognit Lett 25(10): 1203–1213

    Article  Google Scholar 

  36. Herlocker JL, Konstan JA, Terveen LG, Riedl JT (2004) Evaluating collaborative filtering recommender systems. ACM Trans Inf Syst 22(1): 5–53

    Article  Google Scholar 

  37. Huang Z, Zeng D, Chen H (2007) Collaborative-filtering recommendation algorithms for E-commerce. IEEE Intell Syst 22(5): 68–78

    Article  Google Scholar 

  38. Ichino M, Yaguchi H (1994) Generalized Minkowsky metrics for mixed feature type data analysis. IEEE Trans Syst Man Cybernet 24(4): 698–708

    Article  MathSciNet  Google Scholar 

  39. Jiao Y, Cao G (2007) A collaborative tagging system for personalized recommendation in B2C electronic commerce. In: Proceedings of the international conference on wireless communications, networking and mobile computing, Shanghai, China, September 2007, pp 3609–3612

  40. Lauro NC, Palumbo F (1997) Principal component analysis of interval data: a symbolic data analysis approach. Comput Stat 15(1): 73–87

    Article  Google Scholar 

  41. Tamine-Lechani L, Boughanem M, Daoud M (2009) Evaluation of contextual information retrieval effectiveness: overview of issues and research. Knowl Inf Syst. doi:10.1007/s10115-009-0231-1

  42. Ludwig B, Mandl S, Mammen SV (2006) What on tonight: user-centered and situation-aware proposals for TV programmes. In: Proceedings of the 11th international conference on intelligent user interfaces (IUI ’06), Sidney, Australia, January 2006, pp 258–260

  43. Leung CW, Chan SC, Chung F (2006) A collaborative filtering framework based on fuzzy association rules and multiple-level similarity. Knowl Inf Syst 10(3): 357–381

    Article  Google Scholar 

  44. Lima Neto EA, De Carvalho FAT (2008) Centre and range method for fitting a linear regression model to symbolic interval data. Comput Stat Data Anal 52(3): 1500–1515

    Article  MATH  MathSciNet  Google Scholar 

  45. Lima Neto EA, De Carvalho FAT (2010) Constrained linear regression models for symbolic interval-valued variables. Comput Stat Data Anal 54(2): 333–347

    Article  MATH  Google Scholar 

  46. Maia ALS, De Carvalho FAT, Ludermir TB (2008) Forecasting models for interval-valued time series. Neurocomputing 71(16–18): 3344–3352

    Article  Google Scholar 

  47. Mali K, Mitra S (2003) Clustering and its validation in a symbolic framework. Pattern Recognit Lett 24(14): 2367–2376

    Article  MATH  Google Scholar 

  48. Palumbo F, Verde R (2000) Non-symmetrical factorial discriminant analysis for symbolic objects. Appl Stoch Models Bus Ind 15(4): 419–427

    Article  Google Scholar 

  49. Prudencio RBC, Ludermir TB, De Carvalho FAT (2004) A modal symbolic classifier for selecting time series models. Pattern Recognit Lett 25(4): 911–921

    Article  Google Scholar 

  50. Ralph P, Parsons J (2006) A framework for automatic online personalization. In: Proceedings of the 39th Hawaii international conference on system sciences, Kaloa, Kauai, Hawaii, pp 137b–137b

  51. Ricci F, Werthner H (2006) Introduction to the special issue: recommender systems. Int J Electron Commerce 11(2): 5–9

    Article  Google Scholar 

  52. Salter J, Antonopoulos N (2006) Cinema Screen recommender agent: combining collaborative and content-based filtering. IEEE Intell Syst 21(1): 35–41

    Article  Google Scholar 

  53. Schafer JB, Konstan JA, Riedl J (2001) E-commerce recommendation applications. Data Min Knowl Discov 5(1–2): 115–153

    Article  MATH  Google Scholar 

  54. Schweizer B (1984) Distributions are the numbers of the future. In: Nola A, Ventre D (eds) Proceedings of the mathematics of fuzzy systems meeting. University of Naples, Naples, pp 137–149

    Google Scholar 

  55. Shyu ML, Haruechaiyasak C, Chen SC (2006) Mining user access patterns with traversal constraint for predicting web page requests. Knowl Inf Syst 10(4): 515–528

    Article  Google Scholar 

  56. Tamine-Lechani L, Boughanem M, Zemirli Wn (2008) Personalized document ranking: exploiting evidence from multiple user interests for profiling and retrieval. J Digit Inform Manag 6(5): 354–365

    Google Scholar 

  57. Tan AH, Ong HL, Pan H, Ng J, Li QX (2004) Towards personalised web intelligence. Knowl Inf Syst 6(5): 595–616

    Article  Google Scholar 

  58. Vucetic S, Obradovic Z (2005) Collaborative filtering using a regression-based approach. Knowl Inf Syst 7(1): 1–22

    Article  Google Scholar 

  59. Webster A, Vassileva J (2007) Push-Poll recommender system: supporting word of mouth. In: Conati C, McCoy KF, Paliouras G (eds) Proceedings of the 11th international conference UM 2007, user modeling 2007, Corfu, Greece, June 2007. Lecture notes in artificial intelligence, vol. 4511. Springer, Berlin, pp 278–287

  60. Webster A, Vassileva J (2007) The keepup recommender system. In: Proceedings of the 2007 ACM conference on recommender systems, Minneapolis, USA, October 2007, pp 173–176

  61. Wei K, Huang J, Fu S (2007) A survey of E-commerce recommender systems. In: Proceedings of the 2007 international conference on service systems and service management, Chengdu, China, June 2007, pp 1–5

  62. Weng LT, Xu Y, Li Y, Nayak R (2006) A fair peer selection algorithm for an E-commerce oriented distributed recommender system. In: Li Y et al (eds) Advances in intelligent IT: active media technology 2006. IOS Press, Amsterdam

    Google Scholar 

  63. Yu K, Xu X, Ester M, Kriege HP (2003) Feature weighting and instance selection for collaborative filtering: an information-theoretic approach. Knowl Inf Syst 5(2): 201–224

    Article  Google Scholar 

  64. Zanker M, Jessenitschnig M, Jannach D, Gordea S (2007) Recommendation strategies in a commercial context. IEEE Intell Syst 22(3): 69–73

    Article  Google Scholar 

  65. Zhanga Y, Jiao J (2007) An associative classification-based recommendation system for personalization in B2C e-commerce applications. Expert Syst Appl 33(2): 357–367

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Francisco de Assis Tenorio de Carvalho.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Leite Dantas Bezerra, B., Tenorio de Carvalho, F.d. Symbolic data analysis tools for recommendation systems. Knowl Inf Syst 26, 385–418 (2011). https://doi.org/10.1007/s10115-009-0282-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-009-0282-3

Keywords

Navigation