skip to main content
research-article

SESAME: Mining User Digital Footprints for Fine-Grained Preference-Aware Social Media Search

Published: 17 December 2014 Publication History

Abstract

With the recent popularity of social network services, a significant volume of heterogeneous social media data is generated by users, in the form of texts, photos, videos and collections of points of interest, etc. Such social media data provides users with rich resources for exploring content, such as looking for an interesting video or a favorite point of interest. However, the rapid growth of social media causes difficulties for users to efficiently retrieve their desired media items. Fortunately, users' digital footprints on social networks such as comments massively reflect individual's fine-grained preference on media items, that is, preference on different aspects of the media content, which can then be used for personalized social media search. In this article, we propose SESAME, a fine-grained preference-aware social media search framework leveraging user digital footprints on social networks. First, we collect users' direct feedback on media content from their social networks. Second, we extract users' sentiment about the media content and the associated keywords from their comments to characterize their fine-grained preference. Third, we propose a parallel multituple based ranking tensor factorization algorithm to perform the personalized media item ranking by incorporating two unique features, viz., integrating an enhanced bootstrap sampling method by considering user activeness and adopting stochastic gradient descent parallelization techniques. We experimentally evaluate the SESAME framework using two datasets collected from Foursquare and YouTube, respectively. The results show that SESAME can subtly capture user preference on social media items and consistently outperform baseline approaches by achieving better personalized ranking quality.

References

[1]
Peter Anick. 2003. Using terminological feedback for web search refinement: A log-based study. In Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'03). ACM, 88--95.
[2]
Stefano Baccianella, Andrea Esuli, and Fabrizio Sebastiani. 2010. SentiWordNet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining. In Proceedings of the Annual Conference on Language Resources and Evaluation (LREC'10). 2200--2204.
[3]
Léon Bottou. 2010. Large-scale machine learning with stochastic gradient descent. In Proceedings in Computational Statistics. Springer, 177--186.
[4]
Mohamed Reda Bouadjenek, Hakim Hacid, and Mokrane Bouzeghoub. 2013. Sopra: A new social personalized ranking function for improving web search. In Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'13). ACM, 861--864.
[5]
Yi Cai and Qing Li. 2010. Personalized search by tag-based user profile and resource profile in collaborative tagging systems. In Proceedings of the International Conference on Information and Knowledge Management (CIKM'10). ACM, 969--978.
[6]
Paul-Alexandru Chirita, Claudiu S Firan, and Wolfgang Nejdl. 2007. Personalized query expansion for the web. In Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'07). ACM, 7--14.
[7]
Lieven De Lathauwer, Bart De Moor, and Joos Vandewalle. 2000. A multilinear singular value decomposition. SIAM J. Matrix Anal. Appl. 21, 4, 1253--1278.
[8]
Li Ding, Tim Finin, Anupam Joshi, Rong Pan, R Scott Cost, Yun Peng, Pavan Reddivari, Vishal Doshi, and Joel Sachs. 2004. Swoogle: A search and metadata engine for the semantic web. In Proceedings of the International Conference on Information and Knowledge Management (CIKM'04). ACM, 652--659.
[9]
Magdalini Eirinaki and Michalis Vazirgiannis. 2003. Web mining for web personalization. ACM Trans. Internet Technol. 3, 1, 1--27.
[10]
Rainer Gemulla, Erik Nijkamp, Peter J. Haas, and Yannis Sismanis. 2011. Large-scale matrix factorization with distributed stochastic gradient descent. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11). ACM, 69--77.
[11]
Frank Allan Hansen, Niels Olof Bouvin, Bent G. Christensen, Kaj Grønbæk, Torben Bach Pedersen, and Jevgenij Gagach. 2004. Integrating the web and the world: Contextual trails on the move. In Proceedings of the ACM Hypertext Conference (HT'04). ACM, 98--107.
[12]
Jonathan L. Herlocker, Joseph A. Konstan, Loren G. Terveen, and John T. Riedl. 2004. Evaluating collaborative filtering recommender systems. ACM Trans. Inf. Syst. 22, 1, 5--53.
[13]
Anh Huy Phan and Andrzej Cichocki. 2011. PARAFAC algorithms for large-scale problems. Neurocomputing 74, 11, 1970--1984.
[14]
Mayu Iwata, Takahiro Hara, Kentaro Shimatani, Tomohiro Mashita, Kiyoshi Kiyokawa, Shojiro Nishio, and Haruo Takemura. 2011. A location-based content search system considering situations of mobile users. Procedia Computer Science 5, 426--433.
[15]
Heung-Nam Kim, Majdi Rawashdeh, Abdullah Alghamdi, and Abdulmotaleb El Saddik. 2012. Folksonomy-based personalized search and ranking in social media services. Inf. Syst. 37, 1, 61--76.
[16]
Yehuda Koren. 2010. Collaborative filtering with temporal dynamics. Commun. ACM 53, 4, 89--97.
[17]
Dariusz Król, Michal Szymanski, and Bogdan Trawinski. 2006. The recommendation mechanism in an internet information system with time impact coefficient. Int. J. Comp. Sci. Appl. 3, 3, 65--80.
[18]
Nicholas D. Lane, Dimitrios Lymberopoulos, Feng Zhao, and Andrew T. Campbell. 2010. Hapori: context-based local search for mobile phones using community behavioral modeling and similarity. In Proceedings of the ACM International Conference on Ubiquitous Computing (UbiComp'10). ACM, 109--118.
[19]
Fangtao Li, Sinno Jialin Pan, Ou Jin, Qiang Yang, and Xiaoyan Zhu. 2012. Cross-domain co-extraction of sentiment and topic lexicons. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL'12). Association for Computational Linguistics, 410--419.
[20]
Edward Loper and Steven Bird. 2002. NLTK: The natural language toolkit. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL'02) Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing And Computational Linguistics. Association for Computational Linguistics, 63--70.
[21]
Takuya Maekawa, Yutaka Yanagisawa, Yasushi Sakurai, Yasue Kishino, Koji Kamei, and Takeshi Okadome. 2012. Context-aware web search in ubiquitous sensor environments. ACM Trans. Internet Technol. 11, 3, 12.
[22]
Wilfred Ng, Lin Deng, and Dik Lun Lee. 2007. Mining user preference using spy voting for search engine personalization. ACM Trans. Internet Technol. 7, 4, 19.
[23]
Anastasios Noulas, Salvatore Scellato, Cecilia Mascolo, and Massimiliano Pontil. 2011. An empirical study of geographic user activity patterns in foursquare. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM'11). 70--573.
[24]
Derek O'Callaghan, Martin Harrigan, Joe Carthy, and &Pacute;adraig Cunningham. 2012. Network analysis of recurring YouTube spam campaigns. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM'12).
[25]
Steffen Rendle, Leandro Balby Marinho, Alexandros Nanopoulos, and Lars Schmidt-Thieme. 2009a. Learning optimal ranking with tensor factorization for tag recommendation. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09). ACM, 727--736.
[26]
Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009b. BPR: Bayesian personalized ranking from implicit feedback. In Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI'09). AUAI Press, 452--461.
[27]
Steffen Rendle and Lars Schmidt-Thieme. 2010. Pairwise interaction tensor factorization for personalized tag recommendation. In Proceedings of the International Conference on Web Search and Data Mining (WSDM'10). ACM, 81--90.
[28]
Jitao Sang, Changsheng Xu, and Dongyuan Lu. 2012. Learn to personalized image search from the photo sharing websites. IEEE Trans. Multimedia 14, 4, 963--974.
[29]
Pravin Shankar, Yun-Wu Huang, Paul Castro, Badri Nath, and Liviu Iftode. 2012. Crowds replace experts: Building better location-based services using mobile social network interactions. In Proceedings of the IEEE International Conference on Pervasive Computing and Communication (PerCom'12). IEEE, 20--29.
[30]
Shabnam Shariaty and Samaneh Moghaddam. 2011. Fine-grained opinion mining using conditional random fields. In Proceedings of the International Conference on Data Mining Workshops (ICDMW'11). IEEE, 109--114.
[31]
Yue Shi, Alexandros Karatzoglou, Linas Baltrunas, Martha Larson, Alan Hanjalic, and Nuria Oliver. 2012. TFMAP: Optimizing MAP for top-n context-aware recommendation. In Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'12). ACM, 155--164.
[32]
Nakatani Shuyo. 2010. Language Detection Library for Java. (2010). http://code.google.com/p/language-detection/.
[33]
Lucia Specia and Enrico Motta. 2007. Integrating folksonomies with the semantic web. In Proceedings of the European Semantic Web Conference (ESWC'07). Springer, 624--639.
[34]
Kazunari Sugiyama, Kenji Hatano, and Masatoshi Yoshikawa. 2004. Adaptive web search based on user profile constructed without any effort from users. In Proceedings of the International World Wide Web Conference (WWW'04). ACM, 675--684.
[35]
Jian-Tao Sun, Hua-Jun Zeng, Huan Liu, Yuchang Lu, and Zheng Chen. 2005. CubeSVD: A novel approach to personalized Web search. In Proceedings of the International World Wide Web Conference (WWW'05). ACM, 382--390.
[36]
Ledyard R. Tucker. 1966. Some mathematical notes on three-mode factor analysis. Psychometrika 31, 3, 279--311.
[37]
Shengliang Xu, Shenghua Bao, Ben Fei, Zhong Su, and Yong Yu. 2008. Exploring folksonomy for personalized search. In Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'08). ACM, 155--162.
[38]
Dingqi Yang, Daqing Zhang, Zhiyong Yu, and Zhu Wang. 2013a. A sentiment-enhanced personalized location recommendation system. In Proceedings of the ACM Hypertext Conference (HT'13). 119--128.
[39]
Dingqi Yang, Daqing Zhang, Zhiyong Yu, and Zhiwen Yu. 2013b. Fine-grained preference-aware location search leveraging crowdsourced digital footprints from LBSNs. In Proceedings of the ACM International Conference on Ubiquitous Computing (UbiComp'13). ACM, 479--488.
[40]
D. Yang, D. Zhang, V. W. Zheng, and Z. Yu. 2014. Modeling user activity preference by leveraging user spatial temporal characteristics in LBSNs. IEEE Trans. Syst. Man Cybern: Syst. org/10.1109/TSMC.2014.2327053
[41]
Shuang-Hong Yang, Bo Long, Alex Smola, Narayanan Sadagopan, Zhaohui Zheng, and Hongyuan Zha. 2011. Like like alike: joint friendship and interest propagation in social networks. In Proceedings of the International World Wide Web Conference (WWW'11). ACM, 537--546.
[42]
Daqing Zhang, Bin Guo, and Zhiwen Yu. 2011. The emergence of social and community intelligence. Computer 44, 7, 21--28.
[43]
Qiang Zhang, Michael W. Berry, Brian T. Lamb, and Tabitha Samuel. 2009. A parallel nonnegative tensor factorization algorithm for mining global climate data. In Proceedings of the International Conference on Computational Science (ICCS'09). Springer, 405--415.
[44]
Dong Zhou, Séamus Lawless, and Vincent Wade. 2012. Web search personalization using social data. In Theory and Practice of Digital Libraries, Springer, 298--310.
[45]
Yong Zhuang, Wei-Sheng Chin, Yu-Chin Juan, and Chih-Jen Lin. 2013. A fast parallel SGD for matrix factorization in shared memory systems. In Proceedings of the ACM Conference on Recommender Systems (RecSys'13). ACM, 249--256.
[46]
Martin Zinkevich, Markus Weimer, Lihong Li, and Alex J Smola. 2010. Parallelized stochastic gradient descent. In Proceedings of the Conference on Advances in Neural Information Processing Systems (NIPS'10). 2595--2603.

Cited By

View all
  • (2023)A Proposed Service Quality Measurement Model using Sentiment Analysis and Text Mining: The Case of Water and Sewerage ServicesDuygu Analizi ve Metin Madenciliği ile Hizmet Kalitesi Ölçüm Modeli Önerisi: Su ve Kanalizasyon Hizmetleri ÖrneğiOPUS Toplum Araştırmaları Dergisi10.26466/opusjsr.127202020:54(473-487)Online publication date: 31-Jul-2023
  • (2023)Generative Models for Synthetic Urban Mobility Data: A Systematic Literature ReviewACM Computing Surveys10.1145/361022456:4(1-37)Online publication date: 10-Nov-2023
  • (2023)Efficient Point-of-Interest Recommendation Services With Heterogenous Hypergraph EmbeddingIEEE Transactions on Services Computing10.1109/TSC.2022.318703816:2(1132-1143)Online publication date: 1-Mar-2023
  • Show More Cited By

Index Terms

  1. SESAME: Mining User Digital Footprints for Fine-Grained Preference-Aware Social Media Search

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Internet Technology
      ACM Transactions on Internet Technology  Volume 14, Issue 4
      Special Issue on Foundations of Social Computing
      December 2014
      143 pages
      ISSN:1533-5399
      EISSN:1557-6051
      DOI:10.1145/2699996
      • Editor:
      • Munindar P. Singh
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 17 December 2014
      Accepted: 01 June 2014
      Revised: 01 March 2014
      Received: 01 October 2013
      Published in TOIT Volume 14, Issue 4

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Personalized search
      2. fine-grained user preference
      3. parallelization
      4. sentiment analysis
      5. social media
      6. tensor factorization

      Qualifiers

      • Research-article
      • Research
      • Refereed

      Funding Sources

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)11
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 28 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)A Proposed Service Quality Measurement Model using Sentiment Analysis and Text Mining: The Case of Water and Sewerage ServicesDuygu Analizi ve Metin Madenciliği ile Hizmet Kalitesi Ölçüm Modeli Önerisi: Su ve Kanalizasyon Hizmetleri ÖrneğiOPUS Toplum Araştırmaları Dergisi10.26466/opusjsr.127202020:54(473-487)Online publication date: 31-Jul-2023
      • (2023)Generative Models for Synthetic Urban Mobility Data: A Systematic Literature ReviewACM Computing Surveys10.1145/361022456:4(1-37)Online publication date: 10-Nov-2023
      • (2023)Efficient Point-of-Interest Recommendation Services With Heterogenous Hypergraph EmbeddingIEEE Transactions on Services Computing10.1109/TSC.2022.318703816:2(1132-1143)Online publication date: 1-Mar-2023
      • (2022)ABLE: Meta-Path Prediction in Heterogeneous Information NetworksACM Transactions on Knowledge Discovery from Data10.1145/349455816:4(1-21)Online publication date: 8-Jan-2022
      • (2022)Analyzing the structure of tourism destination network based on digital footprints: taking Guilin, China as a caseData Technologies and Applications10.1108/DTA-09-2021-024057:1(56-83)Online publication date: 23-May-2022
      • (2022)Analysis of consensus sorting via the cycle metricJournal of Combinatorial Optimization10.1007/s10878-020-00553-944:3(1831-1847)Online publication date: 1-Oct-2022
      • (2022)Mitigating sensitive data exposure with adversarial learning for fairness recommendation systemsNeural Computing and Applications10.1007/s00521-022-07373-434:20(18097-18111)Online publication date: 1-Oct-2022
      • (2020)Identifying the Types of Digital Footprint Data Used to Predict Psychographic and Human BehaviourDigital Libraries at Times of Massive Societal Transition10.1007/978-3-030-64452-9_26(287-296)Online publication date: 26-Nov-2020
      • (2018)A supervised aspect level sentiment model to predict overall sentiment on tweeter documentsInternational Journal of Metadata, Semantics and Ontologies10.5555/3302773.330277713:1(33-41)Online publication date: 1-Jan-2018
      • (2018)Fine-grained Emotion Role Detection Based on Retweet InformationACM Transactions on Internet Technology10.1145/319182019:1(1-23)Online publication date: 16-Oct-2018
      • Show More Cited By

      View Options

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media