Skip to main content

A Generalized Framework for Quantifying Trust of Social Media Text Documents

  • Conference paper
  • First Online:
Machine Learning and Data Mining in Pattern Recognition (MLDM 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9729))

Abstract

Social media has become a very popular place for users seeking knowledge about a wide variety of topics. While it contains many helpful documents, it also contains many useless and malicious documents or spams. For a casual observer it is very hard to identify high quality or trustworthy documents. As the volume of such data increases, the task for identifying the trustworthy documents becomes more and more difficult. A huge number of research works have focused on quantifying trust in certain specific social network domains. Some have quantified trust based on social graph. In this work, we use such social graph named Reduced node Social Graph with Relationships (RSGR) and we develop a three-step syntax and semantic based trust mining framework. Here we generalize the concept of trust mining for all structured as well as unstructured unsupervised text documents from all social network domains. We calculate trust based on metadata, trust based on relationships with other documents and finally we propagate the trust calculated so far along various relationship edges to calculate the final trust. Finally we show that our method calculates the trust of social media text documents with more than 80% accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Golbeck, J., Parsia, B., Hendler, J.: Trust networks on the semantic web. In: Klusch, M., Omicini, A., Ossowski, S., Laamanen, H. (eds.) CIA 2003. LNCS (LNAI), vol. 2782, pp. 238–249. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  2. Guha, R., Kumar, R., Raghavan, P., Tomkins, A.: Propagation of trust and distrust. In: Proceedings of the 13th International Conference on World Wide Web, NY, USA, 2004, pp. 403–412

    Google Scholar 

  3. Bizer, C., Oldakowski, R.: Using context- and content-based trust policies on the semantic web. In: Proceedings of the 13th International Conference on World Wide Web, NY, USA, pp. 228–229 (2004)

    Google Scholar 

  4. Agichtein, E., Castillo, C., Donato, D., Gionis, A., Mishne, G.: Finding high-quality content in social media. In: Proceedings of the International Conference on Web Search and Web Mining, pp. 183–194 (2008)

    Google Scholar 

  5. Blumenstock, J.: Size matters, word count as a measure of quality on Wikipedia. In: Proceedings of the 17th International Conference on World Wide Web, pp. 1095–1096. ACM, New York (2008)

    Google Scholar 

  6. Zolfaghar, K., Aghaie, A.: Mining trust and distrust relationships in social web applications. In: Proceedings of the 2010 IEEE International Conference on Intelligent Computer Communication and Processing, Cluj-Napoca, Romania, August 26-28, 2010, pp. 73–80 (2010)

    Google Scholar 

  7. McGuinness, D., Zeng, H., Da Silva, P., Ding, L., Narayanan, D., Bhaowal, M: Investigations into trust for collaborative information repositories: a Wikipedia case study. In: Proceedings of the Workshop on Models of Trust for the Web 2006, pp. 3–131 (2006)

    Google Scholar 

  8. Nguyen, V.A., Lim, E.P., Jiang, J., Sun, A.: To trust or not to trust? Predicting online trusts using trust antecedent framework. In: Proceedings of the 9th IEEE International Conference on Data Mining, Miami, Florida, USA, pp. 896–901 (2009)

    Google Scholar 

  9. Chua, F.C.T., Lim, E.P.: Trust network inference for online rating data using generative models. In: Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, July 25–28, 2010, pp. 889–898 (2010)

    Google Scholar 

  10. Matsuo, Y., Yamamoto, H.: Community gravity: measuring bidirectional effects by trust and rating on online social networks. In: Proceedings of the 18th International Conference on World Wide Web, Madrid, Spain, April 20–24, 2009, pp. 751–760 (2009)

    Google Scholar 

  11. Zhang, Y., Yu, T.: Mining trust relationships from online social networks. Journal of Computer Science and Technology 27(3), 492–505 (2012)

    Article  Google Scholar 

  12. Wu, Z., Palmer, M.: Verb semantics and lexical selection. In: Proceedings of the 32nd Annual Meeting of the Associations for Computational Linguistics, pp. 133–138 (1994)

    Google Scholar 

  13. Moturu, S.T., Liu, H.: Evaluating the trustworthiness of Wikipedia articles through quality and credibility. In: Proceedings of the 5th International Symposium on Wikis and Open Collaboration, NY (2009)

    Google Scholar 

  14. Sharma, T., Toshniwal, D.: A generalized relationship mining method for social media text data. In: Perner, P. (ed.) MLDM 2014. LNCS, vol. 8556, pp. 376–392. Springer, Heidelberg (2014)

    Google Scholar 

  15. Deza, M.M., Deza, E.: Encyclopedia of Distances. Springer, Heidelberg (2009)

    Book  MATH  Google Scholar 

  16. Porter, M.F.: An algorithm for suffix stripping. Program: Electronic Library and Information Systems 14(3), 130–137 (1980)

    Article  Google Scholar 

  17. Zhang, J., Votava, P., Lee, T.J., Adhikarla, S., Kulkumjon, I.C., Schlau, M., Natesan, D., Nemani, R.: A technique of analyzing trust relationships to facilitate scientific service discovery and recommendation. In: 2013 IEEE International Conference on Services Computing (SCC), pp. 57–64. IEEE (2013)

    Google Scholar 

  18. Biryukov, M.: Co-author network analysis in DBLP: classifying personal names. In: Thi, H.A.L., Bouvry, P., Dinh, T.P. (eds.) Modelling, Computation and Optimization in Information Systems and Management Sciences. CCIS, vol. 14, pp. 399–408. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  19. Huang, T.-H., Huang, M.L.: Analysis and visualization of co-authorship networks for understanding academic collaboration and knowledge domain of individual researchers. In: 2006 International Conference on Computer Graphics, Imaging and Visualisation, pp. 18–23. IEEE (2006)

    Google Scholar 

  20. Biryukov, M., Dong, C.: Analysis of computer science communities based on DBLP. In: Lalmas, M., Jose, J., Rauber, A., Sebastiani, F., Frommholz, I. (eds.) ECDL 2010. LNCS, vol. 6273, pp. 228–235. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  21. Reitz, F., Hoffmann, O.: An analysis of the evolving coverage of computer science sub-fields in the DBLP digital library. In: Lalmas, M., Jose, J., Rauber, A., Sebastiani, F., Frommholz, I. (eds.) ECDL 2010. LNCS, vol. 6273, pp. 216–227. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  22. Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86. Association for Computational Linguistics (2002)

    Google Scholar 

  23. Zeng, Y., Yao, Y., Zhong, N.: DBLP-SSE: a DBLP search support engine. In: IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technologies, WI-IAT 2009, vol. 1, pp. 626–630. IET (2009)

    Google Scholar 

  24. Sun, Y., Han, J., Zhao, P., Yin, Z., Cheng, H., Wu, T.: Rankclus: integrating clustering with ranking for heterogeneous information network analysis. In: Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, pp. 565–576. ACM (2009)

    Google Scholar 

  25. Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Computational Linguistics 37(2), 267–307 (2011)

    Article  Google Scholar 

  26. Lawrence, S.: Free online availability substantially increases a paper’s impact. Nature 411(6837), 521–521 (2001)

    Article  Google Scholar 

  27. Zhang, X., Cui, L., Wang, Y.: Commtrust: computing multi-dimensional trust by mining e-commerce feedback comments. IEEE Transactions on Knowledge and Data Engineering 26(7), 1631–1643 (2014)

    Article  Google Scholar 

  28. Emayakumaari, T., Ananthi, G.: Mining E-commerce feedback comments for trust evaluation. In: 2015 IEEE International Conference on Engineering and Technology (ICETECH), pp. 1–5. IEEE (2015)

    Google Scholar 

  29. Yean, C.J., Yee, T.C., Tan, I.K.T.: Relative trust management model for Twitter: an analytic hierarchy process approach. In: International Conference on Frontiers of Communications, Networks and Applications (ICFCNA 2014-Malaysia), pp. 1–6. IET (2014)

    Google Scholar 

  30. https://developers.facebook.com/docs/reference/apis/

  31. https://dev.twitter.com/docs/streaming-apis

  32. https://www.quantcast.com/top-sites

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tuhin Sharma .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Sharma, T., Toshniwal, D. (2016). A Generalized Framework for Quantifying Trust of Social Media Text Documents. In: Perner, P. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 2016. Lecture Notes in Computer Science(), vol 9729. Springer, Cham. https://doi.org/10.1007/978-3-319-41920-6_53

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-41920-6_53

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-41919-0

  • Online ISBN: 978-3-319-41920-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics