Abstract
Social media has become a very popular place for users seeking knowledge about a wide variety of topics. While it contains many helpful documents, it also contains many useless and malicious documents or spams. For a casual observer it is very hard to identify high quality or trustworthy documents. As the volume of such data increases, the task for identifying the trustworthy documents becomes more and more difficult. A huge number of research works have focused on quantifying trust in certain specific social network domains. Some have quantified trust based on social graph. In this work, we use such social graph named Reduced node Social Graph with Relationships (RSGR) and we develop a three-step syntax and semantic based trust mining framework. Here we generalize the concept of trust mining for all structured as well as unstructured unsupervised text documents from all social network domains. We calculate trust based on metadata, trust based on relationships with other documents and finally we propagate the trust calculated so far along various relationship edges to calculate the final trust. Finally we show that our method calculates the trust of social media text documents with more than 80% accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Golbeck, J., Parsia, B., Hendler, J.: Trust networks on the semantic web. In: Klusch, M., Omicini, A., Ossowski, S., Laamanen, H. (eds.) CIA 2003. LNCS (LNAI), vol. 2782, pp. 238–249. Springer, Heidelberg (2003)
Guha, R., Kumar, R., Raghavan, P., Tomkins, A.: Propagation of trust and distrust. In: Proceedings of the 13th International Conference on World Wide Web, NY, USA, 2004, pp. 403–412
Bizer, C., Oldakowski, R.: Using context- and content-based trust policies on the semantic web. In: Proceedings of the 13th International Conference on World Wide Web, NY, USA, pp. 228–229 (2004)
Agichtein, E., Castillo, C., Donato, D., Gionis, A., Mishne, G.: Finding high-quality content in social media. In: Proceedings of the International Conference on Web Search and Web Mining, pp. 183–194 (2008)
Blumenstock, J.: Size matters, word count as a measure of quality on Wikipedia. In: Proceedings of the 17th International Conference on World Wide Web, pp. 1095–1096. ACM, New York (2008)
Zolfaghar, K., Aghaie, A.: Mining trust and distrust relationships in social web applications. In: Proceedings of the 2010 IEEE International Conference on Intelligent Computer Communication and Processing, Cluj-Napoca, Romania, August 26-28, 2010, pp. 73–80 (2010)
McGuinness, D., Zeng, H., Da Silva, P., Ding, L., Narayanan, D., Bhaowal, M: Investigations into trust for collaborative information repositories: a Wikipedia case study. In: Proceedings of the Workshop on Models of Trust for the Web 2006, pp. 3–131 (2006)
Nguyen, V.A., Lim, E.P., Jiang, J., Sun, A.: To trust or not to trust? Predicting online trusts using trust antecedent framework. In: Proceedings of the 9th IEEE International Conference on Data Mining, Miami, Florida, USA, pp. 896–901 (2009)
Chua, F.C.T., Lim, E.P.: Trust network inference for online rating data using generative models. In: Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, July 25–28, 2010, pp. 889–898 (2010)
Matsuo, Y., Yamamoto, H.: Community gravity: measuring bidirectional effects by trust and rating on online social networks. In: Proceedings of the 18th International Conference on World Wide Web, Madrid, Spain, April 20–24, 2009, pp. 751–760 (2009)
Zhang, Y., Yu, T.: Mining trust relationships from online social networks. Journal of Computer Science and Technology 27(3), 492–505 (2012)
Wu, Z., Palmer, M.: Verb semantics and lexical selection. In: Proceedings of the 32nd Annual Meeting of the Associations for Computational Linguistics, pp. 133–138 (1994)
Moturu, S.T., Liu, H.: Evaluating the trustworthiness of Wikipedia articles through quality and credibility. In: Proceedings of the 5th International Symposium on Wikis and Open Collaboration, NY (2009)
Sharma, T., Toshniwal, D.: A generalized relationship mining method for social media text data. In: Perner, P. (ed.) MLDM 2014. LNCS, vol. 8556, pp. 376–392. Springer, Heidelberg (2014)
Deza, M.M., Deza, E.: Encyclopedia of Distances. Springer, Heidelberg (2009)
Porter, M.F.: An algorithm for suffix stripping. Program: Electronic Library and Information Systems 14(3), 130–137 (1980)
Zhang, J., Votava, P., Lee, T.J., Adhikarla, S., Kulkumjon, I.C., Schlau, M., Natesan, D., Nemani, R.: A technique of analyzing trust relationships to facilitate scientific service discovery and recommendation. In: 2013 IEEE International Conference on Services Computing (SCC), pp. 57–64. IEEE (2013)
Biryukov, M.: Co-author network analysis in DBLP: classifying personal names. In: Thi, H.A.L., Bouvry, P., Dinh, T.P. (eds.) Modelling, Computation and Optimization in Information Systems and Management Sciences. CCIS, vol. 14, pp. 399–408. Springer, Heidelberg (2008)
Huang, T.-H., Huang, M.L.: Analysis and visualization of co-authorship networks for understanding academic collaboration and knowledge domain of individual researchers. In: 2006 International Conference on Computer Graphics, Imaging and Visualisation, pp. 18–23. IEEE (2006)
Biryukov, M., Dong, C.: Analysis of computer science communities based on DBLP. In: Lalmas, M., Jose, J., Rauber, A., Sebastiani, F., Frommholz, I. (eds.) ECDL 2010. LNCS, vol. 6273, pp. 228–235. Springer, Heidelberg (2010)
Reitz, F., Hoffmann, O.: An analysis of the evolving coverage of computer science sub-fields in the DBLP digital library. In: Lalmas, M., Jose, J., Rauber, A., Sebastiani, F., Frommholz, I. (eds.) ECDL 2010. LNCS, vol. 6273, pp. 216–227. Springer, Heidelberg (2010)
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86. Association for Computational Linguistics (2002)
Zeng, Y., Yao, Y., Zhong, N.: DBLP-SSE: a DBLP search support engine. In: IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technologies, WI-IAT 2009, vol. 1, pp. 626–630. IET (2009)
Sun, Y., Han, J., Zhao, P., Yin, Z., Cheng, H., Wu, T.: Rankclus: integrating clustering with ranking for heterogeneous information network analysis. In: Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, pp. 565–576. ACM (2009)
Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Computational Linguistics 37(2), 267–307 (2011)
Lawrence, S.: Free online availability substantially increases a paper’s impact. Nature 411(6837), 521–521 (2001)
Zhang, X., Cui, L., Wang, Y.: Commtrust: computing multi-dimensional trust by mining e-commerce feedback comments. IEEE Transactions on Knowledge and Data Engineering 26(7), 1631–1643 (2014)
Emayakumaari, T., Ananthi, G.: Mining E-commerce feedback comments for trust evaluation. In: 2015 IEEE International Conference on Engineering and Technology (ICETECH), pp. 1–5. IEEE (2015)
Yean, C.J., Yee, T.C., Tan, I.K.T.: Relative trust management model for Twitter: an analytic hierarchy process approach. In: International Conference on Frontiers of Communications, Networks and Applications (ICFCNA 2014-Malaysia), pp. 1–6. IET (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Sharma, T., Toshniwal, D. (2016). A Generalized Framework for Quantifying Trust of Social Media Text Documents. In: Perner, P. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 2016. Lecture Notes in Computer Science(), vol 9729. Springer, Cham. https://doi.org/10.1007/978-3-319-41920-6_53
Download citation
DOI: https://doi.org/10.1007/978-3-319-41920-6_53
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-41919-0
Online ISBN: 978-3-319-41920-6
eBook Packages: Computer ScienceComputer Science (R0)