Abstract
Social media, and in particular microblogging sites, allow users to post multiple kinds of content for different purposes. Content may be purely conversational, or news-related, or event-related. To find information relevant to users in this heterogeneous mass of content, it would be important to consider the task for which search is carried out, and the most suitable relevance dimensions. In the last years, despite the social search problem has been increasingly investigated, this aspect has not been sufficiently analyzed. For this reason, in this paper, we focus on different search tasks in the microblog search context, and we identify some related relevance dimensions. We also report some experiments we have made to verify the impact of the identified relevance dimensions on the system effectiveness, with respect to the considered search tasks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
In this case, being a related tweet means that the tweet discusses the crisis event.
- 5.
Also for informativeness, logistic regression has been performed by employing the model implemented by the scikit-learn library [36], using the default parameters.
- 6.
- 7.
- 8.
References
Alhadi, A.C., Gottron, T., Kunegis, J., Naveed, N.: LiveTweet: microblog retrieval based on interestingness and an adaptation of the vector space model. In: TREC (2011)
Borlund, P.: The concept of relevance in IR. J. Am. Soc. Inform. Sci. Technol. 54(10), 913–925 (2003)
Buckley, C., Voorhees, E.M.: Retrieval evaluation with incomplete information. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 25–32 (2004)
Choi, J., Croft, W.B., Kim, J.Y.: Quality models for microblog retrieval. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 1834–1838. ACM (2012)
Cooper, W.S.: On selecting a measure of retrieval effectiveness. J. Am. Soc. Inf. Sci. 24(2), 87–100 (1973)
da Costa Pereira, C., Dragoni, M., Pasi, G.: Multidimensional relevance: prioritized aggregation in a personalized information retrieval setting. Inf. Process. Manag. 48(2), 340–357 (2012)
Craswell, N.: Bpref. In: Liu, L., Ozsu, M.T. (eds.) Encyclopedia of Database Systems, pp. 266–267. Springer, Boston (2009). https://doi.org/10.1007/978-0-387-39940-9_489
De Grandis, M., Pasi, G., Viviani, M.: Fake news detection in microblogging through quantifier-guided aggregation. In: Torra, V., Narukawa, Y., Pasi, G., Viviani, M. (eds.) MDAI 2019. LNCS (LNAI), vol. 11676, pp. 64–76. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-26773-5_6
Duan, Y., Jiang, L., Qin, T., Zhou, M., Shum, H.Y.: An empirical study on learning to rank of tweets. In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 295–303. Association for Computational Linguistics (2010)
Efron, M.: Information search and retrieval in microblogs. J. Am. Soc. Inform. Sci. Technol. 62(6), 996–1008 (2011)
Fogg, B., Tseng, H.: The elements of computer credibility. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 80–87. ACM (1999)
Ghosh, S., Ghosh, K., Ganguly, D., Chakraborty, T., Jones, G.J., Moens, M.F.: ECIR 2017 workshop on exploitation of social media for emergency relief and preparedness (SMERP 2017). In: ACM SIGIR Forum, vol. 51, pp. 36–41. ACM (2017)
Giachanou, A., Harvey, M., Crestani, F.: Topic-specific stylistic variations for opinion retrieval on Twitter. In: Ferro, N., et al. (eds.) ECIR 2016. LNCS, vol. 9626, pp. 466–478. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30671-1_34
Gouws, S., Metzler, D., Cai, C., Hovy, E.: Contextual bearing on linguistic variation in social media. In: Proceedings of the Workshop on Languages in Social Media, pp. 20–29. Association for Computational Linguistics (2011)
Gupta, A., Kumaraguru, P., Castillo, C., Meier, P.: TweetCred: real-time credibility assessment of content on Twitter. In: Aiello, L.M., McFarland, D. (eds.) SocInfo 2014. LNCS, vol. 8851, pp. 228–243. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-13734-6_16
Hosmer Jr., D.W., Lemeshow, S., Sturdivant, R.X.: Applied Logistic Regression, vol. 398. Wiley, Hoboken (2013)
Huang, H., et al.: Tweet ranking based on heterogeneous networks. In: Proceedings of COLING 2012, pp. 1239–1256 (2012)
Imran, M., Elbassuoni, S., Castillo, C., Diaz, F., Meier, P.: Practical extraction of disaster-relevant information from social media. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 1021–1024. ACM (2013)
Jiang, J., He, D., Kelly, D., Allan, J.: Understanding ephemeral state of relevance. In: Proceedings of the 2017 Conference on Conference Human Information Interaction and Retrieval, pp. 137–146. ACM (2017)
Liu, T.Y.: Learning to rank for information retrieval. Found. Trends Inf. Retrieval 3(3), 225–331 (2009)
Livraga, G., Viviani, M.: Data confidentiality and information credibility in on-line ecosystems. In: Proceedings of the 11th International Conference on Management of Digital EcoSystems, pp. 191–198 (2019)
Luo, Z., Osborne, M., Wang, T.: An effective approach to tweets opinion retrieval. World Wide Web 18(3), 545–566 (2015)
Mahata, D., Talburt, J.R., Singh, V.K.: From chirps to whistles: discovering event-specific informative content from Twitter. In: Proceedings of the ACM Web Science Conference, p. 17. ACM (2015)
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)
Massoudi, K., Tsagkias, M., de Rijke, M., Weerkamp, W.: Incorporating query expansion and quality indicators in searching microblog posts. In: Clough, P., et al. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 362–367. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-20161-5_36
Mitra, T., Gilbert, E.: Credbank: A large-scale social media corpus with associated credibility annotations. In: Ninth International AAAI Conference on Web and Social Media (2015)
Mizzaro, S.: How many relevances in information retrieval? Interact. Comput. 10(3), 303–320 (1998)
Nagmoti, R., Teredesai, A., De Cock, M.: Ranking approaches for microblog search. In: Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, vol. 1, pp. 153–157. IEEE Computer Society (2010)
Naveed, N., Gottron, T., Kunegis, J., Alhadi, A.C.: Bad news travel fast: a content-based analysis of interestingness on Twitter. In: Proceedings of the 3rd International Web Science Conference, pp. 1–7 (2011)
Naveed, N., Gottron, T., Kunegis, J., Alhadi, A.C.: Searching microblogs: coping with sparsity and document quality. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 183–188. ACM (2011)
Nielsen, F.: A new ANEW: evaluation of a word list for sentiment analysis in microblogs. In: Proceedings of the ESWC2011 Workshop on Making Sense of Microposts: Big things come in small packages 718 in CEUR Workshop Proceedings, Heraklion (2011)
Olteanu, A., Vieweg, S., Castillo, C.: What to expect when the unexpected happens: social media communications across crises. In: Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, pp. 994–1009. ACM (2015)
Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: bringing order to the web. Technical report, Stanford InfoLab (1999)
Pasi, G., De Grandis, M., Viviani, M.: Decision making over multiple criteria to assess news credibility in microblogging sites. In: Proceedings of IEEE World Congress on Computational Intelligence (WCCI) 2020. IEEE (2020)
Pasi, G., Viviani, M.: Application of aggregation operators to assess the credibility of user-generated content in social media. In: Medina, J., et al. (eds.) IPMU 2018. CCIS, vol. 853, pp. 342–353. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91473-2_30
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Porter, M.: The Porter stemming algorithm, 2005 (2008). http://www.tartarus.org/martin/PorterStemmer/index.html
Surdeanu, M., Ciaramita, M., Zaragoza, H.: Learning to rank answers to non-factoid questions from web collections. Comput. Linguist. 37(2), 351–383 (2011)
Tang, R., Solomon, P.: Toward an understanding of the dynamics of relevance judgment: an analysis of one person’s search behavior. Inf. Process. Manag. 34(2–3), 237–256 (1998)
Tao, K., Abel, F., Hauff, C., Houben, G.-J.: Twinder: a search engine for Twitter streams. In: Brambilla, M., Tokuda, T., Tolksdorf, R. (eds.) ICWE 2012. LNCS, vol. 7387, pp. 153–168. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-31753-8_11
Tao, K., Hauff, C., Abel, F., Houben, G.J.: Information retrieval for Twitter data, pp. 195–206. Digital Formations, Peter Lang (2013)
Teevan, J., Ramage, D., Morris, M.R.: # twittersearch: a comparison of microblog search and web search. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, pp. 35–44. ACM (2011)
Vakkari, P.: Task-based information searching. Ann. Rev. Inf. Sci. Technol. 37(1), 413–464 (2003)
Verma, M., Yilmaz, E., Craswell, N.: On obtaining effort based judgements for information retrieval. In: Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, pp. 277–286. ACM (2016)
Viviani, M., Pasi, G.: A multi-criteria decision making approach for the assessment of information credibility in social media. In: Petrosino, A., Loia, V., Pedrycz, W. (eds.) WILF 2016. LNCS (LNAI), vol. 10147, pp. 197–207. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-52962-2_17
Viviani, M., Pasi, G.: Credibility in social media: opinions, news, and health information—a survey. Wiley Interdisc. Rev. Data Mining Knowl. Discov. 7(5), e1209 (2017)
Vosecky, J., Leung, K.W.-T., Ng, W.: Searching for quality microblog posts: filtering and ranking based on content analysis and implicit links. In: Lee, S., Peng, Z., Zhou, X., Moon, Y.-S., Unland, R., Yoo, J. (eds.) DASFAA 2012. LNCS, vol. 7238, pp. 397–413. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-29038-1_29
Webberley, W.M., Allen, S.M., Whitaker, R.M.: Retweeting beyond expectation: inferring interestingness in Twitter. Comput. Commun. 73, 229–235 (2016)
Weerkamp, W., De Rijke, M.: Credibility improves topical blog post retrieval. In: Proceedings of ACL 2008: HLT, pp. 923–931 (2008)
Yager, R.R.: On ordered weighted averaging aggregation operators in multicriteria decision making. IEEE Trans. Syst. Man Cybern. 18(1), 183–190 (1988)
Yang, P., Fang, H., Lin, J.: Anserini: enabling the use of lucene for information retrieval research. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1253–1256. ACM (2017)
Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to information retrieval. ACM Trans. Inf. Syst. (TOIS) 22(2), 179–214 (2004)
Zubiaga, A.: A longitudinal assessment of the persistence of Twitter datasets. J. Assoc. Inf. Sci. Technol. 69(8), 974–984 (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Putri, D.G.P., Viviani, M., Pasi, G. (2020). Social Search and Task-Related Relevance Dimensions in Microblogging Sites. In: Aref, S., et al. Social Informatics. SocInfo 2020. Lecture Notes in Computer Science(), vol 12467. Springer, Cham. https://doi.org/10.1007/978-3-030-60975-7_22
Download citation
DOI: https://doi.org/10.1007/978-3-030-60975-7_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60974-0
Online ISBN: 978-3-030-60975-7
eBook Packages: Computer ScienceComputer Science (R0)