Social Search and Task-Related Relevance Dimensions in Microblogging Sites

Putri, Divi Galih Prasetyo; Viviani, Marco; Pasi, Gabriella

doi:10.1007/978-3-030-60975-7_22

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12467))

Included in the following conference series:

International Conference on Social Informatics

2172 Accesses
4 Citations

Abstract

Social media, and in particular microblogging sites, allow users to post multiple kinds of content for different purposes. Content may be purely conversational, or news-related, or event-related. To find information relevant to users in this heterogeneous mass of content, it would be important to consider the task for which search is carried out, and the most suitable relevance dimensions. In the last years, despite the social search problem has been increasingly investigated, this aspect has not been sufficiently analyzed. For this reason, in this paper, we focus on different search tasks in the microblog search context, and we identify some related relevance dimensions. We also report some experiments we have made to verify the impact of the identified relevance dimensions on the system effectiveness, with respect to the considered search tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://www.statista.com/statistics/282087/number-of-monthly-active-twitter-users/.
2.
https://www.journalism.org/2017/09/07/news-use-across-social-media-platforms-2017/.
3.
https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html.
4.
In this case, being a related tweet means that the tweet discusses the crisis event.
5.
Also for informativeness, logistic regression has been performed by employing the model implemented by the scikit-learn library [36], using the default parameters.
6.
https://radimrehurek.com/gensim/models/ldamodel.html.
7.
https://www.computing.dcu.ie/~dganguly/smerp2017/.
8.
https://lucene.apache.org/.

References

Alhadi, A.C., Gottron, T., Kunegis, J., Naveed, N.: LiveTweet: microblog retrieval based on interestingness and an adaptation of the vector space model. In: TREC (2011)
Google Scholar
Borlund, P.: The concept of relevance in IR. J. Am. Soc. Inform. Sci. Technol. 54(10), 913–925 (2003)
Article Google Scholar
Buckley, C., Voorhees, E.M.: Retrieval evaluation with incomplete information. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 25–32 (2004)
Google Scholar
Choi, J., Croft, W.B., Kim, J.Y.: Quality models for microblog retrieval. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 1834–1838. ACM (2012)
Google Scholar
Cooper, W.S.: On selecting a measure of retrieval effectiveness. J. Am. Soc. Inf. Sci. 24(2), 87–100 (1973)
Article Google Scholar
da Costa Pereira, C., Dragoni, M., Pasi, G.: Multidimensional relevance: prioritized aggregation in a personalized information retrieval setting. Inf. Process. Manag. 48(2), 340–357 (2012)
Article Google Scholar
Craswell, N.: Bpref. In: Liu, L., Ozsu, M.T. (eds.) Encyclopedia of Database Systems, pp. 266–267. Springer, Boston (2009). https://doi.org/10.1007/978-0-387-39940-9_489
Chapter Google Scholar
De Grandis, M., Pasi, G., Viviani, M.: Fake news detection in microblogging through quantifier-guided aggregation. In: Torra, V., Narukawa, Y., Pasi, G., Viviani, M. (eds.) MDAI 2019. LNCS (LNAI), vol. 11676, pp. 64–76. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-26773-5_6
Chapter Google Scholar
Duan, Y., Jiang, L., Qin, T., Zhou, M., Shum, H.Y.: An empirical study on learning to rank of tweets. In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 295–303. Association for Computational Linguistics (2010)
Google Scholar
Efron, M.: Information search and retrieval in microblogs. J. Am. Soc. Inform. Sci. Technol. 62(6), 996–1008 (2011)
Article Google Scholar
Fogg, B., Tseng, H.: The elements of computer credibility. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 80–87. ACM (1999)
Google Scholar
Ghosh, S., Ghosh, K., Ganguly, D., Chakraborty, T., Jones, G.J., Moens, M.F.: ECIR 2017 workshop on exploitation of social media for emergency relief and preparedness (SMERP 2017). In: ACM SIGIR Forum, vol. 51, pp. 36–41. ACM (2017)
Google Scholar
Giachanou, A., Harvey, M., Crestani, F.: Topic-specific stylistic variations for opinion retrieval on Twitter. In: Ferro, N., et al. (eds.) ECIR 2016. LNCS, vol. 9626, pp. 466–478. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30671-1_34
Chapter Google Scholar
Gouws, S., Metzler, D., Cai, C., Hovy, E.: Contextual bearing on linguistic variation in social media. In: Proceedings of the Workshop on Languages in Social Media, pp. 20–29. Association for Computational Linguistics (2011)
Google Scholar
Gupta, A., Kumaraguru, P., Castillo, C., Meier, P.: TweetCred: real-time credibility assessment of content on Twitter. In: Aiello, L.M., McFarland, D. (eds.) SocInfo 2014. LNCS, vol. 8851, pp. 228–243. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-13734-6_16
Chapter Google Scholar
Hosmer Jr., D.W., Lemeshow, S., Sturdivant, R.X.: Applied Logistic Regression, vol. 398. Wiley, Hoboken (2013)
Book Google Scholar
Huang, H., et al.: Tweet ranking based on heterogeneous networks. In: Proceedings of COLING 2012, pp. 1239–1256 (2012)
Google Scholar
Imran, M., Elbassuoni, S., Castillo, C., Diaz, F., Meier, P.: Practical extraction of disaster-relevant information from social media. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 1021–1024. ACM (2013)
Google Scholar
Jiang, J., He, D., Kelly, D., Allan, J.: Understanding ephemeral state of relevance. In: Proceedings of the 2017 Conference on Conference Human Information Interaction and Retrieval, pp. 137–146. ACM (2017)
Google Scholar
Liu, T.Y.: Learning to rank for information retrieval. Found. Trends Inf. Retrieval 3(3), 225–331 (2009)
Article Google Scholar
Livraga, G., Viviani, M.: Data confidentiality and information credibility in on-line ecosystems. In: Proceedings of the 11th International Conference on Management of Digital EcoSystems, pp. 191–198 (2019)
Google Scholar
Luo, Z., Osborne, M., Wang, T.: An effective approach to tweets opinion retrieval. World Wide Web 18(3), 545–566 (2015)
Article Google Scholar
Mahata, D., Talburt, J.R., Singh, V.K.: From chirps to whistles: discovering event-specific informative content from Twitter. In: Proceedings of the ACM Web Science Conference, p. 17. ACM (2015)
Google Scholar
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)
Book Google Scholar
Massoudi, K., Tsagkias, M., de Rijke, M., Weerkamp, W.: Incorporating query expansion and quality indicators in searching microblog posts. In: Clough, P., et al. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 362–367. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-20161-5_36
Chapter Google Scholar
Mitra, T., Gilbert, E.: Credbank: A large-scale social media corpus with associated credibility annotations. In: Ninth International AAAI Conference on Web and Social Media (2015)
Google Scholar
Mizzaro, S.: How many relevances in information retrieval? Interact. Comput. 10(3), 303–320 (1998)
Article Google Scholar
Nagmoti, R., Teredesai, A., De Cock, M.: Ranking approaches for microblog search. In: Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, vol. 1, pp. 153–157. IEEE Computer Society (2010)
Google Scholar
Naveed, N., Gottron, T., Kunegis, J., Alhadi, A.C.: Bad news travel fast: a content-based analysis of interestingness on Twitter. In: Proceedings of the 3rd International Web Science Conference, pp. 1–7 (2011)
Google Scholar
Naveed, N., Gottron, T., Kunegis, J., Alhadi, A.C.: Searching microblogs: coping with sparsity and document quality. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 183–188. ACM (2011)
Google Scholar
Nielsen, F.: A new ANEW: evaluation of a word list for sentiment analysis in microblogs. In: Proceedings of the ESWC2011 Workshop on Making Sense of Microposts: Big things come in small packages 718 in CEUR Workshop Proceedings, Heraklion (2011)
Google Scholar
Olteanu, A., Vieweg, S., Castillo, C.: What to expect when the unexpected happens: social media communications across crises. In: Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, pp. 994–1009. ACM (2015)
Google Scholar
Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: bringing order to the web. Technical report, Stanford InfoLab (1999)
Google Scholar
Pasi, G., De Grandis, M., Viviani, M.: Decision making over multiple criteria to assess news credibility in microblogging sites. In: Proceedings of IEEE World Congress on Computational Intelligence (WCCI) 2020. IEEE (2020)
Google Scholar
Pasi, G., Viviani, M.: Application of aggregation operators to assess the credibility of user-generated content in social media. In: Medina, J., et al. (eds.) IPMU 2018. CCIS, vol. 853, pp. 342–353. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91473-2_30
Chapter Google Scholar
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
MathSciNet MATH Google Scholar
Porter, M.: The Porter stemming algorithm, 2005 (2008). http://www.tartarus.org/martin/PorterStemmer/index.html
Surdeanu, M., Ciaramita, M., Zaragoza, H.: Learning to rank answers to non-factoid questions from web collections. Comput. Linguist. 37(2), 351–383 (2011)
Article Google Scholar
Tang, R., Solomon, P.: Toward an understanding of the dynamics of relevance judgment: an analysis of one person’s search behavior. Inf. Process. Manag. 34(2–3), 237–256 (1998)
Article Google Scholar
Tao, K., Abel, F., Hauff, C., Houben, G.-J.: Twinder: a search engine for Twitter streams. In: Brambilla, M., Tokuda, T., Tolksdorf, R. (eds.) ICWE 2012. LNCS, vol. 7387, pp. 153–168. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-31753-8_11
Chapter Google Scholar
Tao, K., Hauff, C., Abel, F., Houben, G.J.: Information retrieval for Twitter data, pp. 195–206. Digital Formations, Peter Lang (2013)
Google Scholar
Teevan, J., Ramage, D., Morris, M.R.: # twittersearch: a comparison of microblog search and web search. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, pp. 35–44. ACM (2011)
Google Scholar
Vakkari, P.: Task-based information searching. Ann. Rev. Inf. Sci. Technol. 37(1), 413–464 (2003)
Article Google Scholar
Verma, M., Yilmaz, E., Craswell, N.: On obtaining effort based judgements for information retrieval. In: Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, pp. 277–286. ACM (2016)
Google Scholar
Viviani, M., Pasi, G.: A multi-criteria decision making approach for the assessment of information credibility in social media. In: Petrosino, A., Loia, V., Pedrycz, W. (eds.) WILF 2016. LNCS (LNAI), vol. 10147, pp. 197–207. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-52962-2_17
Chapter Google Scholar
Viviani, M., Pasi, G.: Credibility in social media: opinions, news, and health information—a survey. Wiley Interdisc. Rev. Data Mining Knowl. Discov. 7(5), e1209 (2017)
Article Google Scholar
Vosecky, J., Leung, K.W.-T., Ng, W.: Searching for quality microblog posts: filtering and ranking based on content analysis and implicit links. In: Lee, S., Peng, Z., Zhou, X., Moon, Y.-S., Unland, R., Yoo, J. (eds.) DASFAA 2012. LNCS, vol. 7238, pp. 397–413. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-29038-1_29
Chapter Google Scholar
Webberley, W.M., Allen, S.M., Whitaker, R.M.: Retweeting beyond expectation: inferring interestingness in Twitter. Comput. Commun. 73, 229–235 (2016)
Article Google Scholar
Weerkamp, W., De Rijke, M.: Credibility improves topical blog post retrieval. In: Proceedings of ACL 2008: HLT, pp. 923–931 (2008)
Google Scholar
Yager, R.R.: On ordered weighted averaging aggregation operators in multicriteria decision making. IEEE Trans. Syst. Man Cybern. 18(1), 183–190 (1988)
Article Google Scholar
Yang, P., Fang, H., Lin, J.: Anserini: enabling the use of lucene for information retrieval research. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1253–1256. ACM (2017)
Google Scholar
Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to information retrieval. ACM Trans. Inf. Syst. (TOIS) 22(2), 179–214 (2004)
Article Google Scholar
Zubiaga, A.: A longitudinal assessment of the persistence of Twitter datasets. J. Assoc. Inf. Sci. Technol. 69(8), 974–984 (2018)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Informatics, Systems, and Communication (DISCo), Information and Knowledge Repersentation, Retrieval, and Reasoning (IKR3) Lab, University of Milano-Bicocca, Edificio U14, Viale Sarca 336, 20126, Milan, Italy
Divi Galih Prasetyo Putri, Marco Viviani & Gabriella Pasi

Authors

Divi Galih Prasetyo Putri
View author publications
You can also search for this author in PubMed Google Scholar
Marco Viviani
View author publications
You can also search for this author in PubMed Google Scholar
Gabriella Pasi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marco Viviani .

Editor information

Editors and Affiliations

Max Planck Institute for Demographic Research, Rostock, Germany
Samin Aref
University of Sheffield, Sheffield, UK
Kalina Bontcheva
King’s College London, London, UK
Marco Braghieri
Umeå University, Umeå, Sweden
Frank Dignum
ISTI-CNR, Pisa, Italy
Fosca Giannotti
University of Pisa, Pisa, Italy
Francesco Grisolia
University of Pisa, Pisa, Italy
Dino Pedreschi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Putri, D.G.P., Viviani, M., Pasi, G. (2020). Social Search and Task-Related Relevance Dimensions in Microblogging Sites. In: Aref, S., et al. Social Informatics. SocInfo 2020. Lecture Notes in Computer Science(), vol 12467. Springer, Cham. https://doi.org/10.1007/978-3-030-60975-7_22

Download citation

DOI: https://doi.org/10.1007/978-3-030-60975-7_22
Published: 07 October 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60974-0
Online ISBN: 978-3-030-60975-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics