Abstract
This paper presents one approach used for the participation of the task 3 (TimeLine illustration based on Microblogs) for the CLEF Cultural Microblog Contextualization track in 2016. This task deals with the retrieval of tweets related to cultural events (music festivals). The idea is mainly to be able to get tweets that describe what happened during the shows of one festival. For the content-based aspects of the retrieval, we used the classical BM25 model [12]. Our concern was to study the impact of duplicate removal and several ways to re-ranks tweets. The obtained recall/precision evaluation results are biased by the limited number of runs considered in the pooling set for manual assessment, but the evaluation of results according to several informativeness measures show that adequate filtering increases such measure. We also describe the lessons learned from the first edition of this task and present how this impacts 2017’s edition of the task.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
One feature of Twitter is to allow users to “forward” (with or without alteration), or retweet, received tweets.
References
Agrawal, R., Gollapudi, S., Halverson, A., Ieong, S.: Diversifying search results. In: Proceedings of the Second ACM International Conference on Web Search and Data Mining, WSDM 2009, pp. 5–14. ACM, New York (2009)
Amer, N.O., Mulhem, P., Géry, M.: Personalized parsimonious language models for user modeling in social bookmaking systems. In: Proceedings of the Advances in Information Retrieval - 39th European Conference on IR Research, ECIR 2017, Aberdeen, UK, 8–13 April, 2017, pp. 582–588 (2017)
Bellot, P., Moriceau, V., Mothe, J., SanJuan, E., Tannier, X.: INEX tweet contextualization task: evaluation, results and lesson learned. Inf. Process. Manag. 52(5), 801–819 (2016)
Ben Jabeur, L., Damak, F., Tamine, L., Cabanac, G., Pinel-Sauvagnat, K., Boughanem, M.: IRIT at TREC Microblog Track 2013. In: Text REtrieval Conference - TREC 2013, Gaithersburg, United States, November 2013
Cai, J., Zha, Z.-J., Zhou, W., Tian, Q.: Attribute-assisted reranking for web image retrieval. In: Proceedings of the 20th ACM International Conference on Multimedia, MM 2012, pp. 873–876. ACM, New York (2012)
Efron, M., Lin, J., He, J., de Vries, A.: Temporal feedback for tweet search with non-parametric density estimation. In: Proceedings of the 37th International ACM SIGIR Conference on Research & #38; Development in Information Retrieval, SIGIR 2014, pp. 33–42. ACM, New York (2014)
Kamps, J., Pehcevski, J., Kazai, G., Lalmas, M., Robertson, S.: Inex 2007 evaluation measures. In: Focused Access to XML Documents: Sixth Workshop of the Initiative for the Evaluation of XML Retrieval (INEX 2007) (2008)
Kocher, M., Savoy, J.: Distance measures in author profiling. Inform. Process. Manag. 53(5), 1103–1119 (2017)
Kuoman, C., Tollari, S., Detyniecki, M.: Using tree of concepts and hierarchical reordering for diversity in image retrieval. In: 2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI), pp. 251–256, June 2013
Leskovec, J., Backstrom, L., Kleinberg, J.: Meme-tracking and the dynamics of the news cycle. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2009, pp. 497–506. ACM, New York (2009)
Ounis, I., Amati, G., Plachouras, V., He, B., Macdonald, C., Lioma, C.: Terrier: a high performance and scalable information retrieval platform. In SIGIR 2006 Workshop on Open Source Information Retrieval (OSIR 2006) (2006)
Robertson, S.E., Walker, S., Jones, S., Hancock-Beaulieu, M.M., Gatford, M.: Okapi at trec3. In: Overview of the Third Text Retrieval Conference (TREC-3), pp. 109–126. NIST, Gaithersburg, January 1995
Tao, K., Abel, F., Hauff, C., Houben, G.-J., Gadiraju, U.: Groundhog day: near-duplicate detection on Twitter. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 1273–1284. International World Wide Web Conferences Steering Committee (2013)
Tian, X., Yang, L., Wang, J., Yang, Y., Wu, X., Hua, X.-S.: Bayesian video search reranking. In: Proceedings of the 16th ACM International Conference on Multimedia, MM 2008, pp. 131–140. ACM, New York (2008)
Vosecky, J., Leung, K.W.-T., Ng, W.: Collaborative personalized twitter search with topic-language models. In: Proceedings of the 37th International ACM SIGIR Conference on Research & #38; Development in Information Retrieval, SIGIR 2014, pp. 53–62. ACM, New York (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Mulhem, P., Goeuriot, L., Dogra, N., Ould Amer, N. (2017). TimeLine Illustration Based on Microblogs: When Diversification Meets Metadata Re-ranking. In: Jones, G., et al. Experimental IR Meets Multilinguality, Multimodality, and Interaction. CLEF 2017. Lecture Notes in Computer Science(), vol 10456. Springer, Cham. https://doi.org/10.1007/978-3-319-65813-1_22
Download citation
DOI: https://doi.org/10.1007/978-3-319-65813-1_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-65812-4
Online ISBN: 978-3-319-65813-1
eBook Packages: Computer ScienceComputer Science (R0)