Abstract
In the paper, we present an exploration of using social annotations provided by the Web 2.0 sites (such as Del.icio.us) in helping web search. More specifically, we consider using the social annotations as an additional resource to strengthen existing smoothing methods for the language model for IR. The social annotations can benefit the smoothing of language model in two aspects: 1) the annotations themselves can serve as the summaries of the web pages given by the users; 2) the annotations can be seen as the links of the web pages sharing the same annotations. We propose three smoothing methods, addressing the two aspects and their combination, respectively. We call the new language model of using the proposed smoothing methods ’Language Annotation Model (LAM). Preliminary experimental results show that LAM significantly outperforms the traditional language models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ponte, J.M., Croft, W.B.: A Language Modeling Approach to Information Retrieval. In: Research and Development in Information Retrieval, pp. 275–281 (1998)
Zhai, C., Lafferty, J.: A Study of Smoothing Methods for Language Models Applied to Information Retrieval. ACM Transactions on Information Systems 22, 179–214 (2004)
Song, F., Croft, W.B.: A general language model for information retrieval. In: Proc. of CIKM’99, pp. 316–321 (1999)
Rosenfeld, R.: Two decades of statistical language modeling: Where do we go from here. Proc. of the IEEE 88(8) (2000)
Srikanth, M., Srihari, R.K.: Exploiting syntactic structure of queries in a language modeling approach to IR. In: Proc. of CIKM’03, pp. 476–483 (2003)
Bao, S., et al.: LSM: Language Sense Model for Information Retrieval. In: Yu, J.X., Kitsuregawa, M., Leong, H.-V. (eds.) WAIM 2006. LNCS, vol. 4016, pp. 97–108. Springer, Heidelberg (2006)
Kurland, O., Lee, L.: Corpus structure, language models, and ad hoc information. In: Proc. of SIGIR’04, pp. 194–201 (2004)
Xu, J., Croft, W.: Cluster-based retrieval using language models. In: Proc. of SIGIR’04, pp. 186–193 (2004)
Golder, S.A., Huberman, B.A.: The Structure of Collaborative Tagging Systems (2005), http://www.hpl.hp.com/research/idl/papers/tags/
Mika, P.: Ontologies are us: A unified model of social networks and semantics. In: Gil, Y., et al. (eds.) ISWC 2005. LNCS, vol. 3729, Springer, Heidelberg (2005)
Wu, X., Zhang, L., Yu, Y.: Exploring social annotations for the semantic web. In: Proc. of WWW’06, pp. 417–426. ACM Press, New York (2006)
Dmitriev, P.A., et al.: Using Annotations in Enterprise Search. In: Proc. of WWW’06, pp. 811–817. ACM Press, New York (2006)
Robertson, S.E., et al.: Okapi at TREC. In: TREC’92, pp. 21–30 (1992)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Xu, S., Bao, S., Yu, Y., Cao, Y. (2007). Using Social Annotations to Smooth the Language Model for IR. In: Zhou, ZH., Li, H., Yang, Q. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2007. Lecture Notes in Computer Science(), vol 4426. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71701-0_114
Download citation
DOI: https://doi.org/10.1007/978-3-540-71701-0_114
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71700-3
Online ISBN: 978-3-540-71701-0
eBook Packages: Computer ScienceComputer Science (R0)