Skip to main content

Using Social Annotations to Smooth the Language Model for IR

  • Conference paper
Advances in Knowledge Discovery and Data Mining (PAKDD 2007)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4426))

Included in the following conference series:

Abstract

In the paper, we present an exploration of using social annotations provided by the Web 2.0 sites (such as Del.icio.us) in helping web search. More specifically, we consider using the social annotations as an additional resource to strengthen existing smoothing methods for the language model for IR. The social annotations can benefit the smoothing of language model in two aspects: 1) the annotations themselves can serve as the summaries of the web pages given by the users; 2) the annotations can be seen as the links of the web pages sharing the same annotations. We propose three smoothing methods, addressing the two aspects and their combination, respectively. We call the new language model of using the proposed smoothing methods ’Language Annotation Model (LAM). Preliminary experimental results show that LAM significantly outperforms the traditional language models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ponte, J.M., Croft, W.B.: A Language Modeling Approach to Information Retrieval. In: Research and Development in Information Retrieval, pp. 275–281 (1998)

    Google Scholar 

  2. Zhai, C., Lafferty, J.: A Study of Smoothing Methods for Language Models Applied to Information Retrieval. ACM Transactions on Information Systems 22, 179–214 (2004)

    Article  Google Scholar 

  3. Song, F., Croft, W.B.: A general language model for information retrieval. In: Proc. of CIKM’99, pp. 316–321 (1999)

    Google Scholar 

  4. Rosenfeld, R.: Two decades of statistical language modeling: Where do we go from here. Proc. of the IEEE 88(8) (2000)

    Google Scholar 

  5. Srikanth, M., Srihari, R.K.: Exploiting syntactic structure of queries in a language modeling approach to IR. In: Proc. of CIKM’03, pp. 476–483 (2003)

    Google Scholar 

  6. Bao, S., et al.: LSM: Language Sense Model for Information Retrieval. In: Yu, J.X., Kitsuregawa, M., Leong, H.-V. (eds.) WAIM 2006. LNCS, vol. 4016, pp. 97–108. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  7. Kurland, O., Lee, L.: Corpus structure, language models, and ad hoc information. In: Proc. of SIGIR’04, pp. 194–201 (2004)

    Google Scholar 

  8. Xu, J., Croft, W.: Cluster-based retrieval using language models. In: Proc. of SIGIR’04, pp. 186–193 (2004)

    Google Scholar 

  9. Golder, S.A., Huberman, B.A.: The Structure of Collaborative Tagging Systems (2005), http://www.hpl.hp.com/research/idl/papers/tags/

  10. Mika, P.: Ontologies are us: A unified model of social networks and semantics. In: Gil, Y., et al. (eds.) ISWC 2005. LNCS, vol. 3729, Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  11. Wu, X., Zhang, L., Yu, Y.: Exploring social annotations for the semantic web. In: Proc. of WWW’06, pp. 417–426. ACM Press, New York (2006)

    Google Scholar 

  12. Dmitriev, P.A., et al.: Using Annotations in Enterprise Search. In: Proc. of WWW’06, pp. 811–817. ACM Press, New York (2006)

    Google Scholar 

  13. Robertson, S.E., et al.: Okapi at TREC. In: TREC’92, pp. 21–30 (1992)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Zhi-Hua Zhou Hang Li Qiang Yang

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Xu, S., Bao, S., Yu, Y., Cao, Y. (2007). Using Social Annotations to Smooth the Language Model for IR. In: Zhou, ZH., Li, H., Yang, Q. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2007. Lecture Notes in Computer Science(), vol 4426. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71701-0_114

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-71701-0_114

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-71700-3

  • Online ISBN: 978-3-540-71701-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics