Skip to main content

Web Query Expansion by WordNet

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3588))

Abstract

In this paper, we address a novel method of Web query expansion by using WordNet and TSN. WordNet is an online lexical dictionary which describes word relationships in three dimensions of Hypernym, Hyponym and Synonym. And their impacts to expansions are different. We provide quantitative descriptions of the query expansion impact along each dimension. However, WordNet may bring many noises for the expansion due to its collection independent characteristic. Furthermore, it may not catch current state of words and their relationships because of the explosive increase of the Web. To overcome those problems, collection-based TSN (Term Semantic Network) is created with respect to word co-occurrence in the collection. We use TSN both as a filter and a supplement for WordNet. We also provide a quantitatively study as what is the best way for the expansion with TSN. In our system, we combine the query expansions along each semantic dimension as our overall solution. Our experiments reveal that the combined expansion can provide a satisfied result for the Web query performance. The methodologies in this paper have been already employed in our Web image search engine system.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Li, Y., Bandar, Z.A., McLean, D.: An Approach for Measuring Semantic Similarity between Words Using Multiple Information Sources, Knowledge and Data Engineering. IEEE Transactions on 15(4), 871–882 (2003)

    Google Scholar 

  2. Miller, G.A., Beckwith, R., Felbaum, C., Gross, D., Miller, K.: Introduction to WordNet: An On-line Lexical Database (1993) (Revised Version)

    Google Scholar 

  3. Jin, Q., Zhao, J., Xu, B.: Query Expansion Based on Term Similarity Tree Model. In: Natural Language Processing and Knowledge Engineering. International Conference, October 26-29, pp. 400–406 (2003)

    Google Scholar 

  4. Gong, Z., U, L.H., Cheang, C.W.: An Implementation of Web Image Search Engines. In: Chen, Z., Chen, H., Miao, Q., Fu, Y., Fox, E., Lim, E.-p. (eds.) ICADL 2004. LNCS, vol. 3334, pp. 355–367. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  5. Lin, S., Chen, M.C., Ho, J., Huang, Y.: ACIRD: Intelligent Internet Document Organization and Retrieval. IEEE Transactions on Knowledge and Data Engineering 14(3), 599–614 (2002)

    Article  Google Scholar 

  6. Qiu, Y., Frei, H.P.: Concept based query expansion. In: Proceedings of ACM SIGIR International Conference on Research and Development in Information Retrieval, pp. 160–170. ACM Press, New York (1993)

    Google Scholar 

  7. Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley, Reading (1999)

    Google Scholar 

  8. Jing, Y.F., Croft, W.B.: An Association Thesaurus for Information Retrieval. In: RIAO 1994 Conference Proceedings, New York, October 1994, pp. 146–160 (1994)

    Google Scholar 

  9. Crouch, C.J., Yang, B.: Experiments in automatic statistical thesaurus construction. In: Proceeding of ACM SIGIR International Conference on Research and Development in Information Retrieval, pp. 77–88 (1993)

    Google Scholar 

  10. Rocchio, J.Y.: Relevance Feedback in Information Retrieval. In: The SMART Retrieval System, pp. 313–323. Prentice Hall, Englewood Cliffs (1971)

    Google Scholar 

  11. Buckley, C., Singhal, A., Mitra, M., Salton, G.: New Retrieval Approaches Using SMART: TREC 4. In: Harman, D. (ed.) Proceedings of the TREC 4 Conference. National Institute of Standards and Technology Special Publication (1996)

    Google Scholar 

  12. Agrawaland, R., Srikant, R.: Fast Algorithms for Mining Association Rules. In: Proc. 20th Int’l Conf. Very Large Data Bases (VLDB) (September 1994)

    Google Scholar 

  13. Sriari, R.K., Zhang, Z., Rao, A.: Intelligent indexing and semantic retrieval of multimodal documents. Information Retrieval 2(2), 1–37 (2000)

    Google Scholar 

  14. Yang, J., Wenyin, L., Zhang, H., Zhuang, Y.: Thesaurus-aided approach for image browsing and retrieval. In: Thesaurus-aided approach for image browsing and retrieval, Multimedia and Expo, 2001. ICME 2001. IEEE International Conference, Augest 22-25, pp. 1135–1138 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gong, Z., Cheang, C.W., Leong Hou, U. (2005). Web Query Expansion by WordNet. In: Andersen, K.V., Debenham, J., Wagner, R. (eds) Database and Expert Systems Applications. DEXA 2005. Lecture Notes in Computer Science, vol 3588. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11546924_17

Download citation

  • DOI: https://doi.org/10.1007/11546924_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-28566-3

  • Online ISBN: 978-3-540-31729-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics