Skip to main content

Domain-Specific Semantic Retrieval of Institutional Repository Based on Query Extension

  • Conference paper
  • First Online:
Trustworthy Computing and Services (ISCTCS 2014)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 520))

Included in the following conference series:

  • 948 Accesses

Abstract

Researchers have found that most institutional repositories are still using the retrieval technology based on keywords, but because the information resource of which they contain are abundant and highly specialized, such retrieval techniques often can not satisfy the users. This paper designs and implements a Domain-Specific Semantic Retrieval of Institutional Repository, using semantic dictionary WordNet to perform word sense disambiguation and Extension, and the results obtained by filtering the domain dictionary, and take advantage of the open source Lucene search engine tools to complete the document retrieval. Experimental results show that there is improvement in terms of coverage and precision.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bobay, J.: Institutional repositories: why go there? Indiana Libr. 27(1), 7–9 (2014)

    Google Scholar 

  2. Silverstein, C., Marais, H., Henzinger, M., et al.: Analysis of a very large web search engine query log. ACM SIGIR Forum 33(1), 6–12 (1999)

    Article  Google Scholar 

  3. Yu, H., et al.: Research in search engine user behavior based on log analysis. J. Chin. Inf. Process. 21(1), 109–114 (2007)

    Google Scholar 

  4. Furnas, G.W., Landauer, T.K., Gomez, L.M., et al.: The vocabulary problem in human-system communication. Commun. ACM 30(11), 964–971 (1987)

    Article  Google Scholar 

  5. Wang, F., Lin, L., Yang, S., et al. A semantic query extension-based patent retrieval approach. In: 2013 10th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), pp. 572–577. IEEE (2013)

    Google Scholar 

  6. Zapatrin, R.: Quantum emulation of query extension in information retrieval (2014). arXiv:1411.3843

  7. Lyons, J.: Linguistic Semantics: An Introduction. Cambridge University Press, Cambridge (1995)

    Book  Google Scholar 

  8. Jiang, J.J., Conrath, D.W.: Semantic similarity based on corpus statistics and lexical taxonomy. arXiv preprint cmp-lg/9709008 (1997)

  9. Balakrishnan, V., Lloyd-Yemoh, E.: Stemming and lemmatization: a comparison of retrieval performances. Lect. Notes Softw. Eng. 2(3) (2014)

    Google Scholar 

  10. Pal, D., Mitra, M., Datta, K.: Improving query expansion using WordNet. J. Assoc. Inf. Sci. Technol. (2014)

    Google Scholar 

  11. Kolte, S.G., Bhirud, S.G.: Word sense disambiguation using wordnet domains. In: First International Conference on Emerging Trends in Engineering and Technology, 2008, ICETET 2008, pp. 1187–1191. IEEE (2008)

    Google Scholar 

  12. Dantchev, M.: WORDNET 2.1 Overview EECS 595/SI 661&761/LING 541 Natural Language Processing Fall (2006)

    Google Scholar 

Download references

Acknowledgements

This work was supported by the Major Research Plan of the National Natural Science Foundation of China [91124002] and National Culture Support Foundation Project of China [2013BAH43F01].

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pengchong Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wu, X., Li, P., Xu, J., Xie, X. (2015). Domain-Specific Semantic Retrieval of Institutional Repository Based on Query Extension. In: Yueming, L., Xu, W., Xi, Z. (eds) Trustworthy Computing and Services. ISCTCS 2014. Communications in Computer and Information Science, vol 520. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-47401-3_51

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-47401-3_51

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-47400-6

  • Online ISBN: 978-3-662-47401-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics