Skip to main content

Information Retrieval Services Based on Lucene Architecture

  • Conference paper
  • 872 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 307))

Abstract

Lucene full-text retrieval technology is widely used in the field of information retrieval, it is an excellent, open source full-text indexing engine tool kit written in Java. This paper first briefly describes the inverted index mechanism of Lucene, and then analyses Lucene architecture and its index file structure, as the basis for introduction the two modules of the Lucene in detail. Finally point out the shortcomings of the segmentation module and the place that can be improved. Through its API, we can embed it into a variety of applications to develop our own search engine, or ccustom a personalized information retrieval service.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zhou, D., Xie, K.: Lucene search engine. Computer Engineering (2007) (in Chinese)

    Google Scholar 

  2. Guan, J., Gan, J.: Applied research based on the Lucene text search engine. Computer Engineering and Design (2007) (in Chinese)

    Google Scholar 

  3. Zhao, T., Meng, X.: Chinese full-text database design and implementation based on Lucene API. Computer Engineering and Applications (2003)

    Google Scholar 

  4. Li, Y., Ding, H.: The research and application of full-text retrieve of Lucene. Computer Technology and Development (2010)

    Google Scholar 

  5. Tang, H., He, Y., Xu, X.: Distributed parallel index based on Lucene. Computer Technology and Development (2011) (in Chinese)

    Google Scholar 

  6. Ding, Z., Kim, M.: Research and Implementation of personalized search engine based on Lucen. Computer Technology and Development (2011) (in Chinese)

    Google Scholar 

  7. Zheng, R., Lin, S.: Research on Chinese inverted index technology based on Lucene. Computer Technology and Development (2010)

    Google Scholar 

  8. Son, J., Zhu, Y., Liu, R.: Improved full-text search tool kit based on the Lucene. Computer Engineering and Applications (2008)

    Google Scholar 

  9. Li, S., Ling, F., Lv, X., Shi, S.: Study on Efficiency of Full-Text Retrieval Based on Lucene. In: International Conference on Information Engineering and Computer Science (2009)

    Google Scholar 

  10. Hatcher, E., Gospodnetic, O.: Lucene in Action. Manning Press, Greenwich (2004)

    Google Scholar 

  11. Qiu, Z., Fu, T.: Develop its own search engine Lucene 2.0 + Heritrix. People Post Press, Beijing (2010) (in Chinese)

    Google Scholar 

  12. Bruce Croft, W., Metaler, D., Strohman, T.: Search Engines Information Retrival in Practice. China Machine Press (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Li, H., Li, W., Wang, G., Peng, X. (2012). Information Retrieval Services Based on Lucene Architecture. In: Liu, C., Wang, L., Yang, A. (eds) Information Computing and Applications. ICICA 2012. Communications in Computer and Information Science, vol 307. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34038-3_88

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-34038-3_88

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-34037-6

  • Online ISBN: 978-3-642-34038-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics