Skip to main content

Two Phase Indexes Based Passage Retrieval in Biomedical Texts

  • Conference paper
  • 1370 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4688))

Abstract

The biomedical literature is growing at a double-exponential pace. Passage-level retrieval is more effective to provide the information section than document-level retrieval. This paper presents a method of two phase indexes based passage retrieval. First two phase indexes: paragraph index and sentence-level half-overlapped windows index are built. Then, BM25 model is used to retrieval on the two phase indexes. At last, the passage and paragraph retrieval results are combined as the result of the passage retrieval. The experiment result shows that the performance is improved 5% with two phase indexes than only with the paragraph index.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hersh, W., Cohen, A.M., Roberts, P., Rekapalli, H.K.: TREC 2006 Genomics Track Overview. In: Proceedings of the 15th Text REtrieval Conference (2006)

    Google Scholar 

  2. Jiang, J., Zhai, C.: Extraction of coherent relevant passages using hidden Markov models. ACM Transactions on Information Systems 24(3), 295–319 (2006)

    Article  Google Scholar 

  3. Ofoghi, B., Yearwood, J., Jan, R.G.: A semantic approach to boost passage retrieval effectiveness for question answering. In: Proceedings of the 29th Australasian Computer Science Conference, pp. 95–101 (2006)

    Google Scholar 

  4. Hearst, M.: TextTiling: Segmenting Text into Multi-Paragraph Subtopic Passages. Computational Linguistics 23(1), 33–64 (1997)

    Google Scholar 

  5. Callan, J.P.: Passage-level evidence in document retrieval. In: SIGIR. Proceedings of the 17th annual international ACM-SIGIR conference on research and developments in information retrieval, Dublin, Ireland, New York, pp. 302–310. ACM Press, New York (1994)

    Google Scholar 

  6. Liu, X., Croft, W.B.: Passage: Retrieval Based on Language Models. In: Proceedings of the 11th International Conference on Information and Knowledge Management, pp. 375–382 (2002)

    Google Scholar 

  7. Kaszkiel, M., Zobel, J.: Effective ranking with arbitrary passages. Journal of the American Society For Information Science and Technology 52(4), 344–364 (2001)

    Article  Google Scholar 

  8. Ponte, J.M., Bruce Croft, W.: A language modeling approach to information retrieval. In: SIGIR. Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia, pp. 275–281. ACM Press, New York (1998)

    Google Scholar 

  9. Jiang, J., Zhai, C.: UIUC in HARD 2004–Passage retrieval using HMMs. In: Proceedings of the 13th Text REtrieval Conference2004 (2004)

    Google Scholar 

  10. Huang, X., Huang, Y., Wen, M., et al.: York University at TREC 2004: HARD and Genomics Tracks. In: Proceedings of the 13th Text REtrieval Conference (2004)

    Google Scholar 

  11. Lin, K.H.-Y., Hou, W.-J., Chen, H.-H.: Retrieval of Biomedical Documents by Prioritizing Key Phrases. In: Proceedings of the 14th Text REtrieval Conference, Gaithersburg, Maryland (2005)

    Google Scholar 

  12. Zhai, C., Lu, X., Ling, X., He, X., Velivelli, A., Wang, X., Fang, H., Shakery, A.: UIUC/MUSC at TREC 2005 Genomics Track. In: Proceedings of the 14th Text REtrieval Conference (2006)

    Google Scholar 

  13. Strohman, T., Metzler, D., Turtle, H., Croft, W.B.: Indri: A language model based search engine for complex queries. In: Proceedings of the International Conference on Intelligence Analysis (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Kang Li Minrui Fei George William Irwin Shiwei Ma

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chen, R., Lin, H., Yang, Z. (2007). Two Phase Indexes Based Passage Retrieval in Biomedical Texts. In: Li, K., Fei, M., Irwin, G.W., Ma, S. (eds) Bio-Inspired Computational Intelligence and Applications. LSMS 2007. Lecture Notes in Computer Science, vol 4688. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74769-7_79

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74769-7_79

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74768-0

  • Online ISBN: 978-3-540-74769-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics