Skip to main content

Omnibase: Uniform Access to Heterogeneous Data for Question Answering

  • Conference paper
  • First Online:
Natural Language Processing and Information Systems (NLDB 2002)

Abstract

Although the World Wide Web contains a tremendous amount of information, the lack of uniform structure makes finding the right knowledge difficult. A solution is to turn the Web into a “virtual database” and to access it through natural language.We built Omnibase, a system that integrates heterogeneous data sources using an object- property-value model. With the help of Omnibase, our Start natural language system can now access numerous heterogeneous data sources on the Web in a uniform manner, and answers millions of user questions with high precision.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. B. Adelberg. NoDoSE-a tool for semi-automatically extracting structured and semistructured data from text documents. SIGMOD Record, 27:283–294, 1998.

    Article  Google Scholar 

  2. Ion Androutsopoulos, G. Ritchie, and P. Thanisch. Natural language interfaces to databases-an introduction. Natural Language Engineering, 1(1):29–81, 1995.

    Article  Google Scholar 

  3. P. Atzeni, G. Mecca, and P. Merialdo. Semistructured and structured data in the Web: Going back and forth. In Workshop on Management of Semistructured Data at PODS/SIGMOD’97, 1997.

    Google Scholar 

  4. T. Berners-Lee. Weaving the Web. Harper, New York, 1999.

    Google Scholar 

  5. M. Craven, D. DiPasquo, D. Freitag, A. McCallum, T. Mitchell, K. Nigam, and S. Slattery. Automatically deriving structured knowledge bases from on-line dictionaries. Technical Report CMU-CS-98-122, Carnegie Mellon University, 1998.

    Google Scholar 

  6. D. Florescu, A. Levy, and A. Mendelzon. Database techniques for the World-Wide Web: A survey. SIGMOD Record, 27(3):59–74, 1998.

    Article  Google Scholar 

  7. J. Hammer, H. Garcia-Molina, J. Cho, R. Aranha, and A. Crespo. Extracting semistructured information from the Web. In Workshop on Management of Semistructured Data at PODS/SIGMOD’97, 1997.

    Google Scholar 

  8. C. Hsu and C. Chang. Finite-state transducers for semi-structured text mining. In IJCAI-99 Workshop on Text Mining, 1999.

    Google Scholar 

  9. B. Katz. Using English for indexing and retrieving. In RIAO’ 88, 1988.

    Google Scholar 

  10. B. Katz. Annotating the World Wide Web using natural language. In RIAO’ 97, 1997.

    Google Scholar 

  11. T. Kirk, A. Levy, Y. Sagiv, and D. Srivastava. The Information Manifold. Technical report, AT& T Bell Laboratories, 1995.

    Google Scholar 

  12. C. Knoblock, S. Minton, J. Ambite, N. Ashish, I. Muslea, A. Philpot, and S. Tejada. The Ariadne approach to Web-based information integration. International Journal on Cooperative Information Systems, 10(1/2):145–169, 1999.

    Google Scholar 

  13. N. Kushmerick, D. Weld, and R. Doorenbos. Wrapper induction for information extraction. In IJCAI-97, 1997.

    Google Scholar 

  14. J. Lin. The Web as a resource for question answering: Perspectives and challenges. In LREC2002, 2002.

    Google Scholar 

  15. J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A database management system for semistructured data. Technical report, Stanford University Database Group, February 1997.

    Google Scholar 

  16. I. Muslea, S. Minton, and C. Knoblock. A hierarchical approach to wrapper induction. In 3rd International Conference on Autonomous Agents, 1999.

    Google Scholar 

  17. A. Sahuguet and F. Azavant. WysiWyg Web Wrapper Factory. In WWW8, 1999.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Katz, B. et al. (2002). Omnibase: Uniform Access to Heterogeneous Data for Question Answering. In: Andersson, B., Bergholtz, M., Johannesson, P. (eds) Natural Language Processing and Information Systems. NLDB 2002. Lecture Notes in Computer Science, vol 2553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36271-1_23

Download citation

  • DOI: https://doi.org/10.1007/3-540-36271-1_23

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00307-6

  • Online ISBN: 978-3-540-36271-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics