Abstract
Although the World Wide Web contains a tremendous amount of information, the lack of uniform structure makes finding the right knowledge difficult. A solution is to turn the Web into a “virtual database” and to access it through natural language.We built Omnibase, a system that integrates heterogeneous data sources using an object- property-value model. With the help of Omnibase, our Start natural language system can now access numerous heterogeneous data sources on the Web in a uniform manner, and answers millions of user questions with high precision.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
B. Adelberg. NoDoSE-a tool for semi-automatically extracting structured and semistructured data from text documents. SIGMOD Record, 27:283–294, 1998.
Ion Androutsopoulos, G. Ritchie, and P. Thanisch. Natural language interfaces to databases-an introduction. Natural Language Engineering, 1(1):29–81, 1995.
P. Atzeni, G. Mecca, and P. Merialdo. Semistructured and structured data in the Web: Going back and forth. In Workshop on Management of Semistructured Data at PODS/SIGMOD’97, 1997.
T. Berners-Lee. Weaving the Web. Harper, New York, 1999.
M. Craven, D. DiPasquo, D. Freitag, A. McCallum, T. Mitchell, K. Nigam, and S. Slattery. Automatically deriving structured knowledge bases from on-line dictionaries. Technical Report CMU-CS-98-122, Carnegie Mellon University, 1998.
D. Florescu, A. Levy, and A. Mendelzon. Database techniques for the World-Wide Web: A survey. SIGMOD Record, 27(3):59–74, 1998.
J. Hammer, H. Garcia-Molina, J. Cho, R. Aranha, and A. Crespo. Extracting semistructured information from the Web. In Workshop on Management of Semistructured Data at PODS/SIGMOD’97, 1997.
C. Hsu and C. Chang. Finite-state transducers for semi-structured text mining. In IJCAI-99 Workshop on Text Mining, 1999.
B. Katz. Using English for indexing and retrieving. In RIAO’ 88, 1988.
B. Katz. Annotating the World Wide Web using natural language. In RIAO’ 97, 1997.
T. Kirk, A. Levy, Y. Sagiv, and D. Srivastava. The Information Manifold. Technical report, AT& T Bell Laboratories, 1995.
C. Knoblock, S. Minton, J. Ambite, N. Ashish, I. Muslea, A. Philpot, and S. Tejada. The Ariadne approach to Web-based information integration. International Journal on Cooperative Information Systems, 10(1/2):145–169, 1999.
N. Kushmerick, D. Weld, and R. Doorenbos. Wrapper induction for information extraction. In IJCAI-97, 1997.
J. Lin. The Web as a resource for question answering: Perspectives and challenges. In LREC2002, 2002.
J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A database management system for semistructured data. Technical report, Stanford University Database Group, February 1997.
I. Muslea, S. Minton, and C. Knoblock. A hierarchical approach to wrapper induction. In 3rd International Conference on Autonomous Agents, 1999.
A. Sahuguet and F. Azavant. WysiWyg Web Wrapper Factory. In WWW8, 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Katz, B. et al. (2002). Omnibase: Uniform Access to Heterogeneous Data for Question Answering. In: Andersson, B., Bergholtz, M., Johannesson, P. (eds) Natural Language Processing and Information Systems. NLDB 2002. Lecture Notes in Computer Science, vol 2553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36271-1_23
Download citation
DOI: https://doi.org/10.1007/3-540-36271-1_23
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00307-6
Online ISBN: 978-3-540-36271-5
eBook Packages: Springer Book Archive