Omnibase: Uniform Access to Heterogeneous Data for Question Answering

Katz, Boris; Felshin, Sue; Yuret, Deniz; Ibrahim, Ali; Lin, Jimmy; Marton, Gregory; Jerome McFarland, Alton; Temelkuran, Baris

doi:10.1007/3-540-36271-1_23

Boris Katz⁵,
Sue Felshin⁵,
Deniz Yuret⁵,
Ali Ibrahim⁵,
Jimmy Lin⁵,
Gregory Marton⁵,
Alton Jerome McFarland⁵ &
…
Baris Temelkuran⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2553))

Included in the following conference series:

International Conference on Application of Natural Language to Information Systems

576 Accesses

Abstract

Although the World Wide Web contains a tremendous amount of information, the lack of uniform structure makes finding the right knowledge difficult. A solution is to turn the Web into a “virtual database” and to access it through natural language.We built Omnibase, a system that integrates heterogeneous data sources using an object- property-value model. With the help of Omnibase, our Start natural language system can now access numerous heterogeneous data sources on the Web in a uniform manner, and answers millions of user questions with high precision.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

YAGO 4: A Reason-able Knowledge Base

Karma: A System for Mapping Structured Sources into the Semantic Web

6th Open Challenge on Question Answering over Linked Data (QALD-6)

References

B. Adelberg. NoDoSE-a tool for semi-automatically extracting structured and semistructured data from text documents. SIGMOD Record, 27:283–294, 1998.
Article Google Scholar
Ion Androutsopoulos, G. Ritchie, and P. Thanisch. Natural language interfaces to databases-an introduction. Natural Language Engineering, 1(1):29–81, 1995.
Article Google Scholar
P. Atzeni, G. Mecca, and P. Merialdo. Semistructured and structured data in the Web: Going back and forth. In Workshop on Management of Semistructured Data at PODS/SIGMOD’97, 1997.
Google Scholar
T. Berners-Lee. Weaving the Web. Harper, New York, 1999.
Google Scholar
M. Craven, D. DiPasquo, D. Freitag, A. McCallum, T. Mitchell, K. Nigam, and S. Slattery. Automatically deriving structured knowledge bases from on-line dictionaries. Technical Report CMU-CS-98-122, Carnegie Mellon University, 1998.
Google Scholar
D. Florescu, A. Levy, and A. Mendelzon. Database techniques for the World-Wide Web: A survey. SIGMOD Record, 27(3):59–74, 1998.
Article Google Scholar
J. Hammer, H. Garcia-Molina, J. Cho, R. Aranha, and A. Crespo. Extracting semistructured information from the Web. In Workshop on Management of Semistructured Data at PODS/SIGMOD’97, 1997.
Google Scholar
C. Hsu and C. Chang. Finite-state transducers for semi-structured text mining. In IJCAI-99 Workshop on Text Mining, 1999.
Google Scholar
B. Katz. Using English for indexing and retrieving. In RIAO’ 88, 1988.
Google Scholar
B. Katz. Annotating the World Wide Web using natural language. In RIAO’ 97, 1997.
Google Scholar
T. Kirk, A. Levy, Y. Sagiv, and D. Srivastava. The Information Manifold. Technical report, AT& T Bell Laboratories, 1995.
Google Scholar
C. Knoblock, S. Minton, J. Ambite, N. Ashish, I. Muslea, A. Philpot, and S. Tejada. The Ariadne approach to Web-based information integration. International Journal on Cooperative Information Systems, 10(1/2):145–169, 1999.
Google Scholar
N. Kushmerick, D. Weld, and R. Doorenbos. Wrapper induction for information extraction. In IJCAI-97, 1997.
Google Scholar
J. Lin. The Web as a resource for question answering: Perspectives and challenges. In LREC2002, 2002.
Google Scholar
J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A database management system for semistructured data. Technical report, Stanford University Database Group, February 1997.
Google Scholar
I. Muslea, S. Minton, and C. Knoblock. A hierarchical approach to wrapper induction. In 3rd International Conference on Autonomous Agents, 1999.
Google Scholar
A. Sahuguet and F. Azavant. WysiWyg Web Wrapper Factory. In WWW8, 1999.
Google Scholar

Download references

Author information

Authors and Affiliations

Artificial Intelligence Laboratory, 200 Technology Square, 02139, Cambridge, MA
Boris Katz, Sue Felshin, Deniz Yuret, Ali Ibrahim, Jimmy Lin, Gregory Marton, Alton Jerome McFarland & Baris Temelkuran

Authors

Boris Katz
View author publications
You can also search for this author in PubMed Google Scholar
Sue Felshin
View author publications
You can also search for this author in PubMed Google Scholar
Deniz Yuret
View author publications
You can also search for this author in PubMed Google Scholar
Ali Ibrahim
View author publications
You can also search for this author in PubMed Google Scholar
Jimmy Lin
View author publications
You can also search for this author in PubMed Google Scholar
Gregory Marton
View author publications
You can also search for this author in PubMed Google Scholar
Alton Jerome McFarland
View author publications
You can also search for this author in PubMed Google Scholar
Baris Temelkuran
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer and Systems Sciences, Royal Institute of Technology, Forum 100, 16440, Kista, Sweden
Birger Andersson , Maria Bergholtz & Paul Johannesson , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Katz, B. et al. (2002). Omnibase: Uniform Access to Heterogeneous Data for Question Answering. In: Andersson, B., Bergholtz, M., Johannesson, P. (eds) Natural Language Processing and Information Systems. NLDB 2002. Lecture Notes in Computer Science, vol 2553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36271-1_23

Download citation

DOI: https://doi.org/10.1007/3-540-36271-1_23
Published: 28 February 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00307-6
Online ISBN: 978-3-540-36271-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics