Skip to main content

Uniform structured document handling using a constraint-based object approach

  • Document Handling and Information Retrieval
  • Conference paper
  • First Online:
Digital Libraries Research and Technology Advances (ADL 1995)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1082))

Included in the following conference series:

Abstract

Complex multimedia document handling, including the modeling, decomposition, and search across digital documents, is one of the primary services that must be provided by digital library systems. In this paper, we present a general approach for handling structured documents (e.g., SGML documents) by exploiting object-oriented database technology. For this purpose, we propose a constraint-based object model capable of capturing in a uniform manner all SGML constructs typically used to encode the structural organization of complex documents. We present a general strategy for mapping arbitrary document types (e.g., article, journal, and book DTDs) expressed using standard SGML into our model. Most importantly, we demonstrate that our model is designed to handle the integration of diverse document types into one integrated schema, thus avoiding the generating of numerous redundant class definitions for similar document subtypes. The resulting document management system DMS is thus capable of supporting the dynamic addition of new document types, and of uniformly processing queries spanning across multiple document types. In this paper, we also describe the implementation of our approach on the commercial DBMS system Illustra to demonstrate that the ease with which our approach can be realized on current OODB technology — without requiring any specialpurpose constructs. Our DMS system provides support for integrated querying of both structural as well as content-based predicates across arbitrarily complex document types.

This research has been funded in part by the joint NSF/ARPA/NASA Digital Libraries Initiative under CERA IRI-9411287, and by NSF under grants RIA #IRI-9309076 and NYI #IRI-9457609. We also thank Illustra Inc. to provide us with the University Innovation Equipment Award.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

Authors

Editor information

Nabil R. Adam Bharat K. Bhargava Milton Halem Yelena Yesha

Rights and permissions

Reprints and permissions

Copyright information

© 1996 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nica, A., Rundensteiner, E.A. (1996). Uniform structured document handling using a constraint-based object approach. In: Adam, N.R., Bhargava, B.K., Halem, M., Yesha, Y. (eds) Digital Libraries Research and Technology Advances. ADL 1995. Lecture Notes in Computer Science, vol 1082. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0024605

Download citation

  • DOI: https://doi.org/10.1007/BFb0024605

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-61410-4

  • Online ISBN: 978-3-540-68527-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics