Skip to main content

A semantic network approach to semi-structured documents repositories

  • Structured Documents
  • Conference paper
  • First Online:
Research and Advanced Technology for Digital Libraries (ECDL 1997)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1324))

Included in the following conference series:

Abstract

Using database technology for the administration of digital libraries offers many advantages in a multi-user and distributed environment. However, conventional DBMS are not particularly suited to manage semi-structured data with heterogeneous, irregular, evolving structures as in the case of SGML documents found in digital libraries. To overcome the difficulties imposed by the rigid schema of conventional systems, several schema-less approaches have been proposed. Using instead unconstrained, extensible schemata offered by object-oriented semantic network systems, we are able both to map document specific structures as database classes, and to model the associated constraint information as integrated schema annotations. In this paper we present the benefits of this approach to create, access and process heterogeneous SGML documents, and in particular to exploit the shared semantics of evolving SGML structures. A respective application is currently being implemented in the context of the AQUARELLE project.

Work partially supported by European TELEMATICS Project AQUARELLE.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Description de l'Architecture Générale du Projet GEODOC. Technical report, Grif S.A., 78053 St Quentin en Yvelines Cedex, December 1993.

    Google Scholar 

  2. The Extensible Markup Language. Internet Draft, 1997.Availiable at http://www.jtauber.com/xml/.

    Google Scholar 

  3. S. Abiteboul. Querying Semi-Structured Data. In Foto Afrati and Phokion Kolaitis, editors, Database Theory-ICDT'97, volume LNCS 1186 of Lecture Notes in Computer Science, pages 1–18, Delphes, Greece, January 1997. Springer Verlag.

    Google Scholar 

  4. S. Abiteboul, D. Quass, J. McHugh, J. Widom, and J. Wiener. The Lorel Query Language for Semi-Structured Data. Journal of Digital Libraries, 1(1):68–88, November 1997.

    Google Scholar 

  5. A. Analyti, P. Constantopoulos, and N. Spyratos. On the Definition of Semantic Networks Semantics. Technical Report ICS/TR-187, Institute Of Computer Science-FORTH, February 1997. Available at http://www.ics.forth.gr/proj/isst /Publications/TechnicalReports.html.

    Google Scholar 

  6. T. Arnold-Moore, M. Fuller, B. Lowe, J. Thom, and R. Wilkinson. The ELF Data Model and SGQL Query Language for Structured Documents. In Proc. of the Australian Database Conference, pages 17–26, Adelaid, Australia, January 1995.

    Google Scholar 

  7. D. Barnard, L. Burnard, and C. M. Sperberg-McQueen. Lessons Learned From Using sgml in the Text Encoding Initiative. Computer & Interface, 18:3–10, 1996.

    Google Scholar 

  8. L. Bielawski and J. Boyle. Electronic Document Management Systems: A User Centered Approach for Creating, Distributing and Managing Online Publications. Prentice Hall, 1997.

    Google Scholar 

  9. G. Blake, M. Consens, P. Kilpelainen, and P. Larson. Text/Relational Database Management Systems: Harmonizing SQL and SGML. In ADBA'94, pages 267–280, 1994.

    Google Scholar 

  10. K. Böhm, K. Aberer, and E. Neuhold. Administering Structured Documents in Digital Libraries. In Digital Libraries-Current Issues, DL'94, Newark, NJ, USA, 1995. LNCS 916, Springer Verlang.

    Google Scholar 

  11. M. W. Bright, A. R. Hurson, and S. H. Pakzad. A Taxonomy and Current Issues in Multidatabase Systems. IEEE Computer, 25(3):50–59, March 1992.

    Google Scholar 

  12. P. Buneman, S. Davidson, G. Hillebrand, and D. Sucie. A Query Language and Optimization Techniques for Unstructured Data. In SIGMOD'96, pages 505–516, Montreal, Quebec, Canada, June 1996.

    Google Scholar 

  13. OCLC Online Computer Library Center. Fred: The SGML Grammar Builder. Available at “http://www.ocle.org:80/fred/”, 1995.

    Google Scholar 

  14. V. Christophides, S. Abiteboul, S. Cluet, and M. Scholl. From Structured Documents to Novel Query Facilities. In SIGMOD'94, pages 313–324, Minneapolis, Minnesota, USA, May 1994.

    Google Scholar 

  15. V. Christophides, S. Cluet, and G. Moerkotte. Evaluating Queries with Generalized Path Expressions. In SIGMOD'96, pages 413–422, Montreal, Quebec, Canada, June 1996.

    Google Scholar 

  16. V. Christophides and A. Rizk. Querying Structured Documents with Hypertext Links using OODBMS. In ECHT'94, pages 186–197, Edinburgh, United Kingdom, September 1994. ACM.

    Google Scholar 

  17. P. Constantopoulos. Cultural Documentation: The CLIO System. Technical Report 115, Institute of Computer Science, FORTH, January 1994.

    Google Scholar 

  18. L. Elasry.SGML-DBOO Stockage et Manipulation de Documents Structurés. Master's thesis, Université SORBONE, September 1992.

    Google Scholar 

  19. Euroclid. Le Parseur SGML d'Euroclid. Internal document, Euroclid, 12, Avenue des Prés 78180 Montigny le Bretonneux, 1991.

    Google Scholar 

  20. P. Francois. Generalized SGML Repositories: Requirements and Modelling. Computer Standards & Interfaces, 18:11–24, 1996.

    Google Scholar 

  21. P. Futtersack and Q.N. Vuong. Modélisation et Stockage de Documents SGML. Collection de notes internes de la Direction des Études et Recherches 95N000039, EDF-DER, Service IPN. Département SID. 1 Av. du Général-de-Gaulle, 92141 Clamart Cedex, 1995.

    Google Scholar 

  22. C. Goldfarb. The SGML Handbook. Clarendon Press, Oxford, 1990.

    Google Scholar 

  23. R. Goldman and J. Widom. DataGuides: Enabling Query Formulation and Optimization in Semi-Structured databases. Stanford Technical Report, 1997.

    Google Scholar 

  24. Institute Of Computer Science (FORTH)-Hellas. SIS-Semantic Index System, version 2.1 edition, May 1997.

    Google Scholar 

  25. ISO. Information Processing-Text and Office Systems-Standard Generalized Markup Language (SGML). ISO 8879, 1986.

    Google Scholar 

  26. ISO/IEC. Information Technology-Hypermedia/Time-based Structuring Language (HyTime). ISO/IEC 10744, 1992.

    Google Scholar 

  27. P. Kilpeläinen and D. Wood. Exceptions in SGML Document Grammars. Submitted for publication, 1995.

    Google Scholar 

  28. R. Light. Getting a handle on Exhibition Catalogues: the Project OHIO DTD. Available at “http://www.cimi.org/cimi”, Consortium for Interchange of Museum Information, 1995.

    Google Scholar 

  29. J. Le Maitre, E. Murisasco, and M. Rolbert. SmlgQL un Langage d'Interrogation de Documents SGML. In BDA'95, pages 431–446, Nancy, France, August 1995.

    Google Scholar 

  30. J. Mylopoulos, A. Borgida, M. Jarke, and M. Koubarakis. Telos: Representing knowledge about Information Systems. ACM Transactions on Information Systems, 8(4), October 1990.

    Google Scholar 

  31. A. Nica and E. A. Rundensteiner. Uniform Structured Document Handling using a Constraint-based Object Approach. In Digital Libraries: Research and Technology Advances, ADL'95 Forum, pages 83–101, McLean, Virginia, USA, May 1996. LNCS 1082, Springer-Verlag.

    Google Scholar 

  32. D. Raggett.HyperText Markup Language Specification Version 3.0. Internet Draft, March 1995. Avaliable at http://www.w3.org/hypertext/WWW/MarkUp/html3/CoverPage.html.

    Google Scholar 

  33. A. Ramfos, N.J. Fiddian, and W.A. Gray. Object-oriented to relational interschema meta-translation. In Workshop on heterogeneous databases, December 1989.

    Google Scholar 

  34. D. Raymond, F. Tompa, and D. Wood. From Data Representation to Data Model: Meta-semantics Issues in the Evolution of SGML. Computer & Interface, 18:25–36, 1996.

    Google Scholar 

  35. A. Rizk, F. Malézieux, and M. Scholl.Analyse des éléments du Système d'Information: Définition SGML de la Struture des Dossiers de l'Inventaire. Convention de recherche n 295b212 0016008011, Euroclid, 1996.

    Google Scholar 

  36. J. F. Roddick. A Survey of Schema Versioning Issues for Database Systems. Information and Software Technology, 37(7):383–393, 1995.

    Article  Google Scholar 

  37. R. Sacks-Davis, W. Wen, A. Kent, and K. Ramamohanarao. Complex Object Support for a Document Database System. In Thirteenth Australian Computer Science Conference, pages 322–333, Victoria, Australia, 1990. Monash University.

    Google Scholar 

  38. A. P. Sheth and J. A. Larson. Federated Database Systems for Managing Distributed Heterogeneous, and Autonomous Databases. ACM Computing Surveys, 22(3):183–236, September 1990.

    Article  Google Scholar 

  39. J. Warmer and S. Egmond. The Implementation of the Amsterdam SGML Parser. Electronic Publishing, 2(2):65–90, July 1989.

    Google Scholar 

  40. D. Wood. Standard Generalized Markup Language: Mathematical and Philosophical Issues. In Computer Science Today: Recent Trends and Developments. LNCS 1000, 1995.

    Google Scholar 

  41. R. Zicari. A Framework for Schema Updates in an Object-Oriented Database system. In IEEE Data Engineering Conference, Kobe, Japan, 1991.

    Google Scholar 

  42. J. Zobel, J. A. Thom, and R. Sacks-Davis. Efficiency of Nesting Relational Document Database Systems. In VLDB'91, pages 91–102, Barcelona, Catalonia, Spain, September 1991.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Carol Peters Costantino Thanos

Rights and permissions

Reprints and permissions

Copyright information

© 1997 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Christophides, V., Dörr, M., Fundulaki, I. (1997). A semantic network approach to semi-structured documents repositories. In: Peters, C., Thanos, C. (eds) Research and Advanced Technology for Digital Libraries. ECDL 1997. Lecture Notes in Computer Science, vol 1324. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0026735

Download citation

  • DOI: https://doi.org/10.1007/BFb0026735

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-63554-3

  • Online ISBN: 978-3-540-69597-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics