ABSTRACT
Over the last decades, the amount of digital documents has increased exponentially. Nevertheless, traditional document engineering methods are applied. Even worse, the long-term preservation issues have been neglected in standard document life cycle implementations.Our digital (cultural) heritage is, therefore, highly endangered by the silent obsolescence of data formats, software and hardware. Severe losses of information already happened. It is high time to implement concrete solutions.Fortunately numerous institutions already target these issues. Moreover, with the OAIS reference model1 a rich standardized conceptual framework is available, which already serves as implementation basis.2This paper discusses an extension to the OAIS reference model and illustrates a prototype implementation of a document life cycle that is enriched by functions for long-term preservation.More precisely, this paper aims to provide first solutions to the following three problem areas:
1. Detachment: OAIS defines no functions for the process of detaching digital documents prior to the ingest function. This detachment function is modeled in great detail and implemented for the provision of the so-called OAIS's submission information packages (SIP).
2. DBMS: OAIS defines a very complex functionality. We show how a standard database management system (DBMS) can support a wide variety of required functionalities in an integrated and homogenous way. Among others OAIS's data management, archival storage, and access are supported.
3. Metadata: So far, OAIS does not cover any aspects of the metadata generation. Here, we briefly discuss the (semi-)automatic generation of a metadata set.
In order to evaluate the feasibility of our approach, we built a first prototype. We carried out our experiments in close cooperation with the Bavarian State Library, Munich, which is engaged in numerous international initiatives dealing with the problem of long-term preservation. Our University Library also supported us by delivering a representative test set of digital publications.3We conclude our paper by presenting some lessons learned from our conceptual work and from our real world experiments.
- Oracle9i Application Developer's Guide -- XML Release 1 (9.0.1).Google Scholar
- Cedars Guide to The Distributed Digital Archiving Prototype Technical report, Universities of Leeds, Oxford, and Cambridge, 2002.Google Scholar
- Data Dictionary -- Technical Metadata for Digital Still Imgages -- Draft Technical report, National Information Standards Organization and AIIM International, 2002.Google Scholar
- B. Bergeron. Dark Ages II -- When the Digital Data Die Upper Saddle River: Prentice Hall, 2002. Google ScholarDigital Library
- U. Borghoff, P. R&246;dig, J. Scheffczyk, and L. Schmitz. Langzeitarchivierung Heidelberg: dpunkt.verlag, 2003.Google Scholar
- F. Catalina. XPERANTO: Bridging Relational Technology and XML Technical report, IBM Almadan Research Center, 2002.Google Scholar
- CCSDS. Reference Model for an Open Archival Information System (OAIS) Technical report, Consultative Committee for Space Data Systems (CCSDS), 2002.Google Scholar
- J. Cheng. IBM DB2 XML Extender, 02.2000. Technical report, IBM, 2000.Google Scholar
- DCMI. Recommendations Technical report, Dublin Core Metadata Initiative (DCMI). see http://dublincore.orgGoogle Scholar
- A. Herbst. Anwendungsorientiertes DB-Archivieren: Neue Konzepte zur Archivierung in Datenbanksystemen Berlin, Heidelberg, New York: Springer-Verlag, 1997.Google Scholar
- ISO/IEC. ISO/IEC 9075-9:2001 Information technology -- Database languages -- SQL -- Part 9: Management of External Data (SQL/MED) Technical report, ISO/IEC, 2001.Google Scholar
- R. A. Lorie. Long-Term Archiving of Digital Information Technical report, IBM, 2000.Google Scholar
- R. A. Lorie. A Methodology and System for Preserving Digital Data In Proceedings of the second ACM/IEEE-CS joint conference on Digital Libraries, 2002. Google ScholarDigital Library
- C. Lupovici. Metadata for long-term preservation, NEDLIB - LB 5648 Issue 1.0 Technical report, Bibliothèque nationale de France, 2000.Google Scholar
- R. Michel, A. Arora, K. Crooks, A. Lalla, and D. Shields. Data Links -- Managing Files Using DB2 Technical report, IBM, 2001.Google Scholar
- OCLC/RLG. Preservation Metadata for Digital Objects: A Review of the State of the Art, A White paper by the OCLC/RLG Working Group on Preservation Metadata Technical report, OCLC/RLG, 2001.Google Scholar
- OCLC/RLG. Preservation Metadata and the OAIS Information Model -- A Metadata Framework to support the Preservation of Digital Objects: A Report by the OCLC/RLG Working Group on Preservation Metadata Technical report, OCLC/RLG, 2002.Google Scholar
- P. R&246;dig, E. Pfeiffer, and U. Borghoff. Langzeitarchivierung digitaler Publikationen (English: Longterm-archiving of digital publications). Technical Report 2002-02, Fakultät für Informatik, Univ. der Bundeswehr München, June 2002. see http://ist.unibw-muenchen.de/LZA/lza_techrep.pdf.Google Scholar
- J. Rothenberg. An experiment in using emulation to preserve digital publications. Technical report, RAND-Europe, 2000.Google Scholar
- R. Schaarschmidt. Archivierung in Datenbanksystemen: Konzept und Sprache Stuttgart, Leipzig, Wiesbaden: Teubner, 2001.Google Scholar
- World Wide Web Consortium Extensible Markup Language (XML) 1.0 Second Edition W3C Recommendation Technical report, World Wide Web Consortium, 2000.Google Scholar
- World Wide Web Consortium Open Digital Rights Language (ODRL) Version 1.1 W3C Note Technical report, World Wide Web Consortium, 2002.Google Scholar
Index Terms
- Preservation of digital publications: an OAIS extension and implementation
Recommendations
Panel on digital preservation
JCDL '02: Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital librariesDigital information in any form is at risk. Software and hardware become obsolete, and versions and file formats change, making data inaccessible. Data stored in even the simplest form are in danger due to computer media degradation and obsolescence. On-...
Long-term digital preservation: preserving authenticity and usability of 3-D data
Long-term digital preservation, the process of maintaining digital objects through time to ensure continued access, has become a crucial issue in recent years. Whilst the amount of digitised information is constantly increasing, so too is the pace of ...
Building interoperable digital library services: MARIAN, open archives, and the NDLTD
SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrievalIn this demonstration, we present interoperable and personalized search services for the Networked Digital Library of Theses and Dissertations (NDLTD). Using standard protocols and software, including those specified by the Open Archives Initiative (OAI)...
Comments