Skip to main content
Log in

Dependency management for digital preservation using semantic web technologies

  • Published:
International Journal on Digital Libraries Aims and scope Submit manuscript

Abstract

The preservation of digital objects is a topic of prominent importance for archives and digital libraries. In this article, we focus on the problem of preserving the intelligibility of digital objects. We formalize the problem in terms of dependencies and specify a number of basic intelligibility-related tasks. In parallel, we introduce a preservation scenario as a means for clarifying the pros and cons of various representation and modeling languages that are used for the problem at hand, which reveals the benefits of adopting Semantic Web (SW) languages as a representation framework. To this end, we propose a minimal core ontology for representing intelligibility-related dependencies along with methodological hints for extending it. Finally, we report empirical and experimental results from applying the proposed approach on real data sets. It is worth mentioning that this approach can be used not only on SW-based repositories or archives, but also on those that are based on conventional approaches and languages (like EAST, DEDSL, XFDU/SAFE).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. CASPAR (Cultural, Artistic and Scientific knowledge for Preservation, Access and Retrieval). FP6- 2005-IST-033572. http://www.casparpreserves.eu/

  2. Belguidoum M., Dagnat F.: Dependency Management in Software Component Deployment. Electronic Notes in Theoretical Computer Science 182, 17–32 (2007)

    Article  Google Scholar 

  3. Brickley, D., Guha, R.V.: Resource Description Framework (RDF) Schema Specification: Proposed Recommendation, W3C, March 1999. http://www.w3.org/TR/1999/PR-rdf-schema-19990303

  4. Cheney, J., Lagoze, C., Botticelli, P.: Towards a theory of information preservation. In: Proceedings of the 5th European Conference on Research and Advanced Technology for Digital Libraries, pp. 340–351, ECDL’01, London, UK. Springer-Verlag, Berlin (2001)

  5. Cooper B.F., Garcia-Molina H.: InfoMonitor: unobtrusively archiving a World Wide Web server. Int. J. Digit. Libr. 5(2), 106–119 (2005)

    Article  Google Scholar 

  6. DEDSL Language. http://east.cnes.fr/english/index.html

  7. Day, M.: Integrating metadata schema registries with digital preservation systems to support interoperability: a proposal. In: Proceedingss of DC 2003. Supporting Communities of Discourse and Practice-Metadata Research & Applications, vol. 2, Seattle, Washington (USA), September (2003)

  8. EAST Language. http://east.cnes.fr/english/page_east.html

  9. Ferreira M., Baptista A.A., Ramalho J.C.: An intelligent decision support system for digital preservation. Int. J. Digit. Libr. 6(4), 295–304 (2007)

    Article  Google Scholar 

  10. Franch, X., Maiden, N.A.M.: Modeling component dependencies to inform their selection. In: Proceedings of the 2nd International Conference on COTS-Based Software Systems, ICCBSS’03, Ottawa, Canada, February, 2003. Springer, Berlin (2003)

  11. Hedstrom M.: Digital preservation: a time bomb for digital libraries. Comput. Hum. 31(3), 189–202 (1997)

    Article  Google Scholar 

  12. Hunter, J., Choudhury, S.: A semi-automated digital preservation system based on semantic web services. In: Proceedings of the 4th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL’04, pp. 269–278, New York, NY, USA. ACM Press, New Yoek (2004)

  13. Hunter J., Choudhury S.: PANIC: an integrated approach to the preservation of composite digital objects using Semantic Web services. Int. J. Digit. Libr. 6(2), 174–183 (2006)

    Article  Google Scholar 

  14. International Organization For Standardization (OAIS): Open Archival Information System—Reference Model. Ref. No ISO 14721:2003 (2003)

  15. Jarrar, M., Meersman, R.: Formal ontology engineering in the DOGMA approach. In: Proceedings of the International Conference on Ontologies, Databases and Applications of Semantics, ODBase’02, pp. 1238–1254. Springer, Berlin (2002)

  16. Karvounarakis, G., Christophides, V., Plexousakis, D.: Querying Semistructured (Meta)data and Schemas on the Web: The case of RDF & RDFS. Technical Report 269. ICS-FORTH, Heraklion (2000). (http://www.ics.forth.gr/proj/isst/RDF/rdfquerying.pdf)

  17. Lee K.H., Slattery O., Lu R., Tang X., McCrary V.: The state of the art and practice in digital preservation. J. Res. Natl. Inst. Stand. Technol. 107(1), 93–106 (2002)

    Google Scholar 

  18. Lin C.H., Hong J.S., Doerr M.: Issues in an inference platform for generating deductive knowledge: a case study in cultural heritage digital libraries using the CIDOC CRM. Int. J. Digit. Libr. 8(2), 115–132 (2008)

    Article  Google Scholar 

  19. Lucas, A.: XFDU packaging contribution to an implementation of the OAIS reference model. In: Proceedings of the International Conference, PV’2007. Ensuring the Long-Term Preservation and Value Adding to Scientific and Technical Data, Edinburgh, November (2005)

  20. Magiridou, M., Sahtouris, S., Christophides, V., Koubarakis, M.: RUL: A declarative update language for RDF. In: Proceedingss of the 4th International Conference on the Semantic Web, ISWC-2005, Galway, Ireland, November (2005)

  21. Maniatis P., Roussopoulos M., Giuli T.J., Rosenthal D.S.H., Baker M.: The LOCKSS peer-to-peer digital preservation system. ACM Trans. Comput. Syst. (TOCS) 23(1), 2–50 (2005)

    Article  Google Scholar 

  22. Marketakis, Y., Tzanakis, M., Tzitzikas, Y.: Prescan: towards automating the preservation of digital objects. In: Proceedings of the International ACM Conference on Management of Emergent Digital Ecosystems, MEDES’09, pp. 404–411, Lyon, France, October (2009)

  23. Nelson M.L., McCown F., Smith J.A., Klein M.: Using the web infrastructure to preserve web pages. Int. J. Digit. Libr. 6(4), 327–349 (2007)

    Article  Google Scholar 

  24. Noy, N.F., Musen, M.A.: PromptDiff: A fixed-point algorithm for comparing ontology versions. In: Proceedings of the 18th National Conference on Artificial Intelligence, AAAI-2002, pp. 744–750, Edmonton, Alberta, (2002)

  25. Papavassiliou, V., Flouris, G., Fundulaki, I., Kotzinos, D., Christophides, V.: On detecting high-level changes in RDF/S KBs. In: Proceedings of the 8th International Semantic Web Conference, ISWC 2009, October 2009

  26. PLANETS: Digital Preservation Research and Technology. HPRN-CT-2002-00308. http://www.planets-project.eu

  27. Rajasekar, A., Moore, R., Berman, F., Schottlaender, B.: Digital preservation lifecycle management for multi-media collections. In: Proceedings of the 8th International Conference on Asian Digital Libraries, ICADL, vol. 3815, pp. 380–384, December (2005)

  28. Rauch, C., Rauber, A.: Preserving digital media: towards a preservation solution evaluation metric. In: Proceedings of the 7th International Conference on Asian Digital Libraries, ICADL’04, pp. 203–212, Shanghai, China, (2004)

  29. Rauch, C., Franz, P., Strodl, S., Rauber, A.: Evaluating preservation strategies for audio and video files. In: Proceedings of the DELOS Digital Repositories Workshop, Heraklion, Crete, Greece, (2005)

  30. Ross S., Hedstrom M.: Preservation research and sustainable digital libraries. Int. J. Digit. Libr. 5(4), 317–324 (2005)

    Article  Google Scholar 

  31. Stenzhorn H., Srinivas K., Samwald M., Ruttenberg A.: Simplifying access to large-scale health care and life sciences datasets. Lect. Notes Comput. Sci. 5021, 864–868 (2008)

    Article  Google Scholar 

  32. Strodl, S., Becker, C., Neumayer, R., Rauber, A.: How to choose a digital preservation strategy: evaluating a preservation planning procedure. In: Proceedings of the 2007 conference on Digital libraries, pp. 29–38. ACM Press, New York (2007)

  33. Sunagawa, E., Kozaki, K., Kitamura, Y., Mizoguchi, R.: An environment for distributed ontology development based on dependency management. In: Proceedings of the 2nd International Semantic Web Conference, ISWC’03, pp. 453–468. Springer, Berlin (2003)

  34. The Technical Registry PRONOM (The National Archives). (http://www.nationalarchives.gov.uk/pronom)

  35. Theodoridou M., Tzitzikas Y., Doerr M., Marketakis Y., Melessanakis V.: Modeling and querying provenance by extending using CIDOC CRM. J Distrib. Parallel Databases 27, 169–210 (2010)

    Article  Google Scholar 

  36. Thibodeau, K.: Overview of technological approaches to digital preservation and challenges in coming years. Council on Library and Information Resources (CLIR). The State of Digital preservation: An International Perspective, April (2002)

  37. Tzitzikas, Y.: Dependency Management for the preservation of digital information. In: Proceedings of the 18th International Conference on Database and Expert Systems Applications, DEXA’07, Regensburg, Germany, September 2007. Springer-Verlag, Berlin (2007)

  38. Tzitzikas, Y.: On preserving the intelligibility of digital objects through dependency management. In: Proceedings of the International Conference PV’2007. Ensuring the Long-Term Preservation and Value Adding to Scientific and Technical Data, Oberpfaffenhofen, Munich, October (2007)

  39. Tzitzikas, Y., Flouris, G.: Mind the (intelligibily) gap. In: Proceedings of the 11th European Conference on Research and Advanced Technology for Digital Libraries, ECDL’07, Budapest, Hungary, September 2007. Springer-Verlag, Berlin (2007)

  40. Tzitzikas Y., Kotzinos D., Theoharis Y.: On ranking RDF schema elements (and its application in visualization). J. Univ. Comput. Sci. 13(12), 1854–1880 (2007)

    Google Scholar 

  41. Vieira, M., Richardson, D.: Analyzing dependencies in large component-based systems. In: Proceedings of the 17th IEEE International Conference on Automated Service Engineering, ASE’02. IEEE Computer Society, Los Alamitos (2002)

  42. Vieira, M., Dias, M., Richardson, D.J.: Describing dependencies in component access points. In: Proceedings of the 23rd International Conference on Software Engineering, ICSE’01, pp. 115–118, Toronto, Canada, (2001)

  43. Walter, M., Trinitis, C., Karl, W.: OpenSESAME: an intuitive dependability modeling environment supporting inter-component dependencies. In: Proceedings of Pacific Rim International Symposium on Dependable Computing, pp. 76–83, Seoul, Korea, (2001)

  44. XFDU development site. (http://sindbad.gsfc.nasa.gov/xfdu)

  45. Zeginis, D., Tzitzikas, Y., Christophides, V.: On the foundations of computing deltas between rdf models. In: Proceedings of the 6th International Semantic Web Conference, ISWC/ASWC’07, pp. 637–651, Busan, Korea, November (2007)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yannis Tzitzikas.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Marketakis, Y., Tzitzikas, Y. Dependency management for digital preservation using semantic web technologies. Int J Digit Libr 10, 159–177 (2009). https://doi.org/10.1007/s00799-010-0058-0

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00799-010-0058-0

Keywords

Navigation