Skip to main content

Digital Repositories and Linked Data: Lessons Learned and Challenges

  • Conference paper
  • First Online:
Knowledge Graphs and Semantic Web (KGSWC 2019)

Abstract

Digital repositories have been used by Universities and Libraries to store their bibliographic, scientific, and/or institutional contents, and then make their corresponding metadata publicly available to the web and through the OAI-PMH protocol. However, such metadata is not descriptive enough for a document to be easily discoverable. Even though the emergence of Semantic Web technologies have produced the interest of Digital Repository providers to publish and enrich their content using Linked Data (LD) technologies, those institutions have used different generation approaches, and in certain cases ad-hoc solutions to solve particular use cases, but none of them has performed a comparison between existing approaches in order to demonstrate which one is the best solution prior to its application. In order to address this question, we have performed a benchmark study that compares two commonly used generation approaches, and also describes our experience, lessons learned and challenges found during the process of publishing a DSpace digital repository as LD. Results show that the straightforward method for extracting data from a digital repository is through the standard OAI-PMH protocol, whose performance in terms of execution time is much shorter than the database approach, while additional data cleaning tasks are minimal.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 49.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 64.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.dspace.org/.

  2. 2.

    http://dublincore.org/.

  3. 3.

    http://www.cedia.org.ec/.

  4. 4.

    http://www.opendoar.org/.

  5. 5.

    http://www.openarchives.org/pmh/.

  6. 6.

    https://github.com/DSpace/DSpace/tree/master/dspace/config/crosswalks/oai/metadataFormats.

  7. 7.

    http://openrefine.org/.

  8. 8.

    http://d2rq.org/.

  9. 9.

    http://wifo5-03.informatik.uni-mannheim.de/bizer/silk/.

  10. 10.

    https://www.w3.org/2001/sw/wiki/R2RML_Parser.

  11. 11.

    http://simile.mit.edu/wiki/OAI-PMH_RDFizer.

  12. 12.

    http://dspace.ucuenca.edu.ec/.

  13. 13.

    http://dspace.ucuenca.edu.ec/oai/request.

  14. 14.

    http://v2.sherpa.ac.uk/id/repository/4186.

  15. 15.

    http://creativecommons.org/licenses/by/4.0/legalcode.

  16. 16.

    http://www.w3.org/ns/odrl/2/.

  17. 17.

    http://purl.oclc.org/NET/ldr/ns#.

  18. 18.

    http://lov.okfn.org/dataset/lov/details/vocabularySpace_Library.html.

  19. 19.

    http://prefix.cc/.

  20. 20.

    https://github.com/santteegt/oai2rdf.

  21. 21.

    http://www.dbpedia-spotlight.org/.

  22. 22.

    https://www.w3.org/TR/annotation-model/.

  23. 23.

    http://190.15.141.102:8891/sparql.tpl.

  24. 24.

    http://190.15.141.66:8899/ucuenca/recurso.

  25. 25.

    https://github.com/epimorphics/elda.

  26. 26.

    Extract, Transform, Load (ETL) process in data warehousing.

References

  1. Villazón-Terrazas, B., Vilches-Blázquez, L.M., Corcho, O., Gómez-Pérez, A.: Methodological guidelines for publishing government linked data. In: Wood, D. (ed.) Linking Government Data, pp. 27–49. Springer, Heidelberg (2011). https://doi.org/10.1007/978-1-4614-1767-5_2

    Chapter  Google Scholar 

  2. Alexopoulos, A.D., Koutsomitropoulos, D., Papatheodorou, T.S., Solomou, G.D.: Digital repositories and the semantic web: semantic search and navigation for DSpace. Georgia Institute of Technology (2009)

    Google Scholar 

  3. Koutsomitropoulos, D., Solomou, G.D., Papatheodorou, T.S.: Semantic interoperability of dublin core metadata in digital repositories. In: 2008 International Conference on Innovations in Information Technology (2008)

    Google Scholar 

  4. Koutsomitropoulos, D.A., Solomou, G.D., Domenech, R.: Dspace semantic search v2. 0: what’s new and current status. In: Proceeding of the 7th International Conference on Open Repositories (OR 2012), 9–13 July, Edinburgh (2012)

    Google Scholar 

  5. Koutsomitropoulos, D.A., Solomou, G.D., Papatheodorou, T.S.: Semantic query answering in digital repositories: semantic search v2 for DSpace. Int. J. Metadata Semant. Ontol. 8(1), 46–55 (2013)

    Article  Google Scholar 

  6. Haslhofer, B., Schandl, B.: The OAI2LOD server: exposing OAI-PMH metadata as linked data. In: International Workshop on Linked Data on the Web (LDOW2008), Co-located with WWW 2008, Beijing, April 2008

    Google Scholar 

  7. Piedra, N., Chicaiza, J., Lopez-Vargas, J., Caro, E.T.: Guidelines to producing structured interoperable data from open access repositories. In: 2016 IEEE Frontiers in Education Conference (FIE), pp. 1–9. IEEE (2016)

    Google Scholar 

  8. Segarra, J., Ortiz, J., Espinoza, M., Saquicela, V.: Integration of digital repositories through federated queries using semantic technologies. In: 2016 XLII Latin American Computing Conference (CLEI), pp. 1–9. IEEE (2016)

    Google Scholar 

  9. Vila-Suero, D., Villazón-Terrazas, B., Gómez-Pérez, A.: Datos. bne. es: a library linked dataset. Semant. Web 4(3), 307–313 (2013)

    Google Scholar 

  10. Lampert, C.K., Southwick, S.B.: Leading to linking: introducing linked data to academic library digital collections. J. Libr. Metadata 13(2–3), 230–253 (2013)

    Article  Google Scholar 

  11. Southwick, S.B.: A guide for transforming digital collections metadata into linked data using open source technologies. J. Libr. Metadata 15(1), 1–35 (2015)

    Article  Google Scholar 

  12. Berners-Lee, T.: Linked Data - Design Issues, July 2006. http://www.w3.org/DesignIssues/LinkedData.html. Accessed 12 Jan 2017

  13. Latif, A., Scherp, A., Tochtermann, K.: LOD for library science: benefits of applying linked open data in the digital library setting. KI-Künstliche Intelligenz 30(2), 149–157 (2016)

    Article  Google Scholar 

  14. Das, S., Sundara, S., Cyganiak, R.: R2RML: RDB to RDF mapping language, September 2012. http://www.w3.org/TR/r2rml/. Accessed 13 June 2017

  15. Hidalgo-Delgado, Y., Estrada-Nelson, R., Xu, B., Villazon-Terrazas, B., Leiva-Maderos, A., Tello, A.: Methodological guidelines for publishing library data as linked data. In: 2017 IEEE International Conference on Big Data (2017)

    Google Scholar 

  16. Anibaldi, S., Jaques, Y., Celli, F., Stellato, A., Keizer, J.: Migrating bibliographic datasets to the semantic web: the AGRIS case. Semant. Web 6(2), 113–120 (2015)

    Google Scholar 

  17. Konstantinou, N., Spanos, D.E., Houssos, N., Mitrou, N.: Exposing scholarly information as linked open data: Rdfizing DSpace contents. Electron. Libr. 32(6), 834–851 (2014)

    Article  Google Scholar 

  18. Konstantinou, N., Spanos, D.-E.: Creating linked data from relational databases. In: Konstantinou, N., Spanos, D.-E. (eds.) Materializing the Web of Linked Data, pp. 73–102. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16074-0_4

    Chapter  Google Scholar 

  19. Koutsomitropoulos, D.A., Solomou, G.D., Kalou, A.K.: Herding linked data: semantic search and navigation among scholarly datasets. Int. J. Semant. Comput. 9(04), 459–482 (2015)

    Article  Google Scholar 

  20. Piedra, N., Chicaiza, J., Quichimbo, P.: Integración semántica de recursos educativos abiertos cosechados con oai-pmh. proceso aplicado al servicio de búsqueda de oers en la red esvial. de Formación virtual inclusiva y de calidad para el siglo XXI CAFVIR, pp. 337–351 (2015)

    Google Scholar 

  21. Piedra, N., et al.: Marco de trabajo para la integración de recursos digitales basado en un enfoque de web semántica. RISTI-Revista Ibérica de Sistemas e Tecnologias de Informação, pp. 55–70 (2015)

    Google Scholar 

  22. Hyland, B., Stones, R., Atemezing, I.G., EURECOM, Villazón-Terrazas, B., iSOCO, S.A., I.S.C.: Best practices for publishing linked data, January 2014. http://www.w3.org/TR/ld-bp/. Accessed 12 June 2017

  23. Sauermann, L., Cyganiak, R., Ayers, D., Völkel, M.: Cool URIs for the semantic web, December 2008. http://www.w3.org/TR/cooluris/. Accessed 12 June 2017

  24. Rodríguez Doncel, V., Gómez-Pérez, A., Villata, S.: A dataset of RDF licenses. In: Legal Knowledge and Information Systems: JURIX 2014: The Twenty-Seventh Annual Conference (2014)

    Google Scholar 

  25. Saquicela, V., et al.: LOD-GF: an integral linked open data generation framework. In: Botto-Tobar, M., Barba-Maggi, L., González-Huerta, J., Villacrés-Cevallos, P., S. Gómez, O., Uvidia-Fassler, M.I. (eds.) TICEC 2018. AISC, vol. 884, pp. 283–300. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-02828-2_21

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Santiago Gonzalez-Toral .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gonzalez-Toral, S., Espinoza-Mejia, M., Saquicela, V. (2019). Digital Repositories and Linked Data: Lessons Learned and Challenges. In: Villazón-Terrazas, B., Hidalgo-Delgado, Y. (eds) Knowledge Graphs and Semantic Web. KGSWC 2019. Communications in Computer and Information Science, vol 1029. Springer, Cham. https://doi.org/10.1007/978-3-030-21395-4_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-21395-4_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-21394-7

  • Online ISBN: 978-3-030-21395-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics