Skip to main content

Technical Usability of Wikidata’s Linked Data

Evaluation of Machine Interoperability and Data Interpretability

  • Conference paper
  • First Online:
Business Information Systems Workshops (BIS 2019)

Abstract

Wikidata is an outstanding data source with potential application in many scenarios. Wikidata provides its data openly in RDF. Our study aims to evaluate the usability of Wikidata as a data source for robots operating on the web of data, according to specifications and practices of linked data, the Semantic Web and ontology reasoning. We evaluated from the perspective of two use cases of data crawling robots, which are guided by our general motivation to acquire richer data for Europeana, a data aggregator from the Cultural Heritage domain. The first use case regards general data consumption applications based on RDF, RDF-Schema, OWL, SKOS and linked data. The second case regards applications that explore semantics relying on Schema.org and SKOS. We conclude that a human operator must assist linked data applications to interpret Wikidata’s RDF because of the choices that were taken at Wikidata in the definition of its expression in RDF. The semantics of the RDF output from Wikidata is “locked-in” by the usage of Wikidata’s own ontology, resulting in the need for human intervention. Wikidata is only a few steps away from high quality machine interpretation, however. It contains extensive alignment data to RDF, RDFS, OWL, SKOS and Schema.org, but a machine interpretation of those alignments can only be done if some essential Wikidata alignment properties are known.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://europeana.eu.

  2. 2.

    https://github.com/netwerk-digitaal-erfgoed/.

  3. 3.

    https://bbcarchdev.github.io/res/.

  4. 4.

    https://bbcarchdev.github.io/res/collections.

  5. 5.

    The Europeana Network is a community of 1,700 experts with the shared mission to expand and improve access to Europe's digital cultural heritage, in the organization they work for and/or by contributing to shape Europeana’s services.

  6. 6.

    https://www.wikidata.org/wiki/Property:P727.

  7. 7.

    https://github.com/nfreire/Open-Data-Acquisition-Framework/blob/master/opaf-documentation/SpecifyingLodDatasetForEuropeana.md.

  8. 8.

    https://pro.europeana.eu/project/data-quality-committee.

  9. 9.

    https://github.com/nfreire/data-aggregation-lab/blob/master/data-aggregation-casestudies/documentation/wikidata/SchemaOrg-ontology-alignments-listing.md.

References

  1. Niggermann, E., Cousins, J., Sanderhoff, M.: Europeana Business Plan 2018 ‘Democratizing culture’. Europeana Foundation (2018). https://pro.europeana.eu/files/Europeana_Professional/Publications/Europeana_Business_Plan_2018.pdf

  2. Rietveld, L.: Publishing and Consuming Linked Data: Optimizing for the Unknown. Studies on the Semantic Web, vol. 21. IOS Press, Amsterdam (2016)

    Google Scholar 

  3. Radulovic, F., Mihindukulasooriya, N., García-Castro, R., Gomez-Pérez, A.: A comprehensive quality model for linked data. In: Semantic Web, vol. 9, no. 1/2018. IOS Press (2018)

    Google Scholar 

  4. Beek, W., Rietveld, L., Ilievski, F., Schlobach, S.: LOD lab: scalable linked data processing. In: Pan, J., et al. (eds.) Reasoning Web 2016. LNCS, vol. 9885, pp. 124–155. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-49493-7_4

    Chapter  Google Scholar 

  5. Beek, W., Rietveld, L., Schlobach, S., van Harmelen, F.: LOD Laundromat: why the Semantic Web needs centralization (even if we don’t like it). In: IEEE Internet Computing, vol. 20, no. 2. IEEE (2016)

    Google Scholar 

  6. Fernández, J.D., Beek, W., Martínez-Prieto, M.A., Arias, M.: LOD-a-lot. In: d’Amato, C., et al. (eds.) ISWC 2017. LNCS, vol. 10588, pp. 75–83. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-68204-4_7

    Chapter  Google Scholar 

  7. Hogan, A.: Reasoning Techniques for the Web of Data. Studies on the Semantic Web, vol. 19. IOS Press, Amsterdam (2014)

    Google Scholar 

  8. Simou, N., Chortaras, A., Stamou, G., Kollias, S.: Enriching and publishing cultural heritage as linked open data. In: Ioannides, M., Magnenat-Thalmann, N., Papagiannakis, G. (eds.) Mixed Reality and Gamification for Cultural Heritage, pp. 201–223. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-49607-8_7

    Chapter  Google Scholar 

  9. Hyvönen, E.: Publishing and using cultural heritage linked data on the semantic web. In: Ding, Y., Groth, P. (eds.) Synthesis Lectures on the Semantic Web: Theory and Technology (2012). https://doi.org/10.2200/s00452ed1v01y201210wbe003

  10. Jones, E., Seikel, M. (eds.): Linked Data for Cultural Heritage. Facet Publishing, London (2016)

    Google Scholar 

  11. Meijer, E., Valk, S.: A distributed network of heritage information. White paper (2017). https://github.com/netwerk-digitaal-erfgoed/general-documentation/blob/master/Whitepaper%20A%20distributed%20network%20of%20heritage%20information.md

  12. Rietveld, L., Verborgh, R., Beek, W., Vander Sande, M., Schlobach, S.: Linked data-as-a-service: the semantic web redeployed. In: Gandon, F., Sabou, M., Sack, H., d’Amato, C., Cudré-Mauroux, P., Zimmermann, A. (eds.) ESWC 2015. LNCS, vol. 9088, pp. 471–487. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-18818-8_29

    Chapter  Google Scholar 

  13. Freire, N., Manguinhas, H., Isaac, A., Robson, G., Howard, J.B.: Web technologies: a survey of their applicability to metadata aggregation in cultural heritage. In: Chan, L., Loizides, F. (eds.) Expanding Perspectives on Open Science: Communities. Cultures and Diversity in Concepts and Practices. IOS Press, Amsterdam (2018). Inf. Serv. Use J. 37(4)

    Google Scholar 

  14. Freire, N., Robson, G., Howard, J.B., Manguinhas, H., Isaac, A.: Metadata aggregation: assessing the application of IIIF and sitemaps within cultural heritage. In: Kamps, J., Tsakonas, G., Manolopoulos, Y., Iliadis, L., Karydis, I. (eds.) TPDL 2017. LNCS, vol. 10450, pp. 220–232. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-67008-9_18

    Chapter  Google Scholar 

  15. Freire, N., Charles, V., Isaac, A.: Evaluation of Schema.org for aggregation of cultural heritage metadata. In: Gangemi, A., et al. (eds.) ESWC 2018. LNCS, vol. 10843, pp. 225–239. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93417-4_15

    Chapter  Google Scholar 

  16. Freire, N., Meijers, E., Voorburg, R., Cornelissen, R., Isaac, A., de Valk, S.: Aggregation of linked data: a case study in the cultural heritage domain. In: 2018 IEEE International Conference on Big Data (Big Data). IEEE (2018)

    Google Scholar 

  17. Curé, O., Blin, G. (eds.): RDF Database Systems, pp. 41–80. Morgan Kaufmann (2015). Chapter Three - RDF and the Semantic Web Stack. https://doi.org/10.1016/b978-0-12-799957-9.00003-1

  18. Király, P., Stiller, J., Charles, V., Bailer, W., Freire, N.: Evaluating data quality in Europeana: metrics for multilinguality. In: Garoufallou, E., Sartori, F., Siatri, R., Zervas, M. (eds.) MTSR 2018. CCIS, vol. 846, pp. 199–211. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-14401-2_19

    Chapter  Google Scholar 

  19. Erxleben, F., Günther, M., Krötzsch, M., Mendez, J., Vrandečić, D.: Introducing Wikidata to the linked data web. In: Mika, P., et al. (eds.) ISWC 2014. LNCS, vol. 8796, pp. 50–65. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-11964-9_4

    Chapter  Google Scholar 

Download references

Acknowledgements

This work was partly supported by Portuguese national funds through Fundação para a Ciência e a Tecnologia (FCT) with reference UID/CEC/50021/2019, and by the European Commission under contract number 30-CE-0885387/00-80.e.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nuno Freire .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Freire, N., Isaac, A. (2019). Technical Usability of Wikidata’s Linked Data. In: Abramowicz, W., Corchuelo, R. (eds) Business Information Systems Workshops. BIS 2019. Lecture Notes in Business Information Processing, vol 373. Springer, Cham. https://doi.org/10.1007/978-3-030-36691-9_47

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-36691-9_47

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-36690-2

  • Online ISBN: 978-3-030-36691-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics