Abstract
The novel e-Science’s data-centric paradigm has proved that interlinking publications and research data objects coming from different realms and data sources (e.g. publication repositories, data repositories) makes dissemination, re-use, and validation of research activities more effective. Scholarly Communication Infrastructures (SCIs) are advocated for bridging such data sources by offering an overlay of services for identification, creation, and navigation of relationships among objects of different nature. Since realization and maintenance of such infrastructures is in general very cost-consuming, in this paper we propose a lightweight approach for “preliminary analysis of data source interlinking” to help practitioners at evaluating whether and to what extent realizing them can be effective. We present Data Searchery, a configurable tool delivering a service for relating objects across data sources, be them publications or research data, by identifying relationships between their metadata descriptions in real-time.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Mendeley, http://www.mendeley.com/.
- 2.
ORCID, http://orcid.org/.
- 3.
The Research Data Alliance, http://europe.rd-alliance.org/.
- 4.
The DataCite Initiative, http://datacite.org.
- 5.
NARCIS, http://www.narcis.nl/.
- 6.
The OpenAIRE project, http://www.openaire.eu/en.
- 7.
The DRIVER repository, http://www.driver-repository.eu/.
- 8.
Google Scholar, http://scholar.google.com.
- 9.
Apache Solr, http://lucene.apache.org/solr/.
- 10.
Elasticsearch, http://www.elasticsearch.org/.
- 11.
D-Net Software Toolkit, http://www.d-net.research-infrastructures.eu.
- 12.
WhatIzIt - EBI, http://www.ebi.ac.uk/webservices/whatizit/.
- 13.
PANGAEA - Data Publisher for Earth & Environmental Science, http://www.pangaea.de.
- 14.
Figshare.com, http://figshare.com/.
References
Bourne, P.E., Clark, T.W., Dale, R., de Waard, A., Herman, I., Hovy, E.H., Shotton, D.: Improving the future of research communications and e-scholarship (Dagstuhl perspectives workshop 11331). Dagstuhl Manifestos 1(1), 41–60 (2012)
Hogenaar, A.: What is an enhanced publication? http://www.openaire.eu/en/component/content/article/76-highlights/344-a-short-introduction-to-enhanced-publications
Gray, J.: A transformed scientific method. In: Hey, T., Tansley, S., Tolle, K. (eds.) The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research, Redmond (2009)
Reilly, S., Schallier, W., Schrimpf, S., Smit, E., Wilkinson, M.: Report on integration of data and publications. ODE Opportunities for Data Exchange
Callaghan, S., Donegan, S.: Making data a first class scientific output: data citation and publication by NERC’s environmental data centres. Int. J. Digit. Curation 7(1), 107–113 (2012)
Chavan, V., Penev, L.: The data paper: a mechanism to incentivize data publishing in biodiversity science. BMC Bioinform. 12(Suppl 15), S2 (2011)
Hoogerwerf, M., Lösch, M., Schirrwagen, J., Callaghan, S., Manghi, P., Iatropoulou, K., Keramida, D., Rettberg, N.: Linking data and publications: towards a cross-disciplinary approach. Int. J. Digit. Curation 8(1), 244–254 (2013)
Wallis, J.C., Rolando, E., Borgman, C.L.: If we share data, will anyone use them? data sharing and reuse in the long tail of science and technology. PLoS ONE 8(7), e67332 (2013)
Castelli, D., Manghi, P., Thanos, C.: A vision towards scientific communication infrastructures - on bridging the realms of research digital libraries and scientific data centers. J. Digit. Libr. 13(3/4), 155–169 (2013)
Manghi, P., Bolikowski, L., Manola, N., Shirrwagen, J., Smith, T.: Openaireplus: the European scholarly communication data infrastructure. D-Lib Mag. 18(9–10) (2012). http://www.bibsonomy.org/bibtex/23435c839e8f925c6ca94a4c2972015b1/dblp
Manghi, P., Manola, N., Horstmann, W., Peters, D.: An infrastructure for managing EC funded research output - the openaire project. Grey J. (TGJ): Int. J. Grey Lit. 6(1), 31–40 (2010)
Attwood, T.K., Kell, D.B., McDermott, P., Marsh, J., Pettifer, S.R., Thorne, D.: Utopia documents: linking scholarly literature with research data. Bioinformatics 26(18), 568–574 (2010)
Bruce, T.R., Hillmann, D.: The Continuum of Metadata Quality: Defining, Expressing, Exploiting. American Library Association, Chicago (2004)
Tani, A., Candela, L., Castelli, D.: Dealing with metadata quality: the legacy of digital library efforts. Inf. Process. Manag. 49(6), 1194–1205 (2013)
Feijen, M., Horstmann, W., Manghi, P., Robinson, M., Russell, R.: DRIVER: Building the Network for Accessing Digital Repositories across Europe. In: Ariadne Magazine, vol. 53, pp. 1–4, Ariadne (2007). http://puma.isti.cnr.it/dfdownload.php?ident=/cnr.isti/2007-A0-047
Manghi, P., Mikulicic, M., Candela, L., Castelli, D., Pagano, P.: Realizing and maintaining aggregative digital library systems: D-net software toolkit and oaister system. D-Lib Mag. 16(3/4) (2010). http://www.bibsonomy.org/bib/bibtex/2d5fb59f6245dc730c4d86882d7bfb18d/dblp
Berners-Lee, T.: Linked data. http://www.w3.org/DesignIssues/LinkedData.html
Wölger, S., Siorpaes, K., Bürger, T., Simperl, E., Thaler, S., Hofer, C.: A survey on data interlinking methods. Technical report, Semantic Technology Institute (STI), University of Insbruck (March 2011)
Nikolaidou, P.T., Shaeles, S.N., Karakos, A.S.: MusicPedia: retrieving and merging-interlinking music metadata. Int. J. Comput. 3(8) (2011)
Rinke Hoekstra, P.G.: Linkitup: Link discovery for research data. In: Proceedings of the AAAI Fall Symposium on Discovery Informatics (2013)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Mannocci, A., Manghi, P. (2014). Preliminary Analysis of Data Sources Interlinking. In: Bolikowski, Ł., Casarosa, V., Goodale, P., Houssos, N., Manghi, P., Schirrwagen, J. (eds) Theory and Practice of Digital Libraries -- TPDL 2013 Selected Workshops. TPDL 2013. Communications in Computer and Information Science, vol 416. Springer, Cham. https://doi.org/10.1007/978-3-319-08425-1_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-08425-1_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08424-4
Online ISBN: 978-3-319-08425-1
eBook Packages: Computer ScienceComputer Science (R0)