Authors:
Raphael do Vale Amaral Gomes
1
;
Marco A. Casanova
1
;
Giseli Rabello Lopes
1
and
Luiz André P. Paes Leme
2
Affiliations:
1
PUC-Rio, Brazil
;
2
UFF, Brazil
Keyword(s):
Focused Crawler, Tripleset Recommendation, Linked Data.
Related
Ontology
Subjects/Areas/Topics:
Cloud Computing
;
Enterprise Information Systems
;
Semantic Web Technologies
;
Services Science
;
Software Agents and Internet Computing
Abstract:
The Linked Data best practices recommend publishers of triplesets to use well-known ontologies in the triplication process and to link their triplesets with other triplesets. However, despite the fact that extensive lists of open ontologies and triplesets are available, most publishers typically do not adopt those ontologies and link their triplesets only with popular ones, such as DBpedia and Geonames. This paper presents a metadata crawler for Linked Data to assist publishers in the triplification and the linkage processes. The crawler provides publishers with a list of the most suitable ontologies and vocabulary terms for triplification, as well as a list of triplesets that the new tripleset can be most likely linked with. The crawler focuses on specific metadata properties, including subclass of, and returns only metadata, hence the classification “metadata focused crawler”.