Abstract
Datasets on the Web of Data (WoD) are often published without a precise schema which may discourage their reuse. Methods for schema acquisition from linked data have been proposed that mainly exploit the regularities in property and/or value distributions in resources to discover potentially useful classes as homogeneous clusters. Yet the crucial task of interpreting and naming the discovered classes is left to the human analyst. We prone a more holistic approach to schema discovery that, beside clustering, assists the analyst by suggesting plausible names for clusters. In doing that we: (1) rely on concept analysis for class discovery from linked data and (2) exploit known DBpedia types and shared properties to form candidate names. An evaluation of our approach with a dataset from the WoD showed it performs well.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Lehmann, J., Völker, J. (eds.): Perspectives on Ontology Learning, vol. 18. IOS Press, Amsterdam (2014)
Delteil, A., Faron, C., Dieng, R.: Building concept lattices by learning concepts from RDF graphs annotating web documents. In: Priss, U., Corbett, D., Angelova, G. (eds.) ICCS-ConceptStruct 2002. LNCS, vol. 2393, pp. 191–204. Springer, Heidelberg (2002). doi:10.1007/3-540-45483-7_15
Völker, J., Niepert, M.: Statistical schema induction. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., Leenheer, P., Pan, J. (eds.) ESWC 2011. LNCS, vol. 6643, pp. 124–138. Springer, Heidelberg (2011). doi:10.1007/978-3-642-21034-1_9
Alam, M., Buzmakov, A., Codocedo, V., Napoli, A.: Mining definitions from RDF annotations using formal concept analysis. In: IJCAI 2015, Buenos Aires, Argentina (2015)
Cimiano, P., Hotho, A., Staab, S.: Comparing conceptual, divisive and agglomerative clustering for learning taxonomies from text. ECAI 2004, 435–439 (2004)
Ferré, S.: Conceptual navigation in RDF graphs with SPARQL-like queries. In: Kwuida, L., Sertkaya, B. (eds.) ICFCA 2010. LNCS (LNAI), vol. 5986, pp. 193–208. Springer, Heidelberg (2010). doi:10.1007/978-3-642-11928-6_14
Kirchberg, M., Leonardi, E., Tan, Y.S., Ko, R.K.L., Link, S., Lee, B.S.: Beyond rdf links-exploring the semantic web with the help of formal concepts. In: 9th Annual Semantic Web Challenge Conjunction with ISWC 2011 (2011)
Alam, M., Chekol, M.W., Coulet, A., Napoli, A., Smaïl-Tabbone, M.: Lattice based data access (LBDA): an approach for organizing and accessing linked open data in biology. In: d’Amato, C., Berka, P., Svátek, V., Wecel, K. (eds.) Data Mining on Linked Data Workshop (ECML/PKDD, 2013). DMoLD 2013, vol. 1082. Springer, Heidleberg (2013)
Reynaud, J., Toussaint, Y., Napoli, A.: Contribution to the classification of web of data based on formal concept analysis. In: What can FCA do for Artificial Intelligence (FCA4AI)(ECAI 2016) (2016)
Ferré, S., Ridoux, O., Sigonneau, B.: Arbitrary relations in formal concept analysis and logical information systems. In: Dau, F., Mugnier, M.-L., Stumme, G. (eds.) ICCS-ConceptStruct 2005. LNCS, vol. 3596, pp. 166–180. Springer, Heidelberg (2005). doi:10.1007/11524564_11
Rouane-Hacene, M., Huchard, M., Napoli, A., Valtchev, P.: Relational concept analysis: mining concept lattices from multi-relational data. Ann. Math. Artif. Intell. 67(1), 81–108 (2013). doi:10.1007/s10472-012-9329-3
Rutledge, L., van Ossenbruggen, J., Hardman, L.: Making RDF presentable: integrated global and local semantic web browsing. WWW 2005, pp. 199–206, Chiba, Japan, ACM (2005). doi:10.1145/1060745.1060777
d’Aquin, M., Motta, E.: Extracting relevant questions to an rdf dataset using formal concept analysis. In: Proceedings of the Sixth International Conference on Knowledge Capture, pp. 121–128. ACM (2011)
Chekol, M.W., Napoli, A.: An FCA framework for knowledge discovery in sparql query answers. In: Proceedings of the 2013th International Conference on Posters & Demonstrations Track-Volume 1035, pp. 197–200. CEUR-WS. Org (2013)
Wong, W., Liu, W., Bennamoun, M.: Ontology learning from text: a look back and into the future. ACM Comput. Surv. 44(4), 1–36 (2012)
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC/ISWC -2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007). doi:10.1007/978-3-540-76298-0_52
Ganter, B., Wille, R.: Formal Concept Analysis: Mathematical Foundations. Springer, Heidelberg (1999)
Denecke, K., Erné, M., Wismath, S.L.: Galois Connections and Applications, vol. 565. Springer, Heidelberg (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Mehri, R., Valtchev, P. (2017). Mining Schema Knowledge from Linked Data on the Web. In: Li, G., Ge, Y., Zhang, Z., Jin, Z., Blumenstein, M. (eds) Knowledge Science, Engineering and Management. KSEM 2017. Lecture Notes in Computer Science(), vol 10412. Springer, Cham. https://doi.org/10.1007/978-3-319-63558-3_22
Download citation
DOI: https://doi.org/10.1007/978-3-319-63558-3_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-63557-6
Online ISBN: 978-3-319-63558-3
eBook Packages: Computer ScienceComputer Science (R0)