Skip to main content

Mining Schema Knowledge from Linked Data on the Web

  • Conference paper
  • First Online:
Knowledge Science, Engineering and Management (KSEM 2017)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10412))

  • 1889 Accesses

Abstract

Datasets on the Web of Data (WoD) are often published without a precise schema which may discourage their reuse. Methods for schema acquisition from linked data have been proposed that mainly exploit the regularities in property and/or value distributions in resources to discover potentially useful classes as homogeneous clusters. Yet the crucial task of interpreting and naming the discovered classes is left to the human analyst. We prone a more holistic approach to schema discovery that, beside clustering, assists the analyst by suggesting plausible names for clusters. In doing that we: (1) rely on concept analysis for class discovery from linked data and (2) exploit known DBpedia types and shared properties to form candidate names. An evaluation of our approach with a dataset from the WoD showed it performs well.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.w3.org/TR/REC-rdf-syntax/.

  2. 2.

    https://www.w3.org/TR/rdf-schema/.

  3. 3.

    http://dbpedia.org/.

  4. 4.

    http://en.wikipedia.org/wiki/Help:Infobox.

References

  1. Lehmann, J., Völker, J. (eds.): Perspectives on Ontology Learning, vol. 18. IOS Press, Amsterdam (2014)

    MATH  Google Scholar 

  2. Delteil, A., Faron, C., Dieng, R.: Building concept lattices by learning concepts from RDF graphs annotating web documents. In: Priss, U., Corbett, D., Angelova, G. (eds.) ICCS-ConceptStruct 2002. LNCS, vol. 2393, pp. 191–204. Springer, Heidelberg (2002). doi:10.1007/3-540-45483-7_15

    Chapter  Google Scholar 

  3. Völker, J., Niepert, M.: Statistical schema induction. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., Leenheer, P., Pan, J. (eds.) ESWC 2011. LNCS, vol. 6643, pp. 124–138. Springer, Heidelberg (2011). doi:10.1007/978-3-642-21034-1_9

    Chapter  Google Scholar 

  4. Alam, M., Buzmakov, A., Codocedo, V., Napoli, A.: Mining definitions from RDF annotations using formal concept analysis. In: IJCAI 2015, Buenos Aires, Argentina (2015)

    Google Scholar 

  5. Cimiano, P., Hotho, A., Staab, S.: Comparing conceptual, divisive and agglomerative clustering for learning taxonomies from text. ECAI 2004, 435–439 (2004)

    Google Scholar 

  6. Ferré, S.: Conceptual navigation in RDF graphs with SPARQL-like queries. In: Kwuida, L., Sertkaya, B. (eds.) ICFCA 2010. LNCS (LNAI), vol. 5986, pp. 193–208. Springer, Heidelberg (2010). doi:10.1007/978-3-642-11928-6_14

    Chapter  Google Scholar 

  7. Kirchberg, M., Leonardi, E., Tan, Y.S., Ko, R.K.L., Link, S., Lee, B.S.: Beyond rdf links-exploring the semantic web with the help of formal concepts. In: 9th Annual Semantic Web Challenge Conjunction with ISWC 2011 (2011)

    Google Scholar 

  8. Alam, M., Chekol, M.W., Coulet, A., Napoli, A., Smaïl-Tabbone, M.: Lattice based data access (LBDA): an approach for organizing and accessing linked open data in biology. In: d’Amato, C., Berka, P., Svátek, V., Wecel, K. (eds.) Data Mining on Linked Data Workshop (ECML/PKDD, 2013). DMoLD 2013, vol. 1082. Springer, Heidleberg (2013)

    Google Scholar 

  9. Reynaud, J., Toussaint, Y., Napoli, A.: Contribution to the classification of web of data based on formal concept analysis. In: What can FCA do for Artificial Intelligence (FCA4AI)(ECAI 2016) (2016)

    Google Scholar 

  10. Ferré, S., Ridoux, O., Sigonneau, B.: Arbitrary relations in formal concept analysis and logical information systems. In: Dau, F., Mugnier, M.-L., Stumme, G. (eds.) ICCS-ConceptStruct 2005. LNCS, vol. 3596, pp. 166–180. Springer, Heidelberg (2005). doi:10.1007/11524564_11

    Chapter  Google Scholar 

  11. Rouane-Hacene, M., Huchard, M., Napoli, A., Valtchev, P.: Relational concept analysis: mining concept lattices from multi-relational data. Ann. Math. Artif. Intell. 67(1), 81–108 (2013). doi:10.1007/s10472-012-9329-3

    Article  MathSciNet  MATH  Google Scholar 

  12. Rutledge, L., van Ossenbruggen, J., Hardman, L.: Making RDF presentable: integrated global and local semantic web browsing. WWW 2005, pp. 199–206, Chiba, Japan, ACM (2005). doi:10.1145/1060745.1060777

  13. d’Aquin, M., Motta, E.: Extracting relevant questions to an rdf dataset using formal concept analysis. In: Proceedings of the Sixth International Conference on Knowledge Capture, pp. 121–128. ACM (2011)

    Google Scholar 

  14. Chekol, M.W., Napoli, A.: An FCA framework for knowledge discovery in sparql query answers. In: Proceedings of the 2013th International Conference on Posters & Demonstrations Track-Volume 1035, pp. 197–200. CEUR-WS. Org (2013)

    Google Scholar 

  15. Wong, W., Liu, W., Bennamoun, M.: Ontology learning from text: a look back and into the future. ACM Comput. Surv. 44(4), 1–36 (2012)

    Article  MATH  Google Scholar 

  16. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC/ISWC -2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007). doi:10.1007/978-3-540-76298-0_52

    Chapter  Google Scholar 

  17. Ganter, B., Wille, R.: Formal Concept Analysis: Mathematical Foundations. Springer, Heidelberg (1999)

    Book  MATH  Google Scholar 

  18. Denecke, K., Erné, M., Wismath, S.L.: Galois Connections and Applications, vol. 565. Springer, Heidelberg (2013)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Razieh Mehri .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Mehri, R., Valtchev, P. (2017). Mining Schema Knowledge from Linked Data on the Web. In: Li, G., Ge, Y., Zhang, Z., Jin, Z., Blumenstein, M. (eds) Knowledge Science, Engineering and Management. KSEM 2017. Lecture Notes in Computer Science(), vol 10412. Springer, Cham. https://doi.org/10.1007/978-3-319-63558-3_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-63558-3_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-63557-6

  • Online ISBN: 978-3-319-63558-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics