Abstract
The Open Knowledge Extraction (OKE) challenge, at its second edition, has the ambition to provide a reference framework for research on Knowledge Extraction from text for the Semantic Web by re-defining a number of tasks (typically from information and knowledge extraction), taking into account specific SW requirements. The OKE challenge defines two tasks: (1) Entity Recognition, Linking and Typing for Knowledge Base population; (2) Class Induction and entity typing for Vocabulary and Knowledge Base enrichment. Task 1 consists of identifying Entities in a sentence and create an OWL individual representing it, link to a reference KB (DBpedia) when possible and assigning a type to such individual. Task 2 consists in producing rdf:type statements, given definition texts. The participants will be given a dataset of sentences, each defining an entity (known a priori). The following systems participated to the challenge: WestLab to both Task 1 and 2, ADEL and Mannheim to Task 2 only. In this paper we describe the OKE challenge, the tasks, the datasets used for training and evaluating the systems, the evaluation method, and obtained results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
The prefix dul: stands for the namespace http://www.ontologydesignpatterns.org/ont/dul/DUL.owl#.
- 9.
- 10.
The prefixes nif:, itsrdf:, dul:, and dbpedia: identify the namespaces http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core#, http://www.w3.org/2005/11/its/rdf#, http://www.ontologydesignpatterns.org/ont/dul/DUL.owl#, and http://dbpedia.org/resource/ respectively.
- 11.
Prefixes d0: and dul: stand for namespaces http://ontologydesignpatterns.org/ont/wikipedia/d0.owl# and http://www.ontologydesignpatterns.org/ont/dul/DUL.owl# respectively.
- 12.
A preview of the job can be found at https://tasks.crowdflower.com/channels/cf_internal/jobs/913913/editor_preview.
- 13.
- 14.
The training dataset is available at https://github.com/anuzzolese/oke-challenge-2016/blob/master/GoldStandard_sampleData/task1/dataset_task_1.ttl. Similarly, the evaluation dataset is available at https://github.com/anuzzolese/oke-challenge-2016/blob/master/evaluation-data/task1/evaluation-dataset-task1.ttl.
- 15.
The training dataset is available at https://github.com/anuzzolese/oke-challenge-2016/blob/master/GoldStandard_sampleData/task2/dataset_task_2.ttl. Similarly, the evaluation dataset is available at https://github.com/anuzzolese/oke-challenge-2016/blob/master/evaluation-data/task2/evaluation-dataset-task2.ttl.
- 16.
References
Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Semant. Web Inf. Syst. 5(3), 1–22 (2009)
Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., Hellmann, S.: DBpedia - a crystallization point for the web of data. J. Web Semant. 7(3), 154–165 (2009)
Chabchoub, M., Gagnon, M., Zouaq, A.: Collective disambiguation and semantic annotation for entity linking and typing. In: Sack et al. [14]
Doddington, G.R., Mitchell, A., Przybocki, M.A., Ramshaw, L.A., Strassel, S., Weischedel, R.M.: The automatic content extraction (ACE) program-tasks, data, and evaluation. In: LREC (2004)
Faralli, S., Ponzetto, S.P.: Open knowledge extraction challenge a hearst- like pattern-based approach to hypernym extraction and class induction. In: Sack et al. [14] (2016)
Gangemi, A., Guarino, N., Masolo, C., Oltramari, A., Schneider, L.: Sweetening ontologies with DOLCE. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, pp. 166–181. Springer, Heidelberg (2002)
Grishman, R., Sundheim, B.: Message understanding conference-6: a brief history. In: Proceedings of 16th Conference on Computational Linguistics - COLING 1996, vol. 1, pp. 466–471. Association for Computational Linguistics, Stroudsburg (1996)
Haidar-Ahmad, L., Font, L., Zouaq, A., Gagnon, M.: Entity typing and linking using sparql patterns and DBpedia. In: Sack et al. [14]
Hellmann, S., Lehmann, J., Auer, S., Brümmer, M.: Integrating NLP using linked data. In: Alani, H., et al. (eds.) ISWC 2013, Part II. LNCS, vol. 8219, pp. 98–113. Springer, Heidelberg (2013)
Nuzzolese, A.G., Gentile, A.L., Presutti, V., Gangemi, A., Garigliotti, D., Navigli, R.: Open knowledge extraction challenge. In: Gandon, F., Cabrio, E., Stankovic, M., Zimmermann, A. (eds.) SemWebEval 2015. CCIS, vol. 548, pp. 3–15. Springer, Heidelberg (2015). doi:10.1007/978-3-319-25518-7_1
Petasis, G., Karkaletsis, V., Paliouras, G., Krithara, A., Zavitsanos, E.: Ontology population and enrichment: state of the art. In: Paliouras, G., Spyropoulos, C.D., Tsatsaronis, G. (eds.) Bridging the Semantic Gap. LNCS, vol. 6050, pp. 134–166. Springer, Heidelberg (2011)
Plu, J., Rizzo, G., Troncy, R.: Enhancing entity linking by combining models. In: Sack et al. [14]
Röder, M., Usbeck, R., Speck, R., Ngomo, A.-C.N.: CETUS – a baseline approach to type extraction. In: Gandon, F., Cabrio, E., Stankovic, M., Zimmermann, A. (eds.) SemWebEval 2015. CCIS, vol. 548, pp. 16–27. Springer, Heidelberg (2015). doi:10.1007/978-3-319-25518-7_2
Sack, H., Dietze, S., Tordai, A., Lange, C. (eds.): The Semantic Web: ESWC Challenges, Communications in Computer and Information Science. Springer, Berlin (2016)
Tjong Kim Sang, E.F., Introduction to the CoNLL- shared task: language-independent named entity recognition. In: Proceedings of 6th Conference on Natural Language Learning - COLING-2002, vol. 20, pp. 1–4. Association for Computational Linguistics, Stroudsburg (2002)
Iordache, O.: Introduction. In: Iordache, O. (ed.) Polystochastic Models for Complexity. UCS, vol. 4, pp. 1–16. Springer, Heidelberg (2010)
Usbeck, R., Röder, M., Ngomo, A.N., Baron, C., Both, A., Brümmer, M., Ceccarelli, D., Cornolti, M., Cherix, D., Eickmann, B., Ferragina, P., Lemke, C., Moro, A., Navigli, R., Piccinno, F., Rizzo, G., Sack, H., Speck, R., Troncy, R., Waitelonis, J., Wesemann, L.: GERBIL: general entity annotator benchmarking framework. In: Gangemi, A., Leonardi, S., Panconesi, A. (eds.) Proceedings of 24th International Conference on World Wide Web, WWW 2015, pp. 1133–1143. ACM (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Nuzzolese, A.G., Gentile, A.L., Presutti, V., Gangemi, A., Meusel, R., Paulheim, H. (2016). The Second Open Knowledge Extraction Challenge. In: Sack, H., Dietze, S., Tordai, A., Lange, C. (eds) Semantic Web Challenges. SemWebEval 2016. Communications in Computer and Information Science, vol 641. Springer, Cham. https://doi.org/10.1007/978-3-319-46565-4_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-46565-4_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46564-7
Online ISBN: 978-3-319-46565-4
eBook Packages: Computer ScienceComputer Science (R0)