Abstract
The Open Knowledge Extraction (OKE) challenge is aimed at promoting research in the automatic extraction of structured content from textual data and its representation and publication as Linked Data. We designed two extraction tasks: (1) Entity Recognition, Linking and Typing and (2) Class Induction and entity typing. The challenge saw the participations of four systems: CETUS-FOX and FRED participating to both tasks, Adel participating to Task 1 and OAK@Sheffield participating to Task 2. In this paper we describe the OKE challenge, the tasks, the datasets used for training and evaluating the systems, the evaluation method, and obtained results.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
The prefix dul: stands for the namespace http://www.ontologydesignpatterns.org/ont/dul/DUL.owl.
- 9.
- 10.
The prefixes nif:, itsrdf:, dul:, and dbpedia: identify the namespaces http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core, http://www.w3.org/2005/11/its/rdf, http://www.ontologydesignpatterns.org/ont/dul/DUL.owl, and http://dbpedia.org/resource/ respectively.
- 11.
Prefixes d0: and dul: stand for namespaces http://ontologydesignpatterns.org/ont/wikipedia/d0.owl and http://www.ontologydesignpatterns.org/ont/dul/DUL.owl respectively.
- 12.
The training dataset is available at https://github.com/anuzzolese/oke-challenge/blob/master/GoldStandard_sampleData/task1/dataset_task_1.ttl. Similarly, the evaluation dataset is available at https://github.com/anuzzolese/oke-challenge/blob/master/evaluation-data/task1/evaluation-dataset-task1.ttl.
- 13.
The training dataset is available at https://github.com/anuzzolese/oke-challenge/blob/master/GoldStandard_sampleData/task2/dataset_task_2.ttl. Similarly, the evaluation dataset is available at https://github.com/anuzzolese/oke-challenge/blob/master/evaluation-data/task2/evaluation-dataset-task2.ttl.
- 14.
References
Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Semantic Web Inf. Syst. 5(3), 1–22 (2009)
Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., Hellmann, S.: DBpedia - a crystallization point for the web of data. J. Web Sem. 7(3), 154–165 (2009)
Consoli, S., Reforgiato, D.: Using fred for named entity resolution, linking and typing for knowledge base population. In: Gandon, F., Sabou, M., Sack, H., Cabrio, E., Stankovic, M., Zimmermann, A. (eds.) ESWC 2015 Challenges, CCIS, pp. 40–50. Springer International Publishing, Switzerland (2015)
Doddington, G.R., Mitchell, A., Przybocki, M.A., Ramshaw, L.A., Strassel, S., Weischedel, R.M.: The automatic content extraction (ace) program-tasks, data, and evaluation. In: LREC (2004)
Gangemi, A., Guarino, N., Masolo, C., Oltramari, A., Schneider, L.: Sweetening ontologies with DOLCE. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, pp. 166–181. Springer, Heidelberg (2002)
Gao, J., Mazumdar, S.: Exploiting linked open data to uncover entity types. In: Gandon, F., Sabou, M., Sack, H., Cabrio, E., Stankovic, M., Zimmermann, A. (eds.) ESWC 2015 Challenges, CCIS, pp. 51–62. Springer International Publishing, Switzerland (2015)
Grishman, R., Sundheim, B.: Message understanding conference-6: a brief history. In: Proceedings of the 16th Conference on Computational Linguistics, COLING 1996, vol. 1, pp. 466–471. Association for Computational Linguistics, Stroudsburg, PA, USA (1996)
Hellmann, S., Lehmann, J., Auer, S., Brümmer, M.: Integrating NLP using linked data. In: Alani, H., Kagal, L., Fokoue, A., Groth, P., Biemann, C., Parreira, J.X., Aroyo, L., Noy, N., Welty, C., Janowicz, K. (eds.) ISWC 2013, Part II. LNCS, vol. 8219, pp. 98–113. Springer, Heidelberg (2013)
Petasis, G., Karkaletsis, V., Paliouras, G., Krithara, A., Zavitsanos, E.: Ontology population and enrichment: state of the art. In: Paliouras, G., Spyropoulos, C.D., Tsatsaronis, G. (eds.) Multimedia Information Extraction. LNCS, vol. 6050, pp. 134–166. Springer, Heidelberg (2011)
Plu, J., Rizzo, G., Troncy, R.: A hybrid approach for entity recognition and linking. In: Gandon, F., Sabou, M., Sack, H., Cabrio, E., Stankovic, M., Zimmermann, A. (eds.) ESWC 2015 Challenges, CCIS, pp. 28–39. Springer International Publishing, Switzerland (2015)
Röder, M., Usbeck, R., Ngonga Ngomo, A.-C.: Cetus - a baseline approach to type extraction. In: Gandon, F., Sabou, M., Sack, H., Cabrio, E., Stankovic, M., Zimmermann, A. (eds.) ESWC 2015 Challenges, CCIS, pp. 16–27. Springer International Publishing, Switzerland (2015)
Tjong Kim Sang, E.F.: Introduction to the conll-2002 shared task: language-independent named entity recognition. In: Proceedings of the 6th Conference on Natural Language Learning, COLING-02, vol. 20, pp. 1–4. Association for Computational Linguistics, Stroudsburg, PA, USA (2002)
Iordache, O.: Introduction. In: Iordache, O. (ed.) Polystochastic Models for Complexity. UCS, vol. 4, pp. 1–16. Springer, Heidelberg (2010)
Usbeck, R., Röder, M., Ngomo, A.N., Baron, C., Both, A., Brümmer, M., Ceccarelli, D., Cornolti, M., Cherix, D., Eickmann, B., Ferragina, P., Lemke, C., Moro, A., Navigli, R., Piccinno, F., Rizzo, G., Sack, H., Speck, R., Troncy, R., Waitelonis, J., Wesemann, L.: GERBIL: general entity annotator benchmarking framework. In Gangemi, A., Leonardi, S., Panconesi, A. (eds.) Proceedings of the 24th International Conference on World Wide Web, WWW 2015, pp. 1133–1143. ACM (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Nuzzolese, A.G., Gentile, A.L., Presutti, V., Gangemi, A., Garigliotti, D., Navigli, R. (2015). Open Knowledge Extraction Challenge. In: Gandon, F., Cabrio, E., Stankovic, M., Zimmermann, A. (eds) Semantic Web Evaluation Challenges. SemWebEval 2015. Communications in Computer and Information Science, vol 548. Springer, Cham. https://doi.org/10.1007/978-3-319-25518-7_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-25518-7_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25517-0
Online ISBN: 978-3-319-25518-7
eBook Packages: Computer ScienceComputer Science (R0)