ABSTRACT
Publically available text-based documents (e.g. news, meeting transcripts) are a very important source of knowledge for organizations and individuals. These documents refer domain entities such as persons, places, professional positions, decisions, actions, etc. Querying these documents (instead of browsing, searching and finding) is a very relevant task for any person in general, and particularly for professionals dealing with intensive knowledge tasks. Querying text-based documents' data, however, is not supported by common technology. For that, such documents' content has to be explicitly and formally captured into knowledge base facts. Making use of automatic NLP processes for capturing such facts is a common approach, but their relatively low precision and recall give rise to data quality problems. Further, facts existing in the documents are often insufficient to answer complex queries and, therefore, it is often necessary to enrich the captured facts with facts from third-party repositories (e.g. public LOD, private IS databases). This paper describes the adopted process to identify what data is currently missing from the knowledge base repository and which is desirable to collect from external repositories. The proposed process aims to foster and is driven by OWL DL inference-based instance (ABox) classification, which is supported by the constraints of the TBox.
- Canito A., Maio, P. and Silva, N. 2013. An Approach for Populating and Enriching Ontology-based Repositories. 12th International Workshop on Web Semantics (WebS) at DEXA (Prague, Czech Republic, Sept 2013). Google ScholarDigital Library
- Brandão, R., Maio, P. and Silva, N. 2012. Enhancing LOD Complex Query Building with Context. (Macau, China, Dec. 2012).Google Scholar
- Elsenbroich, C., Kutz, O. and Sattler, U. 2006. A Case for Abductive Reasoning over Ontologies. (2006).Google Scholar
- Gamma, E., Helm, R., Johnson, R. and Vlissides, J. 1994. Design Patterns: Elements of Reusable Object-Oriented Software. Addison-Wesley Professional. Google ScholarDigital Library
- Kiryakov, A., Popov, B., Terziev, I., Manov, D. and Ognyanoff, D. 2004. Semantic annotation, indexing, and retrieval. Web Semantics: Science, Services and Agents on the World Wide Web. 2, 1 (Dec. 2004), 49--79. Google ScholarDigital Library
- Linked Data - Design Issues: 2006. http://www.w3. org/DesignIssues/LinkedData. html.Google Scholar
- McDowell, L.K. and Cafarella, M. 2008. Ontology-driven, unsupervised instance population. Web Semantics: Science, Services and Agents on the World Wide Web. 6, 3 (Sep. 2008), 218--236. Google ScholarDigital Library
- Motik, B., Sattler, U. and Studer, R. 2005. Query Answering for OWL-DL with rules. Web Semant. 3, 1 (Jul. 2005), 41--60. Google ScholarDigital Library
- OWL 2 Web Ontology Language Structural Specification and Functional-Style Syntax (Second Edition): http://www.w3. org/TR/2012/REC-owl2-syntax-20121211/. Accessed: 2013-02-26.Google Scholar
- Sirin, E., Parsia, B., Grau, B.C., Kalyanpur, A. and Katz, Y. 2007. Pellet: A practical owl-dl reasoner. Web Semantics: science, services and agents on the World Wide Web. 5, 2 (2007), 51--53. Google ScholarDigital Library
- Song, F., Zacharewicz, G. and Chen, D. 2013. An ontology-driven framework towards building enterprise semantic information layer. Advanced Engineering Informatics. 27, 1 (Jan. 2013), 38--50. Google ScholarDigital Library
- Song, F., Zacharewicz, G. and Chen, D. 2013. An ontology-driven framework towards building enterprise semantic information layer. Advanced Engineering Informatics. 27, 1 (Jan. 2013), 38--50. Google ScholarDigital Library
- Stojanovic, L., Maedche, A., Motik, B. and Stojanovic, N. 2002. User-driven Ontology Evolution Management. 13th International Conference on Knowledge Engineering and Knowledge Management (Heidelberg, Oct. 2002), 197--212. Google ScholarDigital Library
- Talburt, J.R. 2011. Entity resolution and information quality. Morgan Kaufmann. Google ScholarDigital Library
- World Search: 2010. http://www.microsoft.com/portugal/mldc/worldsearch/en/.Google Scholar
- Computational Processing of the Portuguese Language -6th International Workshop, PROPOR 2003.Google Scholar
Index Terms
- Introducing inference-driven OWL ABox enrichment
Recommendations
Translating the Foundational Model of Anatomy into OWL
The Foundational Model of Anatomy (FMA) represents the result of manual and disciplined modeling of the structural organization of the human body. It is a tremendous resource in bioinformatics that facilitates sharing of information among applications ...
Transforming XML documents to OWL ontologies: A survey
The aims of XML data conversion to ontologies are the indexing, integration and enrichment of existing ontologies with knowledge acquired from these sources. The contribution of this paper consists in providing a classification of the approaches used ...
Extension of OWL with Dynamic Fuzzy Logic
Advances in Web and Network Technologies, and Information ManagementIn recent years, ontology has played a major role in knowledge representation. Ontology languages are based on description logics. Though they are expressive enough, they cannot express and reason with fuzzy and dynamic knowledge on the Semantic Web. To ...
Comments