Abstract
Most information retrieval research focuses collecting documents that match the same set of concepts. This study considers a more advanced problem, namely how to discover knowledge not contained in a single source from combined historical facts. By using a well-designed core ontology in the cultural domain (CIDOC CRM, ISO21127), this study discusses the requirement for a robust inference platform for real-life knowledge discovery and integration over distributed sources. The methodology and design are justified in detail through functional requirements for an inference service with the capability of inferring new knowledge from combinations of facts distributed over different sources. A number of critical issues for developing such a robust inference platform are identified, namely (1) systematic accumulation of common concepts and inference rules; (2) extending the ontology with metaclasses; (3) accumulation of factual and categorical knowledge; (4) incorporation of fuzzy inference into the inference engine, and (5) improvement of performance and scalability in the inference engine.
Similar content being viewed by others
References
AAT, Art and Architecture Thesaurus Online: http://www.getty.edu/research/conducting_research/vocabularies/aat/.16 Jan 2008
Alani, H., Kim, S., Millard, E.D., Weal, J.M., Hall, W., Lewis, H.P., Shadbolt, R.N.: Automatic ontology-based knowledge extraction and tailored biography generation from the web. Intell. Syst. 18(1), 14–22 (2003)
AllPoster: http://www.allposters.com. Accessed 16 Jan 2008
Analyti, A., Spyratos, N., Constantopoulos, P.: Deriving and retrieving contextual categorical information through instance inheritance. Fundam. Inf. 44(4), 321–351 (2000)
Antoniou, G., Damásio, V.C., Grosof, B., Horrocks, I., Kifer, M., Maluszynski, J., Patel-Schneider, F.P.: Combining rules and ontologies. A survey. REWERSE Deliverables, no. I3-D3 (2005)
ARCO, Augmented Representation of Cultural Objects: http://www.arco-web.org. Accessed 16 Jan 2008
Artcyclopedia: http://www.artcyclopedia.com. Accessed 16 Jan 2008
Ashburner, M., Ball, A.C., Blake, A.J., Botstein, D., Butler, H., Cherry, M.J., Davis, P.A., Dolinski, K., Dwight, S.S., Eppig, T.J., Harris, A.M., Hill, P.D., Issel-Tarver, L., Kasarskis, A., Lewis, S., Matese, C.J., Richardson, E.J., Ringwald, M., Rubin, M.G., Sherlock, G.: Gene ontology: tool for the unification of biology. Nat. Genet. 25(1), 25–29 (2000)
Baader, F., Calvanese, D., McGuinness, D., Nardi, D., Patel-Schneider, P.: The Description Logic Handbook. Cambridge University Press, London (2002)
Bearman, D., Perkins, J.: The standards framework for computer interchange of museum information. Spectra, CIMI Standards Framework 20(2–3), 1–61 (1993)
Bechhofer, S.: Hoolet: a prototype reasoner based on translation to first Order. Department of Computer Science, University of Manchester. http://owl.man.ac.uk/hoolet/. Accessed 16 Jan 2008
Bijan, P., Evren, S.: Pellet: An OWL DL reasoner. In: Proceedings of the International Workshop on Description Logic (2004)
Boeuf, L.P.: Mapping CRM – > FRBR. http://cidoc.ics.forth.gr/docs/mapping_crm_frbr.pdf. Technical Report, FORTH-ICS (2003)
Boeuf, L.P.: FRBR and further. Cat. Classif. Q. 32(4), 15–22 (2001)
Bozsak, E., Ehrig, M., Handschuh, S., Hotho, A., Maedche, A., Motik, B., Oberle, D., Schmitz, C., Staab, S., Stojanovic, L., Stojanovic, N., Studer, R., Stumme, G., Sure, Y., Tane, J., Volz, R., Zacharias, V.: KAON—Towards a large scale Semantic Web. In: Proceedings of the Third International Conference on E-Commerce and Web Technologies, pp. 304–313 (2002)
CCO, Cataloguing Cultural Objects: http://vraweb.org/ccoweb/cco/about.html. Accessed 16 Jan 2008
CIDOC CRM: http://cidoc.ics.forth.gr/. Accessed 16 Jan 2008
Crawford, M.J., Kuipers, J.B.: Algernon–a tractable system for knowledge-representation. SIGART Bull. 2(3), 35–44 (1991)
Crofts, N., Dionissiadou, I., Doerr, M., Reed, P.: CIDOC Conceptual Reference Model–Information groups. ICOM/CIDOC Documentation Standards Group (1998)
Deepak, R., Pace, R., Keith, G.: First–orderized researchCyc: expressivity and efficiency in a common-sense ontology. In: AAAI Workshop on Contexts and Ontologies: Theory, Practice and Applications (2003)
Doerr, M., LeBoeuf, P.: Modelling intellectual processes The FRBR–CRM harmonization. In: Proceedings of ICOM-CIDOC Annual Meeting on Wider Perspective Broader Base, Gothenburg, Sweden (2006)
Doerr, M., Kritsotaki, A.: Documenting events in metadata. In: Proceedings of CIPA-VAST Conference, Nicosia, Cyprus, Archaeolingua, Budapest, pp. 56–59 (2006)
Doerr, M.: Modelling learning subjects as relationships. In: Grieser G., Tanaka Y. (eds.) Proceedings of Dagstuhl Workshop on Intuitive Human Interface for Organizing and Accessing Intellectual Assets. pp. 200–214 (2004)
Doerr, M.: The CIDOC CRM–an ontological approach to semantic interoperability of metadata. Artif. Intell. Mag. 24(3), 75–92 (2003)
Doerr, M.: Mapping of the AMICO data dictionary to the CIDOC CRM. Technical Report, FORTH-ICS/TR-288 (2001)
Doerr, M.: Mapping of the Dublin Core metadata element set to the CIDOC CRM. Technical Report, FORTH-ICS/TR-274 (2000)
Donini, M.F., Lenzerini, M., Nardi, D., Schaerf, A.: AL-log: integrating datalog and description logics. J. Intell. Inf. Syst. 10(3), 227–252 (1998)
Dublin Core Metadata Initiative: http://dublincore.org/. Accessed 16 Jan 2008
Eiter, T., Lukasiewicz, T., Schindlauer, R., Tompits, H.: Combining answer set programming with description logics for the semantic web. In: Proceedings of the Ninth International Conference on Principles of Knowledge Representation and Reasoning, Whistler, British Columbia, Canada, pp. 141–151 (2004)
Eriksson, H.: Using JessTab to integrate Protégé and Jess. IEEE Intell. Syst. 18(2), 43–50 (2003)
Etzioni, O., Cafarella, M., Downey, D., Popescu, A.M., Shaked, T., Soderland, S., Weld, S.D., Yates, A.: Unsupervised named-entity extraction from the web: an experimental study. Artif. Intell. 165(1), 91–134 (2005)
Fauconnier, G., Turner, M.: The Way We Think: Conceptual Blending and the Mind’s Hidden Complexities. Basic Books, New York (2002)
Fensel, D., Horrocks, I., Harmelen van, F., Decker, S., Erdmann, M., Klein, A.C.M.: OIL in a nutshell. In: Proceedings of the 12th European Workshop on Knowledge Acquisition, Modeling and Management, pp. 1–16 (2000)
Flickner, M., Sawhney, H., Niblack, W., Ashley, J., Huang, Q., Dom, B., Gorkani, M., Hafner, J., Lee, D., Petkovic, D., Steele, D., Yanker, P.: Query by image and video content: the QBIC system. IEEE Comput. 28(9), 23–32 (1995)
Forgy, L.C.: Rete: A fast algorithm for the many pattern/many object pattern match problem. Artif. Intell. 19, 17–37 (1982)
Golbreich, C., Dameron, O., Bierlaire, O., Gibaud, B.: What reasoning support for ontology and rules? The brain anatomy case study. In: Proceedings of the Workshop OWL Experiences and Directions, Galway, Ireland (2005)
Golbreich, C.: Combining rule and ontology reasoners for the Semantic Web. In: Proceedings of Rules and Rule Markup Languages for the Semantic Web, Hiroshima, Japan, pp. 155–169 (2004)
Grosof, B., Gandhe, M., Finin, T.: SweetJess: Translating Daml RuleML to Jess. In: International Workshop on Rule Markup Languages for Business Rules on the Semantic Web (2002)
Grosof, B.: Representing E-Business rules for the Semantic Web: situated courteous logic programs in RuleML. In: Proceedings of the 11th Workshop on Information Technologies and Systems (2001)
Gupta, A., Jain, R.: Visual information retrieval. Commun. ACM 40(5), 71–79 (1997)
Haarslev, V., Möller, R.: Racer: A core inference engine for the Semantic Web. In: Proceedings of the Second International Workshop on Evaluation of Ontology-based Tools, Sanibel Island, Florida (2003)
Heflin, J., Hendler, J., Luke, S.: Reading between the lines: Using SHOE to discover implicit knowledge from the web. In: AAAI-98 Workshop on AI and Information Integration, Madison (1998)
Horrocks, I., Patel-Schneider, F.P., Boley, H., Tabet, S., Grosof, B., Dean, M.: SWRL: A Semantic Web rule language combining OWL and RuleML. Acknowledged W3C Member Submission (2004)
Horrocks, I., Sattler, U.: Decidability of SHIQ with complex role inclusion axioms. Artif. Intell. 160(1), 79–104 (2004)
Horrocks, I., Patel-Schneider, F.P., Harmelen van, F.: From SHIQ and RDF to OWL: the making of a web ontology language. J. Web Semant. 1(1), 7–26 (2003)
Horrocks, I.: DAML+OIL: a description logic for the semantic web. IEEE Data Eng. Bull. 25(1), 4–9 (2002)
Knublauch, H., Musen, A.M., Rector, L.A.: Editing description logic ontologies with the protégé OWL plugin. In: Proceedings of the International Workshop on Description Logics, Whistler, BC, Canada (2004)
Kondylakis, H., Doerr, M., Plexousakis, D.: Mapping language for information integration. Technical Report 385, ICS-FORTH (2006)
Koutsomitropoulos, A.D., Fragakis, F.M., Papatheodorou, S.T.: A methodology for conducting knowledge discovery on the semantic web. In: Proceedings of 16th ACM Conference on Hypertext and Hypermedia (2005)
Lagoze, C., Hunter, J.: The ABC ontology and model. In: Proceedings of the International Conference on Dublin Core and Metadata Applications, pp. 160–176 (2001)
Levy, Y.A., Rousset, M.C.: Combining horn rules and description logics in CARIN. Artif. Intell. 104(1–2), 165–209 (1998)
Lindholm, J., Patel, M.: ARCO internal report: a review of metadata use. SFP-Metadata-Review (2002)
Liu, H., Singh, P.: ConceptNet: a practical commonsense reasoning toolkit. BT Technol. J. 22(4), 211–226 (2004)
Louvre Museum: http://www.louvre.fr. Accessed 16 Jan 2008
Martin, P., Eklund, W.P.: Knowledge retrieval and the World Wide Web. Intell. Syst. 15(3), 18–25 (2000)
Matheus, J.C., Baclawski, K., Kokar, M.M., Letkowski, J.J.: Using SWRL and OWL to capture domain knowledge for a situation awareness application applied to a supply logistics scenario. In: Proceedings of International Conference on Rules and Rule Markup Languages for the Semantic Web, (RuleML-2005), Galway, Ireland (2005)
Matuszek, C., Witbrock, M., Kahlert, C.R., Cabral, J., Schneider, D., Shah, P., Lenat, D.: Searching for common sense populating Cyc from the web. In: Proceedings of the 20th National Conference on Artificial Intelligence (2005)
McKenna, G., Patsatzi, E.: SPECTRUM: The UK Museum Documentation Standard, Version 3.0. MDA, Cambridge (2005)
MetaCRM: http://cidoc.ics.forth.gr/docs/meta_crm/meta_crm_draft.doc. Accessed 16 Jan 2008
Miller, A.G.: WordNet: a lexical database for English. Commun. of the ACM 38(11), 39–41 (1995)
Motik, B., Horrocks, I., Rosati, R., Sattler, U.: Can OWL and logic programming live together happily ever after? In: Proceedings of the 5th International Semantic Web Conference (ISWC-06), Athens, Greece (2006)
Motik, B., Sattler, U.: A comparison of reasoning techniques for querying large description logic ABoxes. In: Proceeding of the 13th International Conference on Logic for Programming Artificial Intelligence and Reasoning, (LPAR 2006), Phnom Penh, Cambodia (2006)
Motik, B., Sattler, U., Studer, R.: Query answering for OWL DL with rules. In: Proceedings of the Third International Semantic Web Conference (ISWC 2004), Hiroshima, Japan (2004)
Orchard, R.: Fuzzy reasoning in Jess: the Fuzzy J toolkit and Fuzzy Jess. In: Proceedings of the Third International Conference on Enterprise Information Systems, pp. 533–542 (2001)
Plas, D.J., Verheijen, M., Zwaal, H., Hutschemaekers, M.: Manipulating context information with SWRL. FreeBand/A-MUSE Deliverable D3.12 (2006)
Protégé: http://protege.stanford.edu/. Accessed 16 Jan 2008
Rosati, R.: On the decidability and complexity of integrating ontologies and rules. J. Web Semant. 3(1), 61–73 (2005)
Sandia National Laboratories. Jess: the rule engine for the Java platform: http://herzberg.ca.sandia.gov/jess. Accessed 16 Jan 2008
Sheth, A., Thacker, S., Patel, S.: Complex relationship and knowledge discovery support in the InfoQuilt system. Int. J. Very Large Data Bases 12(1), 2–27 (2002)
Smeulders, A., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 22(12), 1349–1380 (2000)
Smith, B.: Ontology. In: Luciano, F.(eds) The Blackwell Guide to the Philosophy of Computing and Information, pp. 155–166. Blackwell, Oxford (2003)
Stevens, R., Aranguren, E.M., Wolstencroft, K., Sattler, U., Drummond, N., Horridge, M., Rector, A.: Using OWL to model biological knowledge. Int. J. Hum. Comput. Stud. 65(7), 583–594 (2007)
Theodoridou, M., Doerr, M.: Mapping of the Encoded Archival Description DTD Element Set to the CIDOC CRM. Technical Report FORTH-ICS/TR-289 (2001)
Trant, J., Richmond, K., Bearman, D.: Open concepts: museum digital documentation for education through the AMICO Library. Art Libr. J. 27(3), 30–42 (2002)
Tsarkov, D., Horrocks, I.: FaCT++ description logic reasoner: system description. In: Proceedings of the Third International Joint Conference on Automated Reasoning (IJCAR-06) (2006)
VRA, Visual Resources Association Data Standards Committee: http://www.vraweb.org/. Accessed 16 Jan 2008
Wilkinson, K., Sayers, C., Kuno, H., Reynolds, D.: Efficient RDF storage and retrieval in Jena 2. In: Proceedings of First International Workshop on Semantic Web and Databases, Berlin (2003)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lin, CH., Hong, JS. & Doerr, M. Issues in an inference platform for generating deductive knowledge: a case study in cultural heritage digital libraries using the CIDOC CRM. Int J Digit Libr 8, 115–132 (2008). https://doi.org/10.1007/s00799-008-0034-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00799-008-0034-0