Abstract
We present ontology databases with not-gadgets, a method for detecting inconsistencies in an ontology with large numbers of annotated instances by using triggers and exclusion dependencies in a unique way. What makes this work relevant is the use of the database itself, rather than an external reasoner, to detect logical inconsistencies given large numbers of annotated instances. What distinguishes this work is the use of event-driven triggers together with the introduction of explicit negations. We applied this approach toward the serotonin example, an open problem in biomedical informatics which aims to use annotations to help identify inconsistencies in the Gene Ontology. We discovered 75 inconsistencies that have important implications in biology, which include: (1) methods for refining transfer rules used for inferring electronic annotations, and (2) highlighting possible biological differences across species worth investigating.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
The Gene Ontology (GO) project in 2006. Nucleic Acids Research, 34(Database issue) (January 2006)
Abadi, D.J., Marcus, A., Madden, S.R., Hollenbach, K.: SW-Store: A Vertically Partitioned DBMS for Semantic Web Data Management. VLDB Journal 18(2), 385–406 (2009)
Baader, F., Calvanese, D., McGuinness, D.L., Nardi, D., Patel-Schneider, P.F. (eds.): The Description Logic Handbook: Theory, Implementation, and Applications. Cambridge University Press, Cambridge (2003)
Broekstra, J., Kampman, A., van Harmelen, F.: Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema. In: International Semantic Web Conference, pp. 54–68 (2002)
Bult, C.J., Eppig, J.T., Kadin, J.A., Richardson, J.E., Blake, J.A.a.: The Mouse Genome Database (MGD): mouse biology and model systems. Nucleic Acids Research 36(Database issue) (January 2008)
Calvanese, D., De Giacomo, G., Lembo, D., Lenzerini, M., Rosati, R.: DL-Lite: Tractable Description Logics for Ontologies. In: AAAI 2005: Proceedings of the 20th National Conference on Artificial Intelligence, pp. 602–607 (2005)
Christophides, V., Karvounarakis, G., Plexousakis, D., Scholl, M., Tourtounis, S.: Optimizing taxonomic semantic web queries using labeling schemes. Journal of Web Sematics 1, 207–228 (2004)
Copeland, G.P., Khoshafian, S.N.: A decomposition storage model. In: SIGMOD 1985: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 268–279. ACM, New York (1985)
Curé, O., Squelbut, R.: A Database Trigger Strategy to Maintain Knowledge Bases Developed Via Data Migration. In: Bento, C., Cardoso, A., Dias, G. (eds.) EPIA 2005. LNCS (LNAI), vol. 3808, pp. 206–217. Springer, Heidelberg (2005)
Donini, F.M., Nardi, D., Rosati, R.: Description logics of minimal knowledge and negation as failure. ACM Trans. Comput. Logic 3(2), 177–225 (2002)
Dowek, G., Hardin, T., Kirchner, C.: Theorem Proving Modulo. Journal of Automated Reasoning 31, 2003 (1998)
Franconi, E.: Ontologies and databases: myths and challenges. In: Proceedings of the VLDB Endowment, vol. 1(2), pp. 1518–1519 (2008)
Gene Ontology Consortium. Gene Ontology: tool for the unification of biology. Nature Genetics 25, 25–29 (2000)
Goble, C., Stevens, R.: State of the nation in data integration for bioinformatics. Journal of Biomedical Informatics (February 2008)
Guo, Y., Pan, Z., Heflin, J.: An Evaluation of Knowledge Base Systems for Large OWL Datasets. In: McIlraith, S.A., Plexousakis, D., van Harmelen, F. (eds.) ISWC 2004. LNCS, vol. 3298, pp. 274–288. Springer, Heidelberg (2004)
Guo, Y., Pan, Z., Heflin, J.: LUBM: A benchmark for OWL knowledge base systems. Journal of Web Semantics 3(2-3), 158–182 (2005)
Haarslev, V., Möller, R.: High Performance Reasoning with Very Large Knowledge Bases: A Practical Case Study. In: IJCAI 2001: Proceedings of the International Joint Conferences on Artificial Intelligence, pp. 161–168 (2001)
Hill, D.P., Smith, B., McAndrews-Hill, M.S., Blake, J.A.: Gene Ontology annotations: what they mean and where they come from. BMC Bioinformatics 9(5) (2008)
Horrocks, I., Li, L., Turi, D., Bechhofer, S.: The Instance Store: DL Reasoning with Large Numbers of Individuals. Description Logics (2004)
Lee, J., Goodwin, R.: Ontology Management for Large-Scale E-Commerce Applications. In: DEEC 2005: Proceedings of the International Workshop on Data Engineering Issues in E-Commerce, pp. 7–15. IEEE Computer Society, Washington (2005)
LePendu, P., Dou, D., Frishkoff, G.A., Rong, J.: Ontology Database: a New Method for Semantic Modeling and an Application to Brainwave Data. In: Ludäscher, B., Mamoulis, N. (eds.) SSDBM 2008. LNCS, vol. 5069, pp. 313–330. Springer, Heidelberg (2008)
Motik, B., Horrocks, I., Sattler, U.: Bridging the Gap Between OWL and Relational Databases. In: WWW 2007: Proceedings of the 16th International Conference on World Wide Web, pp. 807–816 (2007)
Neumann, T., Weikum, G.: Scalable Join Processing on Very Large RDF Graphs. In: SIGMOD 2009: Proceedings of the ACM SIGMOD International Conference on Management of Data (to appear, 2009)
Reiter, R.: Deductive Question-Answering on Relational Data Bases. Logic and Data Bases, 149–177 (1977)
Reiter, R.: Towards a Logical Reconstruction of Relational Database Theory. In: Brodie, M.L., Mylopoulos, J., Schmidt, J.W. (eds.) On Conceptual Modelling—Perspectives from Artificial Intelligence, Databases, and Programming Languages. Topics in Information Systems, pp. 191–233. Springer, Heidelberg (1984)
Reiter, R.: What should a database know? In: PODS 1988: Proceedings of the seventh ACM Symposium on Principles of Database Systems, pp. 302–304. ACM, New York (1988)
Rubin, D.L., Musen, M.A., et al.: National Center for Biomedical Ontology: Advancing Biomedicine through Structured Organization of Scientific Knowledge. OMICS: A Journal of Integrative Biology 10(2), 185–198 (2009)
Slaney, J.K.: Relevant Logic and Paraconsistency. Inconsistency Tolerance, 270–293 (2005)
Sprague, J., Westerfield, M., et al.: The Zebrafish Information Network: the zebrafish model organism database provides expanded support for genotypes and phenotypes. Nucleic Acids Research 36, D768–D772 (2007)
Ullman, J.D.: Principles of Database and Knowledge-Base Systems, vol. I. Computer Science Press (1988)
Vasilecas, O., Bugaite, D.: An algorithm for the automatic transformation of ontology axioms into a rule model. In: CompSysTech 2007: Proceedings of the International Conference on Computer Systems and Technologies, pp. 1–6. ACM, New York (2007)
Wache, H., Vögele, T., Visser, U., Stuckenschmidt, H., Schuster, G., Neumann, H., Hübner, S.: Ontology-based integration of information — a survey of existing approaches. In: Stuckenschmidt, H. (ed.) IJCAI 2001: Workshop on Ontologies and Information Sharing, pp. 108–117 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
LePendu, P., Dou, D., Howe, D. (2009). Detecting Inconsistencies in the Gene Ontology Using Ontology Databases with Not-gadgets. In: Meersman, R., Dillon, T., Herrero, P. (eds) On the Move to Meaningful Internet Systems: OTM 2009. OTM 2009. Lecture Notes in Computer Science, vol 5871. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-05151-7_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-05151-7_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-05150-0
Online ISBN: 978-3-642-05151-7
eBook Packages: Computer ScienceComputer Science (R0)