Abstract
We present a method for using aligned ontologies to merge taxonomically organized data sets that have apparently compatible schemas, but potentially different semantics for corresponding domains. We restrict the relationships involved in the alignment to basic set relations and disjunctions of these relations. A merged data set combines the domains of the source data set attributes, conforms to the observations reported in both data sets, and minimizes uncertainty introduced by ontology alignments. We find that even in very simple cases, merging data sets under this scenario is non-trivial. Reducing uncertainty introduced by the ontology alignments in combination with the data set observations often results in many possible merged data sets, which are managed using a possible worlds semantics. The primary contributions of this paper are a framework for representing aligned data sets and algorithms for merging data sets that report the presence and absence of taxonomically organized entities, including an efficient algorithm for a common data set merging scenario.
Work supported by NSF awards IIS-0630033, DBI-0743429, and DBI-0753144.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Cliff, A.D., Haggett, P., Smallman-Raynor, M.: The changing shape of island epidemics: historical trends in icelandic infectious disease waves. J Hist Geogr. 1902–1988 (2009)
Berkley, C., Jones, M., Bojilova, J., Higgins, D.: Metacat: a schema-independent xml database system. In: SSDBM, pp. 171–179 (2001)
Thau, D.: Reasoning about taxonomies and articulations. In: EDBT Workshops, pp. 11–19 (2008)
Brachman, R.: What is-a is and isn’t: An analysis of taxonomic links in semantic networks. IEEE Computer 16, 30–36 (1983)
Randell, D.A., Cui, Z., Cohn, A.: A spatial logic based on regions and connection. In: KR, pp. 165–176 (1992)
Thau, D., Ludäscher, B.: Reasoning about taxonomies in first-order logic. Ecological Informatics 2(3), 195–209 (2007)
Thau, D., Bowers, S., Ludäscher, B.: Merging taxonomies under RCC-5 algebraic articulations. In: Proceedings of the CIKM ONISW Workshop, pp. 47–54 (2008)
Lewis, C., Langford, C.: Symbolic Logic, 2nd edn. Dover, New York (1959)
Abiteboul, S., Kanellakis, P.C., Grahne, G.: On the representation and querying of sets of possible worlds. In: SIGMOD, pp. 34–48 (1987)
Antova, L., Jansen, T., Koch, C., Olteanu, D.: Fast and simple relational processing of uncertain data. In: ICDE, pp. 983–992 (2008)
Bowers, S., Madin, J.S., Schildhauer, M.P.: A conceptual modeling framework for expressing observational data semantics. In: Li, Q., Spaccapietra, S., Yu, E., Olivé, A. (eds.) ER 2008. LNCS, vol. 5231, pp. 41–54. Springer, Heidelberg (2008)
Antova, L., Koch, C., Olteanu, D.: World-set decompositions: Expressiveness and efficient algorithms. In: Schwentick, T., Suciu, D. (eds.) ICDT 2007. LNCS, vol. 4353, pp. 194–208. Springer, Heidelberg (2007)
Bachmair, L., Ganzinger, H., Waldmann, U.: Set constraints are the monadic class. Logic in Computer Science, 75–83 (1993)
Darwiche, A.: New advances in compiling cnf into decomposable negation normal form. In: ECAI, pp. 328–332 (2004)
Korovin, K.: iProver – an instantiation-based theorem prover for first-order logic (system description). In: Armando, A., Baumgartner, P., Dowek, G. (eds.) IJCAR 2008. LNCS (LNAI), vol. 5195, pp. 292–298. Springer, Heidelberg (2008)
Lenzerini, M.: Data integration: A theoretical perspective. In: PODS (2002)
Peet, R.: Taxonomic concept mappings for 9 taxonomies of the genus Ranunculus published from 1948 to 2004. Unpublished data set (2005)
Vasseur, P., Mouaddib, E.M., Pégard, C.: Introduction to multisensor data fusion. In: Zurawski, R. (ed.) The Industrial Information Technology Handbook, pp. 1–10. CRC Press, Boca Raton (2005)
Noy, N.F., Musen, M.A.: PROMPT: Algorithm and tool for automated ontology merging and alignment, pp. 450–455. AAAI, Menlo Park (2000)
McGuinness, D.L., Fikes, R., Rice, J., Wilder, S.: An environment for merging and testing large ontologies. In: ECAI (2000)
Stumme, G., Maedche, A.: Ontology merging for federated ontologies on the semantic web. In: FMII, pp. 413–418 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Thau, D., Bowers, S., Ludäscher, B. (2009). Merging Sets of Taxonomically Organized Data Using Concept Mappings under Uncertainty. In: Meersman, R., Dillon, T., Herrero, P. (eds) On the Move to Meaningful Internet Systems: OTM 2009. OTM 2009. Lecture Notes in Computer Science, vol 5871. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-05151-7_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-05151-7_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-05150-0
Online ISBN: 978-3-642-05151-7
eBook Packages: Computer ScienceComputer Science (R0)