Abstract
Matching life science ontologies to determine ontology mappings has recently become an active field of research. The large size of existing ontologies and the application of complex match strategies for obtaining high quality mappings makes ontology matching a resource- and time-intensive process. To improve performance we investigate different approaches for parallel matching on multiple compute nodes. In particular, we consider inter-matcher and intra-matcher parallelism as well as the parallel execution of element- and structure-level matching. We implemented a distributed infrastructure for parallel ontology matching and evaluate different approaches for parallel matching of large life science ontologies in the field of anatomy and molecular biology.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aumueller, D., Do, H.H., Massmann, S., Rahm, E.: Schema and ontology matching with COMA++. In: Proc. of ACM SIGMOD Intl. Conference on Management of Data, pp. 906–908 (2005)
Bastian, F., Parmentier, G., Roux, J., et al.: Bgee: Integrating and Comparing Heterogeneous Transcriptome Data Among Species. In: Bairoch, A., Cohen-Boulakia, S., Froidevaux, C. (eds.) DILS 2008. LNCS (LNBI), vol. 5109, pp. 124–131. Springer, Heidelberg (2008)
Bernstein, P.A., Melnik, S., Petropoulos, M., Quix, C.: Industrial Strength Schema Matching. ACM SIGMOD Record 33(4), 38–43 (2004)
Bodenreider, O., Burgun, A.: Linking the Gene Ontology to other biological ontologies. In: Proc. of 8th ISMB Meeting on Bio-Ontologies, pp. 17–18 (2005)
Bodenreider, O., Stevens, R.: Bio-ontologies: current trends and future directions. Briefings in Bioinformatics 7(3), 256–274 (2006)
Do, H.H., Rahm, E.: COMA – A System for Flexible Combination of Schema Matching Approaches. In: Proc. of the 28th Intl. Conference on Very Large Databases (VLDB), pp. 610–621 (2002)
Do, H.H., Rahm, E.: Matching large schemas: Approaches and evaluation. Information Systems 32(6), 857–885 (2007)
Ehrig, M., Staab, S.: QOM – Quick Ontology Mapping. In: McIlraith, S.A., Plexousakis, D., van Harmelen, F. (eds.) ISWC 2004. LNCS, vol. 3298, pp. 683–697. Springer, Heidelberg (2004)
Euzenat, J., Shvaiko, P.: Ontology Matching. Springer, Heidelberg (2007)
The Gene Ontology Consortium: The Gene Ontology project in 2008. Nucleic Acids Research 36(Database issue), D440–D444 (2008)
Gross, A., Hartung, M., Kirsten, T., Rahm, E.: Estimating the Quality of Ontology-Based Annotations by Considering Evolutionary Changes. In: Paton, N.W., Missier, P., Hedeler, C. (eds.) DILS 2009. LNCS (LNBI), vol. 5647, pp. 71–87. Springer, Heidelberg (2009)
Hartung, M., Kirsten, T., Rahm, E.: Analyzing the Evolution of Life Science Ontologies and Mappings. In: Bairoch, A., Cohen-Boulakia, S., Froidevaux, C. (eds.) DILS 2008. LNCS (LNBI), vol. 5109, pp. 11–27. Springer, Heidelberg (2008)
Hayamizu, T.F., Mangan, M., Corradi, J.P., Kadin, J.A., Ringwald, M.: The Adult Mouse Anatomical Dictionary: a tool for annotating and integrating data. Genome Biology 6(3), R29 (2005)
Hu, W., Qu, Y., Cheng, G.: Matching large ontologies: A divide-and-conquer approach. Data & Knowledge Engineering 67(1), 140–160 (2008)
Jakoniene, V., Lambrix, P.: Ontology-based integration for bioinformatics. In: Proc. VLDB Workshop on Ontologies-based techniques for Databases and Information Systems (ODBIS), pp. 55–58 (2005)
Kirsten, T., Hartung, M., Gross, A., Rahm, E.: Efficient Management of Biomedical Ontology Versions. In: Meersman, R., Herrero, P., Dillon, T.S. (eds.): On the Move to Meaningful Internet Systems Workshops. Proceedings. LNCS, vol. 4544, pp. 172-187. Springer, Heidelberg (2007)
Kirsten, T., Thor, A., Rahm, E.: Instance-based matching of large life science ontologies. In: Bairoch, A., Cohen-Boulakia, S., Froidevaux, C. (eds.) DILS 2008. LNCS (LNBI), vol. 5109, pp. 11–27. Springer, Heidelberg (2008)
Lambrix, P., Edberg, A.: Evaluation of ontology merging tools in bioinformatics. In: Proc. of the 8th Pacific Symposium on Biocomputing, pp. 589–600 (2003)
Lambrix, P., Tan, H., Jakoniene, V., Strömbäck, L.: Biological Ontologies. In: Semantic Web: Revolutionizing Knowledge Discovery in the Life Sciences, pp. 85–99 (2007)
Ontology Alignment Evaluation Initiative, http://20.ontologymatching.org/
Melnik, S., Garcia-Molina, H., Rahm, E.: Similarity Flooding: A Versatile Graph Matching Algorithm and Its Application to Schema Matching. In: Proc. of the 18th Intl. Conference on Data Engineering (ICDE), pp. 117–128 (2002)
Mork, P., Bernstein, P.A.: Adapting a Generic Match Algorithm to Align Ontologies of Human Anatomy. In: Proc. of the 20th Intl. Conference on Data Engineering (ICDE), pp. 787–790 (2004)
Peukert, E., Berthold, H., Rahm, E.: Rewrite Techniques for performance Optimization of Schema Matching Processes. In: Proc. 13th Intl. Conference on Extending Database Technology (EDBT), pp. 453–464 (2010)
Rahm, E., Bernstein, P.A.: A survey of approaches to automatic schema matching. VLDB Journal 10(4), 334–350 (2001)
Rahm, E., Do, H.H., Massmann, S.: Matching large XML schemas. ACM SIGMOD Record 33(4), 26–31 (2004)
Rance, B., Gibrat, J.F., Froidevaux, C.: An Adaptive Combination of Matchers: Application to the Mapping of Biological Ontologies for Genome Annotation. In: Paton, N.W., Missier, P., Hedeler, C. (eds.) DILS 2009. LNCS (LNBI), vol. 5647, pp. 113–126. Springer, Heidelberg (2009)
Saleem, K., Bellahsene, Z., Hunt, E.: PORSCHE: Performance ORiented SCHEma mediation. Information Systems 33(7-8), 637–657 (2008)
Seddiqui, H., Aono, M.: An efficient and scalable algorithm for segmented alignment of ontologies of arbitrary size. Web Semantics: Science, Services and Agents on the World Wide Web 7(4), 344–356 (2009)
Shvaiko, P., Euzenat, J.: Ten challenges for ontology matching. In: Proc. of on the Move to Meaningful Internet Systems (OTM), pp. 1164–1182 (2008)
Sioutos, N., de Coronado, S., Haber, M.W., et al.: NCI Thesaurus: A semantic model integrating cancer-related clinical and molecular information. Journal of Biomedical Informatics 40(1), 30–43 (2007)
Smith, B., Ashburner, M., Rosse, C., et al.: The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration. Nature Biotechnology 25(11), 1251–1255 (2007)
Thor, A., Hartung, M., Gross, A., Kirsten, T., Rahm, E.: An evolution-based approach for assessing ontology mappings - A case study in the life sciences. In: Proc. Conference of the Business, Technology and Web (BTW), pp. 277–286 (2009)
Zhang, S., Bodenreider, O.: Aligning Representations of Anatomy using Lexical and Structural Methods. In: Proc. of AMIA Annual Symposium, pp. 753–757 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gross, A., Hartung, M., Kirsten, T., Rahm, E. (2010). On Matching Large Life Science Ontologies in Parallel. In: Lambrix, P., Kemp, G. (eds) Data Integration in the Life Sciences. DILS 2010. Lecture Notes in Computer Science(), vol 6254. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15120-0_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-15120-0_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15119-4
Online ISBN: 978-3-642-15120-0
eBook Packages: Computer ScienceComputer Science (R0)