Abstract
Text mining services can be used to extract and categorize entities from textual information on the web. Merging results from multiple services could improve extraction quality. This requires to have an integrated extraction taxonomy and corresponding mappings between individual taxonomies that are used for categorizing extracted information. However, current ontology matching approaches cannot be applied since the available meta data within most taxonomies is weak.
In this article we propose a novel taxonomy alignment process that allows us to automatically identify equal, hierarchical and associative mappings and integrate those mappings in a global taxonomy. We broadly evaluate our matching approach on real world service taxonomies and compare to state-of-the-art approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Grimes, S.: Unstructured data and the 80 percent rule. Clarabridge Bridgepoints (2008). http://breakthroughanalysis.com/2008/08/01/unstructured-data-and-the-80-percent-rule/
Hotho, A., Nürnberger, A., Paaß, G.: A brief survey of text mining. LDV Forum 20(1), 19–62 (2005)
OpenCalais: Calais Homepage. March 2013. http://www.opencalais.com/
AlchemyAPI: AlchemyAPI Homepage. March 2013. http://www.alchemyapi.com/
Seidler, K., Schill, A.: Service-oriented information extraction. In: Proceedings of the Joint EDBT/ICDT Ph.D. Workshop 2011, pp. 25–31 (2011)
Evri: Evri Developer Homepage. June 2012. http://www.evri.com/developer/
FISE: Furtwangen IKS Semantic Engine project page. March 2013. http://wiki.iks-project.eu/index.php/FISE
Euzenat, J., Shvaiko, P.: Ontology Matching. Springer, New York (2007)
Rahm, E., Bernstein, P.A.: A survey of approaches to automatic schema matching. VLDB J. 10, 334–350 (2001)
Pfeifer, K., Peukert, E.: Mapping text mining taxonomies. In: KDIR 2013 Proceedings, Scitepress, Portugal (2013)
Isaac, A., van der Meij, L., Schlobach, S., Wang, S.: An empirical study of instance-based ontology matching. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 253–266. Springer, Heidelberg (2007)
Massmann, S., Rahm, E.: Evaluating instance-based matching of web directories. In: WebDB 2008 Proceedings (2008)
Jean-Mary, Y.R., Shironoshita, E.P., Kabuka, M.R.: Ontology matching with semantic verification. Web Semant. 7(3), 235–251 (2009)
Do, H.H., Rahm, E.: COMA - a system for flexible combination of schema matching approach. In: VLDB Proceedings (2002)
Chua, W.W.K., Kim, J.J.: Discovering cross-ontology subsumption relationships by using ontological annotations on biomedical literature. In: ICBO. CEUR Workshop Proceedings, vol. 897 (2012)
Shvaiko, P., Euzenat, J.: A survey of schema-based matching approaches. In: Spaccapietra, S. (ed.) Journal on Data Semantics IV. LNCS, vol. 3730, pp. 146–171. Springer, Heidelberg (2005)
Drumm, C., Schmitt, M., Do, H.H., Rahm, E.: QuickMig: automatic schema matching for data migration projects. In: CIKM’07 Proceedings (2007)
Li, J., Tang, J., Li, Y., Luo, Q.: RiMOM: a dynamic multistrategy ontology alignment framework. TKDE 21(8), 1218–1232 (2009)
Hu, W., Qu, Y.: Falcon-AO: a practical ontology matching system. Web Semant. 6(3), 237–239 (2008)
Suchanek, F.M., Abiteboul, S., Senellart, P.: Paris: probabilistic alignment of relations, instances, and schema. In: Proceedings of the VLDB Endowment, vol. 5(3), pp. 157–168 (2011)
Saleem, K., Bellahsene, Z., Hunt, E.: PORSCHE: performance oriented schema mediation. Inf. Syst. 33, 637–657 (2008)
Raunich, S., Rahm, E.: ATOM: automatic target-driven ontology merging. In: ICDE Proceedings, pp. 1276–1279 (2011)
Castano, S., Antonellis, V.D., Vimercati, S.D.C.D., Melchiori, M.: An xml-based integration scheme for web datasources. Ingénierie des Systèmes d’Information 6(1), 99–122 (2001)
Dragut, E.C., Wu, W., Sistla, A.P., Yu, C.T., Meng, W.: Merging Source Query Interfaces on Web Databases. In: ICDE Proceedings, vol. 46 (2006)
Beneventano, D., Bergamaschi, S., Guerra, F., Vincini, M.: The MOMIS approach to information integration. In: ICEIS 2001 Proceedings, Setubal, Portugal, vol. 1, pp. 194–198 July 2001
Pfeifer, K., Meinecke, J.: Identifying the truth - aggregation of named entity extraction results. In: iiWAS’13 Proceedings, ACM, Austria (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pfeifer, K., Peukert, E. (2015). Integration of Text Mining Taxonomies. In: Fred, A., Dietz, J., Liu, K., Filipe, J. (eds) Knowledge Discovery, Knowledge Engineering and Knowledge Management. IC3K 2013. Communications in Computer and Information Science, vol 454. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-46549-3_3
Download citation
DOI: https://doi.org/10.1007/978-3-662-46549-3_3
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-46548-6
Online ISBN: 978-3-662-46549-3
eBook Packages: Computer ScienceComputer Science (R0)