Extracting Meronymy Relationships from Domain-Specific, Textual Corporate Databases

Ittoo, Ashwin; Bouma, Gosse; Maruster, Laura; Wortmann, Hans

doi:10.1007/978-3-642-13881-2_5

Ashwin Ittoo²⁰,
Gosse Bouma²⁰,
Laura Maruster²⁰ &
…
Hans Wortmann²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6177))

Included in the following conference series:

International Conference on Application of Natural Language to Information Systems

1343 Accesses
4 Citations

Abstract

Various techniques for learning meronymy relationships from opendomain corpora exist. However, extracting meronymy relationships from domain-specific, textual corporate databases has been overlooked, despite numerous application opportunities particularly in domains like product development and/or customer service. These domains also pose new scientific challenges, such as the absence of elaborate knowledge resources, compromising the performance of supervised meronymy-learning algorithms. Furthermore, the domain-specific terminology of corporate texts makes it difficult to select appropriate seeds for minimally-supervised meronymy-learning algorithms. To address these issues, we develop and present a principled approach to extract accurate meronymy relationships from textual databases of product development and/or customer service organizations by leveraging on reliable meronymy lexico-syntactic patterns harvested from an open-domain corpus. Evaluations on real-life corporate databases indicate that our technique extracts precise meronymy relationships that provide valuable operational insights on causes of product failures and customer dissatisfaction. Our results also reveal that the types of some of the domain-specific meronymy relationships, extracted from the corporate data, cannot be conclusively and unambiguously classified under wellknown taxonomies of relationships.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Berland, M., Charniak, E.: Finding parts in very large corpora. In: 37th Annual Meeting of the Association for Computational Linguistics, pp. 57–64. University of Maryland (1999)
Google Scholar
Girju, R., Badulescu, A., Moldovan, D.: Automatic Discovery of Part-Whole Relations. Computational Linguistics 32, 83–135 (2006)
Google Scholar
Girju, R., Badulescu, A., Moldovan, D.: Learning semantic constraints for the automatic discovery of part-whole relations. In: Conference of the NAACL on HLT, pp. 1–8. Association for Computational Linguistics, Morristown (2003)
Google Scholar
Iris, M., Litowitz, B., Evens, M.: Problems with part-whole Relation. In: Evens, M.W. (ed.) Relational Models of the Lexicon: Representing Knowledge in Semantic Networks, pp. 261–288. Cambridge University Press, Cambridge (1988)
Google Scholar
Ittoo, A., Maruster, L., Wortmann, H., Bouma, G.: Textractor: A Framework for Extracting Relevant Domain Concepts from Irregular Corporate Textual Datasets. In: Abramowicz, W., Tolksdorf, R. (eds.) BIS 2010. LNBIP, vol. 47, pp. 71–82. Springer, Heidelberg (2010)
Google Scholar
Jijkoun, V., de Rijke, M., Mur, J.: Information extraction for question answering: improving recall through syntactic patterns. In: 20^th Intl Conference on Computational Linguistics. Association for Computational Linguistics, Morristown, NJ, USA
Google Scholar
Justeson, J., Katz, S.M.: Technical terminology: some linguistic properties and an algorithm for identification in text. Natural Language Engineering 1, 9–27 (1995)
Article Google Scholar
Keet, C.M., Artale, A.: Representing and reasoning over a taxonomy of part-whole relations. Appl. Ontol. 3, 91–110 (2008)
Google Scholar
Medelyan, O., Milne, D., Legg, C., Witten, I.H.: Mining meaning from Wikipedia. International Journal of Human-Computer Studies 67, 716–754 (2009)
Article Google Scholar
Pantel, P., Pennacchiotti, M.: Espresso: leveraging generic patterns for automatically harvesting semantic relations. In: 21^st Intl Conference on Computational Linguistics, Association for Computational Linguistics, Morristown, NJ, USA, pp. 113–220 (2006)
Google Scholar
Shen, D., Kruijff, K.G.-J., Klakow, D.: Exploring syntactic relation patterns for Question-Answering. In: Dale, R., Wong, K.-F., Su, J., Kwong, O.Y. (eds.) IJCNLP 2005. LNCS (LNAI), vol. 3651, pp. 507–518. Springer, Heidelberg (2005)
Chapter Google Scholar
van Hage, W.R., Kolb, H., Schreiber, G.: A method for learning part-whole relations. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 723–735. Springer, Heidelberg (2006)
Chapter Google Scholar
Winston, M., Chaffin, R., Hermann, D.: A taxonomy of part-whole relations. Cognitive Science 11, 417–444 (1987)
Article Google Scholar
English Wikipedia (2007-08-02), ISLA, University of Amsterdam, http://ilps.science.uva.nl/WikiXML/
Stanford Natural Language Processing Group, http://nlp.stanford.edu/software/index.shtml

Download references

Author information

Authors and Affiliations

University of Groningen, 9747, AE, Groningen, The Netherlands
Ashwin Ittoo, Gosse Bouma, Laura Maruster & Hans Wortmann

Authors

Ashwin Ittoo
View author publications
You can also search for this author in PubMed Google Scholar
Gosse Bouma
View author publications
You can also search for this author in PubMed Google Scholar
Laura Maruster
View author publications
You can also search for this author in PubMed Google Scholar
Hans Wortmann
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Engineering, Cardiff University, UK
Christina J. Hopfe & Haijiang Li &
Informatics Research Institute, University of Salford, M5 4WT, Greater Manchester, UK
Yacine Rezgui
Centre National des Arts et Métiers,
Elisabeth Métais
School of Computer Science, Cardiff University, UK
Alun Preece

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ittoo, A., Bouma, G., Maruster, L., Wortmann, H. (2010). Extracting Meronymy Relationships from Domain-Specific, Textual Corporate Databases. In: Hopfe, C.J., Rezgui, Y., Métais, E., Preece, A., Li, H. (eds) Natural Language Processing and Information Systems. NLDB 2010. Lecture Notes in Computer Science, vol 6177. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13881-2_5

Download citation

DOI: https://doi.org/10.1007/978-3-642-13881-2_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13880-5
Online ISBN: 978-3-642-13881-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics