Skip to main content

Mining for Lexons: Applying Unsupervised Learning Methods to Create Ontology Bases

  • Conference paper
On The Move to Meaningful Internet Systems 2003: CoopIS, DOA, and ODBASE (OTM 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2888))

Abstract

Ontologies in current computer science parlance are computer based resources that represent agreed domain semantics. This paper first introduces ontologies in general and subsequently, in particular, shortly outlines the DOGMA ontology engineering approach that separates “atomic” conceptual relations from “predicative” domain rules. In the main part of the paper, we describe and experimentally evaluate work in progress on a potential method to automatically derive the atomic conceptual relations mentioned above from a corpus of English medical texts. Preliminary outcomes are presented based on the clustering of nouns and compound nouns according to co-occurrence frequencies in the subject-verb-object syntactic context.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agirre, E., Martinez, D.: Learning class-to-class selectional preferences. In: Proceedings CoNLL 2001 (2001)

    Google Scholar 

  2. Berland, M., Charniak, E.: Finding parts in very large corpora. In: Proc. of ACL 1999 (1999)

    Google Scholar 

  3. Bourigault, D., Jacquemin, C.: Term extraction + term clustering: An integrated platform for computer-aided terminology. In: Proceedings of EACL 1999 (1999)

    Google Scholar 

  4. Buchholz, S.: Memory-Based Grammatical Relation Finding (1999)

    Google Scholar 

  5. Buchholz, S., Veenstra, J., Daelemans, W.: Cascaded grammatical relation assignment. PrintPartners Ipskamp (2002)

    Google Scholar 

  6. Caraballo, S.: Automatic construction of a hypernym-labeled noun hierarchy from text. In: Proceedings of ACL 1999 (1999)

    Google Scholar 

  7. Caraballo, S., Charniak, E.: Determining the specificity of nouns from text. In: Proceedings of SIGDAT 1999 (1999)

    Google Scholar 

  8. Cheeseman, P., Stutz, J.: Bayesian classification (autoclass): Theory and results. In: Advances in Knowledge Discovery and Data Mining, pp. 153–180 (1996)

    Google Scholar 

  9. Daelemans, W., Buchholz, S., Veenstra, J.: Memory-based shallow parsing. In: Proceedings of CoNLL 1999 (1999)

    Google Scholar 

  10. Faure, D., Nédellec, C.: Knowledge acquisition of predicate argument structures from technical texts using machine learning: The system Asium. In: Fensel, D., Studer, R. (eds.) EKAW 1999. LNCS (LNAI), vol. 1621, pp. 329–334. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  11. Gamallo, P., Agustini, A., Lopes, G.: Selection restrictions acquisition from corpora. In: Brazdil, P.B., Jorge, A.M. (eds.) EPIA 2001. LNCS (LNAI), vol. 2258, p. 30. Springer, Heidelberg (2001)

    Google Scholar 

  12. Gamallo, P., Alexandre Agustini, A., Lopes, G.: Using co-composition for acquiring syntactic and semantic subcategorisation. In: Proceedings of the Workshop SIGLEX 2002, ACL 2002 (2002)

    Google Scholar 

  13. Gamallo, P., Gasperin, C., Agustini, A., Lopes, G.: Syntactic-based methods for measuring word similarity. In: Matoušek, V., Mautner, P., Mouček, R., Tauser, K. (eds.) TSD 2001. LNCS (LNAI), vol. 2166, p. 116. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  14. Genesereth, M., Nilsson, N.: Logical Foundations of Artificial Intelligence. Morgan Kaufmann, San Francisco (1987)

    MATH  Google Scholar 

  15. Gómez-Pérez, A., Manzano-Macho, D. (eds.): A survey of ontology learning methods and techniques. OntoWeb Deliverable #D1.5, Univ. Politécnica de Madrid (2003)

    Google Scholar 

  16. Grishman, R., John Sterling, J.: Generalizing automatically generated selectional patterns. In: Proceedings of COLING 1994 (1994)

    Google Scholar 

  17. Guarino, N., Giaretta, P.: Ontologies and knowledge bases: Towards a terminological clarification. In: Mars, N. (ed.) Towards Very Large Knowledge Bases: Knowledge Building and Knowledge Sharing, pp. 25–32. IOS Press, Amsterdam (1995)

    Google Scholar 

  18. Humphreys, B., Lindberg, D.: The unified medical language system project: a distributed experiment in improving access to biomedical information. In: Lun K.C. (ed.), Proc. of the 7th World Congress on Medical Informatics (MEDINFO92), pp. 1496–1500 (1992)

    Google Scholar 

  19. Jarrar, M., Meersman, R.: Formal ontology engineering in the dogma approach. In: Meersman, R., Tari, Z., et al. (eds.) CoopIS 2002, DOA 2002, and ODBASE 2002. LNCS, vol. 2519, pp. 1238–1254. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  20. Lin, D.: Automatic retrieval and clustering of similar words. In: Proceedings of COLINGACL 1998 (1998)

    Google Scholar 

  21. Maedche, A., Staab, S.: Discovering conceptual relations from text. Technical Report 399, Institute AIFB, Karlsruhe University (2000)

    Google Scholar 

  22. Maedche, A., Staab, S.: Ontology learning for the semantic web. IEEE Intelligent Systems 16 (2001)

    Google Scholar 

  23. McCarthy, D., Carroll, J., Preiss, J.: Disambiguating noun and verb senses using automatically acquired selectional preferences. SENSEVAL-2 (2001)

    Google Scholar 

  24. Miller, G.: Wordnet: a lexical database for english. Comm. of the ACM 38(11), 39–41 (1995)

    Article  Google Scholar 

  25. Nazarenko, A., Zweigenbaum, P., Bouaud, J., Habert, B.: Corpus-based identification and refinement of semantic classes. In: Masys, R. (ed.) Proceeding of the AMIA Annual Fall Symposium - JAMIA Supplement, pp. 585–589. AMIA (1997)

    Google Scholar 

  26. Pantel, P., Lin, D.: Discoveringword senses from text. In: Proceedings of ACMSIGKDD 2002 (2002)

    Google Scholar 

  27. Peeters, S., Kaufner, S.: State of the art in crosslingual information access for medical information. Technical report, CSLI (2001)

    Google Scholar 

  28. Pustejovsky, J.: The Generative Lexicon. MIT Press, Cambridge (1995)

    Google Scholar 

  29. Reinberger, M.-L., Daelemans, W.: Is shallow parsing useful for the unsupervised learning of semantic clusters? In: Gelbukh, A. (ed.) CICLing 2003. LNCS, vol. 2588. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  30. Reinberger, M.-L., Decadt, B., Daelemans, W.: On the relevance of performing shallow parsing before clustering. In: Computational Linguistics in the Netherlands 2002 (CLIN 2002), Groningen, The Netherlands (2002)

    Google Scholar 

  31. Reiter, R.: Readings in AI and Databases, chapter Towards a Logical Reconstruction of Relational Database Theory. Morgan Kaufmann, San Francisco (1988)

    Google Scholar 

  32. Ribière, M., Charlton, P.: Ontology overview motorola labs. Technical report, Networking and Applications Lab – Centre de Recherche de Motorola Paris (2000)

    Google Scholar 

  33. Spyns, P., Meersman, R., Jarrar, M.: Data modelling versus ontology engineering. SIGMOD Record Special Issue 31(4) (2002)

    Google Scholar 

  34. Tarski, A.: Problems in the philosophy of language, chapter The semantic concept of truth. Holt, Rinehart &Winston, NewYork (1969)

    Google Scholar 

  35. Ushold, M., Gruninger, M.: Ontologies: Principles, methods and applications. The Knowledge Engineering Review 11(2), 93–155 (1996)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Reinberger, ML., Spyns, P., Daelemans, W., Meersman, R. (2003). Mining for Lexons: Applying Unsupervised Learning Methods to Create Ontology Bases. In: Meersman, R., Tari, Z., Schmidt, D.C. (eds) On The Move to Meaningful Internet Systems 2003: CoopIS, DOA, and ODBASE. OTM 2003. Lecture Notes in Computer Science, vol 2888. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39964-3_51

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-39964-3_51

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-20498-5

  • Online ISBN: 978-3-540-39964-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics