Skip to main content

Knowledge acquisition of predicate argument structures from technical texts using Machine Learning: the system Asium

  • Conference paper
  • First Online:
Knowledge Acquisition, Modeling and Management (EKAW 1999)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1621))

Abstract

In this paper, we describe the Machine Learning system, asium1, which learns Subcaterorization Frames of verbs and ontologies from the syntactic parsing of technical texts in natural language. The restrictions of selection in the subcategorization frames are filled by the ontology’s concepts. Applications requiring such knowledge are crucial and numerous. The most direct applications are semantic control of texts and syntactic parsing disambiguation. This knowledge acquisition task cannot be fully automatically performed. Instead,we propose a cooperative ML method which provides the user with a global view of the acquisition task and also with acquisition tools like automatic concepts splitting, example generation, and an ontology view with attachments to the verbs. Validation steps using these features are intertwined with learning steps so that the user validates the concepts as they are learned. Experiments performed on two diérent corpora (cooking domain and patents) give very promising results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Basili and M. T. Pazienza. Lexical Acquisition for Information Extraction. In Maria Teresa Pazienza, editor, Information Extraction: A Multidisciplinary Approach to an Emerging Information Technology, pages 14–18, Frascati, Italy, July 1997. LNAI Tutorial, Springer.

    Google Scholar 

  2. D. Bourigault, I. Gonzalez-Mullier, and C. Gros. LEXTER, a Natural Language Processing Tool for Terminology Extraction. In 7th EURALEX International Congress, Göoteborg, August 1996.

    Google Scholar 

  3. M. R. Brent. Automatic acquisition of subcategorization frames from untagged text. In Proceedings of the 29st annual meeting of the Association for Computational Linguistics, ACL, pages 209–214, 1991.

    Google Scholar 

  4. S. Buchholz. Distinguishing Complements from Adjuncts using Memory-Based Learning. In Proceedings of the ESSLLI’98 workshop on Automated Acquisition of Syntax and Parsing, 1998.

    Google Scholar 

  5. P. Constant. L’analyseur Linguistique SYLEX. In 5éme Éole d’été du CNET, 1995.

    Google Scholar 

  6. D. Faure and C. Nédellec. A Corpus-based Conceptual Clustering Method for Verb Frames and Ontology Acquisition. In Paola Velardi, editor, LREC workshop on Adapting lexical and corpus ressources to sublanguages and applications, pages 5-12, Granada, Spain, May 1998.

    Google Scholar 

  7. G. Grefenstette. Sextant: exploring unexplored contexts for semantic extraction from syntactic analysis. In Proceedings of the 30st annual meeting of the Association for Computational Linguistics, ACL, 1992. 14–18.

    Google Scholar 

  8. R. Grishman and J. Sterling. Generalizing Automatically Generated Selectional Patterns. Proceedings of COLING’ 94 15th International Conference on Computational Linguistics, Kyoto, Japan, August 1994.

    Google Scholar 

  9. Z. Harris. Mathematical Structures of Language. New York: Wiley, 1968.

    MATH  Google Scholar 

  10. D. Hindle. Noun classiffcation from predicate-argument structures. In Proceedings of the 28st annual meeting of the Association for Computational Linguistics, ACL, Pittsburgh, PA, pages 1268–1275, 1990.

    Google Scholar 

  11. H.J. Peat and P. Willet. The limitations of term co-occurrence data for query expansion in document retrieval systems. Journal of the American Society for Information Science, 42(5):378–383, 1991.

    Article  Google Scholar 

  12. F. Pereira, N. Tishby, and L. Lee. Distributional Clustering of English Words. In Proceedings of the 31st annual meeting of the Association for Computational Linguistics, ACL, pages 183–190, 1993.

    Google Scholar 

  13. M.L.G. Shaw and B. R. Gaines. Comparing conceptual structures: consensus, conict, correspondence and contrast. In Knowledge Acquisition, volume 1, pages 341–363, 1989.

    Article  Google Scholar 

  14. C. A. Thompson. Acquisition of a Lexicon from Semantic Representations of Sentences. In 33rd Annual Meeting of the Association of Computational Linguistics, Boston, MA July, (ACL-95)., pages 335–337, 1995.

    Google Scholar 

  15. J. M. Zelle and R. J. Mooney. Learning semantic grammars with constructive inductive logic programming. Proceedings of the Eleventh National Conference on Artificial Intelligence, pages 817–822, 1993.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag

About this paper

Cite this paper

Faure, D., Nédellec, C. (1999). Knowledge acquisition of predicate argument structures from technical texts using Machine Learning: the system Asium . In: Fensel, D., Studer, R. (eds) Knowledge Acquisition, Modeling and Management. EKAW 1999. Lecture Notes in Computer Science(), vol 1621. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48775-1_22

Download citation

  • DOI: https://doi.org/10.1007/3-540-48775-1_22

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-66044-6

  • Online ISBN: 978-3-540-48775-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics