Skip to main content

Mining the Semantics of Text Via Counter-Training

  • Conference paper
Progress in Artificial Intelligence (EPIA 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3808))

Included in the following conference series:

  • 1467 Accesses

Abstract

We report on a set of experiments in text mining, specifically, finding semantic patterns given only a few keywords. The experiments employ the Counter-training framework for discovery of semantic knowledge from raw text in a weakly supervised fashion. The experiments indicate that the framework is suitable for efficient acquisition of semantic word classes and collocation patterns, which may be used for Information Extraction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lin, W., Yangarber, R., Grishman, R.: Bootstrapped learning of semantic classes from positive and negative examples. In: Proc. ICML Workshop, Washington, DC (2003)

    Google Scholar 

  2. Yangarber, R.: Counter-training in discovery of semantic patterns. In: Proc. ACL 2003, Sapporo, Japan (2003)

    Google Scholar 

  3. Riloff, E.: Automatically generating extraction patterns from untagged text. In: Proc. 13th Natl. Conf. on AI, AAAI 1996 (1996)

    Google Scholar 

  4. Califf, M.E., Mooney, R.J.: Bottom-up relational learning of pattern matching rules for information extraction. J. Machine Learning Research 4 (2003)

    Google Scholar 

  5. Wilks, Y., Catizone, R.: Can we make information extraction more adaptive? In: Pazienza, M. (ed.) Information Extraction: Scalable, Adaptable Systems. LNCS (LNAI). Springer, Heidelberg (1999)

    Google Scholar 

  6. Yangarber, R., Grishman, R., Tapanainen, P., Huttunen, S.: Automatic acquisition of domain knowledge for information extraction. In: Proc. 18th Intl. Conf. Computational Linguistics (COLING 2000), Saarbrücken (2000)

    Google Scholar 

  7. Allan, J.: Relevance feedback with too much data. In: Proc. 18th International ACM SIGIR Conf. on R&D in IR, Seattle, Washington (1995)

    Google Scholar 

  8. Thelen, M., Riloff, E.: A bootstrapping method for learning semantic lexicons using extraction pattern contexts. In: Proc. EMNLP (2002)

    Google Scholar 

  9. Tapanainen, P., Järvinen, T.: A non-projective dependency parser. In: Proc. 5th Conf. Applied Natural Language Processing, Washington, D.C. (1997)

    Google Scholar 

  10. Meyers, A., Grishman, R., Kosaka, M.: Formal mechanisms for capturing regularizations. In: Proc. Language Resources and Evaluation Conf (LREC 2002), Las Palmas, Spain (2002)

    Google Scholar 

  11. Riloff, E., Jones, R.: Learning dictionaries for information extraction by multi-level bootstrapping. In: Proc. 16th Natl. Conf. on AI (AAAI 1999), Orlando, FL (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yangarber, R. (2005). Mining the Semantics of Text Via Counter-Training. In: Bento, C., Cardoso, A., Dias, G. (eds) Progress in Artificial Intelligence. EPIA 2005. Lecture Notes in Computer Science(), vol 3808. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11595014_63

Download citation

  • DOI: https://doi.org/10.1007/11595014_63

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-30737-2

  • Online ISBN: 978-3-540-31646-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics