Mining the Semantics of Text Via Counter-Training

Yangarber, Roman

doi:10.1007/11595014_63

Roman Yangarber²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3808))

Included in the following conference series:

Portuguese Conference on Artificial Intelligence

1467 Accesses

Abstract

We report on a set of experiments in text mining, specifically, finding semantic patterns given only a few keywords. The experiments employ the Counter-training framework for discovery of semantic knowledge from raw text in a weakly supervised fashion. The experiments indicate that the framework is suitable for efficient acquisition of semantic word classes and collocation patterns, which may be used for Information Extraction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Lin, W., Yangarber, R., Grishman, R.: Bootstrapped learning of semantic classes from positive and negative examples. In: Proc. ICML Workshop, Washington, DC (2003)
Google Scholar
Yangarber, R.: Counter-training in discovery of semantic patterns. In: Proc. ACL 2003, Sapporo, Japan (2003)
Google Scholar
Riloff, E.: Automatically generating extraction patterns from untagged text. In: Proc. 13th Natl. Conf. on AI, AAAI 1996 (1996)
Google Scholar
Califf, M.E., Mooney, R.J.: Bottom-up relational learning of pattern matching rules for information extraction. J. Machine Learning Research 4 (2003)
Google Scholar
Wilks, Y., Catizone, R.: Can we make information extraction more adaptive? In: Pazienza, M. (ed.) Information Extraction: Scalable, Adaptable Systems. LNCS (LNAI). Springer, Heidelberg (1999)
Google Scholar
Yangarber, R., Grishman, R., Tapanainen, P., Huttunen, S.: Automatic acquisition of domain knowledge for information extraction. In: Proc. 18th Intl. Conf. Computational Linguistics (COLING 2000), Saarbrücken (2000)
Google Scholar
Allan, J.: Relevance feedback with too much data. In: Proc. 18th International ACM SIGIR Conf. on R&D in IR, Seattle, Washington (1995)
Google Scholar
Thelen, M., Riloff, E.: A bootstrapping method for learning semantic lexicons using extraction pattern contexts. In: Proc. EMNLP (2002)
Google Scholar
Tapanainen, P., Järvinen, T.: A non-projective dependency parser. In: Proc. 5th Conf. Applied Natural Language Processing, Washington, D.C. (1997)
Google Scholar
Meyers, A., Grishman, R., Kosaka, M.: Formal mechanisms for capturing regularizations. In: Proc. Language Resources and Evaluation Conf (LREC 2002), Las Palmas, Spain (2002)
Google Scholar
Riloff, E., Jones, R.: Learning dictionaries for information extraction by multi-level bootstrapping. In: Proc. 16th Natl. Conf. on AI (AAAI 1999), Orlando, FL (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Helsinki, Finland
Roman Yangarber

Authors

Roman Yangarber
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Portugal Telecom Inovação (PTI), Centro de Informatica e Sistemas da Universidade de Coimbra (CISUC),
Carlos Bento
Department of Informatics Engineering, Coimbra University, Portugal
Amílcar Cardoso
Centre of Human Language Technology and Bioinformatics, University of Beira Interior, 6201-001, Covilhã, Portugal
Gaël Dias

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yangarber, R. (2005). Mining the Semantics of Text Via Counter-Training. In: Bento, C., Cardoso, A., Dias, G. (eds) Progress in Artificial Intelligence. EPIA 2005. Lecture Notes in Computer Science(), vol 3808. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11595014_63

Download citation

DOI: https://doi.org/10.1007/11595014_63
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30737-2
Online ISBN: 978-3-540-31646-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics