Abstract
In this chapter, we present relational learning algorithms for two natural language processing tasks, semantic parsing and information extraction. We describe the algorithms and present experimental results showing their effectiveness. We also describe our application of active learning techniques to these learning systems.We applied certainty-based selective sampling to each system, using fairly simple notions of certainty. We show that these selective sampling techniques greatly reduce the number of annotated examples required for the systems to achieve good generalization performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bennett, S., Aone, C., & Lovell, C. (1997). Learning to tag multilingual texts through observation. In Proceedings of the Second Conference on Empirical Methods in Natural Language Processing, pp. 109–116.
Berwick, B. (1985). The Acquisition of Syntactic Knowledge. MIT Press, Cambridge, MA.
Borland International (1988). Turbo Prolog 2.0 Reference Guide. Borland International, Scotts Valley, CA.
Brill, E. (1994). Some advances in rule-based part of speech tagging. In Proceedings of the Eleventh National Conference on Artificial Intelligence, pp. 722–727 Washington, D.C.
Briscoe, T., & Carroll, J. (1993). Generalized probabilistic LR parsing of natural language (corpora) with unification-based grammars. Computational Linguistics, 19(1), 25–59.
Cali., M., & Mooney, R. (1999). Relational learning of pattern-match rules for information extraction. In Proceedings of the Sixteenth National Conference on Artificial Intelligence, pp. 328–334 Orlando, FL.
Cohn, D., Atlas, L., & Ladner, R. (1994). Improving generalization with active learning. Machine Learning, 15(2), 201–221.
Dagan, I., & Engelson, S. P. (1995). Committee-based sampling for training probabilistic classifiers. In Proceedings of the Twelfth International Conference on Machine Learning, pp. 150–157 San Francisco, CA. Morgan Kaufman.
Fellbaum, C. (1998). WordNet: An Electronic Lexical Database. MIT Press, Cambridge, MA.
Fillmore, C. J. (1968). The case for case. In Bach, E., & Harms, R. T. (Eds.), Universals in Linguistic Theory. Holt, Reinhart and Winston, New York.
Freitag, D. (2000). Machine learning for information extraction in informal domains. Machine Learning, 39(2/3), 169–202.
Freitag, D. (1998). Multi-strategy learning for information extraction. In Proceedings of the Fifteenth International Conference on Machine Learning, pp. 161–169.
Freund, Y., Seung, H. S., Shamir, E., & Tishby, N. (1997). Selective sampling using the query by committee algorithm. Machine Learning, 28, 133–168.
Holte, R. C., Acker, L., & Porter, B. (1989). Concept learning and the problem of small disjuncts. In Proceedings of the Eleventh International Joint Conference on Artificial Intelligence, pp. 813–818 Detroit, MI.
Junker, M., Sintek, M., & Rinck, M. (2000). Learning for text categorization and information extraction with ILP. In This volume.
Lehnert, W., & Sundheim, B. (1991). A performance evaluation of textanalysis technologies. AI Magazine, 12(3), 81–94.
Lewis, D. D., & Catlett, J. (1994). Heterogeneous uncertainty sampling for supervised learning. In Proceedings of the Eleventh International Conference on Machine Learning, pp. 148–156 New Brunswick, NJ. Morgan Kaufman.
Liere, R., & Tadepalli, P. (1997). Active learning with committees for text categorization. In Proceedings of the Fourteenth National Conference on Artificial Intelligence, pp. 591–596 Providence, RI.
Magerman, D. M. (1995). Statistical decision-tree models for parsing. In Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, pp. 276–283 Cambridge, MA.
Muggleton, S., & Feng, C. (1990). Efficient induction of logic programs. In Proceedings of the First Conference on Algorithmic Learning Theory Ohmsha, Tokyo, Japan.
Plotkin, G. D. (1970). A note on inductive generalization. In Meltzer, B., & Michie, D. (Eds.), Machine Intelligence (Vol. 5). Elsevier North-Holland, New York.
Quinlan, J. (1990). Learning logical definitions from relations. Machine Learning, 5(3), 239–266.
Simmons, R. F., & Yu, Y. (1992). The acquisition and use of context dependent grammars for Engl ish. Computational Linguistics, 18(4), 391–418.
Soderland, S. (1999). Learning information extraction rules for semistructured and free text. Machine Learning, 34, 233–272.
Zelle, J. M., & Mooney, R. J. (1996). Learning to parse database queries using inductive logic programming. In Proceedings of the Thirteenth National Conference on Artificial Intelligence Portland, OR.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Thompson, C.A., Elaine Califf, M. (2000). Improving Learning by Choosing Examples Intelligently in Two Natural Language Tasks. In: Cussens, J., Džeroski, S. (eds) Learning Language in Logic. LLL 1999. Lecture Notes in Computer Science(), vol 1925. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-40030-3_18
Download citation
DOI: https://doi.org/10.1007/3-540-40030-3_18
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41145-1
Online ISBN: 978-3-540-40030-1
eBook Packages: Springer Book Archive