Abstract
In sequence labelling, when the label of a token in the sequence is changed, the output probability of the other tokens in the same sequence would also change. We propose a new active learning framework for sequence labelling which take the change of probability into account. At each iteration of the proposed method, every time the human annotator manually annotates a token, the output probabilities of the other tokens in the sequence are re-estimated. This proposed method is expected to reduce the amount of human annotation required for obtaining a high labelling performance. Through experiments on the NP chunking dataset provided by CoNLL, we empirically show that the proposed method works well.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Cohn, D., Atlas, L., Ladner, R.: Improving generalization with active learning. Machine Learning 15(2), 201–221 (1994)
Tomanek, K., Hahn, U.: Semi-supervised active learning for sequence labeling. In: ACL-IJCNLP 2007: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, Suntec, Singapore, pp. 1039–1047. Association for Computational Linguistics (August 2009)
Settles, B., Craven, M.: An analysis of active learning strategies for sequence labeling tasks. In: EMNLP 2008: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Morristown, NJ, USA, pp. 1070–1079. Association for Computational Linguistics (2008)
Culotta, A., McCallum, A.: Reducing labeling effort for structured prediction tasks. In: AAAI 2005: Proceedings of the 20th National Conference on Artificial Intelligence, pp. 746–751. AAAI Press, Menlo Park (2005)
Lafferty, J.D., McCallum, A., Pereira, F.C.N.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: ICML 2001: Proceedings of the Eighteenth International Conference on Machine Learning, pp. 282–289. Morgan Kaufmann Publishers Inc., San Francisco (2001)
Tjong Kim Sang, E.F., Buchholz, S.: Introduction to the conll-2000 shared task: chunking. In: CoNLL 2000: Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th Conference on Computational Natural Language Learning, Morristown, NJ, USA, pp. 127–132. Association for Computational Linguistics (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wanvarie, D., Takamura, H., Okumura, M. (2010). Active Learning for Sequence Labelling with Probability Re-estimation. In: Zhang, BT., Orgun, M.A. (eds) PRICAI 2010: Trends in Artificial Intelligence. PRICAI 2010. Lecture Notes in Computer Science(), vol 6230. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15246-7_69
Download citation
DOI: https://doi.org/10.1007/978-3-642-15246-7_69
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15245-0
Online ISBN: 978-3-642-15246-7
eBook Packages: Computer ScienceComputer Science (R0)