Active Hidden Markov Models for Information Extraction

Scheffer, Tobias; Decomain, Christian; Wrobel, Stefan

doi:10.1007/3-540-44816-0_31

Tobias Scheffer^5,6,
Christian Decomain &
Stefan Wrobel⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2189))

Included in the following conference series:

International Symposium on Intelligent Data Analysis

1741 Accesses
177 Citations

Abstract

Information extraction from HTML documents requires a classifier capable of assigning semantic labels to the words or word sequences to be extracted. If completely labeled documents are available for training, well-known Markov model techniques can be used to learn such classifiers. In this paper, we consider the more challenging task of learning hidden Markov models (HMMs) when only partially (sparsely) labeled documents are available for training. We first give detailed account of the task and its appropriate loss function, and show how it can be minimized given an HMM. We describe an EM style algorithm for learning HMMs from partially labeled data. We then present an active learning algorithm that selects “difficult” unlabeled tokens and asks the user to label them. We study empirically by how much active learning reduces the required data labeling effort, or increases the quality of the learned model achievable with a given amount of user effort.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Improving Supervised Classification Using Information Extraction

Active Learning and Crowdsourcing: A Survey of Optimization Methods for Data Labeling

Article 01 November 2018

Topics and Label Propagation: Best of Both Worlds for Weakly Supervised Text Classification

References

T. Berners-Lee. Semantic web road map. Internal note, World Wide Web Consortium, 1998.
Google Scholar
T. Brants. Cascaded markov models. In Proceedings of the Ninth Conference of the European Chapter of the Association for Computational Linguistics, 1999.
Google Scholar
D. Cohn, Z. Ghahramani, and M. Jordan. Active learning with statistical models. Journal of Artificial Intelligence Research, 4:129–145, 1996.
MATH Google Scholar
Mark Craven, Dan DiPasquo, Dayne Freitag, Andrew K. McCallum, Tom M. Mitchell, Kamal Nigam, and Seán Slattery. Learning to construct knowledge bases from the World Wide Web. Artificial Intelligence, 118(1-2):69–113, 2000.
Article MATH Google Scholar
L. Eikvil. Information extraction from the world wide web: a survey. Technical Report 945, Norwegian Computing Center, 1999.
Google Scholar
S. Fine, Y. Singer, and N. Tishby. The hierarchical hidden markov model: Analysis and applications. Machine Learning, 32:41–64, 1998.
Article Google Scholar
Ralph Grishman and Beth Sundheim. Message understanding conference-6: A brief history. In Proceedings of the International Conference on Computational Linguistics, 1996.
Google Scholar
Thomas Hofmann and Joachim M. Buhmann. Active data clustering. In Advances in Neural Information Processing Systems, volume 10, 1998.
Google Scholar
N. Hsu and M. Dung. Generating finite-state transducers for semistructured data extraction from the web. Journal of Information Systems, Special Issue on Semistructured Data, 23(8), 1998.
Google Scholar
Anders Krogh and Jesper Vedelsby. Neural network ensembles, cross validation, and active learning. In Advances in Neural Information Processing Systems, volume 7,pages 231–238, 1995.
Google Scholar
N. Kushmerick. Wrapper induction: efficiency and expressiveness. Artificial Intelligence, 118:15–68, 2000.
Article MATH MathSciNet Google Scholar
Andrew McCallum, Dayne Freitag, and Fernando Pereira. Maximum entropy Markov models for information extraction and segmentation. In Proceedings of the Seventeenth International Conference on Machine Learning, 2000.
Google Scholar
L. Rabiner. A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(2):257–285, 1989.
Article Google Scholar
T. Scheffer, S. Hoche, and S. Wrobel. Learning hidden markov models for information extraction actively from partially labeled text. Technical report, University of Magdeburg, 2001.
Google Scholar
Kristie Seymore, Andrew McCallum, and Roni Rosenfeld. Learning hidden markov model structure for information extraction. In AAAI’99 Workshop on Machine Learning for Information Extraction, 1999.
Google Scholar
V. Vapnik. Statistical Learning Theory. Wiley, 1998.
Google Scholar

Download references

Author information

Authors and Affiliations

University of Magdeburg, FIN/IWS, PO Box 4120, 39016, Magdeburg, Germany
Tobias Scheffer & Stefan Wrobel
SemanticEdge, Kaiserin-Augusta-Allee 10-11, 10553, Berlin, Germany
Tobias Scheffer

Authors

Tobias Scheffer
View author publications
You can also search for this author in PubMed Google Scholar
Christian Decomain
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Wrobel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Royal Institute of Technology, Centre for Autonomous Systems, 10044, Stockholm, Sweden
Frank Hoffmann
Imperial College, Huxley Building 180 Queen’s Gate, London, SW7 2BZ, UK
David J. Hand & Niall Adams &
Department of Computer Science, Vanderbilt University, Box 1679, Station B, Nashville, TN, 37235, USA
Douglas Fisher
Department of Computer Science, New University of Lisbon, 2825-114, Caparica, Portugal
Gabriela Guimaraes

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Scheffer, T., Decomain, C., Wrobel, S. (2001). Active Hidden Markov Models for Information Extraction. In: Hoffmann, F., Hand, D.J., Adams, N., Fisher, D., Guimaraes, G. (eds) Advances in Intelligent Data Analysis. IDA 2001. Lecture Notes in Computer Science, vol 2189. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44816-0_31

Download citation

DOI: https://doi.org/10.1007/3-540-44816-0_31
Published: 03 September 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42581-6
Online ISBN: 978-3-540-44816-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Active Hidden Markov Models for Information Extraction

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Improving Supervised Classification Using Information Extraction

Active Learning and Crowdsourcing: A Survey of Optimization Methods for Data Labeling

Topics and Label Propagation: Best of Both Worlds for Weakly Supervised Text Classification

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Active Hidden Markov Models for Information Extraction

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Improving Supervised Classification Using Information Extraction

Active Learning and Crowdsourcing: A Survey of Optimization Methods for Data Labeling

Topics and Label Propagation: Best of Both Worlds for Weakly Supervised Text Classification

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation