Column Segmentation

Sarawagi, Sunita

doi:10.1007/978-1-4614-8265-9_597

Sunita Sarawagi³

17 Accesses

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 4,499.99; Price excludes VAT (USA)

Hardcover Book: USD 6,499.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recommended Reading

Agichtein E, Ganti V. Mining reference tables for automatic text segmentation. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2004. p. 20–9.
Google Scholar
Aldelberg B. Nodose: a tool for semi-automatically extracting structured and semi-structured data from text documents. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 1998. p. 283–94.
Google Scholar
Borkar VR, Deshmukh K, Sarawagi S. Automatic text segmentation for extracting structured records. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2001. p. 175–86.
Google Scholar
Chandel A, Nagesh PC, Sarawagi S. Efficient batch top-k search for dictionary-based entity recognition. In: Proceedings of the 22nd International Conference on Data Engineering; 2006.
Google Scholar
Cunningham H. Information extraction, automatic. In: Encyclopedia of Language and Linguistics. 2nd ed. 2005.
Google Scholar
Kushmerick N, Weld DS, Doorenbos R. Wrapper induction for information extraction. In: Proceedings of the 15th International Joint Conference on AI; 1997. p. 729–37.
Google Scholar
Lafferty J, McCallum A, Pereira F. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proceedings of the 18th International Conference on Machine Learning; 2001. p. 282–9.
Google Scholar
Peng F, McCallum A. Accurate information extraction from research papers using conditional random fields. In: Proceedings of the Human Language Technology Conference and North American Chapter of the Association for Computational Linguistics; 2004. p. 329–36.
Google Scholar
Ratnaparkhi A. Learning to parse natural language with maximum entropy models. Mach Learn. 1999;34:151.
Article MATH Google Scholar
Sarawagi S, Cohen WW. Semi-markov conditional random fields for information extraction. In: Advances in Neural Information Processing Systems. 17, 2004.
Google Scholar
Seymore K, McCallum A, Rosenfeld R. Learning Hidden Markov Model structure for information extraction. In: Papers from the AAAI-99 Workshop on Machine Learning for Information Extraction; 1999. p. 37–42.
Google Scholar

Download references

Author information

Authors and Affiliations

IIT Bombay, Mumbai, India
Sunita Sarawagi

Authors

Sunita Sarawagi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sunita Sarawagi .

Editor information

Editors and Affiliations

Georgia Institute of Technology College of Computing, Atlanta, GA, USA
Ling Liu
University of Waterloo School of Computer Science, Waterloo, ON, Canada
M. Tamer Özsu

Section Editor information

Microsoft Research, Microsoft Corporation, Redmond, WA, USA
Venkatesh Ganti

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Sarawagi, S. (2018). Column Segmentation. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_597

Download citation

DOI: https://doi.org/10.1007/978-1-4614-8265-9_597
Published: 07 December 2018
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics