Article

Efficient inference on sequence segmentation models

Author:

Sunita SarawagiAuthors Info & Claims

ICML '06: Proceedings of the 23rd international conference on Machine learning

Pages 793 - 800

https://doi.org/10.1145/1143844.1143944

Published: 25 June 2006 Publication History

Get Access

Abstract

Sequence segmentation is a flexible and highly accurate mechanism for modeling several applications. Inference on segmentation models involves dynamic programming computations that in the worst case can be cubic in the length of a sequence. In contrast, typical sequence labeling models require linear time. We remove this limitation of segmentation models vis-a-vis sequential models by designing a succinct representation of potentials common across overlapping segments. We exploit such potentials to design efficient inference algorithms that are both analytically shown to have a lower complexity and empirically found to be comparable to sequential models for typical extraction tasks.

References

[1]

Bartlett, P. L., Collins, M., Taskar, B., & McAllester, D. (2005). Exponentiated gradient algorithms for large-margin structured classification. In L. K. Saul, Y. Weiss and L. Bottou (Eds.), Advances in neural information processing systems 17, 113--120. Cambridge, MA: MIT Press.]]

Google Scholar

[2]

Borthwick, A., Sterling, J., Agichtein, E., & Grishman, R. (1998). Exploiting diverse knowledge sources via maximum entropy in named entity recognition. Sixth Workshop on Very Large Corpora New Brunswick, New Jersey. Association for Computational Linguistics.]]

Google Scholar

[3]

Cohen, W. W., Ravikumar, P., & Fienberg, S. E. (2003). A comparison of string distance metrics for name-matching tasks. Proceedings of the IJCAI-2003 Workshop on Information Integration on the Web (IIWeb-03). To appear.]]

Google Scholar

[4]

DauméIII, H., & Marcu, D. (2005). Learning as search optimization: approximate large margin methods for structured prediction. ICML '05: Proceedings of the 22nd international conference on Machine learning (pp. 169--176).]]

Digital Library

Google Scholar

[5]

Keshet, J., Shalev-Shwartz, S., & Singer, Y. (2005). Phoneme alignment using large margin techniques. Workshop on the Advances in Structured Learning for Text and Speech Processing, NIPS.]]

Google Scholar

[6]

McDonald, R., Crammer, K., & Pereira, F. (2005). Flexible text segmentation with structured multilabel classification. Human Language Technology Conference Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP).]]

Digital Library

Google Scholar

[7]

Peng, F., & McCallum, A. (2004). Accurate information extraction from research papers using conditional random fields. HLT-NAACL (pp. 329--336).]]

Google Scholar

[8]

Sarawagi, S., & Cohen, W. W. (2004). Semi-markov conditional random fields for information extraction. NIPS.]]

Google Scholar

[9]

Tsochantaridis, I., Joachims, T., Hofmann, T., & Altun, Y. (2005). Large margin methods for structured and interdependent output variables. Journal of Machine Learning Research (JMLR), 6(Sep), 1453--1484.]]

Digital Library

Google Scholar

[10]

Zhang, T., Damerau, F., & Johnson, D. (2002). Text chunking based on a generalization of winnow. J. Mach. Learn. Res., 2, 615--637.]]

Digital Library

Google Scholar

Cited By

View all

Sarawagi S(2019)Sequence Segmentation Using Semi-Markov Conditional Random FieldsJournal of the Indian Institute of Science10.1007/s41745-019-0100-1Online publication date: 20-Mar-2019
https://doi.org/10.1007/s41745-019-0100-1
Prasad KChaturvedi SFaruquie TSubramaniam LMohania M(2012)Managing data quality by identifying the noisiest data samplesProceedings of 2012 IEEE International Conference on Service Operations and Logistics, and Informatics10.1109/SOLI.2012.6273510(90-95)Online publication date: Jul-2012
https://doi.org/10.1109/SOLI.2012.6273510
Karampatziakis N(2010)Static analysis of binary executables using structural SVMsProceedings of the 24th International Conference on Neural Information Processing Systems - Volume 110.5555/2997189.2997308(1063-1071)Online publication date: 6-Dec-2010
https://dl.acm.org/doi/10.5555/2997189.2997308
Show More Cited By

Index Terms

Efficient inference on sequence segmentation models
1. Computing methodologies
  1. Machine learning
2. Theory of computation
  1. Design and analysis of algorithms
    1. Algorithm design techniques
      1. Dynamic programming

Recommendations

Image Segmentation Method with Positron Emission Tomography Time Sequence Images
IBICA '11: Proceedings of the 2011 Second International Conference on Innovations in Bio-inspired Computing and Applications

Positron emission tomography(PET)images are often used to detect physiology function.However,PET images have more blurs than anatomic images,such as magnetic resonance imaging(MRI)and computed tomography(CT).With the graylevel of PET images,Doctors need ...
Variational inference for medical image segmentation

We present a generalisation of the brain segmentation algorithm implemented in the SPM software, which exploits variational Bayesian inferenceWe test the accuracy and robustness of our method in segmenting brain tissues using synthetic and real MRI ...
Brain MRI image segmentation based on learning local variational Gaussian mixture models

Measuring the distribution of major brain tissues, including the gray matter, white matter and cerebrospinal fluid (CSF), using magnetic resonance imaging (MRI) has attracted extensive research efforts. Many brain MRI image segmentation methods in the ...

Comments

Information & Contributors

Information

Published In

ICML '06: Proceedings of the 23rd international conference on Machine learning

June 2006

1154 pages

ISBN:1595933832

DOI:10.1145/1143844

Program Chairs:
William Cohen,
Andrew Moore

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 June 2006

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Article

Acceptance Rates

ICML '06 Paper Acceptance Rate 140 of 548 submissions, 26%;

Overall Acceptance Rate 140 of 548 submissions, 26%

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

11
Total Citations
View Citations
277
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 18 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Sarawagi S(2019)Sequence Segmentation Using Semi-Markov Conditional Random FieldsJournal of the Indian Institute of Science10.1007/s41745-019-0100-1Online publication date: 20-Mar-2019
https://doi.org/10.1007/s41745-019-0100-1
Prasad KChaturvedi SFaruquie TSubramaniam LMohania M(2012)Managing data quality by identifying the noisiest data samplesProceedings of 2012 IEEE International Conference on Service Operations and Logistics, and Informatics10.1109/SOLI.2012.6273510(90-95)Online publication date: Jul-2012
https://doi.org/10.1109/SOLI.2012.6273510
Karampatziakis N(2010)Static analysis of binary executables using structural SVMsProceedings of the 24th International Conference on Neural Information Processing Systems - Volume 110.5555/2997189.2997308(1063-1071)Online publication date: 6-Dec-2010
https://dl.acm.org/doi/10.5555/2997189.2997308
Singh SHillard DLeggetter CKaplan R(2010)Minimally-supervised extraction of entities from text advertisementsHuman Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics10.5555/1857999.1858008(73-81)Online publication date: 2-Jun-2010
https://dl.acm.org/doi/10.5555/1857999.1858008
Chaturvedi SFaruquie TSubramaniam LMohania MHuang JKoudas NJones GWu XCollins-Thompson KAn A(2010)Estimating accuracy for text classification tasks on large unlabeled dataProceedings of the 19th ACM international conference on Information and knowledge management10.1145/1871437.1871551(889-898)Online publication date: 26-Oct-2010
https://dl.acm.org/doi/10.1145/1871437.1871551
Sarawagi S(2008)Information ExtractionFoundations and Trends in Databases10.1561/19000000031:3(261-377)Online publication date: 1-Mar-2008
https://dl.acm.org/doi/10.1561/1900000003
Hyvönen SGionis AMannila H(2007)Recurrent predictive models for sequence segmentationProceedings of the 7th international conference on Intelligent data analysis10.5555/1771622.1771647(195-206)Online publication date: 6-Sep-2007
https://dl.acm.org/doi/10.5555/1771622.1771647
Deshpande ASarawagi SKlas WNeuhold E(2007)Probabilistic graphical models and their role in databasesProceedings of the 33rd international conference on Very large data bases10.5555/1325851.1326038(1435-1436)Online publication date: 23-Sep-2007
https://dl.acm.org/doi/10.5555/1325851.1326038
Zhu JZhang BNie ZWen JHon HBerkhin PCaruana RWu X(2007)Webpage understandingProceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining10.1145/1281192.1281288(903-912)Online publication date: 12-Aug-2007
https://dl.acm.org/doi/10.1145/1281192.1281288
Hyvönen SGionis AMannila H(2007)Recurrent Predictive Models for Sequence SegmentationAdvances in Intelligent Data Analysis VII10.1007/978-3-540-74825-0_18(195-206)Online publication date: 2007
https://doi.org/10.1007/978-3-540-74825-0_18
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Image Segmentation Method with Positron Emission Tomography Time Sequence Images

Variational inference for medical image segmentation

Brain MRI image segmentation based on learning local variational Gaussian mixture models

Comments

Published In

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Acceptance Rates

Other Metrics

Article Metrics

Other Metrics

Cited By

Login options

Full Access

PDF

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Image Segmentation Method with Positron Emission Tomography Time Sequence Images

Variational inference for medical image segmentation

Brain MRI image segmentation based on learning local variational Gaussian mixture models

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations