research-article

Labeling by landscaping: classifying tokens in context by pruning and decorating trees

Authors:

Siddharth Patwardhan,

Branimir Boguraev,

Apoorv Agarwal,

Alessandro Moschitti,

Jennifer Chu-CarrollAuthors Info & Claims

CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management

Pages 1133 - 1142

https://doi.org/10.1145/2396761.2398412

Published: 29 October 2012 Publication History

Abstract

State-of-the-art approaches to token labeling within text documents typically cast the problem either as a classification task, without using complex structural characteristics of the input, or as a sequential labeling task, carried out by a Conditional Random Field (CRF) classifier. Here we explore principled ways for structure to be brought to bear on the task. In line with recent trends in statistical learning of structured natural language input, we use a Support Vector Machine (SVM) classification framework deploying tree kernels. We then propose tree transformations and decorations, as a methodology for modeling complex linguistic phenomena in highly multi-dimensional feature spaces. We develop a general purpose tree engineering framework, which enables us to transcend the typically complex and laborious process of feature engineering. We build kernel based classifiers for two token labeling tasks: fine-grained event recognition, and lexical answer type detection in questions. For both, we show that in comparison with a corresponding linear kernel SVM, our method of using tree kernels improves recognition, thanks to appropriately engineering tree structures for use by the tree kernel. We also observe significant improvements when comparing with a CRF-based realization of structured prediction, itself performing at levels comparable to state-of-the-art.

References

[1]

R. K. Ando. Exploiting Unannotated Corpora for Tagging and Chunking. In The Companion Volume to the Proceedings of 42st Annual Meeting of the Association for Computational Linguistics, pages 142--145, Barcelona, Spain, July 2004.

Digital Library

[2]

S. Bethard and J. H. Martin. Identification of Event Mentions and their Semantic Class. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, pages 146--154, Sydney, Australia, July 2006.

Digital Library

[3]

B. Boguraev and R. K. Ando. TimeML-compliant Text Analysis for Temporal Reasoning. In Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, pages 997--1003, Edinburgh, Scotland, August 2005.

Digital Library

[4]

B. Boguraev and R. K. Ando. Analysis of TimeBank as a Resource for TimeML Parsing. In Proceedings of the Fifth International Conference on Language Resources and Evaluation, pages 71--76, Genoa, Italy, May 2006.

[5]

W. Cohen. MinorThird: Methods for Identifying Names and Ontological Relations in Text using Heuristics for Inducing Regularities from Data. http://minorthird.sourceforge.net, 2004.

[6]

M. Collins and N. Duffy. Convolution kernels for Natural Language. In Advances in Neural Information Processing Systems, Vancouver, Canada, December 2001.

[7]

D. Croce, A. Moschitti, and R. Basili. Structured Lexical Similarity via Convolution Kernels on Dependency Trees. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pages 1034--1046, Edinburgh, Scotland, July 2011.

Digital Library

[8]

A. Culotta and J. Sorensen. Dependency Tree Kernels for Relation Extraction. In Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), pages 423--429, Barcelona, Spain, July 2004.

Digital Library

[9]

D. Haussler. Convolution Kernels on Discrete Structures. Technical Report UCSC-CRL-99--10, University of California at Santa Cruz, July 1999.

[10]

V. Kecman. Learning and Soft Computing. The MIT Press, Cambridge, MA, 2001.

[11]

T. Kudo and Y. Matsumoto. Chunking with Support Vector Machines. In Proceedings of the Second Meeting of the North American Chapter of the Association for Computational Linguistics}, pages 192--199, Pittsburgh, PA, June 2001.

Digital Library

[12]

J. Lafferty, A. McCallum, and F. Pereira. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. Proceedings of the 18th International Conference on Machine Learning, pages 282--289, June 2001.

Digital Library

[13]

A. Lally, J. Prager, M. McCord, B. Boguraev, S. Patwardhan, J. Fan, P. Fodor, and J. Chu-Carroll. Question Analysis: How Watson Reads a Clue. IBM Journal of Research and Development, 56(3/4):2:1--2:14, May/July 2012.

Digital Library

[14]

X. Li and D. Roth. Learning Question Classifiers: The Role of Semantic Information. In Proceedings of the 19th International Conference on Computational Linguistics, pages 556--562, Taipei, Taiwan, August 2004.

Digital Library

[15]

H. Llorens, E. Saquete, and B. Navarro-Colorado. TimeML Events Recognition and Classification: Learning CRF Models with Semantic Roles. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), pages 725--733, Beijing, China, August 2010.

Digital Library

[16]

L. Marquez and A. Moschitti. Special Issue on Statistical Learning of Natural Language Structured Input and Output. Natural Language Engineering, 18(2):147--153, April 2012.

Digital Library

[17]

M. McCord. Slot Grammar: A System for Simpler Construction of Practical Natural Language Grammars. In Proceedings of the International Symposium on Natural Language and Logic, pages 118--145, Hamburg, Germany, May 1989.

Digital Library

[18]

Q. McNemar. Note on the Sampling Error of the Difference Between Correlated Proportions or Percentages. Psychometrika, 12(2):153--157, 1947.

[19]

A. Moschitti. A Study on Convolution Kernels for Shallow Statistic Parsing. In Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), pages 335--342, Barcelona, Spain, July 2004.

Digital Library

[20]

A. Moschitti. Efficient Convolution Kernels for Dependency and Constituent Syntactic Trees. In Proceedings of the 17th European Conference on Machine Learning, pages 318--329, Berlin, Germany, September 2006.

Digital Library

[21]

A. Moschitti, D. Pighin, and R. Basili. Tree Kernels for Semantic Role Labeling. Computational Linguistics, 34(2):193--224, June 2008. Special Issue on Semantic Role Labeling.

Digital Library

[22]

A. Moschitti, S. Quarteroni, R. Basili, and S. Manandhar. Exploiting Syntactic and Shallow Semantic Kernels for Question Answer Classification. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 776--783, Prague, Czech Republic, June 2007.

[23]

T.-V. T. Nguyen, A. Moschitti, and G. Riccardi. Convolution Kernels on Constituent, Dependency and Sequential Structures for Relation Extraction. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pages 1378--1387, Singapore, August 2009.

Digital Library

[24]

L. Ramshaw and M. Marcus. Text Chunking using Transformation-Based Learning. In Proceedings of Third Annual Workshop on Very Large Corpora, pages 82--94, Cambridge, MA, June 1995.

[25]

R. Sauri, R. Knippen, M. Verhagen, and J. Pustejovsky. Evita: A Robust Event Recognizer For QA Systems. In Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, pages 700--707, Vancouver, Canada, October 2005.

Digital Library

[26]

V. Vapnik. The Nature of Statistical Learning Theory. Springer, New York, NY, 1995.

Digital Library

[27]

L. Wang, editor. Support Vector Machines: Theory and Applications. Springer, Berlin, Germany, 2005.

Digital Library

[28]

D. Zelenko, C. Aone, and A. Richardella. Kernel Methods for Relation Extraction. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, pages 71--78, Philadelphia, PA, July 2002.

Digital Library

[29]

M. Zhang, J. Zhang, and J. Su. Exploring Syntactic Features for Relation Extraction using a Convolution Tree Kernel. In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, pages 288--295, New York City, USA, June 2006.

Digital Library

Cited By

Wu JLiu XZhang XHe ZLv P(2018)Master clinical medical knowledge at certificated-doctor-level with deep learning modelNature Communications10.1038/s41467-018-06799-69:1Online publication date: 19-Oct-2018
https://doi.org/10.1038/s41467-018-06799-6

Index Terms

Labeling by landscaping: classifying tokens in context by pruning and decorating trees
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Language resources

Recommendations

Extending twin support vector machine classifier for multi-category classification problems

Twin support vector machine classifier TWSVM was proposed by Jayadeva et al., which was used for binary classification problems. TWSVM not only overcomes the difficulties in handling the problem of exemplar unbalance in binary classification problems, ...
A subpath kernel for rooted unordered trees
PAKDD'11: Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I

Kernel method is one of the promising approaches to learning with tree-structured data, and various efficient tree kernels have been proposed to capture informative structures in trees. In this paper, we propose a new tree kernel function based on "...
All-in-one multicategory least squares nonparallel hyperplanes support vector machine

A least square version of nonparallel hyperplane support vector machine (LSNHSVM) is proposed.The proposed LSNHSVM is obtained by solving a system of linear equations.LSNHSVM is further generalized to solve multiclass classification problem ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management

October 2012

2840 pages

ISBN:9781450311564

DOI:10.1145/2396761

General Chair:
Xuewen Chen
Wayne State University, USA
,
Program Chairs:
Guy Lebanon
Georgia Institute of Technology
,
Haixun Wang
Microsoft Research Asia
,
Mohammed J. Zaki
Rensselaer Polytechnic Institute

Copyright © 2012 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 October 2012

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

CIKM'12

Sponsor:

CIKM'12: 21st ACM International Conference on Information and Knowledge Management

October 29 - November 2, 2012

Hawaii, Maui, USA

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Sponsor:
sigir
sigir

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
178
Total Downloads

Downloads (Last 12 months)2
Downloads (Last 6 weeks)0

Reflects downloads up to 17 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Wu JLiu XZhang XHe ZLv P(2018)Master clinical medical knowledge at certificated-doctor-level with deep learning modelNature Communications10.1038/s41467-018-06799-69:1Online publication date: 19-Oct-2018
https://doi.org/10.1038/s41467-018-06799-6

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten