Abstract
In this paper, we describe systems for automatic labeling of time expressions occurring in English and Chinese text as specified in the ACE Temporal Expression Recognition and Normalization (TERN) task. We cast the chunking of text into time expressions as a tagging problem using a bracketed representation at token level, which takes into account embedded constructs. We adopted a left-to-right, token-by-token, discriminative, deterministic classification scheme to determine the tags for each token. A number of features are created from a predefined context centered at each token and augmented with decisions from a rule-based time expression tagger and/or a statistical time expression tagger trained on different type of text data, assuming they provide complementary information. We trained one-versus-all multi-class classifiers using support vector machines. We participated in the TERN 2004 recognition task and achieved competitive results.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Allwein, E.L., Schapire, R.E., Singer, Y.: Reducing multiclass to binary: A unifying approach for margin classifiers. Journal of Machine Learning Research 1, 113–141 (2000)
Bikel, D.M., Schwartz, R.L., Weischedel, R.M.: An algorithm that learns what’s in a name. Machine Learning 34(1-3), 211–231 (1999)
Burges, C.J.C.: Tutorial on Support Vector Machines for Pattern Recognition. Data Mining and Knowledge Discovery 2(2), 1–47 (1997)
Ferro, L., Gerber, L., Mani, I., Sundheim, B., Wilson, G.: 2003 standard for the annotation of temporal expressions. Technical Report, MITRE (2004)
Kudo, T., Matsumato, Y.: Use of support vector learning for chunk identification. In: Proc. of the 4th Conference on Very Large corpora., pp. 142–144 (2000)
Lin, D.: Dependency-based Evaluation of MINIPAR. In: Workshop on the Evaluation of Parsing Systems, Granada, Spain (May 1998)
Mani, I.: Recent Developments in Temporal Information Extraction. To appear in Proceedings of RANLP 2003 (2004)
Ramhsaw, L.E., Marcus, M.P.: Text Chunking Using Transformation Based Learning. In: Proceedings of the 3rd ACL Workshop on Very Large Corpora., pp. 82–94 (1995)
Reynar, J.C., Ratnaparkhi, A.: A Maximum Entropy Approach to Identifying Sentence Boundaries. In: Proceedings of the Fifth Conference on Applied Natural Language Processing, March 31-April 3 (1997)
Sang, E.F.T.J., Veenstra, J.: Representing text chunks. In: Proceedings of EACL 1999, pp. 173–179 (1999)
Saquete, E., Martinez-Barco, P., Munoz, R., Vicedo, J.L.: Splitting Complex Temporal Questions for Question Answering systems. In: Association for Computational Linguistics (ACL), Barcelona (2004)
Schilder, F., Habel, C.: Temporal information extraction for temporal question answering. In: Proceedings of the 2003 AAAI Spring Symposium in New Directions in Question Answering, Stanford University, Palo Alto, USA (2003)
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hacioglu, K., Chen, Y., Douglas, B. (2005). Automatic Time Expression Labeling for English and Chinese Text. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2005. Lecture Notes in Computer Science, vol 3406. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30586-6_59
Download citation
DOI: https://doi.org/10.1007/978-3-540-30586-6_59
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24523-0
Online ISBN: 978-3-540-30586-6
eBook Packages: Computer ScienceComputer Science (R0)