Abstract
In this paper, we propose a sentence segmentation model for a semi-automatic tree annotation tool using a parsing model. For the purpose of improving both parsing performance and parsing complexity without any modification of the parsing model, the tree annotation tool performs two-phase parsing for the intra-structure of each segment and the inter-structure of the segments after segmenting a sentence. Experimental results show that it can reduce manual effort about 28.3% by the proposed sentence segmentation model because an annotator’s intervention related to cancellation and reconstruction remarkably decrease.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Choi, K.-S.: KAIST Language Resources ver. The Result of Core Software Project from Ministry of Science and Technology (written in Korean) (2001), http://kibs.kaist.ac.kr
Mitchell, P.M., Santorini, B., Marcinkiewicz, M.A.: Building a Large Annotated Corpus of English: the Penn Treebank. Computational Linguistics 19(2), 313–330 (1993)
Lim, J.-H., Park, S.-Y., Kwak, Y.-J., Rim, H.-C.: A semi-automatic tree annotating workbench for building a korean treebank. In: Gelbukh, A. (ed.) CICLing 2004. LNCS, vol. 2945, pp. 253–257. Springer, Heidelberg (2004)
Kwak, Y.-J., Hwang, Y.-S., Chung, H.-J., Park, S.-Y., Rim, H.-C.: FIDELITY: A Framework for Context-Sensitive Grammar Development. In: Proceedings of International Conference on Computer Processing of Oriental Languages, pp. 305–308 (2001)
Park, S.-Y., Kwak, Y.-J., Lim, J.-H., Rim, H.-C., Kim, S.-H.: Partially Lexicalized Parsing Model Utilizing Rich Features. In: Proceedings of the 8th International Conference on Spoken Language Processing, vol. 3, pp. 2201–2204 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Park, SY., Shin, D., Song, US. (2006). Sentence Segmentation Model to Improve Tree Annotation Tool. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2006. Lecture Notes in Computer Science, vol 3878. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11671299_5
Download citation
DOI: https://doi.org/10.1007/11671299_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32205-4
Online ISBN: 978-3-540-32206-1
eBook Packages: Computer ScienceComputer Science (R0)