Abstract
Event extraction is a technique that automatically extracts key event elements from large-scale texts. In classic event extraction process, recognizing event types is earlier than extracting argument roles in the whole event extraction task. But in practice, multiple-label problems are often encountered (that is, one event sentence corresponds to multiple event types). In order to solve this problem, this paper introduces Binary-Relevance, Classifier-Chain, MLkNN and many other multi-label classification strategies from the perspective of problem transformation and algorithm adaptation, trying to find the best classification method to adapt to our Chinese financial corpus. The experimental results show that the Adaboost method based on single-layer decision tree with Classifier-Chain is the best strategy for the task of recognizing event types in this paper. The micro-F1 score and average-precision value of this strategy are 8.91% and 12.46% higher than the baseline strategy (Binary-Relevance + SVM) respectively. At the same time, this method achieves the lowest value on the three indicators of Hamming-Loss, Coverage-Error and Ranking-Loss. In addition, the results also show: (1) Classifier-Chain strategy is better than Binary-Relevance strategy when the classifiers are the same; (2) Under the same problem transformation strategy, the Adaboost method performs best, followed by KNN and the worst case is SVM; (3) If only single classifier is allowed, the MLkNN strategy based on algorithm adaptation is better than other strategies based on problem transformation.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Ding, X.: Research on sentence level Chinese event extraction. Master dissertation, Harbin Institute of Technology (2011). (in Chinese)
Ahn, D.: The stages of event extraction. In: Proceedings of the Workshop on Annotating and Reasoning about Time and Events, Sydney, Australia, 23 July 2006. Association for Computational Linguistics (2006)
Zhao, Y.Y., Qin, B., Che, W.X., Liu, T.: Research on Chinese event extraction. J. Chin. Inf. Process. 22(1), 3–8 (2008). (in Chinese)
Qin, B., Zhao, Y.Y., Ding, X., et al.: Event type recognition based on trigger expansion. J. Tsinghua Univ. (Sci. Technol.) 15(3), 251–258 (2010)
Tan, H.Y.: Research on Chinese event extraction. Doctoral dissertation, Harbin Institute of Technology (2008). (in Chinese)
Xu, H.L., Chen, J.X., Zhou, C.L., et al.: Research on event type identification for Chinese event extraction. Mind Comput. 4(1), 34–44 (2010). (in Chinese)
Tsoumakas, G., Katakis, I., Taniar, D.: Multi-label classification: an overview. Int. J. Data Warehous. Min. 3(3), 1–13 (2007)
Zhang, M.L., Zhou, Z.H.: ML-KNN: a lazy learning approach to multi-label learning. Pattern Recogn. 40(7), 2038–2048 (2007)
Read, J., Pfahringe, B., Holmes, G., et al.: Classifier chains for multi-label classification. Mach. Learn. 85, 333–359 (2011)
He, J.Y., Chen, R., Xu, M., et al.: Algorithm for Chinese text categorization based on class feature vector representation. Appl. Res. Comput. 25(2), 337–341 (2008)
Xu, Z.G.: A comparative study of multi-label classification approaches. http://lamda.nju.edu.cn/huangsj/dm11/files/xuzg.pdf
Acknowledgement
We thank the National Natural Science Foundation of China (No. 61375053) for financial support.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Luo, S., Wang, Y., Feng, X., Hu, Z. (2018). A Study of Multi-label Event Types Recognition on Chinese Financial Texts. In: Wrycza, S., Maślankowski, J. (eds) Information Systems: Research, Development, Applications, Education. SIGSAND/PLAIS 2018. Lecture Notes in Business Information Processing, vol 333. Springer, Cham. https://doi.org/10.1007/978-3-030-00060-8_11
Download citation
DOI: https://doi.org/10.1007/978-3-030-00060-8_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00059-2
Online ISBN: 978-3-030-00060-8
eBook Packages: Computer ScienceComputer Science (R0)