Abstract
Mentions and brief descriptions of events often appear in a variety of document genres such as news articles containing references to related events, historical accounts or biographies. While event categorization has been previously studied, it was usually done on entire news articles or longer event descriptions. In this work we focus on short descriptions of historical events which are typically in the form of one or a few sentences. We categorize them into 9 general event categories using a range of diverse features and report F-measure close to 80%.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
Usually, only very popular or important events have own names.
- 3.
See Table 1 for examples of events in each class.
- 4.
- 5.
This value was empirically chosen based on analyzing the results on the small held-out development dataset.
- 6.
References
Au Yeung, C.M., Jatowt, A.: Studying how the past is remembered: towards computational history through large scale text mining. In: CIKM 2011, pp. 1231–1240 (2011)
Deerwester, S., Dumais, S.T., Furnas, G.W., Thomas, K.L., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inform. Sci. 41(6), 391–407 (1990)
Gorrell, G., Petrak, J., Bontcheva, K.: Using @Twitter conventions to improve #LOD-based named entity disambiguation. In: Gandon, F., Sabou, M., Sack, H., d’Amato, C., Cudré-Mauroux, P., Zimmermann, A. (eds.) ESWC 2015. LNCS, vol. 9088, pp. 171–186. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-18818-8_11
Košmerlj, A., Belyaeva, E., Leban, G., Grobelnik, M., Fortuna, B.: Towards a complete event type taxonomy. In: WWW 2015 Companion, pp. 899–902. ACM, New York (2015)
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: ICML 2014, vol. 32, pp. 1188–1196, Bejing, China, 22–24 June 2014
Lee, U., Liu, Z., Cho, J.: Automatic identification of user goals in web search. In: WWW 2005, pp. 391–400. ACM, New York (2005)
Chang, M.W., Ratinov, L.A., Roth, D., Srikumar, V.: Importance of semantic representation: dataless classification. In: AAAI, p. 7 (2008)
Nie, L., Wang, M., Zha, Z., Li, G., Chua, T.S.: Multimedia answering: enriching text qa with media information. In: SIGIR 2011, pp. 695–704. ACM, New York (2011)
Phan, X.H., Nguyen, L.M., Horiguchi, S.: Learning to classify short and sparse text & web with hidden topics from large-scale data collections. In: WWW 2008, pp. 91–100. ACM, New York (2008)
Sriram, B., Fuhry, D., Demir, E., Ferhatosmanoglu, H., Demirbas, M.: Short text classification in twitter to improve information filtering. In: SIGIR 2010, pp. 841–842. ACM, New York (2010)
Sun, X., Wang, H., Yu, Y.: Towards effective short text deep classification. In: SIGIR 2011, pp. 1143–1144. ACM, New York (2011)
Zelikovitz, S., Marquez, F.: Transductive learning for short-text classification problems using latent semantic indexing. Int. J. Pattern Recognit Artif Intell. 19(2), 146–163 (2005)
Acknowledgments
This work was supported in part by MEXT Grant-in-Aids (#17H 01828 and #17K12792) and MIC SCOPE (#171507010).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Sumikawa, Y., Jatowt, A. (2018). Classifying Short Descriptions of Past Events. In: Pasi, G., Piwowarski, B., Azzopardi, L., Hanbury, A. (eds) Advances in Information Retrieval. ECIR 2018. Lecture Notes in Computer Science(), vol 10772. Springer, Cham. https://doi.org/10.1007/978-3-319-76941-7_69
Download citation
DOI: https://doi.org/10.1007/978-3-319-76941-7_69
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-76940-0
Online ISBN: 978-3-319-76941-7
eBook Packages: Computer ScienceComputer Science (R0)