Abstract
Human beings generally analyze information with some kinds of semantic expectations. This not only speeds up the processing time, it also helps to put the analysis in the correct context and perspective. To capitalize on this type of intelligent human behavior, this paper proposes a semantic expectationbased knowledge extraction methodology (SEKE) for extracting causation relations from text. In particular, we study the application of a causation semantic template on the Hong Kong Stock market movement (Hang Seng Index) with English financial news from Reuters, South China Morning Post and Hong Kong Standard. With one-month data input and over a two-month testing period, the system shows that it can correctly analyzes single reason sentences with about 76% precision and 74% recall rates. If partial reason extraction (two out of one reason) is included and weighted by a factor of 0.5, the performance is improved to about 83% and 81% respectively. As the proposed framework is language independent, we expect cross lingual knowledge extraction can work better with this semantic expectation-based framework.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Yorick Wilks, Information Extraction as a Core Language Technology, International Summer School on Information Extraction, SCIE’97, Fracscati.
Rau, L.F., Conceptual information extraction from financial news. System Sciences, 1988. Vol.III. Decision Support and Knowledge Based Systems Track, Proceedings of the Twenty-First Annual Hawaii International Conference, 1988 Pg. 501–509.
Costantino, M.; Morgan, R.G.; Collingham, R.J.; Carigliano, R., Natural language processing and information extraction: qualitative analysis of financial news articles. Computational Intelligence for Financial Engineering (CIFEr), 1997., Proceedings of the IEEE/IAFE 1997, 1997 Pg. 116–122.
Grishman, R.; Sterling, J., New York University: Description of the Proteus System as used for MUC-5, Proceedings of the Fifth Message Understanding Conference, 1993 Pgs. 181–194.
Appelt, D.; Hobbs, J.; Bear, J.; Israel, D.; Kameyama, M.; Tyson, M., SRI: Description of the JV-Fastus System Used for MUC-5, Proceedings of the Fifth Message Understanding Conference, 1993, Pg. 221–236.
WORDNET is an on-line lexical reference system whose design is inspired by current psycholinguistic theories of human lexical memory. English nouns, verbs, adjectives and adverbs are organized into synonym sets, each representing one underlying lexical concept. Different relations link the synonym sets. WordNet was developed by the Cognitive Science Laboratory at Princeton University under the direction of Professor George A. Miller (Principal Investigator). http://www.cogsci.princeton.edu/~wn/
Roget’s Thesaurus, a Project for American and French Research on the Treasury of the French Language (ARTFL), University of Chicago http://humanities.uchicago.edu/ARTFL/forms_unrest/ROGET.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Low, BT., Chan, K., Choi, LL., Chin, MY., Lay, SL. (2001). Semantic Expectation-Based Causation Knowledge Extraction: A Study on Hong Kong Stock Movement Analysis. In: Cheung, D., Williams, G.J., Li, Q. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2001. Lecture Notes in Computer Science(), vol 2035. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45357-1_15
Download citation
DOI: https://doi.org/10.1007/3-540-45357-1_15
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41910-5
Online ISBN: 978-3-540-45357-4
eBook Packages: Springer Book Archive