Abstract
Techniques for the automatic acquisition of Information Extraction Pattern are still a crucial issue in knowledge engineering. A semi supervised learning method, based on large scale linguistic resources, such as FrameNet and WordNet, is discussed. In particular, a robust method for assigning conceptual relations (i.e. roles) to relevant grammatical structures is defined according to distributional models of lexical semantics over a large scale corpus. Experimental results show that the use of the resulting knowledge base provide significant results, i.e. correct interpretations for about 90% of the covered sentences. This confirms the impact of the proposed approach on the quality and development time of large scale IE systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Fillmore, C.J.: Frames and the semantics of understanding. Quaderni di Semantica 4(2), 222–254 (1985)
Surdeanu, M., Surdeanu, M., Harabagiu, A., Williams, J., Aarseth, P.: Using predicate-argument structures for information extraction. In: Proceedings of ACL 2003 (2003)
Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley FrameNet project. In: Proc. of COLING-ACL, Montreal, Canada (1998)
Miller, G., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.: Introduction to wordnet: An on-line lexical database. International Journal of Lexicography 13(4), 235–312 (1990)
Basili, R., Giannone, C., De Cao, D.: Learning domain-specific framenets from texts. In: Proc. of 3rd Workshop on Ontology Learning and Population (OLP3), Greece (2008)
Coppola, B., Gangemi, A., Gliozzo, A., Picca, D., Presutti, V.: Frame detection over the semantic web. In: Aroyo, L., Traverso, P., Ciravegna, F., Cimiano, P., Heath, T., Hyvönen, E., Mizoguchi, R., Oren, E., Sabou, M., Simperl, E. (eds.) ESWC 2009. LNCS, vol. 5554, pp. 126–142. Springer, Heidelberg (2009)
Sahlgren, M.: The Word-Space Model. PhD thesis, Stockholm University (2006)
Agirre, E., Rigau, G.: Word sense disambiguation using conceptual density. In: Proc. of COLING 1996, Copenhagen, Denmark (1996)
Lin, D., Pantel, P.: DIRT-discovery of inference rules from text. In: Proceedings of the ACM Conference on Knowledge Discovery and Data Mining (KDD 2001), CA (2001)
Harris, Z.: Distributional structure. In: Katz, J.J., Fodor, J.A. (eds.) The Philosophy of Linguistics. Oxford University Press, New York (1964)
Landauer, T., Dumais, S.: A solution to plato’s problem: The latent semantic analysis theory of acquisition, induction and representation of knowledge. Psychological Review 104 (1997)
Heyer, L., Kruglyak, S., Yooseph, S.: Exploring expression data: Identification and analysis of coexpressed genes. Genome Research (9), 1106–1115 (1999)
Basili, R., De Cao, D., Marocco, P., Pennacchiotti, M.: Learning selectional preferences for entailment or paraphrasing rules. In: Proc. of RANLP 2007 (2007)
De Cao, D., Giannone, C., Basili, R.: Frame-based ontology learning for information extraction (demo paper). In: Proc. of The 16th International Conference on Knowledge Engineering and Knowledge Management Knowledge Patterns (EKAW 2008), Italy (2008)
Johansson, R., Nugues, P.: Semantic structure extraction using nonprojective dependency trees. In: Proceedings of SemEval 2007, Prague, Czech Republic, June 23-24 (2007)
Johansson, R., Nugues, P.: The effect of syntactic representation on semantic role labeling. In: Proceedings of COLING, Manchester, UK, August 18-22 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Basili, R., Croce, D., Giannone, C., De Cao, D. (2010). Acquiring IE Patterns through Distributional Lexical Semantic Models. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2010. Lecture Notes in Computer Science, vol 6008. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12116-6_44
Download citation
DOI: https://doi.org/10.1007/978-3-642-12116-6_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12115-9
Online ISBN: 978-3-642-12116-6
eBook Packages: Computer ScienceComputer Science (R0)