Abstract
Selectional Con straints are usually checked for detecting semantic relations. Previous work usually defined the constraints manually based on hand crafted concept taxonomy, which is time-consuming and impractical for large scale relation extraction. Further, the determination of entity type (e.g. NER) based on the taxonomy cannot achieve sufficiently high accuracy. In this paper, we propose a novel approach to extracting relation instances using the features elicited from Wikipedia, a free online encyclopedia. The features are represented as selectional constraints and further employed to enhance the extrac tion of relations. We conduct case stud ies on the validation of the ex tracted instances for two common relations hasAr tist(album, artist) andhasDirector(film, director). Substantially high extraction precision (around 0.95) and validation accuracy (near 0.90) are obtained.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Girju, R., Badulescu, A., Moldovan, D.: Learning semantic constraints for the automatic discovery of part-whole relations. In: Proceedings of HLT-NAACL (2003)
Sekine, S., Sudo, K., Nobata, C.: Extended Named Entity Hierarchy. In: Proceedings of the LREC-2002 Conference (2002)
Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
Stevenson, M., Greenwood, M.A.: A Semantic Approach to IE Pattern Induction. In: Proceedings of the 43rd Annual Meeting of the ACL, pp. 379–386 (2005)
Roth, D., Yih, W.: Probabilistic Reasoning for Entity & Relation Recognition. In: Proceedings of 19th International Conference on Computational Linguistics (2002)
Resnik, P.: Selectional constraints: an information-theoretic model and its computational realization. Cognition (1996)
Karambelkar, S.: Acquisition of selectional constraints in natural language processing. Master thesis. University of Sheffield (2001)
Schutz, A., Buitelaar, P.: RelExt: A Tool for Relation Extraction from Text in Ontology Extension. In: Proceedings of the 4th International Semantic Web Conference (2005)
Sekine, S.: On-Demand Information Extraction. In: Proceedings of COLING (2006)
Boer, V., Someren, M., Wielinga, B.J.: Extracting Instances of Relations from Web Documents using Redundancy. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, Springer, Heidelberg (2006)
Stevenson, M.: An Unsupervised WordNet-based Algorithm for Relation Extraction. In: 4th LREC Workshop Beyond Named Entity: Semantic Labeling for NLP Tasks (2004)
Choi, Y., Cardie, C., Riloff, E., Patwardhan, S.: Identifying sources of opinions with CRFs and extraction patterns. In: Proceedings of HLT/EMNLP, pp. 355–362 (2005)
Agichtein, E., Gravano, L.: Snowball: Extracting Relations from Large Plain-text Collections. In: Proceedings of Digital Libraries (2000)
Culotta, A., Sorensen, J.: Dependency tree kernels for relation extraction. In: Proceedings of the 42nd Annual Meeting of the ACL (2004)
Geleijnse, G., Korst, J.: Automatic Ontology Population by Googling. In: Proceedings of the 17th BNAIC, pp. 120–126 (2005)
Giles, J.: Internet Encyclopaedias Go Head to Head. Nature 438, 900–901 (2005)
Ruiz-Casado, M., Alfonseca, E., Castells, P.: Automatic extraction of semantic relationships for WordNet by means of pattern learning from Wikipedia. In: Montoyo, A., Muńoz, R., Métais, E. (eds.) NLDB 2005. LNCS, vol. 3513, Springer, Heidelberg (2005)
Voss, J.: Collaborative Thesaurus Tagging the Wikipedia Way, available at http://arxiv.org/abs/cs/0604036
Evgeniy, G., Shaul, M.: Computing Semantic Relatedness using Wikipedia-Based Explicit Semantic Analysis. In: Proceedings of IJCAI 2007 (2007)
Evgeniy, G., Shaul M.: Overcoming the Brittleness Bottleneck using Wikipedia: Enhancing Text Categorization with Encyclopedic Knowledge. In: Proceedings of AAAI 2006, pp. 1301–1306 (2006)
Strube, M., Ponzetto, S.: WikiRelate! Computing Semantic Relatedness Using Wikipedia. In: Proceedings of AAAI 2006 (2006)
Bunescu, R., Pasca, M.: Using Encyclopedic Knowledge for Named Entity Disambiguation. In: Proceedings of EACL 2006 (2006)
Basu, S., Banerjee, A., Mooney, R.: Semi-Supervised Clustering by Seeding. In: Proceedings of ICML 2002 (2002)
MUC: Voorhees, E.: Introduction to Information Extraction and Message Understanding Conferences, http://www.itl.nist.gov/iaui/894.02/related_projects/muc/
Denoyer, L.: The Wikipedia XML Corpus. SIGIR Forum (2006)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, G., Zhang, H., Wang, H., Yu, Y. (2007). Enhancing Relation Extraction by Eliciting Selectional Constraint Features from Wikipedia. In: Kedad, Z., Lammari, N., Métais, E., Meziane, F., Rezgui, Y. (eds) Natural Language Processing and Information Systems. NLDB 2007. Lecture Notes in Computer Science, vol 4592. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73351-5_29
Download citation
DOI: https://doi.org/10.1007/978-3-540-73351-5_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73350-8
Online ISBN: 978-3-540-73351-5
eBook Packages: Computer ScienceComputer Science (R0)