Abstract
It is well known that lexical knowledge sources such as WordNet, HowNet are very important to natural language processing applications. In those lexical resources, attributes play very important roles for defining and distinguishing different concepts. In this paper, we propose a novel method to automatically discover the attribute hosts of HowNet’s attribute set. Given an attribute, we model the solving of its host as a problem of selectional constraint resolution. The World Wide Web is exploited as a large corpus to acquire the training data for such a model. From the training data, the attribute hosts are discovered by using a statistical measure and a semantic hierarchy. We evaluate our algorithm by comparing the result with the original hand-coded attribute specification in HowNet. Some experimental results about the performance of the method are provided.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Fellbaum, C.: Wordnet: an electronic lexical database. MIT Press, Cambridge (1998)
Dong, Z., Dong, Q.: HowNet and the Computation of Meaning. World Scientific, Singapore (2006)
Amsler, R.A.: The Structure of the Merriam-Webster Pocket Dictionary (1980)
Chodorow, M.S., Byrd, R.J., Heidorn, G.E.: Extracting semantic hierarchies from a large on-line dictionary. In: Proceedings of the 23rd conference on Association for Computational Linguistics, pp. 299–304 (1985)
Wilks, Y., Fass, D., Guo, C., McDonald, J., Plate, T., Slator, B.: A tractable machine dictionary as a resource for computational semantics. Longman Publishing Group White Plains, NY, USA (1989)
Alshawi, H.: Analysing the dictionary definitions. Computational lexicography for natural language processing table of contents, 153–169 (1989)
Richardson, S.D., Dolan, W.B., Vanderwende, L.: MindNet: acquiring and structuring semantic information from text. In: Proceedings of the 17th international conference on Computational linguistics, pp. 1098–1102 (1998)
Ide, N., Veronis, J.: Extracting knowledge bases from machine-readable dictionaries: Have we wasted our time. Proceedings of KB&KS 93, 257–266 (1993)
Hearst, M.A.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the 14th conference on Computational linguistics, vol. 2, pp. 539–545 (1992)
Berland, M., Charniak, E.: Finding parts in very large corpora. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, pp. 57–64 (1999)
Poesio, M., Ishikawa, T., im Walde, S.S., Viera, R.: Acquiring lexical knowledge for anaphora resolution. In: Proceedings of the 3rd Conference on Language Resources and Evaluation (LREC) (2002)
Grefenstette, G., Nioche, J.: Estimation of English and non-English Language Use on the WWW. Arxiv preprint cs.CL/0006032 (2000)
Jones, R., Ghani, R.: Automatically building a corpus for a minority language from the web. In: Proceedings of the Student Research Workshop at the 38th Annual Meeting of the Association for Computational Linguistics, pp. 29–36 (2000)
Zhu, X., Rosenfeld, R.: Improving trigram language modeling with the World Wide Web. In: Acoustics, Speech, and Signal Processing, 2001. Proceedings (ICASSP 2001). 2001 IEEE International Conference on, vol. 1 (2001)
Keller, F., Lapata, M.: Using the web to obtain frequencies for unseen bigrams. Computational Linguistics 29(3), 459–484 (2003)
Brin, S.: Extracting patterns and relations from the world wide web. In: WebDB Workshop at 6th International Conference on Extending Database Technology, EDBT 1998, pp. 172–183 (1998)
Pennacchiotti, M., Pantel, P.: A Bootstrapping Algorithm for Automatically Harvesting Semantic Relations. In: Proceedings of Inference in Computational Semantics (ICoS-2006), Buxton, England (2006)
Chen, H.H., Tsai, S.C., Tsai, J.H.: Mining tables from large scale html texts. In: 18th International Conference on Computational Linguistics (COLING), pp. 166–172 (2000)
Yoshida, M., Torisawa, K., Tsujii, J.: A method to integrate tables of the world wide web. In: Proceedings of the International Workshop on Web Document Analysis (WDA 2001), Seattle, US (2001)
Almuhareb, A., Poesio, M.: Attribute-Based and Value-Based Clustering: An Evaluation. In: Proc. of EMNLP, pp. 158–165 (2004)
Woods, W.: What’s in a Link: Foundations for Semantic Networks. Bolt, Beranek and Newman (1975)
Popescu, A.M., Etzioni, O.: Extracting product features and opinions from reviews. In: Proceedings of EMNLP 2005 (2005)
Resnik, P.: Selectional constraints: an information-theoretic model and its computational realization. Cognition 61(1-2), 127–159 (1996)
Framis, F.R.: An experiment on learning appropriate Selectional Restrictions from a parsed corpus. In: Proceedings of the 15th conference on Computational linguistics, vol. 2, pp. 769–774 (1994)
Ribas, F.: On learning more appropriate selectional restrictions. In: Proceedings of the 7th Conference of the European Chapter of the Association for Computational Linguistics, pp. 112–118 (1995)
Wagner, A.: Enriching a lexical semantic net with selectional preferences by means of statistical corpus analysis. In: Proceedings of the ECAI-2000 Workshop on Ontology Learning, pp. 37–42 (2000)
Pustejovsky, J.: The Generative Lexicon. Bradford Books (1998)
Snow, R., Jurafsky, D., Ng, A.: Semantic taxonomy induction from heterogenous evidence. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL, pp. 801–808 (2006)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhao, J., Liu, H., Lu, R. (2007). Automatic Acquisition of Attribute Host by Selectional Constraint Resolution. In: Gelbukh, A., Kuri Morales, Á.F. (eds) MICAI 2007: Advances in Artificial Intelligence. MICAI 2007. Lecture Notes in Computer Science(), vol 4827. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76631-5_93
Download citation
DOI: https://doi.org/10.1007/978-3-540-76631-5_93
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-76630-8
Online ISBN: 978-3-540-76631-5
eBook Packages: Computer ScienceComputer Science (R0)