Abstract
Laborious construction of large wordnets (lexico-semantic networks) can be supported by automatic wordnet expansion methods. Several methods were proposed but mostly were not thoroughly evaluated and compared. In the paper an evaluation methodology for automated wordnet expansion algorithms is proposed. Basic requirements for it are formulated in relation to the linguistic process. The general scheme based on the idea of automated wordnet reconstruction is presented. The methodology is illustrated by applying it to the comparison of the two top level wordnet expansion algorithms: Algorithm of Activation-area Attachment and the algorithm of Snow et al.. The latter was reimplemented and adopted to the Polish language tools.
Work financed by Innovative Economy Programme project POIG.01.01.02-14-013/09.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Alfonseca, E., Manandhar, S.: Extending a lexical ontology by a combination of distributional semantics signatures. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, pp. 1–7. Springer, Heidelberg (2002)
BNC: The British National Corpus, version 2 (BNC World), distributed by Oxford University Computing Services on behalf of the BNC Consortium (2001)
Broda, B., Piasecki, M., Szpakowicz, S.: Extraction of polish noun senses from large corpora by means of clustering. Control and Cybernetics 31(2), 401–420 (2010)
Caraballo, S.A.: Automatic construction of a hypernym-labeled noun hierarchy from text. In: Proceedings of ACL 1999, Baltimore, MD, pp. 120–126 (1999)
Fellbaum, C. (ed.): WordNet — An Electronic Lexical Database. The MIT Press (1998)
Harris, Z.S.: Mathematical Structures of Language. Interscience Publishers, New York (1968)
Israel, G.: Determining sample size. Tech. rep., University of Florida (1992)
Lin, D.: Principle-based parsing without overgeneration. In: Proc. ACL 1993, Columbus, Ohio (1993)
Pantel, P.: Clustering by committee. Ph.D. thesis, Edmonton, Alta., Canada (2003), adviser-Dekang Lin
Piasecki, M., Broda, B., Głąbska, M., Marcińczuk, M., Szpakowicz, S.: Semi-automatic expansion of polish wordnet based on activation-area attachment. In: Recent Advances in Intelligent Information Systems, pp. 247–260. EXIT (2009)
Piasecki, M., Szpakowicz, S., Marcińczuk, M., Broda, B.: Classification-based filtering of semantic relatedness in hypernymy extraction. In: Nordström, B., Ranta, A. (eds.) GoTAL 2008. LNCS (LNAI), vol. 5221, pp. 393–404. Springer, Heidelberg (2008)
Piasecki, M., Szpakowicz, S., Broda, B.: A Wordnet from the Ground Up. Oficyna Wydawnicza Politechniki Wrocławskiej, Wrocław (2009)
Przepiórkowski, A.: The IPI PAN Corpus: Preliminary version. Institute of Computer Science PAS (2004)
Ravichandran, D., Pantel, P., Hovy, E.: Randomized algorithms and nlp: using locality sensitive hash function for high speed noun clustering. In: Proc. of the 43rd Annual Meeting on ACL, pp. 622–629 (2005)
Snow, R.: Semantic Taxonomy Induction. Ph.D. thesis (2009)
Snow, R., Jurafsky, D., Ng, A.Y.: Semantic taxonomy induction from heterogenous evidence. In: COLING 2006 (2006)
Weiss, D.: Korpus Rzeczpospolitej, corpus from the online edtion of Rzeczypospolita (2008), http://www.cs.put.poznan.pl/dweiss/rzeczpospolita
Widdows, D.: Unsupervised methods for developing taxonomies by combining syntactic and statistical information. In: Proc. HLT of North American Chapter of the ACL (2003)
Witschel, H.F.: Using decision trees and text mining techniques for extending taxonomies. In: Proc. of Learning and Extending Lexical Ontologies by Using Machine Learning Methods, Workshop at ICML 2005 (2005)
Yang, H., Callan, J.: A metric-based framework for automatic taxonomy induction. In: Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP of the AFNLP, pp. 271–279. ACL (2009)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Broda, B., Kurc, R., Piasecki, M., Ramocki, R. (2012). Evaluation Method for Automated Wordnet Expansion. In: Bouvry, P., Kłopotek, M.A., Leprévost, F., Marciniak, M., Mykowiecka, A., Rybiński, H. (eds) Security and Intelligent Information Systems. SIIS 2011. Lecture Notes in Computer Science, vol 7053. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25261-7_23
Download citation
DOI: https://doi.org/10.1007/978-3-642-25261-7_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25260-0
Online ISBN: 978-3-642-25261-7
eBook Packages: Computer ScienceComputer Science (R0)