Abstract
Blind Signal Separation (BSS) based on Independent Component Analysis (ICA) is an emerging approach which application is not limited to the signal processing research, where its application principle is rather straight forward. For an increasing amount of information processing fields, ICA has meaningful application which are still undiscovered. The aim of this paper is to investigate the ability of linguistic feature extraction based on word context preprocessing by ICA. The work refers to a first brief analysis in which ICA was applied to an English corpus. We continue this analysis depending on the number of components and the amount of syntactical information that we take into account. Furthermore we discuss to which extent the results deliver general linguistic features, or linguistic features giving us information about the text.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Hyvärinen, A., Karhunen, J., Oja, E.: Independent Component Analysis. Wiley & Sons, Chichester (2001)
Honkela, T., Hyvärinen, A., Väyrynen J.: Emergence of Linguistic Features: Independent Component Analysis of Contexts. In: Proc. of the 9th Neural Computation and Psychology Workshop (NCPW9), pp. 129–138 (2005)
Lagus, K., Creutz, M., Virpioja, S.: Latent Linguistic Codes for Morphemes using Independent Component Analysis. In: Proc. of the 9th Neural Computation and Psychology Workshop (NCPW9), pp. 129–138 (2005)
Hyvärinen, A.: Fast and Robust Fixed-Point Algorithms for Independent Component Analysis. IEEE Transactions on Neural Networks 10(3), 626–634 (1999)
Honkela, T., Hyvärinen, A.: Linguistic Feature Extraction using Independent Component Analysis. In: Proc. of Int. Joint Conf. on Neural Networks (IJCNN), pp. 279–284 (2004)
Rapp, R.: Mining Text for Word Senses using Independent Component Analysis. In: Proc. of Int. Conf. on Data Mining (2004)
Yarowsky, D.: Unsupervised Word Sense Disambiguation rivaling Supervised Methods. In: Proc. of the 33rd Annual Meeting of the ACL, pp. 189–196 (1995)
Steiner, P.: Wortarten und Korpus. Dissertation at the Westfälische Wilhelms-Universität Münster. Shaker Verlag (2004)
Rapp, R.: Automatic Identification of Word Translations from unrelated English and German Corpora. In: Proc. of the 37th Annual Meeting of the ACL, pp. 519–526 (1999)
Creutz, M., Lindén, K.: Morpheme Segmentation Gold Standards for Finnish and English. Techreport Helsinki University of Technology, Publications in Computer and Information Science (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Borschbach, M., Pyka, M. (2007). Specific Circumstances on the Ability of Linguistic Feature Extraction Based on Context Preprocessing by ICA. In: Davies, M.E., James, C.J., Abdallah, S.A., Plumbley, M.D. (eds) Independent Component Analysis and Signal Separation. ICA 2007. Lecture Notes in Computer Science, vol 4666. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74494-8_86
Download citation
DOI: https://doi.org/10.1007/978-3-540-74494-8_86
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74493-1
Online ISBN: 978-3-540-74494-8
eBook Packages: Computer ScienceComputer Science (R0)