Abstract
We propose an information filtering system for documents by a user profile using latent semantics obtained by singular value decomposition (SVD) and independent component analysis (ICA). In information filtering systems, it is useful to analyze the latent semantics of documents. ICA is one method to analyze the latent semantics. We assume that topics are independent of each other. Hence, when ICA is applied to documents, we obtain the topics included in the documents. By using SVD remove noises before applying ICA, we can improve the accuracy of topic extraction. By representation of the documents with those topics, we improve recommendations by the user profile. In addition, we construct a user profile with a genetic algorithm (GA) and evaluate it by 11-point average precision. We carried out an experiment using a test collection to confirm the advantages of the proposed method.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
S Deerwester T Dumais T Landauer et al. (1990) ArticleTitleIndexing by latent semantic analysis J Soc Inf Sci 41 391–497 Occurrence Handle10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9
E Bingham M Kabán A Girolami (2003) ArticleTitleTopic identification in dynamical text by complexity pursuit Neural Process Lett 17 IssueID1 69–83 Occurrence Handle10.1023/A:1022990829563
Kolenda T, Hansen LK (2000) Independent components in text. Advances in independent component analysis. Springer, London
Kolenda T, Hansen LK, Larsen J (2001) Signal detection using ICA: application to chat room topic spotting. 3rd International Conference on Independent Component Analysis and Blind Source Separation, Sandiego, California, USA, pp 540–545
NTCIR2 (2000) NII-NACSIS test collection for IR system. http://research.nii.ac.jp/ntcir/index-en.html
A Hyvarinen E Oja (2000) ArticleTitleIndependent component analysis: a tutorial Neural Networks 13 411–430 Occurrence Handle10.1016/S0893-6080(00)00026-5
G Salton MJ McGill (1997) Introduction to modern information retrieval McGraw-Hill New York Occurrence Handle00046828
Matsumoto Y (1997) Japanese morphological analysis system: CHASEN. Information Science Technical Report NAIST-IS-TR97007, Nara Institute of Science Technology
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was presented in part at the 10th International Symposium on Artificial Life and Robotics, Oita, Japan, February 4–6, 2005
About this article
Cite this article
Yokoi, T., Yanagimoto, H. & Omatu, S. Information filtering using SVD and ICA. Artif Life Robotics 10, 116–119 (2006). https://doi.org/10.1007/s10015-005-0372-6
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/s10015-005-0372-6