Abstract:
We are concerned by the use of Factorial Correspondence Analysis (FCA) for image retrieval. FCA is designed for analysing contingency tables. In Textual Data Analysis (TD...Show MoreMetadata
Abstract:
We are concerned by the use of Factorial Correspondence Analysis (FCA) for image retrieval. FCA is designed for analysing contingency tables. In Textual Data Analysis (TDA), FCA analyses a contingency table crossing terms/words and documents. For adapting FCA on images, we first define ”visual words” computed from Scalable Invariant Feature Transform (SIFT) descriptors in images and use them for image quantization. At this step, we can build a contingency table crossing ”visual words” as terms/words and images as documents. In spite of its successful applications in information retrieval, FCA suffers from large dimension problem because of the diagonalization of a large matrix. We propose a new algorithm, CABoost, which overcomes this large dimension problem of FCA. The data are sampled by column (word) and a FCA is applied on the sample. After some samplings, we finally combine separated results by a weighting - Principle Component Analysis (PCA). The numerical experiments show that our algorithm performs more rapidly than the classical FCA without losing precision.
Date of Conference: 18-20 June 2008
Date Added to IEEE Xplore: 15 July 2008
ISBN Information: