Abstract
In recent years, cross-domain learning algorithms have attracted much attention to solve labeled data insufficient problem. However, these cross-domain learning algorithms cannot be applied for subspace learning, which plays a key role in multimedia processing. This paper envisions the cross-domain discriminative subspace learning and provides an effective solution to cross-domain subspace learning. In particular, we propose the cross-domain discriminative locally linear embedding or CDLLE for short. CDLLE connects the training and the testing samples by minimizing the quadratic distance between the distribution of the training samples and that of the testing samples. Therefore, a common subspace for data representation can be preserved. We basically expect the discriminative information to separate the concepts in the training set can be shared to separate the concepts in the testing set as well and thus we have a chance to address above cross-domain problem duly. The margin maximization is duly adopted in CDLLE so the discriminative information for separating different classes can be well preserved. Finally, CDLLE encodes the local geometry of each training samples through a series of linear coefficients which can reconstruct a given sample by its intra-class neighbour samples and thus can locally preserve the intra-class local geometry. Experimental evidence on NUS-WIDE, a popular social image database collected from Flickr, and MSRA-MM, a popular real-world web image annotation database collected from the Internet by using Microsoft Live Search, demonstrates the effectiveness of CDLLE for real-world cross-domain applications.





Similar content being viewed by others
References
Belkin M, Niyogi P, Sindhwani V (2006) Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples. J Mach Learn Res 7:2399–2434
Cai D, He X, Han J (2007) Semi-supervised discriminant analysis. IEEE International Conference on Computer Vision, pp. 1-7
Caruana R (1997) Multitask learning. Mach Lear 28(1):41–75
Chua T-S, Tang J, Hong R, Li H, Luo Z, Zheng Y-T (2009) NUS-WIDE: A real-world web image database from national university of Singapore. ACM International Conference on Image and Video Retrieval, pp. 1-8
Dai W, Yang Q, Xue G, Yu Y (2007) Boosting for transfer learning. Processing of the 24th international conference on Machine learning, pp. 193-200
Duan L, Tsang IW, Xu D, Maybank SJ (2009) Domain transfer svm for video concept detection. Proceeding of the 21th conference on Computer Vision and Pattern Recognition
Fisher RA (1936) The use of multiple measurements in taxonomic problems. Ann Eugen 7(2):179–188
He X, Niyogi P (2003) Locality preserving projections. Adv Neural Inf Process Syst 16:1–8
Li H, Wang M, Hua X-S (2009) MSRA-MM 2.0: A large-scale web multimedia dataset. ICDM Workshop on Internet Multimedia Mining
Ling X, Dai W, Xue G, Yang Q, Yu Y (2008) Spectral domain-transfer learning. Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 488-496
Liu W, Tao D, Liu J (2008) Transductive component analysis. The 8th IEEE International Conference on Data Mining, pp. 433-442
Liu D, Hua XS, Yang L, Wang M, Zhang H-J (2009) Tag ranking. International World Wide Web Conference (WWW)
Mihalkova L, Mooney R (2006) Transfer learning with markov logic networks. ICML Workshop on Structural Knowledge Transfer for Machine Learning
Pan J, Kwok JT, Yang Q (2008) Transfer learning via dimensionality reduction. Proceedings of the 23th AAAI Conference on Artificial Intelligence, pp. 677-682
Parzen E (1962) On estimation of a probability density function and mode. Ann Math Stat 33(3):1065–1076
Roweis ST, Saul LK (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290:2323–2326
Sebe N, Lew MS, Huijsmans DP (2000) Toward improved ranking metrics. IEEE Trans Pattern Anal Mach Intell 22(10):1132–1143
Si S, Tao D, Chan KP Evolutionary cross-domain discriminative Hessian eigenmaps. IEEE Trans Image Process, to appear
Si S, Tao D, Geng B Bregmann divergence based regularization for transfer subspace learning. IEEE Trans Knowl Data Eng to appear
Snoek CG, Worring M, Smeulders AW (2005) Early versus late fusion in semantic video analysis. Proceeding of the 13th ACM international on Multimedia, pp. 399–402
Snoek CGM, Worring M, Geusebroek JM, Koelma DC, Seinstra FJ, Smeulders AWM (2006) The semantic pathfinder: Using an authoring metaphor for generic multimedia indexing. IEEE Trans Pattern Anal Mach Intell 28(10):1678–1689
Song D, Tao D Biologically inspired feature manifold for scene classification. IEEE Trans Image Process, to appear
Tang J, Yan S, Hong R, Qi GJ, Chua TS (2009) Inferring semantic concepts from community-contributed images and noisy tags. Proceeding of the 17th ACM international on Multimedia, pp. 223–232
Tao D, Li X, Wu X, Maybank SJ (2007) General tensor discriminant analysis and gabor features for gait recognition. IEEE Trans Pattern Anal Mach Intell 29(10):1700–1715
Wang M, Hua XS (2008) Study on the combination of video concept detectors. Proceeding of the 16th ACM International Conference on Multimedia, pp. 47–650
Wang M, Hua XS, Song Y, Yuan X, Li SP, Zhang HJ (2006) Automatic video annotation by semi-supervised learning with kernel density estimation. Proceeding of the 14th ACM International Conference on Multimedia, pp. 967–976
Wang M, Hua XS, Yuan X, Song Y, Dai LR (2007) Optimizing multi-graph learning: Towards a unified video annotation scheme. Proceeding of the 15th International Conference on Multimedia, pp. 862-871
Wang J, Jiang YG, Chang SF (2009) Label diagnosis through self tuning for web image search. Proceeding of the 21th conference on Computer Vision and Pattern Recognition
Wang M, Yang K, Hua XS, Zhang H-J (2009) Visual tag dictionary: interpreting tags with visual words. ACM Workshop on Web-Scale Multimedia Corpus, in association with ACM MM
Wu Z, Ke QF, Isard M, Sun J (2009) Bundling features for large scale partial-duplicate web image search. Proceeding of the 21th conference on Computer Vision and Pattern Recognition
Yang J, Hauptmann AG (2008) A framework for classifier adaptation and its applications in concept detection. Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, pp. 467–474
Yang J, Yan R, Hauptmann AG (2007) Cross-domain video concept detection using adaptive svms. Proceeding of the 15th international conference on Multimedia, pp. 188-197
Zhang T, Tao D, Yang J (2008) Discriminative locality alignment. Proceeding of the 10th European Conference on Computer Vision, pp. 725-738
Zhang T, Tao D, Li X, Yang T (2008) A unifying framework for spectral analysis based dimensionality reduction. IEEE International Joint Conference on Neural Networks 1670-1677, June
Zhang T, Tao D, Yang J (2009) Patch alignment for dimensionality reduction. IEEE Trans Knowl Data Eng 21(9):1299–1313
Zheng V, Yang E, Yang Q, Xiang W, Shen D (2008) Transferring localization models over time. Proceedings of the 23th international conference on Artificial intelligence, pp. 1421-1426
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Si, S., Tao, D., Wang, M. et al. Social image annotation via cross-domain subspace learning. Multimed Tools Appl 56, 91–108 (2012). https://doi.org/10.1007/s11042-010-0567-2
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-010-0567-2