Abstract
Explosive multimedia resources are generated on web, which can be typically considered as a kind of multi-view data in nature. In this paper, we present a Semi-supervised Unified Latent Factor learning approach (SULF) to learn a predictive unified latent representation by leveraging both complementary information among multiple views and the supervision from the partially label information. On one hand, SULF employs a collaborative Nonnegative Matrix Factorization formulation to discover a unified latent space shared across multiple views. On the other hand, SULF adopts a regularized regression model to minimize a prediction loss on partially labeled data with the latent representation. Consequently, the obtained parts-based representation can have more discriminating power. In addition, we also develop a mechanism to learn the weights of different views automatically. To solve the proposed optimization problem, we design an effective iterative algorithm. Extensive experiments are conducted for both classification and clustering tasks on three real-world datasets and the compared results demonstrate the superiority of our approach.
Similar content being viewed by others
Notes
\(\mathbf {w}_k\) is the \(k\)th row of \(\mathbf {W}\). In practice, \(||\mathbf {w}_k||_2\) could be close to zero but not zero. Theoretically, it could be zeros. For this case, we can let \(\varepsilon \) is very small constant, and regularize \(e_{kk}=\frac{1}{2\sqrt{\mathbf {w}_k^T\mathbf {w}_k+\varepsilon }}\).
For convenience, \(\mathbf {A}\) is approximately as constant matrix when requiring the derivatives of \(\frac{\partial {\mathcal {L}}}{\partial \mathbf {V}_l}\).
References
Amini, M.R., Usunier, N., Goutte, C.: Learning from multiple partially observed views—an application to multilingual text categorization. In: Neural Information Processing Systems, pp. 28–36 (2009)
Ando, R.K., Zhang, T.: Two-view feature generation model for semi-supervised learning. In: International Conference on Machine Learning, pp. 25–32 (2007)
Blum, A., Mitchell, T.M.: Combining labeled and unlabeled data with co-training. In: Computational Learning Theory, pp. 92–100 (1998)
Cai, D., He, X., Han, J., Huang, T.S.: Graph regularized nonnegative matrix factorization for data representation. IEEE Trans. Pattern Anal. Mach. Intell. 33, 1548–1560 (2011)
Chaudhuri, K., Kakade, S.M., Livescu, K., Sridharan, K.: Multi-view clustering via canonical correlation analysis. In: International Conference on Machine Learning, pp. 17–136 (2009)
Chen, N., Zhu, J., Sun, F., Xing, E.P.: Large-margin predictive latent subspace learning for multiview data analysis. IEEE Trans. Pattern Anal. Mach. Intell. 34, 2365–2378 (2012)
Chen, X., Chen, S., Xue, H., Zhou, X.: A unified dimensionality reduction framework for semi-paired and semi-supervised multi-view data. Pattern Recognit. 45(5), 2005–2018 (2012)
Chen, Y., Rege, M., Dong, M., Hua, J.: Nonnegative matrix factorization for semi-supervised data clustering. Knowl. Inf. Syst. 17, 355–379 (2008)
Chua, T.S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y.: NUS-WIDE: a real-world web image database from National University of Singapore. In: Conference on Image and Video Retrieval, pp. 1–9 (2009)
Ding, C.H.Q., Li, T., Jordan, M.I.: Convex and semi-nonnegative matrix factorizations. IEEE Trans. Pattern Anal. Mach. Intell. 32, 45–55 (2010)
Ding, C.H.Q., Zhou, D., He, X., Zha, H.: R1PCA: rotational invariant L1-norm principal component analysis for robust subspace factorization. In: International Conference on Machine Learning, pp. 281–288 (2006)
Duygulu, P., Barnard, K., Freitas, J.F.G.D., Forsyth, D.A.: Object recognition as machine translation: learning a Lexicon for a fixed image vocabulary. In: European Conference on Computer Vision, pp. 97–112 (2002)
Hong, R., Tang, J., Tan, H.K., Ngo, C.W., Yan, S., Chua, T.S.: Beyond search: event-driven summarization for web videos. ACM Trans. Multimed. Comput. Commun. Appl. 7(4), 35:1–35:18 (2011)
Hong, R., Wang, M., Li, G., Nie, L., Zha, Z.J., Chua, T.S.: Multimedia question answering. IEEE Trans. Multimed. 19(4), 72–78 (2012)
Hotelling, H.: Relations between two sets of variates. Biometrika 28, 321–377 (1936)
Kumar, A., Rai, P., Daume, III H.: Co-regularized multi-view spectral clustering. In: Neural Information Processing Systems, pp. 1413–1421 (2011)
Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature (1999)
Lee, D.D., Seung, H.S.: Algorithms for nonnegative matrix factorization. In: Neural Information Processing Systems, Vol. 13, pp. 556–562 (2000)
Li, Z., Liu, J., Lu, H.: Structure preserving non-negative matrix factorization for dimensionality reduction. Comput. Vis. Image Underst. 117(9), 1175–1189 (2013)
Li, Z., Liu, J., Zhu, X., Liu, T., Lu, H.: Image annotation using multi-correlation probabilistic matrix factorization. In: ACM Multimedia, pp. 1187–1190 (2010)
Li, Z., Yang, Y., Liu, J., Zhou, X., Lu, H.: Unsupervised feature selection using nonnegative spectral analysis. In: AAAI (2012)
Liu, H., Wu, Z., Cai, D., Huang, T.S.: Constrained nonnegative matrix factorization for image representation. IEEE Trans. Pattern Anal. Mach. Intell. 34, 1299–1311 (2012)
Nigam, K., McCallum, A.K., Thrun, S., Mitchell, T.M.: Text classification from labeled and unlabeled documents using EM. Mach. Learn. 39, 103–134 (2000)
van de Sande, K.E.A., Gevers, T., Snoek, C.G.M.: Evaluating color descriptors for object and scene recognition. IEEE Trans. Pattern Anal. Mach. Intell. 32, 1582–1596 (2010)
Sun, T., Chen, S., yu Yang, J., Shi, P.: A novel method of combined feature extraction for recognition. In: IEEE International Conference on Data Mining, pp. 1043–1048 (2008)
Wang, M., Hong, R., Li, G., Zha, Z.J., Yan, S., Chua, T.S.: Event driven web video summarization by tag localization and key-shot identification. IEEE Trans. Multimed. 14(4), 975–985 (2012)
Acknowledgments
This work was supported by 973 Program (2012C B316304) and National Natural Science Foundation of China (61272329, 61202325, and 61070104).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jiang, Y., Liu, J., Li, Z. et al. Semi-supervised Unified Latent Factor learning with multi-view data. Machine Vision and Applications 25, 1635–1645 (2014). https://doi.org/10.1007/s00138-013-0556-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00138-013-0556-3