Skip to main content
Log in

Semi-supervised Unified Latent Factor learning with multi-view data

  • Special Issue Paper
  • Published:
Machine Vision and Applications Aims and scope Submit manuscript

Abstract

Explosive multimedia resources are generated on web, which can be typically considered as a kind of multi-view data in nature. In this paper, we present a Semi-supervised Unified Latent Factor learning approach (SULF) to learn a predictive unified latent representation by leveraging both complementary information among multiple views and the supervision from the partially label information. On one hand, SULF employs a collaborative Nonnegative Matrix Factorization formulation to discover a unified latent space shared across multiple views. On the other hand, SULF adopts a regularized regression model to minimize a prediction loss on partially labeled data with the latent representation. Consequently, the obtained parts-based representation can have more discriminating power. In addition, we also develop a mechanism to learn the weights of different views automatically. To solve the proposed optimization problem, we design an effective iterative algorithm. Extensive experiments are conducted for both classification and clustering tasks on three real-world datasets and the compared results demonstrate the superiority of our approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. \(\mathbf {w}_k\) is the \(k\)th row of \(\mathbf {W}\). In practice, \(||\mathbf {w}_k||_2\) could be close to zero but not zero. Theoretically, it could be zeros. For this case, we can let \(\varepsilon \) is very small constant, and regularize \(e_{kk}=\frac{1}{2\sqrt{\mathbf {w}_k^T\mathbf {w}_k+\varepsilon }}\).

  2. For convenience, \(\mathbf {A}\) is approximately as constant matrix when requiring the derivatives of \(\frac{\partial {\mathcal {L}}}{\partial \mathbf {V}_l}\).

  3. http://cvxr.com/cvx/.

  4. http://koen.me/research/colordescriptors/.

References

  1. Amini, M.R., Usunier, N., Goutte, C.: Learning from multiple partially observed views—an application to multilingual text categorization. In: Neural Information Processing Systems, pp. 28–36 (2009)

  2. Ando, R.K., Zhang, T.: Two-view feature generation model for semi-supervised learning. In: International Conference on Machine Learning, pp. 25–32 (2007)

  3. Blum, A., Mitchell, T.M.: Combining labeled and unlabeled data with co-training. In: Computational Learning Theory, pp. 92–100 (1998)

  4. Cai, D., He, X., Han, J., Huang, T.S.: Graph regularized nonnegative matrix factorization for data representation. IEEE Trans. Pattern Anal. Mach. Intell. 33, 1548–1560 (2011)

    Article  Google Scholar 

  5. Chaudhuri, K., Kakade, S.M., Livescu, K., Sridharan, K.: Multi-view clustering via canonical correlation analysis. In: International Conference on Machine Learning, pp. 17–136 (2009)

  6. Chen, N., Zhu, J., Sun, F., Xing, E.P.: Large-margin predictive latent subspace learning for multiview data analysis. IEEE Trans. Pattern Anal. Mach. Intell. 34, 2365–2378 (2012)

    Article  Google Scholar 

  7. Chen, X., Chen, S., Xue, H., Zhou, X.: A unified dimensionality reduction framework for semi-paired and semi-supervised multi-view data. Pattern Recognit. 45(5), 2005–2018 (2012)

    Article  MATH  Google Scholar 

  8. Chen, Y., Rege, M., Dong, M., Hua, J.: Nonnegative matrix factorization for semi-supervised data clustering. Knowl. Inf. Syst. 17, 355–379 (2008)

    Article  Google Scholar 

  9. Chua, T.S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y.: NUS-WIDE: a real-world web image database from National University of Singapore. In: Conference on Image and Video Retrieval, pp. 1–9 (2009)

  10. Ding, C.H.Q., Li, T., Jordan, M.I.: Convex and semi-nonnegative matrix factorizations. IEEE Trans. Pattern Anal. Mach. Intell. 32, 45–55 (2010)

    Article  Google Scholar 

  11. Ding, C.H.Q., Zhou, D., He, X., Zha, H.: R1PCA: rotational invariant L1-norm principal component analysis for robust subspace factorization. In: International Conference on Machine Learning, pp. 281–288 (2006)

  12. Duygulu, P., Barnard, K., Freitas, J.F.G.D., Forsyth, D.A.: Object recognition as machine translation: learning a Lexicon for a fixed image vocabulary. In: European Conference on Computer Vision, pp. 97–112 (2002)

  13. Hong, R., Tang, J., Tan, H.K., Ngo, C.W., Yan, S., Chua, T.S.: Beyond search: event-driven summarization for web videos. ACM Trans. Multimed. Comput. Commun. Appl. 7(4), 35:1–35:18 (2011)

    Google Scholar 

  14. Hong, R., Wang, M., Li, G., Nie, L., Zha, Z.J., Chua, T.S.: Multimedia question answering. IEEE Trans. Multimed. 19(4), 72–78 (2012)

    Article  Google Scholar 

  15. Hotelling, H.: Relations between two sets of variates. Biometrika 28, 321–377 (1936)

    Article  MATH  Google Scholar 

  16. Kumar, A., Rai, P., Daume, III H.: Co-regularized multi-view spectral clustering. In: Neural Information Processing Systems, pp. 1413–1421 (2011)

  17. Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature (1999)

  18. Lee, D.D., Seung, H.S.: Algorithms for nonnegative matrix factorization. In: Neural Information Processing Systems, Vol. 13, pp. 556–562 (2000)

  19. Li, Z., Liu, J., Lu, H.: Structure preserving non-negative matrix factorization for dimensionality reduction. Comput. Vis. Image Underst. 117(9), 1175–1189 (2013)

    Google Scholar 

  20. Li, Z., Liu, J., Zhu, X., Liu, T., Lu, H.: Image annotation using multi-correlation probabilistic matrix factorization. In: ACM Multimedia, pp. 1187–1190 (2010)

  21. Li, Z., Yang, Y., Liu, J., Zhou, X., Lu, H.: Unsupervised feature selection using nonnegative spectral analysis. In: AAAI (2012)

  22. Liu, H., Wu, Z., Cai, D., Huang, T.S.: Constrained nonnegative matrix factorization for image representation. IEEE Trans. Pattern Anal. Mach. Intell. 34, 1299–1311 (2012)

    Article  Google Scholar 

  23. Nigam, K., McCallum, A.K., Thrun, S., Mitchell, T.M.: Text classification from labeled and unlabeled documents using EM. Mach. Learn. 39, 103–134 (2000)

    Article  MATH  Google Scholar 

  24. van de Sande, K.E.A., Gevers, T., Snoek, C.G.M.: Evaluating color descriptors for object and scene recognition. IEEE Trans. Pattern Anal. Mach. Intell. 32, 1582–1596 (2010)

    Article  Google Scholar 

  25. Sun, T., Chen, S., yu Yang, J., Shi, P.: A novel method of combined feature extraction for recognition. In: IEEE International Conference on Data Mining, pp. 1043–1048 (2008)

  26. Wang, M., Hong, R., Li, G., Zha, Z.J., Yan, S., Chua, T.S.: Event driven web video summarization by tag localization and key-shot identification. IEEE Trans. Multimed. 14(4), 975–985 (2012)

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported by 973 Program (2012C B316304) and National Natural Science Foundation of China (61272329, 61202325, and 61070104).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jing Liu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jiang, Y., Liu, J., Li, Z. et al. Semi-supervised Unified Latent Factor learning with multi-view data. Machine Vision and Applications 25, 1635–1645 (2014). https://doi.org/10.1007/s00138-013-0556-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00138-013-0556-3

Keywords

Navigation