Skip to main content

An Integrated Robust Graph Regularized Non-negative Matrix Factorization for Multi-dimensional Genomic Data Analysis

  • Conference paper
  • First Online:
Recent Advances in Data Science (IDMB 2019)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1099))

Included in the following conference series:

Abstract

With the generation of “multi-dimensional genomic data”, the multi-platform genomic analysis technology for simultaneous biological samples has been rapidly developed. However, the existing shortcomings of the models for comprehensive analysis of multi-dimensional genomic data are the lack of robustness. We use the integrated matrix factorization model by introducing L2,1-norm to deal with above problem. In this paper, we propose an Integrated Robust Graph Regularization Non-negative Matrix Factorization for multi-dimensional genomic data analysis which named iRGNMF. In the formulation of the objective function, we introduced the local structure to maintain regularization and L2,1-norm to consider the data geometry and model robustness. We also applied this model to three types of data of the same cancer from The Cancer Genome Atlas (TCGA). Experiments shown that iRGNMF has obtained considerable effects in sample clustering, and found suspicious disease genes, which are used to reveal hidden patterns and information of biology in multi-dimensional gene data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Paatero, P., Tapper, U.: Positive matrix factorization: a non-negative factor model with optimal utilization of error estimates of data values. Environmetrics 5(2), 111–126 (2010)

    Article  Google Scholar 

  2. Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 401(6755), 788–791 (1999)

    Article  Google Scholar 

  3. Wang, D., Liu, J.X., Gao, Y.L., Zheng, C.H., Xu, Y.: Characteristic gene selection based on robust graph regularized non-negative matrix factorization. IEEE/ACM Trans. Comput. Biol. Bioinf. 13(99), 1059–1067 (2016)

    Article  Google Scholar 

  4. Zhang, S., Li, Q., Liu, J., Zhou, X.J.: A novel computational framework for simultaneous integration of multiple types of genomic data to identify microRNA-gene regulatory modules. Bioinformatics 27(13), i401–i409 (2011)

    Article  Google Scholar 

  5. Li, Y., Ngom, A.: The non-negative matrix factorization toolbox for biological data mining. Source Code Biol. Med. 8(1), 1–15 (2013)

    Article  Google Scholar 

  6. Grais, E.M., Erdogan, H.: Regularized nonnegative matrix factorization using Gaussian mixture priors for supervised single channel source separation. Comput. Speech Lang. 27(3), 746–762 (2013)

    Article  Google Scholar 

  7. Liu, W., Zheng, N., You, Q.: Nonnegative matrix factorization and its applications in pattern recognition. Chin. Sci. Bull. 51(1), 7–18 (2006)

    Article  MathSciNet  Google Scholar 

  8. Li, X.L., Bai, B., Wu, J.: Transcriptome analysis of early interaction between rice and Magnaporthe oryzae using next-generation sequencing technology. Hereditas 34(1), 102–112 (2012)

    Google Scholar 

  9. Greene, D., Cunningham, P.: A matrix factorization approach for integrating multiple data views. In: Buntine, W., Grobelnik, M., Mladenić, D., Shawe-Taylor, J. (eds.) ECML PKDD 2009. LNCS (LNAI), vol. 5781, pp. 423–438. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04180-8_45

    Chapter  Google Scholar 

  10. Liping, J., Chao, Z., Ng, M.K.: SNMFCA: supervised NMF-based image classification and annotation. IEEE Trans. Image Process. 21(11), 4508–4521 (2012)

    Article  MathSciNet  Google Scholar 

  11. Zhang, S., Liu, C., Li, W., Shen, H., Peter, W.L., Zhou, X.J.: Discovery of multi-dimensional modules by integrative analysis of cancer genomic data. Nucleic Acids Res. 40(19), 9379–9391 (2012)

    Article  Google Scholar 

  12. Žitnik, M., Zupan, B.: Data fusion by matrix factorization. IEEE Trans. Pattern Anal. Mach. Intell. 36(1), 41–53 (2015)

    Article  Google Scholar 

  13. Stražar, M., Žitnik, M., Zupan, B., Ule, J., Curk, T.: Orthogonal matrix factorization enables integrative analysis of multiple RNA binding proteins. Bioinformatics 32(10), 1527–1535 (2016)

    Article  Google Scholar 

  14. Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. NIPS 13(6), 556–562 (2001)

    Google Scholar 

  15. Xie, W., Wang, G., Bindel, D.: Fast iterative graph computation with block updates. Proc. VLDB Endow. 6(14), 2014–2025 (2013)

    Article  Google Scholar 

  16. Hein, M., Audibert, J.-Y., von Luxburg, U.: From graphs to manifolds – weak and strong pointwise consistency of graph Laplacians. In: Auer, P., Meir, R. (eds.) COLT 2005. LNCS (LNAI), vol. 3559, pp. 470–485. Springer, Heidelberg (2005). https://doi.org/10.1007/11503415_32

    Chapter  Google Scholar 

  17. Zhu, R., Liu, J.X., Zhang, Y.K., Guo, Y.: A robust manifold graph regularized nonnegative matrix factorization algorithm for cancer gene clustering. Molecules 22(12), 2131–2142 (2017)

    Article  Google Scholar 

  18. Shen, B., Liu, B.D., Wang, Q., Ji, R.: Robust nonnegative matrix factorization via L1 norm regularization by multiplicative updating rules. In: IEEE International Conference on Image Processing, pp. 5282–5286. IEEE, Paris (2014)

    Google Scholar 

  19. Geng, B., Tao, D., Xu, C., Yang, L., Hua, X.S.: Ensemble manifold regularization. IEEE Trans. Pattern Anal. Mach. Intell. 34(6), 1227–1233 (2012)

    Article  Google Scholar 

  20. Yang, S., Zhang, C., Yi, W.: Robust non-negative matrix factorization via joint sparse and graph regularization for transfer learning. Neural Comput. Appl. 23(2), 541–559 (2013)

    Article  Google Scholar 

  21. Nojun, K.: Principal component analysis based on l1-norm maximization. IEEE Trans. Pattern Anal. Mach. Intell. 30(9), 1672–1680 (2008)

    Article  Google Scholar 

  22. Diestel, R.: Graph theory. Math. Gaz. 173(502), 67–128 (2000)

    MATH  Google Scholar 

  23. Tuia, D., Volpi, M., Trolliet, M.: Semisupervised manifold alignment of multimodal remote sensing images. IEEE Trans. Geosci. Remote Sens. 52(12), 7708–7720 (2014)

    Article  Google Scholar 

  24. Bueler, E.L.: The heat kernel weighted Hodge Laplacian on noncompact manifolds. Trans. Am. Math. Soc. 351(2), 683–713 (1999)

    Article  MathSciNet  Google Scholar 

  25. Bonnet, L., Rayez, J.C.: Gaussian weighting in the quasiclassical trajectory method. Chem. Phys. Lett. 397(1), 106–109 (2004)

    Article  Google Scholar 

  26. Gao, L., Alibart, F., Strukov, D.B.: Analog-input analog-weight dot-product operation with Ag/a-Si/Pt memristive devices. In: IEEE/IFIP International Conference on VLSI and System-on-chip, pp. 1–6. IEEE, Santa Cruz (2015)

    Google Scholar 

  27. Cai, D., He, X., Han, J., Huang, T.S.: Graph regularized nonnegative matrix factorization for data representation. IEEE Trans. Pattern Anal. Mach. Intell. 33(8), 1548–1560 (2011)

    Article  Google Scholar 

  28. Danielsson, P.E.: Euclidean distance mapping. Comput. Graph. Image Process. 14(3), 227–248 (1980)

    Article  Google Scholar 

  29. Wu, G.C., Baleanu, D.: Variational iteration method for the Burgers’ flow with fractional derivatives—new Lagrange multipliers. Appl. Math. Model. 37(9), 6183–6190 (2013)

    Article  MathSciNet  Google Scholar 

  30. Facchinei, F., Fischer, A., Kanzow, C., Peng, J.M.: A simply constrained optimization reformulation of KKT systems arising from variational inequalities. Appl. Math. Optim. 40(1), 19–37 (1999)

    Article  MathSciNet  Google Scholar 

  31. Nguyen, T., Duong, T., Phung, D., Venkatesh, S.: Autism blogs: expressed emotion, language styles and concerns in personal and community settings. IEEE Trans. Affect. Comput. 6(3), 312–323 (2015)

    Article  Google Scholar 

  32. Mudelsee, M.: Estimating Pearson’s correlation coefficient with bootstrap confidence interval from serially dependent time series. Math. Geol. 35(6), 651–665 (2003)

    Article  Google Scholar 

  33. Saito, R., et al.: A travel guide to cytoscape plugins. Nat. Methods 9(11), 1069–1076 (2012)

    Article  Google Scholar 

  34. Belvedere, R., et al.: miR-196a is able to restore the aggressive phenotype of annexin A1 knock-out in pancreatic cancer cells by CRISPR/Cas9 genome editing. Int. J. Mol. Sci. 19(7), 1967 (2018)

    Article  Google Scholar 

  35. Alessandro, C., Sophie, T., Steven, Z.: Acetyl-CoA metabolism supports multistep pancreatic tumorigenesis. Cancer Discov. 9(3), 416–435 (2019)

    Article  Google Scholar 

  36. Kasap, E., et al.: Aurora kinase A (AURKA) and never in mitosis gene A-related kinase 6 (NEK6) genes are upregulated in erosive esophagitis and esophageal adenocarcinoma. Exp. Therap. Med. 4(1), 33–42 (2012)

    Article  Google Scholar 

  37. Zhen, L., Yao, Q., Zhao, S., Yin, W., Li, Y., Zhen, W.: Comprehensive analysis of differential co-expression patterns reveal transcriptional dysregulation mechanism and identify novel prognostic lncRNAs in esophageal squamous cell carcinoma. Oncotargets Ther. 10, 3095–3105 (2017)

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported in part by the NSFC under grant Nos. 61872220 and 61572284.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jin-Xing Liu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Hao, YJ., Hou, MX., Zhu, R., Liu, JX. (2020). An Integrated Robust Graph Regularized Non-negative Matrix Factorization for Multi-dimensional Genomic Data Analysis. In: Han, H., Wei, T., Liu, W., Han, F. (eds) Recent Advances in Data Science. IDMB 2019. Communications in Computer and Information Science, vol 1099. Springer, Singapore. https://doi.org/10.1007/978-981-15-8760-3_7

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-8760-3_7

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-8759-7

  • Online ISBN: 978-981-15-8760-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics