Abstract
With the generation of “multi-dimensional genomic data”, the multi-platform genomic analysis technology for simultaneous biological samples has been rapidly developed. However, the existing shortcomings of the models for comprehensive analysis of multi-dimensional genomic data are the lack of robustness. We use the integrated matrix factorization model by introducing L2,1-norm to deal with above problem. In this paper, we propose an Integrated Robust Graph Regularization Non-negative Matrix Factorization for multi-dimensional genomic data analysis which named iRGNMF. In the formulation of the objective function, we introduced the local structure to maintain regularization and L2,1-norm to consider the data geometry and model robustness. We also applied this model to three types of data of the same cancer from The Cancer Genome Atlas (TCGA). Experiments shown that iRGNMF has obtained considerable effects in sample clustering, and found suspicious disease genes, which are used to reveal hidden patterns and information of biology in multi-dimensional gene data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Paatero, P., Tapper, U.: Positive matrix factorization: a non-negative factor model with optimal utilization of error estimates of data values. Environmetrics 5(2), 111–126 (2010)
Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 401(6755), 788–791 (1999)
Wang, D., Liu, J.X., Gao, Y.L., Zheng, C.H., Xu, Y.: Characteristic gene selection based on robust graph regularized non-negative matrix factorization. IEEE/ACM Trans. Comput. Biol. Bioinf. 13(99), 1059–1067 (2016)
Zhang, S., Li, Q., Liu, J., Zhou, X.J.: A novel computational framework for simultaneous integration of multiple types of genomic data to identify microRNA-gene regulatory modules. Bioinformatics 27(13), i401–i409 (2011)
Li, Y., Ngom, A.: The non-negative matrix factorization toolbox for biological data mining. Source Code Biol. Med. 8(1), 1–15 (2013)
Grais, E.M., Erdogan, H.: Regularized nonnegative matrix factorization using Gaussian mixture priors for supervised single channel source separation. Comput. Speech Lang. 27(3), 746–762 (2013)
Liu, W., Zheng, N., You, Q.: Nonnegative matrix factorization and its applications in pattern recognition. Chin. Sci. Bull. 51(1), 7–18 (2006)
Li, X.L., Bai, B., Wu, J.: Transcriptome analysis of early interaction between rice and Magnaporthe oryzae using next-generation sequencing technology. Hereditas 34(1), 102–112 (2012)
Greene, D., Cunningham, P.: A matrix factorization approach for integrating multiple data views. In: Buntine, W., Grobelnik, M., Mladenić, D., Shawe-Taylor, J. (eds.) ECML PKDD 2009. LNCS (LNAI), vol. 5781, pp. 423–438. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04180-8_45
Liping, J., Chao, Z., Ng, M.K.: SNMFCA: supervised NMF-based image classification and annotation. IEEE Trans. Image Process. 21(11), 4508–4521 (2012)
Zhang, S., Liu, C., Li, W., Shen, H., Peter, W.L., Zhou, X.J.: Discovery of multi-dimensional modules by integrative analysis of cancer genomic data. Nucleic Acids Res. 40(19), 9379–9391 (2012)
Žitnik, M., Zupan, B.: Data fusion by matrix factorization. IEEE Trans. Pattern Anal. Mach. Intell. 36(1), 41–53 (2015)
Stražar, M., Žitnik, M., Zupan, B., Ule, J., Curk, T.: Orthogonal matrix factorization enables integrative analysis of multiple RNA binding proteins. Bioinformatics 32(10), 1527–1535 (2016)
Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. NIPS 13(6), 556–562 (2001)
Xie, W., Wang, G., Bindel, D.: Fast iterative graph computation with block updates. Proc. VLDB Endow. 6(14), 2014–2025 (2013)
Hein, M., Audibert, J.-Y., von Luxburg, U.: From graphs to manifolds – weak and strong pointwise consistency of graph Laplacians. In: Auer, P., Meir, R. (eds.) COLT 2005. LNCS (LNAI), vol. 3559, pp. 470–485. Springer, Heidelberg (2005). https://doi.org/10.1007/11503415_32
Zhu, R., Liu, J.X., Zhang, Y.K., Guo, Y.: A robust manifold graph regularized nonnegative matrix factorization algorithm for cancer gene clustering. Molecules 22(12), 2131–2142 (2017)
Shen, B., Liu, B.D., Wang, Q., Ji, R.: Robust nonnegative matrix factorization via L1 norm regularization by multiplicative updating rules. In: IEEE International Conference on Image Processing, pp. 5282–5286. IEEE, Paris (2014)
Geng, B., Tao, D., Xu, C., Yang, L., Hua, X.S.: Ensemble manifold regularization. IEEE Trans. Pattern Anal. Mach. Intell. 34(6), 1227–1233 (2012)
Yang, S., Zhang, C., Yi, W.: Robust non-negative matrix factorization via joint sparse and graph regularization for transfer learning. Neural Comput. Appl. 23(2), 541–559 (2013)
Nojun, K.: Principal component analysis based on l1-norm maximization. IEEE Trans. Pattern Anal. Mach. Intell. 30(9), 1672–1680 (2008)
Diestel, R.: Graph theory. Math. Gaz. 173(502), 67–128 (2000)
Tuia, D., Volpi, M., Trolliet, M.: Semisupervised manifold alignment of multimodal remote sensing images. IEEE Trans. Geosci. Remote Sens. 52(12), 7708–7720 (2014)
Bueler, E.L.: The heat kernel weighted Hodge Laplacian on noncompact manifolds. Trans. Am. Math. Soc. 351(2), 683–713 (1999)
Bonnet, L., Rayez, J.C.: Gaussian weighting in the quasiclassical trajectory method. Chem. Phys. Lett. 397(1), 106–109 (2004)
Gao, L., Alibart, F., Strukov, D.B.: Analog-input analog-weight dot-product operation with Ag/a-Si/Pt memristive devices. In: IEEE/IFIP International Conference on VLSI and System-on-chip, pp. 1–6. IEEE, Santa Cruz (2015)
Cai, D., He, X., Han, J., Huang, T.S.: Graph regularized nonnegative matrix factorization for data representation. IEEE Trans. Pattern Anal. Mach. Intell. 33(8), 1548–1560 (2011)
Danielsson, P.E.: Euclidean distance mapping. Comput. Graph. Image Process. 14(3), 227–248 (1980)
Wu, G.C., Baleanu, D.: Variational iteration method for the Burgers’ flow with fractional derivatives—new Lagrange multipliers. Appl. Math. Model. 37(9), 6183–6190 (2013)
Facchinei, F., Fischer, A., Kanzow, C., Peng, J.M.: A simply constrained optimization reformulation of KKT systems arising from variational inequalities. Appl. Math. Optim. 40(1), 19–37 (1999)
Nguyen, T., Duong, T., Phung, D., Venkatesh, S.: Autism blogs: expressed emotion, language styles and concerns in personal and community settings. IEEE Trans. Affect. Comput. 6(3), 312–323 (2015)
Mudelsee, M.: Estimating Pearson’s correlation coefficient with bootstrap confidence interval from serially dependent time series. Math. Geol. 35(6), 651–665 (2003)
Saito, R., et al.: A travel guide to cytoscape plugins. Nat. Methods 9(11), 1069–1076 (2012)
Belvedere, R., et al.: miR-196a is able to restore the aggressive phenotype of annexin A1 knock-out in pancreatic cancer cells by CRISPR/Cas9 genome editing. Int. J. Mol. Sci. 19(7), 1967 (2018)
Alessandro, C., Sophie, T., Steven, Z.: Acetyl-CoA metabolism supports multistep pancreatic tumorigenesis. Cancer Discov. 9(3), 416–435 (2019)
Kasap, E., et al.: Aurora kinase A (AURKA) and never in mitosis gene A-related kinase 6 (NEK6) genes are upregulated in erosive esophagitis and esophageal adenocarcinoma. Exp. Therap. Med. 4(1), 33–42 (2012)
Zhen, L., Yao, Q., Zhao, S., Yin, W., Li, Y., Zhen, W.: Comprehensive analysis of differential co-expression patterns reveal transcriptional dysregulation mechanism and identify novel prognostic lncRNAs in esophageal squamous cell carcinoma. Oncotargets Ther. 10, 3095–3105 (2017)
Acknowledgments
This work was supported in part by the NSFC under grant Nos. 61872220 and 61572284.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Hao, YJ., Hou, MX., Zhu, R., Liu, JX. (2020). An Integrated Robust Graph Regularized Non-negative Matrix Factorization for Multi-dimensional Genomic Data Analysis. In: Han, H., Wei, T., Liu, W., Han, F. (eds) Recent Advances in Data Science. IDMB 2019. Communications in Computer and Information Science, vol 1099. Springer, Singapore. https://doi.org/10.1007/978-981-15-8760-3_7
Download citation
DOI: https://doi.org/10.1007/978-981-15-8760-3_7
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-8759-7
Online ISBN: 978-981-15-8760-3
eBook Packages: Computer ScienceComputer Science (R0)