Abstract
Exploring gene-disease associations is of great significance for early prevention, diagnosis and treatment of diseases. Most existing methods depend on specific type of biological evidence and thus are limited in the application. More importantly, these methods ignore some inherent prior sparsity and structure knowledge which is useful for predicting gene-disease associations. To address these challenges, a novel Enhanced Inductive Matrix Completion (EIMC) model is proposed to predict pathogenic genes by introducing the prior sparsity and structure knowledge into the traditional Inductive Matrix Completion (IMC). Specifically, the EIMC model not only employs the sparse regularization to preserve the prior sparsity of gene-disease associations, but also employs the manifold regularization to capture the prior structure information of data distribution. To the best of our knowledge, the proposed EIMC is the first model to simultaneously incorporate both prior sparse and manifold regularizations into the same objective function. Additionally, note that our proposed EIMC model also integrates the features of genes and diseases extracted from various types of biological data, and can predict new genes and diseases by using an inductive learning strategy. Finally, the extensive experimental results demonstrate that our proposed model outperforms other state-of-the-art methods.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Hamosh, A., Scott, A.F., Amberger, J.S., Bocchini, C.A., McKusick, V.A.: Online mendelian inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res. 33(1), D514–D517 (2005)
Becker, K.G., Barnes, K.C., Bright, T.J., Wang, S.A.: The genetic association database. Nat. Genet. 36(5), 431–432 (2004)
Zhao, J., Yang, T.H., et al.: Ranking candidate disease genes from gene expression and protein interaction: a Katz-centrality based approach. PLoS ONE 6(9), e24306 (2011)
Wu, X., Jiang, R., Zhang, M.Q., Li, S.: Network-based global inference of human disease genes. Mol. Syst. Biol. 4(1), 189 (2008)
Li, Y., Patra, J.C.: Genome-wide inferring gene-phenotype relationship by walking on the heterogeneous network. Bioinformatics 26(9), 1219–1224 (2010)
Singh-Blom, U.M., Natarajan, N., Tewari, A., Woods, J.D., Dhillon, I.S., Marcotte, E.M.: Prediction and validation of gene-disease associations using methods inspired by social network analyses. PLoS ONE 8(5), e5897 (2013)
Natarajan, N., Dhillon, I.S.: Inductive matrix completion for predicting gene–disease associations. Bioinformatics 30(12), 60–68 (2014)
Jain, P., Dhillon, I.S.: Provable inductive matrix completion. arXiv:1306.0626 (2013)
Chen, L., Yang, G., et al.: Correlation consistency constrained matrix completion for web service tag refinement. Neural Comput. Appl. 26(1), 101–110 (2015)
Boyd, S., Parikh, N., Chu, E., Peleato, B., Eckstein, J.: Distributed optimization and statistical learning via the alternating direction method of multipliers. Found. Trends Mach. Learn. 3(1), 1–122 (2010)
Combettes, P.L., Wajs, V.R.: Signal recovery by proximal forward-backward splitting. SIAM J. Multiscale Model. & Simul. 4(4), 1168–1200 (2005)
Cai, J.F., Candès, E.J., Shen, Z.: A singular value thresholding algorithm for matrix completion. SIAM J. Optim. 20(4), 1956–1982 (2010)
He, B.S., Yuan, X.M.: On the O(1/n) convergence rate of Douglas-Rachford alternating direction method. SIAM J. Numer. Anal. 50(2), 700–709 (2012)
Acknowledgements
This work is supported by the National Natural Science Foundation of China (grant number 61572263), the Natural Science Foundation of Jiangsu Province (grant number BK20161516), the Postdoctoral Science Foundation of China (grant number 2015M581794), the Postdoctoral Science Foundation of Jiangsu Province (grant number 1501023C).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Chen, L., Pu, J., Yang, Z., Chen, X. (2018). Prior Knowledge Guided Gene-Disease Associations Prediction: An Enhanced Inductive Matrix Completion Approach. In: Geng, X., Kang, BH. (eds) PRICAI 2018: Trends in Artificial Intelligence. PRICAI 2018. Lecture Notes in Computer Science(), vol 11013. Springer, Cham. https://doi.org/10.1007/978-3-319-97310-4_30
Download citation
DOI: https://doi.org/10.1007/978-3-319-97310-4_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-97309-8
Online ISBN: 978-3-319-97310-4
eBook Packages: Computer ScienceComputer Science (R0)