skip to main content
10.1145/3459637.3482110acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

GDFM: Gene Vectors Embodied Deep Attentional Factorization Machines for Interaction prediction

Published: 30 October 2021 Publication History

Abstract

Gene Network Graphs (GNGs) are comprised of biomedical data. Deriving structural information from these graphs remains a prime area of research in the domain of biomedical and health informatics. In this paper, we propose Gene Vectors Embodied Deep Attentional Factorization Machines (GDFMs) for the gene to gene interaction prediction. We first initialize GDFM with vector embeddings learned from gene locality configuration and an expression equivalence criterion that preserves their innate similar traits. GDFM uses an attention-based mechanism that manipulates different positions, to learn the representation of sequence, before calculating the pairwise factorized interactions. We further use hidden layers, batch normalization, and dropout to stabilize the performance of our deep structured architecture. An extensive comparison with several state-of-the-art approaches, using Ecoli and Yeast datasets for gene-gene interaction prediction shows the significance of our proposed framework.

References

[1]
Gregorio Alanis-Lobato, Carlo Vittorio Cannistraci, and Timothy Ravasi. 2013. Exploitation of genetic interaction network topology for the prediction of epistatic behavior. Genomics 102, 4 (2013), 202--208.
[2]
Tanya Barrett, Dennis B Troup, Stephen E Wilhite, Pierre Ledoux, et al. 2010.NCBI GEO: archive for functional genomics data sets-10 years on. Nucleic acids research 39, suppl_1 (2010), D1005--D1010.
[3]
Atul J Butte and Isaac S Kohane. 1999. Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements. In Biocomputing 2000. World Scientific, 418--429.
[4]
Kuan-Hsi Chen, Tsai-Feng Wang, and Yuh-Jyh Hu. 2019. Protein-protein interaction prediction using a hybrid feature representation and a stacked generalization scheme. BMC bioinformatics 20, 1 (2019), 1--17.
[5]
Liang Chen, Yang Liu, Zibin Zheng, and Philip Yu. 2018. Heterogeneous neural attentive factorization machine for rating prediction. In Proceedings of ACM International Conference on Information and Knowledge Management (CIKM). 833--842.
[6]
Tong Chen, Hongzhi Yin, Quoc Viet Hung Nguyen, Wen-Chih Peng, Xue Li, and Xiaofang Zhou. 2020. Sequence-aware factorization machines for temporal predictive analytics. In Proceedings of IEEE International Conference on Data Engineering (ICDE). 1405--1416.
[7]
Wen Dai, Xi Liu, Yibo Gao, Lin Chen, Jianglong Song, Di Chen, Kuo Gao, Yongshi Jiang, Yiping Yang, et al. 2015. Matrix factorization-based prediction of novel drug indications by integrating genomic space. Computational and mathematical methods in medicine (2015).
[8]
Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable feature learning for networks. In Proceedings of ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD). 855--864.
[9]
Huifeng Guo, Ruiming Tang, Yunming Ye, Zhenguo Li, and Xiuqiang He. 2017. DeepFM: a factorization-machine based neural network for CTR prediction. arXiv preprint arXiv:1703.04247 (2017).
[10]
Xiangnan He and Tat-Seng Chua. 2017. Neural factorization machines for sparse predictive analytics. In Proceedings of ACM SIGIR. 355--364.
[11]
Kexin Huang, Cao Xiao, Lucas M Glass, Marinka Zitnik, and Jimeng Sun. 2020. SkipGNN: predicting molecular interactions with skip-graph networks. Scientific reports 10, 1 (2020), 1--16.
[12]
Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167 (2015).
[13]
K. Kc, R. Li, F. Cui, and A. Haake. 2021. Predicting Biomedical Interactions with Higher-Order Graph Convolutional Networks. IEEE/ACM Transactions on Computational Biology and Bioinformatics (2021), 1--1.
[14]
Thomas N Kipf and MaxWelling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).
[15]
KC Kishan, Rui Li, Feng Cui, Qi Yu, and Anne R Haake. 2019. GNE: a deep learning framework for gene network inference by aggregating biological information. BMC systems biology 13, 2 (2019), 38.
[16]
Ying-Ke Lei, Zhu-Hong You, Zhen Ji, Lin Zhu, and De-Shuang Huang. 2012. Assessing and predicting protein interactions by combining manifold embedding with multiple information integration. (2012).
[17]
Jianxun Lian, Xiaohuan Zhou, Fuzheng Zhang, Zhongxia Chen, et al. 2018. xdeepfm: Combining explicit and implicit feature interactions for recommender systems. In Proceedings of ACM SIGKDD. 1754--1763.
[18]
Han Luo, Zhenfeng Lei, and Hanping Ke. 2019. AWGI: An Auto-weighted Framework Integrating Heterogeneous Networks for Gene Interaction Prediction. In Proceedings of ICBIP. 122--127.
[19]
Han Luo, Donghua Wang, Juan Liu, Ying Ju, and Zhe Jin. 2019. A Framework Integrating Heterogeneous Databases for the Completion of Gene Networks. IEEE Access 7 (2019), 168859--168869.
[20]
Tengfei Ma, Cao Xiao, Jiayu Zhou, and Fei Wang. 2018. Drug similarity integration through attentive multi-view graph auto-encoders. arXiv preprint arXiv:1804.10850 (2018).
[21]
Neel S Madhukar, Olivier Elemento, and Gaurav Pandey. 2015. Prediction of genetic interactions using machine learning and network properties. Frontiers in bioengineering and biotechnology 3 (2015), 172.
[22]
Rajiv Pasricha and Julian McAuley. 2018. Translation-based factorization machines for sequential recommendation. In Proceedings of ACM RecSys. 63--71.
[23]
Steffen Rendle. 2010. Factorization machines. In Proceedings of IEEE International Conference on Data Mining (ICDM). 995--1000.
[24]
Tapesh Santra. 2014. A bayesian framework that integrates heterogeneous data for inferring gene regulatory networks. Frontiers in bioengineering and biotechnology 2 (2014), 13.
[25]
Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: a simple way to prevent neural networks from overfitting. Journal of Machine Learning Research 15, 1 (2014), 1929--1958.
[26]
Chris Stark, Bobby-Joe Breitkreutz, Teresa Reguly, Lorrie Boucher, Ashton Breitkreutz, and Mike Tyers. 2006. BioGRID: a general repository for interaction datasets. Nucleic Acids Research 34 (2006), D535--D539.
[27]
Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. 2015. Line: Large-scale information network embedding. In Proceedings of ACM WWW. 1067--1077.
[28]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. arXiv preprint arXiv:1706.03762 (2017).
[29]
Lei Wang, Hai-Feng Wang, San-Rong Liu, Xin Yan, and Ke-Jian Song. 2019. Predicting protein-protein interactions from matrix-based protein sequence using convolution neural network and feature-selective rotation forest. Scientific reports 9, 1 (2019), 1--12.
[30]
Jun Xiao, Hao Ye, Xiangnan He, Hanwang Zhang, Fei Wu, and Tat-Seng Chua. 2017. Attentional factorization machines: Learning the weight of feature interactions via attention networks. (2017).
[31]
Fang Yang, Kunjie Fan, Dandan Song, and Huakang Lin. 2020. Graph-based prediction of Protein-protein interactions with attributed signed graph embedding. BMC bioinformatics 21, 1 (2020), 1--16.
[32]
Jihong Yang, Zheng Li, Xiaohui Fan, and Yiyu Cheng. 2014. Drug--disease association and drug-repositioning predictions in complex diseases using causal inference--probabilistic matrix factorization. Journal of chemical information and modeling 54, 9 (2014), 2562--2569.
[33]
Zhu-Hong You, Xiao Li, and Keith CC Chan. 2017. An improved sequence based prediction protocol for protein-protein interactions using amino acids substitution matrix and rotation forest ensemble classifiers. Neurocomputing 228(2017), 277--282.
[34]
Xiang Yue, Zhen Wang, Jingong Huang, Srinivasan Parthasarathy, et al. 2020. Graph embedding on biomedical networks: methods, applications and evaluations. Bioinformatics 36, 4 (2020), 1241--1251.
[35]
Wen Zhang, Yanlin Chen, Dingfang Li, and Xiang Yue. 2018. Manifold regularized matrix factorization for drug-drug interaction prediction. Journal of biomedical informatics 88 (2018), 90--97.
[36]
Weinan Zhang, Tianming Du, and JunWang. 2016. Deep learning over multi-field categorical data. In Proceedings of ECIR. Springer, 45--57.
[37]
Wen Zhang, Xiang Yue,Weiran Lin,WenjianWu, Ruoqi Liu, et al. 2018. Predicting drug-disease associations by using similarity constrained matrix factorization. BMC bioinformatics 19, 1 (2018), 1--12.
[38]
Lin Zhu, Zhu-Hong You, and De-Shuang Huang. 2013. Increasing the reliability of protein--protein interaction networks via non-convex semantic embedding. Neurocomputing 121 (2013), 99--107.
[39]
Marinka Zitnik, Monica Agrawal, and Jure Leskovec. 2018. Modeling polypharmacy side effects with graph convolutional networks. Bioinformatics 34, 13 (2018), i457--i466.

Cited By

View all

Index Terms

  1. GDFM: Gene Vectors Embodied Deep Attentional Factorization Machines for Interaction prediction

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CIKM '21: Proceedings of the 30th ACM International Conference on Information & Knowledge Management
      October 2021
      4966 pages
      ISBN:9781450384469
      DOI:10.1145/3459637
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 30 October 2021

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. deep attentional factorization machines
      2. expression information
      3. gene network graphs
      4. interaction prediction

      Qualifiers

      • Short-paper

      Conference

      CIKM '21
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

      Upcoming Conference

      CIKM '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 143
        Total Downloads
      • Downloads (Last 12 months)7
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 20 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media