GSAML-DTA: An interpretable drug-target binding affinity prediction model based on graph neural networks with self-attention mechanism and mutual information

doi:10.1016/j.compbiomed.2022.106145

Computers in Biology and Medicine

Volume 150, November 2022, 106145

https://doi.org/10.1016/j.compbiomed.2022.106145 Get rights and content

Highlights

•
We develop GSAML-DTA, an interpretable deep learning framework for DTA prediction.
•
GSAML-DTA integrates a self-attention mechanism and graph neural networks (GNNs) to build representations of drugs and target proteins from the structural information.
•
In addition, mutual information is introduced to filter out redundant information and retain relevant information in the combined representations of drugs and targets.
•
Extensive experimental results demonstrate that GSAML-DTA outperforms state-of-the-art methods for DTA prediction on two benchmark datasets.

Abstract

Identifying drug-target affinity (DTA) has great practical importance in the process of designing efficacious drugs for known diseases. Recently, numerous deep learning-based computational methods have been developed to predict drug-target affinity and achieved impressive performance. However, most of them construct the molecule (drug or target) encoder without considering the weights of features of each node (atom or residue). Besides, they generally combine drug and target representations directly, which may contain irrelevant-task information. In this study, we develop GSAML-DTA, an interpretable deep learning framework for DTA prediction. GSAML-DTA integrates a self-attention mechanism and graph neural networks (GNNs) to build representations of drugs and target proteins from the structural information. In addition, mutual information is introduced to filter out redundant information and retain relevant information in the combined representations of drugs and targets. Extensive experimental results demonstrate that GSAML-DTA outperforms state-of-the-art methods for DTA prediction on two benchmark datasets. Furthermore, GSAML-DTA has the interpretation ability to analyze binding atoms and residues, which may be conducive to chemical biology studies from data. Overall, GSAML-DTA can serve as a powerful and interpretable tool suitable for DTA modelling.

Introduction

Developing a new drug generally takes more than ten years and costs billions of dollars, and less than 12% of the drugs are approved to enter the market [1,2]. The accuracy assessment of drug-target interaction is a crucial step in the early stage of drug development and uncovering their side effects [3]. Binding affinity is the strength of drug-target interaction, which is usually expressed in different metrics such as inhibition constant ( $K_{i}$ ), dissociation constant ( $K_{d}$ ), or the half-maximal inhibitory concentration ( ${I C}_{50}$ ) [4]. Although wet lab experiments to identify the drug-target binding affinity remain the most reliable and effective methods, they are time-consuming and resource-intensive. To mitigate this issue, numerous computational methods have been proposed to accelerate the speed of new drug development and reduce the cost [5].

The existing computational methods mainly fall into two categories: structure-based methods and structure-free methods. Structure-based methods mainly exploit three-dimensional (3D) structure information of small molecules and proteins to explore potential binding poses at the atom level and identify binding affinities. Molecular docking is one of the well-established structure-based methods that integrate various potential binding poses and scoring functions to minimize the free energy of the pose within binding sites [6,7]. Although these methods have achieved relatively attractive predictive performance and provided reasonable biological interpretation, their coverage is limiteddue to the high computational complexity of solving such 3D structures and the scarcity of small molecules and proteins with known 3D structures.

An alternative to structure-based methods is structure-free methods, including feature-based methods and deep learning methods, which only rely on sequence information and require fewer computational resources. Feature-based methods mainly explore primary sequence information to model the binding affinity. Concretely, they focus on extracting discriminative biological features of a drug-target pair and sending extracted features into a machine/deep learning model, such as Naïve Bayes (NB), logistic regression (LR), deep neural network (DNN), and other kernel-based methods, for predicting the binding affinity. For example, Lenselink et al. created and benchmarked a standardized dataset. Based on this dataset, they compared DNN with various traditional classifiers (e.g., NB and LR). It was shown that DNN produced the best results [8]. Rifaioglu et al. integrated multiple protein features, including physicochemical properties and sequential, structural, and evolutionary features, into numerous 2D vectors. They then fed the vectors to state-of-the-art pairwise input hybrid deep neural networks to predict the drug-target interactions [[9], [10], [11]].

Although feature-based methods have a high generalization and sequence sensitivity, they are limited by over-relying on expert knowledge-based hand-crafted feature engineering. Deep learning methods, that is, end-to-end differential models can potentially tackle the above limitations. Indeed, they can automatically learn features and invariances of given data and provide a satisfactory generalization despite a large number of parameters. Inspired by their successful application in various research fields [12,13], numerical deep learning methods are proposed for DTA prediction. For example, Öztürk et al. constructed a deep learning model DeepDTA that employed convolutional neural networks (CNNs) to extract high-latent features of drugs and proteins separately and concatenated the two learned features for final prediction through fully connected layers [14]. Moreover, they proposed another DTA model, WideDTA, which integrated different text-based information to better represent the interaction [15]. DeepCDA [16] proposed a bidirectional attention mechanism to encode the binding strength between each protein substructure-composite substructure pair. And then, a combination of CNN and Long Short Term Memory (LSTM) was built to get good representations of proteins and compounds.

Although CNN-based models have shown satisfactory performance in DTA prediction, these models ignore the structural information. They only use sequences (1-dimensional structure) to represent the input molecules, which may miss the critical spatial information to characterize the intrinsic properties of molecules. To solve this problem, graph neural networks (GNNs), which can extract structural features, are widely used in various DTA prediction models [[17], [18], [19], [20], [21], [22]]. For example, DeepGS [23] first proposed a method to learn the interaction between drugs and targets through the local chemical context and topology structure and then extensive experiments on both large and small benchmark datasets demonstrated the competitiveness and superiority of the proposed DeepGS. GraphDTA [19] represented drug features as graphs and adopted some GNNs, like Graph Convolutional Network (GCN), Graph Attention Network (GAT), and Graph Isomorphic Network (GIN), to extract drug features. The results confirm that deep learning models are beneficial for drug-target binding affinity prediction and representing drugs as graphs is beneficial for model performance improvement. Jiang et al. represented compounds as molecular graphs, utilized contact maps to gain protein graphs through protein sequences, and then built GNN networks to obtain feature representation. The experimental results show that representing proteins through contact maps can improve the prediction performance of the model [24].

Above all, most of the existing deep learning methods fail to consider the contribution of each drug atom and protein residue to the binding affinity and ignore the information hidden in different layers, which will lead to partial information loss during the feature learning process and cause poor prediction performance. Moreover, when concatenating the learned features of drugs and proteins directly, it may introduce much task-irrelevant information without further optimization. To overcome the above limitations, here we propose GSAML-DTA, an interpretable deep learning framework for predicting drug-target binding affinity. First, we construct drug graphs and protein graphs from drug SMILES (Simplified Molecular Input Line Entry System) strings and protein contact maps, respectively. Next, a hybrid network GAT-GCN with a self-attention mechanism is designed to extract layer-wise structural information from drug and protein graphs. The extracted layer-wise features of the drug and target are fused separately, and then fused features are concatenated to obtain a combined representation of a drug-target pair. Finally, the mutual information principle is applied to the combined representation, and the output is fed into fully connected layers to predict binding affinity. Through comprehensive evaluation on two benchmark datasets, we demonstrate that GSAML-DTA outperforms state-of-the-art methods. Additionally, our model can be employed to identify the important binding atoms and residues that contribute most to DTA prediction, thus providing biological interpretability.

Section snippets

Datasets

To perform head-to-head comparisons of GSAML-DTA to existing machine/deep learning-based methods, we evaluate our model on two publicly available DTA datasets, Davis dataset [25] and KIBA dataset [26]. The Davis dataset consists of 442 proteins and 68 compounds forming 30056 drug-target pairs, in which the binding affinity is measured by kinase dissociation constant ( $K_{d}$ ) values. The higher value of $K_{d}$ represent lower binding strength of a drug-target pair. These data are selected from the

Performance evaluation metrics

To assess the performance of the proposed GSAML-DTA, we adopt three commonly used statistical metrics: Concordance Index ( $C I$ ) [36], Mean Squared Error ( $M S E$ ), and $r_{m}^{2}$ [37]. $C I$ is mainly employed to assess the difference between the predicted value and the actual value as follows: $C I = \frac{1}{Z} \sum_{d_{x} - d_{y}} h (b_{x} - b_{y}),$ $h (x) = {\begin{array}{c} 1, i f x > 0 \\ 0.5, i f x = 0 \\ 0, i f x < 0 \end{array},$ where $b_{x}$ is the predicted value of the larger affinity $d_{x}$ , $b_{y}$ is the predicted value of the smaller affinity $d_{y}$ , $Z$ is the normalization constant, and $h (x)$ is the step

Conclusion

In this study, we propose a novel deep-learning model, GSAML-DTA, to predict binding affinities of drug-target pairs, which is a crucial step for rapid virtual drug screening and drug development. We first generate graphs of the drug and target, and then employ a self-attention mechanism and a hybrid graph neural network GAT-GCN to extract structural information of them. Subsequently, to learn an informative representation of the drug-target pair, mutual information is applied to the combined

Funding

This study was supported by the Natural Science Foundation of China (No. 62071278).

Declaration of competing interest

There is no competing financial interest to declare.

References (46)

Q. Yang et al.
MMEASE: online meta-analysis of metabolomic data by enhanced metabolite annotation, marker selection and enrichment analysis
J. Proteonomics
(2021)
W. Xia et al.
PFmulDL: a novel strategy enabling multi-class and multi-label protein function annotation by integrating diverse deep learning methods
Comput. Biol. Med.
(2022)
E.E. Bolton et al.
PubChem: Integrated Platform of Small Molecules and Biological Activities, Annual Reports in Computational Chemistry
(2008)
M. Barratt et al.
An expert system rulebase for identifying contact allergens
(1994)
M.M. Shahzad et al.
Stress effects on FosB-and interleukin-8 (IL8)-driven ovarian cancer growth and metastasis
J Biol Chem.
(2010)
D.J. Newman et al.
Natural Products as Sources of New Drugs over the Nearly Four Decades from 01/1981 to 09/2019
Journal of Natural Products
(2020)
T. Takebe et al.
The current status of drug discovery and development as originated in United States academia
the influence of industrial and academic collaboration on drug discovery and development
(2018)
T. Zhao et al.
Identifying drug-target interactions based on graph convolutional network and deep neural network
Briefings in Bioinformatics
(2021)
J. Tang et al.
Making Sense of Large-Scale Kinase Inhibitor Bioactivity Data Sets: A Comparative and Integrative Analysis,
Journal of Chemical Information and Modeling
(2014)
W. Xue et al.
Computational identification of the binding mechanism of a triple reuptake inhibitor amitifadine for the treatment of major depressive disorder
Phys. Chem. Chem. Phys.
(2018)

P.T. Lang et al.

Dock 6

Combining techniques to model RNA–small molecule complexes

(2009)

G.M. Morris et al.

AutoDock4 and AutoDockTools4: Automated Docking with Selective Receptor Flexibility

Journal of Computational Chemistry

(2009)

E.B. Lenselink et al.

Beyond the hype: deep neural networks outperform established methods using a ChEMBL bioactivity benchmark set

Journal of Cheminformatics

(2017)

A.S. Rifaioglu et al.

MDeePred: novel multi-channel protein featurization for deep learning-based binding affinity prediction in

drug discovery

(2021)

J. Hong et al.

Protein functional annotation of simultaneously improved stability, accuracy and false discovery rate achieved by a sequence-based deep learning

Briefings Bioinf.

(2020)

J. Hong et al.

Convolutional neural network-based annotation of bacterial type IV secretion system effectors with enhanced accuracy and reduced false discovery

Briefings Bioinf.

(2020)

H. Öztürk et al.

DeepDTA: deep drug–target binding affinity prediction

(2018)

H. Öztürk et al.

WideDTA: prediction of drug-target binding affinity

arXiv

(2019 Feb 4)

K. Abbasi et al.

DeepCDA: deep cross-domain compound–protein affinity prediction through

LSTM and convolutional neural networks

(2020)

M. Karimi et al.

DeepAffinity: interpretable deep learning of compound–protein affinity through unified recurrent and convolutional neural networks

(2019)

M. Karimi et al.

Explainable Deep Relational Networks for Predicting Compound- Protein Affinities and Contacts

Journal of Chemical Information and Modeling

(2021)

T. Nguyen et al.

Predicting drug–target binding affinity with graph neural networks

(2021)

W. Torng et al.

Graph Convolutional Neural Networks for Predicting Drug-Target Interactions

Journal of Chemical Information and Modeling

(2019)

Cited by (11)

Prediction of drug-target binding affinity based on deep learning models
2024, Computers in Biology and Medicine
The prediction of drug-target binding affinity (DTA) plays an important role in drug discovery. Computerized virtual screening techniques have been used for DTA prediction, greatly reducing the time and economic costs of drug discovery. However, these techniques have not succeeded in reversing the low success rate of new drug development. In recent years, the continuous development of deep learning (DL) technology has brought new opportunities for drug discovery through the DTA prediction. This shift has moved the prediction of DTA from traditional machine learning methods to DL. The DL frameworks used for DTA prediction include convolutional neural networks (CNN), graph convolutional neural networks (GCN), and recurrent neural networks (RNN), and reinforcement learning (RL), among others. This review article summarizes the available literature on DTA prediction using DL models, including DTA quantification metrics and datasets, and DL algorithms used for DTA prediction (including input representation of models, neural network frameworks, valuation indicators, and model interpretability). In addition, the opportunities, challenges, and prospects of the application of DL frameworks for DTA prediction in the field of drug discovery are discussed.
IIFS: An improved incremental feature selection method for protein sequence processing
2023, Computers in Biology and Medicine
Discrete features can be obtained from protein sequences using a feature extraction method. These features are the basis of downstream processing of protein data, but it is necessary to screen and select some important features from them as they generally have data redundancy.
Here, we report IIFS, an improved incremental feature selection method that exploits a new subset search strategy to find the optimal feature set. IIFS combines nonadjacent sorting features to prevent the drawbacks of data explosion and excessive reliance on feature sorting results. The comparative experimental results on 27 feature sorting data show that IIFS can find more accurate and important features compared to existing methods.The IIFS approach also handles data redundancy more efficiently and finds more representative and discriminatory features while ensuring minimal feature dimensionality and good evaluation metrics. Moreover, we wrap this method and deploy it on a web server for access at http://112.124.26.17:8005/.
Drug–target affinity prediction method based on multi-scale information interaction and graph optimization
2023, Computers in Biology and Medicine
Drug–target affinity (DTA) prediction as an emerging and effective method is widely applied to explore the strength of drug–target interactions in drug development research. By predicting these interactions, researchers can assess the potential efficacy and safety of candidate drugs at an early stage, narrowing down the search space for therapeutic targets and accelerating the discovery and development of new drugs. However, existing DTA prediction models mainly use graphical representations of drug molecules, which lack information on interactions between individual substructures, thus affecting prediction accuracy and model interpretability. Therefore, transformer and diffusion on drug graphs in DTA prediction (TDGraphDTA) are introduced to predict drug–target interactions using multi-scale information interaction and graph optimization. An interactive module is integrated into feature extraction of drug and target features at different granularity levels. A diffusion model-based graph optimization module is proposed to improve the representation of molecular graph structures and enhance the interpretability of graph representations while obtaining optimal feature representations. In addition, TDGraphDTA improves the accuracy and reliability of predictions by capturing relationships and contextual information between molecular substructures. The performance of the proposed TDGraphDTA in DTA prediction was verified on three publicly available benchmark datasets (Davis, Metz, and KIBA). Compared with state-of-the-art baseline models, it achieved better results in terms of consistency index, R-squared, etc. Furthermore, compared with some existing methods, the proposed TDGraphDTA is demonstrated to have better structure capturing capabilities by visualizing the feature capturing capabilities of the model using Grad-AAM toxicity labels in the ToxCast dataset. The corresponding source codes are available at https://github.com/Lamouryz/TDGraph.
GPCNDTA: Prediction of drug-target binding affinity through cross-attention networks augmented with graph features and pharmacophores
2023, Computers in Biology and Medicine
Drug-target affinity prediction is a challenging task in drug discovery. The latest computational models have limitations in mining edge information in molecule graphs, accessing to knowledge in pharmacophores, integrating multimodal data of the same biomolecule and realizing effective interactions between two different biomolecules. To solve these problems, we proposed a method called Graph features and Pharmacophores augmented Cross-attention Networks based Drug-Target binding Affinity prediction (GPCNDTA). First, we utilized the GNN module, the linear projection unit and self-attention layer to correspondingly extract features of drugs and proteins. Second, we devised intramolecular and intermolecular cross-attention to respectively fuse and interact features of drugs and proteins. Finally, the linear projection unit was applied to gain final features of drugs and proteins, and the Multi-Layer Perceptron was employed to predict drug-target binding affinity. Three major innovations of GPCNDTA are as follows: (i) developing the residual CensNet and the residual EW-GCN to correspondingly extract features of drug and protein graphs, (ii) regarding pharmacophores as a new type of priors to heighten drug-target affinity prediction performance, and (iii) devising intramolecular and intermolecular cross-attention, in which the intramolecular cross-attention realizes the effective fusion of different modal data related to the same biomolecule, and the intermolecular cross-attention fulfills the information interaction between two different biomolecules in attention space. The test results on five benchmark datasets imply that GPCNDTA achieves the best performance compared with state-of-the-art computational models. Besides, relying on ablation experiments, we proved effectiveness of GNN modules, pharmacophores and two cross-attention strategies in improving the prediction accuracy, stability and reliability of GPCNDA. In case studies, we applied GPCNDTA to predict binding affinities between 3C-like proteinase and 185 drugs, and observed that most binding affinities predicted by GPCNDTA are close to corresponding experimental measurements.
ColdDTA: Utilizing data augmentation and attention-based feature fusion for drug-target binding affinity prediction
2023, Computers in Biology and Medicine
Accurate prediction of drug-target affinity (DTA) plays a crucial role in drug discovery and development. Recently, deep learning methods have shown excellent predictive performance on randomly split public datasets. However, verifications are still required on this splitting method to reflect real-world problems in practical applications. And in a cold-start experimental setup, where drugs or proteins in the test set do not appear in the training set, the performance of deep learning models often significantly decreases. This indicates that improving the generalization ability of the models remains a challenge. To this end, in this study, we propose ColdDTA: using data augmentation and attention-based feature fusion to improve the generalization ability of predicting drug-target binding affinity. Specifically, ColdDTA generates new drug-target pairs by removing subgraphs of drugs. The attention-based feature fusion module is also used to better capture the drug-target interactions. We conduct cold-start experiments on three benchmark datasets, and the consistency index (CI) and mean square error (MSE) results on the Davis and KIBA datasets show that ColdDTA outperforms the five state-of-the-art baseline methods. Meanwhile, the results of area under the receiver operating characteristic (ROC-AUC) on the BindingDB dataset show that ColdDTA also has better performance on the classification task. Furthermore, visualizing the model weights allows for interpretable insights. Overall, ColdDTA can better solve the realistic DTA prediction problem. The code has been available to the public.
Basing on the machine learning model to analyse the coronary calcification score and the coronary flow reserve score to evaluate the degree of coronary artery stenosis
2023, Computers in Biology and Medicine
To obtain the coronary artery calcium score (CACS) for each branch in coronary artery computed tomography angiography (CCTA) examination combined with the flow fraction reserve (FFR) of each branch in the coronary artery detected by CT and apply a machine learning model (ML) to analyse and predict the severity of coronary artery stenosis.
All patients who underwent coronary computed tomography angiography (CCTA) from January 2019 to April 2022 in the HOSPITAL (T.C.M) AFFILIATED TO SOUTHWEST MEDICAL UNIVERSITY) were retrospectively screened, and their sex, age, characteristics of lipid-containing lesions, coronary calcium score (CACS) and CT-FFR values were collected. Five machine learning models, random forest (RF), k-nearest neighbour algorithm (KNN), kernel logistic regression, support vector machine (SVM) and radial basis function neural network (RBFNN), were used as predictive models to evaluate the severity of coronary stenosis.
Among the five machine learning models, the SVM model achieved the best prediction performance, and the prediction accuracy of mild stenosis was up to 90%. Second, age and male sex were important influencing factors of increasing CACS and decreasing CT-FFR. Moreover, the critical CACS value of myocardial ischemia >200.70 was calculated.
Through computer machine learning model analysis, we prove the importance of CACS and FFR in predicting coronary stenosis, especially the prominent vector machine model, which promotes the application of artificial intelligence computer learning methods in the field of medical analysis.

View all citing articles on Scopus

View full text

GSAML-DTA: An interpretable drug-target binding affinity prediction model based on graph neural networks with self-attention mechanism and mutual information

Highlights

Abstract

Introduction

Section snippets

Datasets

Performance evaluation metrics

Conclusion

Funding

Declaration of competing interest

J. Proteonomics

Comput. Biol. Med.

An expert system rulebase for identifying contact allergens

J Biol Chem.

Natural Products as Sources of New Drugs over the Nearly Four Decades from 01/1981 to 09/2019

Journal of Natural Products

The current status of drug discovery and development as originated in United States academia

the influence of industrial and academic collaboration on drug discovery and development

Identifying drug-target interactions based on graph convolutional network and deep neural network

Briefings in Bioinformatics

Making Sense of Large-Scale Kinase Inhibitor Bioactivity Data Sets: A Comparative and Integrative Analysis,

Journal of Chemical Information and Modeling

Computational identification of the binding mechanism of a triple reuptake inhibitor amitifadine for the treatment of major depressive disorder

Phys. Chem. Chem. Phys.

Dock 6

Combining techniques to model RNA–small molecule complexes

AutoDock4 and AutoDockTools4: Automated Docking with Selective Receptor Flexibility

Journal of Computational Chemistry

Beyond the hype: deep neural networks outperform established methods using a ChEMBL bioactivity benchmark set

Journal of Cheminformatics

MDeePred: novel multi-channel protein featurization for deep learning-based binding affinity prediction in

drug discovery

Protein functional annotation of simultaneously improved stability, accuracy and false discovery rate achieved by a sequence-based deep learning

Briefings Bioinf.

Convolutional neural network-based annotation of bacterial type IV secretion system effectors with enhanced accuracy and reduced false discovery

Briefings Bioinf.

DeepDTA: deep drug–target binding affinity prediction

WideDTA: prediction of drug-target binding affinity

arXiv

DeepCDA: deep cross-domain compound–protein affinity prediction through

LSTM and convolutional neural networks

DeepAffinity: interpretable deep learning of compound–protein affinity through unified recurrent and convolutional neural networks

Explainable Deep Relational Networks for Predicting Compound- Protein Affinities and Contacts

Journal of Chemical Information and Modeling

Predicting drug–target binding affinity with graph neural networks

Graph Convolutional Neural Networks for Predicting Drug-Target Interactions

Journal of Chemical Information and Modeling