GGAC: Multi-relational image gated GCN with attention convolutional binary neural tree for identifying disease with chest X-rays

doi:10.1016/j.patcog.2021.108113

Pattern Recognition

Volume 120, December 2021, 108113

https://doi.org/10.1016/j.patcog.2021.108113 Get rights and content

Highlights

•
Use Graph Convolutional Network to learn high-dimensional feature.
•
Mining the discriminative features and multi-modal relations of images.
•
Solve the problem of weight distribution in different neighborhoods.
•
Image relations generate more image representations for images.

Abstract

Using medical images for disease identification is an important application in the medical field. Graph Convolutional Network (GCN) is proposed to model multi-relational image and generate more informative image representations. Recently, the relations between medical images are utilized to identify diseases. This paper proposes a Gated GCN with Attention Convolutional Binary Neural Tree (GGAC) for Multi-Relational Image Identifying Disease. GGAC extracts the discriminative features of the image, strengthen the ability to model medical images, understands images representation deeply and then well captures the multi-modal relation between images. Firstly, an Attention Convolutional Binary Neural Tree based on the attention mechanism is designed to extract the fine-grained features of the images, and use the attention conversion operation on the edge of the tree structure to enhance the network’s acquisition of key image features. Secondly, a Gated GCN is proposed to improve GCN performance by solving the problem of the weight distribution of different neighbors in the same-order neighborhood. Thirdly, a GCN propagation rule is used to transfer messages in multi-relational Graph and then solves the message passing problem of high-dimensional feature data in GCN. Finally, we verify GGAC on a multi-relational graph constructed on the Chest X-rays14. It can be seen from the experiment that overfitting and underfitting can be solved to a certain extent through the extraction and inference of the features of the multi-relational graph, and then GGAC has better performance than the state-of-the-art methods, and keeps good in model complexity.

Introduction

The basic tasks of computer vision include image classification, object localization, semantic segmentation, and instance segmentation. Convolutional neural networks and annotated image datasets have made significant progress in learning image representation classification in [1], [2], [3]. However, most of the existing deep learning methods neglect the relationships among images, which can benefit extraction of helpful semantic information from associations among images in reality. Meanwhile, there are one or more potential relationships among chest X-rays which can be regarded as a piece of auxiliary information that is helpful for further judging and analyzing chest X-rays. In clinical environments, diagnosis can be easily made by exploring such intrinsic relationships among chest X-rays.

We modeled Chest X-rays images and the relationships between them as a Multi-Relational Chest X-rays Graph where each node corresponds to a Chest X-ray image and the edges between two nodes represent the multiple relationships. The edges between two Chest X-ray images describe the close relations between the images. The construction of Multi-Relational Chest X-rays Graph and the definition of the relationships are given in Section 4.2. Figure 1(a) shows a Chest X-rays Graph with 5 nodes and 3 types of relationships. Our GGCN in Fig. 1(b) processes this multi-relational graph by extracting the features of the images and obtain the semantic relationships between the images. This process provides strong support for the segmentation, classification and object detection of images shown in Fig. 1(d). It can be seen from the experiment in Section 4 that over fitting and under fitting can be solved to a certain extent through the extraction and inference of the features of the multi-relational graph.

There are multiple relationships between the nodes in a Multi-relational Chest X-rays Graph. For example, there are plenty of edges between the two nodes in the Graph if the patients corresponding to image 1 and image 2 in Chest X-rays Graph have the same age and gender. We use the multiple relationships between Multi-relational Chest X-rays Graph to make the diagnosis of diseases more accurate. At present, some emerging researches on graph convolution [4], [5], [6], [7], [8], [9] modeled graph data which use the characteristics and the structural information of nodes to represent nodes. Although the application of GCN [4] to Chest X-ray Graphs is extremely challenging, many scholars have made attempts and achieve success in this regard. Graph SAGE [5] solved the problem of GCN in inductive learning on large images; Relational GCN [7] solved the multi-relational problem where each edge has a label and direction associated with it. Inspired by the ideas of Graph SAGE and relational GCN, the features of the original images were applied and the relationships between images was used to update the information in the Chest X-rays Graph. In the original GCN, the messages passing between adjacent nodes is obtained through linear transformation, but the nodes in Chest X-rays Graph are graphs with high-dimensional features, so an Encoder was used to transfer in multi-relational Graph. In Encoder, the use of the Attention Convolutional Binary Neural Tree was proposed to extract the fine-grained features of the images, then the attention conversion operation on the edge of the tree structure to strengthen the ability of network to acquire key features of images was used. We further design a Gated GCN Block based on GCN to apply to the GGAC. When modeling the relationships between nodes, Gated GCN solves the problem of weight distribution of different neighbors in the same-order neighborhood, and strengthens the generalization ability of the overall architecture in the graph structure. The addition of Gated GCN Block makes the model focus more on neighbor relations that are beneficial to the results. Finally, we select the Chest X-ray8 data set [10] to verify GGAC due to the rich relationships between these images.

This paper introduces the gating mechanism into GCN to solve the problem of weight distribution in same-order neighborhoods while Dauphin et al. [11] introduced the gating mechanism into CNNs to reduced the gradient dispersion and preserved the ability of nonlinearity. And the gating mechanism into GCN can enhances the ability of the model to capture spatial information relevance, and broadens the application of the gating mechanism in GCN. We designed an Attention Convolutional Binary Neural Tree into GCN to capture the discriminative features in medical images, while Wang et al. [12] focused on the interactions that occurred by introducing two additional sources of information, namely the spatial location of the candidate objects and their discriminative features. As we know, it is the first time that the gating mechanism and Attention Convolutional Binary Neural Tree are merged into GCN. The main contributions of this paper are summarized as follows:

(1)
We propose a GCN propagation rule in multi-relational graphs that uses Encoder to pass messages in multi-relational Graph which solves the problem of messages passing in high-dimensional feature data in GCN.
(2)
We use an Attention Convolutional Binary Neural Tree based on the attention mechanism to extract the fine-grained features of the image in Encoder. For small local differences, this convolutional binary neural tree structure can learn features of discrimination to distinguish different types of lung diseases.
(3)
We adopt the idea of the partial parameter sharing to reduce the number of parameters in the GGAC which can reduce the complexity of the entire model.
(4)
We design the Gated GCN Block to replace the traditional GCN which solves the problem of weight distribution of different neighbors in the same-order neighborhood, and strengthens the model’s ability to capture spatial information correlation.
(5)
We construct a multi-relational Chest X-rays Graph on the Chest X-rays14 dataset [10] and verify our model. GGAC always has better performance than the previous method in disease identification.

Section snippets

Related work

After the release of the Chest X-ray14 data set [10], many scholars have made great achievements in disease recognition [13], [14], [15], [16]. However, none of them considered modeling relationships between images, many researches [17], [18], [19] simply used deep neural networks to extract feature on Chest X-rays and generated independent pixel representations. For the problem of relationship modeling, [20], [21] proposed a method for modeling pixel-level relationships, [22], [23], [24], [25]

Proposed method

This section first introduces the detailed design and propagation method of the GGAC, and then introduces the relevant strategies (Encoder and Gated GCN) in GGAC.

After building the Chest X-rays Graph, an end-to-end framework Multi-Relational Image Gated GCN with Attention Convolutional Binary Neural Tree (GGAC) for Identifying Disease was put forwarded. The GGAC uses the features of the original image and the related images to update the information. As shown in Fig. 1 (b), the GGAC uses the

Dataset and preprocessing

We used GGAC to conduct disease identification tests on ChestX-ray14 dataset [10]. Figure 7 illustrates six selected examples from ChestX-ray14. There are 112,120 frontal-view CXR images in this dataset, the size of all images is 1024 $\times$ 1024, and the number of disease categories is 14. In this dataset, 51,708 images contain one or more symptoms, while the other 60,412 images are normal. The ChestX-ray14 dataset is associated with 30,805 patients and gives information about all of these

Conclusion

This paper proposes GGAC for identifying disease with chest x-rays. Different from traditional image classification, a chest x-rays graph was first conducted then the GGCN was used to learn image representations and relationships between images. The encoder proposed in this paper was used to transfer messages in a multi-relational graph to solve the message transfer problem of high-dimensional feature data in GCN. Second, an attention convolutional binary neural tree based on the attention

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgement

This work has been supported by the National Science Foundation of China, Grant No. 61762092, “Dynamic multi-objective requirement optimization based on transfer learning”, and the Open Foundation of the Key Laboratory in Software Engineering of Yunnan Province, China, Grant No. 2020SE303, “Code intelligent recommendation based on automatic neural architecture search under community evolution”, the Major Science and Technology Project of Precious Metal Materials Genome Engineering in Yunnan

Bing Yang received Bachelor’s degree in Software Engineering from Yunnan University in 2018, KunMing, China. He is currently pursuing the Master degree in Software Engineering in Yunnan University, Kunming, China. His current interests include a wide range of topics concerning machine learning, data mining and big data.

References (44)

Q. Guan et al.
Multi-label chest x-ray image classification via category-wise residual attention learning
Pattern Recognit. Lett.
(2020)
Y. Xie et al.
Graph convolutional networks with multi-level coarsening for graph classification
Knowl. Based Syst.
(2020)
E.-Y. Yu et al.
Identifying critical nodes in complex networks via graph convolutional networks
Knowl. Based Syst.
(2020)
A. Krizhevsky et al.
ImageNet classification with deep convolutional neural networks
Advances in Neural Information Processing Systems
(2012)
K. He et al.
Deep residual learning for image recognition
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
(2016)
G. Huang et al.
Densely connected convolutional networks
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
(2017)
T.N. Kipf, M. Welling, Semi-supervised classification with graph convolutional networks,...
W. Hamilton et al.
Inductive representation learning on large graphs
Advances in Neural Information Processing Systems
(2017)
J. Chen, T. Ma, C. Xiao, FastGCN: fast learning with graph convolutional networks via importance sampling,...
M. Schlichtkrull et al.
Modeling relational data with graph convolutional networks
European Semantic Web Conference
(2018)

Z.-M. Chen et al.

Multi-label image recognition with graph convolutional networks

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

(2019)

C. Mao, L. Yao, Y. Luo, ImageGCN: multi-relational image graph convolutional networks for disease identification with...

X. Wang et al.

ChestX-Ray8: hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

(2017)

Y.N. Dauphin et al.

Language modeling with gated convolutional networks

International Conference on Machine Learning

(2017)

X. Wang et al.

Symbiotic attention for egocentric action recognition with object-centric alignment

IEEE Trans. Pattern Anal. Mach. Intell.

(2020)

L. Yao, E. Poblenz, D. Dagunts, B. Covington, D. Bernard, K. Lyman, Learning to diagnose from scratch by exploiting...

P. Kumar et al.

Boosted cascaded convnets for multilabel classification of thoracic diseases in chest radiographs

International Conference Image Analysis and Recognition

(2018)

C. Mao et al.

Deep generative classifiers for thoracic disease diagnosis with chest x-ray images

2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

(2018)

W. Wang et al.

Medical image classification using deep learning

Deep Learning in Healthcare

(2020)

M.I.G. Ocaña et al.

Medical image detection using deep learning

Deep Learning in Healthcare

(2020)

K.L.-L. Román et al.

Medical image segmentation using deep learning

Deep Learning in Healthcare

(2020)

M. Maire et al.

Affinity CNN: learning pixel-centric pairwise relations for figure/ground embedding

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

(2016)

Cited by (6)

Feature aggregation-based multi-relational knowledge reasoning for COPD intelligent diagnosis
2024, Computers and Electrical Engineering
The increasing prevalence of artificial intelligence-based knowledge reasoning has contributed to more accurate and efficient auxiliary diagnoses. However, a majority of the disease prediction methods concentrate on the symptoms themselves while discarding the inherent properties of symptoms and the relationships underlying them. This paper proposes a feature aggregation-based intelligent diagnosis model employing a Heterogeneous Graph Convolutional Network (GCN), termed HeteroGCN. It focuses on symptoms’ inherent properties and multiple hidden relationships among symptoms and properties. By aggregating features of nodes, it realizes effective and accurate symptom-based knowledge reasoning for disease-type prediction. The diagnosis-related information from the Electronic Medical Record (EMR) has been extracted and standardized by taking chronic obstructive pulmonary disease (COPD) as an instance. Then the presented model extracts the symptoms and their properties as nodes and the relationships underlying the nodes as edges to construct a heterogeneous graph. The adjacency matrix and feature matrix have been fused and taken as the input of this model, and then the node representations (embeddings) are generated by aggregating neighbor nodes’ information. Finally, specific disease types (syndromes) will be predicted by the generated symptom node embeddings. The results of the model comparison and parameter sensitivity test demonstrate that the presented HeteroGCN model performs best on disease-type prediction. This paper provides a novel feature aggregation-based multi-relational knowledge reasoning approach for disease type (syndrome) prediction, which holds great significance in improving disease diagnosis.
Spatial transcriptomics analysis of gene expression prediction using exemplar guided graph neural network
2024, Pattern Recognition
Spatial transcriptomics (ST) is essential for understanding diseases and developing novel treatments. It measures the gene expression of each fine-grained area (i.e., different windows) in the tissue slide with low throughput. This paper proposes an exemplar guided graph network dubbed EGGN to accurately and efficiently predict gene expression from each window of a tissue slide image. We apply exemplar learning to dynamically boost gene expression prediction from nearest/similar exemplars of a given tissue slide image window. Our framework has three main components connected in a sequence: (i) an extractor to structure a feature space for exemplar retrievals; (ii) a graph construction strategy to connect windows and exemplars as a graph; (iii) a graph convolutional network backbone to process window and exemplar features, and a graph exemplar bridging block to adaptively revise the window features using its exemplars. Finally, we complete the gene expression prediction task with a simple attention-based prediction block. Experiments on standard benchmark datasets indicate the superiority of our approach when compared with past state-of-the-art methods. We release our code at https://github.com/Yan98/EGN.
Uncovering Hidden Vulnerabilities in Convolutional Neural Networks through Graph-based Adversarial Robustness Evaluation
2023, Pattern Recognition
Convolutional neural networks (CNNs) are widely used for image classification, but their vulnerability to adversarial attacks poses challenges to their reliability and security. However, current adversarial robustness (AR) measures lack a theoretical foundation, limiting the insight into the decision process. To address this issue, we propose a new AR evaluation framework based on Graph of Patterns (GoPs) models and graph distance algorithms. Our approach provides a fine-grained analysis of AR from three perspectives, providing targeted insight into the vulnerability of CNNs. Compared to current standards, our approach is theoretically grounded and allows fine-tuning of model components without repeated attempts and validation. Our experimental results demonstrate its effectiveness in uncovering hidden vulnerabilities in CNNs and providing actionable approaches to improve their AR. Our GoPs modeling approach and graph distance algorithms can be extended to apply to other graph machine learning tasks such as Metric Learning on multi-relational graphs. Overall, our framework represents significant progress in AR evaluation, providing a more interpretable, targeted, and efficient approach to assess CNN robustness in complex graph-based systems.
GraphDPI: Partial label disambiguation by graph representation learning via mutual information maximization
2023, Pattern Recognition
Citation Excerpt :
Recently, GCNs have attracted significant interest and revealed their effectiveness in learning graph representation. Bruna et al. extend convolution operations to neural networks in the graph domain for better supervised node classification or link prediction by mining adjacency information [19]. Further, graph attention networks (GAT) [20,21] adopt the strategy of attention mechanism and assigns different weights based on the importance of different nodes, edges, or models.
Partial label learning (PLL) is a weakly supervised learning framework where each training instance is associated with more than one candidate label, and only one of them is the true label. Most of the existing PLL algorithms directly disambiguate the candidate labels according to the instance feature similarity, but fail to discover the latent semantic relationship over the entire dataset. In this paper, method GraphDPI, an innovative deep partial label disambiguation by graph representation via mutual information maximization, is proposed. This method can capture the semantic clusters with the most unique information in the latent space and automatically adapt to different feature distributions. Specifically, a new sampling method based on the graph is proposed to estimate mutual information, extending GCN to the field of weakly supervised learning. Therefore, the graph representation of the data can contain more distinguishing information to disambiguate candidate labels by maximizing the mutual information of the local graph representation and the global one. Furthermore, the triplet loss is introduced to fully exploit the relationship between instances and extract the latent embedding representation over the entire dataset. It thereby can make the model output as large as possible on the inter-class variation and as small as possible on the intra-class variation. Finally, the candidate labels can be disambiguated by the difference between semantic clusters. Experiments reveal the overwhelming performances of GraphDPI.
Multi-Level Graph Neural Network With Sparsity Pooling for Recognizing Parkinson's Disease
2023, IEEE Transactions on Neural Systems and Rehabilitation Engineering
Deep Learning with Graph Convolutional Networks: An Overview and Latest Applications in Computational Intelligence
2023, International Journal of Intelligent Systems

Yan Kang received Ph.D. degree in Computer Software and Theory from Institute of Software Chinese Academy of Sciences, Beijing, China, in 2003. She is a senior software architect, an associate professor of the Software Institute of Yunnan University, an innovative team of Yunnan Province, and a core member of the Key Laboratory of Software Engineering & Data Science of Yunnan Province. She research interests include software engineering, system optimization, big data processing and mining.

Lan Zhang received Bachelor’s degree in Software Engineering from Maanshan University in 2019, Maanshan, China. She is currently pursuing the Master degree in Software Engineering Technology in Yunnan University, Kunming, China. Her research interests include artificial intelligence and natural language processing.

Hao Li, Professor, MS in Computer Science from the University of Essex, UK, Ph.D in Computer Science from the University of Huddersfield, UK, Visiting Scholar at California Institute of Technology. Mainly engaged in distributed computing, grid and cloud computing research; familiar with software engineering, and has all been researching enterprise ERP and information construction.

View full text

GGAC: Multi-relational image gated GCN with attention convolutional binary neural tree for identifying disease with chest X-rays

Highlights

Abstract

Introduction

Section snippets

Related work

Proposed method

Dataset and preprocessing

Conclusion

Declaration of Competing Interest

Acknowledgement

Pattern Recognit. Lett.

Knowl. Based Syst.

Knowl. Based Syst.

ImageNet classification with deep convolutional neural networks

Advances in Neural Information Processing Systems

Deep residual learning for image recognition

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

Densely connected convolutional networks

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

Inductive representation learning on large graphs

Advances in Neural Information Processing Systems

Modeling relational data with graph convolutional networks

European Semantic Web Conference

Multi-label image recognition with graph convolutional networks

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

ChestX-Ray8: hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

Language modeling with gated convolutional networks

International Conference on Machine Learning

Symbiotic attention for egocentric action recognition with object-centric alignment

IEEE Trans. Pattern Anal. Mach. Intell.

Boosted cascaded convnets for multilabel classification of thoracic diseases in chest radiographs

International Conference Image Analysis and Recognition

Deep generative classifiers for thoracic disease diagnosis with chest x-ray images

2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Medical image classification using deep learning

Deep Learning in Healthcare

Medical image detection using deep learning

Deep Learning in Healthcare

Medical image segmentation using deep learning

Deep Learning in Healthcare

Affinity CNN: learning pixel-centric pairwise relations for figure/ground embedding

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition