Elsevier

Pattern Recognition

Volume 120, December 2021, 108113
Pattern Recognition

GGAC: Multi-relational image gated GCN with attention convolutional binary neural tree for identifying disease with chest X-rays

https://doi.org/10.1016/j.patcog.2021.108113Get rights and content

Highlights

  • Use Graph Convolutional Network to learn high-dimensional feature.

  • Mining the discriminative features and multi-modal relations of images.

  • Solve the problem of weight distribution in different neighborhoods.

  • Image relations generate more image representations for images.

Abstract

Using medical images for disease identification is an important application in the medical field. Graph Convolutional Network (GCN) is proposed to model multi-relational image and generate more informative image representations. Recently, the relations between medical images are utilized to identify diseases. This paper proposes a Gated GCN with Attention Convolutional Binary Neural Tree (GGAC) for Multi-Relational Image Identifying Disease. GGAC extracts the discriminative features of the image, strengthen the ability to model medical images, understands images representation deeply and then well captures the multi-modal relation between images. Firstly, an Attention Convolutional Binary Neural Tree based on the attention mechanism is designed to extract the fine-grained features of the images, and use the attention conversion operation on the edge of the tree structure to enhance the network’s acquisition of key image features. Secondly, a Gated GCN is proposed to improve GCN performance by solving the problem of the weight distribution of different neighbors in the same-order neighborhood. Thirdly, a GCN propagation rule is used to transfer messages in multi-relational Graph and then solves the message passing problem of high-dimensional feature data in GCN. Finally, we verify GGAC on a multi-relational graph constructed on the Chest X-rays14. It can be seen from the experiment that overfitting and underfitting can be solved to a certain extent through the extraction and inference of the features of the multi-relational graph, and then GGAC has better performance than the state-of-the-art methods, and keeps good in model complexity.

Introduction

The basic tasks of computer vision include image classification, object localization, semantic segmentation, and instance segmentation. Convolutional neural networks and annotated image datasets have made significant progress in learning image representation classification in [1], [2], [3]. However, most of the existing deep learning methods neglect the relationships among images, which can benefit extraction of helpful semantic information from associations among images in reality. Meanwhile, there are one or more potential relationships among chest X-rays which can be regarded as a piece of auxiliary information that is helpful for further judging and analyzing chest X-rays. In clinical environments, diagnosis can be easily made by exploring such intrinsic relationships among chest X-rays.

We modeled Chest X-rays images and the relationships between them as a Multi-Relational Chest X-rays Graph where each node corresponds to a Chest X-ray image and the edges between two nodes represent the multiple relationships. The edges between two Chest X-ray images describe the close relations between the images. The construction of Multi-Relational Chest X-rays Graph and the definition of the relationships are given in Section 4.2. Figure 1(a) shows a Chest X-rays Graph with 5 nodes and 3 types of relationships. Our GGCN in Fig. 1(b) processes this multi-relational graph by extracting the features of the images and obtain the semantic relationships between the images. This process provides strong support for the segmentation, classification and object detection of images shown in Fig. 1(d). It can be seen from the experiment in Section 4 that over fitting and under fitting can be solved to a certain extent through the extraction and inference of the features of the multi-relational graph.

There are multiple relationships between the nodes in a Multi-relational Chest X-rays Graph. For example, there are plenty of edges between the two nodes in the Graph if the patients corresponding to image 1 and image 2 in Chest X-rays Graph have the same age and gender. We use the multiple relationships between Multi-relational Chest X-rays Graph to make the diagnosis of diseases more accurate. At present, some emerging researches on graph convolution [4], [5], [6], [7], [8], [9] modeled graph data which use the characteristics and the structural information of nodes to represent nodes. Although the application of GCN [4] to Chest X-ray Graphs is extremely challenging, many scholars have made attempts and achieve success in this regard. Graph SAGE [5] solved the problem of GCN in inductive learning on large images; Relational GCN [7] solved the multi-relational problem where each edge has a label and direction associated with it. Inspired by the ideas of Graph SAGE and relational GCN, the features of the original images were applied and the relationships between images was used to update the information in the Chest X-rays Graph. In the original GCN, the messages passing between adjacent nodes is obtained through linear transformation, but the nodes in Chest X-rays Graph are graphs with high-dimensional features, so an Encoder was used to transfer in multi-relational Graph. In Encoder, the use of the Attention Convolutional Binary Neural Tree was proposed to extract the fine-grained features of the images, then the attention conversion operation on the edge of the tree structure to strengthen the ability of network to acquire key features of images was used. We further design a Gated GCN Block based on GCN to apply to the GGAC. When modeling the relationships between nodes, Gated GCN solves the problem of weight distribution of different neighbors in the same-order neighborhood, and strengthens the generalization ability of the overall architecture in the graph structure. The addition of Gated GCN Block makes the model focus more on neighbor relations that are beneficial to the results. Finally, we select the Chest X-ray8 data set [10] to verify GGAC due to the rich relationships between these images.

This paper introduces the gating mechanism into GCN to solve the problem of weight distribution in same-order neighborhoods while Dauphin et al. [11] introduced the gating mechanism into CNNs to reduced the gradient dispersion and preserved the ability of nonlinearity. And the gating mechanism into GCN can enhances the ability of the model to capture spatial information relevance, and broadens the application of the gating mechanism in GCN. We designed an Attention Convolutional Binary Neural Tree into GCN to capture the discriminative features in medical images, while Wang et al. [12] focused on the interactions that occurred by introducing two additional sources of information, namely the spatial location of the candidate objects and their discriminative features. As we know, it is the first time that the gating mechanism and Attention Convolutional Binary Neural Tree are merged into GCN. The main contributions of this paper are summarized as follows:

  • (1)

    We propose a GCN propagation rule in multi-relational graphs that uses Encoder to pass messages in multi-relational Graph which solves the problem of messages passing in high-dimensional feature data in GCN.

  • (2)

    We use an Attention Convolutional Binary Neural Tree based on the attention mechanism to extract the fine-grained features of the image in Encoder. For small local differences, this convolutional binary neural tree structure can learn features of discrimination to distinguish different types of lung diseases.

  • (3)

    We adopt the idea of the partial parameter sharing to reduce the number of parameters in the GGAC which can reduce the complexity of the entire model.

  • (4)

    We design the Gated GCN Block to replace the traditional GCN which solves the problem of weight distribution of different neighbors in the same-order neighborhood, and strengthens the model’s ability to capture spatial information correlation.

  • (5)

    We construct a multi-relational Chest X-rays Graph on the Chest X-rays14 dataset [10] and verify our model. GGAC always has better performance than the previous method in disease identification.

Section snippets

Related work

After the release of the Chest X-ray14 data set [10], many scholars have made great achievements in disease recognition [13], [14], [15], [16]. However, none of them considered modeling relationships between images, many researches [17], [18], [19] simply used deep neural networks to extract feature on Chest X-rays and generated independent pixel representations. For the problem of relationship modeling, [20], [21] proposed a method for modeling pixel-level relationships, [22], [23], [24], [25]

Proposed method

This section first introduces the detailed design and propagation method of the GGAC, and then introduces the relevant strategies (Encoder and Gated GCN) in GGAC.

After building the Chest X-rays Graph, an end-to-end framework Multi-Relational Image Gated GCN with Attention Convolutional Binary Neural Tree (GGAC) for Identifying Disease was put forwarded. The GGAC uses the features of the original image and the related images to update the information. As shown in Fig. 1 (b), the GGAC uses the

Dataset and preprocessing

We used GGAC to conduct disease identification tests on ChestX-ray14 dataset [10]. Figure 7 illustrates six selected examples from ChestX-ray14. There are 112,120 frontal-view CXR images in this dataset, the size of all images is 1024 × 1024, and the number of disease categories is 14. In this dataset, 51,708 images contain one or more symptoms, while the other 60,412 images are normal. The ChestX-ray14 dataset is associated with 30,805 patients and gives information about all of these

Conclusion

This paper proposes GGAC for identifying disease with chest x-rays. Different from traditional image classification, a chest x-rays graph was first conducted then the GGCN was used to learn image representations and relationships between images. The encoder proposed in this paper was used to transfer messages in a multi-relational graph to solve the message transfer problem of high-dimensional feature data in GCN. Second, an attention convolutional binary neural tree based on the attention

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgement

This work has been supported by the National Science Foundation of China, Grant No. 61762092, “Dynamic multi-objective requirement optimization based on transfer learning”, and the Open Foundation of the Key Laboratory in Software Engineering of Yunnan Province, China, Grant No. 2020SE303, “Code intelligent recommendation based on automatic neural architecture search under community evolution”, the Major Science and Technology Project of Precious Metal Materials Genome Engineering in Yunnan

Bing Yang received Bachelor’s degree in Software Engineering from Yunnan University in 2018, KunMing, China. He is currently pursuing the Master degree in Software Engineering in Yunnan University, Kunming, China. His current interests include a wide range of topics concerning machine learning, data mining and big data.

References (44)

  • Z.-M. Chen et al.

    Multi-label image recognition with graph convolutional networks

    Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

    (2019)
  • C. Mao, L. Yao, Y. Luo, ImageGCN: multi-relational image graph convolutional networks for disease identification with...
  • X. Wang et al.

    ChestX-Ray8: hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases

    Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

    (2017)
  • Y.N. Dauphin et al.

    Language modeling with gated convolutional networks

    International Conference on Machine Learning

    (2017)
  • X. Wang et al.

    Symbiotic attention for egocentric action recognition with object-centric alignment

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2020)
  • L. Yao, E. Poblenz, D. Dagunts, B. Covington, D. Bernard, K. Lyman, Learning to diagnose from scratch by exploiting...
  • P. Kumar et al.

    Boosted cascaded convnets for multilabel classification of thoracic diseases in chest radiographs

    International Conference Image Analysis and Recognition

    (2018)
  • C. Mao et al.

    Deep generative classifiers for thoracic disease diagnosis with chest x-ray images

    2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

    (2018)
  • W. Wang et al.

    Medical image classification using deep learning

    Deep Learning in Healthcare

    (2020)
  • M.I.G. Ocaña et al.

    Medical image detection using deep learning

    Deep Learning in Healthcare

    (2020)
  • K.L.-L. Román et al.

    Medical image segmentation using deep learning

    Deep Learning in Healthcare

    (2020)
  • M. Maire et al.

    Affinity CNN: learning pixel-centric pairwise relations for figure/ground embedding

    Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

    (2016)
  • Cited by (6)

    • GraphDPI: Partial label disambiguation by graph representation learning via mutual information maximization

      2023, Pattern Recognition
      Citation Excerpt :

      Recently, GCNs have attracted significant interest and revealed their effectiveness in learning graph representation. Bruna et al. extend convolution operations to neural networks in the graph domain for better supervised node classification or link prediction by mining adjacency information [19]. Further, graph attention networks (GAT) [20,21] adopt the strategy of attention mechanism and assigns different weights based on the importance of different nodes, edges, or models.

    • Multi-Level Graph Neural Network With Sparsity Pooling for Recognizing Parkinson's Disease

      2023, IEEE Transactions on Neural Systems and Rehabilitation Engineering

    Bing Yang received Bachelor’s degree in Software Engineering from Yunnan University in 2018, KunMing, China. He is currently pursuing the Master degree in Software Engineering in Yunnan University, Kunming, China. His current interests include a wide range of topics concerning machine learning, data mining and big data.

    Yan Kang received Ph.D. degree in Computer Software and Theory from Institute of Software Chinese Academy of Sciences, Beijing, China, in 2003. She is a senior software architect, an associate professor of the Software Institute of Yunnan University, an innovative team of Yunnan Province, and a core member of the Key Laboratory of Software Engineering & Data Science of Yunnan Province. She research interests include software engineering, system optimization, big data processing and mining.

    Lan Zhang received Bachelor’s degree in Software Engineering from Maanshan University in 2019, Maanshan, China. She is currently pursuing the Master degree in Software Engineering Technology in Yunnan University, Kunming, China. Her research interests include artificial intelligence and natural language processing.

    Hao Li, Professor, MS in Computer Science from the University of Essex, UK, Ph.D in Computer Science from the University of Huddersfield, UK, Visiting Scholar at California Institute of Technology. Mainly engaged in distributed computing, grid and cloud computing research; familiar with software engineering, and has all been researching enterprise ERP and information construction.

    View full text