Improving Robustness of Medical Image Diagnosis with Denoising Convolutional Neural Networks

Xue, Fei-Fei; Peng, Jin; Wang, Ruixuan; Zhang, Qiong; Zheng, Wei-Shi

doi:10.1007/978-3-030-32226-7_94

Fei-Fei Xue^16,17,
Jin Peng¹⁶,
Ruixuan Wang^16,17,
Qiong Zhang^16,18 &
…
Wei-Shi Zheng^16,17

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11769))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

10k Accesses
12 Citations

Abstract

Convolutional neural networks (CNNs) are vulnerable to adversarial noises, which may result in potentially disastrous consequences in safety or security sensitive systems. This paper proposes a novel mechanism to improve the robustness of medical image classification systems by bringing denoising ability to CNN classifiers with a naturally embedded auto-encoder and high-level feature invariance to general noises. This novel denoising mechanism can be adapted to many model architectures, and therefore can be easily combined with existing models and denoising mechanisms to further improve robustness of CNN classifiers. This proposed method has been confirmed by comprehensive evaluations with two medical image classification tasks.

F.-F. Xue and J. Peng—The authors contribute equally to this paper.

You have full access to this open access chapter, Download conference paper PDF

Addressing Adversarial Machine Learning Attacks in Smart Healthcare Perspectives

An Efficient Denoising of Medical Images Through Convolutional Neural Network

Extensions and Detailed Analysis of Synergy Between Traditional Classification and Classification Based on Negative Features in Deep Convolutional Neural Networks

Article Open access 03 December 2024

Keywords

1 Introduction

Convolutional neural networks (CNN) have been widely used in medical image analysis, such as automatic segmentation of tumor regions in MRI [13] and intelligent diagnosis of skin cancers [4]. However, the application of a medical analysis system would be limited if it is sensitive to various noises and varying environments. One way to critically evaluate the robustness of a medical image system is by adversarial attacks [14]. Specifically, clean images can be altered with imperceptible perturbations (called adversarial noises) to generate adversarial examples, and such adversarial examples can fool CNN classifiers to make incorrect predictions with high confidence. Recent studies on natural images clearly demonstrate that CNN classifiers can be easily attacked and become completely crashed [8]. Adversarial attacks have also been performed on medical images [12], confirming the high sensitivity of CNN diagnosis systems to adversarial noises. Therefore, it is highly demanding to improve the robustness of intelligent diagnosis systems.

System robustness can be improved by providing the system with the ability of defending adversarial attacks. Multiple defense approaches have been proposed for this purpose. For example, adversarial training and its variants can improve system’s defense ability simply by adding one or more types of adversarial examples into the training data during classifier training [5, 8, 15], while denoising approach pre-processes images often with certain type of auto-encoders, aiming to remove potential adversarial noises before inputting images to classifiers [1, 10]. Adversarial training requires embedding adversarial attacking process into classifier training [5, 8, 15], and denoising approach often suffers from accuracy reduction in classifying clean images [1, 10]. Another approach is to train a distillation network which can improve defense ability by effectively enlarging gaps between distributions of classes in the high-level semantic feature space [11].

While the attacking and defense studies of deep neural networks have been actively investigated on natural images in the past several years, few work has investigated the robustness of medical image analysis associated with its defense ability. This paper proposes a novel defense strategy to improve the robustness of intelligent diagnosis systems. Different from existing approaches, the proposed method directly improves network classifier’s denoising ability with a naturally embedded auto-encoder and a semantic feature invariance strategy for general noises. This novel denoising mechanism can be adapted to many classifier architectures and is independent of any image pre-processing procedure. Therefore, it can be easily combined into the existing models and denoising mechanisms to further improve the robustness of network classifiers. Experiments on a skin image dataset and a chest X-ray dataset demonstrate that, it can always significantly improve the robustness of the classifiers via integrating the proposed denoising mechanism into the existing CNN classifiers, no matter whether the classifiers have employed other defense methods.

2 Methods

In adversarial attacks, image pixel values can be manipulated via small and carefully-crafted perturbations, such that the originally imperceptible adversarial noise can be progressively amplified over layers in deep neural networks, leading to incorrect classifications. Consider an image as a point in the original high-dimensional image space, and the re-ordered output of the last convolutional layer in a CNN classifier as a point in a low-dimensional high-level semantic feature space. For an original (clean) image which can be correctly classified by the neural network classifier, the corresponding adversarial example should be in a small hypersphere centered at the clean image in the image space, while the two images should be relatively far from each other in the high-level semantic feature space due to mis-classification of the adversarial example. To defend attacks from adversarial examples, one intuitive idea is to assure that the convolutional layers in the classifier can transform all neighboring points around each clean image to the same point in the semantic feature space as that of the clean image. The popular adversarial training, which adds adversarial examples to the training set, can be considered as one simplified implementation of this idea. Another idea is to remove (both general and adversarial) noises by projecting images to and then conducting reconstruction from a lower dimensional space, supposing that the clean images lie on a low-dimensional manifold while the noisy images are not. With this idea, auto-encoder has been applied to process images often before they are fed into the classifier. By combining the two ideas, but without using adversarial examples and the pre-processing procedure, we propose a novel plug-and-play mechanism to defend adversarial attacks, thus improving system robustness.

2.1 Transforming Neighboring Noisy Images to the Same Point

Because each adversarial example falls within a small neighborhood of the corresponding clean image in the image space, the classifier trained additionally with all the available noisy images, within the neighborhood of each training image, should become more robust to adversarial attacks, in the sense that all noisy images around a clean image would be projected to the same point in the semantic feature space. However in practice, it is infeasible to collect all such noisy images. Here by trying to project a small subset of general noisy images within the neighborhood of each clean image to the same point as that of the clean image in the semantic feature space, we expect adversarial examples within the neighborhood would be more likely projected to the same point as well. In this case, the adversarial examples would be more likely recognized as the same class of the clean image, thus improving robustness of the classifier.

Formally, for the i-th clean image $\varvec{x}_i$ in the original training set, let us denote by $\varvec{x}'_i$ a noisy image generated by adding uniform random noise $[-\sigma , \sigma ]$ to each pixel of the clean image $\varvec{x}_i$, and $\varvec{f}(\varvec{x}_i)$ and $\varvec{f}(\varvec{x}'_i)$ be the corresponding semantic feature vectors generated by the output of the final convolutional layer in the classifier. Then the objective of transforming neighboring noisy images to the same point in the semantic feature space can be formulated as an optimization problem, i.e., training the classifier (see Fig. 1) such that the following loss function $L_n$ (called neighbor loss) is minimized:

$$\begin{aligned} L_n = \frac{1}{N} \sum _{i=1}^N \Vert \varvec{f}(\varvec{x}_i) - \varvec{f}(\varvec{x}'_i) \Vert . \end{aligned}$$

(1)

Note that $\varvec{x}'_i$ can be randomly generated over training iterations such that multiple different noisy images are used for each clean image. Using random noise rather than adversarial noise during model training is one key difference between our approach and existing ones (e.g., [9]).

2.2 Embedded Auto-Encoder

The existing denoising approaches often employ a separate auto-encoder to remove potential adversarial noise from images before sending image data to the classifier. However, fine details in normal regions in the images could also be modified by the auto-encoder. Such change in normal regions actually causes new noises compared to the original clean images, and these new noises may be progressively amplified over layers in the classifier. Just as the adversarial noises, such new noises may also lead to mis-classification of the images, which actually has been observed in related studies [10] and in our experiments.

To avoid the downgraded classification performance on clean images and meanwhile make use of denoising ability from auto-encoder, we propose to embed the auto-encoder into the network classifier, where the encoder part shares low-to-middle convolutional layers of the classifier (Fig. 2 Left). Because the auto-encoder denoises images mainly by projecting them to a lower-dimensional space via the encoder part, sharing the encoder with the CNN classifier would naturally transfer the denoising competence to the classifier. Meanwhile, the classifier still uses original images rather than reconstructed images from the auto-encoder as input. This is clearly different from the existing approach which used a separate auto-encoder before the CNN classifier [9] (Fig. 2 Right).

Thus, for the clean image $\varvec{x}_i$ and one corresponding noisy image $\varvec{x}'_i$, with their reconstructed results $\hat{\varvec{x}}_i$ and $\hat{\varvec{x}}'_i$ from the embedded auto-encoder, the classifier can be trained to not only improve the classification performance, but also help improve the reconstruction performance of the embedded auto-encoder by additionally minimizing the reconstruction error $L_a$:

$$\begin{aligned} L_a = \frac{1}{N} \sum _{i=1}^N \{ \Vert \varvec{x}_i - \hat{\varvec{x}}_i \Vert _2 + \Vert \varvec{x}_i - \hat{\varvec{x}}'_i \Vert _2 \} . \end{aligned}$$

(2)

Note that the target of the reconstructed noisy image $\hat{\varvec{x}}'_i$ is the clean image $\varvec{x}_i$. Similarly as in Eq. (1), multiple noisy $\varvec{x}'_i$ can be randomly generated for each clean image.

Combining both ideas (Eqs. 1 and 2), a more robust classifier can be obtained by simultaneously training the classifier and the embedded auto-encoder, with the constraint to make clean and noisy images similar to each other in the semantic feature space, i.e., by minimizing the loss function L,

$$\begin{aligned} L = L_c + \lambda _n L_n + \lambda _a L_a . \end{aligned}$$

(3)

Here $L_c$ denotes the cross-entropy loss for the classifier itself to improve its classification performance on both clean and noisy images, $\lambda _{n}$ and $\lambda _{a}$ are hyper-parameters to respectively control the relative weights of loss terms $L_n$ and $L_a$.

3 Experiments

3.1 Experimental Settings

The experiments were performed on two medical image datasets, the skin image dataset from ISIC2018 Challenge with 7 disease categories(SKIN4) [3], and the chest X-ray dataset with 3 categories^{Footnote 1}. To reduce the data imbalance between classes in the skin dataset, four classes (MEL, NV, BCC and BKL) in which the number of images exceeded 500 were selected and 1,500 images of NV class were randomly selected from 6,705, while keeping the other classes of data unchanged. The selected images were split to training set and test set with the rate around 5:1. For the chest X-ray dataset, we randomly split the raw training set to two parts, with 21,000 images as our training set and the left 6,000 images as test set. Also, to generate adversarial images for the evaluation of the proposed defense approach, different attacking methods, including the Fast Gradient Sign Method (FGSM) [5], the iterative FGSM (IFGSM) [7], and the Carlini and Wagner [2] method (C&W), were applied to four widely used network classifiers, including ResNet18 [6] and VGG-16. For each training sample, we generated two adversarial examples with the perturbation level $\epsilon $ in $\{4, 8\}$ for FGSM and IFGSM. And for C&W, we set the searching times to 5 and the iteration times to 1000.

There are mainly two types adversarial attacks based on different assumptions on the knowledge of the target network, i.e., white-box and black-box attacks. In the black-box attack, an attacker can observes only the network’s output information on some probed input information, which is more realistic and applicable. In comparison, in the white-box attack, an attacker has detailed information on the network architecture and model parameters. The evaluations here mainly focus on the defense of black-box attacks.

All the CNN classifiers used in experiments were optimized using SGD, with initial learning rate set 0.01, and weight decay set 0.0001. Each model was trained on a single GPU with batch size 64. The number of training epochs was set 80. Note that due to limited space, only part of the evaluation results were shown below, and the attacking model was ResNet18 unless otherwise mentioned.

Table 1. Classification accuracy on the adversarial examples generated from the SKIN4 test set. Rows 2 to 4 indicate the influence of neighbor loss and rows 5 to 7 represent the influence of reconstruction loss. Clean stands for original clean images. NA means no defense.

Full size table

3.2 Evaluations on Skin Dataset

This section evaluates the effect of the proposed approach in improving robustness of a CNN classifier ResNet18 with ablation study on the skin dataset. Table 1 showed that when including the neighbor loss during training, with the embedded auto-encoded excluded, the trained classifiers (rows 2–4) performed significantly better than the classifier without any defense (first row), when attacked by different methods at different perturbation levels ($\epsilon $). It also shows that with increasing weight $\lambda _{n}$ of the neighbor loss, the defense performance increased accordingly. However, large $\lambda _{n}$ (10.0) might lead to downgraded performance in classifying clean images. This is reasonable because larger $\lambda _{n}$ would make the network pay less attention to the cross-entropy loss during training. As a trade-off, $\lambda _{n} = 1$ was chosen for subsequent tests on the skin dataset. Note that the decrease in classification accuracy on clean images is a common phenomenon in most defense methods (e.g., see [7, 10]).

Similarly, by adding only the embedded auto-encoder to the classifiers, with the neighbor loss excluded during training, Table 1 (fifth row to second last row) showed the trained classifiers also performed significantly better than the classifier without any defense (first row) at various attacking scenarios. As a trade-off, $\lambda _{a} = 10$ was chosen for subsequent tests. Note that $\lambda _a = 100$ lead to downgraded performance in classifying clean images.

By combing both the neighbor loss and the embedded auto-encoder into the classifier, the trained classifier showed superior performance than all the above results (Table 1, last row), suggesting that the two proposed two ideas work together to further improve the robustness of the classifier. Similar results were obtained on the chest X-ray dataset (Table 2).

Table 2. Classification accuracy of the classifiers on the adversarial examples generated from the chest X-ray test set.

Full size table

Table 3. Classification accuracy of the architecture on the SKIN4 test set with modified model based on VGG16. The attacking model is ResNet18.

Full size table

3.3 Combinations with Different Model Structures

To show that our approach can work with different model structures, we combined our idea with another model VGG-16. Table 3 again showed that the proposed defense approach improved the robustness of the CNN classifier with a different structure, compared to the classifier without using defense (row with ‘NA’). Combined with the evaluations on the ResNet18 structure above, it supports that the proposed approach helps improve robustness of multiple CNN model structures.

Table 4. Classification accuracy when ours combined with existing approaches. The results are based on the SKIN4 test images. R denotes the Reformer approach.

Full size table

3.4 Combinations with Existing Defense Approaches

To show that our approach is complementary to existing defense approaches, we combined our approach with two existing approaches, the Reformer approach [10] and the HGD approach [9]. Table 4 clearly showed that when one or both of our ideas ($L_n, L_a, L_n+L_a$, corresponding to the neighbor loss, the embedded auto-encoder, or both) were combined with the existing two approaches, the combination further improved the robustness of the classifiers compared to the performance from the existing approaches alone.

4 Conclusion

In this paper, we proposed a novel defense mechanism to improve robustness of medical image classification systems. This mechanism embeds an auto-encoder into the CNN structure and keeps high-level features invariant to general noises. It is complementary to existing defense approaches and therefore can be combined together to further improve the robustness of CNN classifiers.

Notes

1.
https://www.kaggle.com/c/rsna-pneumonia-detection-challenge.

References

Akhtar, N., Liu, J., Mian, A.: Defense against universal adversarial perturbations. In: CVPR, pp. 3389–3398 (2018)
Google Scholar
Carlini, N., Wagner, D.: Towards evaluating the robustness of neural networks. In: IEEE Symposium on Security and Privacy, pp. 39–57 (2017)
Google Scholar
Codella, N.C.F., et al.: Skin lesion analysis toward melanoma detection: a challenge at the 2017 international symposium on biomedical imaging, hosted by the international skin imaging collaboration. In: ISBI, pp. 168–172 (2018)
Google Scholar
Esteva, A., et al.: Dermatologist-level classification of skin cancer with deep neural networks. Nature 542(7639), 115 (2017)
Article Google Scholar
Goodfellow, I.J., Shlens, J., Szegedy, C.: Explaining and harnessing adversarial examples. In: ICLR (2015)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: CVPR, pp. 770–778 (2016)
Google Scholar
Kurakin, A., Goodfellow, I., Bengio, S.: Adversarial examples in the physical world. arXiv:1607.02533 (2016)
Kurakin, A., Goodfellow, I.J., Bengio, S.: Adversarial machine learning at scale. CoRR: abs/1611.01236 (2016)
Google Scholar
Liao, F., Liang, M., Dong, Y., Pang, T., Hu, X., Zhu, J.: Defense against adversarial attacks using high-level representation guided denoiser. In: CVPR, pp. 1778–1787 (2018)
Google Scholar
Meng, D., Chen, H.: MagNet: a two-pronged defense against adversarial examples. In: ACM Conference on Computer and Communications Security, pp. 135–147 (2017)
Google Scholar
Papernot, N., McDaniel, P., Wu, X., Jha, S., Swami, A.: Distillation as a defense to adversarial perturbations against deep neural networks. In: IEEE Symposium on Security and Privacy, pp. 582–597 (2016)
Google Scholar
Paschali, M., Conjeti, S., Navarro, F., Navab, N.: Generalizability vs. robustness: investigating medical imaging networks using adversarial examples. In: Frangi, A.F., Schnabel, J.A., Davatzikos, C., Alberola-López, C., Fichtinger, G. (eds.) MICCAI 2018. LNCS, vol. 11070, pp. 493–501. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00928-1_56
Chapter Google Scholar
Pereira, S., Pinto, A., Alves, V., Silva, C.A.: Brain tumor segmentation using convolutional neural networks in MRI images. IEEE Trans. Med. Imaging 35(5), 1240–1251 (2016)
Article Google Scholar
Szegedy, C., et al.: Intriguing properties of neural networks. In: ICLR (2014)
Google Scholar
Tramèr, F., Kurakin, A., Papernot, N., Goodfellow, I., Boneh, D., McDaniel, P.: Ensemble adversarial training: attacks and defenses. arXiv:1705.07204 (2017)

Download references

Acknowledgement

This work is supported in part by the National Key Research and Development Plan (grant No. 2018YFC1315402) and by the Guangdong Key Research and Development Plan (grant No. 2019B020228001).

Author information

Authors and Affiliations

School of Data and Computer Science, Sun Yat-sen University, Guangzhou, China
Fei-Fei Xue, Jin Peng, Ruixuan Wang, Qiong Zhang & Wei-Shi Zheng
Key Laboratory of Machine Intelligence and Advanced Computing, MOE, Guangzhou, China
Fei-Fei Xue, Ruixuan Wang & Wei-Shi Zheng
Guangdong Key Laboratory of Information Security Technology, Guangzhou, China
Qiong Zhang

Authors

Fei-Fei Xue
View author publications
You can also search for this author in PubMed Google Scholar
Jin Peng
View author publications
You can also search for this author in PubMed Google Scholar
Ruixuan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Qiong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Wei-Shi Zheng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ruixuan Wang .

Editor information

Editors and Affiliations

University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Dinggang Shen
University of Georgia, Athens, GA, USA
Tianming Liu
Western University, London, ON, Canada
Terry M. Peters
Yale University, New Haven, CT, USA
Lawrence H. Staib
University of Strasbourg, Illkirch, France
Caroline Essert
United Imaging Intelligence, Shanghai, China
Sean Zhou
University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Pew-Thian Yap
Western University, London, ON, Canada
Ali Khan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xue, FF., Peng, J., Wang, R., Zhang, Q., Zheng, WS. (2019). Improving Robustness of Medical Image Diagnosis with Denoising Convolutional Neural Networks. In: Shen, D., et al. Medical Image Computing and Computer Assisted Intervention – MICCAI 2019. MICCAI 2019. Lecture Notes in Computer Science(), vol 11769. Springer, Cham. https://doi.org/10.1007/978-3-030-32226-7_94

Download citation

DOI: https://doi.org/10.1007/978-3-030-32226-7_94
Published: 10 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32225-0
Online ISBN: 978-3-030-32226-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

Improving Robustness of Medical Image Diagnosis with Denoising Convolutional Neural Networks

Abstract

Similar content being viewed by others

Addressing Adversarial Machine Learning Attacks in Smart Healthcare Perspectives

An Efficient Denoising of Medical Images Through Convolutional Neural Network

Extensions and Detailed Analysis of Synergy Between Traditional Classification and Classification Based on Negative Features in Deep Convolutional Neural Networks

Keywords

1 Introduction

2 Methods