Over the past few years, deep neural networks (DNNs) have achieved tremendous success and have been continuously applied in many application domains. However, during the practical deployment in industrial tasks, DNNs are found to be erroneous-prone due to various reasons such as overfitting and lacking of robustness to real-world corruptions during practical usage. To address these challenges, many recent attempts have been made to repair DNNs for version updates under practical operational contexts by updating weights (i.e., network parameters) through retraining, fine-tuning, or direct weight fixing at a neural level. Nevertheless, existing solutions often neglect the effects of neural network architecture and weight relationships across neurons and layers. In this work, as the first attempt, we initiate to repair DNNs by jointly optimizing the architecture and weights at a higher (i.e., block level).

We first perform empirical studies to investigate the limitation of whole network-level and layer-level repairing, which motivates us to explore a novel repairing direction for DNN repair at the block level. To this end, we need to further consider techniques to address two key technical challenges, i.e., block localization, where we should localize the targeted block that we need to fix; and how to perform joint architecture and weight repairing. Specifically, we first propose adversarial-aware spectrum analysis for vulnerable block localization that considers the neurons’ status and weights’ gradients in blocks during the forward and backward processes, which enables more accurate candidate block localization for repairing even under a few examples. Then, we further propose the architecture-oriented search-based repairing that relaxes the targeted block to a continuous repairing search space at higher deep feature levels. By jointly optimizing the architecture and weights in that space, we can identify a much better block architecture. We implement our proposed repairing techniques as a tool, named ArchRepair, and conduct extensive experiments to validate the proposed method. The results show that our method can not only repair but also enhance accuracy and robustness, outperforming the state-of-the-art DNN repair techniques.

1 Introduction

Modern high-capacity deep neural networks (DNNs) have achieved astounding performance in many automated computer vision tasks ranging from complex scene understanding for autonomous driving [6, 9, 37, 40, 51, 66], to accurate DeepFake media detection [12, 30]; from challenging medical imagery grading and diagnosis [8, 15, 61, 71], to billion-scale consumer applications such as the face authentication for mobile payment, and so on. Many of the tasks are safety- and mission-critical and the reliability of the deployed DNNs is of utmost importance. However, over the years, we have come to realize that the existence of unintentional (natural degradation corruptions) and intentional (adversarial perturbations) examples such as [7, 8, 16, 17, 18, 21, 22, 23, 28, 38, 61, 62, 67, 73] is a stark reminder that DNNs are vulnerable.

To tackle the DNN’s vulnerability issues, many researchers have resorted to DNN repairing which aims at fixing the faulty DNN weights with the guidance of some specific repairing optimization criteria. An analogy to this is the traditional software repairing in the software engineering literature [20]. However, general-purpose DNN repairing may not always be feasible in practice, due to (1) the difficulty of generalizing DNNs to any arbitrary unseen scenarios, and (2) the difficulty of generalizing DNNs to seen scenarios but with the unpredictable, volatile, and ever-changing deployed environment. For these reasons, a more practical DNN repairing strategy is to work under some assumptions of practical contexts and to perform task-specific and environment-aware DNN repairing where the model gap is closed up for a certain scenario/environment, or a set of scenarios/environments.

Compared to existing DNN repair work (e.g., [19, 45, 54, 59, 72, 74]), this work takes the DNN repairing to a whole new level, quite literally, where we are performing block-level architecture-oriented repairing as opposed to network-level, layer-level, and neuron-level repairing. As we will show in the following sections that block-level repairing, being a midpoint sweet spot in terms of network module granularity, offers a good tradeoff between network accuracy and time consumption for that just repairing some specific weights in a layer neglects the relationship between different layers while repairing the whole network weights leads to high cost. In addition, block-level repairing allows us to locally adjust not only the weights but also the network architecture within the block very effectively and efficiently.

To this end, as the first attempt, we repair DNNs by jointly optimizing the architecture and weights at the block level in this work. The modern block structure stems from the philosophy of VGG nets [57] and is generalized to a common designing strategy in the state-of-the-art architectures [25] (e.g., ResNet) and optimization method [39]. To validate its importance for block-level repairing, we first study the drawbacks of network-level and layer-level repairing, which motivates us to explore a novel research granularity and repairing direction. Eventually, we identified that block-level architecture-oriented DNN repair is a promising direction. In order to achieve this, we need to address two challenges, i.e., block localization and joint architecture and weight repairing. For the first challenge, we propose the adversarial-aware spectrum analysis for vulnerable block localization that considers the neuron suspiciousness and weights’ gradients in blocks during the forward and backward processes when evaluating a series of examples. This method enables more precise block localization even under few-shot examples. In terms of the second challenge, we propose the architecture-oriented search-based repairing that relaxes the targeted block to a continuous search space. The space consists of several nodes and edges where the node represents deep features and the edge is an operation to connect two nodes. By jointly optimizing the architecture and weights in that space, our method is able to find a much better block architecture for a specific repairing target. We conduct extensive experiments to validate the proposed repairing method and find that our method can not only enhance the accuracy but also the robustness across various corruptions. The different DNN models repaired with our technique perform better than the original one on both clean and corrupted data, with an average of 3.939% improvement on clean data and 7.79% improvement on corrupted data, establishing vigorous general repairing capability on most of the DNN architectures.

Overall, the key contribution of this article is summarized as follows:

—

We propose block-level architecture-oriented repairing for DNN repair. The intuition of block structure design in modern DNNs provides a suitable granularity of DNN repair at the block-level [25]. In addition, we also show that jointly optimizing architecture and weights further brings the advantage of DNN repair over repairing DNN by only updating weights, which is demonstrated by our comparative evaluation in the experimental section.

—

In terms of the novelty and potential impacts, existing DNN repair methods [14, 19, 45, 54, 59, 74] mostly focus on only repairing DNN via updating its weights while ignoring inherent DNN architecture design (e.g., block structure and relationships between different layers), which could also impact the DNN behavior, whereas only repairing the weights could not address such an issue. Therefore, compared with existing work, this article initiates a new and wide direction for DNN repair by taking relationships of DNN architecture design as well as layers and weights into consideration.

—

Technically, we originally propose the adversarial-aware spectrum analysis-based block localization and architecture-oriented search-based repairing method, both of which are novel for DNN repair. The first one enables us to localize a vulnerable block accurately even with only a few examples. The latter formulates the repairing problem as the joint optimization of both the architecture and weights at the block level.

—

We implement our repairing techniques in the tool ArchRepair and perform extensive evaluation against 6 state-of-the-art DNN repair techniques under 4 DNNs with different architectures on two different datasets. The results demonstrate the advantage of ArchRepair in achieving SOTA repairing performance in terms of both accuracy and robustness.

To the best of our knowledge, this is the very first attempt to consider the DNN repairing problem at the block level that repairs both network weights and architecture jointly. The results of this article demonstrate the limitation of repairing DNN by only updating the weights, and show that other important DNN development elements such as architecture that encodes more advanced relationships of neurons and layers should also be taken into consideration during the design of DNN repair techniques.

2 DNN Repairing and Motivation

In this section, we review existing repairing methods in DNN and motivate our method. In Section 2.1, we thoroughly analyze previous DNN repair techniques from the viewpoint of different repairing targets, e.g., the parameters (i.e., weights) of the whole network, layers, or neurons. To this end, we formulate the core mechanism and compare the strengths and weaknesses of existing repairing techniques, which inspires and motivates us to develop the block-level repairing method. To validate our motivation, we perform a preliminary study in Section 2.2.

2.1 DNN Repairing Techniques

In the standard training process, given a training dataset, we can train a DNN denoted as \(\phi _{(\mathcal {W,}\mathcal {A})}\) where \(\mathcal {A}\) represents the network architecture related parameters determining what operations (e.g., convolution layer, pooling layer) are used in the architecture, and \(\mathcal {W}\) is the respective weights (i.e., parameters of different operations). Generally, the architecture \(\mathcal {A}\) is pre-defined and fixed during the training and testing processes. The variable \(\mathcal {W}\) consists of weights for different layers.

Although existing DNNs (e.g., ResNet [25]) have achieved significantly high accuracy on popular datasets, incorrect behaviors are always found in these models when we deploy them in the real world or test them on challenging datasets. There are a series of works that study how to repair these DNNs to be generalizable to misclassified examples, challenging corruptions, or bias errors [54, 59, 63, 72]. In general, we can formulate the existing repairing methods as

\begin{align} \mathcal {W}^* =\text{Locator}(\phi _{(\mathcal {W,}\mathcal {A})},\mathcal {D}^{\text{repair}}) , \end{align}

(1)

\begin{align} \hat{\mathcal {W}}^{*} = \mathop{\text{arg min}}\limits_{\mathcal {W}^*} \text{J}(\phi _{(\mathcal {W}^*,\mathcal {A})}, \mathcal {D}^{\text{repair}}), \end{align}

(2)

where \(\mathcal {W}^*\) is a subset of \(\mathcal {W}\) and \(\hat{\mathcal {W}}^*\) is the fixed counterpart of \(\mathcal {W}^*\). The dataset \(\mathcal {D}^\text{repair}\) contains the examples for repairing guidance. Different works may set different \(\mathcal {D}^\text{repair}\) according to the repairing scenarios. For example, Yu et al. [72] sets \(\mathcal {D}^\text{repair}\) as the combination of the augmented training dataset. We will show that our method can address different repairing scenarios. Intuitively, Equation (1) is to find the weights we need to fix in the DNN, and Equation (2) with a task-related objective function \(\text{J}(\cdot)\) is to fix the selected weights \({\mathcal {W}}^*\) and produce a new one \(\hat{\mathcal {W}}^*\).

The above formulation can represent a series of existing repairing methods. For example, when we try to fix all weights of a DNN (i.e., \(\mathcal {W}^*=\mathcal {W}\)) and set the objective function \(\text{J}(\cdot)\) as the task-related loss function (e.g., cross-entropy function for image classification) with different data augmentation techniques on collected failure cases as \(\mathcal {D}^\text{repair}\) to retrain the weights, we actually get the methods proposed by [54] and [72]. In addition, when we employ the gradient loss of weights and forward impact to localize the targeted weights and use a fitness function to fix localized weights, the formulation becomes the method [59].

Nevertheless, with the general formulation in Equations (1) and (2), we can see that existing repairing methods have the following limitations:

—

Existing works only fix the targeted DNN either at the network-level (i.e., fixing all weights of the DNN) or at the neuron-level (i.e., only fixing partial weights of the DNN), and ignore the effects of the architecture \(\mathcal {A}\).

—

Only repairing some specific weights in a layer could easily neglect the relationship between different layers while repairing the whole network’s weights leads to high costs.

Note that, the state-of-the-art DNNs (e.g., ResNet [25]) are often made up of several blocks where each block is built with stacked convolutional and activation layers. Such block-like architecture is mainly inspired by the philosophy of VGG nets [57] and its effectiveness has been demonstrated in wide applications. Therefore in this work, we focus on DNN repairing at the block level. In particular, we consider both the architecture and weights repairing of a specific block.

2.2 Empirical Study and Motivation

First, we perform a preliminary experiment to discuss the effectiveness of the repairing methods at different levels. In this experiment, we choose 3 variants of ResNet [25] (specifically, ResNet-18, ResNet-50, and ResNet-101) as the targeted DNNs \(\phi\), and we select CIFAR-10 and Tiny-ImageNet dataset as the experimental environment. We repair the DNN at four levels, i.e., Neuron-level (i.e., only fixing weights of one neuron), Layer-level (i.e., only fixing the weights of one layer), Block-level (i.e., fixing the weights of a block) and the Network-level (i.e., fixing all weights of the DNN). Inspired by recent work [59], we choose the neuron (or layer/block) with the greatest gradient (mean gradient for layer and block) as our target to fix. Note that as the previous works have shown that repairing DNN with only a few failure cases is meaningful and important [54, 72], we only randomly select 100 failure cases from the testing dataset to calculate the gradients and choose such neuron (or layer/block). Then, we adjust the weights of the chosen neuron/layer/block by gradient descent w.r.t. the loss function (e.g., cross-entropy loss for image classification). To compare their effectiveness, we apply all methods on the same training dataset of CIFAR-10 and Tiny-ImageNet, then measure the accuracy on the respective testing dataset. We also record the execution time of the total repairing phase (100 epochs) as the indicator of time cost. We show the repairing result in Table 1. Note that, we repeat each experiment five times and take the average of each result.

Table 1.

Scale		ResNet-18		ResNet-50		ResNet-101
Scale		Accuracy (%)	Execution Time	Accuracy (%)	Execution Time	Accuracy (%)	Execution Time
CIFAR-10	Original	85.00	-	85.17	-	85.31	-
	Neuron-level	85.18	650.49	85.23	4,054.29	85.39	6,853.47
	Layer-level	85.16	590.47	85.24	4,159.93	85.41	4,956.81
	Block-level	85.19	760.94	85.24	3,976.39	85.47	7,118.03
	Network-level	85.73	1,456.92	84.80	5,735.61	87.43	9,889.35
Tiny-ImageNet	Original	45.15	-	46.26	-	46.14	-
	Neuron-level	45.23	1,847.59	46.17	13,074.85	46.14	20,395.79
	Layer-level	45.23	1,854.37	46.24	12,796.91	46.15	18,497.53
	Block-level	45.30	2,011.84	46.27	13,452.17	46.22	24,774.15
	Network-level	45.52	2,574.81	46.41	17,495.88	46.55	32,908.43

Table 1. Accuracy (%) and Execution Time (s/100 Epochs) of Applying Repairing Method at Different Levels on 3 Different DNNs Trained and Tested on CIFAR-10 and Tiny-ImageNet Datasets

According to Table 1, the network-level repairing achieves the highest accuracy on ResNet-18 and ResNet-101 when repairing on CIFAR-10 dataset, and all 3 variants of ResNet when repairing on Tiny-ImageNet dataset, but also leads to the highest time cost under every configuration. Among 3 other levels of repairing methods, the block-level repairing achieves the highest accuracy improvement without having a drastic increment on time cost (i.e., the run-time increment comparing with neuron-level and layer-level is less than 500 seconds on 100 epochs across all 3 ResNets) when repairing on both CIFAR-10 and Tiny-ImageNet.

Overall, the network-level repairing is significantly effective in accuracy improvement but leads to a high time cost. Nevertheless, the block-level repairing achieves impressive accuracy enhancement with much less execution time compared to network-level method (e.g., about \(2\times\) less on ResNet-18), making it a good tradeoff between effectiveness and efficiency. This fact inspires and motivates us to further investigate the block-level repairing method.

3 Block-Level Architecture and Weights Repairing

In this section, we first provide an overview of our method in the Section 3.1 by presenting our intuitive idea and the main pipeline containing two key modules, i.e., Vulnerable Block Localization and Architecture-oriented Search-based Repairing. After that, we detail the first module in Section 3.2 and the second module in Section 3.3, respectively. The first module is to locate the vulnerable block in a deployed DNN, while the second module is to repair the architecture and weights of the localized block by formulating it as an architecture searching problem.

3.1 Overview

Given a deployed DNN \(\phi _{(\mathcal {W},\mathcal {A})}\), the weights and architecture usually consists of several blocks, each of which is built by stacking basic operations, e.g., convolutional layer. Then, we represent the weights and architecture with B blocks, i.e., \(\mathcal {W} = \lbrace \mathcal {W}_{\text{b}}^i\rbrace _{i=1}^{B}\) and \(\mathcal {A} = \lbrace \mathcal {A}_{\text{b}}^i\rbrace _{i=1}^{B}\), where the weights or architecture of each block are made up of one or multiple layers. For example, when we consider the ResNet18 [25], we can say that it has six blocks (See Table 2). The first block contains only one convolution layer with the kernel size of \(7\times 7 \times 64\) and the stride of 2. The second to the fifth blocks have two convolutional layers and the last block contains a fully connected layer and a softmax layer. Then, we can reformulate Equations (1) and (2) for the proposed block-level repairing by

\begin{align} (\mathcal {W}_\text{b}^*,\mathcal {A}_\text{b}^*) &= \text{Locator}(\phi _{(\lbrace \mathcal {W}_{\text{b}}^i\rbrace _{i=1}^{B},\lbrace \mathcal {A}_{\text{b}}^i\rbrace _{i=1}^{B})},\mathcal {D}^{\text{repair}}) , \end{align}

(3)

\begin{align} (\hat{\mathcal {W}}^{*}_\text{b}, \hat{\mathcal {A}}^{*}_\text{b}) &= \mathop{\text{arg min}}\limits_{(\mathcal {W}_\text{b}^*,\mathcal {A}_\text{b}^*)} \text{J}(\phi _{(\mathcal {W}_\text{b}^*,\mathcal {A}_\text{b}^*)}, \mathcal {D}^{\text{repair}}), \end{align}

(4)

where Equation (3) is to locate the block (i.e., \((\mathcal {W}_\text{b}^*,\mathcal {A}_\text{b}^*)\)) that should be fixed through the proposed adversarial-aware block localization, and Equation (4) is to repair the localized block by formulating it as a network architecture searching problem. Clearly, compared with the general repairing method (i.e., Equations (1) and (2)), the proposed method focuses on fixing the weights and architecture at the block level. We detail the vulnerable block localization in Section 3.2 and architecture search-based repairing in Section 3.3.

Table 2.

There are two main solutions for vulnerable neuron localization [14, 59]. The first one employs the neuron spectrum analysis during the forward process of DNN on a testing dataset. It calculates the spectrum of all neurons (e.g., activated/non-activated times of neurons for correctly classified examples and activated/non-activated times of neurons for misclassified examples). These attributes are used to measure the suspiciousness of all neurons. The general principle is that a neuron is more suspicious when the neuron is more often activated under the misclassified examples than that under the correctly classified examples [14]. This solution is able to localize the vulnerable neurons accurately but requires a large testing dataset, which is not suitable for the scenario where a few examples are available for repairing. The second solution is to actively localize the vulnerable neurons by performing backpropagation on the misclassified examples and calculating the gradients of neurons w.r.t. the loss function. The neurons with large gradients are responsible for the misclassification [59]. This solution is able to localize the vulnerable neuron with fewer examples but ignores the effects of correctly classified examples. As shown in Figure 1, with different failure examples, the gradients of different convolutional blocks in ResNet18 may have similar values, which demonstrates that the gradient-based localization is not sensitive to the variance of the number of failure examples.

Fig. 1.

Overall, existing methods mainly focus on localizing vulnerable neurons while ignoring the blocks in DNNs. In addition, they have their respective defects. In this work, we propose a novel localization method that aims at finding the most vulnerable block in the DNN, which can lead to the buggy behavior of a deployed DNN. To take the respective advantages of existing works and avoid their defects, we propose adversarial-aware spectrum analysis to localize the vulnerable block.

3.2 Adversarial-aware Spectrum Analysis for Vulnerable Block Localization

3.2.1 Neuron Spectrum Analysis.

Given a dataset \(\mathcal {D}^\text{repair}\) for repairing and the targeted DNN \(\phi _{(\mathcal {W},\mathcal {A})}\), we calculate the spectrum attributes of the jth neuron in \(\mathcal {W}\) by counting the times of activation and non-activation for the neuron under the correctly classified examples and denote them as \(N^j_{\text{ac}}\) and \(N^j_{\text{nc}}\), respectively. Similarly, we can count the times of activation and non-activation for the same neuron under the misclassified examples and name them as \(N^j_{\text{am}}\) and \(N^j_{\text{nm}}\), respectively. Then, we calculate a suspiciousness score for each neuron via the Tarantula measure [29],

\begin{align} s_j = \frac{N^j_{\text{am}}/(N^j_{\text{am}}+N^j_{\text{nm}})}{N^j_{\text{am}}/(N^j_{\text{am}}+N^j_{\text{nm}})+N^j_{\text{ac}}/(N^j_{\text{ac}}+N^j_{\text{nc}}),} \end{align}

(5)

where \(s_j\) determines the suspiciousness of the jth neuron and the higher \(s_j\) means the jth neuron is more vulnerable.

3.2.2 Adversarial-aware Block Spectrum Analysis.

With the above neuron spectrum analysis, we can obtain the suspiciousness scores for all neurons and the suspiciousness set \(\mathcal {S}=\lbrace s_j\rbrace\). Nevertheless, these suspiciousness scores depend on the statistical analysis and are not related to the objective directly, which leads to less effective localization. To alleviate the issue, we propose to refine the suspiciousness scores with adversarial information under the guidance of the loss function (e.g., cross-entropy function for classification).

Specifically, we select the failure examples in \(\mathcal {D}^\text{repair}\) and construct a subset denoted as \(\mathcal {D}^\text{repair}_\text{fail}\). For each example in \(\mathcal {D}^\text{repair}_\text{fail}\), we can calculate the gradient of all neurons w.r.t. the loss function. Then, we average the gradients of a neuron on all examples and get a set \(\mathcal {G} = \lbrace g_j\rbrace\) where \(g_j\) is the averaging gradient of the jth neuron on all examples in \(\mathcal {D}^\text{repair}_\text{fail}\). Intuitively, the larger gradient means that the corresponding neuron may significantly contribute to misclassification and should be tuned to minimize the loss. For the ith block, we denote its gradient as the average of the gradients of all neurons in that block, i.e., \(G_i = \frac{1}{|\mathcal {W}_\text{b}^i|}\sum _{\mathbf {w}_j\in \mathcal {W}_\text{b}^i}g_j\). We also calculate the averaging gradient across all neurons, i.e., \(\overline{G}=\frac{1}{B}\sum _{i=1}^{B}G_i\). Then, we use these gradients to reweight the suspiciousness scores of all neurons.

\begin{align} \hat{s}_j = \frac{|g_j-\overline{G}|}{\max (\lbrace |g_j-\overline{G}|\rbrace)} s_j. \end{align}

(6)

The principle behind this strategy is that the suspiciousness score of the jth neuron decreases when its relative gradient is small. As a result, we can update the suspiciousness set \(\mathcal {S}\) and get \(\hat{\mathcal {S}}=\lbrace \hat{s}_j\rbrace\).

A block in the DNN consists of a series of neurons and we collect the updated suspiciousness scores of the neurons in the ith block to the set \(\hat{\mathcal {S}}_i\in \hat{\mathcal {S}}\). There are B suspiciousness sets and \(\hat{\mathcal {S}} = \lbrace \hat{\mathcal {S}}_i\rbrace _{i=1}^B\). After that, we use a threshold (i.e., \(\epsilon\)) to select the vulnerable neurons, that is, the neuron with \(\hat{s}_j\gt \epsilon\) is identified as the vulnerable neuron. Then, we can count the number of vulnerable neurons in each \(\hat{\mathcal {S}}_i\) and the block with the most vulnerable neurons is identified as the targeted block we would repair.

We summarize the whole process of the block localization in Algorithm 1. We first calculate the suspiciousness score of all neurons (Line 1) and calculate the average gradients on each neuron (Line 2). Then, we update the suspiciousness score by calculating the average gradients on each block (Line 3). Finally, we select a threshold to identify the vulnerable blocks (Line 4:5). To validate its advantages, we conduct an experiment to compare the effectiveness and stability of the blocks positioned from \({\mathcal {S}}\) and \(\hat{\mathcal {S}}\), respectively. To compare the stability of the method, we changed the size of the dataset \(\mathcal {D}^{\text{repair}}_\text{fail}\). We observe that as the size of the dataset changes, the suspicious neurons on each block obtained by \({\mathcal {S}}\) vary significantly while those obtained by \(\hat{\mathcal {S}}\) are much more stable and lead to unanimous conclusions. As shown in Figure 2, according to the experiments on ResNet-18, by the number of suspicious neurons contained in the block, \({\mathcal {S}}\) and \(\hat{\mathcal {S}}\) estimated that “block 1” and “block 4” are the most vulnerable, respectively. We observed similar results when the threshold \(\epsilon\) are set to other values (e.g., \(\epsilon _{10}\), \(\epsilon _{20}\), \(\epsilon _{30}\), \(\epsilon _{40}\), \(\epsilon _{100}\)). We also conduct detailed quantitative analysis and discussion in Section 5.3, presenting that repairing the most vulnerable block, i.e., “block 4”, achieves much higher improvement.

Fig. 2.

3.3 Architecture-oriented Search-based Repairing

After localizing the targeted block, how to break the old architecture’s bottleneck and fix it to become competent in the tasks is another challenge. To this end, we formulate the very first block-level architecture and weights repairing as the network architecture search task. Given a deployed DNN with pre-trained weights and fixed architecture (i.e., \(\phi _{(\mathcal {W},\mathcal {A})}\)), we first relax the targeted block (i.e., \(\phi _{(\mathcal {W}^*_\text{b},\mathcal {A}^*_\text{b})}\)) to a directed acyclic graph like the cell structure in the differentiable architecture search (DARTS) [39], which is composed of an ordered sequence of nodes that are connected by edges. Intuitively, the node corresponds to the deep feature while the edge denotes the operation layer like the convolutional layer. Our goal is to optimize the edges, i.e., to determine which two nodes should be connected and which operation should be selected for that connection. To this end, the key issues are to define the architecture search space and optimization strategy.

3.3.1 Architecture Search Space for the Targeted Block.

To better illustrate the process of architecture search, we take ResNet as an example. Given a block in ResNet containing K operation layers, we reformulate it as a directed acyclic graph that has \(K+1\) nodes \(\lbrace \mathbf {X}^k\rbrace _{k=1}^{K}\) and allow each node to accept the outputs from all previous nodes instead of following the sequential order. As shown in Figure 3, we present an example of the graph representation of the targeted block via nodes and edges. Specifically, we denote the edge for connecting the ith and jth nodes as \(\text{e}_{(i,j)}\) and the node \(\mathbf {X}^j\) can be calculated by

\begin{align} \mathbf {X}^j=\sum _{i=[1,j-1]}\text{e}_{(i,j)}(\mathbf {X}^{i}), \end{align}

(7)

where \(\text{e}_{(i,j)}(\mathbf {X}^{i})\) is an edge taking the node \(\mathbf {X}^{i}\) as the input. Then, we define an operation set \(\mathcal {O}\) containing six candidate operations as presented in Table 3, each of which can be set as the edge. This set of operations is selected in coordination with our NAS method [70]. For example, when we select “None” for \(\text{e}_{(i,j)}\), the two nodes \(\mathbf {X}^{i}\) and \(\mathbf {X}^{j}\) should not be connected.

Fig. 3.

Table 3.

Operators	Operations
None	Add a Zero CNN layer whose weights are all zero.
Skip	Add an Identity CNN layer whose weights are all one.
AvgPool	Add an Average Pooling layer and an Identity CNN layer.
MaxPool	Add a Max Pooling layer and an Identity CNN layer.
SepConv	Add separated CNN layers.
DilConv	Add a CNN layer with the dilation kernel and an Identity CNN layer.

Table 3. All Operators in the Operation Set \(\mathcal {O}\)

Note that, the raw sequentially ordered block of ResNet is a special case in the defined search space and we can naturally inherent the raw weights and architecture setup as the initialization for the following optimization.

3.3.2 Architecture and Weights Optimization.

The optimization goal is to select a suitable operation for each edge from the operation set. To this end, we relax the selection as a continuous process by regarding the edge connecting the node i and j as a weighted combination of the outputs of all candidate operations

\begin{align} \text{e}_{(i,j)}(\mathbf {X}^{i})=\sum _{\text{o}\in \mathcal {O}}\frac{\exp {(\alpha ^\text{o}_{(i,j)})}}{\sum _{\text{o}^{\prime }\in \mathcal {O}} \exp {(\alpha ^{\text{o}^{\prime }}_{(i,j)})}}\text{o}(\mathbf {X}^{i}), \end{align}

(8)

where the parameter \(\alpha ^\text{o}_{(i,j)}\) determines the combination weight of using the operation \(\text{o}\) for connecting the ith and jth nodes. As a result, we can define the architecture parameters for the edge \(\text{e}_{(i,j)}\) as a vector \(\mathbf {a}_{(i,j)}=[\alpha _{(i,j)}^\text{o}|\text{o}\in \mathcal {O}]\) assigning each operation in the \(\mathcal {O}\) a combination weight. Moreover, for the whole block, we denote its architecture as \(\mathcal {A}_{\text{b}}^*=\lbrace \mathbf {a}_{(i,j)}\rbrace\) and respective parameters for all candidate operations as \(\mathcal {W}_{\text{b}}^*=\lbrace \mathbf {w}_{(i,j)}\rbrace\). Then, we can specify the repairing process in Equation (4) by optimizing the weights (i.e., \(\mathcal {W}_\text{b}^*\)) and architecture parameters (i.e., \(\mathcal {A}_\text{b}^*\)) on the training dataset and validation dataset, alternatively, that is, we have

\begin{align} \hat{\mathcal {W}}^{*}_\text{b} &= \mathop{\text{arg min}}\limits_{\mathcal {W}_\text{b}^*} \text{J}\left(\phi _{\left(\mathcal {W}_\text{b}^*,\mathcal {A}_\text{b}^*\right)}, \mathcal {D}^{\text{repair}}_{\text{train}}\right) , \end{align}

(9)

\begin{align} \hat{\mathcal {A}}^{*}_\text{b} &= \mathop{\text{arg min}}\limits_{\mathcal {A}_\text{b}^*} \text{J}\left(\phi _{\left(\hat{\mathcal {W}}_\text{b}^*,\mathcal {A}_\text{b}^*\right)}, \mathcal {D}^{\text{repair}}_{\text{val}}\right), \end{align}

(10)

where \(\text{J}(\cdot)\) is specified as the cross-entropy loss function for the image classification task. During the training process, we initialize the block architecture \(\mathcal {A}^{*}_\text{b}\) as the raw block architecture of the targeted DNN, and update the architecture and weights, alternatively. We will illustrate the repairing process in Section 3.4. After getting the optimized architecture (i.e., \(\hat{\mathcal {A}}_\text{b}^*\)) in the continuous search space, we set the operation with maximum combination weight as the edge, i.e., \(\text{e}_{(i,j)} =\text{arg\,max}_{\text{o}\in \mathcal {O}}\alpha _{(i,j)}^\text{o}\). Then, we retrain the weights \(\hat{\mathcal {W}}^*_\text{b}\) with fixed block architecture.

3.4 Our Repairing Algorithm of ArchRepair

Figure 3 summarizes the whole workflow of ArchRepair. Given a deployed DNN, we first employ the proposed vulnerable block localization to determine the block we aim at repairing. Specifically, we use the repair dataset \(\mathcal {D}^\text{repair}\) and the neuron spectrum analysis to calculate the suspiciousness of all neurons, i.e., \(\mathcal {S}=\lbrace s_j\rbrace\). Meanwhile, we use the failure examples in \(\mathcal {D}^\text{repair}\) (i.e., \(\mathcal {D}^\text{repair}_\text{fail}\)) to obtain the gradients of all neurons w.r.t. the loss function (i.e., \(\mathcal {G}=\lbrace g_j\rbrace\)). Then, we use Equation (6) and the gradients \(\mathcal {G}=\lbrace g_j\rbrace\) to reweight \(\mathcal {S}=\lbrace s_j\rbrace\), thus get the suspiciousness scores \(\hat{\mathcal {S}}=\lbrace \hat{s}_j\rbrace\). After that, we can calculate the number of vulnerable neurons through a threshold \(\epsilon\), that is, when the suspiciousness score of a neuron \(\hat{\mathcal {S}}=\lbrace \hat{s}_j\rbrace\) is larger than \(\epsilon\), the neuron is identified as a vulnerable case. Finally, the block with the largest number of vulnerable cases is selected as the targeted block we want to repair.

During the architecture search-based repairing, we reformulate the targeted block as a directed acyclic graph, where the deep features are nodes and operations are edges. Then, we relax each edge as a combination of six operations (i.e., Equation (8)), where the combination weights correspond to the architecture parameters \(\mathcal {A}_\text{b}^*=\lbrace \mathbf {a}_{(i,j)}\rbrace\). We use the dataset \(\mathcal {D}^\text{repair}\) to conduct the architecture and weights optimization via Equations (9) and (10), where the original architecture and weights are inherited and serve as the optimization initialization. Therefore, given the optimized block architecture in the continuous space (i.e., \(\hat{\mathcal {A}}^*_\text{b}\)), we discretize it to the final architecture by preserving the operation with the maximum combination weight and removing other operations. Finally, we use the \(\mathcal {D}^\text{repair}\) to fine-tune the weights by fixing the optimized architecture for the repaired DNN.

4 Experimental Design and Settings

In this section, we conduct extensive experiments to validate the proposed methods and compare with the state-of-the-art DNN repair techniques, to investigate the following research questions:

—

RQ1. Does ArchRepair outperform the state-of-the-art (SOTA) DNN repair techniques with better repairing effects?

—

RQ2. Could ArchRepair repair DNNs on certain failure patterns without sacrificing robustness on clean data and other failure patterns?

—

RQ3. Is our proposed localization method effective in identifying vulnerable neuron blocks?

—

RQ4. How do different components of our proposed method impact the overall repairing performance?

RQ1 intends to evaluate the overall repairing capability of ArchRepair and to compare it to SOTA DNN repair techniques as baselines. RQ2 aims at exploring the potential of our method in repairing DNN on corrupted data, which are common robustness issues during DNN practical usage in the operational environments. RQ3 intends to examine whether the proposed localization method can precisely locate vulnerable blocks. RQ4 is to explore the contribution that each of ArchRepair’s key components makes on the overall performance of DNN repair.

4.1 Experimental Setups

To answer the research questions above, we design our evaluation from multiple perspectives listed in the following.

Subject Datasets and Repairing Scenarios. Given a deployed DNN trained on a training dataset \(\mathcal {D}^\text{t}\), we can evaluate it on a testing dataset \(\mathcal {D}^\text{v}\). In the real world, there are a lot of scenes that cannot be covered by \(\mathcal {D}^\text{v}\) and the DNN’s performance may decrease significantly after the DNN is deployed in its operational environment. For example, there are common corruptions (i.e., noise patterns) in the real world that can affect the DNN significantly [26]: Gaussian noise (GN), shot noise (SN), impulse noise (IN), defocus blur (DB), Gaussian blur (GB), motion blur (MB), zoom blur (ZB), snow (SNW), frost (FRO), fog (FOG), brightness (BR), contrast (CTR), elastic transform (ET), pixelate (PIX), and JPEG compression (JPEG).

According to the aftermentioned situations, we consider two repairing scenarios that commonly occur in practice:

—

Repairing the accuracy drift on the testing dataset. When we evaluate the DNN on the testing dataset \(\mathcal {D}^\text{v}\), we can collect a few failure examples (i.e., 1,000 examples) denoted as \(\mathcal {D}^\text{v}_{\text{fail}}\). Then, we set \(\mathcal {D}^{\text{repair}}=\mathcal {D}^\text{v}_{\text{fail}}\cup \mathcal {D}^\text{t}\) and use the proposed or baseline repairing methods to enhance the deployed DNNs. We evaluate the accuracy on the testing dataset where \(\mathcal {D}^\text{v}_{\text{fail}}\) is excluded (i.e., \(\mathcal {D}^\text{v}\setminus \mathcal {D}^\text{v}_{\text{fail}}\)). Note that, the context of repairing DNN with only a few testing data is meaningful and important, which is adopted by recent works [54, 72]. In addition, there could be many practical scenarios, where collecting buggy examples is very difficult or at very high costs, with only a few buggy examples collected entirely. Hence, we follow the common choice in recent works [54, 72] to select only 1,000 failure examples from testing data.

—

Repairing the robustness on corrupted datasets. When we evaluate the DNN on a corrupted testing dataset \(\mathcal {D}^\text{c}\), we can also collect a few failure examples (i.e., 1,000 examples) denoted as \(\mathcal {D}^\text{c}_{\text{fail}}\) and set \(\mathcal {D}^{\text{repair}}=\mathcal {D}^\text{c}_{\text{fail}}\,\cup \,\mathcal {D}^\text{t}\). The repairing goal is to enhance the accuracy on \(\mathcal {D}^\text{c}\setminus \mathcal {D}^\text{c}_{\text{fail}}\) and other corrupted datasets while maintaining the accuracy on the clean testing dataset (i.e., \(\mathcal {D}^\text{v}\setminus \mathcal {D}^\text{v}_{\text{fail}}\)).

We choose CIFAR-10 [33], CIFAR-100 [33], Tiny-ImageNet [36], and ImageNet [11] as the evaluation datasets. They are commonly used datasets in recent DNN repair studies, enabling us to perform comparative studies in a relatively fair way. Each dataset contains its respective training dataset \(\mathcal {D}^\text{t}\) and testing dataset \(\mathcal {D}^\text{v}\). CIFAR-10 contains a total of 60,000 images in 10 categories, in which 50,000 images are for \(\mathcal {D}^\text{t}\) and the other 10,000 are for \(\mathcal {D}^\text{v}\). CIFAR-100 has 100 classes containing 600 images each. There are 500 images in the training dataset \(\mathcal {D}^{\bf t}\) and 100 images in the testing dataset \(\mathcal {D}^{\bf v}\) for each class. Tiny-ImageNet has a training dataset \(\mathcal {D}^\text{t}\) with the size of 100,000 images, and a testing dataset \(\mathcal {D}^\text{v}\) with the size of 10,000 images. ImageNet contains over 14 million images. In our experiment, the training dataset \(\mathcal {D}^\text{t}\) uses 1.3 million images, and the testing dataset \(\mathcal {D}^\text{v}\) uses over 50,000 images. Therefore, we have corrupted testing datasets \(\lbrace \mathcal {D}^\text{c}_i\rbrace\) where \(i=1,2,\dots , 15\) corresponding to the above fifteen corruptions [26].

DNN architectures. We select six different architectures of DNN, i.e., VGGNet-16 [58], ResNet-18, ResNet-50, ResNet-101 [25], DenseNet-121 [27], and EfficientNet-B0 [60]. Given that ArchRepair is a block-based repairing method, the block-like architecture, ResNet, turns out to be a perfect research subject. For a broad comparison, we also choose a non-block-like architecture, DenseNet-121, to examine the repairing capability of ArchRepair.¹ For each architecture, we first pre-train them with the original training dataset \(\mathcal {D}^\text{t}\) (from CIFAR-10, CIFAR-100, Tiny-ImageNet or ImageNet), the model with the highest accuracy in testing dataset \(\mathcal {D}^\text{v}\) (from CIFAR-10, CIFAR-100, Tiny-ImageNet or ImageNet) will be saved as pre-trained model \(\phi _\theta\). As the original ResNet and DenseNet are not designed for CIFAR-10 and Tiny-ImageNet datasets, we use the unofficial architecture code offered by a popular GitHub project,² which has more than 4.1 K stars.

Block definition. We divide each of the six selected DNN architectures into several blocks. For each of ResNet-18, ResNet-50, and ResNet-101, we follow its block structures and divide it into four blocks, as shown in Table 2. For DenseNet-121, we divide it by every two convolutional layers as one block. For VGGNet-16, we manually divide it into six blocks by maxpool layer as Table 2 shows and select five of them as repairing targets (i.e., Block 1~5, Block 6 is used for getting output, so we left it out of repairing). For EfficientNet-B0, we follow its block structures and divide them into seven blocks (see Table 2).

NAS method. We select PC-DARTS [70] as our NAS solution for ArchRepair . While all popular NAS techniques should fit into ArchRepair (e.g., DARTS, SNAS, and BayesNAS), these techniques use more time than PC-DARTS in searching for a better network architecture. Given that DNN repair is a time-sensitive task, we choose PC-DARTS [70] as it is among the fastest NAS methods. Though ArchRepair remains the interface for switching to other NAS techniques when the task cares more about the performance of repaired models than the time cost of repairing.

Hyper-parameters. Regarding the training setup, we employ stochastic gradient descent (SGD) as the optimizer, setting batch size as 128, the initial learning rate as 0.1, and the weight decay as 0.0005. We use the cross-entropy loss as the loss function. The maximum number of epochs is 500, and an early-stop function will terminate the training phase when the validation loss no longer decreases in 10 epochs.

Baselines. To demonstrate the repairing capability of the proposed ArchRepair, we select six SOTA DNN repair methods from two different categories as baselines: neuron-level repairing methods and network-level repairing methods. The neuron-level repairing methods focus on fixing certain neurons’ weights in order to repair the DNNs. Representative methods from this category are MODE [45], Apricot [74], and Arachne [59]. While network-level repairing methods mainly repair DNNs by using augmented datasets to fine-tune the whole network, where SENSEI [19], Few-Shot [54], and DeepRepair [72] are the most popular ones. For a fair comparison, we employ the same settings on all six repairing methods and ArchRepair. In order to fully evaluate the effectiveness of the proposed method, we apply all methods (six baselines and ArchRepair) to fix four different DNN architectures on large-scale datasets, including the clean version and 15 corrupted versions from CIFAR-10 and Tiny-ImageNet, to assess the repairing capability.

Other configurations. We implement ArchRepair in Python 3.9 based on PyTorch framework. All the experiments were performed on a server with a 12-core 3.60 GHz Xeon CPU E5-1650, 128 GB RAM, and four NVIDIA GeForce RTX 3,090 GPUs (each has 4 GB memory), which runs Ubuntu 18.04.

In summary, for each baseline method and ArchRepair, our evaluation consists of 96 configurations (6 DNN architectures \(\times\) 16 versions of a dataset³) on four datasets (i.e., CIFAR-10, CIFAR-100, Tiny-InageNet, and ImageNet. For CIFAR-10 dataset, an execution of training and repairing a model under one specific configuration costs about 12 hours on average (the maximum one is about 50 hours); while for Tiny-ImageNet dataset, an execution of training and repairing a model takes about 18 hours on average (the maximum one is about 64 hours). We measured the execution time of repairing six different DNN architectures (i.e., VGGNet-16, ResNet-18, ResNet-50, ResNet-101, DenseNet-121, and EfficientNet-B0) repaired on CIFAR-10 within 100 epochs by different repairing methods. The results are reported in Table 4. According to Table 4, the Neuron-lv’s methods use less execution time than other repairing methods (The cell with green background), and the Network-lv’s methods use more execution time than others. Our method, ArchRepair, uses more execution time than Neuron-lv’s methods but less than Network-lv’s methods. This is because ArchRepair repairs DNN models on the block level, which works at a larger size than the neuron level but at a smaller size than the network level. This is also consistent with our expectation in Section 2.2, i.e., block-level repairing makes a good tradeoff between effectiveness and efficiency. Overall, the total execution time of our experiments takes more than two months.

Table 4.

CIFAR-10		Execution time (100 epochs)
CIFAR-10		VGG-16	Res-18	Res-50	Res-101	Den-121	Eff-B0
Neuron-lv	MODE [43]	2h17m	2h41m	3h37m	4h36m	6h58m	4h11m
	Apricot [70]	3h09m	3h31m	4h15m	5h32m	8h06m	4h15m
	Arachne [56]	2h17m	2h31m	3h54m	4h58m	7h25m	3h09m
Network-lv	SENSEI [18]	46h18m	48h21m	54h25m	68h32m	82h36m	57h03m
	Few-Shot [51]	27h18m	28h51m	32h11m	40h25m	47h36m	30h39m
	DeepRepair [68]	49h21m	51h47m	56h23m	65h23m	72h16m	61h32m
ArchRepair (ours)		18h15m	18h37m	21h34m	29h17m	33h25m	23h17m

Table 4. Execution Time of Repairing 6 Different DNNS (i.e., VGGNet-16, ResNet-18, ResNet-50, ResNet-101, DenseNet-121, and EfficientNet-B0) Repaired on CIFAR-10 within 100 Epochs by Different Repairing Methods

The result describes ArchRepair uses less repairing time than Network-lv’s methods and has excellent repairing performance.

5 Experimental Results

In this section, we summarize the high-level results and findings to answer our research questions. We present more detailed evaluation results, configurations as well as a replication package on our supplementary website [52] of this article.

5.1 RQ1: Does ArchRepair Outperform the State-of-the-arts (SOTA) DNN Repair Techniques?

To answer RQ1, we train 6 DNNs (i.e., VGGNet-16, ResNet-18, ResNet-50, ResNet-101, DenseNet-121, and EfficientNet-B0) on 4 datasets’ (i.e., CIFAR-10, CIFAR-100, Tiny-ImageNet, and ImageNet) training datasets (i.e., \(\mathcal {D}^\text{t}\)) and evaluate them on testing datasets (i.e., \(\mathcal {D}^\text{v}\)), respectively. To evaluate the performance of our method (i.e., ArchRepair), we apply six different SOTA methods as well as ArchRepair to repair these 4 DNNs. The evaluation results of repairing are summarized in Table 5. In general, ArchRepair exhibits significant advantages over all baseline methods on the 6 DNNs, demonstrating the effectiveness and generalization ability of the proposed method. In particular, comparing with the state-of-the-art DNN repair methods (i.e., neuron-level repairing method Arachne [59], and network-level repairing method DeepRepair [72]), ArchRepair achieves much higher accuracy on 5 out of 6 DNNs on CIFAR-10 dataset. On the more challenging dataset, Tiny-ImageNet, ArchRepair still achieves much higher accuracy on 3 out of 6 DNNs. Note that on DenseNet-121, all the repairing methods failed to repair, i.e., failing to improve the performance compared to the original network. One possible explanation is that the original DenseNet-121’s performance has almost reached the upper bound of the classification accuracy on Tiny-ImageNet, hence there might not be much room for improvement in terms of accuracy. To better illustrate the performance of ArchRepair compared with other baselines, we also conduct the statistical test (i.e., Wilcoxon Signed-rank Test) on the results obtained by our method, compared with each of the 6 corresponding repairing methods across all 6 different models (i.e., VGGNet-16, ResNet-18/50/101, DenseNet-101, and EfficientNet-B0), all the 4 evaluated datasets (i.e., CIFAR-10/100, Tiny-ImageNet, and ImageNet). Table 6 summarizes the obtained statistical test results, which demonstrate the advantage of our method to be statistically significant at the 0.01 confidence level (i.e., \(p\lt 0.01\)), compared with the SOTA. We report the obtained significant test results in Table 6 and add a paragraph of discussion in the original article, the results confirm the advantage of our method to be statistically significant at the 0.01 confidence level (i.e., \(p\lt 0.01\)).

Table 5.

Table 6.

n = 120	ArchRepair
n = 120	MODE [45]	Apricot [74]	Arachne [59]	SENSEI [19]	Few-Shot [54]	DeepRepair [72]
\(p\)	3.91E-19 \(\lt\) 0.01	5.09E-20 \(\lt\) 0.01	1.51E-18 \(\lt\) 0.01	3.37E-16 \(\lt\) 0.01	5.96E-17 \(\lt\) 0.01	1.55E-3 \(\lt\) 0.01

Table 6. Wilcoxon Signed-rank Test

Furthermore, to understand the influence of repairing on DNN’s robustness, we evaluate the repaired DNNs’ performance on corruption datasets (i.e., CIFAR-10-C [26] and Tiny-ImageNet-C [26]). The CIFAR-10-C and Tiny-ImageNet-C contain over 15 types of natural corruption datasets, and we show the results on CIFAR-10-C in Figure 4 and Tiny-ImageNet-C in Figure 5. Obviously in Figure 4, ArchRepair achieves the highest accuracy on a majority of corruption datasets across three variants of ResNet (8/15, 9/15, and 7/15 on ResNet-18, ResNet-50, and ResNet-101, respectively) besides the best performance on the clean dataset. Even on DenseNet-121, which is not a block-like DNN, ArchRepair also achieves promising performance compared with SOTA method Apricot [74]. The performance of ArchRepair is also significant on Tiny-ImageNet-C. As we’ve mentioned before, Tiny-ImageNet is way more challenging. Nevertheless, ArchRepair still outperforms baselines in terms of the robustness on a majority of corruption datasets across three variants of ResNet (9/15, 9/15, and 7/15 on ResNet-18, ResNet-50, and ResNet-101, respectively) as well as the non-block-like DNN DenseNet-121 (8/15). The results confirm that ArchRepair doesn’t harm the DNN’s robustness, and on the contrary, it can even sometimes improve DNN’s generalization ability towards classifying corrupted data.

Fig. 4.

Fig. 5.

Fig. 6.

Answer to RQ1: According to the experimental results on clean dataset, ArchRepair outperforms the SOTA repairing method on all 6 DNNs with different architectures (i.e., VGGNet-16, ResNet-18, ResNet-50, ResNet-101, DenseNet-121, and EfficientNet-B0). Moreover, the experimental results on corruption datasets also support that ArchRepair can repair a DNN without harming its robustness.

5.2 RQ2: Can ArchRepair Fix DNN on a Certain Failure Pattern without Sacrificing Robustness on Clean Data and other Failure Patterns?

In Section 5.1, our investigation results demonstrated that ArchRepair will not affect DNN’s robustness when repairing on the clean dataset. Hence in this section, we continue to validate whether our method harms DNN’s robustness when repairing a specific failure pattern.

We first verify the repairing capability of ArchRepair. We repair a deployed DNN (i.e., ResNet-18⁴) on each of the corruption datasets from CIFAR-10-C and Tiny-ImageNet-C, and compare the performance with the other repairing methods, where the results are summarized in Table 6. Comparing the experimental results on the corruption dataset, we see that all repairing methods have the capability to repair the failure patterns, except shot noise (SN) on Tiny-ImageNet-C (all repairing methods fail to repair this corruption pattern). Among these repairing techniques, our method ArchRepair has the highest accuracy on 8 out of 15 the corruption datasets on CIFAR-10-C dataset, and 9 out of 15 the corruption datasets on Tiny-ImageNet-C, respectively, demonstrating that ArchRepair exhibits the advantages in repairing failure patterns.

Table 7.

ResNet-18		Clean	GN	SN	IN	DB	GB	MB	ZB	SNW	FRO	FOG	BR	CTR	ET	PIX	JPEG
CIFAR-10-C	Original	85.000	61.452	67.392	61.944	74.762	54.782	66.348	69.476	71.408	70.114	73.532	82.736	58.716	74.822	72.364	78.752
	Apricot [74]	86.644	76.930	78.656	77.694	75.827	66.390	76.810	79.851	76.406	77.269	78.979	89.254	74.390	75.112	75.350	75.810
	Arachne [59]	88.451	77.144	77.715	78.976	76.546	65.815	75.963	77.712	77.862	77.224	79.200	86.913	75.792	73.876	77.694	74.402
	SENSEI [19]	86.525	68.762	70.471	73.345	76.842	60.244	71.229	73.297	73.732	73.814	76.975	83.006	64.861	72.814	75.833	79.495
	DeepRepair [72]	88.159	75.197	73.990	75.807	77.369	63.263	75.703	74.973	76.999	76.872	77.884	83.967	72.889	76.594	74.669	77.726
	ArchRepair(ours)	90.177	77.546	77.689	73.237	80.679	67.523	75.998	77.697	77.867	80.677	79.854	85.146	79.026	78.053	77.448	77.967
Tiny-ImageNet-C	Original	45.150	15.912	16.972	15.482	14.281	14.337	13.648	12.191	13.562	16.452	15.119	13.823	6.130	12.657	10.819	13.577
	Apricot [74]	46.732	16.703	15.270	15.339	14.266	14.762	13.047	11.959	13.319	19.550	14.838	14.041	8.790	11.231	9.227	14.825
	Arachne [59]	46.297	16.302	15.932	15.932	14.938	15.152	14.119	11.695	13.805	18.986	15.106	14.123	8.253	11.831	10.145	13.918
	SENSEI [19]	45.824	15.270	14.870	14.390	14.664	15.052	14.191	12.112	13.917	17.250	14.943	13.602	9.117	12.902	11.277	14.772
	DeepRepair [72]	46.780	17.032	15.673	15.277	14.669	15.324	13.570	12.478	13.624	18.950	15.152	14.145	9.385	13.496	11.926	14.597
	ArchRepair(ours)	47.350	17.820	15.779	16.376	14.769	15.224	15.967	12.670	12.923	19.295	15.915	15.112	10.337	13.765	12.553	14.624

Table 7. Accuracy (%) of a Deployed ResNet-18 Repaired by Different Repairing Methods on 15 Different Corruption Patterns

To validate whether our method has harmed DNN’s robustness, we also evaluate the performance of repaired DNNs on the other corruption datasets. The evaluation results on CIFAR-10 and Tiny-ImageNet are shown in Figures 6 and 7, respectively. Besides, we calculate the robustness of repaired models with the formula used in SENSEI. The results of robustness are recorded in Table 8. Comparing the accuracy difference on CIFAR-10-C (see Figure 6), we observe that the DNNs repaired by ArchRepair (i.e., the red bar) have higher accuracies on both clean and corruption datasets than the original DNN (i.e., the gray bar, which is lower than others in most of the cases), indicating that repairing method will not harm the DNN’s robustness when having fixed certain corruption patterns. Also, this fact proves that the repairing procedure will not cause over-fit. This is also verified by the results on Tiny-ImageNet-C (see Figure 7), where repairing on a certain corruption pattern does not affect the DNN’s robustness on clean dataset and other corruption patterns. Instead, it can even enhance the robustness in some cases (e.g., when repairing on Fog corruption is performed, the performance on other corruptions is also improved).

Fig. 7.

Table 8.

Answer to RQ2: ArchRepair can successfully fix a certain corruption pattern on a deployed DNN (i.e., ResNet-18), outperforming the existing 4 DNN repair methods. In addition, ArchRepair’s repairing doesn’t harm DNN’s robustness on clean dataset and other failure patterns.

5.3 RQ3: Is our Proposed Localization Effective in Identifying Vulnerable Block Candidates?

To verify the effectiveness of our localization method, we conduct an experiment by applying the repairing method on all 4 blocks of ResNet-18 and ResNet-50, and comparing the accuracy on the clean datasets \(\mathcal {D}^\text{v}\) of both CIFAR-10 and Tiny-ImageNet with their block suspiciousness \(\mathcal {S}_\text{B}\) (i.e., the number of suspicious neurons in corresponding block). We calculate the block suspiciousness under 8 different thresholds \(\epsilon _i\)⁵ (\(i\in \lbrace 10, 20, 30, 40, 50, 75, 100, 150\rbrace\)) to evaluate how the threshold \(\epsilon _i\) affects the block suspiciousness. The experimental results are summarized in Table 9.

Table 9.

As shown in Table 9, the block suspiciousness \(\mathcal {S}_\text{B}\) of Block 4 in ResNet-18 and Block 3 in ResNet-50 are always the highest on both CIFAR-10 and Tiny-ImageNet datasets, no matter what value the threshold \(\epsilon _i\) is. It matches the performance of repaired DNNs, where the DNN repaired on Block 4 in ResNet-18 and Block 3 in ResNet-50 has the highest accuracy, respectively. This demonstrates that our localization method can correctly locate the most vulnerable block.

It’s worth mentioning that for a simpler DNN architecture, i.e., ResNet-18, the vulnerable candidate block can be located more accurately when the threshold \(\epsilon _i\) is small. As the threshold \(\epsilon _i\) increases, the block suspiciousness \(\mathcal {S}_\text{B}\) on other blocks becomes larger, making the localization method difficult to identify the vulnerable block. While for ResNet-50 (a relatively complex DNN), no matter what value the threshold \(\epsilon _i\) is, the localization result is always significantly accurate (with a much higher suspiciousness \(\mathcal {S}_\text{B}\) compared with other blocks).

Answer to RQ3: ArchRepair is able to locate the most vulnerable block regardless of the settings of threshold \(\epsilon _i\) on different DNNs’ architectures we evaluated (e.g., ResNet-18 and ResNet-50).

5.4 RQ4: How Different Components of ArchRepair Impact Its Overall Performance?

To demonstrate the effectiveness of our ArchRepair and investigate how each component contributes to its overall performance, we conduct an ablation study by repairing 4 pre-trained models (i.e., ResNet-18, ResNet-50, ResNet-101, and DenseNet-121) with two variants of our method on both CIFAR-10 and Tiny-ImageNet datasets. Table 10 summarizes the evaluation results. The first one performs ArchRepair on one single layer of the DNN, and we denote these variants as “Layer-lv” in Table 10. The second one is our full (complete) version that applies ArchRepair at the block level, we denote this variant as “Block-lv” in Table 10.

Table 10.

	ResNet-18	ResNet-50	ResNet-101	DenseNet-121	ResNet-18	ResNet-50	ResNet-101	DenseNet-121
	CIFAR-10				Tiny-ImageNet
Original	85.00	85.17	85.72	87.97	45.15	46.27	46.14	48.73
Layer-lv	85.02	85.26	85.29	89.86	45.35	45.11	45.84	46.17
Block-lv	88.29	89.58	90.38	91.37	47.35	47.82	46.73	46.84

Table 10. Comparing the Two Variants of our Methods on four DNNs by Evaluating the Accuracy of Repaired DNN under Testing Dataset \(\mathcal {D}^\text{t}\)

Compared with the original DNNs, the performance of “Layer-lv” is acceptable on CIFAR-10 dataset, as it slightly improves the behaviors on three DNNs (i.e., ResNet-18, ResNet-50, and DenseNet-121) and only decreases slightly on ResNet-101. The “Block-lv” achieves better performance on all of the four DNNs on CIFAR-10, and these results indicate that ArchRepair’s repairing capability is effective at both levels. The performance on “Block-lv” is better than the “Layer-lv” on all the four DNNs on two different datasets, especially on the more challenging dataset Tiny-ImageNet, where “Layer-lv” only shows small improvement on ResNet-18 while “Block-lv” has significant improvement on all three variants of ResNet. This demonstrates that repairing on one specific layer cannot fully unleash ArchRepair’s potential while repairing on a block enables to take the advantage of all components of ArchRepair. Note that even though both “Block-lv” and “Layer-lv” fail to repair DenseNet-121 on Tiny-ImageNet (as well as all the SOTA baseline methods, see evaluation results in Table 5), “Block-lv” still performs better than “Layer-lv”.

Answer to RQ4: Block-level repairing is more effective than layer-level one towards fully releasing ArchRepair’s repairing capability. In addition, adjusting the network’s architecture and weights simultaneously is more effective than only adjusting the weights, especially for block-level repairing, demonstrating that jointly repairing the block architecture and weights is a promising research direction for DNN repair.

5.5 Threat to Validity

The threats to the validity of this article could come from the following aspects: (1) The selected dataset and the used model architectures could be a threat. To mitigate this, we selected popular datasets as well as diverse architectures to evaluate our method. (2) The selection of the corruption dataset could be biased, i.e., our method and results may not generalize well on other corruptions. To counteract this, we tried our best and selected as many as 15 diverse and commonly used natural corruptions in the standard benchmarks of previous work [26]. (3) Another threat is from the implementation of our method as well as the usage of the existing baselines. To mitigate the threat, we carefully follow the configuration as stated in the original articles or implementation, respectively. Moreover, our co-authors carefully test and review our code and the configuration of other tools. Furthermore, to be comprehensive for better understanding the position of ArchRepair, we perform a large-scale comparative study against 6 SOTA DNN repair techniques. The results confirm DNN repair could be even more promising and there are still opportunities ahead when going beyond focusing on repairing DNN weights only.

6 Related Work

6.1 DNN Testing

DNN testing is an important and relevant technique to DNN repair, aiming to detect potential buggy issues of a DNN. Some recent work focuses on testing criteria design. For example, DeepXplore [49] proposes the neuron coverage based on the number of activated neurons on given testing data, where the neuron coverage represents the adequacy of the testing data. Similarly, DeepGauge [43] proposes multi-granularity testing criteria, which are based on the analysis of neural behaviors. DeepCT [42] considers the interactions between the different neurons, and further Kim et al. [31] propose the coverage criteria to measure the surprise of the inputs based on the neuron features at the layer level. Some researchers [24, 56] recently also point out that the neuron coverage might fail if most of the neurons are activated by a few test cases, and further in-depth research is still needed along this line.

Overall, these testing criteria lay the early foundation for testing generation techniques to detect defects in DNNs. DeepTest [64] generates test cases based on the guidance of neuron coverage. TensorFuzz [48] proposes a distance-based coverage-guided fuzzing technique to test DNNs. Similarly, DeepHunter [69] proposes another coverage-guided testing technique by integrating the coverage criteria from DeepGauge. Readers can also see [44]. DeepStellar [13] employs the coverage criteria and fuzzing technique, to test and analyze the recurrent neural network. More discussions on the progress of machine learning testing can be referred to the recent survey [41, 75]. Different from these testing techniques, our work mainly focuses on repairing DNNs and enhancing their robustness and generalization capability, which can be considered as the downstream tasks of DNN testing.

6.2 Fault Localization on Deep Neuron Network

Fault localization aims to locate the root causes of software failures. Similar approaches have been widely studied for traditional software, which focuses on developing faults identification methods such as spectral-based [1, 29, 34, 35, 46, 50, 76], model-based [4, 55], slice-based [2], and semantic fault localization [10]. Several works recently introduce fault localization on DNNs to find vulnerable neurons and repair their weights. Representative techniques include sensitivity-based fault localization [59] and spectrum-based fault localization [14]. Eniser et al. [14] try to identify suspicious neurons responsible for unsatisfactory DNN performance, which is an early attempt to introduce fault localization techniques on DNNs with promising results. However, these methods only consider a fixed DNN architecture and neuron-aware buggy behaviors, which is less flexible for real-world applications. Our work repairs DNN at a higher level (i.e., block level) by localizing the vulnerable block and jointly repairing the block architecture and weights, which is novel and has not been investigated in previous work.

6.3 DNN Repair

So far, there are several attempts for repairing DNN models. Inspired by software debugging, Ma et al. [45] propose a novel model debugging technique for neural network models, which is denoted as MODE. MODE first performs state differential analysis on hidden layers to identify the faulty neurons that are responsible for the misclassification. Then, an input selection algorithm is used to select new input samples to retrain the faulty neurons.

Zhang et al. [74] propose a weight-adjustment approach named Apricot to fix the DNN. Apricot first generates a set of reduced DNNs from the original model and trains them with a random subset of the original training dataset, respectively. For each failure example, Apricot separates reduced DNN models into two partitions, one successfully predicts the label and the other does not, and takes the mean of the corresponding weight assignments of two partitions. After that, Apricot automatically adjusts the weight with these mean values. Further, Sohn et al. [59] propose a search-based repair technique for DNNs, called Arachne. Unlike other techniques, Arachne directly manipulates the neuron weights without retraining. Arachne first uses positive and negative input data to retain correct behavior and generate a patch, respectively. Then uses Particle Swarm Optimization (PSO) to search and locate faulty neurons, and uses the result of PSO candidate to update neurons’ weights, and further calculates fitness value based on the outcomes.

Recently, Gao et al. [19] have proposed a new algorithm called SENSEI, which uses guided test generation techniques to address the data augmentation problem for robust generalization of DNNs under natural environmental variations. Firstly, SENSEI uses a genetic search on a space of the natural environmental variants of each training input data to identify the worst variant for augmentation on each epoch. Besides, SENSEI uses a heuristic technique named selective augmentation, which allows skipping augmentation in certain epochs based on an analysis of the DNN’s current robustness. Ren et al. [54] uses Gaussian mixture model (GMM) to estimate the noise distribution and guide the data augmentation process for DNN repairing. It first applies GMM on collected failure data. Then the estimated GMM samples augment weights to mix the augmented data and generate a set for repairing. Another recent attempt for DNN repair is DeepRepair [72], a method that repairs the DNN on the image classification task. DeepRepair uses a style-guided data augmentation for DNN repairing to introduce the unknown failure patterns into the training data to retrain the model and applies clustering-based failure data generation to improve the effectiveness of data augmentation. NeuRecover [65] uses the history of training process to determine which DNN parameters should be corrected and corrects it through the use of particle swarm optimization. Nakagawa et al. [47] address the difficulty of repairing DNN by first searching for correction proposals for each different type of recognition error and performing the correction of “suspicious parameters”. DistrRep [5] first searches for the best fix for each different misclassification, then combines these fixes into a single repaired DNN model according to their risk levels. Kim et al. [32] make one of the earliest attempts on repairing neural network architectures through proposing a benchmark on both real and artificial DNN architecture faults with different hyperparameter optimization methods. Our work differs from them in two ways: (1) our repairing method replaces the faulty neural network blocks instead of only tuning hyperparameters, (2) our work could repair DNNs’ performance against corrupted data in the operational environment, with potentially both the weights and architecture of DNN enhanced.

Our repairing method is orthogonal to existing data augmentation-based methods such as SENSEI [19] and DeepRepair [72], where we focus on repairing DNN from the architecture and weight perspective. Our method also goes one step further beyond the weight level (e.g., MODE [45], Apricot [74], and Arachne [59]), and considers at a higher granularity by jointly repairing architecture and weights at the block level, which is demonstrated to be a promising direction for DNN repairing.

Note that, the field of DNN repairing has been progressing very fast, with some concurrent work proposed during the enhancement of our work. We would continuously update our supplementary website [52] to keep the relevant techniques of DNN repairing updated, and hopefully provide a basis to ease further research in this direction.

6.4 Neural Architecture Search

Neural architecture search (NAS) could be another relevant line of our work, aiming to automatically design an architecture instead of handcrafting one. Typical NAS includes evolution-based [53, 68], and reinforcement-learning-based [3] methods. However, the resources RL or evolution-based methods leveraged are often very expensive and still unaffordable in practice. More recently, DARTS [39] relaxes the search space to make it continuous so that the search processes can be performed based on the gradient. Differentiable NAS approaches can significantly reduce computational costs. Our search method is based on PC-DARTS [70], a stability-improved variant of DARTS by introducing a partially connected mechanism.

The purpose of repairing and NAS is very different. The former intends to fix the buggy behaviors that follow some patterns with generalization capability, while NAS is to design general architecture automatically for better performance (e.g., energy efficiency). In this article, we formulate the block-level joint architecture and weight repairing as a NAS problem, which demonstrates the possibilities and chances for DNN repair along this direction.

7 Conclusion

In this work, we have proposed ArchRepair, an architecture-oriented DNN repair at block level, which offers a good tradeoff between repaired network accuracy and time consumption, compared to neuron-level, layer-level, and network-level (data augmentation) repairing. To achieve this, two key problems are identified and solved sequentially, i.e., block localization, joint architecture, and weights repairing. By jointly repairing both architecture and weights on the candidate block for repairing, ArchRepair is able to achieve competitive performance compared with 6 SOTA techniques. Our extensive evaluation has also demonstrated that ArchRepair could not only enhance the accuracy but also the robustness across various corruption patterns while being cost-effective. To the best of our knowledge, this work is among the very early attempt at DNN repair by considering adjusting both the architecture and weights at the “block-level”. Our research also initiates a promising direction for further DNN repair research, toward addressing the current urgent industrial demands for reliable and trustworthy DNN deployment in diverse real-world environments.

Footnotes

For DenseNet-121, we manually group two consecutive convolution blocks as one block when repairing.

Train CIFAR10 with PyTorch: https://github.com/kuangliu/pytorch-cifar.

one clean dataset (repairing the accuracy drift on testing dataset) and fifteen corruption datasets (repairing the robustness on corrupted datasets).

⁴

More evaluation results on other DNNs are available on our project website [52].

⁵

\(\epsilon _i\) indicates top-i neurons with highest suspiciousness.

References

[1]

Rui Abreu, Peter Zoeteweij, Rob Golsteijn, and Arjan J. C. van Gemund. 2009. A practical evaluation of spectrum-based fault localization. Journal of Systems and Software 82, 11 (2009), 1780–1792. DOI:

Abstract

1 Introduction

2 DNN Repairing and Motivation

2.1 DNN Repairing Techniques

2.2 Empirical Study and Motivation

3 Block-Level Architecture and Weights Repairing

3.1 Overview

3.2 Adversarial-aware Spectrum Analysis for Vulnerable Block Localization

3.2.1 Neuron Spectrum Analysis.

3.2.2 Adversarial-aware Block Spectrum Analysis.

3.3 Architecture-oriented Search-based Repairing

3.3.1 Architecture Search Space for the Targeted Block.

3.3.2 Architecture and Weights Optimization.

3.4 Our Repairing Algorithm of ArchRepair

4 Experimental Design and Settings

4.1 Experimental Setups

5 Experimental Results

5.1 RQ1: Does ArchRepair Outperform the State-of-the-arts (SOTA) DNN Repair Techniques?

5.2 RQ2: Can ArchRepair Fix DNN on a Certain Failure Pattern without Sacrificing Robustness on Clean Data and other Failure Patterns?

5.3 RQ3: Is our Proposed Localization Effective in Identifying Vulnerable Block Candidates?

5.4 RQ4: How Different Components of ArchRepair Impact Its Overall Performance?

5.5 Threat to Validity

6 Related Work

6.1 DNN Testing

6.2 Fault Localization on Deep Neuron Network

6.3 DNN Repair

6.4 Neural Architecture Search

7 Conclusion

Footnotes

References

Cited By

Index Terms

Recommendations

Repairing deep neural networks: fix patterns and challenges

Adaptive Search-based Repair of Deep Neural Networks

MetaRepair: Learning to Repair Deep Neural Networks from Repairing Experiences

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Login options

Full Access

Share

Share this Publication link

Share on social media

Affiliations