FlagEval

Introduction of Robustness

The robustness of a model refers to its ability to maintain its performance when confronted with various conditions, including different types of anomalies, noise, interference, changes in data distribution, and adversarial attacks. Mathematically, robustness can be described in terms of the stability of the loss function. Specifically, for a given input $X$ and true label $Y$ , the model's prediction $f_{θ} (X)$ and the loss $L (f_{θ} (X), Y)$ between the prediction and the true label should remain within an acceptable limit even in the presence of input perturbation $Δ X$ , i.e., $L (f_{θ} (X + Δ X), Y) \leq ϵ$ , where $ϵ$ represents the upper limit of the loss. If the loss exceeds this limit, it can potentially lead to incorrect model outputs. In the context of adversarial robustness, the model should maintain its robustness even in the presence of the worst-case adversarial perturbation, which can be expressed as $max_{Δ X} L (f_{θ} (X + Δ X), Y) \leq ϵ$ . Enhancing robustness is crucial to ensure that the model can reliably perform in complex and variable real-world environments. Additionally, the construction of perturbation noise should be inconspicuous to humans. In other words, when generating test examples with image noise, the evaluation of the created test samples should be designed in such a way that $Δ X$ makes $X + Δ X$ differ minimally in human perception from the original $X$ , yet it can easily lead to errors in the model's output.

We assess the robustness of the model by applying two types of perturbations, adversarial perturbations, and natural noise perturbations, to input images across various computer vision tasks. Specifically, we introduce perturbations to image datasets, encompassing two primary categories of interference. First, we consider common natural noise present in the physical world, such as Gaussian noise caused by lighting and sensor noise, as well as atmospheric conditions like fog, rain, and snow. The second category involves targeted adversarial perturbations where we utilize proxy models to generate adversarial perturbations with the aim of causing the model to produce erroneous results when exposed to these perturbations. By applying the perturbation methods described above, we create datasets with varying degrees of perturbations and calculate the model's robustness metrics by evaluating its performance on these perturbed datasets. This evaluation method enables us to gain a better understanding of the model's performance when confronted with various image interferences, thereby enhancing its robustness in practical computer vision applications.

Datasets

ImageNet-1K Robustness Dataset

The construction of the robustness dataset involves two aspects. One is the individual addition of various types of natural noise to the original images to create perturbations for generating test images. This primarily includes Gaussian noise, impulse noise, and granular noise. The other aspect involves the use of adversarial perturbation algorithms, perturbing the results through a surrogate model. The goal is to disrupt the surrogate model's results as much as possible within a certain number of iterations.

ImageNet-1K_gaussian

Gaussian Noise

Gaussian noise is a common type of natural noise in image processing, typically caused by variations in lighting, random fluctuations in electronic sensors, or interference during the transmission process. It is characterized by the randomness of pixel values and follows a Gaussian distribution. The mathematical expression for Gaussian noise is as follows:

I (x, y) = I (x, y) + N (0, σ^{2})

Sample of Perturbation Dataset

ILSVRC2012_val_00000001_g

ImageNet-1K_impulse

Impulse Noise

Impulse noise, also known as salt and pepper noise, is a common type of natural noise in image processing. It often manifests as sudden bright or dark pixels in an image, simulating abrupt errors in the image sensor or during the transmission process. Its mathematical expression can be represented in the following form:

I (x, y) = {\begin{cases} A, & with probability P_{a} \\ B, & with probability P_{b} \\ I (x, y), & with probability 1 - P_{a} - P_{b} \end{cases}

Sample of Perturbation Dataset

ILSVRC2012_val_00000001_i

ImageNet-1K_shot

Shot Noise

Shot noise is a common type of natural noise in image processing, typically caused by malfunctioning pixels in the image sensor or sensor defects. It manifests as individual pixels in the image appearing unusually bright or dark, often due to some pixels in the sensor having exceptionally high or low values. The mathematical expression for shot noise can be represented in the following form:

I (x, y) = I (x, y) + S (x, y)

Sample of Perturbation Dataset

ILSVRC2012_val_00000001_s

ImageNet-1K_pgd

PGD Algorithm

The PGD (Projected Gradient Descent) algorithm, used in adversarial noise, is a commonly employed optimization algorithm for generating adversarial samples. Its objective is to find the smallest perturbation that causes a deep learning model to produce incorrect outputs on input data. The goal of the PGD algorithm is to minimize the loss function while constraining the perturbation within a predetermined bound. It iterates multiple times, taking small steps in each iteration and projecting the perturbation within the specified range. Selecting ResNet-50 as the surrogate model, the mathematical expression can be represented in the following form:

δ (t + 1) = C l i p ϵ (δ (t) + α \cdot s i g n (\nabla δ J (θ, x + δ (t), y)))

Sample of Perturbation Dataset

ILSVRC2012_val_00000001_p

Robustness Metrics（RB-index）

For the original data set and different perturbation data sets we have $A c c_{o r g}$ ， $A c c_{d i s t 1}$ ， $A c c_{d i s t 2}$ ， $A c c_{d i s t 3}$ ， $. . .$ ， $A c c_{d i s t T}$ ( $A c c$ refers to the evaluation index of the model under this data set, $o r g$ refers to the original data set, and $d i s 1. . . T$ refers to different perturbation data sets).

The calculation formula of the robustness index on this data set is:

$R o b u s t n e s s = \frac{1}{T * A c c_{o r g}} Σ_{i = 1}^{T} (A c c_{o r g} - A c c_{d i s t i}) * F l a g$

If the higher value of the $A c c$ metric indicates that the model is more effective, then the $F l a g$ value is 1, otherwise the $F l a g$ value is -1.

Smaller values of the robustness metric indicate better model robustness and can be negative (mostly found in NLP)

Introduction of Robustness ​

Datasets ​

ImageNet-1K Robustness Dataset ​

ImageNet-1K_gaussian ​

Gaussian Noise ​

Sample of Perturbation Dataset ​

ImageNet-1K_impulse ​

Impulse Noise ​

Sample of Perturbation Dataset ​

ImageNet-1K_shot ​

Shot Noise ​

Sample of Perturbation Dataset ​

ImageNet-1K_pgd ​

PGD Algorithm ​

Sample of Perturbation Dataset ​

Robustness Metrics（RB-index） ​

Introduction of Robustness

Datasets

ImageNet-1K Robustness Dataset

ImageNet-1K_gaussian

Gaussian Noise

Sample of Perturbation Dataset

ImageNet-1K_impulse

Impulse Noise

Sample of Perturbation Dataset

ImageNet-1K_shot

Shot Noise

Sample of Perturbation Dataset

ImageNet-1K_pgd

PGD Algorithm

Sample of Perturbation Dataset

Robustness Metrics（RB-index）