Adversarial Quantum Machine Learning: An Information-Theoretic Generalization Analysis

Petros Georgiou, Sharu Theresa Jose Department of Computer Science,
University of Birmingham, UK
Email: pxg402@student.bham.ac.uk, s.t.jose@bham.ac.uk Osvaldo Simeone KCLIP lab
Centre for Intelligent Information Processing Systems (CIIPS)
Department of Engineering, King’s College London
Email: osvaldo.simeone@kcl.ac.uk

Abstract

In a manner analogous to their classical counterparts, quantum classifiers are vulnerable to adversarial attacks that perturb their inputs. A promising countermeasure is to train the quantum classifier by adopting an attack-aware, or adversarial, loss function. This paper studies the generalization properties of quantum classifiers that are adversarially trained against bounded-norm white-box attacks. Specifically, a quantum adversary maximizes the classifier’s loss by transforming an input state $\rho(x)$ into a state $\lambda$ that is $\epsilon$ -close to the original state $\rho(x)$ in $p$ -Schatten distance. Under suitable assumptions on the quantum embedding $\rho(x)$ , we derive novel information-theoretic upper bounds on the generalization error of adversarially trained quantum classifiers for $p=1$ and $p=\infty$ . The derived upper bounds consist of two terms: the first is an exponential function of the 2-Rényi mutual information between classical data and quantum embedding, while the second term scales linearly with the adversarial perturbation size $\epsilon$ . Both terms are shown to decrease as $1/\sqrt{T}$ over the training set size $T$ . An extension is also considered in which the adversary assumed during training has different parameters $p$ and $\epsilon$ as compared to the adversary affecting the test inputs. Finally, we validate our theoretical findings with numerical experiments for a synthetic setting.

I Introduction

Motivation: Quantum machine learning (QML) has emerged as a design paradigm for current noisy intermediate scale quantum (NISQ) computers [1, 2]. Among the main projected application of QML is data analytics, of which classification is a prototypical example. As shown in Fig. 1(a), in a typical quantum classification problem, a classical input $x$ – such as an image, a text, or a vector of tunable parameters for a physical experiment – is mapped to a quantum state $\rho(x)$ , which is known as a quantum embedding. The quantum embedding map $\rho(x)$ may be implemented by a quantum circuit or by some physical mechanism, possibly encompassing also quantum sensing [3]. The design goal is to find a classifier, consisting of a positive operator valued measure (POVM), that can predict the true class $c$ associated with input $x$ with reasonable accuracy.

Refer to caption — Figure 1: Quantum classification in $(a)$ a non-adversarial setting, in which the quantum measurement $\Pi$ acts on the unperturbed quantum embedding $\rho(x)$ ; $(b)$ an adversarial setting, in which the state $\rho(x)$ is perturbed by a quantum adversary to yield a state $\lambda$ .

Despite quantum classifiers having shown promising results [4], recent works [5, 6, 7] have highlighted their vulnerability to adversarial attacks. A quantum adversary can perturb the input quantum state $\rho(x)$ via the application of a quantum channel, producing a state $\lambda$ for which the classifier is less likely to identify the true class $c$ .

Adversarial training was found to be a promising defense strategy [5, 7]. In adversarial training, the classifier replaces the conventional classification loss with an adversarial loss that accounts for the worst-case effect of an adversarial perturbation of the quantum embedding. This approach results in a min-max optimization problem with outer minimization over POVMs and inner maximization over adversarial perturbations. Our aim is to understand how well an adversarially trained classifier generalizes to new, previously unseen quantum states subjected to a possibly different adversarial attack.

Related Work: While the theory of adversarial generalization has recently garnered attention in classical adversarial machine learning [8, 9, 10], related efforts have not been reported for QML. Indeed, existing works on the generalization analysis of QML models focus on the conventional non-adversarial setting [11, 12, 13]. Our work is particularly inspired by [11], which presented an information-theoretic analysis of generalization for quantum classifier in the absence of quantum adversaries. Our generalization bounds extend those derived in [11] by accounting for the impact of adversarial training and for the presence of a quantum attacker at test time.

Main Contributions: In this work, we study quantum adversarial attacks which perturb the input quantum state $\rho(x)$ to a state that is $\epsilon$ -close to $\rho(x)$ in $p$ -Schatten distance. Our main contributions are as follows:
$\bullet$ We derive new information-theoretic upper bounds on the adversarial generalization error for $p=1$ and $p=\infty$ . The resulting upper bounds consist of two terms: The first, which coincides with the bound in [11], captures the non-adversarial generalization error via the exponentiated 2-Rényi-mutual information between the classical input and the quantum embedding; while the second term accounts for the impact of adversarial perturbations. Specifically, the second term scales as $2\epsilon/\sqrt{T}$ under $p=1$ attack, and as $2d\epsilon/\sqrt{T}$ under $p=\infty$ attack, where $d$ is the dimension of Hilbert space and $T$ is the number of training samples. Accordingly, our results bound the increase in sample complexity caused by the presence of an attacker, and they account for the power of the adversary via parameters $p$ and $\epsilon$ .
$\bullet$ We study a setting in which the classifier is adversarially trained against a $p$ -adversarial attack with $\epsilon$ -perturbation budget, but it is tested against a $p^{\prime}$ -attack with $\epsilon^{\prime}$ -perturbation budget. We show that in the presence of this training-test mismatch, training with a strong adversary is the preferred strategy, as weak training adversaries may incur a positive non-vanishing term that scales as $d^{(1-1/p^{\prime})}\epsilon^{\prime}+d^{(1-1/p)}\epsilon.$
$\bullet$ Finally, we validate our main theoretical findings with numerical experiments.

II Problem Formulation

In this section, we first introduce the quantum classification problem in the absence of quantum adversary, and define the conventional generalization error of a quantum classifier. We then formulate the adversarial setting, and define the generalization error of an adversarially-trained classifier.

II-A Generalization Error of Quantum Classifiers

As illustrated in Fig. 1(a), a classical input $x$ is embedded into a quantum state $\rho(x)$ by a fixed and known quantum embedding map $x\mapsto\rho(x)$ . The state $\rho(x)$ is a density matrix, i.e., a positive semi-definite, unit-trace matrix, defined in a finite-dimensional Hilbert space $\mathcal{H}$ . Let $c\in\{1,\ldots,K\}$ denote the correct label assigned to input $x$ that takes values in one of the $K$ classes. The classical tuple $(x,c)$ is generated from an unknown data distribution $P(x,c)$ . We assume $x$ to be discrete-valued to avoid some technicalities, but the analysis can be extended to continuous-valued inputs $x$ .

The quantum classifier consists of a POVM applied to the quantum embedding $\rho(x)$ . The POVM $\Pi=\{\Pi_{c}\}_{c=1}^{K}$ is defined by positive semi-definite matrices $\Pi_{c}$ , for $c=1,\ldots,K$ , that satisfy the equality $\sum_{c=1}^{K}\Pi_{c}=I$ , where $I$ denotes the identity matrix. We use $\mathcal{M}=\{\Pi:\Pi_{c}\geq 0,\sum_{c=1}^{K}\Pi_{c}=I\}$ to denote the set of all POVMs. By Born’s rule, a POVM $\Pi$ applied to a quantum state $\rho(x)$ yields the output class $c$ with probability ${\rm Tr}(\Pi_{c}\rho(x))$ .

Accordingly, we consider as loss function the probability of error

\displaystyle\ell(\Pi,\rho(x),c)=1-{\rm Tr}(\Pi_{c}\rho(x)),

(1)

which is the probability of misclassifying state $\rho(x)$ given its true label $c$ . The goal of the quantum classification problem is to find the POVM $\Pi\in\mathcal{M}$ that minimizes the population risk,

\displaystyle L(\Pi)=\mathbb{E}_{P(x,c)}[\ell(\Pi,\rho(x),c)],

(2)

which is the expected loss with respect to the distribution $P(x,c)$ .

However, the population risk cannot be evaluated by the classifier, since the data distribution $P(x,c)$ is unknown. Instead, the optimization of POVM is done with respect to the empirical training risk,

\displaystyle\widehat{L}(\Pi,\mathcal{T})=\frac{1}{T}\sum_{n=1}^{T}\ell(\Pi,% \rho(x_{n}),c_{n}),

(3)

which is evaluated using a training set $\mathcal{T}=\{(x_{n},c_{n})\}_{n=1}^{T}$ consisting of $T$ tuples $(x_{n},c_{n})$ generated i.i.d. from distribution $P(x,c)$ . The difference between the population risk and the training risk is defined as the generalization error

\displaystyle\mathcal{G}(\Pi,\mathcal{T})=L(\Pi)-\widehat{L}(\Pi,\mathcal{T})

(4)

obtained by the POVM $\Pi$ .

II-B Adversarial Attacks on Quantum Classifiers

In an adversarial setting, as illustrated in Fig. 1(b), a quantum adversary can perturb the input quantum state $\rho(x)$ via the application of a completely positive trace preserving (CPTP) map, i.e., a quantum channel, with the aim of maximizing the classifier’s loss (1)[5]. Targeting a worst-case scenario, the adversary is assumed to know the quantum classifier $\Pi$ , the loss function (1), as well as the quantum embedding map $x\mapsto\rho(x)$ , resulting in white-box attacks.

To define the power of the adversary, we constrain the distance between the density matrices before and after the perturbation. To this end, we adopt the $p$ -Schatten norm. For two density matrices $\rho_{1}$ and $\rho_{2}$ and $p\in[1,\infty)$ , the $p$ -Schatten distance $D_{p}(\rho_{1},\rho_{2})$ is defined as

D_{p}(\rho_{1},\rho_{2})=\|\rho_{1}-\rho_{2}\|_{p}=({\rm Tr}(|\bar{\rho}|^{p})% )^{1/p},

(5)

where $\bar{\rho}=\rho_{1}-\rho_{2}$ and $|\bar{\rho}|=\sqrt{\bar{\rho}\bar{\rho}^{{\dagger}}}$ . In the limiting case of $p=\infty$ , the distance $D_{\infty}(\rho_{1},\rho_{2})$ is defined as $D_{\infty}(\rho_{1},\rho_{2})=\max(\{\alpha(|\bar{\rho}|)\})$ where $\{\alpha(|\bar{\rho}|)\}$ is the set of eigenvalues of $|\bar{\rho}|$ .

A $p$ -adversarial attack with a perturbation budget $\epsilon\geq 0$ can produce any quantum state $\lambda$ satisfying $D_{p}(\rho(x),\lambda)\leq\epsilon$ . Assuming that the adversary maximizes the loss $\ell(\Pi,\lambda,c)$ incurred by the quantum classifier under this perturbation budget, the resulting adversarial loss of the classifier $\Pi$ on data tuple $(\rho(x),c)$ is given as

\displaystyle\ell_{p,\epsilon}(\Pi,\rho(x),c)

\displaystyle=\underset{\lambda:D_{p}(\rho(x),\lambda)\leq\epsilon}{\max}\ell(% \Pi,\lambda,c),

(6)

where $\ell(\Pi,\lambda,c)$ is defined as in (1). In this paper, we will focus on the extreme cases with $p=1$ and $p=\infty$ adversarial attacks.

In the presence of a $p$ -adversarial attack with perturbation budget $\epsilon$ , the performance of the quantum classifier is measured by the adversarial population risk

\displaystyle L_{p,\epsilon}(\Pi)=\mathbb{E}_{P(x,c)}[\ell_{p,\epsilon}(\Pi,% \rho(x),c)],

(7)

which is the expected adversarial loss with respect to the unknown distribution $P(x,c)$ .

II-C Generalization Error of Adversarially Trained Classifiers

Suppose that the quantum classifier is aware of the presence of a $p$ -adversarial attack with perturbation budget $\epsilon$ . While the adversarial population risk cannot be directly evaluated, the quantum classifier can be trained by optimizing the adversarial training risk

\displaystyle\widehat{L}_{p,\epsilon}(\Pi,\mathcal{T})=\frac{1}{T}\sum_{n=1}^{% T}\ell_{p,\epsilon}(\Pi,\rho(x_{n}),c_{n}),

(8)

which is the empirical average of the adversarial loss (6) over the training set $\mathcal{T}$ . This results in a min-max optimization problem with the outer minimization over POVMs and the inner maximization over perturbations of quantum states [5].

In this work, we are interested in characterizing the adversarial generalization error. The adversarial generalization error of a POVM $\Pi$ is the difference between adversarial population risk (7) and adversarial training loss (8), i.e.,

\displaystyle\mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})=L_{p,\epsilon}(\Pi)-% \widehat{L}_{p,\epsilon}(\Pi,\mathcal{T}).

(9)

Note that in the limit as $\epsilon\rightarrow 0$ , the adversarial generalization error $\mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})$ coincides with the standard generalization error $\mathcal{G}(\Pi,\mathcal{T})$ in (4).

III Preliminaries

In this section, we first present the main result of [11], which gives a high-probability, information-theoretic, upper bound on the generalization error (4) for conventional quantum learning. We then outline the key steps in the derivation of the upper bound, which will be useful in the next section to derive the proposed upper bounds on the adversarial generalization error.

Theorem 1 (Banchi et. al [11]).

For any POVM $\Pi\in\mathcal{M}$ , the following upper bound on the generalization error $\mathcal{G}(\Pi,\mathcal{T})$ holds with probability at least $1-\delta$ , for $\delta\in(0,1)$ , with respect to random draws of of the training set $\mathcal{T}$ ,

\displaystyle\mathcal{G}(\Pi,\mathcal{T})

\displaystyle\leq 2\sqrt{\frac{2^{I_{2}(X:Q)}K}{T}}+\sqrt{\frac{2\log(2/\delta% )}{T}}:=\mathcal{B},

(10)

where $I_{2}(X:Q)$ denotes the 2-Renyi mutual information between the quantum state space $Q$ and the classical feature space $X$ under state $\rho^{XQ}=\sum_{x}P(x)|x\rangle\langle x|\otimes\rho(x)$ , which is given by

{I_{2}(X:Q)}=2\log_{2}\Bigg{(}{\rm Tr}\sqrt{\sum_{x}P(x)\rho(x)^{2}}\Bigg{)}.

(11)

The derivation of the upper bound in (10) follows two main steps. In the first step, the generalization error $\mathcal{G}(\Pi,\mathcal{T})$ of a POVM $\Pi\in\mathcal{M}$ is upper bounded as

\displaystyle\mathcal{G}(\Pi,\mathcal{T})\leq\sup_{\Pi\in\mathcal{M}}|L(\Pi)-% \widehat{L}(\Pi,\mathcal{T})|:=\mathcal{U}(\mathcal{M},\mathcal{T}),

(12)

where $\mathcal{U}(\mathcal{M},\mathcal{T})$ denotes the uniform deviation bound that depends on the training set $\mathcal{T}$ and the set $\mathcal{M}$ of POVMs. In the second step, the uniform deviation bound is upper bounded by leveraging a classical result from statistical learning theory. This result, stated next, hinges on the fact that the loss function in (1) satisfies the inequality $0\leq\ell(\cdot,\cdot,\cdot)\leq 1$ .

Lemma 1 (Shalev-Schwartz and Ben David[14]).

With probability at least $1-\delta$ , for $\delta\in(0,1)$ , with respect to random draws of the training set $\mathcal{T}$ , the following inequality holds

\displaystyle\mathcal{U}(\mathcal{M},\mathcal{T})\leq 2\mathcal{R}(\mathcal{M}% )+\sqrt{\frac{2\log(2/\delta)}{T}},

(13)

where

\displaystyle\mathcal{R}(\mathcal{M})=\mathbb{E}_{\mathcal{T}}\mathbb{E}_{% \boldsymbol{\sigma}}\biggl{[}\sup_{\Pi\in\mathcal{M}}\frac{1}{T}\sum_{n=1}^{T}% \sigma_{n}\ell(\Pi,\rho(x_{n}),c_{n})\biggr{]}

(14)

is the Rademacher complexity of the set $\mathcal{M}$ of POVMs. In (14), $\boldsymbol{\sigma}=(\sigma_{1},\ldots,\sigma_{T})$ denotes a vector of $T$ i.i.d Rademacher variables $\sigma_{i}$ that takes value $\pm 1$ with equal probability.

An information-theoretic characterization of the Rademacher complexity (14) then yields the upper bound in (10).

IV Generalization Bounds for Adversarially Trained Quantum Classifiers

In this section, we present our main results, which provides information-theoretic upper bounds on the adversarial generalization error $\mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})$ defined in (9).

IV-A Key Technical Challenge

To derive upper bounds on the adversarial generalization error $\mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})$ , one can follow similar steps as discussed in Sec. III, targeting the adversarial uniform deviation bound

\mathcal{U}_{p,\epsilon}(\mathcal{M},\mathcal{T})=\sup_{\Pi\in\mathcal{M}}|L_{% p,\epsilon}(\Pi)-\widehat{L}_{p,\epsilon}(\Pi,\mathcal{T})|

(15)

on the adversarial generalization error $\mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})$ . The uniform deviation bound can be further upper bounded, as in Lemma 1, as a function of the adversarial Rademacher complexity

\displaystyle\mathcal{R}_{p,\epsilon}(\mathcal{M})=\mathbb{E}_{\mathcal{T}}% \mathbb{E}_{\boldsymbol{\sigma}}\biggl{[}\sup_{\Pi\in\mathcal{M}}\frac{1}{T}% \sum_{n=1}^{T}\sigma_{n}\ell_{p,\epsilon}(\Pi,\rho(x_{n}),c_{n})\biggr{]}.

(16)

Specifically, as in Lemma 1, with probability at least $1-\delta$ , for $\delta\in(0,1)$ , the following inequality holds

\displaystyle\mathcal{U}_{p,\epsilon}(\mathcal{M},\mathcal{T})\leq 2\mathcal{R% }_{p,\epsilon}(\mathcal{M})+\sqrt{\frac{2\log(2/\delta)}{T}}.

(17)

However, evaluating the adversarial Rademacher complexity $\mathcal{R}_{p,\epsilon}(\mathcal{M})$ is challenging. The function $\mathcal{R}_{p,\epsilon}(\mathcal{M})$ in (16) is defined using the adversarial loss $\ell_{p,\epsilon}(\Pi,\rho(x),c)$ , which entails a maximization problem over the set $\{\lambda:\lambda\succeq 0,{\rm Tr}(\lambda)=1,\|\rho(x)-\lambda\|_{p}\leq\epsilon\}$ of density matrices that satisfy the perturbation constraint. In the corresponding problem studied in [8] for classical adversarial learning, the relevant constraint imposes a bound on the $l_{\infty}$ -norm based perturbation of the classical input, and the resulting adversarial loss can be easily evaluated in closed form. In contrast, the constrained optimization underlying the quantum adversarial loss $\ell_{p,\epsilon}(\Pi,\rho(x),c)$ appears not to admit a closed-form solution in general.

IV-B Main Results

To state the main results, we make the following assumption.

Assumption 1.

The quantum embedding map $x\mapsto\rho(x)$ from classical input $x$ to density matrix $\rho(x)$ is such that the minimum eigenvalue of the density matrix $\rho(x)$ satisfies the inequality $\alpha_{\min}(\rho(x))\geq\Delta$ for some $\Delta\in(0,1/d]$ , where $d$ is the dimension of the Hilbert space.

Assumption 1 imposes a constraint on the entropy of the quantum embedding, requiring all quantum states $\rho(x)$ to have all non-zero eigenvalues, and hence maximum Rényi entropy of order zero [15]. In practice, the quantum embedding may be noisy, which is modelled by a CPTP map $\mathcal{E}(\cdot)$ , whereby the input classical data $x$ is mapped to a noisy state $\rho^{\prime}(x)$ as $x\mapsto\mathcal{E}(\rho(x))=\rho^{\prime}(x)$ . The minimal eigenvalue of the resulting noisy state $\rho^{\prime}(x)$ is greater than or equal to that of the clean state, i.e., $\alpha_{\min}(\mathcal{E}(\rho(x)))\geq\alpha_{\min}(\rho(x)).$ Thus, noisy quantum states can satisfy Assumption 1 even when corresponding clean states don’t.

The following theorem gives an upper bound on the adversarial generalization error $\mathcal{G}_{1,\epsilon}(\Pi)$ defined in (9) with $p=1$ .

Theorem 2.

Assume that the $K$ classes are equi-probable and that we have a $p=1$ -adversarial attack with a perturbation budget $\epsilon\leq 2\Delta$ . Under Assumption 1, the following upper bound on the adversarial generalization error $\mathcal{G}_{1,\epsilon}(\Pi,\mathcal{T})$ , for any POVM $\Pi\in\mathcal{M}$ , holds with probability at least $1-\delta$ , for $\delta\in(0,1)$ ,

\displaystyle\mathcal{G}_{1,\epsilon}(\Pi,\mathcal{T})\leq\mathcal{B}+2\sqrt{% \frac{K}{T}}\epsilon,

(18)

where $\mathcal{B}$ is as defined in (10).

The bound in (18) shows that the adversarial generalization error can be upper bounded in terms of the non-adversarial generalization bound (10) with an additional term that is directly proportional to the perturbation budget $\epsilon$ . This term quantifies the impact of the adversarial perturbation on generalization, and it recovers the bound (10) for $\epsilon=0$ . Furthermore, by the upper bound in (18), in the limit of a large number of observations $T$ , the adversarial generalization error vanishes. These results hold under the constraint $\epsilon\leq 2\Delta$ on the power of the adversary, which is more restrictive for less noisy quantum embeddings $\rho(x)$ with a smaller minimum eigenvalue $\alpha_{\min}(\rho(x))$ .

We now present an upper bound on the adversarial generalization error $\mathcal{G}_{\infty,\epsilon}(\Pi,\mathcal{T})$ under $\infty$ -Schatten norm attacks.

Theorem 3.

Assume that the $K$ classes are equi-probable and that we have a $p=\infty$ -adversarial attack with a perturbation budget $\epsilon\leq\Delta$ . Under Assumption 1, the following upper bound on the adversarial generalization error $\mathcal{G}_{\infty,\epsilon}(\Pi,\mathcal{T})$ , for any POVM $\Pi\in\mathcal{M}$ , holds with probability at least $1-\delta$ , for $\delta\in(0,1)$ ,

\displaystyle\mathcal{G}_{\infty,\epsilon}(\Pi,\mathcal{T})\leq\mathcal{B}+2d% \sqrt{\frac{K}{T}}\epsilon,

(19)

where $\mathcal{B}$ is as defined in (10).

For any given perturbation level $\epsilon$ , $p$ -adversarial attacks with $p=\infty$ are stronger than with $p=1$ , since they allow for perturbations in a larger volume of the Hilbert space. In a manner consistent with this observation, the additional term in the bound (19) is larger than in (18), with the relative increase factor equal to the dimension $d$ of the Hilbert space. The result holds under the more restrictive assumption $\epsilon\leq\Delta$ .

The generalization bounds derived in the previous two theorems vanish in the limit of a large number of samples, $T\rightarrow\infty$ . These results hold under Assumption 1, which requires the quantum embeddings to be sufficiently noisy, and on the stated upper bounds for the perturbation $\epsilon$ . As we show next, even when removing these assumptions, it is possible to show that the adversarial generalization error is given by the adversarial generalization bound (10) with the addition of a term proportional to the perturbation level $\epsilon$ . However, these additional terms do not vanish as $T$ increases. We leave it as an open problem to establish tighter bounds in this regime.

Theorem 4.

Assume that the $K$ classes are equi-probable and that we have a $p$ -adversarial attack with any perturbation budget $\epsilon\geq 0$ . For any POVM $\Pi\in\mathcal{M}$ , the following upper bound on the adversarial generalization error $\mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})$ holds with probability at least $1-\delta$ , for $\delta\in(0,1)$ ,

\displaystyle\mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})\leq\mathcal{B}+\begin{% cases}\epsilon\sqrt{2d(1+\frac{K-1}{T})},&\text{$p=1$}\\ 2\epsilon d\sqrt{(1+\frac{K-1}{T})},&\text{$p=\infty$}.\end{cases}

(20)

V Generalization Bounds Under Adversarial Mismatch

In the previous sections, we have considered the setting in which the quantum classifier is trained by assuming the same type of attacks encountered during testing. This is seldom true in practice: a quantum classifier $\Pi$ adversarially trained against $p$ -adversarial attacks with an $\epsilon$ -perturbation budget can encounter a generally different $p^{\prime}$ -adversarial attack with $\epsilon^{\prime}$ -budget during testing. In this section, we quantify the adversarial generalization error under adversarial mismatch.

We define the mismatched adversarial generalization error,

\displaystyle\mathcal{G}_{p,p^{\prime},\epsilon,\epsilon^{\prime}}(\Pi,% \mathcal{T})=L_{p^{\prime},\epsilon^{\prime}}(\Pi)-\widehat{L}_{p,\epsilon}(% \Pi,\mathcal{T}),

of a POVM $\Pi$ as the difference between the adversarial population risk $L_{p^{\prime},\epsilon^{\prime}}(\Pi)$ , evaluated under $p^{\prime}$ -adversarial attack with $\epsilon^{\prime}$ -perturbation budget, and the adversarial training risk $L_{p,\epsilon}(\Pi)$ , evaluated under $p$ -adversarial attack with $\epsilon$ -perturbation budget. To characterize the mismatched adversarial generalization error as a function of the generalization error $\mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})$ , we first define the following notion of relative strength of the adversaries.

Definition 1.

A $p$ -adversarial attack with perturbation budget $\epsilon\geq 0$ is said to be stronger than a $p^{\prime}$ -adversarial attack with perturbation budget $\epsilon^{\prime}\geq 0$ if the following inclusion condition

\{\lambda:D_{p}(\lambda,\rho)\leq\epsilon\}\supset\{\lambda:D_{p^{\prime}}(% \lambda,\rho)\leq\epsilon^{\prime}\}

(21)

holds for all density matrices $\rho$ . In this case, we also say that the second attack is weaker than the first.

The definition above is justified by the fact that a stronger attack, satisfying condition (21), would be able to further increase the adversarial loss (6) as compared to a weaker attack. The following lemma provides sufficient conditions that guarantee an adversary to be stronger than another.

Lemma 2.

A $p$ -adversarial attack with budget $\epsilon>0$ is stronger than a $p^{\prime}$ -adversarial attack with budget $\epsilon^{\prime}>0$ if

\displaystyle\epsilon^{\prime}<\begin{cases}d^{1/p^{\prime}-1/p}\epsilon,&p% \leq p^{\prime}\\ 2d^{1/p^{\prime}-1/p-1}\epsilon,&p>p^{\prime}\end{cases}

With these definitions, we have the following result.

Theorem 5.

Assume that the quantum classifier is adversarially trained assuming a $p$ -adversarial attack with perturbation budget $\epsilon\geq 0$ , while a $p^{\prime}$ -adversarial attack with perturbation budget $\epsilon^{\prime}\geq 0$ affects the quantum embeddings during testing. If the training adversarial attack is stronger than the testing adversarial attack, the following relation holds,

\displaystyle\mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})-\xi\leq\mathcal{G}_{p,p% ^{\prime}\epsilon,\epsilon^{\prime}}(\Pi,\mathcal{T})\leq\mathcal{G}_{p,% \epsilon}(\Pi,\mathcal{T}),

(22)

where

\xi=d^{(1-1/p^{\prime})}\epsilon^{\prime}+d^{(1-1/p)}\epsilon

is a function of the parameters $(p,p^{\prime},\epsilon,\epsilon^{\prime})$ . If the training adversary is weaker than the testing adversary, we have

\displaystyle\mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})\leq\mathcal{G}_{p,p^{% \prime}\epsilon,\epsilon^{\prime}}(\Pi,\mathcal{T})\leq\mathcal{G}_{p,\epsilon% }(\Pi,\mathcal{T})+\xi.

(23)

Theorem 5 gives insights on how best to adversarially train the classifier so that it generalizes well when tested against a possibly different adversary. In particular, the upper bound (22) guarantees that if the training adversary is stronger than the testing adversary, the mismatched generalization error is no larger than the generalization error $\mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})$ obtained when the stronger attacker is also present at test time. From Lemma 2, a way to ensure a stronger attacker at training time is to train assuming $p=\infty$ and a sufficiently large $\epsilon$ . Conversely, by (23), assuming a weaker adversary during training yields a mismatched generalization error that can exceed the generalization error $\mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})$ with the weaker test-time attacker by a non-vanishing (with $T$ ) term $\xi.$

VI Examples and Final Remarks

Figure 2: True generalization errors for non-adversarial (

\mathcal{G}(\Pi,\mathcal{T})

) and adversarial (

\mathcal{G}_{1,\epsilon}(\Pi,\mathcal{T})

) settings, compared with numerically evaluated uniform deviation bounds (13) and (17) and derived bounds as a function of the training set size

T

: (left)

\epsilon=0.08\leq 2\Delta=0.1

, and (right)

\epsilon=0.12>2\Delta

We consider a quantum binary classification problem with equi-probable class labels $c\in\{0,1\}$ . For each class $c$ , we obtain the discrete-valued input $x$ by finely quantizing a continuous-valued feature input $\tilde{x}\in\mathbb{R}$ so that the discrete sum in (11) can be evaluated via numerical integration [11]. The input $\tilde{x}$ is sampled from the conditional Gaussian distribution $P(x|c)=\mathcal{N}(\mu_{c},1)$ with mean $\mu_{c}=(-1)^{c}$ . We consider a depolarized quantum embedding, with noise strength $q\in(0,1)$ , that maps $x$ to the quantum state $\rho(x)=(1-q)|x\rangle\langle x|+qI/d$ , where the pure state $|x\rangle$ is obtained as

\displaystyle|x\rangle=U_{\theta}(x)|0\rangle,\;U_{\theta}(x)=R_{X}(x){\rm Rot% }_{\theta}R_{X}(x),

(24)

with $\theta=(\theta_{1},\theta_{2},\theta_{3})\in[0,2\pi)^{3}.$ Here, ${\rm Rot}_{\theta}=\exp(-i\vec{\theta}\cdot\vec{\sigma})$ and $R_{X}(x)={\rm Rot}_{(x,0,0)}$ are single qubit rotation gates, where $\vec{\sigma}=(\sigma_{x},\sigma_{y},\sigma_{z})$ denotes the vector of the Pauli matrices. In our experiments, we fix $\theta=(\pi/4,\pi/4,\pi/4)$ and $q=0.05$ , which results in $\Delta=0.05$ .

In Fig. 2, we plot the true non-adversarial and adversarial generalization errors, i.e., $\mathcal{G}(\Pi,\mathcal{T})$ and $\mathcal{G}_{1,\epsilon}(\Pi,\mathcal{T})$ (for $p=1$ ) respectively, for the POVM $\Pi=\{|0\rangle\langle 0|,|1\rangle\langle 1|\}$ when $\epsilon\leq 2\Delta$ (left) and $\epsilon>2\Delta$ (right) as a function of the training set size $T$ . To validate our analysis, we also evaluate numerically the Rademacher complexity based uniform deviation bounds (13) and (17) for non-adversarial and adversarial errors with $\delta=0.8$ ; and we plot the derived adversarial upper bounds (18) (left) and (20) (right), along with the non-adversarial bound in (10).

The true generalization bounds follow a similar trend in both plots, with the adversarial generalization error being larger than the non-adversarial counterpart, and with both errors tending to $0$ for large values of the data set size $T$ . Furthermore, when the adversary’s perturbation is limited as $\epsilon\leq 2\Delta$ , this behaviour is reproduced by the derived upper bound in Theorem 2. From the uniform deviation bounds it can be seen that the adversarial Rademacher complexity exceeds the non-adversarial Rademacher complexity. For the case when $\epsilon>2\Delta$ , i.e., when Assumption 1 is not satisfied, while capturing the general decrease with $T$ of the generalization error, our bound (20) is loose. We leave it as an open problem to derive tighter bounds in this regime. This observation also suggests that Assumption 1 is only instrumental in facilitating the derivation of the bound, which requires the optimization over the attacker’s channel, rather than indicating a “phase transition” in the generalization behavior.

Acknowledgments

The work of OS was supported by an Open Fellowship of the EPSRC with reference EP/W024101/1, by the EPSRC project EP/X011852/1, and by the European Union’s Horizon Europe Project CENTRIC under Grant 101096379. PG wishes to thank Mr. Charalampos Perdikis for the many useful discussions about code optimization and the use of multiprocessing and multithreading in accelerating his code.

References

[1] M. Schuld and F. Petruccione, Machine learning with quantum computers. Springer, 2021.
[2] O. Simeone et al., “An introduction to quantum machine learning for engineers,” Foundations and Trends® in Signal Processing, vol. 16, no. 1-2, pp. 1–223, 2022.
[3] L. Davidovich, “Quantum sensing: Beyond the classical limits of precision,” 2024.
[4] V. Havlíček, A. D. Córcoles, K. Temme, A. W. Harrow, A. Kandala, J. M. Chow, and J. M. Gambetta, “Supervised learning with quantum-enhanced feature spaces,” Nature, vol. 567, no. 7747, pp. 209–212, 2019.
[5] S. Lu, L.-M. Duan, and D.-L. Deng, “Quantum adversarial machine learning,” Physical Review Research, vol. 2, no. 3, aug 2020. [Online]. Available: https://doi.org/10.1103/physrevresearch.2.033212
[6] M. T. West, S. M. Erfani, C. Leckie, M. Sevior, L. C. Hollenberg, and M. Usman, “Benchmarking adversarially robust quantum machine learning at scale,” Physical Review Research, vol. 5, no. 2, p. 023186, 2023.
[7] W. Ren, W. Li, S. Xu, K. Wang, W. Jiang, F. Jin, X. Zhu, J. Chen, Z. Song, P. Zhang et al., “Experimental quantum adversarial learning with programmable superconducting qubits,” Nature Computational Science, vol. 2, no. 11, pp. 711–717, 2022.
[8] D. Yin, R. Kannan, and P. Bartlett, “Rademacher complexity for adversarially robust generalization,” in International conference on machine learning. PMLR, 2019, pp. 7085–7094.
[9] P. Awasthi, N. Frank, and M. Mohri, “Adversarial learning guarantees for linear hypotheses and neural networks,” in International Conference on Machine Learning. PMLR, 2020, pp. 431–441.
[10] J. Xiao, Y. Fan, R. Sun, and Z.-Q. Luo, “Adversarial rademacher complexity of deep neural networks,” arXiv preprint arXiv:2211.14966, 2022.
[11] L. Banchi, J. Pereira, and S. Pirandola, “Generalization in quantum machine learning: A quantum information standpoint,” PRX Quantum, vol. 2, no. 4, p. 040321, 2021.
[12] M. C. Caro, H.-Y. Huang, M. Cerezo, K. Sharma, A. Sornborger, L. Cincio, and P. J. Coles, “Generalization in quantum machine learning from few training data,” Nature communications, vol. 13, no. 1, p. 4919, 2022.
[13] M. C. Caro, H.-Y. Huang, N. Ezzell, J. Gibbs, A. T. Sornborger, L. Cincio, P. J. Coles, and Z. Holmes, “Out-of-distribution generalization for learning quantum dynamics,” Nature Communications, vol. 14, no. 1, p. 3751, 2023.
[14] S. Shalev-Shwartz and S. Ben-David, Understanding machine learning: From theory to algorithms. Cambridge university press, 2014.
[15] M. Müller-Lennert, F. Dupuis, O. Szehr, S. Fehr, and M. Tomamichel, “On quantum rényi entropies: A new generalization and some properties,” Journal of Mathematical Physics, vol. 54, no. 12, 2013.
[16] U. Haagerup, “The best constants in the khintchine inequality,” Studia Mathematica, vol. 70, no. 3, pp. 231–283, 1981.
[17] E. H. L. Keith Ball, Eric A. Karlsen, “Sharp uniform convexity and smoothness inequalities for trace norms.” Inventiones Mathematicae, vol. 115, p. 463–482, 1994.

Appendix A Proof of Theorem 2 and Theorem 3

The key idea of the proofs is to upper bound the adversarial Rademacher complexity (16) via the non-adversarial Rademacher complexity (14) and an additional term that accounts for the impact of perturbation. To this end, we equivalently write the adversarial Rademacher complexity (16) as

	$\displaystyle\mathcal{R}_{p,\epsilon}(\mathcal{M})=\mathbb{E}_{\mathcal{T}}% \mathbb{E}_{\boldsymbol{\sigma}}\biggl{[}\sup_{\Pi\in\mathcal{M}}\frac{1}{T}% \sum_{n=1}^{T}\sigma_{n}\times$
	$\displaystyle\max_{\lambda:D_{p}(\lambda,\rho(x_{n}))\leq\epsilon}\Bigl{(}1-{% \rm Tr}(\Pi_{c_{n}}\rho(x_{n}))-{\rm Tr}(\Pi_{c_{n}}(\lambda-\rho(x_{n})))% \Bigr{)}\biggr{]}.$		(25)

Using the inequality $\sup(f+g)\leq\sup f+\sup g$ in the upper bound (25), we obtain

\displaystyle\mathcal{R}_{p,\epsilon}(\mathcal{M})\leq\mathcal{R}(\mathcal{M})% +\Delta\mathcal{R}_{p,\epsilon}(\mathcal{M}),

(26)

where $\mathcal{R}(\mathcal{M})$ is as defined in (14), and

	$\displaystyle\Delta\mathcal{R}_{p,\epsilon}(\mathcal{M})=\mathbb{E}_{\mathcal{% T}}\mathbb{E}_{\boldsymbol{\sigma}}\biggl{[}\sup_{\Pi\in\mathcal{M}}\frac{1}{T% }\sum_{n=1}^{T}\sigma_{n}\times$
	$\displaystyle\max_{\lambda:D_{p}(\lambda,\rho(x_{n}))\leq\epsilon}{\rm Tr}% \Bigl{(}\Pi_{c_{n}}(\rho(x_{n})-\lambda)\Bigr{)}\biggr{]}$

may be defined as the perturbation Rademacher complexity.

We continue by writing the POVM elements $\Pi_{c}$ in terms of their eigendecomposition as $\Pi_{c}=U_{c}\bar{\Pi}_{c}U_{c}^{\dagger}$ , where $\bar{\Pi}_{c}$ denotes the diagonal matrix of eigenvalues. Using the cyclic property of the trace, $\Delta\mathcal{R}_{p,\epsilon}(\mathcal{M})$ can be equivalently written as

$\displaystyle\Delta\mathcal{R}_{p,\epsilon}(\mathcal{M})$	$\displaystyle=\mathbb{E}_{\mathcal{T}}\mathbb{E}_{\boldsymbol{\sigma}}\biggl{[% }\sup_{\Pi\in\mathcal{M}}\frac{1}{T}\sum_{n=1}^{T}\sigma_{n}\times$
	$\displaystyle\max_{\lambda:D_{p}(\lambda,\rho(x_{n}))\leq\epsilon}{\rm Tr}% \Bigl{(}\bar{\Pi}_{c_{n}}U^{\dagger}_{c_{n}}(\rho(x_{n})-\lambda)U_{c_{n}}% \Bigr{)}\biggr{]}$
	$\displaystyle=\mathbb{E}_{\mathcal{T}}\mathbb{E}_{\boldsymbol{\sigma}}\biggl{[% }\sup_{\Pi\in\mathcal{M}}\frac{1}{T}\sum_{c=1}^{K}\sum_{n=1}^{T}\mathds{1}(c_{% n}=c)\sigma_{n}\times$
	$\displaystyle\max_{\lambda:D_{p}(\lambda,\rho(x_{n}))\leq\epsilon}{\rm Tr}(% \bar{\Pi}_{c}\bar{\tau}(x_{n},c)\Bigr{)}\biggr{]},$	(27)

where we have defined $\bar{\tau}(x_{n},c)=U_{c}^{\dagger}(\rho(x_{n})-\lambda)U_{c}$ .

We now proceed to upper bound $\Delta\mathcal{R}_{p,\epsilon}(\mathcal{M})$ for the case of $p=1$ , which gives the required upper bound in Theorem 2.

A-A $D_{1}(\cdot,\cdot)$ perturbation Rademacher complexity

For fixed $c$ , the inner maximization in (27) is achieved when $\bar{\tau}(x_{n},c)$ is diagonal with entries

\displaystyle(\bar{\tau}(x_{n},c))_{ii}=\begin{cases}+\frac{\epsilon}{2}&\text% {if $(\bar{\Pi}_{c})_{ii}=\alpha_{\max}(\Pi_{c})$}\\ -\frac{\epsilon}{2}&\text{if $(\bar{\Pi}_{c})_{ii}=\alpha_{\min}(\Pi_{c})$}\\ 0&\text{otherwise},\end{cases}

(28)

where $\alpha_{\max}(\cdot)$ and $\alpha_{\min}(\cdot)$ respectively denote the maximum and minimum eigenvalues of ‘ $\cdot$ ’. It can be verified that this choice of $\bar{\tau}(x_{n},c)$ yields a physical density matrix $\lambda$ . In particular, the condition $\epsilon\leq 2\Delta$ guarantees that the minimal eigenvalue of $\lambda$ is positive (for 2 linear operators $A$ , $B$ , we have $\alpha_{\min}(A-B)\geq\alpha_{\min}(A)-\alpha_{\max}(B)$ ).

Now, defining $Q_{\boldsymbol{\sigma},c}=\sum_{n=1}^{T}\sigma_{n}\mathds{1}(c_{n}=c)\bar{\tau% }(x_{n},c),$ we can re-write (27) as

\displaystyle\Delta\mathcal{R}_{1,\epsilon}(\mathcal{M})=\mathbb{E}_{\mathcal{% T}}{\mathbb{E}}_{\boldsymbol{\sigma}}\Bigl{[}\sup_{\Pi}\frac{1}{T}\sum_{c}{\rm Tr% }(\bar{\Pi}_{c}Q_{\boldsymbol{\sigma},c})\Bigr{]}.

Applying Hölder’s inequality, we get the relation ${\rm Tr}(\bar{\Pi}_{c}Q_{\boldsymbol{\sigma},c})\leq\|Q_{\boldsymbol{\sigma},c% }\|_{1}\|\bar{\Pi}_{c}\|_{\infty}\leq\|Q_{\boldsymbol{\sigma},c}\|_{1}$ since $\lVert\bar{\Pi_{c}}\rVert_{\infty}\leq 1$ . Subsequently, we have

\displaystyle\Delta\mathcal{R}_{1,\epsilon}(\mathcal{M})\leq\mathbb{E}_{% \mathcal{T}}{\mathbb{E}}_{\boldsymbol{\sigma}}\Bigl{[}\sup_{\Pi}\frac{1}{T}% \sum_{c}\|Q_{\boldsymbol{\sigma},c}\|_{1}\Bigr{]}.

Since $Q_{\boldsymbol{\sigma},c}$ is diagonal, the trace norm $\|Q_{\boldsymbol{\sigma},c}\|_{1}$ evaluates as the sum of the absolute values of its diagonal elements. We thus have

\displaystyle\Delta\mathcal{R}_{1,\epsilon}(\mathcal{M})\leq\frac{2}{T}{% \mathbb{E}}_{\mathcal{T}}\mathbb{E}_{\boldsymbol{\sigma}}\biggl{[}\sum_{c}% \Bigl{|}\sum_{n=1}^{T}\frac{\epsilon}{2}\sigma_{n}\mathds{1}(c_{n}=c)\Bigr{|}% \biggr{]}.

(29)

Let $T_{c}$ denote the number of examples in the training set $\mathcal{T}$ that belongs to class $c$ . Then, for $K$ equiprobable classes, the upper bound (29) evaluates as

\displaystyle\Delta\mathcal{R}_{1,\epsilon}(\mathcal{M})\leq\frac{K}{T}{% \mathbb{E}}_{\mathcal{T}}\mathbb{E}_{\boldsymbol{\sigma}}\mathbb{E}_{c}\biggl{% [}\Bigl{|}\sum_{n=1}^{T_{c}}\epsilon\sigma_{n}\Bigr{|}\biggr{]}.

Using Khintchine’s inequality (see, e.g., [16]), we have $\mathbb{E}_{\boldsymbol{\sigma}}[|\sum_{n=1}^{T_{c}}\epsilon\sigma_{n}|]\leq% \epsilon\sqrt{T_{c}}$ . This results in

\displaystyle\Delta\mathcal{R}_{1,\epsilon}(\mathcal{M})\leq\epsilon\frac{K}{T% }\mathbb{E}_{\mathcal{T}}\mathbb{E}_{c}[\sqrt{T_{c}}]\leq\epsilon\frac{K}{T}% \sqrt{\mathbb{E}_{\mathcal{T}}\mathbb{E}_{c}[T_{c}]},

where the last inequality is due to Jensen’s inequality. Finally, noting that the classes are equi-probable, the expected value of $T_{c}$ evaluates as $T/K$ , yielding

\displaystyle\Delta\mathcal{R}_{1,\epsilon}(\mathcal{M})\leq\epsilon\sqrt{% \frac{K}{T}}.

Using this in (26), together with the upper bound in (10) returns the upper bound of (18).

We now upper bound $\Delta\mathcal{R}_{p,\epsilon}(\mathcal{M})$ for $p=\infty$ , which gives the required upper bound in Theorem 3.

A-B $D_{\infty}(\cdot,\cdot)$ perturbation Rademacher complexity

To upper bound $\Delta\mathcal{R}_{p,\epsilon}(\mathcal{M})$ in (27), we start by arranging the set $\{\alpha(\Pi_{c})_{i}\}$ of eigenvalues of $\Pi_{c}$ in increasing order in $i$ . Then, define the median eigenvalue as

\displaystyle\alpha_{{\rm med}}=\begin{cases}\frac{\alpha(\Pi_{c})_{d/2}+% \alpha(\Pi_{c})_{d/2+1}}{2}&\mbox{if}\hskip 2.84544ptd\mod 2=0\\ \alpha(\Pi_{c})_{\lceil d/2\rceil}&\mbox{if}\hskip 2.84544ptd\mod 2=1.\end{cases}

For fixed $c$ , the inner maximization in (27) is achieved when $\bar{\tau}(x_{n},c)$ is diagonal with entries

\displaystyle(\bar{\tau}(x_{n},c))_{ii}=\epsilon\operatorname{sgn}({\rm{diag}}% (\bar{\Pi}_{c}-\alpha_{{\rm{med}}}I)_{i}).

It can be verified that this choice of $\bar{\tau}(x_{n},c)$ yields a physical density matrix $\lambda$ . As before, the condition $\epsilon\leq\Delta$ ensures that the minimum eigenvalue of $\lambda$ is positive.

We now proceed with the same steps as in the previous proof for the $D_{1}(\cdot,\cdot)$ attack. We define $Q_{\boldsymbol{\sigma},c}=\sum_{n=1}^{T}\sigma_{n}\mathds{1}(c_{n}=c)\bar{\tau% }(x_{n},c)$ and use it to re-write $\Delta\mathcal{R}_{p,\epsilon}(\mathcal{M})$ . Applying Hölder’s inequality and evaluating the trace norm $\|Q_{\boldsymbol{\sigma},c}\|_{1}$ as the sum of the absolute values of its diagonal elements, we arrive at an inequality analogous to (29), namely

\displaystyle\Delta\mathcal{R}_{\infty,\epsilon}(\mathcal{M})\leq\frac{d}{T}% \mathbb{E}_{\mathcal{T}}\mathbb{E}_{\boldsymbol{\sigma}}\biggl{[}\sum_{c}\Bigl% {|}\sum_{n=1}^{T}\epsilon\sigma_{n}\mathds{1}(c_{n}=c)\Bigr{|}\biggr{]}.

(30)

Again, following the same steps as before, we get

\displaystyle\Delta\mathcal{R}_{\infty,\epsilon}(\mathcal{M})\leq d\epsilon% \sqrt{\frac{K}{T}}.

Using this in (26), together with the upper bound in (10) returns the upper bound of (19).

Appendix B Proof of Theorem 4

To obtain the required bound, we proceed as in the proof of Theorem 2 in Appendix A. An upper bound on the adversarial Rademacher complextiy $\mathcal{R}_{p,\epsilon}(\mathcal{M})$ can be obtained as in (26), in terms of the standard Rademacher complexity and the perturbation Rademacher complexity. The latter then evaluates as in (27). Let $\tau^{*}(x_{n},c)$ denote the perturbation matrix that achieves the inner maximization in (27). Subsequently, defining $Q_{\boldsymbol{\sigma},c}=\sum_{n=1}^{T}\sigma_{n}\mathds{1}(c_{n}=c)\tau^{*}(% x_{n},c),$ we re-write $\Delta\mathcal{R}_{p,\epsilon}(\mathcal{M})$ as

\displaystyle\Delta\mathcal{R}_{p,\epsilon}(\mathcal{M})={\mathbb{E}}_{% \mathcal{T}}{\mathbb{E}}_{\boldsymbol{\sigma}}\biggl{[}{\sup}_{\Pi\in\mathcal{% M}}\frac{1}{T}\sum_{c}\rm Tr(\Pi_{c}Q_{c,\boldsymbol{\sigma}})\biggr{]}.

Employing Hölder’s inequality yields that ${\rm Tr}(\Pi_{c}Q_{c,\boldsymbol{\sigma}})\leq\|\Pi_{c}\|_{2}\|Q_{c,% \boldsymbol{\sigma}}\|_{2}\leq\sqrt{d}\|Q_{c,\boldsymbol{\sigma}}\|_{2}$ , where the last inequality follows since $0\leq\Pi_{c}\leq I$ . This results in the following upper bound

\displaystyle\Delta\mathcal{R}_{p,\epsilon}(\mathcal{M})\leq{\mathbb{E}}_{% \mathcal{T}}{\mathbb{E}}_{\boldsymbol{\sigma}}\Bigl{[}{\sup}_{\Pi\in\mathcal{M% }}\frac{1}{T}\sum_{c}\sqrt{d}\lVert Q_{c,\sigma}\rVert_{2}\Bigr{]}.

(31)

We now evaluate the 2-norm $\lVert Q_{c,\sigma}\rVert_{2}$ , which can be written as

	$\displaystyle\lVert Q_{c,\boldsymbol{\sigma}}\rVert_{2}$
	$\displaystyle=\biggl{(}{\rm Tr}\biggl{(}\sum_{n,m=1}^{T}\sigma_{n}\sigma_{m}% \mathds{1}(c_{m}=c_{n}=c)\tau^{}(x_{n},c)\tau^{}(x_{m},c)\biggr{)}\biggr{)}^% {\frac{1}{2}}$
	$\displaystyle=\Bigg{(}{\rm Tr}\biggl{(}\sum_{n=1}^{T}\mathds{1}(c_{n}=c)\tau^{% *}(x_{n},c)^{2}+\sum_{n\neq m}(\mathds{1}(\sigma_{n}\sigma_{m}=1)$
	$\displaystyle-\mathds{1}(\sigma_{n}\sigma_{m}=-1))(\mathds{1}(c_{m}=c_{n}=c)% \tau^{}(x_{n},c)\tau^{}(x_{m},c))\biggr{)}\Bigg{)}^{\frac{1}{2}}.$		(32)

In the following subsections, we consider the two cases $p=1$ and $p=\infty$ , and obtain respective upper bounds on (32).

Furthermore, using the shorthand notation $\tau_{n}=\tau^{*}(x_{n},c)$ , we note that

\displaystyle-\lVert\tau_{n}\tau_{m}\rVert_{1}\leq\rm Tr(\tau_{n}\tau_{m})\leq% \lVert\tau_{n}\tau_{m}\rVert_{1}

(33)

which will be used to obtain a worst case upper bound on $\lVert Q_{c,\boldsymbol{\sigma}}\rVert_{2}$ .

B-A $D_{1}(\cdot,\cdot)$ perturbation Rademacher complexity

Using Hölder’s inequality, we get that $\lVert\tau_{n}\tau_{m}\rVert_{1}\leq\lVert\tau_{n}\rVert_{1}\lVert\tau_{m}% \rVert_{\infty}$ , where $\lVert\tau_{n}\rVert_{1}\leq\epsilon$ . We now consider $\lVert\tau_{m}\rVert_{\infty}$ . We write $\tau_{m}$ in its diagonal basis via a unitary transform $U_{m}$ as a matrix of positive (P) and a matrix of negative (N) eigenvalues $\tau_{m}=U_{m}(P+N)U_{m}^{\dagger}$ . The trace condition $\rm Tr\tau_{m}=0$ implies that $\|P\|_{1}=\|N\|_{1}\leq\epsilon/2$ . The $\infty$ -Schatten norm gives the eigenvalue of $\tau_{m}$ with maximal absolute value, which according to the norm bounds on $P$ and $N$ cannot exceed $\epsilon/2$ . Thus $\|\tau_{m}\|_{\infty}\leq\epsilon/2$ which together with $\|\tau_{m}|_{1}\leq\epsilon$ yields $\lVert\tau_{n}\tau_{m}\rVert_{1}\leq\epsilon^{2}/2$ . Using this in (33) yields

\displaystyle-\frac{\epsilon^{2}}{2}\leq{\rm Tr}(\tau_{n}\tau_{m})\leq\frac{% \epsilon^{2}}{2}\;\forall\;n,m.

Using this in (32), we get the following worst case upper bound:

\displaystyle\lVert Q_{c,\boldsymbol{\sigma}}\rVert_{2}^{2}\leq\frac{\epsilon^% {2}}{2}\Bigg{(}\sum_{n=1}^{T}\mathds{1}(c_{n}=c)+\sum_{n\neq m}\mathds{1}(c_{m% }=c_{n}=c)\Bigg{)}.

Plugging this in (31) for $p=1$ , and assuming $K$ equiprobable classes, we can now upper bound $\mathcal{R}_{1,\epsilon}(\mathcal{M})$ as

	$\displaystyle\mathcal{R}_{1,\epsilon}(\mathcal{M})\leq\frac{\sqrt{d}K}{T}% \mathbb{E}_{\mathcal{T}}{\mathbb{E}}_{\boldsymbol{\sigma}}\mathbb{E}_{c}$
	$\displaystyle\biggl{[}\left(\frac{\epsilon^{2}}{2}\Bigg{(}\sum_{n=1}^{T}% \mathds{1}(c_{n}=c)+\sum_{n\neq m}\mathds{1}(c_{m}=c_{n}=c)\Bigg{)}\right)^{% \frac{1}{2}}\biggr{]}.$

Finally, taking the expectation over the training set $\mathcal{T}$ inside the square root by application of Jensen’s inequality yields the following upper bound

\displaystyle\mathcal{R}_{1,\epsilon}(\mathcal{M})\leq\sqrt{\frac{\epsilon^{2}% }{2}d(1+\frac{K-1}{T})}.

Plugging this in (26) and using the upper bound in (10) yields the required bound.

B-B $D_{\infty}(\cdot,\cdot)$ perturbation Rademacher complexity

Under the $D_{\infty}(\cdot,\cdot)$ distance we have that $\lVert\tau_{n}\rVert_{\infty}\leq\epsilon$ , which implies that $\lVert\tau_{n}\rVert_{1}\leq\epsilon d$ . Using Hölder’s inequality, we have $\lVert\tau_{n}\tau_{m}\rVert_{1}\leq\lVert\tau_{n}\rVert_{1}\lVert\tau_{m}% \rVert_{\infty},$ which together with (33) yields:

\displaystyle-\epsilon^{2}d\leq{\rm Tr}(\tau_{n}\tau_{m})\leq\epsilon^{2}d\;% \forall\;n,m.

Using this, we get the following worst case upper bound on $\lVert Q_{c,\boldsymbol{\sigma}}\rVert_{2}^{2}$

\displaystyle\lVert Q_{c,\boldsymbol{\sigma}}\rVert_{2}^{2}\leq\epsilon^{2}d% \Bigg{(}\sum_{n=1}^{T}\mathds{1}(c_{n}=c)+\sum_{n\neq m}\mathds{1}(c_{m}=c_{n}% =c)\Bigg{)}.

Retracing the remaining steps of the $D_{1}(\cdot,\cdot)$ adversary derivation yields

\displaystyle\mathcal{R}_{\infty,\epsilon}(\mathcal{M})\leq\epsilon d\sqrt{1+% \frac{K-1}{T}}

which together with (26) and (10) concludes the proof.

Appendix C Proof of Lemma 2

To derive the required relation, we start by noting that $p$ -Schatten distance between the states $\rho$ and $\lambda$ is defined as $D_{p}(\rho,\lambda)=\|\rho-\lambda\|_{p}.$ Thus, defining the matrix $\tau=\rho-\lambda$ , the distance condition can be written as

D_{p}(\rho,\lambda)=\|\tau\|_{p}\leq\epsilon.

Furthermore, we remind ourselves of the generalized version of Hölder’s inequality [17] for matrices $A,\;B$

\|AB\|_{r}\leq\|A\|_{p}\|B\|_{q},\;1/r=1/p+1/q.

Firstly we prove the inequality for the case $p\leq p^{\prime}$ . Assume that $D_{p}(\rho,\lambda)\leq\epsilon$ for some $\epsilon\geq 0$ . Then the following set of relations hold,

	$\displaystyle\\|\tau\\|_{p^{\prime}}=\\|I\;\tau\\|_{p^{\prime}}$	$\displaystyle\leq\\|\tau\\|_{p}^{\prime}\\|I\\|_{pp^{\prime}/(p^{\prime}-p)}$
		$\displaystyle=d^{1/p-1/p^{\prime}}\\|\tau\\|_{p},$

where in the first line we inserted the identity operator $I$ and in the second applied Hölder’s inequality. This in turn implies that

\displaystyle D_{p}(\rho,\lambda)\geq d^{1/p^{\prime}-1/p}D_{p^{\prime}}(\rho,\lambda)

(34)

The inequality (34) implies that a $p$ -adversary with perturbation budget $\epsilon$ can access all states within the set

\{\lambda:D_{p^{\prime}}(\rho,\lambda)\leq d^{1/p^{\prime}-1/p}\epsilon\}.

Thus, if we have a $p$ -adversary with perturbation budget $\epsilon$ at training, and a $p^{\prime}$ -adversary with perturbation budget $\epsilon^{\prime}$ during testing, the training adversary is stronger than the testing adversary if

\displaystyle\epsilon^{\prime}<d^{1/p^{\prime}-1/p}\epsilon.

(35)

We now proceed to the case of $p>p^{\prime}$ . Consider the set $\{\lambda:D_{p}(\lambda,\rho)\leq\epsilon\}$ . With the definitions above, this can be equivalently written as $\lVert\tau\rVert_{p}\leq\epsilon$ . Holder’s inequality implies that $\lVert\tau\rVert_{1}\leq\lVert\tau\rVert_{p}d^{1-1/p}$ . Furthermore, as shown in appendix B, $\lVert\tau_{\infty}\leq\lVert\tau\rVert_{1}/2$ . Thus

\lVert\tau\rVert_{p}\leq\epsilon\implies\lVert\tau\rVert_{\infty}\leq\frac{% \epsilon}{2}d^{(1-1/p)}.

To lower bound $\lVert\tau\rVert_{\infty}$ we again use Hölder’s inequality; $\lVert\tau\rVert_{p}\leq\lVert\tau\rVert_{\infty}\lVert I\rVert_{p}$ . Thus

\lVert\tau\rVert_{p}\leq\epsilon\implies\lVert\tau\rVert_{\infty}\geq\epsilon d% ^{(-1/p)}.

Therefore a sufficient condition to enforce the inclusion condition is that the lower bound on the $\infty$ -Schatten norm of the $p$ -adversarial attack is greater than the upper bound on the $\infty$ -Schatten norm of the $p^{\prime}$ -adversarial attack

\displaystyle\epsilon^{\prime}<2d^{(1/p^{\prime}-1/p-1)}\epsilon

(36)

This concludes the proof.

Appendix D Proof of Theorem 5

To obtain bounds on the mismatched adversarial generalization error, we first decompose it as

\displaystyle\mathcal{G}_{p,p^{\prime},\epsilon,\epsilon^{\prime}}(\Pi,% \mathcal{T})=\nu_{p,p^{\prime};\epsilon,\epsilon^{\prime}}(\Pi)+\mathcal{G}_{p% ,\epsilon}(\Pi,\mathcal{T}),

(37)

where $\nu_{p,p^{\prime};\epsilon,\epsilon^{\prime}}(\Pi)=L_{p^{\prime},\epsilon^{% \prime}}(\Pi)-L_{p,\epsilon}(\Pi)$ is the difference in adversarial population risks due to adversarial mismatch and $\mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})$ , defined as in (9), is the adversarial generalization error with no adversarial mismatch.

We now derive the bounds stated in Theorem 5. To this end we consider the following two cases:

•

The training adversary is stronger than the testing adversary, as defined in Definition 1, i.e. $\{\lambda:D_{p}(\lambda,\rho(x))\leq\epsilon\}\supset\{\lambda:D_{p^{\prime}}(% \lambda,\rho(x))\leq\epsilon^{\prime}\}$ . Then we have

	$\displaystyle\underset{P(x,c)}{\mathbb{E}}\Bigl{[}\underset{\lambda:D_{p}(% \lambda,\rho(x))\leq\epsilon}{\min}({\rm Tr}\Pi_{c}\lambda)\Bigr{]}$
	$\displaystyle\leq\underset{P(x,c)}{\mathbb{E}}\Bigl{[}\underset{\lambda^{% \prime}:D_{p^{\prime}}(\lambda^{\prime},\rho(x))\leq\epsilon^{\prime}}{\min}({% \rm Tr}\Pi_{c}\lambda^{\prime})\Bigr{]}.$

This implies that $\nu_{p,p^{\prime};\epsilon,\epsilon^{\prime}}(\Pi)\leq 0$ . Using this in (37) yields the following upper bound $\mathcal{G}_{p,p^{\prime},\epsilon,\epsilon^{\prime}}(\Pi,\mathcal{T})\leq% \mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})$ .

•

The training adversary is weaker than the testing adversary i.e. $\{\lambda:D_{p}(\lambda,\rho(x))\leq\epsilon\}\subset\{\lambda:D_{p^{\prime}}(% \lambda,\rho(x))\leq\epsilon^{\prime}\}$ . Then we have

	$\displaystyle\underset{P(x,c)}{\mathbb{E}}\Bigl{[}\underset{\lambda:D_{p}(% \lambda,\rho(x))\leq\epsilon}{\min}({\rm Tr}\Pi_{c}\lambda)\Bigr{]}$
	$\displaystyle\geq\underset{P(x,c)}{\mathbb{E}}\Bigl{[}\underset{\lambda^{% \prime}:D_{p^{\prime}}(\lambda^{\prime},\rho(x))\leq\epsilon^{\prime}}{\min}({% \rm Tr}\Pi_{c}\lambda^{\prime})\Bigr{]}$

This in turn implies that $\nu_{p,p^{\prime};\epsilon,\epsilon^{\prime}}(\Pi)\geq 0.$ Combining this with (37) gives the following lower bound on the mismatched adversarial generalization error $\mathcal{G}_{p,p^{\prime},\epsilon,\epsilon^{\prime}}(\Pi,\mathcal{T})\geq% \mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})$ in the case of a weak training adversary

To obtain a lower bound in the case of a strong training adversary, and an upper bound in the case of a weak adversary, we need to bound the term $\nu_{p,p^{\prime};\epsilon,\epsilon^{\prime}}(\Pi)$ . To this end, we start by expressing the mismatch as

	$\displaystyle\nu_{p,p^{\prime};\epsilon,\epsilon^{\prime}}(\Pi)$	$\displaystyle=\mathbb{E}_{P(x,c)}[\ell_{p^{\prime},\epsilon^{\prime}}(\Pi,\rho% (x),c)-\ell_{p,\epsilon}(\Pi,\rho(x),c)]$
		$\displaystyle=\mathbb{E}_{P(x,c)}\Bigl{[}{\rm Tr}\Bigl{(}\Pi_{c}(\lambda^{}_{% p,\epsilon}(x,c)-\lambda^{}_{p^{\prime},\epsilon^{\prime}}(x,c))\Bigr{)}\Bigr% {]}$
		$\displaystyle=\mathbb{E}_{P(x,c)}\Bigl{[}{\rm Tr}(\Pi_{c}\Delta\lambda_{p,p^{% \prime};\epsilon,\epsilon^{\prime}}(x,c))\Bigr{]}=C,$

where $\lambda^{*}_{p,\epsilon}(x,c)$ is the optimal adversarial example for a $p$ -adversarial attack with perturbation budget $\epsilon$ , and likewise for $\lambda^{*}_{p^{\prime},\epsilon^{\prime}}(x,c)$ . The matrix $\Delta\lambda_{p,p^{\prime};\epsilon,\epsilon^{\prime}}(x,c)$ is a trace zero matrix thus $C$ may take both positive and negative values. Using this, we write

\displaystyle-|C|\leq\nu_{p,p^{\prime},\epsilon,\epsilon^{\prime}}(\Pi)\leq|C|.

(38)

We now proceed to upper bound the term $C$ .

	$\displaystyle\|C\|=\|\mathbb{E}_{P(x,c)}[{\rm Tr}(\Pi_{c}\Delta\lambda_{p,p^{% \prime};\epsilon,\epsilon^{\prime}}(x,c))]\|$
	$\displaystyle\leq\mathbb{E}_{P(x,c)}[\|{\rm Tr}(\Pi_{c}\Delta\lambda_{p,p^{% \prime};\epsilon,\epsilon^{\prime}}(x,c))\|]$
	$\displaystyle\leq\mathbb{E}_{P(x,c)}[\\|\lambda^{}_{p,\epsilon}(x,c)-\lambda^{% }_{p^{\prime},\epsilon^{\prime}}(x,c)\\|_{1}],$		(39)

where the first inequality follows from Jensen’s inequality, and the second inequality follows from Hölder’s using $\|\Pi_{c}\|_{\infty}\leq 1$ . The trace norm $\|\lambda^{*}_{p,\epsilon}(x,c)-\lambda^{*}_{p^{\prime},\epsilon^{\prime}}(x,c% )\|_{1}$ can be further upper bounded as

	$\displaystyle\\|\lambda^{}_{p,\epsilon}(x,c)-\lambda^{}_{p^{\prime},\epsilon^% {\prime}}(x,c)\\|_{1}$
	$\displaystyle\leq\\|\lambda^{}_{p,\epsilon}(x,c)-\rho(x)\\|_{1}+\\|\rho(x)-% \lambda^{}_{p^{\prime},\epsilon^{\prime}}(x,c)\\|_{1}$
	$\displaystyle\leq d^{(1-1/p)}\\|\lambda^{*}_{p,\epsilon}(x,c)-\rho(x)\\|_{p}$
	$\displaystyle+d^{(1-1/p^{\prime})}\\|\lambda^{*}_{p^{\prime},\epsilon^{\prime}}% (x,c)-\rho(x)\\|_{p^{\prime}}$
	$\displaystyle\leq d^{(1-1/p)}\epsilon+d^{(1-1/p^{\prime})}\epsilon^{\prime}=\xi,$		(40)

where the first inequality follows by adding and subtracting the term $\rho(x)$ and then applying the triangle inequality. The second inequality is an application of Hölder’s inequality with $\|I\|_{p/(p-1)}\leq d^{1-1/p}$ . The last inequality follows since $\lambda^{*}_{p,\epsilon}(x,c)$ is the optimal perturbed quantum state satisfying the constraint, $\|\lambda^{*}_{p,\epsilon}(x,c)-\rho(x)\|_{p}\leq\epsilon.$

Thus we can bound the mismatch $\nu_{p,p^{\prime},\epsilon,\epsilon^{\prime}}(\Pi)$ as follows:

-\xi\leq\nu_{p,p^{\prime},\epsilon,\epsilon^{\prime}}(\Pi)\leq\xi.

Using this again in (37) gives the following lower bound for when the training adversary is stronger,

\mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})-\xi\leq\mathcal{G}_{p,p^{\prime},% \epsilon,\epsilon^{\prime}}(\Pi,\mathcal{T}),

and the following upper bound for when the training adversary is weaker,

\mathcal{G}_{p,p^{\prime},\epsilon,\epsilon^{\prime}}(\Pi,\mathcal{T})\leq% \mathcal{G}_{p,\epsilon}(\Pi,\mathcal{T})+\xi.

This concludes the proof.

Appendix E Noisy Quantum Embedding Satisfies Assumption 1

In this section, we show that the minimum eigenvalue of the quantum state $\rho^{\prime}(x)=\mathcal{E}(\rho(x))$ resulting due to a noisy quantum embedding $x\mapsto\mathcal{E}(\rho(x))$ is at least the minimum eigenvalue of the noiseless state $\rho(x)$ . To this end, we note that the CPTP map $\mathcal{E}(\cdot)$ can be equivalently written as $\mathcal{E}(\rho)=\sum_{i}E_{i}\rho E_{i}^{{\dagger}}$ where $\{K_{i}\}$ is the set of Kraus operators satisfsying the completeness relation, $\sum_{i}K_{i}^{{\dagger}}K_{i}=I$ .

To compute the minimal eigenvalue of the noisy state $\mathcal{E}(\rho)$ , we use the variational principle as

	$\displaystyle\alpha_{\min}(\mathcal{E}(\rho))$	$\displaystyle=\underset{\|\psi\rangle}{\min}\sum_{i}\frac{\langle\psi\|E_{i}\rho E% _{i}^{\dagger}\|\psi\rangle}{\langle\psi\|\psi\rangle}$
		$\displaystyle=\underset{\|\psi\rangle}{\min}\sum_{i}\frac{\langle\psi\|E_{i}\rho E% _{i}^{\dagger}\|\psi\rangle}{\langle\psi\|E_{i}E_{i}^{\dagger}\|\psi\rangle}\frac% {\langle\psi\|E_{i}E_{i}^{\dagger}\|\psi\rangle}{\langle\psi\|\psi\rangle}.$		(41)

In (41), the first term is an upper bound on the minimal eigenvalue of $\rho$ . Using this, we have

\displaystyle\alpha_{\min}(\mathcal{E}(\rho))\geq\sum_{i}\alpha_{\min}(\rho)% \frac{\langle\psi|E_{i}E_{i}^{\dagger}|\psi\rangle}{\langle\psi|\psi\rangle}=% \alpha_{\min}(\rho).

Furthermore, equality holds only if there exists a state $|\alpha^{\min}\rangle$ such that

\displaystyle|\alpha^{\min}\rangle=\arg\min_{|\psi\rangle}\frac{\langle\psi|E_% {i}\rho E^{\dagger}_{i}|\psi\rangle}{\langle\psi|E_{i}E_{i}^{\dagger}|\psi% \rangle}\;\forall\;i.

This is because in equation (41) if $|\alpha^{\min}\rangle$ does not exist, then

\displaystyle\frac{\langle\psi|E_{i}\rho E^{\dagger}_{i}|\psi\rangle}{\langle% \psi|E_{i}E_{i}^{\dagger}|\psi\rangle}=\alpha_{\min}(\rho)

is not attainable for all $i$ simultaneously, hence $\alpha_{\min}(\mathcal{E}(\rho))=\alpha_{\min}(\rho)$ cannot be achieved.

	$\displaystyle\|C\|=\|\mathbb{E}_{P(x,c)}[{\rm Tr}(\Pi_{c}\Delta\lambda_{p,p^{% \prime};\epsilon,\epsilon^{\prime}}(x,c))]\|$
	$\displaystyle\leq\mathbb{E}_{P(x,c)}[\|{\rm Tr}(\Pi_{c}\Delta\lambda_{p,p^{% \prime};\epsilon,\epsilon^{\prime}}(x,c))\|]$
	$\displaystyle\leq\mathbb{E}_{P(x,c)}[\\|\lambda^{}_{p,\epsilon}(x,c)-\lambda^{% }_{p^{\prime},\epsilon^{\prime}}(x,c)\\|_{1}],$		(39)

	$\displaystyle\\|\lambda^{}_{p,\epsilon}(x,c)-\lambda^{}_{p^{\prime},\epsilon^% {\prime}}(x,c)\\|_{1}$
	$\displaystyle\leq\\|\lambda^{}_{p,\epsilon}(x,c)-\rho(x)\\|_{1}+\\|\rho(x)-% \lambda^{}_{p^{\prime},\epsilon^{\prime}}(x,c)\\|_{1}$
	$\displaystyle\leq d^{(1-1/p)}\\|\lambda^{*}_{p,\epsilon}(x,c)-\rho(x)\\|_{p}$
	$\displaystyle+d^{(1-1/p^{\prime})}\\|\lambda^{*}_{p^{\prime},\epsilon^{\prime}}% (x,c)-\rho(x)\\|_{p^{\prime}}$
	$\displaystyle\leq d^{(1-1/p)}\epsilon+d^{(1-1/p^{\prime})}\epsilon^{\prime}=\xi,$		(40)

Adversarial Quantum Machine Learning: An Information-Theoretic Generalization Analysis

Abstract

I Introduction

II Problem Formulation

II-A Generalization Error of Quantum Classifiers

II-B Adversarial Attacks on Quantum Classifiers

II-C Generalization Error of Adversarially Trained Classifiers

III Preliminaries

Theorem 1 (Banchi et. al [11]).

Lemma 1 (Shalev-Schwartz and Ben David[14]).

IV Generalization Bounds for Adversarially Trained Quantum Classifiers

IV-A Key Technical Challenge

IV-B Main Results

Assumption 1.

Theorem 2.

Theorem 3.

Theorem 4.

V Generalization Bounds Under Adversarial Mismatch

Definition 1.

Lemma 2.

Theorem 5.

VI Examples and Final Remarks

Acknowledgments

References

Appendix A Proof of Theorem 2 and Theorem 3

A-A D1⁢(⋅,⋅)subscript𝐷1⋅⋅D_{1}(\cdot,\cdot)italic_D start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( ⋅ , ⋅ ) perturbation Rademacher complexity

A-B D∞⁢(⋅,⋅)subscript𝐷⋅⋅D_{\infty}(\cdot,\cdot)italic_D start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ( ⋅ , ⋅ ) perturbation Rademacher complexity

Appendix B Proof of Theorem 4

B-A D1⁢(⋅,⋅)subscript𝐷1⋅⋅D_{1}(\cdot,\cdot)italic_D start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( ⋅ , ⋅ ) perturbation Rademacher complexity

B-B D∞⁢(⋅,⋅)subscript𝐷⋅⋅D_{\infty}(\cdot,\cdot)italic_D start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ( ⋅ , ⋅ ) perturbation Rademacher complexity

Appendix C Proof of Lemma 2

Appendix D Proof of Theorem 5

Appendix E Noisy Quantum Embedding Satisfies Assumption 1

A-A $D_{1}(\cdot,\cdot)$ perturbation Rademacher complexity

A-B $D_{\infty}(\cdot,\cdot)$ perturbation Rademacher complexity

B-A $D_{1}(\cdot,\cdot)$ perturbation Rademacher complexity

B-B $D_{\infty}(\cdot,\cdot)$ perturbation Rademacher complexity