Sample Complexity of the Boolean Multireference Alignment Problem

Emmanuel Abbe; João M Pereira; Amit Singer

doi:10.1109/ISIT.2017.8006742

. Author manuscript; available in PMC: 2018 May 11.

Published in final edited form as: Proc IEEE Int Symp Info Theory. 2017 Aug 15;2017:1316–1320. doi: 10.1109/ISIT.2017.8006742

Sample Complexity of the Boolean Multireference Alignment Problem

Emmanuel Abbe ¹, João M Pereira ², Amit Singer ³

PMCID: PMC5947970 NIHMSID: NIHMS912110 PMID: 29755834

Abstract

The Boolean multireference alignment problem consists in recovering a Boolean signal from multiple shifted and noisy observations. In this paper we obtain an expression for the error exponent of the maximum A posteriori decoder. This expression is used to characterize the number of measurements needed for signal recovery in the low SNR regime, in terms of higher order autocorrelations of the signal. The characterization is explicit for various signal dimensions, such as prime and even dimensions.

I. Introduction

The Boolean multireference alignment (BMA) problem consists of estimating an unknown signal $x \in ℤ_{2}^{L}$ , from noisy cyclically shifted copies $Y_{1}, \dots, Y_{N} \in ℤ_{2}^{L}$ , i.e.,

Y_{i} = R^{S_{i}} x \oplus Z_{i}, i \in {1, \dots, N},

(1)

where the error Z_i ~ Ber(p)^L, the product measure of L Bernoulli variables with parameter p, ⊕ denotes addition mod 2, R is the index cyclic shift operator that shifts a vector one element to the right (x₁, …, x_N) ↦ (x_N, x₁, …, x_N₋₁), R^S_i corresponds to applying S_i times the operator R and the shifts S_i ~ 𝒰(ℤ_L), the uniform distribution in ℤ_L.

The motivation to study this problem comes from the classical multireference alignment problem, where the signal and observations are real valued vectors, and the error is Gaussian white noise. Several algorithms were recently proposed to solve the problem, including angular synchronization [1], semidefinite program relaxations of the maximum likelihood decoder [2] and reconstruction using the bispectrum [3]. This problem is also an instance of a larger class of problems, called Non-Unique Games, which also includes the orientation estimation problem in cryo-electron microscopy [4].

Despite these advancements in algorithmic development, not much progress has been made in understanding the fundamental limits of signal recovery. The recent paper [5] investigated fundamental limits of shift recovery in multireference alignment, but not those of signal recovery. We note that estimating the shifts is impossible at low signal-to-noise ratio (SNR) even if an oracle presents us with the true signal. Also, the goal of many applications is signal recovery rather than shift estimation. Our paper aims to fill the gap on signal recovery, by studying the Boolean case.

We focus on the low SNR regime, since pairwise alignment performs well in the high SNR regime, while in applications, such as cryo-electron microscopy, the low regime is predominant. We show here that signal recovery is possible at arbitrarily low SNR, if sufficiently many measurements are available, and quantify this tradeoff. We do not consider here the problem of determining the sample complexity of multireference alignment in the real-valued Gaussian noise case, which is a topic of ongoing research [6], [7].

In BMA the search space is finite, and the maximum A posteriori decoder (MAP) minimizes the probability of error. Our main contribution is an expression for the error exponent of MAP, in the low SNR regime, given in Theorems III.2 and III.3. Our results imply how many measurements are needed, as a function of the SNR, in order to accurately estimate the signal.

The expression depends on the autocorrelations of the signal, defined in (6). Our results connect the order of autocorrelations needed to reconstruct the signal to the number of measurements needed to estimate the signal. This has some connections with previous theoretical work on uniqueness of the bispectrum [8].

We also consider some generalizations of the original problem in order to model some aspects of multireference alignment that arise in applications, such as the introduction of deletions.

II. BMA Problem

In the BMA problem, the errors are i.i.d. Bernoulli of parameter p. If $p = \frac{1}{2}$ , then the observations $Y_{i} ~ Ber {(\frac{1}{2})}^{L}$ , regardless of the original signal, and signal recovery is impossible. This corresponds to the case when SNR = 0. On the other hand, p = 0 or 1 corresponds to the noiseless case. Thus we define

SNR : = {(p - \frac{1}{2})}^{2} .

(2)

In contrast to proposing an algorithm to solve the BMA problem, our paper focuses on its sample complexity, in the low SNR regime, when $p \to \frac{1}{2}$ and SNR → 0.

Note that the observations Y_i, i ∈ [N], given the signal x, are i.i.d., since both the shifts S_i and the errors Z_i are i.i.d. For that reason we will drop the index i when it is more convenient. We rewrite (1), denoting by x(j) the j-th entry of x.

Y (j) = x (S + j) \oplus Z (j), j \in ℤ_{L},

(3)

where ‘+’ is addition mod L.

Our paper also considers the sample complexity of the following variations of the basic BMA problem:

BMA Problem with consecutive deletions: In this case the measurements Y₁, …, Y_N are in $ℤ_{2}^{K}$ , with K ≤ L, and
$Y (j) = x (S + j) \oplus Z (j), j \in ℤ_{K} .$ (4)

When K = L we obtain the original BMA problem.
BMA Problem with known deletions: Let V ⊂ ℤ_L be an ordered set of non-deletions, i.e. the set of deletions is ℤ_L\V. Now the measurements Y₁, …, Y_N are in $ℤ_{2}^{K}$ , with K = |V|, and:
$Y (j) = x (S + V_{j}) \oplus Z (j), \forall j \in ℤ_{K},$ (5)

where V_j denotes the j-th element of V. When V = [K] we recover the BMA problem with consecutive deletions.
BMA Problem (and variations) with non uniform rotations: Similar to the previous problems, but now the shifts follow some distribution ξ in ℤ_L.

These variations are motivated by problems similar to multireference alignment. The case of possible deletions is intended to model instances where the observations are only partial, whereas the extension to non-uniform shifts attempts to represent a non-symmetric version of the problem.

III. Results

We start by introducing the following notion of autocorrelation of a signal that is central to our main results.

Definition III.1

The (ξ, k)-autocorrelation of x, with respect to a distribution ξ in ℤ_L and $k = (k_{1}, k_{2}, \dots, k_{d}) \in ℤ_{L}^{d}$ is defined as

A_{ξ, k} (x) : = \sum_{s = 1}^{L} ξ (s) x (k_{1} + s) \dots x (k_{d} + s) .

(6)

We refer to d = |k| as the order of the auto-correlation. When ξ ~ 𝒰(ℤ_L), we simply write k-autocorrelation and A_k. Notice A_k is shift invariant, that is A_k(x) = A_k(R^sx), and in this case we may assume k₁ = 0.

We define the minimum autocorrelation order necessary to distinguish x₁ and x₂ under ξ and V as

t_{ξ, V} (x_{1}, x_{2}) : = inf {x : A_{ξ, k} (x_{1}) \neq A_{ξ, k} (x_{2}), k \in V^{d}},

(7)

where V^d denotes the vectors in $Z_{2}^{d}$ with entries in V. The minimum autocorrelation order necessary to describe all signals in 𝒳 is defined as

t_{ξ, V} (X) : = max_{\begin{matrix} x_{1}, x_{2} \in X \\ x_{1} \neq x_{2} \end{matrix}} t_{ξ, V} (x_{1}, x_{2}) .

(8)

Given a prior distribution on the signals P_X, with support 𝒳, denote by X the random variable with distribution P_X. Given an algorithm for BMA the probability of error is defined as

P (\hat{X} \neq X) = \sum_{x_{i} \in X} P (\hat{X} \neq x_{i}) P_{X} (x_{i}),

(9)

where X̂ is the answer given by the algorithm. In the BMA problem the search space is finite, thus MAP minimizes the probability of error (9). We obtain results that do not depend on the prior distribution, they depend only on its support.

Theorem III.2

Consider the BMA problem with known deletions Z_L\V and shift distribution ξ. Let $X \subset Z_{2}^{L}$ be the support of the prior distribution of the signals and μ_x the conditional distribution in $ℤ_{2}^{K}$ of the observations Y given the signal x, where K = |V|. The probability of error of the MAP estimator, denoted by P_e, has the following asymptotic behavior

lim_{N \to \infty} \frac{1}{N} log P_{e} = min_{\begin{matrix} x_{1}, x_{2} \in X \\ x_{1} \neq x_{2} \end{matrix}} C (μ_{x_{1}}, μ_{x_{2}}),

(10)

with

C (μ_{x_{1}}, μ_{x_{2}}) = \frac{2^{4 t - 3}}{t!} {SNR}^{t} \sum_{k \in V^{t}} {(A_{ξ, k} (x_{1}) - A_{ξ, k} (x_{2}))}^{2} + O ({SNR}^{t + 1}),

(11)

and t = t_ξ,V (x₁, x₂).

The theorem implies that the exponent on SNR is t_ξ,V(𝒳). In the original problem, with uniform shifts and no deletions, the recovery of the original signal is possible only up to a shift, i.e. we can only recover R^kx, where x is the original signal, and k is some shift in ℤ_L. For that reason, we consider 𝒳 to have exactly one element of each class of all the shifts of a signal, i.e., there are no two elements in 𝒳 where one is a shift of the other (for example, if L is prime, then there are 2^L − 2 such elements).

Corollary III.3

Consider the original problem, with V = [L], ξ ~ 𝒰(ℤ_L) and 𝒳 as defined above. By inspection one can obtain the error exponent for L ≤ 5. For L ≥ 6, we either have

lim_{N \to \infty} \frac{1}{N} log P_{e} = {\begin{cases} \frac{2^{10}}{L} {SNR}^{3} + O ({SNR}^{4}) \\ O ({SNR}^{4}) \end{cases}

(12)

Also, the first case occurs when L is prime, and the second when L ≥ 12 and is even. The other values of L remain open.

IV. Proof Techniques

Proof of Theorem III.2

The proof consists of two main parts. The next theorem gives a formula to the error exponent and claim IV.2 makes the connection with autocorrelations.

Theorem IV.1

Consider the BMA problem with known deletions Z_L\V and shift distribution ξ. Let $X \subset Z_{2}^{L}$ be the space of possible signals and μ_x := P_Y|X (·|x) the conditional distribution in $ℤ_{2}^{K}$ of the observations given the signal x. The probability of error of the MAP estimator (P_e) has the following asymptotic behavior

lim_{N \to \infty} \frac{1}{N} log P_{e} = min_{x_{1} \neq x_{2} \in X} C (μ_{x_{1}}, μ_{x_{2}}),

(13)

with

C (μ_{x_{1}}, μ_{x_{2}}) = \frac{{(\frac{1}{2} - p)}^{2 s}}{8 {(s!)}^{2}} \sum_{y \in ℤ_{2}^{K}} \frac{{(μ_{x_{1}}^{(s)} (y; \frac{1}{2}) - μ_{x_{2}}^{(s)} (y; \frac{1}{2}))}^{2}}{μ_{x_{1}} (y; \frac{1}{2})} + O {(\frac{1}{2} - p)}^{2 s + 2},

(14)

where $μ_{x}^{(m)} (y; p)$ denotes the m-th derivative of μ_x(y; p) in p, i.e. the derivative of the conditional distribution in y given x in order of the Bernoulli parameter p, and

s (x_{1}, x_{2}) : = inf {m : μ_{x_{1}}^{(m)} (y; \frac{1}{2}) \neq μ_{x_{2}}^{(m)} (y; \frac{1}{2}), y \in ℤ_{2}^{K}} .

This theorem follows from Theorems 1 and 2 in [9]. Theorem 1 is a corollary of Sanov Theorem, and expression (13) is in fact the Chernoff Information between distributions μ_x₁ and μ_x₂ [10]. In Theorem 2 [9] we Taylor expand the Chernoff Information (13) and obtain (14).

Claim IV.2

μ_{x_{1}}^{(m)} (y; \frac{1}{2}) = μ_{x_{2}}^{(m)} (y; \frac{1}{2}) \forall m < n, y \in ℤ_{2}^{K},

(15)

then the following expressions are equal:

\sum_{y \in ℤ_{2}^{K}} \frac{{(μ_{x_{1}}^{(n)} (y; \frac{1}{2}) - μ_{x_{2}}^{(n)} (y; \frac{1}{2}))}^{2}}{μ_{x_{1}} (y; \frac{1}{2})}

(16)

and

2^{4 n} n! \sum_{k \in V^{L}} {(A_{ξ, k} (x_{1}) - A_{ξ, k} (x_{2}))}^{2} .

(17)

In fact, since the expressions (16) and (17) are both sum of squares, the claim implies that t_ξ,V (x₁, x₂) = s(x₁, x₂), what concludes the proof of theorem III.2.

Proof of Claim IV.2

The claim is proved by induction on n. Note that condition (15) is equivalent to (16) vanishing for m < n, which implies (17) also vanishes by applying the claim with m. In general (17) is a function of (16) and lower order terms, which vanish when we enforce condition (15). The rigorous proof follows.

Denote by x(V) the vector in $ℤ_{2}^{K} (K = ∣ V ∣)$ that consists of the values of x with indices in V, i.e. the j-th element of x(V) is x(V_j). Also, given s ∈ ℤ_L denote by s + V the ordered set corresponding to the sum of each element in V with s mod L. Equation (5) can then be rewritten, as

Y = x (S + V) \oplus Z

(18)

Then since Z ~ Ber(p)^L, we have

μ_{x} (y; p ∣ S = s) = {(1 - p)}^{K - w (y \oplus x (s + V))} p^{w (y \oplus x (s + V))},

where w denotes the Hamming weight, and since S ~ ξ

μ_{x} (y; p) = \sum_{s = 1}^{L} ξ (s) {(1 - p)}^{K - w (y \oplus x (s + V))} p^{w (y \oplus x (s + V))} .

(19)

In the statement of the theorem we have $x \in ℤ_{2}^{L}$ , however it is convenient for the proof to consider the entries of x to be −1, 1, changed by the rule: a ↦ 1 − 2a. We will call

u : = 1 - 2 x \in \sum_{2}^{L}

(20)

the corresponding element of x with ±1 values, where Σ₂ := {−1, 1}, and v := 1 − 2y. In analogy to the Hamming weight, we define

W (u) : = \sum_{s = 1}^{L} u (s) = L - 2 w (x) .

(21)

With this we rewrite (19)

μ_{u} (v; p) = \sum_{s = 1}^{L} ξ (s) {(1 - p)}^{\frac{K}{2} + \frac{W (v \oplus u (s + V))}{2}} p^{\frac{K}{2} - \frac{W (v \oplus u (s + V))}{2}},

(22)

where μ_u(v; p) := μ_x(y; p). For simplicity of notation denote

W_{v, u, s} : = W (v \oplus u (s + V)) .

By properties of Jacobi polynomials [11] we have

{(p^{\frac{K}{2} - \frac{b}{2}} {(1 - p)}^{\frac{K}{2} + \frac{b}{2}})}_{∣ p = \frac{1}{2}}^{(m)} = {(- 2)}^{m - K} P_{m} (b),

where P_m is a polynomial with the following property

P_{m} (b) = b^{m} + Q_{m} (b),

(23)

where Q_m has degree at most m − 1, and Q₀ ≡ Q₁ ≡ 0. Thus

μ_{u}^{(m)} (v; \frac{1}{2}) = {(- 2)}^{m - K} \sum_{s = 1}^{L} ξ (s) P_{m} (W_{v, u, s}) .

(24)

Then when m = 1

\sum_{v \in \sum_{2}^{K}} \frac{{(μ_{u_{1}}^{(1)} (v; \frac{1}{2}) - μ_{u_{2}}^{(1)} (v; \frac{1}{2}))}^{2}}{μ_{u_{1}} (v; \frac{1}{2})} = 2^{2 - K} \sum_{v \in \sum_{2}^{K}} {[\sum_{s = 1}^{L} ξ (s) (W_{v, u_{1}, s} - W_{v, u_{2}, s})]}^{2} .

Now, by the induction hypothesis if $μ_{u_{1}}^{(k)} (v; \frac{1}{2}) = μ_{u_{2}}^{(k)} (v; \frac{1}{2})$ for all k ≤ n − 1, $v \in \sum_{2}^{K}$

\sum_{s = 1}^{L} ξ (s) Q_{n} (W_{v, u_{1}, s}) = \sum_{s = 1}^{L} ξ (s) Q_{n} (W_{v, u_{2}, s}),

for all $v \in \sum_{2}^{K}$ since Q_n has degree at most n − 1. Thus by (23) and (24)

\sum_{v \in \sum_{2}^{K}} \frac{{(μ_{u_{1}}^{(n)} (v; \frac{1}{2}) - μ_{u_{2}}^{(n)} (v; \frac{1}{2}))}^{2}}{μ_{u_{1}} (v; \frac{1}{2})} = 2^{2 n - K} \sum_{v \in \sum_{2}^{K}} {[\sum_{s = 1}^{L} ξ (s) (W_{v, u_{1}, s}^{n} - W_{v, u_{2}, s}^{n})]}^{2}

(25)

Now splitting the square of the sum on the RHS into a product of two sums and expanding, we obtain terms of the form

\sum_{s_{1} = 1}^{L} \sum_{s_{2} = 1}^{L} ξ (s_{1}) ξ (s_{2}) {(- 1)}^{α + β} \sum_{v \in \sum_{2}^{K}} W_{v, u_{α}, s_{1}}^{n} W_{v, u_{β}, s_{2}}^{n},

(26)

where α and β are 1 or 2. By Lemma IV.3 we get

\sum_{v \in \sum_{2}^{K}} W_{v, u_{α}, s_{1}}^{n} W_{v, u_{β}, s_{2}}^{n} = 2^{K} \sum_{\begin{array}{l} A \in M_{[2 n]} \\ A is even \end{array}} C_{A} \prod_{i = 1}^{∣ A ∣} (\sum_{k = 1}^{K} \prod_{j = 1}^{∣ a_{i} ∣} u_{a_{i j}} (k)),

(27)

Where u_{a_ij} is u_α(s₁ + V) if a_ij ≤ n, and is u_β(s₂ + V) otherwise. So, since |a_i| is even, as A is an even partition, and the entries of u_{a_ij} are ±1,

\sum_{k = 1}^{K} \prod_{j = 1}^{∣ a_{i} ∣} u_{a_{i j}} (k) = \sum_{k \in V} u_{α} (s_{1} + k) u_{β} (s_{2} + k)

if |a_i ∩ [n]| is odd, and it is K otherwise. Then

\sum_{v \in \sum_{2}^{K}} W_{v, u_{α}, s_{1}}^{n} W_{v, u_{β}, s_{2}}^{n} = R_{n} (\sum_{k \in V} u_{α} (s_{1} + k) u_{β} (s_{2} + k)),

where R_n is a polynomial with degree n (with coefficients possibly depending on K and n), and R₁(b) = 2^kb. It cannot have degree n + 1 since |A| ≤ n, since it is an even partition of [2n]. For it to be a power of order n, we need |A| = n, so |a_i| = 2 for i = 1, …, n, thus C_A = 1, by the Lemma. Also |a_i ∩ [n]| must be odd for all i, thus |a_i ∩ [n]| = 1. There are exactly n! partitions with this property, so the leading coefficient of R_n is 2^K n!. We also have

\sum_{s_{1} = 1}^{L} \sum_{s_{2} = 1}^{L} ξ (s_{1}) ξ (s_{2}) {(\sum_{k \in V} u_{α} (s_{1} + k) u_{β} (s_{2} + k))}^{n} = \sum_{s_{1} = 1}^{L} \sum_{s_{2} = 1}^{L} ξ (s_{1}) ξ (s_{2}) \sum_{k \in V^{n}} \prod_{i = 1}^{n} u_{α} (s_{1} + k_{i}) u_{β} (s_{2} + k_{i}) = \sum_{k \in V^{n}} A_{ξ, k} (u_{α}) A_{ξ, k} (u_{β}),

(28)

The equation will be true for n = 1, since R₁(b) = 2^kb. As in (25), the induction hypothesis implies the lower order terms in R_n cancel and only the leading coefficient is of interest. We get

\sum_{v \in \sum_{2}^{K}} {[\sum_{s = 1}^{L} ξ (s) (W_{v, u_{1}, s}^{n} - W_{v, u_{2}, s}^{n})]}^{2} = 2^{k} n! \sum_{k \in V^{n}} {(A_{ξ, k} (u_{1}) - A_{ξ, k} (u_{2}))}^{2}

(29)

Now through some algebraic manipulation, and using again the argument of the leading coefficient, if |k| = n, then

\sum_{k \in V^{n}} {(A_{ξ, k} (u_{1}) - A_{ξ, k} (u_{2}))}^{2} = 2^{2 n} \sum_{k \in V^{n}} {(A_{ξ, k} (x_{1}) - A_{ξ, k} (x_{2}))}^{2}

(30)

This together with (25) and (29) concludes the proof.

Lemma IV.3

For any partition A = {a₁, …, a_|A|} of the set {1, 2, …, m}, denote by a_ij the j-th entry of a_i and M_[_m_] the set of all such partitions. If $u_{1}, \dots, u_{m} \in \sum_{2}^{K}$

\sum_{v \in \sum_{2}^{K}} W (u_{1} \oplus v) \dots W (u_{m} \oplus v) = 2^{K} \sum_{\begin{array}{l} A \in M_{[m]} \\ A i s even \end{array}} C_{A} \prod_{i = 1}^{∣ A ∣} (\sum_{k = 1}^{K} \prod_{j = 1}^{∣ a_{i} ∣} u_{a_{i j}} (k)),

(31)

where A is even if all |a_i| are even for i ∈ {1, …, |A|}. Moreover, C_A is a constant that depends only on the partition A and is always 1 if |a_i| = 2 for all i ∈ {1, …, |A|}.

Proof

Recall (21). We have $W (u \oplus v) = \sum_{k = 1}^{K} u (k) v (k)$

\sum_{v \in \sum_{2}^{K}} W (u_{1} \oplus v) \dots W (u_{m} \oplus v) = \sum_{k_{1} = 1}^{K} \dots \sum_{k_{m} = 1}^{K} u_{1} (k_{1}) \dots u_{m} (k_{m}) \sum_{v \in \sum_{2}^{K}} v (k_{1}) \dots v (k_{m}) = \sum_{A \in M_{[m]}} \sum_{\begin{array}{l} k_{1}, \dots, k_{∣ A ∣} = 1 \\ all distinct \end{array}}^{K} \prod_{i = 1}^{∣ A ∣} \prod_{j = 1}^{∣ a_{i} ∣} u_{a_{i j}} (k_{i}) \sum_{v \in \sum_{2}^{K}} \prod_{i = 1}^{∣ A ∣} v {(k_{i})}^{∣ a_{i} ∣},

(32)

The last sum is 2^K when A is even, and 0 otherwise. Using a combinatorial argument we can rewrite (32) without the ‘all-distinct’ condition, at the cost of a constant C_A, which is 1 when |a_i| = 2 for i ∈ {1, …, |A|}. We get

2^{K} \sum_{\begin{array}{l} A \in M_{[m]} \\ A is even \end{array}} C_{A} \sum_{k_{1}, \dots, k_{∣ A ∣} = 1}^{K} \prod_{i = 1}^{∣ A ∣} \prod_{j = 1}^{∣ a_{i} ∣} u_{a_{i j}} (k_{i}) = 2^{K} \sum_{\begin{array}{l} A \in M_{[m]} \\ A is even \end{array}} C_{A} \prod_{i = 1}^{∣ A ∣} (\sum_{k = 1}^{K} \prod_{j = 1}^{∣ a_{i} ∣} u_{a_{i j}} (k))

Proof of Corollary III.3

We first prove equation (12). Recall (6), and denote by

B_{m} (x_{1}, x_{2}) : = \sum_{k \in ℤ_{L}^{m}} {(A_{k} (x_{1}) - A_{k} (x_{2}))}^{2}

and

B_{m} (L) : = min_{x_{1} \neq x_{2} \in X} B_{m} (x_{1}, x_{2})

Note that B_m(x₁, x₂) = 0 if m < t_ξ,V (x₁, x₂) by (7). For convenience let B(x₁, x₂) := B_{t_ξ,V(x₁,x₂)}(x₁, x₂) and B(L) := Bt_{ξ,V_(𝒳)}(L). Using this notation we rewrite (10) and (11)

lim_{N \to \infty} \frac{1}{N} log P_{e} = B (L) \frac{2^{4 t_{L} - 3}}{t_{L}!} {SNR}^{t_{L}} + O ({SNR}^{t_{L} + 1})

Now equation (12) is equivalent to having t_ξ,V(𝒳) ≥ 3 and B₃(L) either $\frac{12}{L}$ or 0. Turns out, for L ≥ 6, if we take

x_{1}^{*} = (1, 1, 0, 1, \underset{L - 4 zeros}{\underset{︸}{0, \dots, 0}}) and x_{2}^{*} = (1, 0, 1, 1, \underset{L - 4 zeros}{\underset{︸}{0, \dots, 0}}),

then $t_{ξ, V} (X) \geq t_{ξ, V} (x_{1}^{*}, x_{2}^{*}) = 3$ and $B_{3} (L) \leq B (x_{1}^{*}, x_{2}^{*}) = \frac{12}{L}$ . Also we cannot have $\frac{12}{L} > B_{3} (L) > 0$ . This implies there exists x₁ and x₂ in 𝒳 such that $\frac{12}{L} > B (x_{1}, x_{2}) > 0$ . Since it is positive, there is $k^{*} \in ℤ_{L}^{3}$ such that A_k^*(x₁) ≠ A_k^*(x₂). But by definition (6), since $ξ (s) = \frac{1}{L}$ , LA_k^*(x) is an integer for $x \in ℤ_{2}^{L}$ , and L²(A_k^*(x₁) − A_k^*(x₂))² ∈ ℤ.

Now by the definition we also have A_σ(k^*)(x) = A_k^*(x), where σ permutes the entries of k^*. Also, for s ∈ ℤ_L, let $s + k^{*} : = (s + k_{1}^{*}, s + k_{2}^{*}, s + k_{3}^{*})$ , then A_s_+k^*(x) = A_k^*(x). There is 6 permutations and L possible values for s ∈ ℤ_L, so B(x₁, x₂) is an integer multiple of $\frac{6}{L}$ . (we can also have not trivial s and σ such that s + k^* = σ(k^*) but that case also has the property mentioned). However we cannot have $B (x_{1}, x_{2}) = \frac{6}{L}$ . That means there exists only one $k^{*} \in ℤ_{L}^{3}$ (with permutations and shifts) such that A_k^*(x₁) ≠ A_k^* (x₂). Then

\sum_{k \in ℤ_{L}^{3}} A_{k} (x_{1}) - A_{k} (x_{2}) = 6 L (A_{k^{*}} (x_{1}) - A_{k^{*}} (x_{2})) \neq 0

(33)

On the other hand

\sum_{k \in ℤ_{L}^{3}} A_{k} (x_{1}) = \frac{1}{L} \sum_{s = 1}^{L} \sum_{k \in ℤ_{L}^{2}} x (k_{1} + s) x (k_{2} + s) x (k_{3} + s) = L^{3} A_{0} {(x_{1})}^{3},

where A₀ denotes k-autocorrelation with k = 0. Since t_L > 1, A₀(x₁) = A₀(x₂), so equation (33) must be 0, and equation (12) follows by contradiction. Now if L ≥ 12 is even, choose

x_{1}^{*} = (1, 1, 0, \underset{\frac{L}{2} - 3 ones}{\underset{︸}{1, \dots, 1}}, 0, 0, 1, \underset{\frac{L}{2} - 3 zeros}{\underset{︸}{0, \dots, 0}})

and $x_{2}^{*}$ the vector obtained by reversing the entries of $x_{1}^{*}$ . Since one is the reverse of the other, they have same 1 and 2 order autocorrelations. Recall (20) and (6) and notice that in this case both A_k(u₁) and A_k(u₂) are 0 when |k| is odd, since half of the signal is the symmetric of the other half, i.e. $u_{1} ({1, \dots, \frac{L}{2}}) = - u_{1} ({\frac{L}{2} + 1, \dots, L})$ . Now because of (30) we have A_k(x₁) = A_k(x₂) when |k| = 3, so t_L ≥ 4, and B₃(L) = 0.

Finally, let L ≥ 6 be prime. We prove by contradiction that t_L = 3 and $B_{3} (L) = \frac{12}{L}$ . If this is not true, then it exists $x_{1}^{*}$ and $x_{2}^{*}$ such that $t_{x_{1}^{*}, x_{2}^{*}} > 3$ , so

A_{k} (x_{1}^{*}) = A_{k} (x_{2}^{*}), k \in ℤ_{L}^{n}, n \leq 3

(34)

By Theorem 2 of paper [8], if the Fourier coefficients of $x_{1}^{*}$ and $x_{2}^{*}$ are non-zero, then equation (34) implies one is a shift of the other. Denote by ${r_{j}^{1}}_{j \in ℤ_{L}}$ and ${r_{j}^{2}}_{j \in ℤ_{L}}$ the Fourier coefficients of $x_{1}^{*}$ and $x_{2}^{*}$ , respectively, which are given by

r_{j}^{α} = \frac{1}{\sqrt{L}} \sum_{s = 1}^{L} x_{α} (s) ω_{L}^{- j s}, α \in {1, 2}, j \in ℤ_{L},

(35)

= \frac{1}{\sqrt{L}} \sum_{s : x_{α} (s) = 1} ω_{L}^{- j s},

(36)

where ω_L is the L’th root of unity. $r_{0}^{α} = 0$ implies $x_{α}^{*}$ only has zeros, and $r_{j}^{α}$ is 0 only if $w_{L}^{- j}$ is a root of the polynomial

\sum_{s : x_{α} (s) = 1} b^{s}

(37)

However, since L is prime, the minimal polynomial of $w_{L}^{- j}$ in ℚ[x], for L > j > 0, is 1 + x + ··· + x^L⁻¹ [12], so this polynomial must divide (37). Thus $x_{1}^{*}$ and $x_{2}^{*}$ must be the all zeros and all ones signals, but these signals also do not satisfy (34).

Acknowledgments

A. S. and J. P. were partially supported by Award Number R01GM090200 from the NIGMS, FA9550-12-1-0317 from AFOSR, Simons Foundation Investigator Award and Simons Collaborations on Algorithms and Geometry, and the Moore Foundation Data-Driven Discovery Investigator Award.

E. A. is partially supported by NSF CAREER Award CCF-1552131 and ARO grant W911NF-16-1-0051.

Contributor Information

Emmanuel Abbe, Princeton University.

João M. Pereira, Princeton University

Amit Singer, Princeton University.

References

1.Singer A. Angular synchronization by eigenvectors and semidefinite programming. Applied and computational harmonic analysis. 2011;30(1):20–36. doi: 10.1016/j.acha.2010.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Bandeira AS, Charikar M, Singer A, Zhu A. Multireference alignment using semidefinite programming. Proceedings of the 5th conference on Innovations in theoretical computer science; ACM; 2014. pp. 459–470. [Google Scholar]
3.Sadler BM, Giannakis GB. Shift- and rotation-invariant object reconstruction using the bispectrum. JOSA A. 1992;9(1):57–69. [Google Scholar]
4.Bandeira AS, Chen Y, Singer A. Non-unique games over compact groups and orientation estimation in cryo-EM. 2015 doi: 10.1088/1361-6420/ab7d2c. arXiv preprint arXiv:1505.03840. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Aguerrebere C, Delbracio M, Bartesaghi A, Sapiro G. Fundamental limits in multi-image alignment. IEEE Transactions on Signal Processing. 2016;64(21):5707–5722. [Google Scholar]
6.Bandeira AS, Rigollet P, Weed J. Optimal rates of estimation for multi-reference alignment. In preparation. [Google Scholar]
7.Perry A, Weed J, Bandeira AS, Rigollet P, Singer A. The sample complexity of multi-reference alignment. In preparation. [Google Scholar]
8.Kakarala R. The bispectrum as a source of phase-sensitive invariants for Fourier descriptors: a group-theoretic approach. Journal of Mathematical Imaging and Vision. 2012;44(3):341–353. [Google Scholar]
9.Abbe E, Pereira JM, Singer A. Very noisy Sanov’s theorem and applications to alignment problems. 2017 Preprint. [Google Scholar]
10.Cover TM, Thomas JA. Elements of information theory. John Wiley & Sons; 2006. [Google Scholar]
11.Szego G. Orthogonal polynomials. Vol. 23 American Mathematical Society; 1975. [Google Scholar]
12.Jacobson N. Lectures in Abstract Algebra: III. Theory of Fields and Galois Theory. Vol. 32 Springer-Verlag; 1964. [Google Scholar]

[R1] 1.Singer A. Angular synchronization by eigenvectors and semidefinite programming. Applied and computational harmonic analysis. 2011;30(1):20–36. doi: 10.1016/j.acha.2010.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] 2.Bandeira AS, Charikar M, Singer A, Zhu A. Multireference alignment using semidefinite programming. Proceedings of the 5th conference on Innovations in theoretical computer science; ACM; 2014. pp. 459–470. [Google Scholar]

[R3] 3.Sadler BM, Giannakis GB. Shift- and rotation-invariant object reconstruction using the bispectrum. JOSA A. 1992;9(1):57–69. [Google Scholar]

[R4] 4.Bandeira AS, Chen Y, Singer A. Non-unique games over compact groups and orientation estimation in cryo-EM. 2015 doi: 10.1088/1361-6420/ab7d2c. arXiv preprint arXiv:1505.03840. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] 5.Aguerrebere C, Delbracio M, Bartesaghi A, Sapiro G. Fundamental limits in multi-image alignment. IEEE Transactions on Signal Processing. 2016;64(21):5707–5722. [Google Scholar]

[R6] 6.Bandeira AS, Rigollet P, Weed J. Optimal rates of estimation for multi-reference alignment. In preparation. [Google Scholar]

[R7] 7.Perry A, Weed J, Bandeira AS, Rigollet P, Singer A. The sample complexity of multi-reference alignment. In preparation. [Google Scholar]

[R8] 8.Kakarala R. The bispectrum as a source of phase-sensitive invariants for Fourier descriptors: a group-theoretic approach. Journal of Mathematical Imaging and Vision. 2012;44(3):341–353. [Google Scholar]

[R9] 9.Abbe E, Pereira JM, Singer A. Very noisy Sanov’s theorem and applications to alignment problems. 2017 Preprint. [Google Scholar]

[R10] 10.Cover TM, Thomas JA. Elements of information theory. John Wiley & Sons; 2006. [Google Scholar]

[R11] 11.Szego G. Orthogonal polynomials. Vol. 23 American Mathematical Society; 1975. [Google Scholar]

[R12] 12.Jacobson N. Lectures in Abstract Algebra: III. Theory of Fields and Galois Theory. Vol. 32 Springer-Verlag; 1964. [Google Scholar]

PERMALINK

Sample Complexity of the Boolean Multireference Alignment Problem

Emmanuel Abbe

João M Pereira

Amit Singer

Abstract

I. Introduction

II. BMA Problem

III. Results

Definition III.1

Theorem III.2

Corollary III.3

IV. Proof Techniques

Proof of Theorem III.2

Theorem IV.1

Claim IV.2

Proof of Claim IV.2

Lemma IV.3

Proof

Proof of Corollary III.3

Acknowledgments

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Sample Complexity of the Boolean Multireference Alignment Problem

Emmanuel Abbe

João M Pereira

Amit Singer

Abstract

I. Introduction

II. BMA Problem

III. Results

Definition III.1

Theorem III.2

Corollary III.3

IV. Proof Techniques

Proof of Theorem III.2

Theorem IV.1

Claim IV.2

Proof of Claim IV.2

Lemma IV.3

Proof

Proof of Corollary III.3

Acknowledgments

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases