A method for verifying the generalized Riemann hypothesis

Ghaith Hiary, Summer Ireland, Megan Kyi GH: Department of Mathematics, The Ohio State University, 231 West 18th Ave, Columbus, OH 43210, USA hiary.1@osu.edu SI: Department of Mathematics, The Ohio State University, 231 West 18th Ave, Columbus, OH 43210, USA ireland.118@buckeyemail.osu.edu MK: Oberlin College, 135 West Lorain Street, Oberlin, OH 44074-1081, USA mkyi@oberlin.edu

Abstract.

Riemann numerically approximated at least three zeta zeros. According to Edwards, Riemann even took steps to verify that the lowest zero he computed was indeed the first zeta zero. This approach to verification is developed, improved, and generalized to a large class of $L$ -functions. Results of numerical calculations demonstrating the efficacy of the method are presented.

Key words and phrases:

Riemann hypothesis, Turing test, Riemann zeta function, L-functions.

2020 Mathematics Subject Classification:

Primary: 11M06, 11Y35.

1. Introduction

Let $s=\sigma+it$ be a complex variable, where $\sigma$ and $t$ are real numbers. The Riemann zeta function $\zeta(s)$ is defined by the Dirichlet series

(1)

\zeta(s)=\sum_{n\geq 1}\frac{1}{n^{s}},

which converges absolutely in the half-plane $\sigma>1$ . Zeta can be analytically continued to the entire complex plane except for a simple pole at $s=1$ , and has zeros (i.e., roots) at $s=-2,-4,-6,\ldots$ , which are called the trivial zeros. Zeta also has an infinite number of nontrivial zeros $\rho=\beta+i\gamma$ in the critical strip $0<\sigma<1$ , none of which is real. We call $|\gamma|$ the height of $\rho$ and order the $\rho$ ’s by increasing height. The trivial and nontrivial zeros account for all the zeta zeros. The Riemann Hypothesis (RH) is that all the $\rho$ ’s are on the critical line $\sigma=1/2$ , or equivalently that $\beta=1/2$ for all $\rho$ .

It is frequently asserted that Riemann numerically approximated the first few zeta zeros by hand, citing unpublished notes by Riemann. See in particular Edwards [edwards_riemann_2001, §7.6] as well as the Clay Mathematics Institute page [clay]. As Figure 1 shows, Riemann numerically approximated three zeta zeros on the critical line, corresponding to those with ordinates (i.e., imaginary parts)

\begin{split}\gamma_{1}=14.1347251417\ldots,\\ \gamma_{2}=21.0220396387\ldots,\\ \gamma_{3}=25.0108575801\ldots.\end{split}

Riemann approximated $\gamma/(2\pi)$ rather than $\gamma$ as the former quantity appeared naturally in various formulas.

The closest approximation of $\gamma_{1}/(2\pi)$ that we found in Riemann’s notes was $2.250466$ , so that $\gamma_{1}$ is $\approx 14.140095$ .¹¹1This differs from what is stated in [edwards_riemann_2001, 159] which gave the approximation $14.1386$ . For $\gamma_{2}$ and $\gamma_{3}$ , Riemann computed the approximations $3.287195$ and $4.0287$ , respectively. Both of these approximations were noticeably far from the true values $3.34576152\ldots$ and $3.98060161\ldots$ , and as can be seen in Figure 1, Riemann had other intermediate approximations that were slightly better. Nevertheless, Riemann’s approximation of $\gamma_{1}$ , which was long unknown to the outside world, remained closest to the true value of $\gamma_{1}$ for nearly five decades.²²2As far as we can tell, the first circulated approximation of $\gamma_{1}$ was in 1887 by Stieltjes [bailaud_bourget_1905, 450] who gave the approximation $14.5$ . Eight years later, Gram [gram_1895] gave the approximation $14.135$ , which Gram [gram_1903] improved to $14.13472$ in 1903. Around the same time, Lindelöf [lindelof_1903] devised a different method to approximate the $\rho$ ’s and proved that $14\leq\gamma_{1}\leq 14.25$ .

According to Edwards [edwards_riemann_2001, §7.6], Riemann even attempted to verify that Riemann’s numerical approximation of $1/2+i\gamma_{1}$ indeed corresponded to the first zeta zero (i.e., to the $\rho$ with smallest positive ordinate). This verification relied on the Hadamard product for $\zeta(s)$ together with a positivity argument and a known special value of zeta. However, unlike Riemann’s method to numerically compute pointwise values of $\zeta(s)$ , which became a standard method known as the Riemann–Siegel formula [siegel_1932], Riemann’s approach to verifying the RH remained little known. It is worth remarking, though, that Gram [gram_1895, gram_1903] considered the power series of the logarithm of the Hadamard product, like Riemann did, but often appeared to assume the RH. One may also compare the Riemann approach to verification with the Li criterion [li_1997] for the RH equivalence.

After Riemann’s 1859 paper, various efficient methods for verifying the RH were derived. Backlund [backlund_1911, backlund_1916] devised a verification method that relied on a clever application of the argument principle from complex analysis together with the Euler–Maclaurin summation for $\zeta(s)$ . This was eventually surpassed by a highly efficient method due to Turing [turing_1953], which has since become the standard method for verifying the RH, provided one is high enough on the critical line.

Refer to caption — Figure 1. Riemann’s approximation of the first three zeta zeros. Reproduced from [riemann3] with permission.

In comparison, the Riemann method, cited in Edwards [edwards_riemann_2001, §7.6], is time consuming at large heights. It can be expected to require $\sim\frac{1}{2\pi^{2}}(t_{0}\log t_{0})^{2}$ initial zeta zeros to verify the RH up to height $t_{0}$ . Nevertheless, this method is reasonably efficient at low heights³³3For example, $10$ zeta zeros suffice to verify the RH up to height $\gamma_{1}$ via this method, and $51$ zeta zeros suffice to verify the RH up to height $\gamma_{2}$ . while offering great simplicity. Therefore, it is worthwhile to generalize the Riemann method to families of $L$ -functions where even the “first” zero of $L(s)$ is deeply interesting. Such a generalization is one of the main goals of this paper.

Specifically, rather than fall back on a generalized Backlund method, which would require using a numerically-involved application of the argument principle, we re-examine and develop the Riemann method in a more general setting. Our generalization works naturally with already available databases and software for $L$ -functions. Our generalization is also simple to derive and justify, requiring a single numerical evaluation of a logarithmic derivative of the $L$ -function at a special point in the region of absolute convergence.

Although the main goal of this paper is to provide a simple RH verification for a large class of $L$ -functions at low heights, a secondary goal is to improve the Riemann method for zeta so that it functions efficiently at large heights. This results in a conceptually straightforward verification method that can verify the RH over larger windows. Specifically, given zeros data in a window of size $\tau$ around height $y$ , the improved method in Theorem 13 is expected to succeed in verifying the RH in a window of size $\eta\gg\tau/\sqrt{\log y}$ around $y$ .

To illustrate out main results, let us state two corollaries. Corollary 1 provides an example of an RH verification test for low zeros of the Dirichlet $L$ -function $L(s,\chi_{d})$ , where $\chi_{d}$ is any real primitive character of fundamental discriminant $d$ . This corollary is obtained from Theorem 7, part (i), on setting $\delta=-1$ and $m=1$ , and using the formula for $w_{1,\delta}$ in Corollary 5.

Note the required value of the logarithmic derivative of $L(s,\chi_{d})$ that appears in Corollary 1 is well inside the region of absolute convergence of $L(s,\chi_{d})$ . So, the required value can be computed easily by truncating the Dirichlet series for the logarithmic derivative, even if $d$ is very large. Lemma 6 furnishes an explicit bound on the corresponding truncation error.

To state the next two corollaries, we will make use of the following the quantity.

\iota(\eta):=\min\left(\frac{1}{1+\eta^{2}}+\frac{2}{4+\eta^{2}},\,\frac{12}{9% +4\eta^{2}}\right).

Corollary 1.

Let $d$ be a positive fundamental discriminant, $\tau$ be a real positive number, and $\mathcal{Z}$ be a set of nonempty disjoint subintervals of the form $[\gamma_{-},\gamma_{+}]\subseteq[0,\tau]$ or of the form $[-\gamma_{0},\gamma_{0}]\subseteq[-\tau,\tau]$ . Suppose that $L(1/2+it,\chi_{d})$ has a zero of odd multiplicity in each subinterval in $\mathcal{Z}$ . Further, define

C(\mathcal{Z}):=\sum_{[\gamma_{-},\gamma_{+}]\in\mathcal{Z}}\frac{12}{9+4% \gamma_{+}^{2}}+\sum_{[-\gamma_{0},\gamma_{0}]\in\mathcal{Z}}\frac{6}{9+4% \gamma_{0}^{2}}.

Let $\lambda_{0}=0.57721566\ldots$ be the Euler constant. For any real positive number $\eta$ , if

2\iota(\eta)+C(\mathcal{Z})>\frac{1}{2}\log\frac{d}{\pi e^{\lambda_{0}}}+\frac% {L^{\prime}}{L}(2,\chi_{d}),

then then RH holds for all the nontrivial zeros of $L(s,\chi_{d})$ with positive height $\leq\eta$ .

Here, one may think of $\tau$ as the width of the window where known zeros data is available, and of $\eta$ as the width of the window where one would like to verify the RH. The quantity $C(\mathcal{Z})$ is the minimal contribution from known zeros (i.e. from supplied zeros data), and $\iota(\eta)$ is the minimal contribution of a hypothetical counter-example of positive height $\leq\eta$ . The displayed inequality indicates that a contradiction has been reached, so that a hypothetical counter-example of positive height $\leq\eta$ cannot exist.

Similarly, by combining Theorem 11, part (i) and Corollary 9, we obtain an RH verification test for zeta zeros at large heights. This is stated in Corollary 2. But in this case we can improve the basic verification test substantially by considering the behavior of $S(u)$ , which is the fluctuating part of the counting function of zeta zeros - see (23). Specifically, in Theorem 13, we incorporate the explicit bounds

|S(u)|\leq\ell(u)\qquad\text{and}\qquad\left|\int_{u_{0}}^{u}S(\nu)\,d\nu% \right|\leq\ell_{1}(u),

where, according to [trudgian_2014, trudgian_2011], we may take

(2)		$\displaystyle\ell(u)$	$\displaystyle:=0.112\log u+0.278\log\log u+2.510,$
	$\displaystyle\ell_{1}(u)$	$\displaystyle:=0.059\log u+2.067,$

provided $u$ is large enough. ( $u>u_{0}>168\pi$ suffices.) We note that an explicit bound on $|\int_{u_{0}}^{u}S(\nu)\,d\nu|$ is a main ingredient in the Turing method as well, but we use this bound differently in our case. Also, without additional knowledge or analysis, the explicit bound on $|S(u)|$ is typically far more impactful for us than the explicit bound on $|\int_{u_{0}}^{u}S(\nu)\,d\nu|$ .

We will make use of the following notation and quantity. Let $g(s)=(s-1)\zeta(s)$ , $\psi_{0}$ denote the digamma function, and define

\kappa(y,\tau):=\frac{0.57}{\tau y^{2}}+\frac{3\log 2}{2\pi y}+\frac{2\log(y/2% \pi)}{\pi\tau^{3}}+\frac{3\log(y/2\pi)}{\pi y}+\frac{12\,\ell(2y)}{y^{2}}+% \frac{6\,\ell_{1}(2y)}{\tau^{3}}.

Corollary 2.

Let $y$ and $\tau$ be real numbers such that $3<\tau\leq y/2$ and $336\pi<y-\tau$ . Let $\mathcal{Z}$ be a set of nonempty disjoint subintervals of the form $[\gamma_{-},\gamma_{+}]\subseteq[y-\tau,y+\tau]$ such that $y$ does not belong to any of the subintervals in $\mathcal{Z}$ . Suppose that $\zeta(1/2+it)$ has a zero of odd multiplicity in each subinterval in $\mathcal{Z}$ . Further, define

D(\mathcal{Z}):=\sum_{\begin{subarray}{c}[\gamma_{-},\gamma_{+}]\in\mathcal{Z}% \\ \gamma_{-}>y\end{subarray}}\frac{6}{9+4(\gamma_{+}-y)^{2}}+\sum_{\begin{% subarray}{c}[\gamma_{-},\gamma_{+}]\in\mathcal{Z}\\ \gamma_{+}<y\end{subarray}}\frac{6}{9+4(y-\gamma_{-})^{2}}.

For any real positive number $\eta\leq y$ , if

	$\displaystyle\iota(\eta)+D(\mathcal{Z})+\frac{3}{2\pi\tau}\log\frac{y}{2\pi}-% \frac{6\,\ell(2y)}{2\tau^{2}}$	$\displaystyle-\kappa(y,\tau)>$
		$\displaystyle-\frac{1}{2}\log\pi+\frac{1}{2}{\textrm{Re}}\,\psi_{0}\left(2-% \frac{iy}{2}\right)+{\textrm{Re}}\,\frac{g^{\prime}}{g}(2-iy),$

then RH holds for all the nontrivial zeros of $\zeta(s)$ with height in $[y-\eta,y+\eta]$ .

Remark.

Here, one may think of $y$ as very large, $\tau$ is large but much smaller than $y$ , and with $\eta$ somewhat smaller than $\tau$ . The expression for $\kappa(y,\tau)$ is obtained from Theorem 13 by setting $x=-1$ and $c=y/2$ .

Like before, the special value of the logarithmic derivative of $g(s)$ appearing in Corollary 2 is well inside the region of absolute convergence of $\zeta(s)$ . So this value can be approximated easily and fairly accurately via a truncated sum over primes and prime powers using our Lemma 10, even at very large heights.

Our main theorems, Theorem 7 and Theorem 11, additionally enable verifying the simplicity of zeros in a given range as well as verifying the completeness of a given list of zeros. In addition, Theorem 13 gives a counterpart that allows one to still draw a conclusion in some situations where the RH might not be verified using the Turing method. For example, if the given zeros list is incomplete (i.e. there is a zero with ordinate in $[y-\tau,y+\tau]$ that is missing from the list), then the the Turing method might not prove that the zeros list is indeed incomplete. In this case, the counterpart in Theorem 13 will typically enable proving that the given zeros list is indeed incomplete.

Lastly, it completely reasonable to expect that a similar method to the one described here may be derived using the framework of the explicit formula [iwaniec_kowalski_2004, §5.5]. By choosing a suitable test function in the explicit formula, one may even accelerate the convergence of the associated series over the prime and prime powers. At the same time though one must ensure, under no assumption, that the individual terms in the sum over the zeros appearing in the explicit formula are nonnegative. We favored the current derivation due to its simplicity, its historical connection, and because we already have good control over the convergence of the said series in the region of absolute convergence. Additionally, the current derivation gives us access to several useful exact values and exacting relations as well as to long-studied sums in the theory of the Riemann zeta function, which benefits the practicality of our derivation. For example, we can directly benefit from exact values of the polygamma function and, if we wish, of exact values of $L$ -functions at special points such as the class number formula for Dirichlet $L$ -functions.

Overview. In §2, we provide background and set up some notation. In §3, we outline the Riemann approach to verifying the RH following the description in [edwards_riemann_2001]. In §4, we generalize the Riemann approach to a class of $L$ -functions with real Dirichlet coefficients. In §5, we treat the case of $\zeta$ separately, both because $\zeta(s)$ is outside our class of $L$ -functions (in view of the pole at $s=1$ ) and because our focus for zeta will be on large heights. In §6, we discuss substantial improvements in the case of zeta. In §7, we present results of numerical computations implemented in interval arithmetic for a variety of examples of $L$ -functions.

2. Background and notation

Using the Dirichlet series (1), we see that

(3)

\zeta(\overline{s})=\overline{\zeta(s)}.

So, $\rho=\beta+i\gamma$ is a zeta zero if and only if the complex conjugate $\overline{\rho}=\beta-i\gamma$ is a zeta zero, or equivalently the $\rho$ ’s are symmetric about the real axis. The $\rho$ ’s are also symmetric about the critical line. This is seen by using the zeta functional equation, which in its simplest form states that the entire function

\xi(s):=\pi^{-s/2}\Gamma(s/2+1)(s-1)\zeta(s),

where $\Gamma$ is the Gamma function⁴⁴4The poles of $\Gamma(s/2+1)$ are all simple and coincide with the trivial zeros of zeta, all of which are simple as well. So, the poles of $\Gamma(s/2+1)$ cancel the trivial zeros of zeta. The simple pole of zeta at $s=1$ coincides with the zero of the factor $s-1$ in the definition of $\xi$ ., satisfies the functional equation

(4)

\xi(s)=\xi(1-s).

Therefore, $\xi(s)$ is even about $s=1/2$ . For example, $\xi(0)=\xi(1)=-\zeta(0)=1/2$ . Since $\Gamma$ and $\pi^{-s/2}$ have no zeros at all, the zeros of $\xi$ are the same as the nontrivial zeros of $\zeta$ . Hence, by the functional equation (4), $\rho$ is a zeta zero if and only if $1-\rho$ is a zeta zero.

Furthermore, by the functional equation (4) and the symmetry relation (3),

\xi(1/2+it)=\xi(1/2-it)=\overline{\xi(1/2+it)}.

So, $\xi$ is real-valued on the critical line (as well as on the real axis). It follows by the intermediate value theorem that the simple (or odd multiplicity) nontrivial zeta zeros on the critical line correspond to sign changes of $\xi(1/2+it)$ . In particular, one can numerically prove the existence of zeta zeros of odd multiplicity on the critical line by detecting sign changes of $\xi(1/2+it)$ .

3. Riemann and verifying the RH

Being an entire function of order 1, $\xi$ has a Hadamard product given by

(5)

\xi(s)=\xi(0)\prod_{\rho}(1-s/\rho),

where the product is taken by pairing the terms for $\rho$ and $\overline{\rho}$ (or pairing the terms for $\rho$ and $1-\rho$ ), which ensures correct convergence. Starting with (5), Riemann obtained the following formula

(6)

\sum_{\rho}\frac{1}{\rho}=v_{1}\qquad\text{where}\qquad v_{1}:=\frac{1}{2}% \lambda_{0}+1-\frac{1}{2}\log 4\pi,

and the sum over the $\rho$ ’s is executed by pairing the terms for $\rho$ and $\overline{\rho}$ . Therefore,

v_{1}=2\sum_{\gamma>0}\textrm{Re}\,\frac{1}{\rho}.

As seen in Figure 2, Riemann correctly computed the value of $v_{1}$ up to 20 digits, obtaining $v_{1}=0.02309570896612103381\ldots$ .

According to Edwards [edwards_riemann_2001, §7.6], Riemann even attempted to use the numerical value of $v_{1}$ to verify that the Riemann approximation of $\rho_{1}=1/2+i\gamma_{1}$ indeed corresponded to the first zeta zero (zeta zero of lowest height). This attempt is described essentially as follows.

Using the first $10$ zeros in the upper half-plane, $2\,\textrm{Re}\,(\rho_{1}^{-1}+\ldots+\rho_{10}^{-1})\approx 0.0136$ . On the other hand, if there is a zero $\rho_{0}$ in the upper half-plane of height $<\gamma_{1}$ , then there must be a second such zero. This is because either $\rho_{0}$ is off the critical line, in which case $1-\overline{\rho_{0}}$ is a distinct zeta zero in the upper half-plane that is also of height $<\gamma_{1}$ . Or $\rho_{0}$ is on the critical line, in which case, considering that $\xi(1/2+it)$ has the same sign at both $t=0$ and $t=14.1$ , there must be a second zero on the critical line with a positive ordinate $<\gamma_{1}$ .⁵⁵5More precisely, the argument in [edwards_riemann_2001, §7.6] only works if $\rho_{0}$ has height $<14.1$ . Since the possibility that $\rho_{0}$ has height $\geq 14.1$ is not yet ruled out, this argument does not force the existence of a second zero on the critical line in this case. Therefore, if $\rho_{0}$ existed, then it would force an additional contribution of at least $2\,\textrm{Re}(\rho_{1}^{-1})$ , causing the zeros sum to exceed $v_{1}$ and hence gives a contradiction.

Although not stated explicitly, it is critical to the last part of the argument that the terms

\textrm{Re}\,\frac{1}{\rho}=\frac{\beta}{\beta^{2}+\gamma^{2}}

are all nonnegative. This ensures that the tail of the zeros sum contributes a nonnegative amount to $v_{1}$ . Therefore, we can drop the tail of the zeros sum and still obtain a valid lower bound on $v_{1}$ .

More generally, in this paper, we will consider the behavior of the function

(7)

\phi(\beta,\eta,x):=\frac{\beta-x}{(\beta-x)^{2}+\eta^{2}}+\frac{1-\beta-x}{(1% -\beta-x)^{2}+\eta^{2}}.

If $z=x+iy$ is a complex number then we have

\textrm{Re}\,\left[\frac{1}{\rho-z}+\frac{1}{1-\overline{\rho}-z}\right]=\phi(% \beta,\gamma-y,x).

Note that $\phi(\beta,\eta,x)$ is nonnegative for $0\leq\beta\leq 1$ and $x\leq 0$ . To analyze the behavior of $\phi$ in detail, we will often invoke the following lemma.

Lemma 3.

Let $\beta$ be a real number such that $0\leq\beta\leq 1$ . Let $x$ be a real nonpositive number, and let $\eta$ be a real positive number. Then $\phi(\beta,\eta,x)\geq 0$ . Furthermore, we have the following.

(i)

If $\displaystyle\eta\leq\sqrt{\frac{x(x-1)}{3}}$ , then $\phi(\beta,\eta,x)$ is minimized at $\displaystyle\beta=\frac{1}{2}$ .
(ii)

If $\displaystyle\eta>\sqrt{\frac{x(x-1)}{3}}$ , then $\phi(\beta,\eta,x)$ is minimized at $\beta=0$ (or $\beta=1$ ).
(iii)

If $\displaystyle\eta>\frac{1-2x}{2\sqrt{3}}$ , then $\phi(\beta,\eta,x)$ is maximized at $\displaystyle\beta=\frac{1}{2}$ .
(iv)

$\displaystyle\frac{\partial}{\partial u}\phi(\beta,u,x)$ is negative. Additionally, if $\displaystyle u>\frac{1-2x}{2\sqrt{3}}$ then $\displaystyle\frac{\partial}{\partial u}\phi(1/2,u,x)$ is increasing, and if $\displaystyle u>\frac{2-2x}{2\sqrt{3}}$ then $\displaystyle\frac{\partial}{\partial u}\phi(0,u,x)$ is increasing.

Proof.

See §9. ∎

4. Generalization to a class of $L$ -functions

In the sequel, we use the analytic normalization of $L$ -functions, so the critical line is $\sigma=1/2$ . We consider $L$ -functions of order $1$ only. The following notation and assumptions are used throughout this section. Let $L(s)$ be a Dirichlet series

L(s)=\sum_{n\geq 1}\frac{a(n)}{n^{s}},

absolutely convergent in the half-plane $\sigma>1$ . We suppose that the Dirichlet coefficients $a(n)$ are real, so that

L(\overline{s})=\overline{L(s)},

and the zeros of $L(s)$ must be symmetric about the real axis.

Following the notation in Booker [booker_2006], specialized to our context⁶⁶6In particular, we require that the $\mu_{j}$ are real and $\mu_{j}\geq 0$ instead of $\textrm{Re}\,(\mu_{j})\geq-1/2$ . We also write the formulas for $\Gamma_{\mathbb{R}}(s)$ and $\overline{f}(z)$ explicitly as $\pi^{-s/2}\Gamma(s/2)$ and $\overline{f(\overline{z})}$ , respectively, as well as drop a scaling factor by $N^{-1/4}$ in the definition of $\gamma(s)$ in [booker_2006, 387] as this does not interfere with any of our calculations., we state a number of assumptions satisfied by the set of $L$ -functions we consider. $L(s)$ has an Euler product of degree $r$ absolutely convergent in the half-plane $\sigma>1$ ,

L(s)=\prod_{p\,\text{prime}}\frac{1}{(1-\alpha_{p,1}p^{-s})\cdots(1-\alpha_{p,% r}p^{-s})},

where the $\alpha_{p,j}$ satisfy the conditions in [booker_2006, 387]. We will further assume that $|\alpha_{p,j}|\leq 1$ . Note that by the absolute convergence of the Euler product, $L(s)$ has no zeros in the half-plane $\sigma>1$ .

Suppose further there are positive integers $r$ and $N$ , a complex number $\epsilon$ of modulus $1$ , and real nonnegative numbers $\mu_{1},\ldots,\mu_{r}$ , such that the function $\xi_{L}(s)$ defined by

(8)

\xi_{L}(s):=\gamma(s)L(s),\qquad\gamma(s):=\epsilon N^{s/2}\pi^{-sr/2}\prod_{j% =1}^{r}\Gamma(s/2+\mu_{j}/2),

extends to an entire function and satisfies the functional equation

(9)

\xi_{L}(s)=\overline{\xi_{L}(1-\overline{s})}.

Note that by the functional equation, $\xi_{L}(1/2+it)$ is real. Also, $\xi_{L}$ is real on the real axis. If $\epsilon=\pm 1$ , then the functional equation simplifies to $\xi_{L}(s)=\xi_{L}(1-s)$ which means that $\xi_{L}(1/2+it)$ is even in $t$ . While if $\epsilon=\pm i$ , then $\xi_{L}(s)=-\xi_{L}(1-s)$ which means that $\xi_{L}(1/2+it)$ is odd in $t$ , and hence must have a zero of odd multiplicity at $t=0$ .

Since $\xi_{L}(s)$ is entire, $L(s)$ must have zeros at the poles of $\gamma(s)$ , which are the trivial zeros of $L(s)$ . Since $L(s)$ has no zeros in the half-plane $\sigma>1$ , it follows by the functional equation that the trivial zeros of $L(s)$ in $\sigma<0$ have the same multiplicities as the poles of $\gamma(s)$ . Moreover, the nontrivial zeros of $L(s)$ , which we denote by $\rho=\beta+i\gamma$ , are in the critical strip $0\leq\sigma\leq 1$ .

We assume $L(1)\neq 0$ , so that $\xi_{L}(1)\neq 0$ , and hence $\xi_{L}(0)\neq 0$ . Therefore, the zeros of $\xi_{L}(s)$ are exactly the nontrivial zeros of $L(s)$ . Also, just like $\xi(s)$ , $\xi_{L}(s)$ being of order $1$ has a Hadamard product

\xi_{L}(s)=\xi_{L}(0)\prod_{\rho}(1-s/\rho),

where we pair the terms for $\rho$ and $\overline{\rho}$ (or for $\rho$ and $1-\rho$ ). The RH for $L(s)$ is the assertion that all the $\rho$ ’s are on the critical line $\sigma=1/2$ .

To state the next proposition, we recall the $j$ -th order polygamma function $\psi_{j}(s)$ , defined as the $j$ -th derivative of $\psi_{0}(s)=\Gamma^{\prime}(s)/\Gamma(s)$ . Also, for any real number $\delta<1$ such that $L(1-\delta)\neq 0$ , let us write

(10)

\log L(s-\delta)=\sum_{j\geq 0}d_{j,\delta}(s-1)^{j},\qquad d_{j,\delta}=\frac% {1}{j!}\left[\frac{d^{j}}{ds^{j}}\log L(s-\delta)\right]_{s=1}

for $s$ sufficiently close to $1$ .

Lemma 4.

Let $k$ be a positive integer. Let $\delta$ be a real number such that $\delta<1$ and $\xi_{L}(\delta)\neq 0$ . Define

w_{k,\delta}:=\sum_{\rho}\frac{1}{(\rho-\delta)^{k}},

where the sum is ordered by pairing each term with its conjugate. Then $w_{k,\delta}$ is a real number. If $k>1$ , then

w_{k,\delta}=(-1)^{k-1}\left[\frac{1}{2^{k}(k-1)!}\sum_{j=1}^{r}\psi_{k-1}(1/2% -\delta/2+\mu_{j}/2)+kd_{k,\delta}\right].

And if $k=1$ , then the same formula holds but there is an additional term of

\frac{1}{2}\log N-\frac{r}{2}\log\pi.

Proof.

Since $\delta$ is a real number and the $\rho$ ’s are symmetric about the real axis, $w_{k,\delta}$ is real. By the Hadamard product for $\xi_{L}$ ,

\log\xi_{L}(s+\delta)=\log\xi_{L}(\delta)-\sum_{k\geq 1}\frac{w_{k,\delta}}{k}% s^{k},

provided $s$ is sufficiently close to $0$ . Therefore,

(11)

-\frac{w_{k,\delta}}{k}=\frac{1}{k!}\left[\frac{d^{k}}{ds^{k}}\log\xi_{L}(s+% \delta)\right]_{s=0}.

By the functional equation (9),

(12)

\frac{1}{k!}\left[\frac{d^{k}}{ds^{k}}\log\xi_{L}(s+\delta)\right]_{s=0}=\frac% {(-1)^{k}}{k!}\left[\frac{d^{k}}{ds^{k}}\log\xi_{L}(s-\delta)\right]_{s=1}.

On the other hand, recalling the definition of $\xi_{L}(s)$ ,

\log\xi_{L}(s)=\log\epsilon+\frac{s}{2}\log N-\frac{sr}{2}\log\pi+\sum_{j=1}^{% r}\log\Gamma(s/2+\mu_{j}/2)+\log L(s)

for $s$ away from zeros or poles of both sides. Therefore, replacing $s$ with $s-\delta$ , and using the series expansion (10), we obtain

(13)		$\displaystyle\frac{1}{k!}\left[\frac{d^{k}}{ds^{k}}\log\xi_{L}(s-\delta)\right% ]_{s=1}=$	$\displaystyle\,\mathds{1}_{k=1}\left(\frac{1}{2}\log N-\frac{r}{2}\log\pi% \right)+$
		$\displaystyle\frac{1}{2^{k}k!}\sum_{j=1}^{r}\psi_{k-1}(1/2-\delta/2+\mu_{j}/2)% +d_{k,\delta},$

where $\mathds{1}_{k=1}$ is the indicator function of the condition $k=1$ . Substituting (13) back into (12), then back into (11), yields the proposition. ∎

Since our numerical experiments in §7 will focus on the case $k=1$ , we provide a version of Lemma 4 in this special case.

Corollary 5.

When $k=1$ , we have

w_{1,\delta}=\frac{1}{2}\log N-\frac{r}{2}\log\pi+\frac{1}{2}\sum_{j=1}^{r}% \psi_{0}\left(\frac{1-\delta+\mu_{j}}{2}\right)+\frac{L^{\prime}(1-\delta)}{L(% 1-\delta)}.

Let us note that many special values $\psi_{0}(s)$ can be expressed exactly in terms of known constants.⁷⁷7For example, when $s=1/2$ , $\psi_{0}(1/2)=-2\log 2-\lambda_{0}$ , $\psi_{1}(1/2)=-\pi^{2}/2$ , $\psi_{2}(1/2)=-14\zeta(3)$ , $\psi_{3}(1/2)=\pi^{2}$ , and more generally for $j\geq 1$ , $\psi_{j}(1/2)=(-1)^{j+1}j!(2^{j+1}-1)\zeta(j+1)$ . As another example, when $s=1$ , we have $\psi_{0}(1)=-\lambda_{0}$ . In general, there are efficient ways for computing $\psi_{0}(x)$ for $x>0$ ; see for example [johansson_2021] for a discussion of methods to compute $\Gamma$ , $\psi_{0}$ , and related functions. Therefore, for the purpose of computing $w_{1,\delta}$ , we may focus our attention on the logarithmic derivative of $L(s)$ at $s=1-\delta$ . The next lemma supplies a simple formula for doing this, provided $\delta<0$ . We make use of the following notation: if $n=p^{m}$ for a prime $p$ and a natural number $m$ , then

\Lambda_{L}(n):=\log p\sum_{j=1}^{r}\alpha_{j,p}^{m},

and we set $\Lambda_{L}(n)=0$ otherwise. In particular, since $|\alpha_{j,p}|\leq 1$ ,

|\Lambda_{L}(n)|\leq r\Lambda(n),

where $\Lambda(n)$ is the von Mangoldt function. This is defined by $\Lambda(n)=\log p$ if $n=p^{m}$ for a prime $p$ and a natural number $m$ , and $\Lambda(n)=0$ otherwise.

Lemma 6.

Let $K\geq 18$ be an integer. If $\delta<0$ , then

\frac{L^{\prime}(1-\delta)}{L(1-\delta)}=-\sum_{k=1}^{K}\frac{\Lambda_{L}(k)}{% k^{1-\delta}}+\mathcal{R}_{L}(K,\delta),

where

\left|\mathcal{R}_{L}(K,\delta)\right|<\frac{rK^{\delta}}{\delta}\left(2.85% \cdot\frac{2\delta-1}{\log K}-1\right).

Proof.

Suppose $\sigma>1$ . By the Euler product for $L(s)$ ,

(14)

\frac{L^{\prime}}{L}(s)=-\sum_{k\geq 1}\frac{\Lambda_{L}(k)}{k^{s}}.

So, by Stietljes integration and the bound $|\Lambda_{L}(k)|\leq r\Lambda(k)$ , the tail $\mathcal{R}_{L}(K,\delta)$ of the Dirichlet series (14) for $k>K$ and $s=1-\delta$ satisfies

(15)

\left|\mathcal{R}_{L}(K,\delta)\right|\leq r\int_{K}^{\infty}\frac{1}{u^{1-% \delta}}\,d\psi(u)\quad\text{where}\quad\psi(u)=\sum_{k\leq u}\Lambda(k).

Using integration by parts,

(16)

\int_{K}^{\infty}\frac{1}{u^{1-\delta}}\,d\psi(u)=-\frac{\psi(K)}{K^{1-\delta}% }+(1-\delta)\int_{K}^{\infty}\frac{\psi(u)}{u^{2-\delta}}\,du.

Furthermore, by [rosser_1941, 227], we have for $u\geq K$ the double inequality

0<u\left(1-\frac{2.85}{\log K}\right)\leq\psi(u)\leq u\left(1+\frac{2.85}{\log K% }\right).

Substituting this into (16), then back into (15), and integrating yields the result. ∎

Remark.

A simpler version of Lemma 6 is obtained by using the trivial bound $|\Lambda_{L}(k)|\leq r\log k$ . This gives

\left|\mathcal{R}_{L}(K,\delta)\right|\leq r\sum_{n>K}\frac{\log k}{k^{1-% \delta}}<r\int_{K}^{\infty}\frac{\log u}{u^{1-\delta}}\,du=rK^{\delta}\cdot% \frac{1-\delta\log K}{\delta^{2}}.

Although usually not as precise as Lemma 6, this estimate is sharper than Lemma 6 if $\delta$ is very large compared to $\log K$ .

Remark.

Lemma 6 generalizes easily to higher order logarithmic derivatives of $L(s)$ at $s=1-\delta$ . For example,

\left[\frac{d^{2}}{ds^{s}}\log L(s)\right]_{s=1-\delta}=-\sum_{k=1}^{K}\frac{% \Lambda_{L}(k)\log k}{k^{1-\delta}}+\mathcal{R}_{L,2}(K,\delta),

where

\left|\mathcal{R}_{L,2}(K,\delta)\right|<rK^{\delta}\left(2.85-\log K+\frac{1/% (1-\delta)-\delta\log K}{\delta^{2}}(1-\delta)(1+2.85/\log K)\right).

Theorem 7 next is our main result in this section. Unlike the case of zeta, where none of the $\rho$ ’s is real, $L(s)$ might have real nontrivial zeros. So, care is needed to allow for this possibility. The following lemma will facilitate the proof of Theorem 7. Recall that the function $\phi$ was defined in (7), and that

\textrm{Re}\,\left[\frac{1}{\rho-\delta}+\frac{1}{1-\rho-\delta}\right]=\phi(% \beta,\gamma,\delta).

Theorem 7.

Let $\delta$ be a real nonpositive number and let $\tau$ be a real positive number. Let $\mathcal{Z}$ be a set of nonempty disjoint subintervals of the form $[\gamma_{-},\gamma_{+}]\subseteq[0,\tau]$ or of the form $[-\gamma_{0},\gamma_{0}]\subseteq[-\tau,\tau]$ . Suppose that $\xi_{L}(1/2+it)$ has a sign change in each subinterval in $\mathcal{Z}$ .⁸⁸8This means $\xi_{L}(1/2+i\tau_{1})<0<\xi_{L}(1/2+i\tau_{2})$ for some $\tau_{1},\tau_{2}$ in each subinterval in question.. Define

C(\mathcal{Z},\delta):=\sum_{[\gamma_{-},\gamma_{+}]\in\mathcal{Z}}\frac{1-2% \delta}{(1/2-\delta)^{2}+\gamma_{+}^{2}}+\sum_{[-\gamma_{0},\gamma_{0}]\in% \mathcal{Z}}\frac{1/2-\delta}{(1/2-\delta)^{2}+\gamma_{0}^{2}}.

For any real positive number $\eta$ , any positive integer $m$ , and with $\phi$ as in (7), define

	$\displaystyle f_{1}(\eta,\delta,m)$	$\displaystyle:=2m\cdot\min\left(\phi(0,\eta,\delta),\phi(1/2,\eta,\delta)% \right),$
	$\displaystyle f_{2}(\eta,\delta,m)$	$\displaystyle:=m\cdot\phi(1/2,\eta,\delta),$
	$\displaystyle h_{1}(\delta,m)$	$\displaystyle:=m\cdot\phi(1/2,0,\delta),$
	$\displaystyle h_{2}(\delta,m)$	$\displaystyle:=m/2\cdot\phi(1/2,0,\delta),$
	$\displaystyle F(\eta,\delta)$	$\displaystyle:=\min\left(2\cdot\phi(0,\eta,\delta),\phi(1/2,\eta,\delta),1/2% \cdot\phi(1/2,0,\delta)\right).$

Then, we have the following, where zeros are counted with multiplicity in all cases.

(i)

If $f_{1}(\eta,\delta,m)+C(\mathcal{Z},\delta)>w_{1,\delta}$ , then there are strictly fewer than $4m$ non-real $\rho$ ’s off the critical line of height $\leq\eta$ .
(ii)

If $f_{2}(\eta,\delta,m)+C(\mathcal{Z},\delta)>w_{1,\delta}$ , then there are strictly fewer than $2m$ non-real $\rho$ ’s on the critical line of height $\leq\eta$ not accounted for in $\mathcal{Z}$ .
(iii)

If $h_{1}(\delta,m)+C(\mathcal{Z},\delta)>w_{1,\delta}$ , then there are strictly fewer than $2m$ real $\rho$ ’s off the critical line.
(iv)

If $h_{2}(\delta,m)+C(\mathcal{Z},\delta)>w_{1,\delta}$ , then a zero at the central point $s=1/2$ has multiplicity strictly less than $m$ .
(v)

If $F(\eta,\delta)+C(\mathcal{Z},\delta)>w_{1,\delta}$ , then the list of zeros in $\mathcal{Z}$ is complete. This means that every $\rho$ in the upper half-plane of height $\leq\eta$ is on the critical line, is simple, and belongs to some subinterval in the set $\mathcal{Z}$ .

Remark.

There are important cases where the subintervals in $\mathcal{Z}$ should be allowed to appear with multiplicity. In such cases, the conclusions about the simplicity of zeros in parts (ii) and (iv–v) should be modified so as to account for any nonsimple zeros already present in $\mathcal{Z}$ . For example, the $L$ -function of an elliptic curve with analytic rank $>1$ has by definition a zero at $s=1/2$ of multiplicity $>1$ . Note that in this case, part (iv) of the theorem gives an unconditional upper bound on the analytic rank of the elliptic curve. Bober [bober_2013] gave a method to bound the analytic rank of elliptic curve $L$ -functions via the “explicit formula,” conditional on the RH for the corresponding $L$ -function.

Remark.

Let us explicitly note that if $\epsilon=\pm 1$ , then $\xi(1/2+it)$ is even, so any zero at the central point $s=1/2$ has even multiplicity. Thus, in this situation, it is unclear how the non-simple zero at $s=1/2$ can be detected rigorously by numerical means, via the intermediate value theorem, as there will be no sign change to detect. All this is to say that if $\epsilon=\pm 1$ and the zeros of height $\leq\tau$ have been sufficiently resolved, then the sum over intervals of the form $[-\gamma_{0},\gamma_{0}]\in\mathcal{Z}$ is expected to be empty.

Proof.

We prove part (i). Let $\{\rho_{1},\ldots,\rho_{m}\}$ be a set of $m$ zeros $\rho_{j}=\beta_{j}+i\gamma_{j}$ off of the critical line of height $\leq\eta$ with $\textrm{Re}(\rho_{j})>\frac{1}{2}$ and $\Im(\rho_{j})>0$ , possibly with repetition up to multiplicity. For each $\rho_{j}$ , there are necessarily 4 symmetric, distinct zeros $\rho_{j}$ , $1-\rho_{j}$ , $\overline{\rho_{j}}$ , and $1-\overline{\rho_{j}}$ . These 4 counterexample zeros will collectively contribute to $w_{1,\delta}$ a value of

\phi(\beta_{j},\gamma_{j},\delta)+\phi(\beta_{j},-\gamma_{j},\delta)=2\phi(% \beta_{j},\gamma_{j},\delta).

Using lemma 3 and the monotonicity of $\phi(\beta,\eta,x)$ in $\eta$ , we find that the minimum of the possible contribution from these 4 counterexample $\rho_{j}$ is at least

2\cdot\min_{\beta\in[0,1]}\phi(\beta,\eta,\delta)=2\cdot\min(\phi(0,\eta,% \delta),\phi(1/2,\eta,\delta))=f_{1}(\eta,\delta,1).

Note that this lower bound is independent of $\beta_{i}$ and $\gamma_{i}$ . Thus, the $m$ counterexample zeros $\rho_{1},\ldots,\rho_{m}$ along with their $3m$ symmetric zeros contribute at least $m\cdot f_{1}(\eta,\delta,1)=f_{1}(\eta,\delta,m)$ to the value of $w_{1,\delta}$ . Thus, if

f_{1}(\eta,\delta,m)+C(\mathcal{Z},\delta)>w_{1,\delta},

then we have a contradiction, so there are strictly fewer than $4m$ zeros $\rho$ off the critical line of positive height $\leq\eta$ . In the case of $m=1$ this means the RH holds in the interval $(0,\eta]$ . In the case of $m=2$ , there is at most one set of 4 symmetric, non-real $\rho$ off the critical line of height $\leq\eta$ and they must be simple.

Next, we prove part (ii). Note that the minimum contribution to $w_{1,\delta}$ from a non-real zero $\rho$ on the critical line of height $\leq\eta$ together with its symmetric part $1-\rho$ is $\phi(1/2,\eta,\delta)$ . Similarly, the contribution from $m$ such zeros on the critical line is at least $m\cdot\phi(1/2,\eta,\delta)=f_{2}(\eta,\delta,m)$ . Therefore, if

f_{2}(\eta,\delta,m)+C(\mathcal{Z},\delta)>w_{1,\delta},

then we have a contradiction, so there are strictly fewer than $2m$ zeros $\rho$ on the critical line of positive height $\leq\eta$ which are not accounted for in the subintervals of $\mathcal{Z}$ . In the case of $m=1$ this means the intervals in $\mathcal{Z}$ account for all non-real $\rho$ on the critical line of height $\leq\eta$ . In the case of $m=2$ , at most one pair of non-real $\rho$ and $1-\rho$ on the critical line of height $\leq\eta$ are not accounted for in $\mathcal{Z}$ . Since each subinterval of $\mathcal{Z}$ contains a sign-change (so corresponds to a zero of odd multiplicity), this case implies that all non-real $\rho$ on the critical line of height $\leq\eta$ are simple (including the possible pair $\rho$ and $1-\rho$ missed by $\mathcal{Z})$ .

We prove part (iii). By Lemma 3, the minimum of the contribution to $w_{1,\delta}$ from a real zero $\rho$ off the critical line and its symmetric zero $1-\rho$ is $\phi(1/2,0,\delta)$ . Thus, the contribution from $m$ pairs of real zeros off the critical line $\{\rho_{1},1-\rho_{1},\ldots,\rho_{m},1-\rho_{m}\}$ is at least $m\cdot\phi(1/2,0,\delta)=h_{1}(\delta,m)$ . So if

h_{1}(\delta,m)+C(\mathcal{Z},\delta)>w_{1,\delta},

there are strictly fewer than $m$ such pairs of real zeros, so fewer than $2m$ total real zeros off the critical line.

We prove part(iv). Each repetition of the zero $\rho=1/2$ (possibly none) contributes $1/2\cdot\phi(1/2,0,\delta)$ to $w_{1,\delta}$ . So, if the zero at $s=1/2$ has multiplicity $m$ , then these zeros have total contribution $m/2\cdot\phi(1/2,0,\delta)=h_{2}(\delta,m)$ . By the same arguments thus far, if

h_{2}(\delta,m)+C(\mathcal{Z},\delta)>w_{1,\delta},

then the multiplicity of the zero at $s=1/2$ is strictly smaller than $m$ . In the case $m=1$ , this means we have non-vanishing of $L(s)$ on the real line. In the case $m=2$ , and combined with part (iii), any real zero must be at the central point $s=1/2$ and must be simple.

Lastly, we prove part (v). Suppose $F(\eta,\delta)+C(\mathcal{Z},\delta)>w_{1,\delta}$ . By the definition of $F(\eta,\delta)$ , we have

F(\eta,\delta)\leq\min\{f_{1}(\eta,\delta,1),f_{2}(\eta,\delta,1),h_{2}(\delta% ,1)\}.

Therefore, by part (i) of the theorem, all non-real $\rho$ are on the critical line. By part (ii), all non-real $\rho$ on the critical line belong to some subinterval in the set $\mathcal{Z}$ and are thus simple. By parts (iii) and (iv), $L(s)$ is non-vanishing on the real line (except for possibly a simple zero at $\rho=1/2$ included in $\mathcal{Z}$ ). These three cases leave no room for zeros outside of the simple zeros within the subintervals in the set $\mathcal{Z}$ . Thus, the list of zeros in $\mathcal{Z}$ account for all zeros of $L(s)$ of height $\leq\eta$ and they are all simple, i.e. the list $\mathcal{Z}$ is complete. ∎

5. Generalization in the zeta case

Let $g(s):=(s-1)\zeta(s)$ . So, $g$ is an entire function. The series expansion of $g$ at $s=1$ is given by

(17)

g(s)=1+\sum_{j\geq 0}\frac{(-1)^{j}\lambda_{j}}{j!}(s-1)^{j+1},

where $\lambda_{1},\lambda_{2},\ldots$ are the Stieltjes constants.⁹⁹9For instance, $\lambda_{1}=-0.07281584\ldots$ , $\lambda_{2}=-0.00969036\ldots$ , $\lambda_{3}=0.00205383\ldots$ , and so on. For any complex number $z$ such that $\zeta(1-z)\neq 0$ , we may write

\log g(s-z)=\sum_{j\geq 0}c_{j,z}(s-1)^{j},\qquad c_{j,z}=\frac{1}{j!}\left[% \frac{d^{j}}{ds^{j}}\log g(s-z)\right]_{s=1},

for $s$ sufficiently close to $1$ .¹⁰¹⁰10If $z=0$ , then the coefficients $c_{j}:=c_{j,0}$ can be calculated easily in terms of the $\lambda_{j}$ ’s. For example, $c_{0}=0,\,c_{1}=\lambda_{0},\,c_{2}=-\lambda_{0}^{2}/2-\lambda_{1},\,c_{3}=% \lambda_{0}^{3}/3+\lambda_{0}\lambda_{1}+\lambda_{2}/2\ldots$ .

Lemma 8.

Let $k$ be a positive integer. Let $z=x+iy$ be a complex number such that $x<1$ and $z$ does not coincide with any zero $\rho$ of $\xi(s)$ . If $k>1$ then

v_{k,z}:=\sum_{\rho}\frac{1}{(\rho-z)^{k}}=(-1)^{k-1}\left[\frac{\psi_{k-1}(3/% 2-z/2)}{2^{k}(k-1)!}+kc_{k,z}\right].

If $k=1$ , then there is an additional term of

\displaystyle-\frac{1}{2}\log\pi.

Proof.

The proof is similar to that of Proposition 4 except the coefficients $c_{j,z}$ are defined differently than the analogous coefficients $d_{j,\delta}$ due to the pole of zeta. ∎

Corollary 9.

When $k=1$ , we have

v_{1,z}=-\frac{1}{2}\log\pi+\frac{1}{2}\psi_{0}\left(\frac{3-z}{2}\right)+% \frac{g^{\prime}(1-z)}{g(1-z)}.

One can compute the $g^{\prime}(1-z)/g(1-z)$ using the Euler–Maclaurin summation formula; see for example [rubinstein_2005]. However, if $x<0$ is large enough, then the following simpler formula could suffice, and has the same proof as that for Lemma 6.

Lemma 10.

If $z=x+iy$ and $x<0$ , then

\frac{g^{\prime}(1-z)}{g(1-z)}=-\frac{1}{z}-\sum_{k=1}^{K}\frac{\Lambda(k)}{k^% {1-z}}+\mathcal{R}(K,x),

where $\mathcal{R}(K,x)$ satisfies the same bound as in Lemma 6 but with $r=1$ and $\delta=x$ .

Theorem 11 is the main result in this section. Since the main interest in the case of zeta is at large heights, we expand about a complex number $z=x+iy$ where $y>0$ is typically large. Therefore, the advantage provided by the symmetry of the $\rho$ ’s about the real axis is mostly lost.

Theorem 11.

Let $z=x+iy$ be a complex number such that $x\leq 0$ and $y>0$ . Let $\tau$ be a real positive number such that $\tau\leq y$ . Let $\mathcal{Z}$ be a set of nonempty disjoint subintervals $[\gamma_{-},\gamma_{+}]\subseteq[y-\tau,y+\tau]$ such that $\xi(1/2+it)$ has a sign change in each subinterval. Suppose further that $y$ does not belong to any of the subintervals in $\mathcal{Z}$ . Define

\displaystyle D(\mathcal{Z},z)

\displaystyle:=\sum_{\begin{subarray}{c}[\gamma_{-},\gamma_{+}]\in\mathcal{Z}% \\ \gamma_{-}>y\end{subarray}}\frac{1/2-x}{(1/2-x)^{2}+(\gamma_{+}-y)^{2}}+\sum_{% \begin{subarray}{c}[\gamma_{-},\gamma_{+}]\in\mathcal{Z}\\ \gamma_{+}<y\end{subarray}}\frac{1/2-x}{(1/2-x)^{2}+(y-\gamma_{-})^{2}}.

Further, for any real $\eta$ such that $0<\eta\leq y$ , and with $\phi$ as in (7), define

	$\displaystyle g_{1}(\eta,x)$	$\displaystyle:=\min\left(\phi(0,\eta,x),\phi(1/2,\eta,x)\right),$
	$\displaystyle g_{2}(\eta,x)$	$\displaystyle:=\phi(1/2,\eta,x),$
	$\displaystyle g_{3}(\eta,x)$	$\displaystyle:=\min\left(\phi(0,\eta,x),1/2\cdot\phi(1/2,\eta,x)\right).$

Then, for any real positive number $\eta$ such that $\eta\leq y$ we have the following.

(i)

If $g_{1}(\eta,x)+D(\mathcal{Z},z)>{\textrm{Re}}\,(v_{1,z})$ , then all the $\rho$ ’s with height in $[y-\eta,y+\eta]$ are on the critical line. That is, the RH holds in the interval $[y-\eta,y+\eta]$ .
(ii)

If $g_{2}(\eta,x)+D(\mathcal{Z},z)>{\textrm{Re}}\,(v_{1,z})$ , then all the $\rho$ ’s on the critical line with height in $[y-\eta,y+\eta]$ are simple.
(iii)

If $g_{3}(\eta,x)+D(\mathcal{Z},z)>{\textrm{Re}}\,(v_{1,z})$ , then the list $\mathcal{Z}$ is complete. This means that every $\rho$ in the upper half-plane with height in $[y-\eta,y+\eta]$ is on the critical line, is simple, and belongs to some subinterval in the set $\mathcal{Z}$ .

Proof.

Let us prove part (i). Suppose there is a counter-example $\rho=\beta+i\gamma$ such that $\gamma\in[y-\eta,y+\eta]$ . Then $1-\overline{\rho}$ is a counter-example distinct from $\rho$ . The contribution of $\rho$ and $1-\overline{\rho}$ to $\textrm{Re}(v_{1,z})$ is $\phi(\beta,\gamma-y,x)$ . Since $|\gamma-y|\leq\eta$ , it follows by Lemma 3 that this contribution is at least $g_{1}(\eta,x)$ . Moreover, the zeros from the set $\mathcal{Z}$ already contribute at least $D(\mathcal{Z},z)$ to $\textrm{Re}(v_{1,z})$ . So, if the inequality in (i) holds, and considering that any remaining zeros will contribute a nonnegative amount to $\textrm{Re}(v_{1,z})$ , then we obtain a contradiction. Hence, the counter-example $\rho$ cannot exist.

We prove part (ii). Suppose there is a nonsimple zero $\rho=1/2+i\gamma$ of multiplicity $m$ such that $\gamma\in[y-\eta,y+\eta]$ . If $\rho$ is already in the set $\mathcal{Z}$ , then $m\geq 3$ , since the zeros in $\mathcal{Z}$ have odd multiplicity (as they correspond to sign changes of $\xi(1/2+it)$ ). If $\rho$ is not in $\mathcal{Z}$ , then $m\geq 2$ . In either case, there are at least two zeros on the critical line with ordinates in $[y-\eta,y+\eta]$ that are missing from $\mathcal{Z}$ . So, arguing as in part (i) and using Lemma 3, the contribution of these missing zeros to $\textrm{Re}(v_{1,z})$ is at least $g_{2}(\eta,x)$ . So, if the inequality in (ii) holds, then we obtain a contradiction since any remaining zeros will contribute a nonnegative amount to $\textrm{Re}(v_{1,z})$ . Hence, such a nonsimple $\rho$ cannot exist.

Lastly, we prove part (iii). Note that $g_{1}(\eta,x)\geq g_{3}(\eta,x)$ and $g_{2}(\eta,x)\geq g_{3}(\eta,x)$ . So, if the inequality in (iii) holds, then all the zeros with height in $[y-\eta,y+\eta]$ are on the critical line and are simple. Thus, in seeking a contradiction we may assume without loss of generality that there is a simple zero $\rho=1/2+i\gamma$ such that $\gamma\in[y-\eta,y+\eta]$ and $\gamma$ is not in any subinterval $[\gamma_{-},\gamma_{+}]\in\mathcal{Z}$ . But the contribution of such $\rho$ to $\textrm{Re}(v_{1,z})$ is at least $1/2\cdot\phi(1/2,\eta,x)$ . Hence, if the inequality in (iii) holds, then we obtain a contradiction, like before. So, such a missing $\rho$ cannot exist. ∎

Remark.

By using the shift $z=-1/2+i14.1$ in Theorem 11 along with the $12$ initial zeros of $\zeta(s)$ , one can verify that $\rho_{1}=1/2+i\gamma_{1}$ and $\rho_{2}=1/2+i\gamma_{2}$ are the only zeta zeros with ordinates in the window $[6.5360,21.6640]$ . Since the value $v_{1}=0.0230957\ldots$ that Riemann computed already tells us that there are no zeta zeros of height less than $6.56$ , this yields that $\rho_{1}$ and $\rho_{2}$ are indeed the first two zeta zeros. By comparison, verifying $\rho_{1}$ and $\rho_{2}$ are the first two zeta zeros using just the value $v_{1}$ requires accounting for the contribution of $52$ initial zeros of zeta.

6. Improvements

Instead of using nonnegativity to simply drop the contribution to $\textrm{Re}(v_{1,z})$ of the tail of the zeros sum, we derive a lower bound on the contribution of the tail. Incorporating this into Theorem 11 greatly improves the efficiency of our verification method at large heights (i.e. when $y$ is large). Hence, the RH can be verified via our method in a much wider window than before (i.e. for a much larger $\eta$ ). Specifically, whereas the basic verification method in Theorem 11 is only expected to succeed in windows of size $\eta\ll\sqrt{\tau/\log y}$ , the improved method in Theorem 13 is expected to succeed in windows of size $\eta\gg\tau/\sqrt{\log y}$ .

In addition, we derive an upper bound on the contribution of the tail of the zeros sum. This can sometimes allow us to prove the incompleteness of a supplied list of zeros in a given range, as shown in Theorem 13.

Proposition 12.

Let $z=x+iy$ be a complex number and $\tau$ be a real number. Suppose that $x<0$ and $1-2x<\tau<y$ . For any real number $c$ such that $168\pi<c<y-\tau$ , we have

-b(z,\tau,c)\leq\sum_{\begin{subarray}{c}\rho\\ |\gamma-y|>\tau\end{subarray}}{\textrm{Re}}\frac{1}{\rho-z}-\frac{1-2x}{2\pi% \tau}\log\frac{y}{2\pi}\leq B(z,\tau,c),

where

	$\displaystyle b(z,\tau,c):=\frac{1}{2\pi}\cdot\left[\epsilon_{1}\frac{1-2x}{% \tau}+\epsilon_{2}+\epsilon_{3}\log\frac{y}{2\pi}\right]+\frac{\epsilon_{4}+% \epsilon_{5}}{2},$
	$\displaystyle B(z,\tau,c):=\frac{1}{2\pi}\cdot\left[\epsilon_{1}\frac{1-2x}{% \tau}+\epsilon_{2}\right]+\frac{\epsilon_{4}+\epsilon_{5}}{2}+\epsilon_{6},$

and defining $\ell(u)$ and $\ell_{1}(u)$ as in (2) we have

	$\displaystyle\epsilon_{1}(y,\tau,c):=4\pi^{2}\cdot 0.006\cdot\left[\frac{1}{(y% +\tau)^{2}}+\frac{1}{c^{2}}\right],$
	$\displaystyle\epsilon_{2}(z,c):=\frac{1-2x}{2y}\cdot\log\frac{2y}{c},$
	$\displaystyle\epsilon_{3}(z,\tau,c):=\left[\frac{(1-x)^{2}}{3\tau^{3}}+\frac{1% }{y-c}\right]\cdot(1-2x),$
	$\displaystyle\epsilon_{4}(z,\tau,c):=\left(\frac{2-4x}{\tau^{2}}+\frac{2-4x}{(% y-c)^{2}}\right)\cdot\ell(2y),$
	$\displaystyle\epsilon_{5}(z,\tau):=\frac{4-8x}{\tau^{3}}\cdot\ell_{1}(2y),$
	$\displaystyle\epsilon_{6}(z,c):=\frac{1-2x}{2y}\cdot\frac{(2y-c)^{2}\log(2y-c)% -(y-c)^{2}\log(y-c)}{\pi(y-c)^{2}}.$

Proof.

See §8. ∎

Theorem 13.

Let $z=x+iy$ , $\tau$ , $c$ , and the functions $b(z,\tau,c)$ and $B(z,\tau,c)$ all be given as in Proposition 12. Furthermore, let $\mathcal{Z}$ and the functions $D(\mathcal{Z},z)$ , $\phi(\beta,\eta,x)$ , $g_{1}(\eta,x)$ , $g_{2}(\eta,x)$ , and $g_{3}(\eta,x)$ be given as in Theorem 11. Define

	$\displaystyle r(z,\tau,c)$	$\displaystyle:=\frac{1-2x}{2\pi\tau}\log\frac{y}{2\pi}-b(z,\tau,c)$
	$\displaystyle R(z,\tau,c)$	$\displaystyle:=\frac{1-2x}{2\pi\tau}\log\frac{y}{2\pi}+B(z,\tau,c)$

For any real positive number $\eta$ such that $\eta\leq y$ we have the following improvements to Theorem 11.

(i)

If $g_{1}(\eta,x)+D(\mathcal{Z},z)+r(z,\tau,c)>{\textrm{Re}}\,(v_{1,z})$ , then all the $\rho$ ’s with height in $[y-\eta,y+\eta]$ are on the critical line. That is, the RH holds in $[y-\eta,y+\eta]$ .
(ii)

If $g_{2}(\eta,x)+D(\mathcal{Z},z)+r(z,\tau,c)>{\textrm{Re}}\,(v_{1,z})$ , then all the $\rho$ ’s on the critical line with height in $[y-\eta,y+\eta]$ are simple.
(iii)

If $g_{3}(\eta,x)+D(\mathcal{Z},z)+r(z,\tau,c)>{\textrm{Re}}\,(v_{1,z})$ , then every $\rho$ in the upper half-plane with height in $[y-\eta,y+\eta]$ is on the critical line, simple, and belongs to some subinterval in the set $\mathcal{Z}$ .

In addition to these improvements, the upper bound in Proposition 12 yields the following counterpart.

(iv)

If $D(\mathcal{Z},z)+R(z,\tau,c)<{\textrm{Re}}\,(v_{1,z})$ , then $\mathcal{Z}$ does not account for all the $\rho$ ’s with height in $[y-\tau,y+\tau]$ . This means there is a subinterval in $\mathcal{Z}$ that contains the ordinates of at least three $\rho$ ’s (including multiplicity), or there is $\rho=1/2+i\gamma$ such that $\gamma\in[y-\tau,y+\tau]$ and $\gamma$ is not in any subinterval in $\mathcal{Z}$ , or there is $\rho$ off the critical line with height in $[y-\tau,y+\tau]$ .

Proof.

Parts (i)–(iii) follow directly from the arguments in Theorem 10, except that these bounds account for the contribution from zeros $\rho$ outside of the ordinate window $[y-\tau,y+\tau]$ for which we have zeros data.

For part (iv), if $D(\mathcal{Z},z)+R(z,\tau,c)<{\textrm{Re}}\,(v_{1,z})$ , then there necessarily are zeros whose (positive) contribution to ${\textrm{Re}}\,(v_{1,z})$ is not being accounted for. More explicitly, since

\sum_{\begin{subarray}{c}\rho\\ |\gamma-y|>\tau\end{subarray}}{\textrm{Re}}\left(\frac{1}{\rho-z}\right)\leq R% (z,\tau,c),

$R(z,\tau,c)$ already accounts for the maximum possible contribution from all zeros $\rho$ with $|\gamma-y|>\tau$ . Therefore, any deficiency in contribution to $\textrm{Re}(v_{1,z})$ must arise from some $\rho=\beta+i\gamma$ satisfying $|\gamma-y|\leq\tau$ that has not been already accounted for in $\mathcal{Z}$ . ∎

Remark.

It is possible that a further small improvement would be made by incorporating explicit zeros-density estimates, in addition to the explicit bounds on $S(u)$ and its integral that are already included.

7. Numerical examples

The examples in this section are meant for illustration, to show how the method we described behaves in practice on representative examples. The data in this section was obtained from [lmfdb] and [zeta_zeros], and using LCALC [lcalc] as well as SageMath [sagemath]. Our working assumption is that the zeros ordinates from [lmfdb] and [zeta_zeros] are accurate within $\pm 10^{-10}$ , and the zeros ordinates obtained using [lcalc] and [sagemath] are accurate to within $\pm 10^{-8}$ , though it is possible the accuracy is higher. We used this assumption to determine the interval $[\gamma_{-},\gamma_{+}]$ corresponding to each zero ordinate $\gamma$ . Numerical calculations were done using the interval arithmetic package in mpmath [mpmath]. We also used FLINT [flint] to compute the polygamma function when no exact value was available. The code for the implementation is available as a GitHub repository [github].

7.1. The Riemann zeta function

We used Theorem 11 and Theorem 13 for verification using

y=10^{28}+501675.8,\qquad x=-2,\qquad\tau=501575.4,\qquad c=y/2.

Our set $\mathcal{Z}$ was obtained from [zeta_zeros]. For $z=x+iy$ , and with the aid of Corollary 9 and Lemma 10, applied with $K=10^{7}$ , we computed

\textrm{Re}(v_{1,z})\in[31.418062627034752,31.418062627034846],

D(\mathcal{Z},z)\in[31.417963253430945,31.417963255019071],

r(z,\tau,c)\in[0.000099372589781012325291744466523344471948495\pm 5*10^{-45}].

Based on this input data, Theorem 11, part(i), succeeded in verifying the RH for $\eta=224$ , and Theorem 13, part (i), succeeded verifying the RH for $\eta=70216$ , which is much larger and contains $1399910$ zeros of the zeta function. Theorem 13, part (iii), also succeeded in verifying that completeness of the subset

\mathcal{Z}\cap[y-\eta,y+\eta],\qquad\eta=49650,

a window that contains $989881$ zeros. In the opposite direction, we applied Theorem 13, part (iv), to the subset $\mathcal{Z}_{0}$ , which is the same as $\mathcal{Z}$ except the subinterval $[\gamma_{-},\gamma_{+}]$ corresponding to the ordinate $\gamma=10^{28}+521738.816$ was removed. We computed

D(\mathcal{Z}_{0},z)\in[31.417963247220145,31.417963248808271],

R(z,\tau,c)\in[0.00009937291681087140202410471243137201884323\pm 10^{-44}].

Based on this input data, Theorem 13 succeeded in proving that the set $\mathcal{Z}_{0}$ was indeed incomplete.

7.2. Real Dirichlet $L$ -function

Let $d$ be a fundamental discriminant, $\chi_{d}$ be the corresponding real primitive character, and $L(s,\chi_{d})$ the corresponding Dirichlet $L$ -function. In the notation of §4, we have $r=1$ , $N=d$ , $\epsilon=1$ , and if $d<0$ then $\mu_{1}=1$ . We applied Theorem 7, part (i), to verify the RH using

d=-1159523,\qquad\delta=-1,\qquad\tau=1692.8.

The coefficients arising from the Euler product are given by $\alpha_{p,1}=\chi_{d}(p)$ . Our set $\mathcal{Z}$ was obtained using [lcalc]. With the aid of Corollary 5 as well as Lemma 6 applied with $K=10^{5}$ we computed

w_{1,\delta}\in[6.4702225452,6.4702573982],

C(\mathcal{Z},\delta)\in[6.4644405451,6.4644405588].

Based on this input data, Theorem 7 succeeded in verifying the RH for $L(s,\chi_{d})$ for $\eta=32$ , a window containing $74$ zeros with nonnegative ordinates.

7.3. The Ramanujan $\tau$ $L$ -function

Let $\tau$ be the Ramanujan tau function¹¹¹¹11So, $\tau(1)=1,\tau(2)=-24,\tau(3)=252,\tau(4)=-1472,\ldots$ ., and let $L(s)$ be the Ramanujan tau $L$ -function.¹²¹²12Therefore, $L(s)$ is given by the Dirichlet series $L(s)=1+a_{2}2^{-s}+a_{3}3^{-s}+a_{4}4^{-s}+\cdots$ where $a_{n}=\tau(n)n^{-11/2}$ , at least when $\sigma>1/2$ . In the notation of §4, we have $r=2$ , $N=1$ , $\epsilon=1$ , $\mu_{1}=11/2$ , and $\mu_{2}=13/2$ . We applied Theorem 7 to verify the RH using

\delta=-1,\qquad\tau=9877.3.

The coefficients $\alpha_{1,p}$ and $\alpha_{2,p}$ arising from the Euler product for $L(s)$ are given by the roots of the polynomial $x^{2}-\tau(p)p^{-11/2}x+1$ . Our set $\mathcal{Z}$ was obtained using [lcalc] and [lmfdb]. With the aid of Corollary 5 and Lemma 6, applied with $K=10^{5}$ , we computed

w_{1,\delta}\in[0.1671717623,0.1672414682],

C(\mathcal{Z},\delta)\in[0.1663983945,0.1663983946].

Based on this input data, Theorem 7 succeeded in verifying the RH for $L(s)$ for $\eta=84$ , a window which includes $46$ zeros with nonnegative ordinates.

7.4. Elliptic curve $L$ -function

Let $E$ be an elliptic curve over $\mathbb{Q}$ of conductor $N=\Delta_{E}$ . Let $L(s,E)$ be the corresponding elliptic curve $L$ -function. In the notation of §4, we have $r=2$ , $N=\Delta_{E}$ , $\epsilon=1$ or $\epsilon=i$ , $\mu_{1}=1/2$ , and $\mu_{2}=3/2$ . We applied Theorem 7 with

\delta=-1,\qquad\tau=90,

to verify the RH for the elliptic curve $E$ with minimal Weierstrass equation

(18)

E:y^{2}+y=x^{3}-x.

According to [lmfdb, 37.a1], $E$ has conductor $37$ so that $N=37$ , and the sign of the functional equation of $L(s,E)$ is $-1$ so that, in the notation of §4, $\epsilon=i$ . To calculate the Euler factors of $L(s,E)$ , let $|E(\mathbb{F}_{p})|$ denote the number of solutions $(x,y)\in\mathbb{F}_{p}\times\mathbb{F}_{p}$ that satisfy the minimal Weierstrass equation (18) together with the point at infinity that lies on $E$ . Define

(19)

b(p):=p+1-|E(\mathbb{F}_{p})|.

Then the coefficients $\alpha_{1,p}$ and $\alpha_{2,p}$ arising from the Euler product for $L(s,E)$ are the roots of the polynomial $x^{2}-b(p)p^{-1/2}x+1$ , provided $p\neq 37$ . If $p=37$ , then $\alpha_{p,1}=-1/\sqrt{p}$ and $\alpha_{p,2}=0$ . Our set $\mathcal{Z}$ was obtained from [lmfdb]. With the aid of Corollary 5 and Lemma 6, applied with $K=10^{5}$ , we computed

w_{1,\delta}\in[1.2186382841,1.21870798992],

C(\mathcal{Z},\delta)\in[1.160632197991927,1.160632199964985].

Based on this input data, Theorem 7 succeeded in verifying the RH for $L(s,E)$ for $\eta=10$ , a window which contains $5$ zeros with nonnegative ordinates.

8. Proof of Proposition 12

Recall the function $\phi$ defined in (7). For $\rho=\beta+i\gamma$ , we have

\textrm{Re}\,\left[\frac{1}{\rho-z}+\frac{1}{1-\overline{\rho}-z}\right]=\phi(% \beta,\gamma-y,x).

Also, if $|\gamma-y|>\tau>1-x$ , then Lemma 3, parts (ii–iii) give

(20)

\phi(0,\gamma-y,x)\leq\phi(\beta,\gamma-y,x)\leq\phi(1/2,\gamma-y,x).

Now, for $u\geq 0$ let $N(u)$ be the number of zeta zeros with ordinates in $[0,u]$ , and extend $N(u)$ to $u<0$ by requiring it to be odd. Using Stieltjes integrals, we define

	$\displaystyle L(z,\tau)$	$\displaystyle:=\int_{\|u-y\|>\tau}\phi(0,u-y,x)\,\frac{dN(u)}{2},$
	$\displaystyle U(z,\tau)$	$\displaystyle:=\int_{\|u-y\|>\tau}\phi(1/2,u-y,x)\,\frac{dN(u)}{2}.$

The double inequality (20) thus gives

(21)

L(z,\tau)\leq\sum_{\begin{subarray}{c}\rho\\ |\gamma-y|>\tau\end{subarray}}\textrm{Re}\,\frac{1}{\rho-z}\leq U(z,\tau).

We bound $L$ and $U$ from below and above, respectively, starting with $L$ .

To this end, since the integrand in $L$ is nonnegative, a lower bound on $L$ can be obtained by restricting the integration interval to $\tau<|u-y|<y-c$ . Doing so, followed by the change of variable $u\leftarrow u-y$ , gives

(22)

L(z,\tau)\geq\frac{1}{2}\cdot\int_{\tau<u<y-c}\phi(0,u,x)\,d[N(y+u)-N(y-u)].

On the other hand, it is known [davenport_1967] that

(23)

N(u)=\frac{1}{\pi}\theta(u)+1+S(u),

where $\theta(u)=\arg[\pi^{-iu/2}\Gamma(1/4+iu/2)]$ and, if $u$ does not coincide with the ordinate of any nontrivial zero, $S(u)=\pi^{-1}\arg\zeta(1/2+iu)$ .¹³¹³13The arguments are defined by a continuous variation starting at $s=2$ , going up vertically to $s=2+iu$ , and then horizontally to $s=1/2+iu$ . Also, $\theta(u)$ is a smooth odd function and $S(u)$ is right-continuous with jump discontinuities at the zeros ordinates. Thus, combining (22) and (23), and defining

(24)		$\displaystyle L_{\theta}(z,\tau)$	$\displaystyle:=\int_{\tau<u<y-c}\phi(0,u,x)\,(\theta^{\prime}(y+u)+\theta^{% \prime}(y-u))\,du,$
(25)		$\displaystyle L_{S}(z,\tau)$	$\displaystyle:=\int_{\tau<u<y-c}\phi(0,u,x)\,d[S(y+u)-S(y-u)],$

where $\theta^{\prime}$ is the derivative of $\theta$ with respect to $u$ , we obtain

(26)

L(z,\tau)\geq\frac{1}{2\pi}L_{\theta}(z,\tau)-\frac{1}{2}|L_{S}(z,\tau)|,

We first bound $L_{\theta}$ from below. By [lehman_1970, Lemma 10], if $u>0$ , then

(27)

\left|\theta^{\prime}(u)-\frac{1}{2}\log\frac{u}{2\pi}\right|\leq\frac{4\pi^{2% }\cdot 0.006}{u^{2}}.

So, in view of (24), we are led to consider

(28)

\frac{1}{2}\log\frac{y+u}{2\pi}+\frac{1}{2}\log\frac{y-u}{2\pi}=\log\frac{y}{2% \pi}+\frac{1}{2}\log\left(1-\frac{u^{2}}{y^{2}}\right).

Expanding the right-side in (28) about $u=0$ , and using the Lagrange form of the remainder, as well as (27), we see that for $\tau<u<y-c$ ,

(29)

\displaystyle\left|\theta^{\prime}(y+u)+\theta^{\prime}(y-u)-\log\frac{y}{2\pi% }\right|\leq\frac{u^{2}}{y^{2}-u^{2}}+\epsilon_{1}(y,\tau,c),

where

\epsilon_{1}(y,\tau,c)=4\pi^{2}\cdot 0.006\cdot\left[\frac{1}{(y+\tau)^{2}}+% \frac{1}{c^{2}}\right].

On the other hand, substituting the following simple bound into the integral in (31) below,

(30)

0\leq\phi(0,u,x)\leq\frac{1-2x}{u^{2}},

and evaluating the resulting integral in closed-form gives the inequality

(31)

0\leq\int_{\tau}^{y-c}\phi(0,u,x)\cdot\frac{u^{2}}{y^{2}-u^{2}}\,du\leq% \epsilon_{2}(z,c),

where

\epsilon_{2}(z,c)=\frac{1-2x}{2y}\cdot\log\frac{2y}{c}.

Additionally, using the anti-derivative formula

(32)

\displaystyle\int\phi(0,u,x)\,du=\arctan\left(\frac{u}{1-x}\right)-\arctan% \left(\frac{u}{x}\right),

together with the following double inequality (from the Laurent series for $\arctan$ ), which is valid for $u>\tau>1-x$ ,

(33)

0\leq\left[\arctan\left(\frac{u}{1-x}\right)-\arctan\left(\frac{u}{x}\right)% \right]-\left[\pi-\frac{1-2x}{u}\right]\leq\frac{(1-x)^{3}-x^{3}}{3u^{3}},

we obtain

(34)

\int_{\tau}^{y-c}\phi(0,u,x)\,du\geq\frac{1-2x}{\tau}-\epsilon_{3}(z,\tau,c),

where, after using the elementary inequality $(1-x)^{3}-x^{3}<(1-x)^{2}(1-2x)$ ,

\epsilon_{3}(z,\tau,c)=\left[\frac{(1-x)^{2}}{3\tau^{3}}+\frac{1}{y-c}\right]% \cdot(1-2x).

Therefore, combining (24), (29), (31), and (34), we obtain

(35)

L_{\theta}(z,\tau)\geq\left(\frac{1-2x}{\tau}-\epsilon_{3}\right)\left(\log% \frac{y}{2\pi}-\epsilon_{1}\right)-\epsilon_{2}.

We now calculate an upper bound on $L_{S}$ , which is defined in (25). Let $\phi^{\prime}$ denote the derivative of $\phi$ with respect to $u$ . Using integration by parts and Lemma 3, part (iv), together with the intermediate value theorem, we obtain

	$\displaystyle\left\|L_{S}(z,\tau)\right\|$	$\displaystyle\leq 2\,\left[\phi(0,\tau,x)+\phi(0,y-c,x)\right]\cdot\sup_{c<u<2% y}\|S(u)\|$
		$\displaystyle\quad-2\,\phi^{\prime}(0,\tau,x)\cdot\sup_{c<u_{1}<u_{2}<2y}\left% \|\int_{u_{1}}^{u_{2}}S(u)\,du\right\|.$

If $c>e$ , then [trudgian_2014, Theorem 1] gives the bound $|S(u)|\leq\ell(u)$ . And if $c>168\pi$ , then [trudgian_2011, Theorem 2.2] gives the bound $|\int_{u_{0}}^{u}S(t)\,t|\leq\ell_{1}(u)$ . So, using the simple bound (30) for $\phi$ as well as the bound

(36)

\frac{-2+4x}{u^{3}}\leq\phi^{\prime}(\beta,u,x)\leq 0,

valid for $0\leq\beta\leq 1$ , we obtain

(37)

\left|L_{S}(z,\tau)\right|\leq\epsilon_{4}(z,\tau,c)+\epsilon_{5}(z,\tau),

where

	$\displaystyle\epsilon_{4}(z,\tau,c)=\left(\frac{2-4x}{\tau^{2}}+\frac{2-4x}{(y% -c)^{2}}\right)\cdot\ell(2y),$
	$\displaystyle\epsilon_{5}(z,\tau)=\frac{4-8x}{\tau^{3}}\cdot\ell_{1}(2y).$

Combining (26), (35), (37) yields the lower bound in the proposition.

To derive the upper bound in the proposition, we bound $U(z,\tau)$ in (21) from above. Let us write $U(z,\tau)=I_{1}(z,\tau)+I_{2}(z,\tau)$ where

	$\displaystyle I_{1}(z,\tau)$	$\displaystyle:=\int_{\tau<\|u-y\|\leq y-c}\phi(1/2,u-y,x)\,\frac{dN(u)}{2},$
	$\displaystyle I_{2}(z,\tau)$	$\displaystyle:=\int_{\|u-y\|>y-c}\phi(1/2,u-y,x)\,\frac{dN(u)}{2}.$

$I_{1}$ is estimated by an analogous calculation to that used for $L(z,\tau)$ . The difference is that the formula (32) and the double inequality (33) are replaced with the formula

\int\phi(1/2,u,x)\,du=2\arctan\left(\frac{2u}{1-2x}\right),

and the following double inequality, valid for $u>\tau>1-2x$ ,

0\leq 2\arctan\left(\frac{2u}{1-2x}\right)-\left[\pi-\frac{1-2x}{u}\right]\leq% \frac{(1-2x)^{3}}{12u^{3}}.

Consequently, the formula (34) is replaced with

\int_{\tau}^{y-c}\phi(1/2,u,x)\,du\leq\frac{1-2x}{\tau}.

So that the term $\epsilon_{3}$ in (35) may be replaced with zero. Also, since we are looking for an upper bound, the $-$ signs in (35) should be replaced with $+$ signs. Put together,

(38)

I_{1}(z,\tau)\leq\frac{1-2x}{2\pi\tau}\left(\log\frac{y}{2\pi}+\epsilon_{1}% \right)+\frac{\epsilon_{2}}{2\pi}+\frac{\epsilon_{4}+\epsilon_{5}}{2}.

Next, we bound $I_{2}$ . After the change of variable $u\leftarrow u-y$ we obtain

I_{2}(z,\tau)=\frac{1}{2}\int_{y-c}^{\infty}\phi(1/2,u,x)\,dN(y+u)-\frac{1}{2}% \int_{y-c}^{\infty}\phi(1/2,u,x)\,dN(y-u).

Using integration by parts together with the observation $\phi(1/2,u,x)\ll 1/u^{2}$ and the facts that $N(u)\ll u\log u$ and non-decreasing, we obtain

(39)

I_{2}(z,\tau)\leq\int_{y-c}^{\infty}\left|\phi^{\prime}(1/2,u,x)\right|N(y+u)% \,du.

Therefore, on substituting the bound on $\phi^{\prime}$ given in (36) and the bound

|N(u)|\leq\frac{u}{2\pi}\log\frac{u}{2\pi},\qquad(u\geq\gamma_{1})

which follows from e.g. [trudgian_2014, Corollary 1], we obtain after a small calculation

I_{2}(z,\tau)\leq\epsilon_{6}(z,c),

where $\epsilon_{6}(z,c)$ is defined as in the statement of the proposition. The claimed upper bound then follows on combining this with (38).

9. Proof of Lemma 3

Proof of parts (i)–(iii): Taking the partial derivative of $\phi$ with respect to $\beta$ , we find with the aid of a computer algebra system that

(40)

\frac{\partial}{\partial\beta}\phi(\beta,\eta,x)=-\frac{(2\beta-1)(2x-1)G(% \beta,\eta,x)}{[((\beta-x)^{2}+\eta^{2})((1-\beta-x)^{2}+\eta^{2})]^{2}},

where $G$ is a degree $4$ monic polynomial in $\beta$ ,

(41)		$\displaystyle G(\beta,\eta,x):=$	$\displaystyle\,\beta^{4}-2\beta^{3}+(1-2\eta^{2}+2x-2x^{2})\beta^{2}+2(\eta^{2% }-x+x^{2})\beta$
		$\displaystyle+\eta^{2}(2x-2x^{2}-1-3\eta^{2})+x^{2}(1-2x+x^{2}).$

satisfying $G(\beta,\eta,x)=G(1-\beta,\eta,x)$ . Note that the sign of the partial derivative of $\phi$ with respect to $\beta$ is the same as the sign of $(2\beta-1)G(\beta,\eta,x)$ .

We have the formulas

(42)		$\displaystyle G(\beta,0,x)$	$\displaystyle=(\beta-x)^{2}(1-\beta-x)^{2},$
(43)		$\displaystyle\frac{\partial}{\partial\beta}G(\beta,\eta,x)$	$\displaystyle=2(2\beta-1)(\beta-r_{+})(\beta-r_{-}),$

where

(44)

r_{\pm}:=\frac{1\pm\sqrt{4\eta^{2}+(2x-1)^{2}}}{2}\qquad\text{so that}\qquad r% _{-}<0\text{ and }1<r_{+},

as well as the formulas

(45)		$\displaystyle\left[\frac{\partial^{2}}{\partial\beta^{2}}G(\beta,\eta,x)\right% ]_{\beta=\frac{1}{2}}$	$\displaystyle=-4\eta^{2}-(2x-1)^{2},$
(46)		$\displaystyle\left[\frac{\partial^{2}}{\partial\beta^{2}}G(\beta,\eta,x)\right% ]_{\beta=r_{\pm}}$	$\displaystyle=8\eta^{2}+2(2x-1)^{2}.$

Taking (42) as our “starting point” in some sense, and viewing it as a function of $\beta$ , we see that $G(\beta,0,x)$ has two local minima (and $\beta$ -axis intercepts) at $\beta=x$ and $\beta=1-x$ with a (positive) local maximum at $\beta=1/2$ . As $\eta>0$ increases, we see from (43) and (44) together with (46) that the two local minima locations $r_{\pm}$ move away from $1/2$ . In comparison, as follows from (43) and (45), $\beta=1/2$ remains a local maximum of $G(\beta,\eta,x)$ (and a global maximum on the $\beta$ -interval $[0,1]$ ), albeit with a monotonically decreasing value of $G(1/2,\eta,x)$ . The latter claim can be seen from the negativity of the partial derivative

(47)

\frac{\partial}{\partial\eta}G(\beta,\eta,x)=-\eta\left(12\eta^{2}+(2\beta-1)^% {2}+(2x-1)^{2}\right).

We now consider three cases.

Case (1): Suppose $G(\beta,\eta,x)>0$ on the $\beta$ -interval $(0,1)$ , so that, by considering the sign of $\partial\phi/\partial\beta$ in (40), we find $\beta=1/2$ is a local minimum of $\phi(\beta,\eta,x)$ . In evaluating this case, we note by the negativity of the partial derivative in (47), if $\eta$ and $x$ are such that $\beta=0$ is a root of $G(\beta,\eta,x)$ , then $\beta=0$ cannot be a root for any greater $\eta$ (with the same $x$ ). Also, $\beta=0$ is a root if and only if the constant term of the polynomial $G(\beta,\eta,x)$ in (41) is $0$ , hence if and only if $3\eta^{4}+(2x^{2}-2x+1)\eta^{2}+x^{2}(-x^{2}+2x-1)=0$ . Solving for $\eta$ and recalling the discussion following (46), we therefore find $G(\beta,\eta,x)>0$ throughout $\beta\in(0,1)$ if and only if

\eta\leq\sqrt{\frac{-2x^{2}+2x-1+\sqrt{16x^{4}-32x^{3}+20x^{2}-4x+1}}{6}}=:v(x).

For such $\eta$ , $\phi(\beta,\eta,x)$ has no local extrema in the interval $\beta\in(0,1)$ except at $\beta=1/2$ where it has a local minimum. Thus, if $0\leq\beta\leq 1$ , then $\phi(\beta,\eta,x)$ is minimized at $\beta=1/2$ in this case.

Case (2): Suppose $G(\beta,\eta,x)<0$ on the $\beta$ -interval $(0,1)$ , so that, by considering the sign of $\partial\phi/\partial\beta$ in (40), we find $\beta=1/2$ is a local maximum of $\phi(\beta,\eta,x)$ . On the other hand, $G(\beta,\eta,x)$ maintains a negative sign throughout $\beta\in(0,1)$ if and only if the local extremum of $G(\beta,\eta,x)$ that occurs at $\beta=1/2$ is negative. By direct calculation, we have

G(1/2,\eta,x)=\frac{1}{16}(1-2x)^{4}-\frac{1}{16}(1-2x)^{4}\eta^{2}-3\eta^{4}.

Setting $G(1/2,\eta,x)=0$ and solving for $\eta$ , we find that $G(\beta,\eta,x)<0$ throughout $\beta\in(0,1)$ if and only if

\eta>\frac{1-2x}{2\sqrt{3}}.

For such $\eta$ , $\phi(\beta,\eta,x)$ has no local extrema in the interval $\beta\in(0,1)$ except at $\beta=1/2$ where it has a local maximum. Thus, if $0\leq\beta\leq 1$ , then $\phi(\beta,\eta,x)$ is minimized at the boundary points $\beta=0,1$ and maximized at $\beta=1/2$ in this case.

Case (3): Suppose $G(\beta,\eta,x)$ does not maintain its sign throughtout the $\beta$ -interval $(0,1)$ , so that $G(\beta,\eta,x)$ has at least one root at some $\beta\in(0,1)$ . By the symmetry of $G(\beta,\eta,x)$ about $\beta=1/2$ as well as the discussion following (46), there can be at most two such roots, one in the subinterval $(0,1/2]$ and another symmetric root in the subinterval $[1/2,1)$ . By the work done thus far, such roots occur if and only if

v(x)<\eta\leq\frac{1-2x}{2\sqrt{3}}.

When these roots occur, they each correspond to local maxima of $\phi(\beta,\eta,x)$ as seen by considering the sign of $\partial\phi/\partial\beta$ in (40). Therefore in this case, $\phi(\beta,\eta,x)$ is minimized either at the boundary $\beta=0$ (equivalently $\beta=1$ ), or at the center $\beta=1/2$ . To find the point of transition between these two situations, we set $\phi(0,\eta,x)=\phi\left(1/2,\eta,x\right)$ and find that the transition occurs when

\eta=\sqrt{\frac{x(x-1)}{3}}.

Summary: From the above 3 cases, we have the following behavior of the minimum of $\phi(\beta,\eta,x)$ for $\beta\in[0,1]$ .

(a)

If $\eta\leq\sqrt{\frac{x(x-1)}{3}}$ , then $\phi(\beta,\eta,x)$ is minimized at $\beta=\frac{1}{2}$
(b)

If $\eta>\sqrt{\frac{x(x-1)}{3}}$ , then $\phi(\beta,\eta,x)$ is minimized at $\beta=0$ (equivalently $\beta=1$ ).

This covers claims (i) and (ii) in the statement of the lemma. Claim (iii) in the lemma follows from Case (2) above.

Proof of part (iv): Writing $\phi^{\prime}(\beta,u,x)$ as the partial derivative of $\phi(\beta,u,x)$ with respect to $u$ and $\phi^{\prime\prime}(\beta,u,x)$ similarly. We have

(48)

\displaystyle\phi^{\prime}(\beta,u,x)

\displaystyle=\frac{2(x-\beta)u}{[(x-\beta)^{2}+u^{2}]^{2}}+\frac{2(x-(1-\beta% ))u}{[(x-(1-\beta))^{2}+u^{2}]^{2}}.

Since $x\leq 0$ , $\beta\in[0,1]$ , and $u>0$ , by (48) we have that $\phi^{\prime}(\beta,u,x)<0$ for any such $\beta,u$ , and $x$ .

For the claims regarding the increasing nature of $\phi^{\prime}(\beta,u,x)$ , we begin by evaluating $\phi^{\prime\prime}$ at $\beta=1/2$ and find

\phi^{\prime\prime}(1/2,u,x)=\frac{2(1-2x)[3u^{2}-(\frac{1}{2}-x)^{2}]}{[(% \frac{1}{2}-x)^{2}+u^{2}]^{3}}.

From this, we see that $\phi^{\prime}(1/2,u,x)$ is increasing if and only if

u>\frac{\frac{1}{2}-x}{\sqrt{3}}=\frac{1-2x}{2\sqrt{3}},

yielding the first of these claims.

Similarly, evaluating $\phi^{\prime\prime}$ at $\beta=0$ gives us

(49)

\phi^{\prime\prime}(0,u,x)=\frac{2x(x^{2}-3u^{2})}{(x^{2}+u^{2})^{3}}+\frac{2(% x-1)((x-1)^{2}-3u^{2})}{((x-1)^{2}+u^{2})^{3}}.

The first term in (49) is positive only when $u>|x|/\sqrt{3}$ and the second term only when $u>|x-1|/\sqrt{3}$ . So, since $x\leq 0$ , $\phi^{\prime}(0,u,x)$ is increasing when

u>\frac{1-x}{\sqrt{3}}=\frac{2-2x}{2\sqrt{3}}.

10. Conclusions and future directions

We presented a method to verify the RH for zeta at large heights and to verify the RH for a general class of $L$ -functions at low heights. The method is simple to understand and implement and we demonstrated its efficacy on a variety of $L$ -functions using interval arithmetic. We also presented a significant improvement to the method in the case of zeta by incorporating explicit bounds on $S(t)$ and integrals of $S(t)$ .

In forthcoming work, we will develop and detail further generalizations of this verification method. These generalizations include, among other things, consideration of $w_{k,\delta}$ and $v_{k,z}$ when $k>1$ , further improvements in the case of zeta at large heights, the special case of Dirichlet $L$ -functions to real primitive characters, as well as the extending of the improvements in §6 to a more general setting.

Acknowledgements

We are grateful to Georg-August-Universität Göttingen, Niedersächsische Staats- und Universitätsbibliothek Göttingen, for helping us locate some of Riemann’s unpublished notes and giving us permission to reproduce them in this paper. Megan Kyi thanks the OH5-OSU SURE Undergraduate Research program at the Ohio State University, Columbus, for their support.

\printbibliography

A method for verifying the generalized Riemann hypothesis

Abstract.

Key words and phrases:

2020 Mathematics Subject Classification:

1. Introduction

Corollary 1.

Corollary 2.

Remark.

2. Background and notation

3. Riemann and verifying the RH

Lemma 3.

Proof.

4. Generalization to a class of L𝐿Litalic_L-functions

Lemma 4.

Proof.

Corollary 5.

Lemma 6.

Proof.

Remark.

Remark.

Theorem 7.

Remark.

Remark.

Proof.

5. Generalization in the zeta case

Lemma 8.

Proof.

Corollary 9.

Lemma 10.

Theorem 11.

Proof.

Remark.

6. Improvements

Proposition 12.

Proof.

Theorem 13.

Proof.

Remark.

7. Numerical examples

7.1. The Riemann zeta function

7.2. Real Dirichlet L𝐿Litalic_L-function

7.3. The Ramanujan τ𝜏\tauitalic_τ L𝐿Litalic_L-function

7.4. Elliptic curve L𝐿Litalic_L-function

8. Proof of Proposition 12

9. Proof of Lemma 3

10. Conclusions and future directions

Acknowledgements

4. Generalization to a class of $L$ -functions

7.2. Real Dirichlet $L$ -function

7.3. The Ramanujan $\tau$ $L$ -function

7.4. Elliptic curve $L$ -function