A method for verifying the generalized Riemann hypothesis
Ghaith Hiary, Summer Ireland, Megan Kyi
GH: Department of Mathematics, The Ohio State University, 231 West 18th
Ave, Columbus, OH 43210, USA
hiary.1@osu.edu
SI: Department of Mathematics, The Ohio State University, 231 West 18th
Ave, Columbus, OH 43210, USA
ireland.118@buckeyemail.osu.edu
MK: Oberlin College, 135 West Lorain Street, Oberlin, OH 44074-1081, USA
mkyi@oberlin.edu
Abstract.
Riemann numerically approximated at least three zeta zeros. According to Edwards, Riemann even took steps to verify that the lowest zero he computed was indeed the first zeta zero. This approach to verification is developed, improved, and generalized to a large class of -functions. Results of numerical calculations demonstrating the efficacy of the method are presented.
Let be a complex variable, where and are real numbers.
The Riemann zeta function is defined by the Dirichlet series
(1)
which converges absolutely in the half-plane .
Zeta can be analytically continued to the entire complex plane except for a simple pole at , and has zeros (i.e., roots) at , which are called the trivial zeros. Zeta also has an infinite number of nontrivial zeros in the critical strip , none of which is real. We call the height of and order the ’s by increasing height.
The trivial and nontrivial zeros account for all the zeta zeros.
The Riemann Hypothesis (RH) is that all the ’s are on the critical line ,
or equivalently that for all .
It is frequently asserted that Riemann numerically approximated the first few zeta zeros by hand, citing unpublished notes by Riemann. See in particular Edwards [edwards_riemann_2001, §7.6] as well as the Clay Mathematics Institute page [clay].
As Figure 1 shows,
Riemann numerically approximated three zeta zeros on the critical line,
corresponding to those with ordinates (i.e., imaginary parts)
Riemann approximated rather than as the former
quantity appeared naturally in various formulas.
The closest approximation of that we found in Riemann’s notes was , so that is .111This differs from what is stated in [edwards_riemann_2001, 159] which gave the approximation . For and , Riemann computed the approximations and
, respectively. Both of these approximations were
noticeably far from the true values and , and
as can be seen in Figure 1, Riemann had other intermediate approximations that were
slightly better.
Nevertheless, Riemann’s approximation of , which was
long unknown to the outside world,
remained closest to the true value of for nearly five decades.222As far as we can tell, the first
circulated approximation of was in 1887 by
Stieltjes [bailaud_bourget_1905, 450] who gave the approximation .
Eight years later, Gram [gram_1895]
gave the approximation , which Gram [gram_1903] improved to
in 1903. Around the same time, Lindelöf [lindelof_1903] devised a different method
to approximate the ’s and
proved that .
According to Edwards [edwards_riemann_2001, §7.6], Riemann even attempted
to verify that Riemann’s numerical approximation of indeed
corresponded to the first zeta zero (i.e., to the with smallest positive ordinate).
This verification relied on the Hadamard product for together with
a positivity argument and a known special value of zeta.
However, unlike Riemann’s method to numerically compute pointwise values of , which
became a standard method known as the Riemann–Siegel formula [siegel_1932],
Riemann’s approach to verifying the RH remained little known.
It is worth remarking, though,
that Gram [gram_1895, gram_1903]
considered the power series of the logarithm of the Hadamard product, like Riemann did,
but often appeared to assume the RH. One may also
compare the Riemann approach to verification with the Li
criterion [li_1997] for the RH equivalence.
After Riemann’s 1859 paper, various efficient
methods for verifying the RH were derived.
Backlund [backlund_1911, backlund_1916]
devised a verification method that relied on
a clever application of
the argument principle from complex analysis
together with the Euler–Maclaurin summation for .
This was eventually surpassed by
a highly efficient method due to Turing [turing_1953],
which has since become the standard method for verifying the RH,
provided one is high enough on the critical line.
In comparison, the Riemann method, cited in Edwards [edwards_riemann_2001, §7.6],
is time consuming at large heights. It can be expected to require
initial zeta zeros to verify the RH up to height .
Nevertheless, this method is reasonably efficient at low heights333For example, zeta zeros suffice to verify the RH up to height via this method,
and zeta zeros suffice to verify the RH up to height . while
offering great simplicity.
Therefore, it is worthwhile to generalize the Riemann method
to families of -functions
where even the “first” zero of is deeply interesting. Such a generalization is one of the main goals of this paper.
Specifically, rather
than fall back on a generalized Backlund method, which would require using
a numerically-involved application of the argument
principle,
we re-examine and develop the Riemann method in a more general setting.
Our generalization
works naturally with
already available databases and software for -functions.
Our generalization
is also simple to derive and justify,
requiring a single
numerical evaluation of a logarithmic derivative of the -function at a special point in the region of absolute convergence.
Although the main goal of this paper is to provide a simple RH verification for a large class of -functions at low heights, a secondary goal is to improve the Riemann method for zeta
so that it functions efficiently at large heights. This results in a conceptually straightforward verification method that can verify the RH over larger windows. Specifically, given zeros data in a window of size around height , the improved method in Theorem 13
is expected to succeed in verifying the RH in a window of size around .
To illustrate out main results, let us state two
corollaries.
Corollary 1 provides an example of
an RH verification test for low zeros of
the Dirichlet -function , where is any real primitive character of fundamental discriminant .
This corollary is obtained
from Theorem 7, part (i), on setting and ,
and using the formula for in Corollary 5.
Note
the required value of the logarithmic derivative of
that appears in Corollary 1
is well inside the region of absolute convergence of .
So, the required value can be computed easily by truncating the
Dirichlet series for the logarithmic derivative, even if is very large.
Lemma 6 furnishes an explicit bound on the corresponding truncation error.
To state the next two corollaries, we will make use of the following the quantity.
Corollary 1.
Let be a positive fundamental discriminant, be a real positive number,
and be a set of nonempty disjoint subintervals of the form or of the form . Suppose that has a zero of odd multiplicity in each subinterval in .
Further, define
Let be the Euler constant. For any real positive number , if
then then RH holds for all the nontrivial zeros of with positive height .
Here, one may think of as the width of the window where known zeros data is available,
and of as the width of the window where one would like to verify the RH. The quantity is the minimal contribution from known zeros (i.e. from supplied zeros data),
and is the minimal contribution of a hypothetical counter-example of positive height . The displayed inequality indicates that a contradiction has been reached, so that a hypothetical counter-example of positive height cannot exist.
Similarly, by combining Theorem 11, part (i) and Corollary 9,
we obtain an RH verification test for zeta zeros at large heights.
This is stated in Corollary 2.
But in this case we can improve the basic verification test substantially
by considering the behavior
of , which is the fluctuating part of the counting function of zeta zeros - see (23).
Specifically, in Theorem 13, we incorporate the explicit bounds
where, according to [trudgian_2014, trudgian_2011], we may take
(2)
provided is large enough. ( suffices.)
We note that an explicit bound on is a main ingredient in the Turing method as well, but we use this bound differently in our case.
Also, without additional knowledge or analysis, the explicit bound on is typically
far more impactful for us than the explicit bound on .
We will make use of the following notation and quantity. Let
, denote the digamma function, and define
Corollary 2.
Let and be real numbers such that and .
Let be a set of nonempty disjoint subintervals of the form
such that does not belong to any of the subintervals in .
Suppose that has a zero of odd multiplicity in each subinterval in .
Further, define
For any real positive number , if
then RH holds for all the nontrivial zeros of with height in .
Remark.
Here, one may think of as very large, is large but much smaller than ,
and with somewhat smaller than . The expression for
is obtained from Theorem 13 by setting and .
Like before,
the special value of the logarithmic derivative of appearing in Corollary 2
is well inside the region of absolute convergence of .
So this value can be approximated easily and fairly accurately
via a truncated sum over primes and prime powers
using our Lemma 10, even at very large heights.
Our main theorems, Theorem 7 and Theorem 11, additionally
enable verifying the simplicity of zeros in a given range as well as
verifying the completeness of a given list of zeros.
In addition, Theorem 13 gives
a counterpart that allows
one to still draw a conclusion
in some situations where the RH might not be verified using the
Turing method. For example, if the given zeros list is
incomplete (i.e. there is a zero with ordinate in that is missing from the list), then the
the Turing method
might not prove that the zeros list is indeed incomplete.
In this case, the counterpart in Theorem 13
will typically enable proving that the given zeros list is indeed incomplete.
Lastly, it completely reasonable to expect that a similar method to the one described here may be derived using the framework of the explicit formula [iwaniec_kowalski_2004, §5.5]. By choosing a suitable test function in the explicit formula, one may even accelerate the convergence of the associated series over the prime and prime powers.
At the same time though one must ensure, under no assumption, that the individual terms in
the sum over the zeros appearing in the explicit formula are nonnegative. We favored
the current derivation due to its simplicity, its historical connection, and because we already have good control
over the convergence of the said series in the region of absolute convergence.
Additionally, the current derivation gives us access to several useful exact values and exacting relations as well
as to long-studied sums in the theory of the Riemann zeta function, which benefits the practicality of our derivation.
For example, we can directly benefit from exact values of the polygamma function and, if we wish,
of exact values of -functions at special points such as the class number formula for Dirichlet -functions.
Overview.
In §2, we provide background and set up some notation.
In §3, we outline the Riemann approach to verifying the RH following the description in
[edwards_riemann_2001].
In §4, we generalize the Riemann approach to a class of -functions with real Dirichlet coefficients.
In §5, we treat the case of separately,
both because is outside our class of -functions (in view of the pole at ) and
because our focus for zeta will be on large heights.
In §6, we discuss substantial improvements in the case of zeta.
In §7, we present results of numerical computations
implemented in interval arithmetic
for a variety of examples of -functions.
So, is a zeta zero if and only if the complex conjugate is a zeta zero, or equivalently the ’s are symmetric about the real axis.
The ’s are also symmetric about the critical line. This
is seen by using the zeta functional equation, which
in its simplest form states that
the entire function
where is the Gamma function444The poles of are all simple
and coincide with the trivial zeros of zeta, all of which are simple as well. So, the poles of cancel the trivial zeros of zeta.
The simple pole of zeta at coincides with the zero of the factor in the definition of ., satisfies the functional equation
(4)
Therefore, is even about .
For example, .
Since and have no zeros at all,
the zeros of are the same as the nontrivial zeros of .
Hence, by the functional equation (4), is a zeta zero if and only if is a zeta zero.
Furthermore, by the functional equation (4) and the symmetry relation (3),
So, is real-valued on the critical line (as well as on the real axis).
It follows by the intermediate value theorem that the simple (or odd multiplicity)
nontrivial zeta zeros on the critical line correspond to sign changes of . In particular, one can numerically prove the existence of zeta zeros of odd multiplicity
on the critical line by detecting sign changes of .
3. Riemann and verifying the RH
Being an entire function of order 1, has
a Hadamard product given by
(5)
where
the product is taken by pairing the terms for and (or pairing the terms for and ), which ensures correct convergence.
Starting with (5), Riemann obtained the following formula
(6)
and the sum over the ’s is executed by pairing the terms for and .
Therefore,
As seen in Figure 2, Riemann correctly computed the value of
up to 20 digits,
obtaining .
According to Edwards [edwards_riemann_2001, §7.6], Riemann even
attempted to use the
numerical value of to verify that the Riemann approximation of
indeed corresponded to the first zeta zero (zeta zero of lowest height). This attempt is described essentially as follows.
Using the first zeros in the upper half-plane, .
On the other hand,
if there is a zero in the upper half-plane of height ,
then there must be a second such zero. This is because either
is off the critical line, in which case is a distinct zeta zero in the upper half-plane that is also of
height . Or is on the critical line, in which case,
considering that has the same sign at both and ,
there must be a second zero on the critical line with a positive ordinate .555More precisely,
the argument in [edwards_riemann_2001, §7.6]
only works if has height .
Since the possibility that has height is not yet ruled out,
this argument
does not force the existence of a second zero on the critical line in this case.
Therefore, if existed, then it would force an additional contribution of at least ,
causing the zeros sum to exceed and hence gives a contradiction.
Although not stated explicitly, it is critical to the last part of the argument that the terms
are all nonnegative.
This ensures that the tail of the zeros sum contributes a nonnegative amount to .
Therefore, we can drop the tail of the zeros sum and still obtain
a valid lower bound on .
More generally, in this paper,
we will consider the behavior of the function
(7)
If is a complex number then we have
Note that is nonnegative for and .
To analyze the behavior of in detail, we will often invoke the following lemma.
Lemma 3.
Let be a real number such that . Let
be a real nonpositive number, and let be a real positive number. Then . Furthermore, we have the following.
(i)
If , then is minimized at .
(ii)
If , then is minimized at (or ).
(iii)
If , then is maximized at .
(iv)
is negative. Additionally, if then is increasing, and if then is increasing.
In the sequel,
we use the analytic normalization of -functions, so the critical line is .
We consider -functions of order only.
The following notation and assumptions are used throughout this section.
Let be a Dirichlet series
absolutely convergent in the half-plane .
We suppose that the Dirichlet coefficients are real,
so that
and the zeros of
must be symmetric about the real axis.
Following the notation in Booker [booker_2006],
specialized to our context666In particular, we require that the are real and
instead of . We also write the formulas
for and explicitly as
and , respectively, as well as
drop a scaling factor by in the definition of in [booker_2006, 387] as this
does not interfere with any of our calculations., we state a number of assumptions satisfied
by the set of -functions we consider.
has an Euler product of degree absolutely convergent in the half-plane ,
where the satisfy the conditions in [booker_2006, 387]. We will further assume
that .
Note that by the absolute convergence of the Euler product, has no zeros in the half-plane .
Suppose further there
are positive integers and , a complex number of modulus , and real nonnegative numbers , such that the function defined by
(8)
extends to an entire function and satisfies the functional equation
(9)
Note that by the functional equation, is real.
Also, is real on the real axis.
If ,
then the functional equation simplifies to which means that
is even in . While if , then
which means that is odd in ,
and hence must have a zero of
odd multiplicity at .
Since is entire, must have zeros at the poles of ,
which are the trivial zeros of .
Since has no zeros in the half-plane ,
it follows by the functional equation that the trivial zeros of in
have the same multiplicities as the poles of . Moreover, the
nontrivial zeros of , which we
denote by , are in the critical strip .
We assume , so that , and hence .
Therefore, the zeros of are exactly the nontrivial zeros of .
Also, just like , being of order has a Hadamard product
where we pair the terms for and (or for and ).
The RH for is the assertion that all the ’s are on the critical line .
To state the next proposition, we recall the -th order polygamma function ,
defined as the -th derivative of
.
Also, for any real number such that , let us write
(10)
for sufficiently close to .
Lemma 4.
Let be a positive integer.
Let be a real number such that
and .
Define
where the sum is ordered by pairing each term with its conjugate.
Then is a real number.
If , then
And if , then the same formula holds but there is an additional term of
Proof.
Since is a real number and the ’s are symmetric about the real axis, is real.
By the Hadamard product for ,
for away from zeros or poles of both sides. Therefore, replacing with ,
and using the series expansion (10), we obtain
(13)
where is the indicator function of the condition .
Substituting (13) back into (12),
then back into (11), yields the proposition.
∎
Since our numerical experiments in §7
will focus on the case , we provide a version of Lemma 4 in this special case.
Corollary 5.
When , we have
Let us note that many special values can be expressed exactly
in terms of known constants.777For example, when ,
, ,
, ,
and more generally for ,
. As another example, when , we have .
In general, there are
efficient ways for computing for ;
see for example [johansson_2021] for a discussion of methods to compute , , and related functions.
Therefore, for the purpose of computing , we may focus our attention on the logarithmic derivative
of at .
The next lemma supplies a simple formula for doing this, provided .
We make use of the following notation:
if for a prime and a natural number , then
and we set otherwise. In particular,
since ,
where is the von Mangoldt function. This is defined by
if for a prime and a natural number ,
and otherwise.
Lemma 6.
Let be an integer. If , then
where
Proof.
Suppose . By the Euler product for ,
(14)
So, by Stietljes integration and the bound ,
the tail
of the Dirichlet series (14) for and satisfies
(15)
Using integration by parts,
(16)
Furthermore, by [rosser_1941, 227], we have for
the double inequality
Substituting this into
(16), then back into (15), and integrating yields the result.
∎
Remark.
A simpler version of Lemma 6 is obtained by using the trivial bound
. This gives
Although usually not as precise as Lemma 6,
this estimate is sharper than Lemma 6
if is very large compared to .
Remark.
Lemma 6 generalizes easily to higher order
logarithmic derivatives of at . For example,
where
Theorem 7 next is our main result in this section. Unlike the case of zeta, where none of the ’s is real, might have real nontrivial zeros. So,
care is needed to allow for this possibility.
The following lemma will facilitate the proof of Theorem 7. Recall
that the function was defined in (7), and that
Theorem 7.
Let be a real nonpositive number and let be a real positive number. Let be a set of nonempty disjoint subintervals of the form or of the form . Suppose that has a sign change in each subinterval in .888This means for some in each subinterval in question.. Define
For any real positive number , any positive integer , and with as in (7),
define
Then, we have the following, where zeros are counted with multiplicity in all cases.
(i)
If
,
then there are strictly fewer than non-real ’s off the critical line of height .
(ii)
If
,
then there are strictly fewer than non-real ’s on the critical line of height not accounted for in .
(iii)
If ,
then there are strictly fewer than real ’s off the critical line.
(iv)
If
,
then a zero at the central point has multiplicity strictly less than .
(v)
If ,
then the list of zeros in is complete. This means that
every in the upper half-plane of height
is on the critical line, is simple, and belongs to some subinterval in the set .
Remark.
There are important cases where the subintervals in
should be allowed to appear with multiplicity. In such cases,
the conclusions about the simplicity of zeros in parts (ii) and (iv–v) should be modified
so as to account for
any nonsimple zeros already present in .
For example, the -function of an elliptic curve with analytic rank has by definition
a zero at of multiplicity .
Note that in this case, part (iv) of the theorem
gives an unconditional upper bound on the analytic rank of the elliptic curve.
Bober [bober_2013] gave a
method to bound the analytic rank of elliptic curve -functions via the “explicit formula,”
conditional on the RH for the corresponding -function.
Remark.
Let us explicitly note that if , then is even, so
any zero at the central point has even multiplicity.
Thus, in this situation, it is unclear how the non-simple
zero at can be detected rigorously by numerical means, via the intermediate value theorem,
as there will be no sign change to detect. All this is to say that
if and the zeros of height
have been sufficiently resolved, then the sum over intervals of the form
is expected to be empty.
Proof.
We prove part (i). Let be a set of zeros off of the critical line of height with and , possibly with repetition up to multiplicity. For each , there are necessarily 4 symmetric, distinct zeros , , , and . These 4 counterexample zeros will collectively contribute to a value of
Using lemma 3 and the monotonicity of in , we find that the minimum of the possible contribution from these 4 counterexample is at least
Note that this lower bound is independent of and . Thus, the counterexample zeros along with their symmetric zeros contribute at least to the value of . Thus, if
then we have a contradiction, so there are strictly fewer than zeros off the critical line of positive
height . In the case of this means the RH holds in the interval . In the case of ,
there is at most one set of 4 symmetric, non-real off the critical line of height and they must be simple.
Next, we prove part (ii). Note that the minimum contribution to from a non-real zero on the critical line of height together with its symmetric part is . Similarly, the contribution from such zeros on the critical line is at least . Therefore, if
then we have a contradiction, so there are strictly fewer than zeros on the critical line of positive height which are not accounted for in the subintervals of . In the case of this means the intervals in account for all non-real on the critical line of height . In the case of , at most one pair of non-real and on the critical line of height are not accounted for in . Since each subinterval of contains a sign-change (so corresponds to a zero of odd multiplicity), this case implies that all non-real on the critical line of height are simple (including the possible pair and missed by .
We prove part (iii). By Lemma 3, the minimum of the contribution to from a real zero off the critical line and its symmetric zero is . Thus, the contribution from pairs of real zeros off the critical line is at least . So if
there are strictly fewer than such pairs of real zeros, so fewer than total real zeros off the critical line.
We prove part(iv). Each repetition of the zero (possibly none)
contributes to . So, if the zero at has multiplicity , then these zeros have total contribution . By the same arguments thus far, if
then the multiplicity of the zero at is strictly smaller than .
In the case , this means we have non-vanishing of on the real line. In the case , and combined with part (iii),
any real zero must be at the central point and must be simple.
Lastly, we prove part (v). Suppose
.
By the definition of , we have
Therefore, by part (i) of the theorem, all non-real are on the critical line. By part (ii), all non-real on the critical line belong to some subinterval in the set and are thus simple. By parts (iii) and (iv), is non-vanishing on the real line (except for possibly a simple zero at included in ). These three cases leave no room for zeros outside of the simple zeros within the subintervals in the set . Thus, the list of zeros in account for all zeros of of height and they are all simple, i.e. the list is complete.
∎
5. Generalization in the zeta case
Let
. So, is an entire function.
The series expansion of at is given by
(17)
where are the Stieltjes constants.999For instance,
, , ,
and so on. For any complex number such that , we may write
for sufficiently close to .101010If , then the coefficients can be calculated
easily in terms of the ’s. For example, .
Lemma 8.
Let be a positive integer. Let be a complex number such that and does not
coincide with any zero of . If then
If , then there is an additional term of
Proof.
The proof is similar to that of Proposition 4
except the coefficients are defined differently than the
analogous coefficients
due to the pole of zeta.
∎
Corollary 9.
When , we have
One can compute the
using the Euler–Maclaurin summation formula; see for example [rubinstein_2005].
However, if is large enough, then the following simpler formula could suffice,
and has the same proof as that for Lemma 6.
Lemma 10.
If and , then
where satisfies the same bound as in Lemma 6 but with and .
Theorem 11 is the main result in this section. Since the main interest in the case of zeta is at large heights, we expand about a complex number where is typically large. Therefore,
the advantage provided by the symmetry of the ’s about the real axis is mostly lost.
Theorem 11.
Let be a complex number such that and .
Let be a real positive number such that
. Let be a set of nonempty disjoint subintervals such that has a sign change in each subinterval.
Suppose further that does not belong to any of the subintervals in .
Define
Further, for any real such that , and with as in (7), define
Then, for any real positive number such that we have the following.
(i)
If
,
then all the ’s with height in are on the critical line.
That is, the RH holds in the interval .
(ii)
If
,
then all the ’s on the critical line with height in are simple.
(iii)
If ,
then the list is complete. This means that
every in the upper half-plane with height in
is on the critical line, is simple, and belongs to some subinterval in the set .
Proof.
Let us prove part (i). Suppose there is a counter-example such that . Then is a counter-example distinct from . The contribution of and to is . Since , it follows by Lemma 3 that this contribution is at least . Moreover, the zeros from the set already contribute at least to . So, if the inequality in (i) holds, and considering that any remaining zeros will contribute a nonnegative amount to , then we obtain a contradiction. Hence, the counter-example cannot exist.
We prove part (ii). Suppose there is a nonsimple zero of multiplicity such that . If is already in the set , then , since the zeros in have odd multiplicity (as they correspond to sign changes of ). If is not in , then . In either case, there are at least two zeros on the critical line with ordinates in that are missing from . So, arguing as in part (i) and using Lemma 3, the contribution of these missing zeros to is at least . So, if the inequality in (ii) holds, then we obtain a contradiction since any remaining zeros will contribute a nonnegative amount to . Hence, such a nonsimple cannot exist.
Lastly, we prove part (iii). Note that and . So, if the inequality in (iii) holds, then all the zeros with height in are on the critical line and are simple. Thus,
in seeking a contradiction we may assume without loss of generality that there is a simple zero such that and is not in any subinterval .
But the contribution of such to is at least . Hence, if the inequality in (iii) holds, then we obtain a contradiction, like before. So, such a missing cannot exist.
∎
Remark.
By using the shift in Theorem 11 along with the initial zeros of , one can verify that and are the only zeta zeros with ordinates in the window . Since the value that Riemann computed already tells us that there are no zeta zeros of height less than , this yields that and are indeed the first two zeta zeros. By comparison, verifying and are the first two zeta zeros using just the value requires accounting for the contribution of initial zeros of zeta.
6. Improvements
Instead of using nonnegativity to simply drop the contribution to
of the tail of the zeros sum,
we derive a lower bound on the contribution of the tail.
Incorporating this into Theorem 11 greatly
improves the efficiency of our verification method
at large heights (i.e. when is large). Hence,
the RH can be verified via our method
in a much wider window than before (i.e. for a much larger ). Specifically, whereas the basic verification method in Theorem 11 is only expected to succeed
in windows of size , the improved method in Theorem 13
is expected to succeed in windows of size .
In addition, we derive an upper bound on the contribution of the tail of the zeros sum.
This can sometimes allow us to prove the incompleteness of a supplied
list of zeros in a given range, as shown in Theorem 13.
Proposition 12.
Let be a complex number and be a real number.
Suppose that and .
For any real number such that , we have
Let , , , and the functions and all be given as in Proposition 12. Furthermore, let and the functions , , , , and be given as in Theorem 11. Define
For any real positive number such that we have the following improvements to Theorem 11.
(i)
If
,
then all the ’s with height in are on the critical line. That is, the RH holds in .
(ii)
If
,
then all the ’s on the critical line with height in are simple.
(iii)
If ,
then every in the upper half-plane with height in
is on the critical line, simple, and belongs to some subinterval in the set .
In addition to these improvements, the upper bound in Proposition 12 yields the following counterpart.
(iv)
If , then does not account for all the ’s with height in . This means there is a subinterval in that contains the ordinates of at least three ’s (including multiplicity), or there is such that and is not in any subinterval in , or there is off the critical line with height in .
Proof.
Parts (i)–(iii) follow directly from the arguments in Theorem 10, except that these bounds account for the contribution from zeros outside of the ordinate window for which we have zeros data.
For part (iv), if , then there necessarily are zeros whose (positive) contribution to is not being accounted for. More explicitly, since
already accounts for the maximum possible contribution from all zeros with . Therefore, any deficiency in contribution to must arise from some satisfying
that has not been already accounted for
in .
∎
Remark.
It is possible that a further small improvement would be made by incorporating explicit zeros-density estimates, in addition to the explicit bounds on and its integral that are already included.
7. Numerical examples
The examples in this section are meant for illustration, to show how the method we described behaves
in practice on representative examples.
The data in this section was obtained from [lmfdb] and [zeta_zeros],
and using LCALC [lcalc] as well as SageMath [sagemath].
Our working assumption is that the zeros ordinates from [lmfdb] and [zeta_zeros] are accurate within , and the zeros ordinates obtained using [lcalc] and [sagemath]
are accurate to within , though it is possible the accuracy is higher.
We used this assumption to determine the interval corresponding to each zero ordinate .
Numerical calculations were done using the interval arithmetic package in mpmath [mpmath]. We also used FLINT [flint] to compute the polygamma function when no exact value was available. The code for the implementation is available as a GitHub repository [github].
7.1. The Riemann zeta function
We used Theorem 11 and Theorem 13
for verification using
Our set was obtained from [zeta_zeros]. For ,
and with the aid of Corollary 9 and Lemma 10, applied with , we computed
Based on this input data, Theorem 11, part(i), succeeded in verifying the RH
for , and Theorem 13, part (i), succeeded
verifying the RH for , which is much larger and contains
zeros of the zeta function. Theorem 13, part (iii), also succeeded in
verifying that completeness of
the subset
a window that contains zeros.
In the opposite direction, we applied Theorem 13, part (iv), to
the subset , which is the same as except
the subinterval corresponding to the ordinate
was removed. We computed
Based on this input data,
Theorem 13 succeeded in proving that the set was indeed incomplete.
7.2. Real Dirichlet -function
Let be a fundamental discriminant,
be the corresponding real primitive character, and
the corresponding Dirichlet -function.
In the notation of §4, we have , , , and
if then .
We applied Theorem 7, part (i),
to verify the RH using
The coefficients arising from the Euler product are given by
.
Our set was obtained using [lcalc].
With the aid of Corollary 5
as well as Lemma 6 applied with
we computed
Based on this input data, Theorem 7 succeeded in verifying the RH
for for , a window containing zeros with nonnegative ordinates.
7.3. The Ramanujan -function
Let be the Ramanujan tau function111111So, .,
and let be the Ramanujan tau -function.121212Therefore,
is given by the Dirichlet series
where ,
at least when .
In the notation of §4, we have , , , , and .
We applied Theorem 7
to verify the RH using
The coefficients and arising from the Euler product for
are given by the roots of the polynomial .
Our set was obtained using [lcalc] and [lmfdb].
With the aid of Corollary 5
and Lemma 6, applied with ,
we computed
Based on this input data, Theorem 7 succeeded in verifying the RH for
for , a window which includes zeros with nonnegative ordinates.
7.4. Elliptic curve -function
Let be an elliptic curve over of conductor .
Let be the corresponding elliptic curve -function.
In the notation of §4, we have , , or ,
, and .
We applied Theorem 7 with
to verify the RH for the elliptic curve with
minimal Weierstrass equation
(18)
According to [lmfdb, 37.a1], has conductor so that ,
and the sign of the functional equation of is so that,
in the notation of §4, .
To calculate the Euler factors of , let
denote
the number of solutions
that satisfy the minimal Weierstrass equation (18)
together with the point at infinity that lies on . Define
(19)
Then
the coefficients and arising from the Euler product for
are the roots of the polynomial , provided .
If , then and .
Our set was obtained from [lmfdb].
With the aid of Corollary 5
and Lemma 6, applied with ,
we computed
Based on this input data, Theorem 7 succeeded in verifying the RH for
for , a window which contains zeros with nonnegative ordinates.
We bound and from below and above, respectively,
starting with .
To this end, since the integrand in is nonnegative, a lower bound on
can be obtained by restricting the integration interval to .
Doing so, followed by the change of variable , gives
(22)
On the other hand, it is known [davenport_1967] that
(23)
where and,
if does not coincide with the ordinate of
any nontrivial zero, .131313The arguments are defined by a continuous variation starting at , going up vertically to , and then horizontally to . Also, is a smooth odd function and is right-continuous with
jump discontinuities at the zeros ordinates. Thus, combining (22) and (23), and defining
(24)
(25)
where is the derivative of with respect to ,
we obtain
(26)
We first bound from below.
By [lehman_1970, Lemma 10], if , then
Expanding the right-side in (28) about , and using the Lagrange form of the remainder, as well as (27), we see that for ,
(29)
where
On the other hand, substituting the following simple bound into the integral in (31) below,
(30)
and evaluating the resulting integral in closed-form gives the inequality
(31)
where
Additionally, using the anti-derivative formula
(32)
together with the following double inequality (from the Laurent series for ), which is valid for ,
(33)
we obtain
(34)
where, after using the elementary inequality ,
Therefore, combining (24), (29), (31),
and (34), we obtain
(35)
We now calculate an upper bound on , which is defined in (25).
Let denote the derivative of with respect to .
Using integration by parts and Lemma 3, part (iv), together with the intermediate value theorem, we obtain
If , then [trudgian_2014, Theorem 1] gives the bound . And
if , then [trudgian_2011, Theorem 2.2] gives the bound .
So, using the simple bound (30) for as well as the bound
(36)
valid for , we obtain
(37)
where
Combining (26), (35), (37) yields
the lower bound in the proposition.
To derive the upper bound in the proposition, we bound in (21) from above.
Let us write where
is estimated by an analogous calculation to that used for .
The difference is that the formula (32) and the double inequality (33) are replaced with
the formula
So that the term in (35) may be replaced with zero.
Also, since we are looking for an upper bound, the signs in (35) should be replaced
with signs. Put together,
(38)
Next, we bound . After the change of variable we obtain
Using integration by parts together with the observation
and the facts that and non-decreasing, we obtain
(39)
Therefore, on substituting the bound on given in (36) and the bound
which follows from e.g. [trudgian_2014, Corollary 1], we obtain after a small calculation
where is defined as in the statement of the proposition.
The claimed upper bound then follows on combining this with (38).
Proof of parts (i)–(iii): Taking the partial derivative of with respect to , we find with the aid of a computer algebra system that
(40)
where is a degree monic polynomial in ,
(41)
satisfying .
Note that the sign of the partial derivative of with respect to
is the same as the sign of .
We have the formulas
(42)
(43)
where
(44)
as well as the formulas
(45)
(46)
Taking (42) as our “starting point” in some sense, and viewing it as a function of , we see that
has two local minima (and -axis intercepts) at and with a (positive) local maximum at . As increases, we see from (43) and (44) together with (46)
that the two local minima locations move away from .
In comparison, as follows from (43) and (45),
remains a local maximum of (and a global maximum on the -interval ),
albeit with a monotonically decreasing value of . The latter claim can be seen from the negativity of the partial derivative
(47)
We now consider three cases.
Case (1): Suppose on the -interval , so that, by
considering the sign of in (40), we find
is a local minimum of .
In evaluating this case, we note by the negativity of the partial derivative in (47),
if and are such that is a root of , then cannot
be a root for any greater (with the same ). Also, is a root if and only if
the constant term of the polynomial
in (41) is , hence if and only if
. Solving for and
recalling the discussion following (46), we therefore
find throughout if and only if
For such , has no local extrema in the interval
except at where it
has a local minimum. Thus, if , then is minimized at in this case.
Case (2): Suppose on the -interval ,
so that, by considering the sign of in (40), we find
is a local maximum of .
On the other hand, maintains a negative sign throughout if and only if
the local extremum of that occurs at is negative.
By direct calculation, we have
Setting and solving for , we find that throughout if and only if
For such , has no local extrema in the interval except
at where it
has a local maximum. Thus, if , then
is minimized at the boundary points and maximized at in this case.
Case (3): Suppose does not maintain its sign throughtout the -interval , so that
has at least one root at some . By the symmetry of about
as well as the discussion following (46), there can be at most two such roots, one
in the subinterval and another symmetric root in the subinterval .
By the work done thus far, such roots occur if and only if
When these roots occur, they each correspond to local maxima of as seen by considering the sign of in (40). Therefore in this case, is minimized either at the boundary (equivalently ), or at the center . To find the point of transition between these two situations, we set and find that the transition
occurs when
Summary: From the above 3 cases, we have the following behavior of the minimum of
for .
(a)
If , then is minimized at
(b)
If , then is minimized at (equivalently ).
This covers claims (i) and (ii) in the statement of the lemma. Claim (iii) in the lemma
follows from Case (2) above.
Proof of part (iv):
Writing as the partial derivative of with respect to and similarly.
We have
(48)
Since , , and , by (48) we have that for any such , and .
For the claims regarding the increasing nature of , we begin by evaluating at and find
From this, we see that is increasing if and only if
yielding the first of these claims.
Similarly, evaluating at gives us
(49)
The first term in (49) is positive only when and the second term only when .
So, since , is increasing when
10. Conclusions and future directions
We presented a method to verify the RH for zeta at large heights and to verify the RH for a general class of -functions at low heights. The method is simple to understand and implement and we demonstrated its efficacy on a variety of -functions using interval arithmetic. We also presented a significant improvement to the method in the case of zeta by incorporating explicit bounds on and integrals of .
In forthcoming work, we will develop and detail further generalizations of this verification method. These generalizations include, among other things, consideration of and when , further improvements in the case of zeta at large heights, the special case of Dirichlet -functions to real primitive characters, as well as the extending of the improvements in §6 to
a more general setting.
Acknowledgements
We are grateful to Georg-August-Universität Göttingen,
Niedersächsische Staats- und Universitätsbibliothek Göttingen, for helping us locate some of
Riemann’s unpublished notes and giving us permission to reproduce them in this paper.
Megan Kyi thanks the OH5-OSU SURE Undergraduate Research program at the Ohio State University, Columbus, for
their support.