For $x \gt 0$, let $\pi(x)$ denote the number of primes $p \leq x$ (in these notes, $p$ is always used to denote prime numbers). The celebrated Prime Number Theorem is the asymptotic statement
where we recall that $f(x) \sim g(x)$ means $\underset{x \to \infty}{\lim} \frac{f(x)}{g(x)} = 1$.
This was first proven in 1896 independently by Hadamard and de la Vallée-Poussin, first by proving a result on the zeroes of the Riemann zeta function $\zeta(s)$ (that none occur on the line $Re(s) = 1$), which can be proven without too much difficulty. They then deduced the prime number theorem from this result, by a fairly intricate analysis.
Over the decades this analysis was whittled down more and more until it reached a fairly definitive form, in the form of a succinct “Tauberian theorem” due to D.J. Newman (1980). A streamlined account was written up in the American Mathematical Monthly by Don Zagier (1997), who managed to produce a proof virtually from scratch in a little less than three pages (it requires just a bit of background in complex analysis, not much more than Cauchy’s theorem).
As in his famous one-sentence proof that primes of the form $4n+1$ are sums of two squares, Zagier seemed to be aiming explicitly for maximal compression while still keeping the argument (arguably) self-contained. It’s an impressive performance, alright, but its terseness will probably force most readers to follow up with pen and paper on the side to verify his arguments in detail. In this article we will mostly follow Zagier’s line, but with some slight expansions and glosses so that pen and paper is (hopefully) unnecessary.
In this section we prove a result essentially due to Chebyshev which puts us already in the neighborhood of the prime number theorem. Introduce a function
By the following result, our job is done once we establish the relation $\theta(x) \sim x$.
The asymptotic statement $\theta(x) \sim x$ implies the prime number theorem.
In one direction, we have an obvious inequality
Assuming that $\theta(x) \sim x$, this implies that given $\epsilon \gt 0$, the inequality
holds for all sufficiently large $x$.
In the other direction, for any $\epsilon \gt 0$ we have
From the last line and (1) (which implies $\frac{\pi(x)\log(x)}{x}$ is bounded below by a positive constant), we obtain
for sufficiently large $x$. Thus for any small $\epsilon \gt 0$ (say less than $1/3$) and the assumption $\theta(x) \sim x$ we have
or simply
for all sufficiently large $x$.
Proving the asymptotic result $\theta(x) \sim x$ will involve complex analysis applied to the Riemann zeta function. An easier preliminary result gives information on the order of $\theta(x)$; it will later be used as a technical hypothesis in the Tauberian theorem with which we conclude. The order result is essentially due to Chebyshev, who actually established sharper estimates (which we won’t need). In fact, he also showed that if $\frac{\pi(x)\log(x)}{x}$ has a limit, then that limit must be $1$.
$\theta(x) = \mathrm{O}(x)$.
First let us observe that
so that
If $n \leq x \lt n+1$, then $\theta(x) = \theta(n)$ and $\theta(2x) \leq \theta(2n) + \log(2x)$, so that
or simply
Taking a telescopic sum, we have
where $n$ is chosen so that $1 \leq x/2^n \lt 2$ (so $n = floor(\log_2(x))$). The first term on the right is bounded above by $\log(2) x$. The second term is bounded above by $\log_2(x) \cdot \log(x) = K\log(x)^2$ (where $K = \log_2(e)$), which considered asymptotically is small compared to $x$. Thus, for any $C \gt \log(2)$, we have
for all sufficiently large $x$, which completes the proof.
Let
define the usual Riemann zeta function on the region $Re(s) \gt 1$. First we establish a meromorphic continuation of $\zeta(s)$ to the region $Re(s) \gt 0$, or in fact a holomorphic continuation of $\zeta(s) - \frac1{s-1}$ to this region.
The function $\Psi$ defined by the expression
converges absolutely for $Re(s) \gt 0$, giving a holomorphic function in that region, and
over the region $Re(s) \gt 1$.
Let us bound the summands in the definition of $\Psi(s)$:
Since $\sum_{n=1}^\infty \frac1{n^{Re(s) + 1}}$ converges over the region $Re(s) \gt 0$, it follows that the series for $\Phi(s)$ converges absolutely, and $\Psi(s)$ is a holomorphic function, in that region.
Over the region $Re(s) \gt 1$, we have
as asserted.
Now we relate $\theta(x)$ to $\zeta(s)$ by introducing an auxiliary function $\Phi$ that is connected with the logarithmic derivative of $\zeta(s)$. For $Re(s) \gt 1$ put
Over $Re(s) \gt 1$, we calculate
which upon differentiating yields
with the last sum converging over the region $Re(s) \gt \frac1{2}$. From this and the analytic continuation of $\zeta(s)$ (Proposition ), we see that $\Phi(s)$ admits a meromorphic continuation over $Re(s) \gt \frac1{2}$, and holomorphic in that region except possibly where $\zeta(s)$ has zeroes, or at the simple pole $s = 1$.
$\Phi(s) - \frac1{s-1}$ is holomorphic over the region $Re(s) \geq 1$, i.e., $\zeta(s)$ has no zeroes over $Re(s) \geq 1$.
The case $Re(s) \gt 1$ is easily handled by the product formula (2).
For a fixed $t \gt 0$, let $\mu$ be the order of the zero of $\zeta(1 + i t)$ (if $\zeta(1 + i t) \neq 0$ then of course $\mu = 0$); it is the same as the order of the zero at $s = 1 - i t$. Let $\nu$ be the order of the zero at $1 \pm 2i t$. We then have
$\underset{\epsilon \searrow 0}{\lim} \epsilon \cdot \Phi(1 + \epsilon) = 1$ (since $\zeta(s)$ has a pole of order $1$ at $s = 1$), and similarly
$\underset{\epsilon \searrow 0}{\lim} \epsilon \cdot \Phi(1 + \epsilon \pm i t) = -\mu$,
$\underset{\epsilon \searrow 0}{\lim} \epsilon \cdot \Phi(1 + \epsilon \pm 2i t) = -\nu$.
Observe that for $\epsilon \gt 0$,
from which, in conjunction with 1., 2., 3. above, we deduce $0 \leq 6 - 8\mu - 2\nu$, so that $\mu = 0$.
The function $\theta(x)$ is related to the function $\Phi(s)$ by way of a Riemann-Stieltjes integral:
for $Re(s) \gt 1$. An integration by parts yields
where $\underset{M \to \infty}{\lim} \frac{\theta(M)}{M^s} = 0$ by Lemma . Thus
The asymptotic result $\theta(x) \sim x$ will be established by appeal to the following result:
Suppose that the integral
converges. Then $\theta(x) \sim x$.
Suppose that for some $\lambda \gt 1$ there exist arbitrarily large $x$ such that $\lambda x \leq \theta(x)$. The monotonicity of $\theta$ implies the first inequality that follows:
This is a contradiction, because convergence of $\int_1^\infty \; \frac{\theta(t) - t}{t^2}\; d t$ implies that $\int_x^{\lambda x} \; \frac{\theta(t) - t}{t^2}\; d t$ can be made arbitrarily small by taking $x$ large.
Similarly, if we suppose that for some $\lambda \lt 1$ there exist arbitrarily large $x$ such that $\theta(x) \leq \lambda x$, then
which again leads to a contradiction.
Thus we focus on convergence of
For $Re(s) \gt 1$ we have, from (3),
or what is the same, for the region $Re(s) \gt 0$,
where the left side is in fact holomorphic over $Re(s) \geq 0$, hence takes a finite value at $s = 0$, say $L$.
The Tauberian theorem we will establish permits the conclusion
which in turn will yield the Prime Number Theorem:
The integral $\int_0^\infty \; \frac{\theta(e^t) - e^t}{e^t} \; d t$ converges.
(The prime number theorem follows from this by (4), Theorem , and Lemma .)
Here is the statement of the particular Tauberian theorem used (recalling that “Tauberian” signifies a family of theorems that roughly speaking give conditions for convergence of series or integral expressions occurring at natural boundaries of regions where series or integral representations are defined). This particular form was given by D.J. Newman, and allows a dramatic shortening of the proof of the prime number theorem, as shown convincingly by Zagier, who as we said compressed it down to under three pages.
Let $f(t)$ be a function that is bounded and locally integrable over the domain $t \geq 0$, and suppose that its Laplace transform, defined for $Re(s) \gt 0$ by the equation
extends to a function $g$ that is holomorphic over the region $Re(s) \geq 0$. Then the integral $\int_0^\infty\; f(t)\; d t$ converges and equals $g(0)$.
The proof below involves some fairly refined complex analysis. It may be well to recall that we say a function $g$ is holomorphic at a point $z_0$ if $g$ is defined over a small neighborhood $\{z: {|z-z_0|} \lt \delta\}$ of the point and in that neighborhood is expressible as a convergent Taylor series
Then we say $g$ is holomorphic over a (not necessarily open) region if it is holomorphic at every point in the region.
In the case of interest $f(t) = \frac{\theta(e^t) - e^t}{e^t}$, the order estimate Lemma guarantees that $f$ is bounded and locally integrable, and we have already observed that its Laplace transform, which is
by (5), is holomorphic in the region $Re(s) \geq 0$ by Proposition . Thus all the hypotheses of the theorem are satisfied in this case, and its conclusion in this case is Theorem . So all that remains is to prove Theorem .
For $T \geq 0$, put
Clearly $g_T$ is an entire (i.e., everywhere holomorphic) function. We are trying to show that $\underset{T \to \infty}{\lim} g_T(0) = g(0)$.
A straightforward application of Cauchy’s theorem yields
for contours $C$ having $0$ as an interior point, such that $g_T - g$ is holomorphic on $C$ and its interior. We will choose $C$ to be the boundary of the closed region $D = \{z: {|z|} \leq R\; and\; Re(z) \geq -\delta\}$ where for a given $R$, we choose $\delta$ small enough that $g_T - g$ is holomorphic in $D$ (this uses compactness of the interval $I = \{i t: -R \leq t \leq R\}$ and holomorphicity of $g_T - g$ along $I$).
The idea would be to estimate the contribution of the integral over $C_+$, the part of $C$ where the real part is nonnegative, and over $C_-$ where it is nonpositive, hoping to show these contributions are small when $R$ is large. This almost works, except that we don’t have good control over the size of the contributions near the area where $Re(z) = 0$. To offset this, a mollifying factor $(1 + \frac{z^2}{R^2})$ is introduced in the integrand (“Carleman’s formula”): this is close to zero near $Re(z) = 0$ and has the desired mitigating effect.
Thus we instead use
Suppose now that ${|f(t)|}$ is globally bounded by $B$. For $Re(z) \gt 0$ we have
and we also have, over the circle ${|z|} = R$,
so that putting the factors together, the integral over $C_+$ is bounded in absolute value by
(and of course this hasn’t taken into account the factor $\frac1{2\pi i}$ outside the integral). In any case, with the Carleman factor, the contribution from the $C_+$ part is $B/R$, which is small if $R$ is large.
Now we bound the contribution from $C_-$ by considering $g_T$ and $g$ separately. First let us bound
For this, we can take advantage of the fact that $g_T$ is entire, and (courtesy of Cauchy’s theorem) replace $C_-$ by the semicircle where ${|z|} = R, Re(z) \lt 0$. The estimates are similar in form to those above, starting with
Bear in mind here that we are now considering $Re(z) \lt 0$ and $-Re(z) = {|Re(z)|}$; the bound produced tends to be very large. But, large as it may be, it is offset by the bound on the product of the remaining factors in the integrand, which is $e^{Re(z)T} \cdot {|2Re(z)|}/R^2$. We conclude as before that over $C_-$ the overall contribution from $g_T$ has upper bound $B/R$.
Finally, we bound
Holding $R$ fixed, the expression $g(z)\cdot \left(1 + \frac{z^2}{R^2}\right) \cdot \frac1{z}$ taken over $C_-$ is bounded in absolute value by a number $K$. Whereas along $Re(z) = -\delta$, by taking $T$ very large we can make ${|e^{z T}|}$ as small as we like, so that considering $g$ separately, the contribution over $C_-$ can be discarded as vanishingly small (in other words, first pick $R$, then pick $\delta$, and finally pick $T$, to get this contribution to be as small as we like). This completes the proof.
Donald J. Newman, Simple analytic proof of the prime number theorem, Amer. Math. Monthly 87 (1980), 693-696.
Don Zagier, Newman’s short proof of the prime number theorem, Amer. Math. Monthly 104 (1997), 705-708. (pdf)
Don Zagier, A one-sentence proof that every prime $p \equiv 1 (\mod 4)$ is a sum of two squares, Amer. Math. Monthly, Vol. 97 No. 2 (1990), p. 144. (link)
Last revised on June 15, 2019 at 15:53:29. See the history of this page for a list of all contributions to it.