Prime Number Theorem And ...

10 Tertiary explicit estimates

10.1 The least common multiple sequence is not highly abundant for large \(n\)

10.1.1 Problem statement and notation

Definition 10.1.1
#

\(\sigma (n)\) is the sum of the divisors of \(n\).

Definition 10.1.2
#

A positive integer \(N\) is called highly abundant (HA) if

\[ \sigma (N) {\gt} \sigma (m) \]

for all positive integers \(m {\lt} N\), where \(\sigma (n)\) denotes the sum of the positive divisors of \(n\).

Informally, a highly abundant number has an unusually large sum of divisors.

Definition 10.1.3
#

For each integer \(n \ge 1\), define

\[ L_n := \mathrm{lcm}(1,2,\dots ,n). \]

We call \((L_n)_{n \ge 1}\) the least common multiple sequence.

Clearly \(L_n\) has a lot of divisors, and numerical experiments for small \(n\) suggests that \(L_n\) is often highly abundant. This leads to the following question:

Original MathOverflow question. Is it true that every value in the sequence \(L_n = \mathrm{lcm}(1,2,\dots ,n)\) is highly abundant? Equivalently,
\[ \{ L_n : n \ge 1\} \subseteq HA? \]

Somewhat surprisingly, the answer is no: not every \(L_n\) is highly abundant.

It has previously been verified in Lean that \(L_n\) is highly aboundant for \(n \leq 70\), \(81 \leq n \leq 96\), \(125 \leq n \leq 148\), \(169 \leq n \leq 172\), and not highly abondant for all other \(n ≤ 10^{10}\). The arguments here establish the non-highly-abundance of \(L_n\) for all \(n \geq 89683^2\) sufficiently large \(n\), thus completing the determination in Lean of all \(n\) for which \(L_n\) is highly abundant. This argument was taken from this MathOverflow answer.

10.1.2 A general criterion using three medium primes and three large primes

The key step in the proof is to show that, if one can find six primes \(p_1,p_2,p_3,q_1,q_2,q_3\) obeying a certain inequality, then one can find a competitor \(M {\lt} L_n\) to \(L_n\) with \(\sigma (M) {\gt} \sigma (L_n)\), which will demonstrate that \(L_n\) is not highly abundant. More precisely:

Definition 10.1.4
#

In the next few subsections we assume that \(n \geq 1\) and that \(p_1,p_2,p_3,q_1,q_2,q_3\) are primes satisfiying

\[ \sqrt{n} {\lt} p_1 {\lt} p_2 {\lt} p_3 {\lt} q_1 {\lt} q_2 {\lt} q_3 {\lt} n \]

and the key criterion

\begin{equation} \label{eq:main-ineq} \prod _{i=1}^3\Bigl(1+\frac{1}{q_i}\Bigr) \le \Biggl( \prod _{i=1}^3 \Bigl(1+\frac{1}{p_i(p_i+1)}\Bigr) \Biggr) \Bigl(1 + \frac{3}{8n}\Bigr) \Biggl(1 - \frac{4 p_1 p_2 p_3}{q_1 q_2 q_3}\Biggr). \end{equation}
1

NOTE: In the Lean formalization of this argument, we index the primes from 0 to 2 rather than from 1 to 3.

Lemma 10.1.1
#

We have \(4 p_1 p_2 p_3 {\lt} q_1 q_2 q_3\).

Proof

Obvious from the non-negativity of the left-hand side of 1.

10.1.3 Factorisation of \(L_n\) and construction of a competitor

Lemma 10.1.2 Factorisation of \(L_n\)

There exists a positive integer \(L'\) such that

\[ L_n = q_1 q_2 q_3 \, L' \]

and each prime \(q_i\) divides \(L_n\) exactly once and does not divide \(L'\).

Proof

Since \(q_i {\lt} n\), the prime \(q_i\) divides \(L_n\) exactly once (as \(q_i^2 {\gt} n\)). Hence we may write \(L_n = q_1 q_2 q_3 L'\) where \(L'\) is the quotient obtained by removing these prime factors. By construction, \(q_i \nmid L'\) for each \(i\).

Lemma 10.1.3 Normalised divisor sum for \(L_n\)

Let \(L'\) be as in Lemma 10.1.2. Then

\begin{equation} \label{eq:sigmaLn} \frac{\sigma (L_n)}{L_n} \; =\; \frac{\sigma (L')}{L'} \prod _{i=1}^3 \Bigl(1 + \frac{1}{q_i}\Bigr). \end{equation}
2

Proof

Use the multiplicativity of \(\sigma (\cdot )\) and the fact that each \(q_i\) occurs to the first power in \(L_n\). Then

\[ \sigma (L_n) = \sigma (L') \prod _{i=1}^3 \sigma (q_i) = \sigma (L') \prod _{i=1}^3 (1+q_i). \]

Dividing by \(L_n = L' \prod _{i=1}^3 q_i\) gives

\[ \frac{\sigma (L_n)}{L_n} = \frac{\sigma (L')}{L'} \prod _{i=1}^3 \frac{1+q_i}{q_i} = \frac{\sigma (L')}{L'} \prod _{i=1}^3 \Bigl(1 + \frac{1}{q_i}\Bigr). \]

There exist integers \(m \ge 0\) and \(r\) satisfying \(0 {\lt} r {\lt} 4 p_1 p_2 p_3\) and

\[ q_1 q_2 q_3 = 4 p_1 p_2 p_3 m + r \]
Proof

This is division with remainder.

Definition 10.1.5
#

With \(m,r\) as above, define the competitor

\[ M := 4 p_1 p_2 p_3 m L'. \]

With notation as above, we have:

  1. \(M {\lt} L_n\).

  2. \[ 1 {\lt} \frac{L_n}{M} = \Bigl(1 - \frac{r}{q_1 q_2 q_3}\Bigr)^{-1} {\lt} \Bigl(1 - \frac{4 p_1 p_2 p_3}{q_1 q_2 q_3}\Bigr)^{-1}. \]
Proof

The first item is by construction of the division algorithm. For the second, note that

\[ L_n = q_1 q_2 q_3 L' = (4 p_1 p_2 p_3 m + r) L' {\gt} 4 p_1 p_2 p_3 m L' = M, \]

since \(r{\gt}0\). For the third,

\[ \frac{L_n}{M} = \frac{4 p_1 p_2 p_3 m + r}{4 p_1 p_2 p_3 m} = 1 + \frac{r}{4 p_1 p_2 p_3 m} = \Bigl(1 - \frac{r}{4 p_1 p_2 p_3 m + r}\Bigr)^{-1} = \Bigl(1 - \frac{r}{q_1 q_2 q_3}\Bigr)^{-1}. \]

Since \(0 {\lt} r {\lt} 4p_1p_2p_3\) and \(4p_1p_2p_3 {\lt} q_1q_2q_3\), we have

\[ 0{\lt}\frac{r}{q_1 q_2 q_3}{\lt}\frac{4p_1p_2p_3}{q_1 q_2 q_3}, \]

so

\[ \Bigl(1-\frac{r}{q_1 q_2 q_3}\Bigr)^{-1} {\lt} \Bigl(1-\frac{4p_1p_2p_3}{q_1 q_2 q_3}\Bigr)^{-1}. \]

10.1.4 A sufficient condition

We give a sufficient condition for \(\sigma (M) \geq \sigma (L_n)\).

Suppose

\[ \frac{\sigma (M)}{M} \Bigl(1 - \frac{4 p_1 p_2 p_3}{q_1 q_2 q_3}\Bigr) \; \ge \; \frac{\sigma (L_n)}{L_n}. \]

Then \(\sigma (M) \ge \sigma (L_n)\), and so \(L_n\) is not highly abundant.

Proof

By Lemma 10.1.5,

\[ \frac{L_n}{M} {\lt} \Bigl(1 - \frac{4 p_1 p_2 p_3}{q_1 q_2 q_3}\Bigr)^{-1}. \]

Hence

\[ \frac{\sigma (M)}{M} \ge \frac{\sigma (L_n)}{L_n} \Bigl(1 - \frac{4 p_1 p_2 p_3}{q_1 q_2 q_3}\Bigr)^{-1} {\gt} \frac{\sigma (L_n)}{L_n} \cdot \frac{M}{L_n}. \]

Multiplying both sides by \(M\) gives

\[ \sigma (M) {\gt} \sigma (L_n) \cdot \frac{M}{L_n} \]

and hence

\[ \sigma (M) \ge \sigma (L_n), \]

since \(M/L_n{\lt}1\) and both sides are integers.

Combining Lemma 10.1.6 with Lemma 10.1.3, we see that it suffices to bound \(\sigma (M)/M\) from below in terms of \(\sigma (L')/L'\):

Lemma 10.1.7 Reduction to a lower bound for \(\sigma (M)/M\)

If

\begin{equation} \label{eq:sigmaM-lower} \frac{\sigma (M)}{M} \; \ge \; \frac{\sigma (L')}{L'} \Biggl( \prod _{i=1}^3 \Bigl(1+\frac{1}{p_i(p_i+1)}\Bigr) \Biggr) \Bigl(1 + \frac{3}{8n}\Bigr), \end{equation}
3

then \(L_n\) is not highly abundant.

Proof

Insert 3 and 2 into the desired inequality and compare with the assumed bound 1; this is a straightforward rearrangement.

10.1.5 Conclusion of the criterion

Lemma 10.1.8 Lower bound for \(\sigma (M)/M\)

With notation as above,

\[ \frac{\sigma (M)}{M} \ge \frac{\sigma (L')}{L'} \Biggl( \prod _{i=1}^3 \Bigl(1 + \frac{1}{p_i(p_i+1)}\Bigr) \Biggr) \Bigl(1 + \frac{3}{8n}\Bigr). \]
Proof

By multiplicativity, we have

\[ \frac{\sigma (M)}{M} = \frac{\sigma (L')}{L'} \prod _p \frac{1+p^{-1}+\dots +p^{-\nu _p(M)}}{1+p^{-1}+\dots +p^{-\nu _p(L')}}. \]

The contribution of \(p=p_i\) is

\[ \frac{(1+p_i^{-1}+p_i^{-2})}{1+p^{-1}_i} = 1 + \frac{1}{p_i(p_i+1)}. \]

The contribution of \(p=2\) is

\[ \frac{1+2^{-1}+\dots +2^{-k-2}}{1+2^{-1}+\dots +2^{-k}}, \]

where \(k\) is the largest integer such that \(2^k \le n\). A direct calculation yields

\[ \frac{(1+2^{-1}+\dots +2^{-k-2})}{1+2^{-1}+\dots +2^{-k}} = \frac{2^{k+3}-1}{2^{k+3}-4} = 1 + \frac{3}{2^{k+3}-4}, \]

Finally, since \(2^k \le n {\lt} 2^{k+1}\), we have \(2^{k+3} {\lt} 8n\), so

\[ \frac{3}{2^{k+3}-4} \ge \frac{3}{8n}, \]

So the contribution from the prime \(2\) is at least \(1 + 3/(8n)\).

Finally, the contribution of all other primes is at least \(1\).

We have thus completed the key step of demonstrating a sufficient criterion to establish that \(L_n\) is not highly abundant:

Let \(n \geq 1\). Suppose that primes \(p_1,p_2,p_3,q_1,q_2,q_3\) satisfy

\[ \sqrt{n} {\lt} p_1 {\lt} p_2 {\lt} p_3 {\lt} q_1 {\lt} q_2 {\lt} q_3 {\lt} n \]

and the key criterion 1. Then \(L_n\) is not highly abundant.

Proof

By Lemma 10.1.8, the condition 3 holds. By Lemma 10.1.7 this implies

\[ \frac{\sigma (M)}{M} \Bigl(1 - \frac{4 p_1 p_2 p_3}{q_1 q_2 q_3}\Bigr) \ge \frac{\sigma (L_n)}{L_n}. \]

Applying Lemma 10.1.6, we obtain \(\sigma (M) \ge \sigma (L_n)\) with \(M {\lt} L_n\), so \(L_n\) cannot be highly abundant.

Remark 10.1.1
#

Analogous arguments allow other pairs \((c,\alpha )\) in place of \((4,3/8)\), such as \((2,1/4)\), \((6,17/36)\), \((30,0.632\dots )\); or even \((1,0)\) provided more primes are used on the \(p\)-side than the \(q\)-side to restore an asymptotic advantage.

10.1.6 Choice of six primes \(p_i,q_i\) for large \(n\)

To finish the proof we need to locate six primes \(p_1,p_2,p_3,q_1,q_2,q_3\) obeying the required inequality. Here we will rely on the prime number theorem of Dusart [ 12 ] .

Lemma 10.1.9 Choice of medium primes \(p_i\)
#

Let \(n \ge X_0^2\). Set \(x := \sqrt{n}\). Then there exist primes \(p_1,p_2,p_3\) with

\[ p_i \le x \Bigl(1 + \frac{1}{\log ^3 x}\Bigr)^i \]

and \(p_1 {\lt} p_2 {\lt} p_3\). Moreover, \(\sqrt{n} {\lt} p_1\)

Proof

Apply Proposition 9.9.4 successively with \(x, x(1+1/\log ^3 x), x(1+1/\log ^3 x)^2\), keeping track of the resulting primes and bounds. For \(n\) large and \(x = \sqrt{n}\), we have \(\sqrt{n} {\lt} p_1\) as soon as the first interval lies strictly above \(\sqrt{n}\); this can be enforced by taking \(n\) large enough.

Lemma 10.1.10 Choice of large primes \(q_i\)
#

Let \(n \ge X_0^2\). Then there exist primes \(q_1 {\lt} q_2 {\lt} q_3\) with

\[ q_{4-i} \ge n \Bigl(1 + \frac{1}{\log ^3 \sqrt{n}}\Bigr)^{-i} \]

for \(i = 1,2,3\), and \(q_1 {\lt} q_2 {\lt} q_3 {\lt} n\).

Proof

Apply Theorem ?? with suitable values of \(x\) slightly below \(n\), e.g. \(x = n(1+1/\log ^3\sqrt{n})^{-i}\), again keeping track of the intervals. For \(n\) large enough, these intervals lie in \((\sqrt{n},n)\) and contain primes \(q_i\) with the desired ordering.

10.1.7 Bounding the factors in 1

Lemma 10.1.11 Bounds for the \(q_i\)-product
#

With \(p_i,q_i\) as in Lemmas 10.1.9 and 10.1.10, we have

\begin{equation} \label{eq:qi-upper} \prod _{i=1}^3 \Bigl(1 + \frac{1}{q_i}\Bigr) \le \prod _{i=1}^3 \Bigl(1 + \frac{\bigl(1 + \frac{1}{\log ^3 \sqrt{n}}\bigr)^i}{n} \Bigr). \end{equation}
4

Proof

By Lemma 10.1.10, each \(q_i\) is at least

\[ n\Bigl(1 + \frac{1}{\log ^3 \sqrt{n}}\Bigr)^{-j} \]

for suitable indices \(j\), so \(1/q_i\) is bounded above by

\[ \frac{\bigl(1 + \frac{1}{\log ^3 \sqrt{n}}\bigr)^i}{n} \]

after reindexing. Multiplying the three inequalities gives 4.

Lemma 10.1.12 Bounds for the \(p_i\)-product
#

With \(p_i\) as in Lemma 10.1.9, we have for large \(n\)

\begin{equation} \label{eq:pi-lower} \prod _{i=1}^3 \Bigl(1 + \frac{1}{p_i(p_i+1)}\Bigr) \ge \prod _{i=1}^3 \Bigl( 1 + \frac{1}{\bigl(1 + \frac{1}{\log ^3 \sqrt{n}}\bigr)^{2i} (n + \sqrt{n})} \Bigr). \end{equation}
5

Proof

By Lemma 10.1.9, \(p_i \le \sqrt{n} (1+1/\log ^3\sqrt{n})^i\). Hence

\[ p_i^2 \le n\bigl(1 + \tfrac {1}{\log ^3 \sqrt{n}}\bigr)^{2i} \quad \text{and}\quad p_i+1 \le p_i(1 + 1/\sqrt{n}) \le (1+1/\sqrt{n}) p_i. \]

From these one gets

\[ p_i(p_i+1) \le \bigl(1 + \tfrac {1}{\log ^3 \sqrt{n}}\bigr)^{2i} (n + \sqrt{n}), \]

and hence

\[ \frac{1}{p_i(p_i+1)} \ge \frac{1}{\bigl(1 + \tfrac {1}{\log ^3 \sqrt{n}}\bigr)^{2i} (n + \sqrt{n})}. \]

Taking \(1+\cdot \) and multiplying over \(i=1,2,3\) gives 5.

Lemma 10.1.13 Lower bound for the product ratio \(p_i/q_i\)
# Discussion

With \(p_i,q_i\) as in Lemmas 10.1.9 and 10.1.10, we have

\begin{equation} \label{eq:pq-ratio} 1 - \frac{4 p_1 p_2 p_3}{q_1 q_2 q_3} \ge 1 - \frac{4 \bigl(1 + \frac{1}{\log ^3 \sqrt{n}}\bigr)^{12}}{n^{3/2}}. \end{equation}
6

Proof

We have \(p_i \le \sqrt{n} (1+1/\log ^3 \sqrt{n})^i\), so

\[ p_1 p_2 p_3 \le n^{3/2} \bigl(1 + \tfrac {1}{\log ^3 \sqrt{n}}\bigr)^{6}. \]

Similarly, \(q_i \ge n(1+1/\log ^3 \sqrt{n})^{-i}\), so

\[ q_1 q_2 q_3 \ge n^3 \bigl(1 + \tfrac {1}{\log ^3 \sqrt{n}}\bigr)^{-6}. \]

Combining,

\[ \frac{p_1 p_2 p_3}{q_1 q_2 q_3} \le \frac{n^{3/2} \bigl(1 + \frac{1}{\log ^3 \sqrt{n}}\bigr)^{6}}{n^3 \bigl(1 + \frac{1}{\log ^3\sqrt{n}}\bigr)^{-6}} = \frac{\bigl(1 + \frac{1}{\log ^3 \sqrt{n}}\bigr)^{12}}{n^{3/2}}. \]

This implies 6.

10.1.8 Reduction to a small epsilon-inequality

Lemma 10.1.14 Uniform bounds for large \(n\)

For all \(n \ge X_0^2 = 89693^2\) we have

\[ \frac{1}{\log ^3 \sqrt{n}} \le 0.000675, \qquad \frac{1}{n^{3/2}} \le \frac{1}{89693}\cdot \frac{1}{n}. \]

and

\[ \frac{1}{n+\sqrt{n}} \ge \frac{1}{1 + 1/89693}\cdot \frac{1}{n}. \]
Proof

This is a straightforward calculus and monotonicity check: the left-hand sides are decreasing in \(n\) for \(n \ge X_0^2\), and equality (or the claimed upper bound) holds at \(n=X_0^2\). One can verify numerically or symbolically.

Lemma 10.1.15 Polynomial approximation of the inequality
#

For \(0 \le \varepsilon \le 1/89693^2\), we have

\[ \prod _{i=1}^3 (1 + 1.000675^i \varepsilon ) \le \Bigl(1 + 3.01\varepsilon + 3.01\varepsilon ^2 + 1.01\varepsilon ^3\Bigr), \]

and

\[ \prod _{i=1}^3 \Bigl(1 + \frac{\varepsilon }{1.000675^{2i}}\frac{1}{1 + 1/89693}\Bigr) \Bigl(1 + \frac{3}{8}\varepsilon \Bigr) \Bigl(1 - \frac{4 \times 1.000675^{12}}{89693}\varepsilon \Bigr) \ge 1 + 3.36683\varepsilon - 0.01\varepsilon ^2. \]
Proof

Expand each finite product as a polynomial in \(\varepsilon \), estimate the coefficients using the bounds in Lemma 10.1.14, and bound the tails by simple inequalities such as

\[ (1+C\varepsilon )^k \le 1 + k C\varepsilon + \dots \]

for small \(\varepsilon \). All coefficients can be bounded numerically in a rigorous way; this step is a finite computation that can be checked mechanically.

Lemma 10.1.16 Final polynomial comparison
#

For \(0 \le \varepsilon \le 1/89693^2\), we have

\[ 1 + 3.01\varepsilon + 3.01\varepsilon ^2 + 1.01\varepsilon ^3 \le 1 + 3.36683\varepsilon - 0.01\varepsilon ^2. \]
Proof

This is equivalent to

\[ 3.01\varepsilon + 3.01\varepsilon ^2 + 1.01\varepsilon ^3 \le 3.36683\varepsilon - 0.01\varepsilon ^2, \]

or

\[ 0 \le (3.36683 - 3.01)\varepsilon - (3.01+0.01)\varepsilon ^2 - 1.01\varepsilon ^3. \]

Factor out \(\varepsilon \) and use that \(0{\lt}\varepsilon \le 1/89693^2\) to check that the resulting quadratic in \(\varepsilon \) is nonnegative on this interval. Again, this is a finite computation that can be verified mechanically.

Proposition 10.1.1 Verification of 1 for large \(n\)

For every integer \(n \ge X_0^2 = 89693^2 \approx 8.04\times 10^9\), the primes \(p_i,q_i\) constructed in Lemmas 10.1.9 and 10.1.10 satisfy the inequality 1.

Proof

Combine Lemma 10.1.11, Lemma 10.1.12, and Lemma 10.1.13. Then apply Lemma 10.1.14 and set \(\varepsilon = 1/n\). The products are bounded by the expressions in Lemma 10.1.15, and the inequality in Lemma 10.1.16 then ensures that 1 holds.

10.1.9 Conclusion for large \(n\)

Theorem 10.1.2 Non-highly abundant for large \(n\)

For every integer \(n \ge 89693^2\), the integer \(L_n\) is not highly abundant.

Proof

By Proposition 10.1.1, there exist primes \(p_1,p_2,p_3,q_1,q_2,q_3\) satisfying the hypotheses of Theorem 10.1.1. Hence \(L_n\) is not highly abundant.

10.2 Erdos problem 392

The proof here is adapted from https://www.erdosproblems.com/forum/thread/392#post-2696 which in turn is inspired by the arguments in https://arxiv.org/abs/2503.20170.

Definition 10.2.1
#

We work with (approximate) factorizations \(a_1 \dots a_t\) of a factorial \(n!\).

Definition 10.2.2
#

The waste of a factorizations \(a_1 \dots a_t\) is defined as \(\sum _i \log (n / a_i)\).

The balance of a factorization \(a_1 \dots a_t\) at a prime \(p\) is defined as the number of times \(p\) divides \(a_1 \dots a_t\), minus the number of times \(p\) divides \(n!\).

If a factorization has zero total imbalance, then it exactly factors \(n!\).

Proof

The waste of a factorization is equal to \(t \log n - \log n!\), where \(t\) is the number of elements.

Proof
Definition 10.2.4

The score of a factorization (relative to a cutoff parameter \(L\)) is equal to its waste, plus \(\log p\) for every surplus prime \(p\), \(\log (n/p)\) for every deficit prime above \(L\), \(\log L\) for every deficit prime below \(L\) and an additional \(\log n\) if one is not in total balance.

If one is in total balance, then the score is equal to the waste.

Proof

If there is a prime \(p\) in surplus, one can remove it without increasing the score.

Proof

Locate a factor \(a_i\) that contains the surplus prime \(p\), then replace \(a_i\) with \(a_i/p\).

If there is a prime \(p\) in deficit larger than \(L\), one can remove it without increasing the score.

Proof

Add an additional factor of \(p\) to the factorization.

If there is a prime \(p\) in deficit less than \(L\), one can remove it without increasing the score.

Proof

Without loss of generality we may assume that one is not in the previous two situations, i.e., wlog there are no surplus primes and all primes in deficit are at most \(L\). If all deficit primes multiply to \(n\) or less, add that product to the factorization (this increases the waste by at most \(\log n\), but we save a \(\log n\) from now being in balance). Otherwise, greedily multiply all primes together while staying below \(n\) until one cannot do so any further; add this product to the factorization, increasing the waste by at most \(\log L\).

One can bring any factorization into balance without increasing the score.

Proof

Apply strong induction on the total imbalance of the factorization and use the previous three sublemmas.

Starting from any factorization \(f\), one can find a factorization \(f'\) in balance whose cardinality is at most \(\log n!\) plus the score of \(f\), divided by \(\log n\).

Proof

Combine Lemma 10.2.4, Lemma 10.2.3, and Lemma 10.2.2.

Definition 10.2.5
#

Now let \(M,L\) be additional parameters with \(n {\gt} L^2\); we also need the minor variant \(\lfloor n/L \rfloor {\gt} \sqrt{n}\).

Definition 10.2.6

We perform an initial factorization by taking the natural numbers between \(n-n/M\) (inclusive) and \(n\) (exclusive) repeated \(M\) times, deleting those elements that are not \(n/L\)-smooth (i.e., have a prime factor greater than or equal to \(n/L\)).

Sublemma 10.2.4

The number of elements in this initial factorization is at most \(n\).

Proof

The total waste in this initial factorization is at most \(n \log \frac{1}{1-1/M}\).

Proof

A large prime \(p \geq n/L\) cannot be in surplus.

Proof

No such prime can be present in the factorization.

A large prime \(p \geq n/L\) can be in deficit by at most \(n/p\).

Proof

This is the number of times \(p\) can divide \(n!\).

A medium prime \(\sqrt{n} {\lt} p ≤ n/L\) can be in surplus by at most \(M\).

Proof

Routine computation using Legendre’s formula.

A medium prime \(\sqrt{n} {\lt} p ≤ n/L\) can be in deficit by at most \(M\).

Proof

The number of times \(p\) divides \(a_1 \dots a_t\) is at least \(M \lfloor n/Mp \rfloor ≥ n/p - M\) (note that the removal of the non-smooth numbers does not remove any multiples of \(p\)). Meanwhile, the number of times \(p\) divides \(n!\) is at most \(n/p\).

A small prime \(p \leq \sqrt{n}\) can be in surplus by at most \(M\log n\).

Proof

Routine computation using Legendre’s formula, noting that at most \(\log n / \log 2\) powers of \(p\) divide any given number up to \(n\).

A small prime \(L {\lt} p \leq \sqrt{n}\) can be in deficit by at most \(M\log n\).

Proof

Routine computation using Legendre’s formula, noting that at most \(\log n / \log 2\) powers of \(p\) divide any given number up to \(n\).

A tiny prime \(p \leq L\) can be in deficit by at most \(M\log n + ML\pi (n)\).

Proof

In addition to the Legendre calculations, one potentially removes factors of the form \(plq\) with \(l \leq L\) and \(q \leq n\) a prime up to \(M\) times each, with at most \(L\) copies of \(p\) removed at each factor.

The initial score is bounded by

\[ n \log (1-1/M)^{-1} + \sum _{p \leq n/L} M \log n + \sum _{p \leq \sqrt{n}} M \log ^2 n / \log 2 + \sum _{n/L {\lt} p \leq n} \frac{n}{p} \log \frac{n}{p} + \sum _{p \leq L} (M \log n + M L \pi (n)) \log L. \]
Proof

Combine Lemma 10.2.5, Sublemma 10.2.5, Sublemma 10.2.6, Sublemma 10.2.7, Sublemma 10.2.8, Sublemma 10.2.9, Sublemma 10.2.10, and Sublemma 10.2.11, and combine \(\log p\) and \(\log (n/p)\) to \(\log n\).

Sublemma 10.2.12

If \(M\) is sufficiently large depending on \(\varepsilon \), then \(n \log (1-1/M)^{-1} \leq \varepsilon n\).

Proof

Use the fact that \(\log (1-1/M)^{-1}\) goes to zero as \(M \to \infty \).

Sublemma 10.2.13

If \(L\) is sufficiently large depending on \(M, \varepsilon \), and \(n\) sufficiently large depending on \(L\), then \(\sum _{p \leq n/L} M \log n \leq \varepsilon n\).

Proof

Use the prime number theorem (or the Chebyshev bound).

Sublemma 10.2.14

If \(n\) sufficiently large depending on \(M, \varepsilon \), then \(\sum _{p \leq \sqrt{n}} M \log ^2 n / \log 2 \leq \varepsilon n\).

Proof

Crude estimation.

Lemma 10.2.6
#
\[ \pi (n) = o(n) \quad \text{as } n \to \infty . \]
Proof

Given \(\varepsilon {\gt} 0\), choose \(a \neq 0\) with \(\varphi (a)/a {\lt} \varepsilon /2\) (using \(\prod _{p \leq n}(1 - 1/p) \to 0\)). For \(n \geq a + 2\),

\[ \pi (n) \leq \frac{\varphi (a)}{a} \cdot n + \varphi (a) + \pi (a+1) + 1. \]

Since \(\varphi (a)/a {\lt} \varepsilon /2\), for \(n\) large enough the constant terms are absorbed, giving \(\pi (n) {\lt} \varepsilon n\).

Sublemma 10.2.15

If \(n\) sufficiently large depending on \(L, \varepsilon \), then \(\sum _{n/L {\lt} p \leq n} \frac{n}{p} \log \frac{n}{p} \leq \varepsilon n\).

Proof

Bound \(\frac{n}{p}\) by \(L\) and use the prime number theorem (or the Chebyshev bound).

Sublemma 10.2.16
#

For all \(n \geq 2\), one has

\[ \pi (n) \leq \sqrt{n} + \frac{2n \log 4}{\log n}. \]
Proof

By Chebyshev’s bound, \(\prod _{p \leq n} p \leq 4^n\), so \(\sum _{p \leq n} \log p \leq n \log 4\). The number of primes \(p \leq \sqrt{n}\) is trivially at most \(\sqrt{n}\). For primes \(p {\gt} \sqrt{n}\), we have \(\log p {\gt} \frac{1}{2} \log n\), hence

\[ \bigl(\pi (n) - \pi (\sqrt{n})\bigr) \cdot \tfrac {1}{2} \log n {\lt} \sum _{\sqrt{n} {\lt} p \leq n} \log p \leq n \log 4, \]

giving \(\pi (n) - \pi (\sqrt{n}) {\lt} \frac{2n \log 4}{\log n}\). Adding \(\pi (\sqrt{n}) \leq \sqrt{n}\) yields the result.

Sublemma 10.2.17

If \(n\) sufficiently large depending on \(M, L, \varepsilon \), then \(\sum _{p \leq L} (M \log n + M L \pi (n)) \log L \leq \varepsilon n\).

Proof

Use the prime number theorem (or the Chebyshev bound).

The score of the initial factorization can be taken to be \(o(n)\).

Proof

Pick \(M\) large depending on \(\varepsilon \), then \(L\) sufficiently large depending on \(M, \varepsilon \), then \(n\) sufficiently large depending on \(M,L,\varepsilon \), so that the bounds in Sublemma 10.2.12, Sublemma 10.2.13, Sublemma 10.2.14, Sublemma 10.2.15, and Sublemma 10.2.17 each contribute at most \((\varepsilon /5) n\). Then use Proposition 10.2.2.

One can find a balanced factorization of \(n!\) with cardinality at most \(n - n / \log n + o(n / \log n)\).-

Proof

Combine Proposition 10.2.3 with Proposition 10.2.1 and the Stirling approximation.

Theorem 10.2.2
# Discussion

One can factorize \(n!\) into at most \(n/2 - n / 2\log n + o(n / \log n)\) numbers of size at most \(n^2\).-

Proof

Group the factorization arising in Theorem 10.2.1 into pairs, using Lemma 10.2.1.