The Series of Primes
1. Prime Numbers
A number \(p\) is said to be prime if
- \(p\) > 1
- \(p\) has no positive divisors except \(1\) and \(p\).
A number greater than 1 and not prime is called composite.
2. Theorem 1
Every positive integer, except 1, is a product of primes
2.1. Proof
Either \(n\) is prime, when there is nothing to prove, or \(n\) has divisors between 1 and n. If \(m\) is the least of these divisors, \(m\) is prime; for otherwise \[ \exists~l \ . \ 1 < l < m \ . \ l \mid m; \] and \[ l \mid m \rightarrow l \mid n, \] which contradicts the definition of \(m\).
Hence \(n\) is prime or divisible by a prime less than \(n\), say \(p_{1}\), in which case \[ n = p_{1}n_{1}, \quad 1 < n_{1} < n. \] Here either \(n_{1}\) is prime, in which case the proof is completed, or it is divisible by a prime \(p_{2}\) less than \(n_{1}\), in which case \[ n = p_{1}n_{1} = p_{1}p_{2}n_{2}, \quad 1 < n_{2} < n_{1} < n. \] Repeating the argument, we obtain a sequence of decreasing numbers \(n, n_{1}, \dots , n_{k-1}, \dots\), all greater than \(1\), for each of which the same alternative presents itself. Sooner or later we must accept the first alternative, that \(n_{k-1}\) is a prime, say \(p_{k}\), and then \[ n = p_{1}p_{2}\dots p_{k} \] thus \(666 = 2\cdot 3 \cdot 3 \cdot 37\).
If \(ab = n\), then \(a\) and \(b\) cannot both exceed \(\sqrt{n}\). Hence any composite n is divisible by a prime \(p\), which does not exceed \(\sqrt{n}\).
The primes in the permutation are not necessarily distinct, nor arranged in any particular order.
If we arrange them in increasing order, associate sets of equal primes into single factors, and change the notation appropriately, we obtain \[ n = p^{a_{1}}_{1}p^{a_{2}}_{2}\dots p^{a_{k}}_{k} \quad (a_{1}>0, a_{2}>0, \dots,\quad p_{1} < p_{2} < \dots). \] We then say that \(n\) is expressed in standard form.
3. Theorem 2
The standard form of \(n\) is unique; apart from rearrangement of factors, \(n\) can be expressed as a product of primes in one way only.
4. Theorem 3: (EUCLID’S FIRST THEOREM).
If p is prime, and \(p \mid ab\), then \(p \mid a\) or \(p \mid b\).
4.1. Deduction
If we take Theorem 3 for granted for the moment, we can deduce Theorem 2:
It is an obvious corollary of Theorem 3 that \[ p \mid abc \dots l \rightarrow p \mid a \ or \ p \mid b \ or \ p\mid c \dots or \ p\mid l, \] and in particular that, if \(a, b,\dots,l\) are primes, then \(p\) is one of \(a,b,c,\dots,l\).
Suppose now that \[ n = p^{a_{1}}_{1}p^{a_{2}}_{2}\dots p^{a_{k}}_{k} = q^{b_{1}}_{1}q^{b_{2}}_{2} \dots q^{b_{j}}_{j}, \] each product being a product of primes in standard form. Then \(p_{i}\mid q^{b_{1}}_{1}q^{b_{2}}_{2}\dots q^{b_{k}}_{k}\) for every \(i\), so that every \(p\) is a \(q\), and similarly every \(q\) is a \(p\).
Hence \(k=j\) and, since both sets are arranged in increasing order, \(p_{i} = q_{i}\) for every \(i\).
If \(a_{i} > b_{i}\), and we divide by \(p^{a_{i}}_{i}\), we obtain \[ p^{a_{1}}_{1}\dots p^{a_{i}-b_{i}} \dots p^{a_{k}}_{k} = p^{b_{1}}_{1} \dots p^{b_{i-1}}_{i-1} p^{b_{i+1}}_{i+1} \dots p^{b_{k}}_{k} \]. The left-hand side is divisible by \(p_{i}\) while the right-hand side is not; a contradict.
Similarity, \(b_{i}>a_{i}\) yields a contradict. It follows that \(a_{i}=b_{i}\), and this completes the proof of Theorem 2.
It will now be obvious why \(1\) should not be counted as a prime. If it were, Theorem 2 would be false, since we could insert any number of unit factors.
5. Theorem 4: (EUCLID’S SECOND THEOREM).
The number of primes is infinite.
5.1. First Proof (Euclid’s own proof)
Let \(2, 3, 5, \dots, p\) be the aggregate of primes up to \(p\), and let \[ q = 2 \cdot 3 \cdot 5 \cdots p + 1 \] Then \(q\) is not divisible by any of the numbers \(2, 3, 5, \dots, p\). It is therefore either prime, or divisible by a prime between \(p\) and \(q\). In either case there is a prime greater than \(p\), which proves the theorem.
The theorem is equivalent to \[ \pi(x) \rightarrow \infty \]
5.2. Deduction
If \(p\) is the \(n\)th prime \(p_n\), and \(q\) is defined as in (2.1.1), it is plain that \[ q < p_n^n + 1 \] for \(n > 1\), and so that \[ p_{n+1} < p_n^n + 1. \]
This inequality enables us to assign an upper limit to the rate of increase of \(p_n\), and a lower limit to that of \(\pi(x)\).
We can, however, obtain better limits as follows. Suppose that
\begin{equation} p_n < 2^{2^n} \tag{2.2.1} \end{equation}for \(n = 1, 2, \ldots, N\). Then Euclid’s argument shows that
\begin{equation} p_{N+1} \le p_1 p_2 \cdots p_N + 1 < 2^{2+4+\cdots+2^N} + 1 < 2^{2^{N+1}}. \tag{2.2.2} \end{equation}Since (2.2.1) is true for \(n = 1\), it is true for all \(n\).
Suppose now that \(n \ge 4\) and \[ e^{e^{n-1}} < x \le e^{e^n}. \] Then \[ e^{n-1} > 2^n, \qquad e^{e^{n-1}} > 2^{2^n}; \] and so \[ \pi(x) \ge \pi(e^{e^{n-1}}) \ge \pi(2^{2^n}) \ge n, \] by (2.2.1). Since \(\log\log x \le n\), we deduce that \[ \pi(x) \ge \log\log x \] for \(x > e^{e^3}\); and it is plain that the inequality holds also for \(2 \le x \le e^{e^3}\). We have therefore proved Theorem 10.
5.3. Second Proof (Pólya)
5.3.1. Theorem 16: No two Fermat’s numbers have a common divisor greater than 1.
Fermat’s numbers are defined by \[ F_{n} = 2^{2^{n}} + 1 \] For suppose that \(F_{n}\) and \(F_{n+k}\), where \(k>0\), are two Fermat’s numbers, and that \[ m\mid F_{n}, \qquad m\mid F_{n+k} \] If \(x=2^{2^{n}}\), we have \[ \frac{F_{n+k}-2}{F_{n}}=\frac{2^{2^{n+k}}-1}{2^{2^{n}}+1}=\frac{x^{2^{k}}-1}{x+1}=x^{2^{k}-1}-x^{2^{k}-2}+\dots -1 \] and so \(F_{n}\mid F_{n+k}-2\). Hence \[ m\mid F_{n+k}, \qquad m\mid F_{n+k}-2 \] and therefore \(m\mid 2\). Since \(F_{n}\) is odd, \(m=1\), which proves the theorem.
5.3.2. Second Proof depending on Fermat’s numbers
It follows that each of the numbers \(F_{1}, F_{2}, \dots, F_{n}\) is divisible by an odd prime number which does not divide any of the others; and therefore that there are at least \(n\) odd primes not exceeding \(F_{n}\). This proves Euclid’s theorem. Also \[ p_{n+1} \leq F_{n} = 2^{2^{n}}+1 , \] and it is plain that this inequality, which is a little stronger than (2.2.1), leads to a proof of Theorem 10.
5.4. Third Proof
Suppose that \(2, 3, \ldots, p_j\) are the first \(j\) primes and let \(N(x)\) be the number of \(n\) not exceeding \(x\) which are not divisible by any prime \(p > p_j\). If we express such an \(n\) in the form \[ n = n_1^2 m, \] where \(m\) is `quadratfrei’, i.e. is not divisible by the square of any prime, we have \[ m = 2^{b_1} 3^{b_2} \cdots p_j^{b_j}, \] with every \(b\) either \(0\) or \(1\). There are just \(2^j\) possible choices of the exponents and so not more than \(2^j\) different values of \(m\).
Again, \[ n_1 \le \sqrt{n} \le \sqrt{x}, \] and so there are not more than \(\sqrt{x}\) different values of \(n_1\).
Hence
\begin{equation} N(x) \le 2^j \sqrt{x}. \tag{2.6.1} \end{equation}If Theorem 4 is false, so that the number of primes is finite, let the primes be \(2, 3, \ldots, p_j\). In this case \(N(x) = x\) for every \(x\) and s \[ x \le 2^j \sqrt{x}, \qquad x \le 2^{2j}, \] which is false as we can always find a larger number \(x\) that is greater than \(2^{2j}\).
We can use this argument to prove two further results.
5.4.1. Theorem 19
The series \[ \sum \frac{1}{p} = \frac{1}{2} + \frac{1}{3}+\frac{1}{5}+ \frac{1}{7} + \cdots \] is divergent.
Proof:
If the series is convergent, we can choose \(j\) so that the remainder after \(j\) terms is less than \(\frac{1}{2}\), i.e., \[ \frac{1}{p_{j+1}}+ \frac{1}{p_{j+2}}+ \cdots < \frac{1}{2}. \]
The number of \(n \le x\) which are divisible by \(p\) is at most \(x/p\).
Hence \(x-N(x)\), the number of \(n \le x\) divisible by one or more of \(p_{j+1}, p_{j+2}, \dots\), is not more than \[ \frac{x}{p_{j+1}}+ \frac{x}{p_{j+2}}+ \cdots < \frac{1}{2} x. \] Hence, by (2.6.1), \[ \frac{1}{2} x < N(x) \le 2^{j}\sqrt{x}, \qquad x < 2^{2j+2}, \] which is false as we can always find a \(x\) not less than \(2^{2j+2}\). Hence the series diverges.
5.4.2. Theorem 20
\[ \pi(x) \ge \frac{\ln(x)}{2\ln(2)} \quad (x \ge 1); \qquad p_{n} \le 4^{n}. \]
Proof:
We take \(j=\pi(x)\), so that \(p_{j+1}>x\) and \(N(x)=x\). We have \[ x=N(x) \le 2^{\pi(x)}\sqrt{x}, \qquad 2^{\pi(x)} \ge \sqrt{x} \] and the first part of Theorem 20 follows on taking logarithms.
If we put \(x=p_{n}\), so that \(\pi(x)=n\), the second part is immediate.
6. Moduli of Integers
A modulus is a system \(S\) of numbers such that the sum and difference of any two members of \(S\) are themselves members of \(S\): i.e., \[ m \in S \; . \; n \in S \rightarrow (m \pm n) \in S \] The single number 0 forms a modulus (the null modulus).
It follows from the definition of \(S\) that: \[ a \in S \Rightarrow 0=a-a \in S \; . \; 2a = a+a \in S \] Repeating that argument, we see that \(na\in S\) for any integral \(n\) (positive or negative). More generally, \[ a \in S \; . \; b\in S \Rightarrow xa+yb \in S \] for any integral \(x,y\).
On the other hand, it is obvious that, if \(a\) and \(b\) are given, the aggregate of values of \(xa + yb\) forms a modulus.
It is plain that any modulus \(S\), except the null modulus, contains some positive numbers. Suppose that \(d\) is the smallest positive number of \(S\). If \(n\) is any positive number of \(S\), then \(n - zd \in S\) for all \(z\). If \(c\) is the remainder when \(n\) is divided by \(d\) and
\[ n = zd + c, \]
then \(c \in S\) and \(0 \le c < d\). Since \(d\) is the smallest positive number of \(S\), \(c = 0\) and \(n = zd\). Hence
6.1. Theorem 23
Any modulus, other than the null modulus, is the aggregate of integral multiples of a positive number \(d\).
We define the highest common divisor \(d\) of two integers \(a\) and \(b\), not both zero, as the largest positive integer which divides both \(a\) and \(b\); and write \[ d = (a,b). \] Thus \((0,a) = |a|\). We may define the highest common divisor \[ (a,b,c,\ldots,k) \] of any set of positive integers \(a, b, c, \ldots, k\) in the same way.
The aggregate of numbers of the form \[ xa + yb, \] for integral \(x, y\), is a modulus which, by Theorem 23, is the aggregate of multiples \(zc\) of a certain positive \(c\). Since \(c\) divides every number of \(S\), it divides \(a\) and \(b\), and therefore \[ c \le d. \]
On the other hand, \[ d \mid a,\quad d \mid b \;\Rightarrow\; d \mid xa + yb, \] so that \(d\) divides every number of \(S\), and in particular \(c\). It follows that \[ c = d \] and that \(S\) is the aggregate of multiples of \(d\). Thus we have proved:
6.2. Theorem 24
The modulus \(xa+yb\) is the aggregate of multiples of \(d=(a,b)\)
6.3. Theorem 25
The equation \[ ax+by=n \] /is soluble in integers \(x, y\) if and only if \(d \mid n\). In particular, \[ ax+by=d \] is soluble.
Proof:
\(ax+by=n\) is soluble in integers \(x, y\), which is equivalent to that \(n\) can be written to some form of \(ax+by\) where \(x, y \in \mathbb{Z}\). That is: \[ n \in \{ax+by : x, y \in \mathbb{Z}\} = S \] We have proved that \[ S = d\mathbb{Z}. \] Hence \[ n \in S \Longleftrightarrow n \in d\mathbb{Z} \Longleftrightarrow d\mid n \]
6.4. Theorem 26
Any common divisor of \(a\) and \(b\) divides \(d\).
Proof:
By Theorem 25, there must exist interal \(x,y\) such that \[ ax+by = d. \] Now if \(c\mid a\) and \(c\mid b\), then \[ c \mid ax, \qquad c\mid by, \] then \[ c \mid (ax+by) = d \] Thus any common divisor of \(a\) and \(b\) divides \(d\).
7. Theorem 10:
\[ \pi(x) \ge \log\log x \qquad (x \ge 2). \] we have thus found a lower limit for the order of magnitude of \(\pi(x)\).
8. Theorem 5
- There are blocks of consecutive composite numbers whose length exceeds any given number \(N\).
- There are infinitely many prime-pairs \((p, p+2)\)
- The numbers \(p, p+2, p+4\) cannot all be prime, since one of them must be divisible by 3.
9. Some questions concerning primes
9.1. how many primes are there less than a given number x?
Suppose that, as in usual, we define \[ \pi(x) \] to be the number of primes which do not exceed x, so that \(\pi(1) = 0, \pi(2)=1, \pi(20)=8\).
If \(p_{n}\) is the n-th prime then \[ \pi(p_{n}) = n, \] so that \(\pi(x)\), as function of \(x\), and \(p_{n}\), as function of \(n\), are inverse functions.
10. Some Notations
- \(f = O(\phi)\) means that \(\left| f\right| < A\phi\) where \(A\) is independent of \(n\) or \(x\), for all values of \(n\) or \(x\) in question.
- \(f=o(\phi)\) means that \(f/\phi \rightarrow 0\).
- \(f \sim \phi\) means that \(f/\phi \rightarrow 1\)
- \(f \prec \phi\) means \(f/\phi \rightarrow 0\), and is equivalent to \(f=o(\phi)\)
- \(f \succ \phi\) means \(f/\phi \rightarrow \infty\)
- \(f \asymp \phi\) means \(A\phi < f < A'\phi\),
We agree that `\(O(\phi)\)’ denotes an unspecified f such that \(f=O(\phi)\). We can then write, for example, \[ O(1)+O(1) = O(1) = o(x) \] when \(x \rightarrow \infty\). It is to be observed that the relation \(=\), asserted between \(O\) or \(o\) symbols, is not usually symmetrical. Thus \(o(1)=O(1)\) is always true, but \(O(1)=o(1)\) is usually false. If we have \(f \sim \phi\), then according to definition \[ \frac{f}{\phi} \rightarrow 1 \] holds, which means \[ \frac{f-\phi}{\phi} \rightarrow 0, \] then we get \[ f-\phi = o(\phi), \] it is equivalent to \[ f=\phi + o(\phi) \] It means that the difference between \(f\) and \(\phi\) is much less than \(\phi\) itself. Next we extract a \(\phi\) from it: \[ f = \phi(1+\frac{o(\phi)}{\phi}) \] and because of \(\frac{o(\phi)}{\phi}=o(1)\), so \[ f = \phi \cdot (1+o(1)) \] In these circumstances we say that \(f\) and \(\phi\) are asymptotically equivalent, or that \(f\) is asymptotic to \(\phi\).
11. Theorem 6 (THE PRIME NUMBER THEOREM)
The number of primes not exceeding \(x\) is asymptotic to \(x/\ln(x)\)
12. Theorem 7 (TCHEBYCHEF’S THEOREM)
The order of magnitude of \(\pi(x)\) is \(x/\ln(x)\): \[ \pi(x) \asymp \frac{x}{\ln(x)} \]
12.1. A remark
If \[ y = \frac{x}{\ln(x)} \] then \[ \ln(y) = \ln(x)-\ln(\ln(x)) \] and \(\ln(\ln(x))=o(\ln(x))\) so that \[ \ln(y) \sim \ln(x), \qquad x = y\ln(x) \sim y\ln(y) \] From this remark we infer that Theorem 6 is equivalent to
12.2. Theorem 8
\[ p_{n} \sim n\ln(n) \] Similarly, Theorem 7 is equivalent to to
12.3. Theorem 9
\[ p_{n} \asymp n\ln(n) \]