(7975.89.92) Proofs of the infinitude of primes
#proof-theory #philosophy-of-math
Theorem 1 : There are infinitely many primes.
Euclid's proof of the infinitude of primes is one of the first examples of mathematical proof that students encounter. It goes roughly as follows:
Proof 1 (Euclid)
Suppose there are only finitely many primes, so that is the largest prime. Define . Then none of the divide , so must be prime, as every positive integer greater than admits a prime factorization. But , contradicting the assumption that is the largest prime. Therefore, there are infinitely many primes.
It is elegant: it is succinct, effective, simple in structure, and uses few assumptions. It is methodologically pure: the techniques and auxiliary lemmas used are number-theoretic in nature. And it promotes understanding: the explicit construction of
Of course, according to the peculiar mathematical tradition of re-proving established results, dozens of alternate proofs of Theorem 1 have since been proposed. A particularly striking example is Harry Furstenburg's 1955 proof, relying on (what he claimed was) a topological argument. Here is a sketch of Furstenburg's version:
Proof 2 (Furstenburg)
Observe that the collection of arithmetic progressions fromto furnishes a basis for a topology on . Each arithmetic progression is also closed, as its complement is the union of finitely many other arithmetic progressions. Therefore, the union of finitely many arithmetic progressions is closed. Define , where is the set of multiples of and varies over the set of primes. Then . Since is not open, cannot be closed; in particular, is not a finite union of closed sets. Therefore, there are infinitely many primes.
While Furstenburg's argument hinges on properties of open and closed sets, it is not immediately clear that this appeal to a topological structure is actually essential to the argumentative structure of this proof. The integers are not an object that topological methods are "meant" to analyze: the intuitive schema of continuous transformations between continuous spaces, that motivated the the axioms of topological spaces in the first place, does not apply to the integers. Indeed, the integers are in a sense, (if viewed as an extension of the Peano characterization of the natural numbers) a fundamentally discrete, inductively generated collection. This suggest that Furstenburg's proof is not methodologically pure; the language of the proof differs significantly from the language used to formulate the proposition.
This is not to say that this sort of methodological impurity should be discouraged in mathematics. Divergence from purity of method is, in many cases, precisely that which drives mathematical discovery and insight. This is related to the concept of mathematical depth; a proof might be said to be deep if it unites ideas from different areas of mathematics in a novel or surprising way. Indeed, many of the most important (and deepest) branches of modern mathematical research are founded explicitly on cross-disciplinary connections that defy methodological boundaries. The abstraction that results from formalizing intuition into general mathematical definitions is a powerful tool; mathematics would certainly be impoverished if we could only apply "methodologically pure" tools to a given problem.
This note seeks to tease out the coincident underlying structure of Euclid's and Furstenburg's proofs, which exists independently of method. The ultimate aim of this study is to uncover structural and epistemic invariants of informal mathematical proofs, in analog to Dominic Hughes' combinatorial proofs as an invariant on proofs in propositional logic.
My claim is that Euclid's proof is a "minimal" representative in the equivalence class of proofs establishing Theorem 1, and that the mapping of any other proof in this class (Furstenburg's, say) to the proposition it establishes factors through Euclid's proof. That is, there is an explicit "structural" reduction of Furstenburg's proof that reduces it to Euclid's argument, which in turn establishes Theorem 1.
There is a straightforward notion of equivalence for propositions. Two propositions are equivalent if each entails the other. For equivalent propositions
This flexibility is simply a consequence of the way these equivalences behave; they are defined such that the mapping from proofs to propositions is stable on equivalence classes. So why do mathematicians bother to re-prove theorems at all? This question has been well-treated by mathematicians and philosophers alike, with a prevailing point of view being that this practice promotes mathematical understanding, which can in turn catalyze the development of new techniques to enable mathematical discovery. More relevant to this context, new proofs of already established theorems may also differ from old proofs along dimensions of purity, depth, rigor, cognitive efficiency, or other qualitative features of informal proof.
In the case of purity, different methodological or syntactic instantiations of the same minimal proof structure are more (or less) well-suited to prove different members of the corresponding equivalence class of propositions. This inverts the usual notion of methodological purity; rather than the statement of the proposition determining the purity of proofs, the minimal proof of a class of propositions determines the minimal, or "purest," representative of that class. This suggests that the meaning of a proposition (rather, an equivalence class of propositions), is in this sense determined by the minimal representative of the the associated class of proofs.
These two claims are quite strong, and they require much substantiation: we will use Euclid's proof of the infinitude of primes as a case study. The goal of this note is to suggest that Euclid's proof is the minimal representative of the appropriate class of proofs, with a view toward finding a mathematical procedure to reduce Furstenburg's proof to Euclid's. In this way, the problem of the infinitude of primes is (unsurprisingly) number-theoretic in nature rather than, say, topological. On this basis, Euclid's argument is the most qualified candidate for a "Book" proof of the infinitude of primes.
Furstenburg Mercer
Over half a century after Furstenburg's proof, Idris Mercer, perhaps with similar considerations in mind, claimed to give a "variant of Furstenburg's proof that avoids topological language." This wording is striking: after removing the topological language, what remains of Furstenburg's proof? Mercer makes an implicit claim that Furstenburg's proof is not essentially topological. Thus, Mercer commits to the existence of an underlying argument structure that is independent of method. This attitude is not uncommon in mathematics. Authors often refer to "standard ___ arguments" to describe families of well-known proof techniques, even in contexts quite distant from those in which they first arose. The reader has surely encountered diagonal arguments, compactness arguments, fixed-point arguments, symmetry arguments, counting arguments, and so on.
Here is a sketch of Mercer's variant of Furstenburg's proof. He includes two lemmas, which are preserved here for later reference.
Proof 3 (Mercer)
Lemma 1: A finite intersection of arithmetic progressions is either empty or infinite.
Proof. Letbe an intersection of a finite collection of arithmetic progressions. We may assume that has an element, as is trivial to construct an empty intersection. Let , so that for all . Then for all , as well.
Lemma 2: For a collection of sets, a finite intersection of finite unions of sets in is a finite union of finite intersections of sets in .
Proof. This is immediate from the fact that set intersection distributes over union.
Letdenote the set of integers not divisible by , and note that each is a finite union of arithmetic progressions: . Assume that there are finitely many primes . Then But by Lemma 2, this set is a finite union of finite intersections of arithmetic progressions, which is either empty or infinite by Lemma 1. This is a contradiction. Therefore, there are infinitely many primes.
Why does Mercer claim that his version is a variant of Furstenburg's, rather than a different proof altogether? The situation is clarified if the proofs are compared in a more standardized form, resembling the style of "natural formalization" Sieg uses to compare proofs of the Cantor-Bernstein theorem.
The "dest. H" and "contra. H" rules correspond to the introduction of the underlined hypothesis and the contradiction derived therefrom. Here, the
Note that
This correspondence between auxiliary lemmas is likely what Mercer had in mind when he claimed that his proof is a variant of Furstenburg's, rather than a different proof altogether. There are at most as many lemmas used in Mercer's proof as Furstenburg's, with each being at most as general as the corresponding statement in Furstenburg's proof. This correspondence between auxiliary lemmas suggests that the topological language employed by Furstenburg was not essential to the structure of his proof, and was either more general than needed (in the case of
Mercer Euclid
We can also construct a "natural formalization" for Euclid's proof of the infinitude of primes.
where the auxiliary lemmas
and the penultimate iff expresses the property of
The implication
To clarify this discussion, we can make the implication
As demonstrated in Mercer's proof,
This witness of the intersection, guaranteed by the proof of
The idea of constructing an implication from
As a final note, though both Mercer and Euclid's proofs are presented as proofs by contradiction, the fact that they each rely on only the Fundamental Theorem of Arithmetic and this inductive algorithm for generating a larger prime from any finite set of primes means that either could be construed as a direct proof; just demonstrating this algorithm implies Theorem 1, without a need to pass to a contradiction.
Summary: Furstenburg Euclid
We can compose the implications
and to assert that Furstenburg's proof is also a more general version of Euclid's. We thus obtain the following picture
If, for example, these were the only possible proofs for the infinitude of primes, then the strong claim that there is a "terminal object" in the equivalence class of proofs of a proposition would hold in this case: Euclid's proof is the "minimal" proof of the infinitude of primes, on the basis that it is the least general proof needed to establish the truth of the proposition.
However, the difficulty lies in showing that this chain of implications in an equivalence class of proofs actually terminates in a single, minimal, proof object. As Euclid's proof is number-theoretic in method (it relies only on the Fundamental Theorem of Arithmetic and an algorithm for constructing prime numbers), this would suggest, on the basis of the discussion in the introduction, that the statement of Theorem 1 is the "minimal" representative in the class of equivalent propositions, and that, no matter the language used to formulate such equivalent propositions, this property is essentially number-theoretic in nature. This is why Furstenburg's proof might "feel" less natural to a mathematician than Euclid's. Whether this determination of the methodological nature of a problem is applicable to more complicated propositions, or even whether it's a meaningful property in the first place, is also a topic for further study.
The discussion in this note is quite suggestive that this claim is true, but this analysis should be taken with a grain of salt; as discussed, Euclid's proof is exceptionally simple, elegant, and pure compared to the proofs of most mathematical propositions. And, though the method here examines the lemmas in detail, simply repeating this ad hoc approach for every mathematical proposition is not a viable approach. Realistically, this procedure for finding minimal proofs of a proposition cannot rely on the particular properties of the auxiliary lemmas, in the way that we discussed the intricacies of, say,
Even without justification for this more general claim, that Euclid's proof is the "minimal" proof of the infinitude of primes, we can still make the more modest assertion that Furstenburg's and Mercer's proofs are really just generalizations of the structure of Euclid's proof, disguised in topological and set-theoretic clothing. The hope is that this method of analysis of Theorem 1 will lead to a suitable generalization of structural comparison between proofs.