1 The consistency strength hierarchy
The starting points of this story are Gödel’s incompleteness theorems. These theorems establish fundamental restrictions on what can be proven on the basis of any reasonable axiomatic theory. By a reasonable axiomatic theory I mean a consistent, recursively axiomatized theory that interprets a modicum of arithmetic.
Theorem 1.1 [Reference Gödel11, Reference Rosser30]
No reasonable axiomatic theory is complete.Footnote 1
Theorem 1.2 [Reference Gödel11]
No reasonable axiomatic theory proves its own consistency.
The first incompleteness theorem shows that, no matter what principles we endorse, there will be questions that cannot be resolved on the basis of those principles. This means that there is no universal axiom system within which mathematics can be developed. Instead, we are left with a vast array of axiomatic theories. The second incompleteness theorem yields the primary method for mapping out this vast array of theories. Fix some reasonable base theory B. Let’s now recall some standard definitions.
Definition 1.3. For reasonable theories T and U, we say that
$T \leq _{\mathsf {Con}}U $
if B proves that the consistency of U implies the consistency of T.
$T\equiv _{\mathsf {Con}}U$
if
$T \leq _{\mathsf {Con}} U $
and
$U \leq _{\mathsf {Con}} T$
.
$T <_{\mathsf {Con}} U$
if
$T \leq _{\mathsf {Con}} U$
and
$U \nleq _{\mathsf {Con}} T$
.
The notion of strength that
$<_{\mathsf {Con}}$
engenders is known as consistency strength. The structure of all reasonable theories ordered by consistency strength is quite complicated. As far as I know, the following theorems are folklore.
Theorem 1.4 (Folklore)
The ordering
$<_{\mathsf {Con}}$
is not pre-linear, i.e., there are theories T and U such that
$T\nleq _{\mathsf {Con}}U$
and
$U \nleq _{\mathsf {Con}} T$
.
Theorem 1.5 (Folklore)
The ordering
$<_{\mathsf {Con}}$
is ill-founded, i.e., there is a sequence
$T_0>_{\mathsf {Con}} T_1 >_{\mathsf {Con}} T_2 >_{\mathsf {Con}} \dots $
where each
$T_i$
is consistent.
All known instances of non-linearity and ill-foundedness have been discovered by defining theories in an ad-hoc manner using self-reference and other logical tricks. The restriction of
$<_{\mathsf {Con}}$
to natural axiomatic theories—those that arise in practice—is a pre-well-ordering.Footnote
2
Here is a tiny snapshot of such theories:
The well-ordering phenomenon persists, taking a very liberal view of what constitutes a “natural” theory.Footnote 3 Note that the theories just cited run the gamut from weak fragments of arithmetic to subsystems of analysis and all the way to strong extensions of set theory. These theories come from different areas of mathematics (e.g., arithmetic, analysis, and set theory) and often codify different conceptions of mathematics. Indeed, many of the natural extensions of set theory that have been investigated are jointly inconsistent, yet comparable according to consistency strength.
Explaining the contrast between natural axiomatic theories and axiomatic theories in general is widely regarded as a major outstanding conceptual problem in mathematical logic. The following passage is representative:
The fact that “natural” theories, i.e., theories which have something like an “idea” to them, are almost always linearly ordered with regard to logical strength has been called one of the great mysteries of the foundations of mathematics. ([Reference Friedman, Rathjen and Weiermann9, p. 382])
One can find similar quotes elsewhere in the literature. For instance, Shelah writes that explaining “the phenomena of linearity of consistency strength” is a “logical dream” [Reference Shelah34].Footnote 4
If it is true that natural axiomatic theories are pre-well-ordered by consistency strength, and not an illusion engendered by a paucity of examples, then one would like to prove that it is true. However, the claim that natural axiomatic theories are pre-well-ordered by consistency strength is not a strictly mathematical claim. The problem is that we lack a definition of “natural axiomatic theory.” Without a precise definition of “natural,” it is not clear how to prove this claim, or even how to state it mathematically.
Here is my plan for the rest of the article. In Section 2 I will discuss the relevance of the pre-well-ordering phenomenon to the search for new axioms in set theory. In Sections 3–5 I will discuss two research programs that emerged from the incompleteness theorems, namely, ordinal analysis and the study of iterated reflection principles. Both programs will play an important role in the approaches to the pre-well-ordering problem discussed in this article. In Section 6 I will discuss a phenomenon from recursion theory, namely, the well-ordering of natural Turing degrees by Turing reducibility; this phenomenon is analogous in many ways to the pre-well-ordering of natural theories by consistency strength. In Sections 8 and 9 I will discuss approaches to the pre-well-ordering problem that have emerged in recent work. Finally, in Section 10 I will discuss remaining questions and directions for future research.
2 The search for new axioms
In this section I will discuss how the pre-well-ordering phenomenon informs the search for new axioms in set theory.
Set theory has been developed in an explicitly axiomatic fashion, at least since Zermelo isolated the axioms that are the precursors to the standard
$\mathsf {ZFC}$
axioms. The progenitors of set theory discovered that various mathematical structures could be realized in set-theoretic terms and that their properties could be established on the basis of set-theoretic reasoning. Logicians often formulate this aspect of set theory in terms of the technical notion of interpretability. Roughly, a theory
$T_1$
is interpreted in another theory
$T_2$
if there is a translation
$\tau $
from the language of
$T_1$
to the language of
$T_2$
such that, for each sentence
$\varphi $
of the language of
$T_1$
, if
$T_1\vdash \varphi $
then
$T_2\vdash \tau (\varphi )$
.Footnote
5
As Koellner notes, interpretability is ubiquitous in mathematics: “Poincaré provided an interpretation of two dimensional hyperbolic geometry in the Euclidean geometry of the unit circle; Dedekind provided an interpretation of analysis in set theory; and Gödel provided an interpretation of the theory of formal syntax in arithmetic” [Reference Koellner15]. We may restate the aforementioned discovery that large swathes of mathematics can be developed in set theory as the discovery that
$\mathsf {ZFC}$
axioms interpret the axioms of various other branches of mathematics.
Despite the fact that
$\mathsf {ZFC}$
interprets many other theories, many of the central problems of set theory (such as the Continuum Hypothesis, the Projective Measure problem, and Suslin’s Hypothesis) cannot be resolved on the basis of the
$\mathsf {ZFC}$
axioms. That is, the
$\mathsf {ZFC}$
axioms suffice for resolving mainstream problems in all of the branches of mathematics with the conspicuous exception of set theory itself. This has motivated the search for new axioms for set theory.
Set theorists have investigated many extensions of
$\mathsf {ZFC}$
, including large cardinal axioms, determinacy axioms, forcing axioms, and more. Is it possible to make rational judgments about these principles and their consequences? Some point to the interpretability strength of
$\mathsf {ZFC}$
as its core virtue. Indeed, it is the interpretability strength of
$\mathsf {ZFC}$
that allows us to carry out so much mathematics on the basis of set-theoretic principles and to regard
$\mathsf {ZFC}$
as a “foundations” for mathematics. Perhaps, so the thought goes, we should embrace those axiomatic extensions of set theory that most enhance its interpretability power, thereby enriching the domain of mathematics as much as possible.Footnote
6
Steel has promoted the maxim “maximize interpretability strength” as a guiding principle in the search for new axioms [Reference Steel39]. Let’s call this principle steel’s maxim.Footnote
7
For a sentence
$\varphi $
that is independent of
$\mathsf {ZFC}$
, one can imagine the following four possibilities:
-
1.
$\varphi $
increases interpretability strength but
$\neg \varphi $
does not. -
2.
$\neg \varphi $
increases interpretability strength but
$\varphi $
does not. -
3. Neither
$\varphi $
nor
$\neg \varphi $
increases interpretability strength. -
4. Both
$\varphi $
and
$\neg \varphi $
increase interpretability strength.
It turns out that all four possibilities are realized.Footnote 8 In the fourth case we cannot follow steel’s maxim or we would land ourselves in inconsistency. However, this is not typically taken as a refutation of steel’s maxim. The key point here is that when one restricts oneself to natural theories, only the first three possibilities are realized.Footnote 9 Indeed, in the fourth case we have an instance of non-linearity in the interpretability hierarchy, and there are apparently no natural instances of such incomparability. This is generally viewed as a kindred phenomenon to the pre-well-ordering phenomenon since, for natural theories, interpretability strength is intimately connected with consistency strength.Footnote 10
Consider once more the variety of extensions of
$\mathsf {ZFC}$
that are investigated by set theorists: large cardinal axioms, axioms of definable determinacy, forcing axioms, and more. These axiom systems have different motivations and often codify different foundational conceptions of mathematics. Nevertheless, they are pre-well-ordered by consistency strength. Indeed, they are pre-well-ordered according to all the common notions of proof-theoretic strength (such as 1-consistency strength,
$\Pi ^1_1$
reflection strength, and so on). A consequence is that these axiom systems all converge on arithmetic statements and even analytic statements. As Steel writes:
Thus the well-ordering of natural consistency strengths corresponds to a well-ordering by inclusion of theories of the natural numbers. There is no divergence at the arithmetic level, if one climbs the consistency strength hierarchy in any natural way we know of…. Natural ways of climbing the consistency strength hierarchy do not diverge in their consequences for the reals. Let
$T, U$
be natural theories of consistency strength at least that of “there are infinitely many Woodin cardinals”; then either
$(\Pi ^1_\omega )_T\subseteq (\Pi ^1_\omega )_U$
or
$(\Pi ^1_\omega )_U\subseteq (\Pi ^1_\omega )_T$
. ([Reference Steel40, p. 159])
That is, at the level of
$\Pi ^1_\omega $
statements—i.e., statements about
$\mathbb {R}$
—all natural theories converge. Moreover, steel’s maxim and its variants suggest that we ought to endorse the sentences on which they converge. Thus, the apparent pre-well-ordering of theories by consistency strength and other notions of proof-theoretic strength plays a central role in the search for new axioms in set theory.
3 Ordinal analysis
At this point it will be helpful to discuss two research programs that deal with questions of systematically reducing incompleteness. One is Gentzen’s program of ordinal analysis and the other is Turing’s program (significantly extended by Feferman) of completeness via iterated reflection principles. Both programs can be understood as reactions to the incompleteness theorems. The practitioners of ordinal analysis have attempted to reduce incompleteness by proving consistency statements in a systematic way, namely, from ever stronger transfinite induction principles. Turing’s program, on the other hand, uses the second incompleteness theorem as the major engine for overcoming incompleteness; the practitioners of Turing’s program try to systematically effect reductions in incompleteness by successively adding consistency statements to theories as new axioms. Both programs will play a major role in the approach to the “great mystery” discussed here.
Ordinal analysis was developed in the context of Hilbert’s Program, an early twentieth-century research program pioneered by David Hilbert. To combat skepticism about the cogency of infinitary mathematics, Hilbert proposed to (i) axiomatize infinitary mathematics and (ii) prove the consistency of the axioms by finitary means. In 1931, Hilbert’s program reached a major obstacle in the form of Gödel’s second incompleteness theorem [Reference Gödel11]. Indeed, it follows from Gödel’s theorem that if the principles of finitistic mathematics are codifiable in a reasonable axiomatic theory, then they do not prove their own consistency, much less the consistency of stronger theories. Thus, it is generally agreed that Hilbert’s program failed.
Gentzen was apparently undeterred, however. Not long after Gödel proved the incompleteness theorems, Gentzen produced a consistency proof of arithmetic.
Theorem 3.1 [Reference Gentzen10]
If
$\varepsilon _0$
is well-founded, then arithmetic is consistent.
The principles invoked in Gentzen’s proof are nearly universally regarded as finitistically acceptable, except for the principle that the ordinal number
$\varepsilon _0$
is well-founded. Gentzen’s consistency proof marked the beginning of ordinal analysis, a research program whereby similar consistency proofs have been discovered for a wide range of axiomatic theories. Developing such a consistency proof for a theory T involves, among other things, determining the proof-theoretic ordinal of T. Informally, the proof-theoretic ordinal of T is the least ordinal
$\alpha $
such that induction along
$\alpha $
suffices to prove the consistency of T. Making this informal definition precise is no easy task, as we shall see.
The methods of ordinal analysis have been used to analyze many theories of interest, including subsystems of first-order arithmetic, second-order arithmetic, and set theory. Ordinal analysis has not yet reached the level of full second-order arithmetic. Nevertheless, it is reasonable to expect that the existing results are part of a general connection between ordinals and consistency. Do Gentzen-style methods suffice for proving the consistency of any axiomatic theory? The following result might seem to suggest a positive answer. (
$\mathsf {TI}^\prec _{\Pi _1}$
is a sentence expressing the validity of induction for
$\Pi _1$
predicates along
$\prec $
.)
Theorem 3.2 [Reference Kreisel, Shoenfield and Wang18]
For any reasonable theory T there is a primitive recursive presentation
$\prec $
of an ordinal such that
$\mathsf {PA}+\mathsf {TI}^\prec _{\Pi _1}\vdash \mathsf {Con}(T)$
.Footnote
11
This theorem is proved by showing that, given any true consistency statement
$\varphi $
, one can “encode” the truth of
$\varphi $
into an ordinal notation system
$\prec $
. One would presumably not recognize the validity of
$\Pi _1$
transfinite induction along such a notation system without knowing the truth of
$\varphi $
, so one could not use such a transfinite induction principle to prove
$\varphi $
. Thus, the epistemic value of this theorem is limited.
As we will see, this is one version of a pervasive problem known as the canonicity problem. The ordinal notations that are devised to prove Theorem 3.2 are not natural notations. They form a notation system that one would introduce only in an ad-hoc manner to solve a problem in proof theory. Does the distinction between the pathological notation systems and the natural notation systems reflect some intrinsic mathematical properties of the notation systems? If one had a convincing definition of “natural,” one might conjecture that, for any reasonable theory T, there is a natural presentation
$\prec $
of a recursive ordinal such that
$\mathsf {PA}+\mathsf {TI}^\prec _{\Pi _1}\vdash \mathsf {Con}(T)$
. However, at present there is no convincing evidence that it is possible to precisely define the “natural” notation systems.
The canonicity problem also makes it difficult to define “proof-theoretic ordinal,” as suggested earlier. One might try to define the proof-theoretic ordinal of T as the least ordinal
$\alpha $
such that induction along
$\alpha $
(along with finitary methods) suffices to prove the consistency of T. The problem with this definition is that, in formalized theories, transfinite induction principles are stated in terms of presentations of ordinals, not the ordinals themselves. Kreisel [Reference Kreisel17] shows that it is always possible to prove the consistency of a reasonable theory by induction along a sufficiently pathological presentation of
$\omega $
. Conversely, Beklemishev [Reference Beklemishev1] shows that, for any reasonable theory T and recursive ordinal
$\alpha $
, there is a sufficiently pathological presentation of
$\alpha $
such that transfinite induction along that presentation (along with finitary methods) does not prove the consistency of T. When one restricts one’s attention to “natural” presentations of ordinals, the naïve definition seems to work, but the current state of affairs is vexing and unsatisfactory.
At present, many conflicting definitions of the “proof-theoretic ordinal” of a theory have been proposed; these definitions often coincide in crucial cases. Perhaps the most common definition is this: The proof-theoretic ordinal of a theory T is the supremum of the order-types of the primitive recursive well-orderings whose well-foundedness is provable in T. This is sometimes called the
$\Pi ^1_1$
ordinal of a theory. Of course, this notion is useful only for measuring the proof-theoretic strength of theories in which well-foundedness is expressible, so it does not apply, e.g., to fragments of first-order arithmetic. Moreover, it is a somewhat coarse notion of strength, since
$\Pi ^1_1$
ordinals are invariant under the addition of true
$\Sigma ^1_1$
sentences to the object theory. Nevertheless, the
$\Pi ^1_1$
ordinal of a theory is an ordinal, not a presentation of an ordinal, so this is a somewhat robust notion.
4 Turing–Feferman progressions
4.1 Turing’s completeness theorem
The second incompleteness theorem suggests a method for dealing with the first incompleteness theorem. According to Turing, if we endorse the axioms of a reasonable axiomatic theory T, then there is a principled way of extending T, namely, by adopting T’s consistency statement as an axiom. Of course, if one adopts the statement
$\mathsf {Con}(T)$
as an axiom, the statement
$\mathsf {Con}(T+\mathsf {Con}(T))$
remains unprovable. However, there is a principled way of resolving this problem, namely, by adopting
$\mathsf {Con}(T+\mathsf {Con}(T))$
as an axiom. Of course, this engenders a new problem, but it just as easily engenders a new solution, whence this process can be iterated ad infinitum. Given presentations of recursive ordinals in the language of arithmetic, one can even extend this process into the effective transfinite.
Naïvely, we might try to define the iterations of consistency over a theory T as follows:
-
•
$T_0:=T;$
-
•
$T_{\alpha +1} := T_\alpha + \mathsf {Con}(T_\alpha );$
-
•
$T_\lambda :=\bigcup _{\alpha <\lambda }T_\alpha $
for
$\lambda $
a limit.
However, such a definition does not even pin down the theory
$T_{\omega +1}$
. According to the definition,
$T_{\omega +1}$
is just
$T_\omega +\mathsf {Con}(T_\omega )$
, but for
$\mathsf {Con}(T_\omega )$
to be a statement of arithmetic we must have some effective presentation of
$T_\omega $
, and there are many choices for the latter. Accordingly, when one defines iterated consistency statements, one must first fix an ordinal notation system
$\prec $
. One can then define the iterations of consistency along
$\prec $
within arithmetic via Gödel’s fixed point lemma. Let me briefly indicate how this is done; thenceforth I will only give the intuitive “definitions” by transfinite recursion, trusting the reader to fill in the details.Footnote
12
We fix a formula
$Ax(x,T)$
which formalizes that x is an axiom of T. We now want to define a formula
$Ax(x,\alpha ,T)$
that defines the axioms of the theory
$T_\alpha $
. We define the formula as the following fixed point:
Put a bit more explicitly:
$$ \begin{align*}&\mathsf{PA} \vdash Ax(x,\alpha,T) \\&\quad\!\!\leftrightarrow \kern-1pt\Big(Ax(x,T) \vee \exists \beta\prec \alpha \; x=\ulcorner \forall z \mathsf{Con}\big(\bigwedge \{ w \mid w\leq z \wedge Ax(w,\beta,T) \}\big)\urcorner\Big).\end{align*} $$
Note that the three-place formula
$Ax(\cdot ,\cdot ,\cdot )$
appears on the right but only in Gödel quotes, whence this is a legitimate application of the fixed point lemma.
Does iteratively endorsing consistency statements in this manner effect a significant reduction in incompleteness? The following result may seem to suggest a positive answer.
Theorem 4.1 [Reference Turing41]
For any true
$\Pi _1$
sentence
$\varphi $
, there is a presentation
$\dot \alpha $
of
$\omega +1$
such that
$\mathsf {PA}_{\dot \alpha }\vdash \varphi $
.
This result is known as Turing’s Completeness Theorem. At first glance this theorem seems epistemically significant: We can come to know the truth of any
$\Pi _1$
statement
$\varphi $
simply by iterating consistency statements. How are the consistency statements used in the proof of
$\varphi $
? The disappointing response is that they are not used at all. Instead, the truth of
$\varphi $
is encoded into a non-standard description of the base theory
$\mathsf {PA}$
, which becomes available at iteration
$\omega +1$
. Given this non-standard description of
$\mathsf {PA}$
, discerning what the theory
$\mathsf {PA}_{\dot \alpha }$
is committed to requires knowing the truth-value of
$\varphi $
. This is another appearance of the canonicity problem. As far as I know, this was the initial appearance of the problem, and Turing was the first to identify it.
Turing was careful not to overstate the epistemic significance of his theorem. Indeed, he wrote:
This completeness theorem as usual is of no value. Although it shows, for instance, that it is possible to prove Fermat’s last theorem…(if it is true) yet the truth of the theorem would really be assumed by taking a certain formula as an ordinal formula. ([Reference Turing41, p. 213])
Nevertheless, one might hope that iterating
$\mathsf {Con}$
along natural ordinal notations would also suffice to prove any true
$\Pi _1$
statement. Once again, if we had a precise definition of “natural” ordinal notation systems, we would be able to formulate a precise conjecture; however, no such characterization of “natural” ordinal notation systems is currently available.
Turing discussed the possibility of using iterated consistency progressions to measure the strength of
$\Pi _1$
statements. He wrote:
We might also expect to obtain an interesting classification of number-theoretic theorems according to “depth”. A theorem which required an ordinal
$\alpha $
to prove it would be deeper than one which could be proved by the use of an ordinal
$\beta $
less than
$\alpha $
. However, this presupposes more than is justified. ([Reference Turing41, p. 200])
Naïvely, one approach would be to assign to each true
$\Pi _1$
sentence
$\varphi $
the least ordinal
$\alpha $
such that
$\mathsf {PA}_\alpha \vdash \varphi $
as its measure of complexity. One could then extend this measure of complexity to theories: the complexity of T is the least ordinal such that
$\mathsf {PA}_\alpha \vdash \varphi $
for each
$\Pi _1$
theorem
$\varphi $
of T. Clearly, Turing’s completeness theorem shows that iterations of consistency depend on presentations of ordinals and not just on ordinals themselves. The possibility remains, however, that one could prove informative results of this sort for large swathes of interesting theories by antecedently fixing some natural notation system. Beklemishev has recently pursued this possibility, as we will discuss in Section 5.
4.2 Reflection principles
Turing’s work was later pursued and greatly extended by Feferman [Reference Feferman6]. Feferman shifted the focus from iterated consistency to iterated reflection principles of other sorts. We pause here to describe these reflection principles.
Feferman’s focus was on the full uniform reflection schema, denoted
$\mathsf {RFN}(T)$
. To discuss this schema, we must review some notation. In what follows,
$\overline {m}$
is the standard numeral of the number m and
$\ulcorner \varphi \urcorner $
is the Gödel number of the formula
$\varphi $
. The expression
$\ulcorner \varphi (\dot {x_1},\dots ,\dot {x_n})\urcorner $
denotes an elementary definable term for the function taking
$k_1,\dots ,k_n$
to the Gödel number
$\ulcorner \varphi (\overline {k_1},\dots , \overline {k_n} )\urcorner $
of the result of substituting the numerals
$\overline {k_1},\dots , \overline {k_n}$
for the variables
$x_1,\dots ,x_n$
in
$\varphi $
.
Assuming that T is a theory in the language
$\mathcal {L}_A$
of arithmetic, the full uniform reflection principle for T is the schema:
We sometimes consider restrictions of the full uniform reflection schema to various complexity classes. One such schema is the restriction of the uniform reflection principle to
$\Sigma _n$
formulas, i.e.,
In fact, this principle is axiomatizable over weak systemsFootnote 13 by a single sentence, namely,
Likewise, the schema of uniform
$\Pi _n$
reflection
also has a finite axiomatization:
The last reflection principle that we introduce is that of n-consistency. We say that a theory is n-consistent if it is consistent with the true
$\Pi _n$
theory of arithmetic. We can express that a theory T is n-consistent as follows:Footnote
14
In fact, over
$\mathsf {EA}$
, the statements
$\mathsf {RFN}_{\Sigma _n}(T)$
,
$\mathsf {RFN}_{\Pi _{n+1}}(T)$
, and
$ (n+1) \mathsf {Con}(T)$
are all provably equivalent.
4.3 Feferman’s completeness theorem
Feferman’s theorem concerns the iteration of the full uniform reflection schema. After fixing a presentation
$\prec $
of a recursive ordinal, we can then define iterations of reflection over a theory T so as to satisfy the following conditions:
-
•
$T^{\mathsf {RFN}}_0:=T;$
-
•
$T^{\mathsf {RFN}}_{\alpha +1} := T^{\mathsf {RFN}}_\alpha + \mathsf {RFN}(T^{\mathsf {RFN}}_\alpha );$
-
•
$T^{\mathsf {RFN}}_\lambda :=\bigcup _{\alpha <\lambda }T^{\mathsf {RFN}}_\alpha $
for
$\lambda $
a limit.
Feferman proved that, merely by iterating the uniform reflection schema along presentations of recursive ordinals, one can accrue resources sufficient, not only for proving any true
$\Pi _1$
statement, but for proving any true arithmetical statement.
Theorem 4.2 [Reference Feferman6]
For any true arithmetical sentence
$\varphi $
, there is a presentation
$\dot \alpha $
of an ordinal
$\alpha <\omega ^{\omega ^\omega +1}$
such that
$\mathsf {PA}^{\mathsf {RFN}}_{\dot \alpha }\vdash \varphi $
.
This result is known as Feferman’s Completeness Theorem. In the proof of this theorem, Feferman used Turing’s technique of encoding the truth of statements into presentations of recursive ordinals, among other things. Feferman also proved that there are paths P through Kleene’s
$\mathcal {O}$
such that iterating uniform reflection along P is arithmetically complete [Reference Feferman6]. However, no such path is even
$\Sigma ^1_1$
definable [Reference Feferman and Spector7]. That is, identifying such a path is more difficult than identifying the set of arithmetical truths. Thus, the epistemic significance of these results—as Feferman emphasized—is limited.
5 Ordinal analysis via iterated reflection
In recent years there has been interest in the interface between ordinal analysis and Turing progressions. This research has been motivated by a number of drawbacks of the research discussed in the previous two sections.
As discussed in Section 3, the standard notion of the
$\Pi ^1_1$
ordinal of a theory is (i) only applicable to theories in which “well-foundedness” is expressible and (ii) insensitive to the true
$\Sigma ^1_1$
consequences of a theory. Building on [Reference Schmerl32], Beklemishev [Reference Beklemishev2] introduced the notion of the
$\Pi _1$
ordinal of a theory. This notion of “ordinal analysis” is both (i) suitable for subsystems of first-order arithmetic and (ii) sensitive to the
$\Pi _1$
consequences of theories. Iterations of consistency in the style of Turing play a central role in the definition of
$\Pi _1$
ordinals.
First we fix a base theory: Beklemishev uses
$\mathsf {EA}$
, a weak subsystem of arithmetic.
$\mathsf {EA}$
proves the totality of exponentiation but not superexponentiation;
$\mathsf {EA}$
is just strong enough to carry out arithmetization of syntax in the standard way. We then fix some natural ordinal notation system and an elementarily defined extension T of
$\mathsf {EA}$
. We define the iterations of T so as to satisfy the following conditions:
-
1.
$\mathsf {EA}_0:=\mathsf {EA;}$
-
2.
$\mathsf {EA}_{\alpha +1}:=\mathsf {EA}_\alpha +\mathsf {Con}(\mathsf {EA}_\alpha );$
-
3.
$\mathsf {EA}_\lambda : =\bigcup _{\alpha <\lambda }\mathsf {EA}_\alpha $
for
$\lambda $
a limit.
Given a target theory T, the
$\Pi _1$
ordinal of T (relative to the base theory
$\mathsf {EA}$
and notation system
$\prec $
) is defined as follows:
This definition yields interesting information for theories that are conservatively approximated by iterations of consistency over
$\mathsf {EA}$
; that is, for any T such that
$T\equiv _{\Pi _1} \mathsf {EA}_\alpha $
, where
$\alpha =|T|_{\Pi _1}$
. Knowing that T is so approximated is useful, because there are elegant equations, first discovered in [Reference Schmerl32] but refined in [Reference Beklemishev2], which spell out conservation relations between iterated reflection principles.
$\Pi _1$
ordinals are well-defined for theories of first-order arithmetic in which well-foundedness is not directly expressible. The definition is also sensitive to
$\Pi _1$
sentences. For instance, whereas the
$\Pi _1$
ordinal of
$\mathsf {PA}$
is
$\varepsilon _0$
, the
$\Pi _1$
ordinal of
$\mathsf {PA}+\mathsf {Con}(\mathsf {PA})$
is
$\varepsilon _0\times 2$
.
Note that the definition of
$\Pi _1$
ordinals achieves Turing’s goal of providing a metric of strength for theories in terms of the number of iterations of consistency required to capture their
$\Pi _1$
theorems. In the cases that have been investigated, it turns out that Turing’s metric of strength is the same one provided by ordinal analysis. That is, the definition of
$\Pi _1$
ordinals coincides with other standard definitions of proof-theoretic ordinal in standard cases. To provide such a metric of strength, we must fix (in advance) a natural ordinal notation system. Moreover, the definition works only for theories whose
$\Pi _1$
fragments can be conservatively approximated by iterated consistency statements. This is apparently a feature of “natural theories” but not of all theories. This approach to ordinal analysis was a major inspiration for the approach to the main topic of this article discussed in Sections 8 and 9.
These ideas are closely related to Beklemishev’s approach to the canonicity problem for ordinal notation systems [Reference Beklemishev3, Reference Beklemishev4]. I will briefly describe the terms that make up Beklemishev’s ordinal notation system for analyzing
$\mathsf {PA}$
and its fragments. The terms are generated by the constant symbol
$\top $
and the function symbols
$\Diamond _0(\cdot ), \Diamond _1(\cdot ), \Diamond _2(\cdot ),$
etc. The ordering on these terms is defined in terms of a variable-free polymodal logic
$\mathsf {GLP}_0$
.Footnote
15
In particular,
Let me emphasize one aspect of this term system that is relevant to present concerns. First, we define a translation
$\star $
from the variable-free polymodal language into the language of arithmetic such that:
-
•
$\top ^\star := 0=0;$
-
•
$\Diamond _n(\varphi )^\star := n\mathsf {Con}(\mathsf {EA}+\varphi ^\star );$
-
•
$\star $
commutes with Boolean connectives.
For every
$\varphi $
of the variable-free polymodal language,
$\mathsf {GLP}_0$
proves
$\varphi $
if and only if
$\mathsf {EA}$
proves
$\varphi ^\star $
.Footnote
16
So the ordering on terms corresponds with consistency strength over
$\mathsf {EA}$
in the following sense:
This notation system is well-suited to ordinal analysis via iterated reflection.
$\mathsf {PA}$
can be conservatively approximated over
$\mathsf {EA}$
by the interpretations of the terms of the notation system.
The connection with the canonicity problem is this: If these reflection principles are regarded as a canonical means of specifying the theory
$\mathsf {PA}$
, then the ordinal notation system and not just the ordinal has been extracted from a canonical presentation of
$\mathsf {PA}$
. The results discussed in this article support the idea that reflection principles are canonical. Moreover, these results support the idea that natural theories can be approximated by iterating reflection principles. It remains to be seen whether the canonicity of reflection principles can be established in a way that bears on the problem of canonical ordinal notations. That is, the results discussed in this article, though they show that reflection principles are in some sense canonical, seem too coarse grained to isolate Beklemishev’s notation system as canonical.Footnote
17
6 Turing degree theory
The pre-well-ordering of natural theories is paralleled by phenomena in Turing degree theory. Let’s start with a discussion of Post’s Problem. Post asked whether there is a recursively enumerable degree d such that
$0 <_T d <_T 0'$
. This question was answered positively by Friedberg and Muchnik. However, they constructed such intermediate degrees by deploying ad hoc methods; the degrees that they constructed are widely regarded as “unnatural.” Is there a natural recursively enumerable degree d such that
$0 <_T d <_T 0'$
? Once again, it is not entirely clear how to address this problem. It is not a wholly mathematical problem since there is no precise mathematical definition of “natural” Turing degrees.
Sacks clarified this situation by asking whether there is a degree-invariant solution to Post’s Problem. Let me briefly explain.
Definition 6.1. A function
$W : 2^\omega \to 2^\omega $
is a recursively enumerable operator if there is an
$e\in \mathbb {N}$
such that, for each A,
$W(A)=W_e^A$
.
Definition 6.2. A function
$f:2^\omega \to 2^\omega $
is degree-invariant if for all reals A and B,
$A\equiv _T B$
implies
$f(A)\equiv _T f(B)$
.
One oft-noted feature of natural Turing degrees is that their definitions relativize. Relativizing the definition of a Turing degree yields a degree-invariant function on the reals. For instance, relativizing the definition of
$0'$
yields the Turing jump
$\lambda X.X'$
. We are now in a position to state Sacks’ question.
Question 6.3 [Reference Sacks31]
Is there a degree-invariant solution to Post’s Problem? That is, is there a degree-invariant recursively enumerable operator W such that for every real A,
$A<_T W(A) <_T A'$
?
This question is still unresolved. A negative answer would explain why the only natural recursively enumerable degrees are
$0$
and
$0'$
. Relativizing the definition of any natural recursively enumerable degree should also yield a degree-invariant recursively enumerable operator. Thus, if there are no degree-invariant recursively enumerable operators strictly between the identity and the jump, it suggests that there are no natural recursively enumerable degrees other than
$0$
and
$0'$
.
Sacks’ question is actually a special case of a more wide-reaching question that has motivated a considerable amount of research into the Turing degrees. The Turing degrees are not linearly ordered by
$<_T$
. That is, there are distinct degrees a and b such that
$a\nleq _Tb$
and
$b\nleq _Ta$
. The Turing degrees are also ill-founded. That is, there are infinite sequences
$(a_n)_{n<\omega }$
such that
$a_k>_Ta_{k+1}$
for all k. These two results mean that it is neither possible to compare nor to rank Turing degrees in general. This behavior is exhibited below
$0'$
, but it is in fact realized throughout the Turing degrees.
The degrees that exhibit these pathological properties have been constructed using ad-hoc recursion-theoretic techniques, like the priority method. When one restricts one’s attention to natural Turing degrees, the resulting structure is a well-ordering:
This state of affairs is remarkably similar to the state of affairs on the proof-theoretic side. Once again, it is not entirely clear how to address this problem. It is not a wholly mathematical problem since there is no precise mathematical definition of “natural” Turing degrees.
As already mentioned, relativizing the definition of natural Turing degrees yields a degree-invariant function on the reals. Just as relativizing the definition of
$0'$
yields the Turing jump
$\lambda X.X'$
, relativizing the definition of Kleene’s
$\mathcal {O}$
yields the hyperjump
$\lambda X.\mathcal {O}^X$
, relativizing the definition of
$0^\sharp $
yields the sharp function
$\lambda X.X^\sharp $
, and so on. The identity function and constant functions are also degree-invariant. Are these all the degree-invariant functions that there are? There are indeed many others. Indeed, for any real A, we can concoct a degree-invariant function
$\lambda X. A \oplus X'$
. Yet this function is clearly just a modification of the Turing jump, and if we restrict our attention to the Turing degrees above A, this function behaves exactly as the Turing jump does. Other examples of degree-invariant functions also seem to be mere variations of the functions already described. So one might suspect that—modulo a suitable equivalence relation—the identity, the constant functions, and generalizations of the Turing jump are the only degree-invariant functions.
Martin proposed a classification of the degree-invariant functions that makes sense of this informal suspicion. Martin’s Conjecture classifies the degree-invariant functions in terms of their behavior almost everywhere in the sense of a measure known as Martin Measure.Footnote
18
To state the definition of Martin Measure, recall that a cone in the Turing degrees is any set of the form
$\{a: a\geq _T b\}$
. Assuming
$\mathsf {AD}$
, Martin proved that every degree-invariant set of reals either contains a cone or is disjoint from a cone. Moreover the intersection of countably many cones contains a cone. Thus, assuming
$\mathsf {AD}$
, the function
$$\begin{align*}\mu(A)= \begin{cases} 1 & \text{ if } A \text{ contains a cone} \\ 0 & \text{ if } A \text{ is disjoint from a cone} \end{cases} \end{align*}$$
is a countably additive measure on the
$\sigma $
algebra of degree-invariant sets. This measure is called Martin Measure. In the statement of Martin’s Conjecture, almost everywhere means almost everywhere with respect to Martin Measure. If
$f(x)\leq _T g(x)$
almost everywhere (in the sense of Martin Measure), then we say that f is Martin below g and write
$f\leq _M g$
.
Conjecture 6.4 (Martin)
Assume
$\mathsf {ZF}+\mathsf {DC}+\mathsf {AD}$
. Then:
-
I. If
$f:2^\omega \rightarrow 2^\omega $
is degree-invariant, and f is not increasing almost everywhere then f is constant almost everywhere. -
II.
$\leq _M$
pre-well-orders the set of degree-invariant functions that are increasing almost everywhere. If f has
$\leq _M$
rank
$\alpha $
, then
$f'$
has
$\leq _M$
rank
$\alpha +1$
, where
$f'(x)=f(x)'$
for all x.
Martin’s Conjecture is stated under the Axiom of Determinacy,
$\mathsf {AD}$
. Though one can refute
$\mathsf {AD}$
in
$\mathsf {ZFC}$
, one cannot refute restrictions of
$\mathsf {AD}$
to sets of reals with sufficiently simple definitions.Footnote
19
Indeed, using
$\mathsf {ZFC}$
, one can prove Borel Determinacy (the restriction of the Axiom of Determinacy to Borel sets). Assuming appropriate large cardinal axioms, one can even prove that
$L(\mathbb {R})$
satisfies
$\mathsf {AD}$
. That is, as we make stronger and stronger assumptions, we can prove restrictions of determinacy to the “definable” sets for ever more liberal notions of definability. Hence, Martin’s Conjecture is often understood as a conjecture about the “definable” functions. Understood this way, Martin’s Conjecture roughly states that the only definable degree-invariant functions (up to almost everywhere equivalence) are constant functions, the identity function, and iterates of the Turing Jump.
Though Martin’s Conjecture is presently open, many informative partial results and special cases are known. Let me briefly mention some.Footnote 20
Definition 6.5. A function f is order-preserving if
$A\leq _T B$
implies
$f(A)\leq _T f(B)$
.
Theorem 6.6 [Reference Lutz and Siskind21]
Part I of Martin’s Conjecture holds for order-preserving functions.
Theorem 6.7 [Reference Slaman and Steel35]
Part II of Martin’s Conjecture holds for order-preserving Borel functions.
There are other results that speak to this phenomenon of well-foundedness. For instance, Steel proved that there cannot be any “simple” descending sequences in the Turing jump hierarchy, in the following sense.
Theorem 6.8 [Reference Steel37]
Let
$P\subset \mathbb {R}^2$
be arithmetic. Then there is no sequence
$(x_n)_{n<\omega }$
such that for every n,
-
(i)
$x_n\geq _T x^{\prime }_{n+1}$
and -
(ii)
$x_{n+1}$
is the unique y such that
$P(x_n,y)$
.
We will explore the analogy between the recursion-theoretic and proof-theoretic well-ordering phenomena throughout the rest of this article. In particular, we will look at proof-theoretic results inspired by Martin’s Conjecture and its partial realizations.
7 Interlude
My goal is to bring precision to the question of the pre-well-ordering of natural theories and to offer (at least partial) solutions. Before summarizing this research, I would like to mention three themes that will be interwoven.
The first theme is that many natural theories can be axiomatized by reflection principles over natural base theories. Indeed, the fragments of natural theories corresponding to different syntactic complexity classes can often be conservatively approximated by iterated reflection principles of the appropriate complexity class.
The second theme is that ordinal analysis is an expression of the well-ordering phenomenon. For different notions of “proof-theoretic strength,” there are corresponding notions of “proof-theoretic ordinals.” Insofar as the proof-theoretic ordinal afforded by some definition of “proof-theoretic ordinal” measures the proof-theoretic strength of theories, the attendant method of ordinal analysis well-orders the theories within its ken according to that notion of strength. The heuristic that we will try to vindicate in this article is the following: To calculate the proof-theoretic ordinal of a theory T is to determine T’s rank in the hierarchy of natural theories ordered by proof-theoretic strength.
The third theme is that reflection principles play a role in proof theory analogous to the role jumps play in recursion theory. Just as natural Turing degrees are apparently equivalent to ordinal iterates of the Turing jump, natural theories are apparently equivalent to ordinal iterates (along natural presentations of well-orderings) of reflection principles. Just as natural Turing degrees can be obliquely studied in terms of jumps, natural axiomatic theories can be obliquely studied in terms of reflection principles.
8 Analogues of Martin’s conjecture
Ordinal analysis is a means of well-ordering axiom systems, so one might expect some connections between ordinal analysis and the well-ordering phenomenon. Recall that Beklemishev’s method (see Section 5) for calculating the proof-theoretic ordinal of a theory T involves demonstrating that T can be approximated over a weak base theory by a class of formulas that are well-behaved. In particular, the
$\Pi _1$
fragments of natural theories are often proof-theoretically equivalent to iterated consistency statements over a weak base theory, whence these theories are amenable to ordinal analysis. Why are the
$\Pi _1$
fragments of natural theories proof-theoretically equivalent to iterated consistency statements?
Our approach to this problem is inspired by Martin’s Conjecture and similar problems in recursion theory. In this section we will formulate and discuss proof-theoretic versions of these problems. Let’s fix some notation. Throughout this section we will let T be a sound (i.e., true in the standard model), recursively axiomatized same-language extension of elementary arithmetic. We use the following notation frequently.
Definition 8.1.
$[\varphi ]_T$
is the equivalence class of
$\varphi $
modulo T provable equivalence, i.e.,
$[\varphi ]_T = \{\psi : T\vdash \varphi \leftrightarrow \psi \}$
.
Finally, in this section, whenever we write “true” we mean “true according to the standard interpretation
$\mathbb {N}$
of first-order arithmetic.”
Now, let’s consider an analogue of Post’s Problem. Are there theories whose deductive strength lies strictly between that of T and
$T+\mathsf {Con}(T)$
? The answer is positive. Indeed, there is a true T-independent
$\Pi _1$
sentence
$\varphi $
such that
$T+\mathsf {Con}(T)\vdash \varphi $
but
$T+\varphi \nvdash \mathsf {Con}(T)$
. Indeed, by the second incompleteness theorem
$T+\neg \mathsf {Con}(T)$
is consistent. Thus, by Rosser’s Theorem there is some sentence that is neither provable nor refutable in
$T+\neg \mathsf {Con}(T)$
. In particular, Rosser’s theorem shows that the sentence R is
$T+\neg \mathsf {Con}(T)$
-independent, where:
R “says” that all
$T+\neg \mathsf {Con}(T)$
proofs of R (if there are any) are preceded by
$T+\neg \mathsf {Con}(T)$
proofs of
$\neg R$
. An elementary argument then shows that
$T+ \big (\mathsf {Con}(T) \vee R \big ) $
has deductive strength strictly between T and
$T+\mathsf {Con}(T)$
.
The theory
$T+ \big (\mathsf {Con}(T) \vee R \big )$
has the properties we were looking for, but it is a highly unnatural theory. Nobody would have proposed
$\mathsf {Con}(T) \vee R$
as a principle of arithmetic; it was defined in an unusual way to exhibit quirky meta-mathematical features. Is there any way of explaining the unnaturalness of this formula in a mathematically exact way?
Recall that R was defined in terms of the theory
$T+\neg \mathsf {Con}(T)$
. Indeed, Rosser proved that for any reasonable theory U there is a U-independent sentence
$R_U$
where:
Note that
$R_U$
depends on some seemingly arbitrary details about the way that the theory U is presented. In particular, R makes a claim about the order in which proofs from the theory U are encoded by natural numbers. Yet there are many different ways of numerating proofs from the theory U. Some of these differences are merely differences in how U itself is presented or described (for instance, the order in which the axioms of U are listed). So, for instance, there could be two sentences
$\varphi $
and
$\psi $
such that
$T\vdash \varphi \leftrightarrow \psi $
but
$T\nvdash R_{T+\varphi } \leftrightarrow R_{T+\psi }.$
All of this is to say that the function
$\varphi \mapsto R_{T+\varphi }$
is not extensional over T.
Definition 8.2. A function
$f: \mathbb {N}\to \mathbb {N}$
is extensional over a theory T if for all
$\varphi $
and
$\psi $
:
Remark 8.3. Note that the previous definition technically involves an abuse of notation. Since f is a function from
$\mathbb {N}$
to
$\mathbb {N}$
, it should map the Gödel numbers of
$\varphi $
and
$\psi $
to the Gödel numbers of other formulas. If
$\theta _0,\theta _1,\dots $
is a fixed Gödel numbering of the formulas of arithmetic, then the following gives an exact description of the condition of extensionality:
We find that this particular abuse of notation is a useful aid for intuition and we henceforth use it without comment.
The output of an extensional function with input
$\varphi $
depends only on
$[\varphi ]_T$
. The identity function and constant functions are all extensional. So is the consistency operator
$\varphi \mapsto \mathsf {Con}(T+\varphi )$
and its iterates into the effective transfinite. We introduce the following notation to reduce clutter in what follows:
We will mostly be concerned with functions that are not only extensional but also monotone over T.
Definition 8.4. A function
$f: \mathbb {N}\to \mathbb {N}$
is monotone over a theory T if for all
$\varphi $
and
$\psi $
:
Note that all monotone functions are extensional but not vice-versa. The goal of this approach is to show that the consistency operator and its iterates are canonical monotone functions.
8.1 A positive theorem
Let’s begin with a positive result.Footnote 21 This result shows that sufficiently constrained recursive monotone functions must coincide with the consistency operator in the limit within the ultrafilter of true sentences. Note that “in the limit” plays a role analogous to that which “a.e. in the sense of Martin measure” does in Martin’s Conjecture. Here is how we precisely define “in the limit”:
Definition 8.5. A cone is any set of sentences of the form
$\{\varphi \mid T+\varphi \vdash \psi \}$
, where
$\psi $
is a sentence. A true cone is a cone that contains a sentence that is true according to the standard interpretation of the language of arithmetic.
One may think of true cones as “large” sets. Note that one must limit one’s attention to true cones for this to make sense. Indeed,
$[\bot ]_T$
is a cone but it is not large since it is a single Lindenbaum degree.
We must introduce one more technical condition before stating the theorem.
Definition 8.6. A function
$\mathfrak {g}$
is bounded if there exists an
$n\in \mathbb {N}$
such that for all
$\varphi $
,
$\mathfrak {g}(\varphi )$
is
$\Pi _n$
.
Theorem 8.7 [Reference Walsh44]
Let
$\mathfrak {g}$
be recursive, monotone, and bounded. Then one of the following holds:
-
• There is a true cone
$\mathfrak {C}$
such that for all
$\varphi \in \mathfrak {C}$
,
$T+\varphi \vdash \mathfrak {g}(\varphi ).$
-
• There is a true cone
$\mathfrak {C}$
such that for all
$\varphi \in \mathfrak {C}$
,
$T+\varphi +\mathfrak {g}(\varphi ) \vdash \mathsf {Con}_T(\varphi ).$
Roughly, this result states that any bounded recursive monotone operator must either be as weak as the identity operator in the limit or as strong as the consistency operator in the limit. By analogy with Sacks’ question, this result confirms the suspicion that there are no natural theories of deductive strength strictly between T and
$T+\mathsf {Con}(T)$
. If there were one, then relativizing it would produce a monotone operator of strictly intermediate strength, but there is no such operator.Footnote
22
Logicians have studied many variants of the consistency operator that display interesting logical behavior. Sometimes these variants are mistaken for counter-examples to Theorem 8.7, so it is worth pausing to discuss why this is not the case.
First, let’s consider the case of slow reflection over
$\mathsf {PA}$
[Reference Friedman, Rathjen and Weiermann9]. The slow consistency statement for
$\mathsf {PA}$
has the following form:
Note that
$F_{\varepsilon _0}$
is a recursive function whose totality is not provable in
$\mathsf {PA}$
. Since
$F_{\varepsilon _0}$
is recursive, its graph is
$\Sigma _1$
-definable, whence
$\mathsf {Con}^\diamondsuit (\mathsf {PA})$
is a
$\Pi _1$
sentence. The slow consistency statement is not provable in
$\mathsf {PA}$
, but it is strictly weaker statement than the ordinary consistency statement
$\mathsf {Con}(\mathsf {PA})$
. That is:
Theorem 8.8 [Reference Friedman, Rathjen and Weiermann9]
We have each of the following:
-
•
$\mathsf {PA}\vdash \mathsf {Con}(\mathsf {PA}) \to \mathsf {Con}^\diamondsuit (\mathsf {PA});$
-
•
$\mathsf {PA}\nvdash \mathsf {Con}^\diamondsuit (\mathsf {PA}) \to \mathsf {Con}(\mathsf {PA});$
-
•
$\mathsf {PA}\nvdash \mathsf {Con}^\diamondsuit (\mathsf {PA}).$
Relativizing the definition of slow consistency yields a slow consistency operator:
So why doesn’t the existence of the slow consistency operator refute Theorem 8.7? The key point is that the slow consistency operator “catches up” to the standard consistency operator for all inputs
$\varphi $
such that
$\mathsf {PA}+\varphi \vdash F_{\varepsilon _0}\downarrow $
. That is, if
$\mathsf {PA}+\varphi \vdash F_{\varepsilon _0}\downarrow $
then also
$\mathsf {PA}\vdash \big (\varphi \wedge \mathsf {Con}_{\mathsf {PA}}(\ulcorner \varphi \urcorner )\big )\leftrightarrow \big (\varphi \wedge \mathsf {Con}^\diamondsuit _{\mathsf {PA}}(\ulcorner \varphi \urcorner )\big ).$
Similar remarks explain why cut-free consistency is not a counter-example over the base theory
$\mathsf {EA}$
. Cut-free consistency just means consistency with respect to proofs that do not use the cut rule. Of course, consistency and cut-free consistency coincide by the cut-elimination theorem. However, the cut-elimination algorithm is superexponential, and it cannot be proved to terminate in
$\mathsf {EA}$
. In fact, cut-free consistency exhibits the following curious behavior within
$\mathsf {EA}$
.
Why isn’t this a counter-example to Theorem 8.7? For any
$\varphi $
which implies the cut-elimination theorem over
$\mathsf {EA}$
:
Slow consistency and cut-free consistency may seem to obviously be variants of the consistency operator. Indeed, both are defined by modifying the definition of consistency (in one instance by “slowing” it down and in the other by removing a rule from the proof system). Theorem 8.7 suggests one way in which this claim can be made precise. Slow consistency and cut-free consistency belong to the same equivalence class (modulo agreeing on a true cone) as the consistency operator.
Modulo an appropriate equivalence relation, Theorem 8.7 illustrates that the consistency operator is canonical. It thereby indicates that there are no natural theories with deductive strength strictly between that of T and
$T+\mathsf {Con}(T)$
. Nevertheless, there are many ways one could imagine improving Theorem 8.7. For instance: Must
$\mathfrak {g}$
be monotone, or would it be enough for
$\mathfrak {g}$
to be extensional? Must
$\mathfrak {g}$
be recursive, or would it be enough for
$\mathfrak {g}$
to be limit-recursive? Finally, can the result be generalized to the iterates of the consistency operator? Let’s look at each of these questions in turn.
8.2 Extensional operators
One conspicuous feature of Theorem 8.7 is that it concerns only monotone functions. Recall that Sacks’ question and Martin’s Conjecture concern degree-invariant functions, not order-preserving functions. Whereas monotone functions are analogues of order-preserving functions, extensional functions are analogues of degree-invariant functions. Indeed, extensional functions are merely functions that are invariant with respect to degrees in Lindenbaum algebras. So a strengthening of Theorem 8.7 to extensional functions would yield a more faithful proof-theoretic analogue of Martin’s Conjecture.
By a result of Shavrukov and Visser, Theorem 8.7 cannot be strengthened to cover extensional functions in this way. S. Friedman asked them, in conversation, whether “the consistency operator is, in some sense, the least way to strengthen theories.” In response, Shavrukov and Visser exhibited an extensional density function on the Lindenbaum algebra of
$\mathsf {PA}$
. Their result actually extends to all reasonable theories by a result of Kripke and Pour-El [Reference Kripke and Pour-El19].Footnote
23
Theorem 8.10 [Reference Shavrukov and Visser33]
For any reasonable theory T, the Lindenbaum algebra of T admits a recursive extensional density function.
Let me briefly explain the significance of this result. Rosser’s theorem yields a recursive density function on the Lindenbaum algebra of
$\mathsf {PA}$
, yet this function is not extensional.Footnote
24
Here is how Shavrukov and Visser find an extensional density function: To start, they introduced the following formula (note that
$\mathsf {Con}^2_T(\ulcorner \varphi \urcorner )=\mathsf {Con}_T(\ulcorner \varphi \wedge \mathsf {Con}_T(\ulcorner \varphi \urcorner ))=\mathsf {Con}\big (T+\varphi + \mathsf {Con}(T+\varphi )\big )$
):
The function
$\varphi \mapsto \mathsf {SV}(\varphi )$
is extensional in the sense that whenever
$\mathsf {PA}\vdash \varphi \leftrightarrow \psi $
then also
$\mathsf {PA}\vdash \mathsf {SV}(\varphi ) \leftrightarrow \mathsf {SV}(\psi )$
. This formula yields a recursive extensional density function on the Lindenbaum algebra of
$\mathsf {PA}$
. Indeed, we may define:
This is an extensional function in the sense that:
It is also a density function in the sense that for any
$\varphi $
and
$\psi $
such that
$\mathsf {PA}+\varphi \vdash \psi $
but
$\mathsf {PA}+\psi \nvdash \varphi $
:
-
1.
$\mathsf {PA}+ \varphi \vdash \mathsf {SV}^\star (\varphi ,\psi );$
-
2.
$\mathsf {PA}+ \mathsf {SV}^\star (\varphi ,\psi ) \nvdash \varphi ;$
-
3.
$\mathsf {PA}+\mathsf {SV}^\star (\varphi ,\psi ) \vdash \psi ;$
-
4.
$\mathsf {PA}+\psi \nvdash \mathsf {SV}^\star (\varphi ,\psi ).$
As a corollary we infer that the generalization of Theorem 8.7 to extensional functions is false. Indeed, the function
$\varphi \mapsto \mathsf {SV}^\star \big (\varphi \wedge \mathsf {Con}_{\mathsf {PA}}(\ulcorner \varphi \urcorner ), \varphi \big )$
is everywhere stronger than the identity but weaker than the consistency operator. Thus, it is neither as weak as the identity operator nor as strong as the consistency operator on a true cone. In particular, Theorem 8.10 shows that the function
$\varphi \mapsto \mathsf {SV}^\star \big (\varphi \wedge \mathsf {Con}_{\mathsf {PA}}(\ulcorner \varphi \urcorner ),\varphi \big )$
is an extensional function that is strictly stronger than the identity operator but strictly weaker than the consistency operator. Shavrukov and Visser wrote that this suggests a negative answer to S. Friedman’s question whether the consistency operator is “the least way to strengthen theories.”
8.3 Recursiveness and limit-recursiveness
Must
$\mathfrak {g}$
be recursive? Note that the restriction to recursive functions in the statement of Theorem 8.7 is analogous to the restriction to
$\mathsf {AD}$
in Martin’s Conjecture. In both cases, these restrict our attention to “simple” functions. Note that if we dropped the restriction to recursive functions in the statement of Theorem 8.7 we could easily find a counter-example using the Axiom of Choice. Yet there is a long distance in between recursive functions and arbitrary functions. Can Theorem 8.7 be strengthened by assuming only that
$\mathfrak {g}$
is arithmetic? Or by assuming only that
$\mathfrak {g}$
is
$0'$
-recursive?
In [Reference Walsh44] it is shown that the assumption that the function is recursive is necessary in the statement of Theorem 8.7. We do this by exhibiting a function f that is limit-recursive but not recursive which meets the other hypotheses of Theorem 8.7 but does not satisfy the conclusion. In particular, f oscillates between behaving like the identity operator and the consistency operator, without converging on either. To make this precise, let’s introduce some terminology.
Definition 8.11. Given a set
$\mathfrak {A}$
, we say that cofinally many true sentences belong to
$\mathfrak {A}$
if for every true
$\varphi $
there is a true
$\psi $
such that
$T+\psi \vdash \varphi $
and
$\psi \in \mathfrak {A}$
.
The motivation for the previous definition is this: If cofinally many true sentences belong to
$\mathfrak {A}$
, then the set of Lindenbaum degrees of members of
$\mathfrak {A}$
is cofinal in the ultrafilter of Lindenbaum degrees of true sentences.
Theorem 8.12 [Reference Walsh44]
There is a
$0'$
-recursive monotone
$\mathfrak {g}$
such that, for all
$\varphi $
,
$\mathfrak {g}(\varphi )$
is
$\Pi _1$
, and both of the following hold:
-
1. For cofinally many true
$\varphi $
:
$[\varphi \wedge \mathfrak {g}(\varphi )]_T = [\varphi ]_T.$
-
2. For cofinally many true
$\varphi $
:
$[\varphi \wedge \mathfrak {g}(\varphi )]_T = [\varphi \wedge \mathsf {Con}_T(\ulcorner \varphi \urcorner )]_T.$
By Gödel’s second incompleteness theorem,
$[\varphi ]_T\neq [\varphi \wedge \mathsf {Con}_T(\ulcorner \varphi \urcorner )]_T$
for T-consistent
$\varphi $
. So a function that oscillates between these two outputs cannot converge on either. That is, such a function can neither be as weak as the identity operator in the limit nor as strong as the consistency operator in the limit.
8.4 Iterates of consistency
For the rest of this section we will consider the extent to which the features that make the consistency operator canonical generalize to iterates of the consistency operator. We define iterates of consistency so that they are always finite extensions of the base theory T (that is, theories of the form
$T+\varphi $
for some sentence
$\varphi $
). To define our iterates of consistency, we fix an effective presentation
$\prec $
of a recursive ordinal. We informally define the iterates of
$\mathsf {Con}_T$
so that for any sentence
$\varphi $
:
-
•
$\mathsf {Con}_T^0(\ulcorner \varphi \urcorner ):=\top ;$
-
•
$\mathsf {Con}_T^{\alpha +1}(\ulcorner \varphi \urcorner ):=\mathsf {Con}_T(\ulcorner \varphi \wedge \mathsf {Con}_T^\alpha (\ulcorner \varphi \urcorner )\urcorner );$
-
•
$\mathsf {Con}_T^\lambda (\ulcorner \varphi \urcorner ):=\forall \alpha \prec \lambda \mathsf {Con}_T\big (\ulcorner \varphi \wedge \mathsf {Con}_T^\alpha (\ulcorner \varphi \urcorner )\urcorner \big )$
.
Strictly speaking, we define the iterates of
$\mathsf {Con}_T$
via Gödel’s fixed point lemma:
Remark 8.13. Let us reiterate that all of the iterates of the consistency operator we have defined are sentences. This is appropriate for the current investigation since we are interested in functions that induce functions on the Lindenbaum algebra of T. For instance, given our definition, for a limit
$\lambda $
,
$\mathsf {Con}^\lambda _T(\ulcorner \varphi \urcorner )$
is the sentence
$\forall \alpha \prec \lambda \mathsf {Con}_T\big (\ulcorner \varphi \wedge \mathsf {Con}_T^\alpha (\ulcorner \varphi \urcorner )\urcorner \big )$
. This clashes with the conventions adopted in some other papers, wherein
$\mathsf {Con}^\lambda _T(\ulcorner \varphi \urcorner )$
is used as a name for the infinite set
$\{ \mathsf {Con}^\alpha _T(\ulcorner \varphi \urcorner ) \mid \alpha \prec \lambda \}$
. For the same reason, it also comes apart from Turing’s definition of theories axiomatized by iterated consistency statements, as introduced in Section 4.
Does Theorem 8.7 generalize to iterates of the consistency operator along suitable ordinal notations?Footnote 25 An optimist might hope to prove the following.
Naïve Conjecture: Fix a suitable ordinal notation system
$\prec $
. Let
$\alpha \succ 0$
. Let
$\mathfrak {g}$
be recursive and monotone such that, for all
$\varphi $
,
$\mathfrak {g}(\varphi )$
is
$\Pi _1$
. Then one of the following holds:
-
1. There is a true cone
$\mathfrak {C}$
such that for all
$\varphi \in \mathfrak {C}$
,
$T+\varphi +\mathfrak {g}(\varphi ) \vdash \mathsf {Con}^\alpha _T(\ulcorner \varphi \urcorner ).$
-
2. For some
$\beta \prec \alpha $
, there is a true cone
$\mathfrak {C}$
such that for all
$\varphi \in \mathfrak {C}$
,
$T+\varphi +\mathsf {Con}^\beta _T(\ulcorner \varphi \urcorner )\vdash \mathfrak {g}(\varphi ).$
As the conjecture’s label suggests, the answer is negative. In fact, the proposed classification fails already at the very next step.
Theorem 8.14 [Reference Walsh47]
Fix a suitable ordinal notation system
$\prec $
. For every
$\alpha \succ 0$
, there is a recursive monotone
$\mathfrak {g}$
such that, for all
$\varphi $
,
$\mathfrak {g}(\varphi )$
is
$\Pi _1$
, and both of the following hold:
-
1. For cofinally many true
$\varphi $
:
$[\varphi \wedge \mathfrak {g}(\varphi )]_T = [\varphi \wedge \mathsf {Con}^\alpha _T(\ulcorner \varphi \urcorner )]_T.$
-
2. For cofinally many true
$\varphi $
:
$[\varphi \wedge \mathfrak {g}(\varphi )]_T=[\varphi \wedge \mathsf {Con}_T(\ulcorner \varphi \urcorner )]_T.$
This is a dramatic failure of the naïve conjecture. Indeed, for
$\alpha \succ 1$
and true
$\varphi $
,
$[\varphi \wedge \mathsf {Con}^\alpha _T(\ulcorner \varphi \urcorner )]_T \neq [\varphi \wedge \mathsf {Con}_T(\ulcorner \varphi \urcorner )]_T$
by Gödel’s second incompleteness theorem. So a function that oscillates between mimicking consistency and mimicking
$\alpha $
-iterated consistency will neither be as weak as the consistency operator on a true cone nor as strong as any non-trivial iterate of the consistency operator on a true cone.
8.5 More positive theorems
Theorem 8.14 rules out a straightforward generalization of Theorem 8.7 to the iterates of the consistency operator. Yet the counter-examples it produces still coincide with the iterates of the consistency operator on a non-trivial set of values. This is no coincidence. Indeed, every function that is, e.g., stronger than the consistency operator but as weak as the 2-iterated consistency operator must coincide with the latter on cofinally many true inputs. This latter claim generalizes upwards to all ordinal notations.
Theorem 8.15 [Reference Montalbán and Walsh26]
Let
$\mathfrak {g}$
be recursive and monotone. Let
$\prec $
be a suitable ordinal notation system and let
$\alpha $
be an ordinal notation from the field of
$\prec $
. Suppose that for all
$\varphi $
:
-
•
$T+\varphi + \mathsf {Con}^\alpha _T(\ulcorner \varphi \urcorner )\vdash \mathfrak {g}(\varphi )$
. -
• For all
$\beta \prec \alpha $
:-
–
$T+\varphi + \mathfrak {g}(\varphi )\vdash \mathsf {Con}^\beta (\ulcorner \varphi \urcorner )$
. -
–
$T+\varphi + \mathsf {Con}^\beta (\ulcorner \varphi \urcorner ) \nvdash \mathfrak {g}(\varphi ) $
.
-
Then for cofinally many true sentences
$\varphi $
:
Note that this rules out the possibility of recursive monotone functions with strength strictly between
$\mathsf {Con}_T^\alpha $
and
$\mathsf {Con}_T^{\alpha +1}$
. Note, moreover, that Theorem 8.15 includes no restriction to bounded functions.
Theorem 8.15 says that if the range of a monotonic function
$\mathfrak {g}$
is sufficiently constrained, then for some
$\varphi $
and some
$\alpha $
,
This property still holds even when these constraints on the range of
$\mathfrak {g}$
are relaxed considerably. Indeed, the main theorem of Montalbán and Walsh [Reference Montalbán and Walsh26] states that if the strength of any sufficiently nice function
$\mathfrak {g}$
is “bounded” by some iterate of the consistency operator, then
$\mathfrak {g}$
must somewhere coincide with an iterate of the consistency operator.
Theorem 8.16 [Reference Montalbán and Walsh26]
Let
$\mathfrak {g}$
be recursive and monotone such that, for all
$\varphi $
,
$\mathfrak {g}(\varphi )$
is
$\Pi _1$
. Let
$\prec $
be a suitable ordinal notation system and let
$\alpha $
be an ordinal notation from the field of
$\prec $
.
Suppose that for all
$\varphi $
,
$T+\varphi +\mathsf {Con}^\alpha _T(\ulcorner \varphi \urcorner )\vdash \mathfrak {g}(\varphi )$
. Then, for some
$\beta \preceq \alpha $
and some
$\varphi $
:
Theorem 8.16 was inspired by a recursion-theoretic result.
Theorem 8.17 [Reference Slaman and Steel35]
Assume
$\mathsf {AD}$
. Suppose
$f:2^\omega \to 2^\omega $
is order-preserving and increasing a.e. Then either:
-
(i)
$x^\alpha <_T f(x)$
on a cone. -
(ii) For some
$\beta \leq \alpha $
,
$f(x)\equiv _T x^\beta $
on a cone.
Theorem 8.17 is much stronger than Theorem 8.16 insofar as it achieves coincidence on a cone rather than mere coincidence at some non-trivial point. Theorem 8.14 rules out the possibility of strengthening Theorem 8.16 to secure coincidence on a cone. Yet other questions remain.
Question 8.18. Let
$\mathfrak {g}$
be recursive and monotone such that, for all
$\varphi $
,
$\mathfrak {g}(\varphi )$
is
$\Pi _1$
. Suppose that for all
$\varphi $
,
$T+\varphi +\mathsf {Con}^\alpha _T(\ulcorner \varphi \urcorner )\vdash \mathfrak {g}(\varphi )$
. Is there some true
$\varphi $
such that
The problem of formulating analogues of these results for reflection principles that are not
$\Pi _1$
-axiomatizable (such as 1-consistency) also remains open.
9 Reflection principles and ordinal analysis
In this section we turn from consistency to other uniform reflection principles. For definitions of these principles see Section 4.2.
Recall that when Beklemishev (see Section 5) calculates the proof-theoretic ordinal of T, he first axiomatizes the
$\Pi _{n+1}$
consequences of T in terms of iterated n-consistency statements over a weak base theory. Not all theories can be studied using such methods, since not all theories are
$\Pi _{n+1}$
equivalent to iterations of n-consistency. Nevertheless, “natural” theories seem to be
$\Pi _{n+1}$
equivalent to iterations of n-consistency along “natural” well-orderings. The problem of characterizing the “natural” well-orderings is just as difficult as the problem of characterizing the “natural” theories. So the situation at present is this: There are many specific examples of connections between natural theories and iterations of reflection principles, but the latter depend on choices of ordinal notation system, so it is not clear whether these are part of any general connection. To prove such a general connection, we would need to study iterations of reflection principles in a way that is independent of ordinal notation systems. This is the theme that we pursue in this section.
9.1 Descending sequences in first-order arithmetic
To motivate our approach to this question, let’s begin by discussing the apparent well-foundedness of natural theories by proof-theoretic strength. There is a contrast to note between ascending sequences of theories and descending sequences of theories. Sequences of consistent theories that are ascending in consistency strength are quite easy to define: just iterate a strength-increasing principle along a well-ordering. Descending sequences—by contrast—seem to require more ingenuity to define. Does this reflect some inherent complexity of the descending sequences of theories in proof-theoretic hierarchies? In this section we turn to results, inspired by Steel’s Theorem 6.8, to the effect that such descending sequence must be complex. Indeed, these results show that descending sequences in proof-theoretic hierarchies are either complex or contain elements that are unsound.
Gaifman asked whether there is a recursive sequence
$\langle T_0,T_1,\dots \rangle $
of consistent r.e. extensions of
$\mathsf {PA}$
such that for all n,
$T_n\vdash \mathsf {Con}(T_{n+1})$
. The answer is positive; however, there are none that are
$\mathsf {PA}$
-provably descending.
Theorem 9.1 (H. Friedman, Smoryński, and Solovay)
Both of the following hold:
-
1. There is a recursive sequence
$(T_n)_{n<\omega }$
of consistent r.e. extensions of
$\mathsf {PA}$
such that for each n,
$T_n\vdash \mathsf {Con}(T_{n+1}).$
-
2. There is no r.e. sequence
$(T_n)_{n<\omega }$
of consistent r.e. extensions of
$\mathsf {PA}$
such that
$\mathsf {PA}\vdash \forall x \mathsf {Pr}_{T_x}\big (\ulcorner \mathsf {Con}(T_{x+1})\urcorner \big )$
.Footnote
26
The second part of Theorem 9.1 takes a step towards the goal of this section. Indeed, it shows that any descending sequences in the consistency strength hierarchy over
$\mathsf {PA}$
must be complicated in the following sense: Either it is not r.e. or it cannot be proved within
$\mathsf {PA}$
to be a descending sequence.
The first part of Theorem 9.1 was improved by Visser.
Theorem 9.2 [Reference Visser42]
There is a recursive sequence
$(T_n)_{n<\omega }$
of consistent r.e. extensions of
$\mathsf {PA}$
such for each n,
$T_n$
proves each instance of the uniform reflection principle for
$T_{n+1}$
.
Recall that our goal in this section is to argue that descending sequences in reflection hierarchies are either complex or contain elements that are unsound. Visser’s result shows that—without some condition of uniform provability—no such results can be proved with respect to complexity alone. However, if we keep both complexity and unsoundness in mind, we can rule out descending sequences in reflection hierarchies, as the following result demonstrates:
Theorem 9.3 [Reference Pakhomov and Walsh27]
There is no recursively enumerable sequence
$(T_n)_{n\in \mathbb {N}}$
of
$\Sigma _2$
-sound extensions of
$B\Sigma _1$
such that, for each n,
$T_n$
proves the
$\Sigma _2$
-soundness of
$T_{n+1}$
.Footnote
27
Theorem 9.3 was inspired by an alternative proof, discovered by H. Friedman, of Steel’s Theorem 6.8 [Reference Friedman8, Reference Steel37]. For more recursion-theoretic applications of this technique see [Reference Lutz and Walsh22]. The adaptability of this proof arguably strengthens the analogy between the proof-theoretic and recursion-theoretic well-foundedness phenomena.
There are a few ways that one can imagine refining Theorem 9.3. In the first place, Theorem 9.1 concerns consistency but Theorem 9.3 concerns
$\Sigma _2$
-soundness. These results skip right over
$\Sigma _1$
-soundness. So one might wonder whether the case of
$\Sigma _1$
-soundness is more like Theorem 9.1 or Theorem 9.3. In fact, the situation with
$\Sigma _1$
-soundness is just like the situation with consistency.
Theorem 9.4 [Reference Pakhomov and Walsh27]
There exists a recursive sequence
$(T_n)_{n\in \mathbb {N}}$
of
$\Sigma _1$
-sound extensions of elementary arithmetic such that, for each n,
$T_n$
proves the
$\Sigma _1$
-soundness of
$T_{n+1}$
.
This demonstrates that Theorem 9.3 is the best possible theorem with respect to the syntactic complexity classes it concerns.
In the second place, Theorem 9.3 is not clearly optimal with respect to its complexity assumptions. In particular, Theorem 9.3 shows that descending sequences in a certain proof-theoretic hierarchy cannot be simple in the sense that they cannot be recursively enumerable. However, the following question remains:
Question 9.5. Is there a limit-recursive sequence
$(T_n)_{n\in \mathbb {N}}$
of
$\Sigma _2$
-sound extensions of
$B\Sigma _1$
such that, for each n,
$T_n$
proves the
$\Sigma _2$
-soundness of
$T_{n+1}$
?
9.2 Descending sequences in second-order arithmetic
The method introduced to prove Theorem 9.3 easily adapts to rule out all descending sequences of sufficiently sound theories in another hierarchy of proof-theoretic strength, namely, the one given by
$\Pi ^1_1$
-reflection. Recall that a theory is
$\Pi ^1_1$
-sound if all its
$\Pi ^1_1$
consequences are true.
Theorem 9.6 [Reference Pakhomov and Walsh27]
There is no sequence
$(T_n)_{n\in \mathbb {N}}$
of
$\Pi ^1_1$
-sound r.e. extensions of
$\mathsf {ACA}_0$
such that, for each n,
$T_n$
proves the
$\Pi ^1_1$
-soundness of
$T_{n+1}$
.
Theorem 9.6 rules out descending sequences of theories according to a certain metric of logical strength. It is worth pausing to note that when one proves that U proves the consistency of T, one often also establishes the stronger fact that U proves the
$\Pi ^1_1$
-soundness of T. Consider, for instance, the construction of
$\omega $
-models of theories via forcing. Moreover, of course, many natural theories are
$\Pi ^1_1$
-sound extensions of
$\mathsf {ACA}_0$
. Accordingly, Theorem 9.6 delivers some insight into the phenomenon that the natural axiomatic theories are pre-well-ordered by consistency strength.
The following result provides an alternate perspective on the well-foundedness phenomenon.
Theorem 9.7 [Reference Walsh45]
There is no sequence
$(T_n)_{n<\omega }$
of
$\Pi ^1_1$
-sound and
$\Sigma ^1_1$
-definable extensions of
$\Sigma ^1_1\text {-}\mathsf {AC}_0$
such that for each n,
$T_n \vdash \mathsf {RFN}_{\Pi ^1_1}(T_{n+1})$
.
Note that whereas Theorem 9.6 concerns r.e. theories, Theorem 9.7 concerns all
$\Sigma ^1_1$
-definable theories. On the other hand, Theorem 9.6 covers extensions of
$\mathsf {ACA}_0$
whereas Theorem 9.7 applies only to extensions of the stronger system
$\Sigma ^1_1\text {-}\mathsf {AC}_0$
. So Theorem 9.7 is neither stronger nor weaker than Theorem 9.6 but rather incomparable with it.
It is notable that Theorems 9.6 and 9.7 have totally different proofs. Theorem 9.6 is proved by appealing to Gödel’s second incompleteness, but Theorem 9.7 is proved using concepts from ordinal analysis. One additional benefit of Theorem 9.7 is that it yields, as a corollary, a version of Gödel’s second incompleteness theorem.
Theorem 9.8 [Reference Walsh45]
There is no
$\Pi ^1_1$
-sound
$\Sigma ^1_1$
-definable extension T of
$\Sigma ^1_1\text {-}\mathsf {AC}_0$
such that
$T \vdash \mathsf {RFN}_{\Pi ^1_1}(T)$
.
For comparison, Gödel’s original theorem states that no sufficiently strong
$\Pi ^0_1$
-sound
$\Sigma ^0_1$
-definable theory T proves
$\mathsf {RFN}_{\Pi ^0_1}(T)$
.Footnote
28
9.3 Ordinal analysis
Recall that in ordinal analysis, values called proof-theoretic ordinals are systematically assigned to theories. According to conventional wisdom, proof-theoretic ordinals measure the proof-theoretic strength of theories. We will now turn to results that vindicate this conventional wisdom. Indeed, Theorem 9.6 makes it possible to rank theories according to their strength in terms of provable
$\Pi ^1_1$
-reflection. This leads to the following definition.
Definition 9.9. The reflection rank of T is the rank of T in the restriction of the relation
$\{ (T,U) : U\vdash \mathsf {RFN}_{\Pi ^1_1}(T) \}$
to
$\Pi ^1_1$
-sound extensions of
$\mathsf {ACA}_0$
.Footnote
29
For most theories T, this “reflection rank” of T equals the proof-theoretic ordinal of T. We make the notion of “most theories” precise—just as in Martin’s Conjecture—by restricting our attention to an appropriate “cone” of theories. In particular, we will focus on extensions of
$\mathsf {ACA}_0^+$
, which is itself an axiomatic theory extending
$\mathsf {ACA}_0$
with the axiom “every set is contained in an
$\omega $
-model of
$\mathsf {ACA}_0$
.”
Theorem 9.10 [Reference Pakhomov and Walsh27, Reference Walsh48]
For any
$\Pi ^1_1$
-sound extension T of
$\mathsf {ACA}_0^+$
, the reflection rank of T equals the
$\Pi ^1_1$
proof-theoretic ordinal of T.Footnote
30
Theorem 9.10 makes contact with the well-foundedness phenomenon but not the pre-linearity phenomenon. Now we will turn to results that make contact with both of these phenomena.
9.3.1 Two proof-theoretic orderings
In the introduction, we introduced the consistency strength ordering modulo a fixed theory B. In different contexts, logicians choose different theories for the base theory B. Common choices include
$\mathsf {EA}$
,
$\mathsf {PRA}$
,
$\mathsf {PA}$
, and
$\mathsf {ZFC}$
. In this section, we have to choose a specific base theory. Note that since
$\mathsf {ACA}_0$
is conservative over
$\mathsf {PA}$
, consistency strength over
$\mathsf {ACA}_0$
and consistency strength over
$\mathsf {PA}$
are actually the same. Thus, we refine our definition of consistency strength as follows.
Definition 9.11.
$T\leq _{\mathsf {Con}} U \Leftrightarrow \mathsf {ACA}_0 \vdash \mathsf {Con}(U) \to \mathsf {Con}(T).$
Another common way to compare the strength of theories is to compare their
$\Pi ^0_1$
consequences.
Definition 9.12.
$T\subseteq _{\Pi ^0_1} U \Leftrightarrow $
For every
$\varphi \in \Pi ^0_1$
, if
$T\vdash \varphi $
then
$U\vdash \varphi $
.
It is often claimed that, when we restrict our attention to “natural” theories, the
$\subseteq _{\Pi ^0_1}$
ordering coincides with relative consistency strength [Reference Steel40]. However, these notions do not coincide in general. For a counter-example, consider any consistent T and let
$R_T$
be the Rosser sentence for T. Then
$T+R_T\not \subseteq _{\Pi ^0_1} T$
but
$\mathsf {ACA}_0\vdash \mathsf {Con}(T) \to \mathsf {Con}(T+R_T)$
, i.e.,
$T+R_T \leq _{\mathsf {Con}}T$
.
The orderings
$\leq _{\mathsf {Con}}$
and
$\subseteq _{\Pi ^0_1}$
are neither pre-linear nor pre-well-founded. That is, in both orderings, there are incomparable elements and infinite descending sequences. Remarkably, both of these features disappear when we restrict our attention to the natural theories. The restriction of
$\leq _{\mathsf {Con}}$
to natural theories coincides with the restriction of
$\subseteq _{\Pi ^0_1}$
to natural theories, and these restrictions are pre-well-orderings.
Does this phenomenon we observe when we restrict to natural theories reflect some mathematical reality or is it an illusion? We will present results in the rest of this section that speak in favor of the former answer. Recall that in Sections 4 and 5 we discussed the possibility that the natural theories can be characterized in terms of the natural well-orderings.Footnote
31
Yet we do not have a convincing mathematical definition of the “natural” well-orderings. In the rest of this section we will introduce analogues of the aforementioned consistency strength and
$\Pi ^0_1$
inclusion orderings. These analogues move us into a context that abstracts away from the problem of pathological ordinal notation systems. These analogues of the classical notions of proof-theoretic strength actually coincide and actually pre-well-order theories. This suggests that the problem of pathological ordinal notation systems truly is the impediment to pre-well-ordering in the original context.
9.3.2 An analogue of
$\subseteq _{\Pi ^0_1}$
First, let’s introduce our analogue of
$\subseteq _{\Pi ^0_1}$
. The key idea is that we replace the notion of provability with the notion of provability in the presence of an oracle for
$\Gamma $
truth. A theory T will prove a sentence
$\varphi $
in the presence of such an oracle if
$T+\psi $
proves
$\varphi $
for some true
$\Gamma $
sentence
$\psi $
. We introduce the following notation to capture this idea.
Definition 9.13. We write
$T\vdash ^{\Gamma }\varphi $
if there is a true
$\psi \in \Gamma $
such that
$T+\psi \vdash \varphi $
.
Just as we focus on provability in the presence of an oracle, we focus not on inclusion of
$\Gamma $
theorems but on inclusion of
$\Gamma $
theorems in the presence of an oracle. We introduce the following notation for this notion.
Definition 9.14. We write
$T\subseteq ^{\Gamma }_{\Delta } U$
if for all
$\varphi \in \Delta $
, if
$T\vdash ^{\Gamma } \varphi $
then
$U\vdash ^{\Gamma } \varphi $
.
The only theories we will consider are extensions of
$\mathsf {ACA}_0$
. Since
$\mathsf {ACA}_0$
proves every true
$\Sigma ^0_1$
sentence,Footnote
32
we have the following equivalence:
Thus, for all the theories we consider, we have the following equivalence:
The analogue of
$\subseteq ^{\Sigma ^0_1}_{\Pi ^0_1}$
that we are interested in is
$\subseteq ^{\Sigma ^1_1}_{\Pi ^1_1}$
, where:
Note that the latter results from the former merely by replacing
$\Sigma ^0_1$
with
$\Sigma ^1_1$
and
$\Pi ^0_1$
with
$\Pi ^1_1$
.
9.3.3 An analogue of
$\leq _{\mathsf {Con}}$
Before stating the analogue of
$\leq _{\mathsf {Con}}$
, let’s note that (provably in
$\mathsf {ACA}_0$
) a theory is consistent just in case all of its
$\Pi ^0_1$
consequences are true.Footnote
33
Accordingly, we have the following equivalence:
where
$\mathsf {RFN}_{\Pi ^0_1}(T)$
is a formula expressing that all of T’s
$\Pi ^0_1$
consequences are true.
We thus generalize the notion of consistency strength as follows.
Definition 9.15. Fixing a formula
$\mathsf {RFN}_\Delta (T)$
expressing
$\Delta $
-soundness of T in
$\mathsf {ACA}_0$
:
We will be particularly interested in
$\Pi ^1_1$
-soundness, where a theory is
$\Pi ^1_1$
-sound just in case all its
$\Pi ^1_1$
consequences are true. We can formalize the
$\Pi ^1_1$
-soundness of T with a single sentence in
$\mathsf {ACA}_0$
:
Note that
$\mathsf {Pr}_T$
here picks out ordinary provability from T, not provability in the presence of an oracle. Hence, we have the following analogue of
$\leq _{\mathsf {Con}}$
:
Note once again that the definition of
$\leq ^{\Sigma ^1_1}_{\mathsf {RFN_{\Pi ^1_1}}}$
results from our equivalence characterizing
$\leq _{\mathsf {Con}}$
merely by replacing
$\Sigma ^0_1$
with
$\Sigma ^1_1$
and
$\Pi ^0_1$
with
$\Pi ^1_1$
.
9.3.4 Connecting the orderings via ordinal analysis
Recall that, according to conventional wisdom, calculating the proof-theoretic ordinal of a theory is a means of measuring its logical strength. However, note that the ordering of theories induced by ordinal analysis:
is a pre-well-ordering since the ordinals are well-ordered. Hence,
$\leq _{\mathsf {WF} }$
cannot strictly coincide with either
$\leq _{\mathsf {Con}}$
or
$\subseteq _{\Pi ^0_1}$
.
Nevertheless, in the presence of an oracle for
$\Sigma ^1_1$
truths, we can vindicate the common wisdom that ordinal analysis is a means of measuring the logical strength of theories.
Theorem 9.16 [Reference Walsh46]
For
$\Pi ^1_1$
-sound arithmetically definable T and U extending
$\mathsf {ACA}_0:$
Theorem 9.16 cannot be extended to
$\Sigma ^1_1$
-definable theories. Since the ordinals are well-ordered, this immediately yields the following corollary.
Corollary 9.16.1. The relations
$\subseteq _{\Pi ^1_1}^{\Sigma ^1_1}$
and
$\leq ^{\Sigma ^1_1}_{\mathsf {RFN}_{\Pi ^1_1}}$
pre-well-order the
$\Pi ^1_1$
-sound arithmetically definable extensions of
$\mathsf {ACA}_0$
.
Note that in the statement of Theorem 9.16 and its corollary, we have dropped the non-mathematical quantification over “natural” theories. Jeon and Walsh [Reference Jeon and Walsh14] extend Theorem 9.16 to set-theoretic reflection principles using a generalization of ordinal analysis due to [Reference Pohlers29].
10 Conclusions
Why are the natural axiomatic theories pre-well-ordered by proof-theoretic strength? We have described a few results that point towards an answer, interweaving the three themes discussed in Section 7. The emerging picture is that natural theories are proof-theoretically equivalent to iterated reflection principles. Moreover, ordinal analysis is a means of calculating the ranks of natural theories in these hierarchies. I write “hierarchies” since there will be different hierarchies attending different notions of proof-theoretic strength, depending, e.g., on the reflection principle they are based on. One may have to adapt the notion of ordinal analysis to investigate these different notions of proof-theoretic strength. The claim that ordinal analysis is a means of calculating the ranks of natural theories should be refined by further developing the systematic connections between iterated reflection and proof-theoretic ordinals.
Acknowledgments
Thanks to Antonio Montalbán, John Steel, and anonymous referee for comments.