1 Introduction
We begin with a precise formulation of the Kruskal–Friedman theorem for trees. Let
$\mathcal T[\nu ]$
be the set of finite rooted trees with vertex labels from a given ordinal
$\nu $
(identified with the set of its predecessors). Elements of
$\mathcal T[\nu ]$
will be written in the form
$\alpha \star [t_0,\ldots ,t_{n-1}]$
, where
$\alpha <\nu $
is the label of the root and
$t_0,\ldots ,t_{n-1}$
are the immediate subtrees. We ignore the order of subtrees but not their multiplicities. To determine a partial order on
$\mathcal T[\nu ]$
by recursion, we declare that

holds precisely if we have
$\alpha \leq \beta $
and one of the following clauses applies:
-
(i) there is an injective function
$f:\{0,\ldots ,m-1\}\to \{0,\ldots ,n-1\}$ such that we have
$s_i\leq _{\mathcal T[\nu ]}t_{f(i)}$ for all
$i<m$ ,
-
(ii) we have
$s\leq _{\mathcal T[\nu ]}t_j$ for some
$j<n$ .
Note that clause (i) corresponds to an embedding that sends the root to the root and subtrees of s into different subtrees of t, which means that infima with respect to the tree order are preserved. In clause (ii), the root of s is sent to an internal node of t. All labels below this node must be at least
$\alpha $
, as the above condition
$\alpha \leq \beta $
applies recursively. If
$\alpha _i$
is the root label of
$s_i$
in clause (i), it follows that all labels below the image of
$s_i$
in
$t_{f(i)}$
are at least
$\alpha _i$
(while we may have
$\alpha \leq \beta <\alpha _i$
at the root of t). Recursively, the condition propagates to all ‘gaps’ that the image of s leaves in t. Hence our order on
$\mathcal T[\nu ]$
coincides with H. Friedman’s notion of strong gap embeddability, as determined by [Reference Kříž22, Condition (1.2.5)] (see also [Reference Simpson, Harrington, Morley, Sčědrov and Simpson30]).
By the Kruskal–Friedman theorem, we shall mean the statement that
$\mathcal T[\nu ]$
is a well partial order (
$\mathsf {WPO}$
) for any ordinal
$\nu $
, i.e., that any infinite sequence
$t_0,t_1,\ldots $
in
$\mathcal T[\nu ]$
involves an inequality
$t_i\leq _{\mathcal T[\nu ]}t_j$
for some
$i<j$
. Note that
$\nu =1$
amounts to the case without labels, which was proved by J. Kruskal [Reference Kruskal24]. The extension to all finite
$\nu <\omega $
is due to Friedman (see the presentation by S. Simpson [Reference Simpson, Harrington, Morley, Sčědrov and Simpson30]). It has been used in N. Robertson and P. Seymour’s proof of their graph minor theorem (see [Reference Friedman, Robertson, Seymour and Simpson14]). The Kruskal–Friedman theorem for general
$\nu $
was conjectured by Friedman and proved by I. Kříž [Reference Kříž22]. For a somewhat different version of the gap condition, a proof of the general case was given by L. Gordeev [Reference Gordeev18].
The original Kruskal theorem is unprovable in a fairly strong axiom system
$\mathsf {ATR}_0$
that is associated with ‘predicative mathematics’, while the extension to all
$\nu <\omega $
is unprovable in the even stronger system
$\Pi ^1_1\text {-}\mathsf {CA}_0$
, as shown by Friedman [Reference Simpson, Harrington, Morley, Sčědrov and Simpson30]. Together with Robertson and Seymour [Reference Friedman, Robertson, Seymour and Simpson14], the latter has deduced that
$\Pi ^1_1\text {-}\mathsf {CA}_0$
cannot prove the graph minor theorem (not even for bounded tree width). This is a particularly striking manifestation of the incompleteness phenomenon from Gödel’s theorems, for the following two reasons: First, the axiom system
$\Pi ^1_1\text {-}\mathsf {CA}_0$
is strong enough to prove the vast majority of mathematical theorems that have been analyzed in the research program of reverse mathematics (see [Reference Simpson32] for a comprehensive introduction). Secondly, the graph minor theorem has been described as one of “the deepest theorems that mathematics has to offer” [Reference Diestel7, Chapter 12]. It is also of great importance for theoretical computer science, as it guarantees that certain tasks can be solved by polynomial time algorithms (see, e.g., [Reference Robertson and Seymour29]).
For Kruskal’s original theorem, we have two proofs that are optimal in different respects: On the one hand, C. Nash-Williams [Reference Nash-Williams26] has given an extremely elegant proof, for which he introduced his influential notion of ‘minimal bad sequence’. This proof, however, uses axioms that are stronger than necessary. Specifically, A. Marcone [Reference Marcone, Hodges, Hyland, Steinhorn and Truss25] has shown that a fundamental existence principle for minimal bad sequences is equivalent to
$\Pi ^1_1$
-comprehension, the main axiom of
$\Pi ^1_1\text {-}\mathsf {CA}_0$
. This axiom is strictly stronger than Kruskal’s theorem. On the other hand, M. Rathjen and A. Weiermann [Reference Rathjen and Weiermann28] have analyzed a proof via ‘reifications’, which uses just the right axioms but is less straightforward. The tension between different forms of optimality—elegance and the desire to use the weakest possible axioms—is resolved in a recent paper by Rathjen, Weiermann, and the present author [Reference Freund, Rathjen and Weiermann13]. There we investigate a uniform version of Kruskal’s theorem, which extends the original result from trees to general recursive data types (see below for details). By the main result of [Reference Freund, Rathjen and Weiermann13], this uniform Kruskal theorem is equivalent to
$\Pi ^1_1$
-comprehension (over
$\mathsf {RCA}_0$
with the chain antichain principle). So in the uniform case, the elegant proof via minimal bad sequences uses just the right axioms. This provides formal support for the intuition that Nash-Williams has given ‘the canonical’ proof of Kruskal’s theorem. In the present article, we show that a uniform Kruskal–Friedman theorem is equivalent to
$\Pi ^1_1$
-transfinite recursion (explained below). We also show that both are equivalent to a minimality principle for bad sequences under the gap condition, which Kříž [Reference Kříž22] has established in proving the Kruskal–Friedman theorem for trees.
Let us now recall the terminology from [Reference Freund, Rathjen and Weiermann13] that is needed to state the uniform Kruskal–Friedman theorem. To motivate the somewhat abstract definitions, we will use the case of trees as a running example. First, a map
$f:X\to Y$
between partial orders is called a quasi embedding if it is order reflecting, i.e., if
$f(x)\leq _Yf(x')$
implies
$x\leq _X x'$
. It is called an embedding if it is also order preserving, i.e., if the converse implication holds as well. Let us write
$\mathsf {PO}$
for the category with the partial orders as objects and the quasi embeddings as morphisms. We say that a functor
$W:\mathsf {PO}\to \mathsf {PO}$
preserves embeddings if
$W(f):W(X)\to W(Y)$
is an embedding whenever the same holds for
$f:X\to Y$
. By
$[X]^{<\omega }$
we denote the set of finite subsets of a given set X. To make the construction functorial, we declare that
$f:X\to Y$
maps to
$[f]^{<\omega }:[X]^{<\omega }\to [Y]^{<\omega }$
with
$[f]^{<\omega }(a):=\{f(x)\,|\,x\in a\}$
. Let us also agree to write
$\operatorname {rng}(f)=\{f(x)\,|\,x\in X\}$
for the range or image. The forgetful functor from partial orders to their underlying sets will be left implicit. Conversely, we will often consider subsets as suborders. We can now recall a central definition from [Reference Freund, Rathjen and Weiermann13], which adapts J.-Y. Girard’s [Reference Girard15] notion of dilator on linear orders.
Definition 1.1. A
$\mathsf {PO}$
-dilator is a functor
$W:\mathsf {PO}\to \mathsf {PO}$
that preserves embeddings and admits a natural transformation
$\operatorname {supp}:W\Rightarrow [\cdot ]^{<\omega }$
such that the following ‘support condition’ holds: for any embedding
$f:X\to Y$
and any
$\sigma \in W(Y),$
we have

We call W a
$\mathsf {WPO}$
-dilator if
$W(X)$
is a
$\mathsf {WPO}$
whenever X is one.
Concerning the support condition, we point out that the converse implication is automatic by naturality, as
$\sigma =W(f)(\sigma _0)$
entails

If the required transformation exists, then
$\operatorname {supp}_Y(\sigma )$
is uniquely determined as the smallest
$X\in [Y]^{<\omega }$
such that we have
$\sigma \in \operatorname {rng}(W(f))$
for the inclusion
$f:X\hookrightarrow Y$
. To give an example that will be relevant for the case of trees, we define
$W(X)$
as the set of finite multisets
$[x_0,\ldots ,x_{n-1}]$
with entries
$x_i\in X$
. Let us declare that

holds precisely if there is an injective function
$f:\{0,\ldots ,m-1\}\to \{0,\ldots ,n-1\}$
with
$x_i\leq _X y_{f(i)}$
for all
$i<m$
. Note that the transformation
$X\mapsto W(X)$
preserves
$\mathsf {WPO}$
s, due to Higman’s lemma. We obtain a
$\mathsf {WPO}$
-dilator by setting

For subsets a and b of a partial order X, we shall abbreviate

We use analogous notation with
$<$
at the place of
$\leq $
. In the case of singletons, we write
$x\leq ^{\operatorname {fin}}_X b$
and
$a\leq ^{\operatorname {fin}} y$
rather than
$\{x\}\leq ^{\operatorname {fin}}_X b$
and
$a\leq ^{\operatorname {fin}}_X\{y\}$
, respectively. The multiset construction that we have just described has the following property.
Definition 1.2. A
$\mathsf {PO}$
-dilator W is called normal if we have

for any partial order X and all elements
$\sigma ,\tau \in W(X)$
.
In the case of linear orders, the given condition ensures that normal dilators induce normal functions on the ordinals (essentially due to P. Aczel [Reference Aczel1]; see also Girard’s [Reference Girard15] notion of flower and the discussion in [Reference Freund and Rathjen11]). There is no apparent connection with our use of normality, which is explained below. The following notion will be at the heart of our uniform Kruskal–Friedman theorem.
Definition 1.3. For an ordinal
$\nu $
and a normal
$\mathsf {PO}$
-dilator W, a
$\nu $
-Kruskal fixed point of W consists of a partial order X and a bijection
$\kappa :\nu \times W(X)\to X$
with

We say that
$(X,\kappa )$
is initial if any other
$\nu $
-Kruskal fixed point
$(Y,\pi )$
of W admits a unique quasi embedding
$f:X\to Y$
such that the diagram

commutes, where
$\nu \times W(f)$
denotes the map
$(\alpha ,\sigma )\mapsto (\alpha ,W(f)(\sigma ))$
.
Let us remark that the unique f in the definition is always an embedding, due to the last claim in Theorem 2.8 below. When W is the multiset construction from above, a
$\nu $
-Kruskal fixed point is given by the set
$\mathcal T[\nu ]$
of finite rooted trees with vertex labels from
$\nu $
and the function

Here the given value of
$\kappa $
is the tree with root label
$\alpha $
and immediate subtrees
$t_i$
. One should observe that the equivalence in Definition 1.3 coincides with the recursive characterization of gap embeddability via clauses (i) and (ii) above. The given fixed point is initial, since each finite tree can be generated by finitely many applications of the function
$\kappa $
(starting with the empty multiset to generate leaves). Formally, this can be confirmed via condition (ii) in Theorem 2.8. Let us point out that the example motivates all features of our abstract definitions: Functoriality is used to state the universal property that captures the intuitive idea of a recursively generated structure. Supports provide a general notion of immediate subtree, not least in the equivalence in Definition 1.3. Concerning normality, we first note that no tree s can be embedded into a tree t of smaller height. This can be verified by induction over the recursive definition of embeddability, as given at the beginning of the present article. In clause (ii) of that definition, the induction hypothesis will ensure that each subtree
$s_i$
of s has at most the height of some subtree
$t_{f(i)}$
of t. The role of normality is to provide the function f on indices. Lemma 2.4 below is a generalization of the given argument.
In the next section, we will present a construction that yields an initial
$\nu $
-fixed point of any normal
$\mathsf {PO}$
-dilator. This fixed point must be isomorphic to any other initial
$\nu $
-fixed point of the same
$\mathsf {PO}$
-dilator, by a standard argument about universal properties (see the paragraph after Corollary 2.9 below). Let us now give a precise definition of an expression that was used above: by a recursive data type with labels from
$\nu $
, we shall mean the initial
$\nu $
-fixed point of some normal
$\mathsf {WPO}$
-dilator W (note the requirement that W preserves
$\mathsf {WPO}$
s). As demonstrated by our running example, the
$\nu $
-labelled trees with gap condition arise as the special case where W is the multiset construction. The following asserts that the Kruskal–Friedman theorem holds not only for trees but for general recursive data types.
Theorem 1.4 (Uniform Kruskal–Friedman Theorem for
$\nu $
)
When W is a normal
$\mathsf {WPO}$
-dilator, the initial
$\nu $
-Kruskal fixed point of W is a
$\mathsf {WPO}$
.
In addition to proving the theorem, we aim to analyze it from the viewpoint of reverse mathematics. As a precondition, we need to explain how our definitions are to be formalized in that framework. The crucial observation is that any
$\mathsf {PO}$
-dilator is determined by its restriction to the category of finite partial orders. For the case of dilators on linear orders, the analogous result is due to Girard [Reference Girard15]. It rests on the idea that any comparison
$\sigma _0\leq _{W(X)}\sigma _1$
is already determined on the finite suborder
$a:=\bigcup _{i\leq 1}\operatorname {supp}_X(\sigma _i)$
. Indeed, the support condition from Definition 1.1 allows us to write
$\sigma _i=W(\iota )(\sigma _i')$
for the inclusion
$\iota :a\hookrightarrow X$
. Since
$W(\iota )$
is an embedding,
$\sigma _0\leq _{W(X)}\sigma _1$
is equivalent to
$\sigma _0'\leq _{W(a)}\sigma _1'$
. This argument also reveals why Definition 1.1 includes the condition that
$\mathsf {PO}$
-dilators preserve embeddings. In fact, some readers may wonder why quasi embeddings are considered at all. The reason is that a linearization of an order Y can be represented by a surjective quasi embedding
$f:X\to Y$
where X is linear. For the purpose of formalization, the point is that the restrictions of
$\mathsf {PO}$
-dilators to finite orders are sets rather than proper classes. To represent them in reverse mathematics, we presume that these restrictions are countable and indeed coded by subsets of
$\mathbb N$
. In [Reference Freund, Rathjen and Weiermann13] it is carefully shown how all arguments can be carried out on the level of representations. The latter are not made explicit in the present article, in order to improve readability.
As we work in the framework of reverse mathematics, we shall identify ordinals with countable well orders. The principle of
$\Pi ^1_1$
-recursion along an ordinal
$\nu $
allows us to define hierarchies
$\langle Y_\alpha \,|\,\alpha <\nu \rangle $
by recursive clauses

where
$\varphi $
must be a
$\Pi ^1_1$
-formula (i. e., of the form
$\forall X\!\subseteq \!\mathbb N\,\forall /\exists \, x_1\!\in \!\mathbb N\ldots \forall /\exists \, x_k\!\in \!\mathbb N.\,\theta $
with no quantifiers in
$\theta $
). By
$\Pi ^1_1$
-transfinite recursion, we shall mean the statement that
$\Pi ^1_1$
-recursion is available along any (countable) ordinal. For a more detailed discussion of transfinite recursion in reverse mathematics, we refer to [Reference Simpson32]. Finally, we are ready to state our main result. Concerning the following statement (i), we note that
$\mathsf {RCA}_0$
proves the existence and uniqueness of initial
$\nu $
-Kruskal fixed points (but not the fact that they are
$\mathsf {WPO}$
s), as shown in the next section.
Theorem 1.5. Over
$\mathsf {RCA}_0$
extended by the chain antichain principle (
$\mathsf {CAC}$
), the following are equivalent for any infinite ordinal
$\nu $
:
-
(i) the uniform Kruskal–Friedman theorem for
$\nu $ ,
-
(ii)
$\Pi ^1_1$ -recursion along
$\nu $ .
The restriction to infinite
$\nu $
is inherited from [Reference Freund and Rathjen12], where it saves a tedious case distinction. It seems likely that the present theorem and the result from [Reference Freund and Rathjen12] extend to all finite
$\nu $
, but this has not been verified. The important case of
$\nu =1$
is the main result of [Reference Freund, Rathjen and Weiermann13] (with [Reference Freund8, Reference Freund9] replacing [Reference Freund and Rathjen12]). The chain antichain principle in the base theory of Theorem 1.5 ensures that different definitions of
$\mathsf {WPO}$
are equivalent. Specifically, we adopt the definition that X is a
$\mathsf {WPO}$
if any infinite sequence
$x_0,x_1,\ldots \subseteq X$
involves an inequality
$x_i\leq _X x_j$
for some
$i<j$
. In the presence of
$\mathsf {CAC}$
, one may equivalently ask for a strictly increasing
$f:\mathbb N\to \mathbb N$
such that
$x_{f(i)}\leq _X x_{f(i+1)}$
holds for all
$i\in \mathbb N$
(see [Reference Cholak, Marcone and Solomon6] for a detailed analysis). Note that
$\mathsf {CAC}$
follows from Ramsey’s theorem for pairs, which is an extremely weak consequence of our statement (ii). It is somewhat surprising that no proof of
$\mathsf {CAC}$
from statement (i) has been found so far.Footnote
1
According to Lemma 4.2 of the present article,
$\mathsf {CAC}$
follows when (i) is available for all ordinals. This improves the base theory in the following (where finite
$\nu $
may be included due to Corollary 2.10).
Corollary 1.6. The following are equivalent over
$\mathsf {RCA}_0$
:
-
(i) the uniform Kruskal–Friedman theorem for all
$\nu $ ,
-
(ii)
$\Pi ^1_1$ -transfinite recursion.
In order to obtain another conclusion of interest, we recall that the following statement (ii) is equivalent to
$\Pi ^1_1$
-recursion along
$\omega $
(see, e.g., [Reference Freund and Rathjen12, Section 9]).
Corollary 1.7. The following are equivalent over
$\mathsf {RCA}_0+\mathsf {CAC}$
:
-
(i) the uniform Kruskal–Friedman theorem for
$\nu =\omega $ ,
-
(ii) every subset of
$\mathbb N$ is contained in a countable
$\beta $ -model of
$\Pi ^1_1$ -comprehension.
In the rest of this introduction, we summarize how Theorem 1.5 will be proved. Let us first observe that the functions
$\kappa :\nu \times W(X)\to X$
from Definition 1.3 are order preserving, if we agree that
$(\alpha ,\sigma )\leq (\beta ,\tau )$
is equivalent to the conjunction of
$\alpha \leq \beta $
and
$\sigma \leq \tau $
. Now consider a dilator D on linear orders. To make
$\nu \times D(Z)$
linear, we add that
$(\alpha ,\sigma )\leq (\beta ,\tau )$
holds whenever we have
$\alpha <\beta $
. But then we cannot expect to get an order preserving map
$\psi :\nu \times D(Z)\to Z$
, as the domain may always have larger order type than the range. One way out is to allow
$\psi $
to be partial. If
$\psi $
is bijective on its domain, we can represent it by its total inverse

In [Reference Freund and Rathjen12, Definition 1.4] we have formulated a condition which ensures that the range of
$\pi $
is large. If
$\pi $
satisfies this condition, it is called a
$\nu $
-collapse of D. The main result of [Reference Freund and Rathjen12] shows that
$\Pi ^1_1$
-recursion along
$\nu $
is equivalent to the statement that any dilator D on (linear) well orders admits a
$\nu $
-collapse for some well order Y. We now argue that this last statement follows from (i) of Theorem 1.5. Given a dilator D, one can find a normal
$\mathsf {WPO}$
-dilator W that admits a natural family of order reflecting maps
$\eta _X:D(X)\to W(X)$
, one for each linear order X. This is a non-trivial result established in [Reference Freund, Rathjen and Weiermann13]. As in the case of partial orders, it is not hard to construct a
$\nu $
-collapse
$\pi :Y\to \nu \times D(Y)$
for some linear order Y. The crucial task is to show that the latter is well founded. Given (i) of Theorem 1.5, the
$\nu $
-Kruskal fixed point Z of W is a
$\mathsf {WPO}$
. We will construct an order reflecting function
$f:Y\to Z$
(i. e., a partial linearization of Z by Y) such that

is a commutative diagram. Given that Z is a
$\mathsf {WPO}$
, the map f ensures that Y is a well order. Now (ii) of Theorem 1.5 follows via the cited result from [Reference Freund and Rathjen12]. It may be interesting to observe that the given argument involves ‘abstract versions’ of classical results from ordinal analysis: First, the partial inverse
$\psi :\nu \times D(Y)\to Y$
of
$\pi $
is intimately connected to the proof theoretic collapsing functions of W. Buchholz [Reference Buchholz2]. Secondly, the proof in [Reference Freund and Rathjen12] relativizes the ordinal analysis of
$\Pi ^1_1$
-recursion, for which we refer to [Reference Buchholz, Aczel, Simmons and Wainer3, Reference Buchholz, Feferman, Pohlers and Sieg5, Reference Jäger and Pohlers21, Reference Rathjen and Schindler27]. Finally, the embedding
$f:Y\to X$
corresponds to linearizations of Friedman’s gap condition on trees by traditional ordinal notation systems, as given in [Reference Buchholz, Sieg, Sommer and Talcott4, Reference Simpson, Harrington, Morley, Sčědrov and Simpson30].
Let us now sketch the proof that (ii) implies (i) in Theorem 1.5. As we will see, statement (i) follows from a result due to Kříž [Reference Kříž22]. More precisely, it follows from a restriction of this result to finite structures (see Section 3 for the precise statement). We will refer to this restriction as ‘Kříž’s minimality principle’, even though the main result of Kříž covers some infinite structures as well. The logical strength of the full result is not analyzed in the present article. It will be shown that the restricted result can be derived from
$\Pi ^1_1$
-recursion. This closes a circle of implications between statements (i) and (ii) of Theorem 1.5 and Kříž’s minimality principle.
While the original proof by Kříž relies on excessively strong axioms at the level of
$\Pi ^1_2$
-comprehension, it was already known that
$\Pi ^1_1$
-recursion proves statement (i) from Theorem 1.5 for trees, due to work of Gordeev [Reference Gordeev18, Reference Gordeev, Schuster, Seisenberger and Weiermann19] and M. Krombholz (see [Reference Krombholz23, Section 5.1]). The extension from trees to general data types is routine. Nevertheless, we provide a complete proof, which seems particularly transparent in that it stays very close to the original argument of Kříž (except for a small but crucial modification). Our main result is that (i) implies (ii) in Theorem 1.5. Here it was already known that
$\Pi ^1_1$
-recursion has the same consistency strength as the Kruskal–Friedman theorem for trees (see again [Reference Gordeev, Schuster, Seisenberger and Weiermann19]). The point of our result is that we increase the quantifier complexity of the Kruskal–Friedman theorem by extending it from trees to general data types. This allows us to turn the result about consistency strength into an actual equivalence. The latter cannot be derived by traditional ordinal analysis at the level of provable well orders. Instead, it applies results of functorial ordinal analysis, which goes back to Girard’s work in the 1980s (see [Reference Girard15, Reference Girard16]) but has seen important new developments in the last years.
2 Existence of Kruskal fixed points
In this section, we prove the existence of initial
$\nu $
-Kruskal fixed points via an explicit construction. We also provide a criterion that helps to decide whether a given fixed point is initial. Unless otherwise noted, we work over the base theory
$\mathsf {RCA}_0$
. The presentation in this section is similar to the one in [Reference Freund10, Reference Freund, Rathjen and Weiermann13], where the analogous results for
$\nu =1$
are proved.
We begin with a definition and lemma that will be used to approximate
$\nu $
-Kruskal fixed points via their finite suborders.
Definition 2.1. Given a
$\mathsf {PO}$
-dilator W and a finite partial order a, we put

Let us point out that the given definition is closely related to the notion of trace, which is due to Girard [Reference Girard15] for linear orders and was transferred to partial orders in [Reference Freund, Rathjen and Weiermann13]. The following can be seen as a variant of Girard’s normal form theorem.
Lemma 2.2. For each partial order X and each
$\sigma \in W(X)$
, there are unique
$a\in [X]^{<\omega }$
and
$\sigma _0\in W_0(a)$
with
$\sigma =W(\iota )(\sigma _0)$
, where
$\iota :a\hookrightarrow X$
is the inclusion.
Proof. To obtain uniqueness, note that
$\sigma =W(\iota )(\sigma _0)$
with
$\sigma _0\in W_0(a)$
entails

due to the naturality of supports. Once a is determined, so is
$\sigma _0$
, as the embedding
$W(\iota )$
is injective. For existence, set
$a:=\operatorname {supp}_X(\sigma )$
and use the support condition from Definition 1.1 to get
$\sigma =W(\iota )(\sigma _0)$
for some
$\sigma _0\in W(a)$
. We have
$\sigma _0\in W_0(a)$
by the same computation as in the uniqueness part.
The following definition yields an initial
$\nu $
-Kruskal fixed point of W, as we will prove below. It extends the construction for
$\nu =1$
given in [Reference Freund, Rathjen and Weiermann13, Definition 3.4]. To allow for applications of W, the definition requires that the arising relation is a partial order on certain subsets. In hindsight, this requirement turns out to be redundant: we will see that we get a partial order on the entire fixed point.
Definition 2.3. For an ordinal
$\nu $
and a normal
$\mathsf {PO}$
-dilator W, we use simultaneous recursion to generate a set
$\mathcal T W[\nu ]$
of terms and a binary relation
$\leq _{\mathcal TW[\nu ]}$
on it:
-
• Whenever we have constructed a finite set
$a\subseteq \mathcal TW[\nu ]$ that is partially ordered by the relation
$\leq _{\mathcal TW[\nu ]}$ , we add a term
$\alpha \star (a,\sigma )\in \mathcal TW[\nu ]$ for each ordinal
$\alpha <\nu $ and each element
$\sigma \in W_0(a)$ .
-
• The relation
$\alpha \star (a,\sigma )\leq _{\mathcal TW[\nu ]}\beta \star (b,\tau )$ holds precisely when we have
$\alpha \leq \beta $ and one of the following conditions is satisfied:
-
(i) the set
$a\cup b$ is partially ordered by
$\leq _{\mathcal TW[\nu ]}$ and we have
$$ \begin{align*} W(\iota_a)(\sigma)\leq_{W(a\cup b)} W(\iota_b)(\tau) \end{align*} $$
$\iota _a:a\hookrightarrow a\cup b$ and
$\iota _b:b\hookrightarrow a\cup b$ ,
-
(ii) we have
$\alpha \star (a,\sigma )\leq _{\mathcal TW[\nu ]} r$ for some
$r\in b$ .
-
More explicitly, the given clauses decide
$r\in \mathcal T W[\nu ]$
and
$s\leq _{\mathcal T W[\nu ]}t$
by simultaneous recursion on
$l(r)$
and
$l(s)+l(t)$
, respectively, for the length function

To prove that the relation
$\leq _{\mathcal TW}$
is a partial order, we first show how it relates to the height function

As indicated in the introduction, the assumption that W is normal is crucial for the following argument.
Lemma 2.4. From
$s\leq _{\mathcal TW[\nu ]}t$
we get
$h(s)\leq h(t)$
.
Proof. We argue by induction on
$l(s)+l(t)$
. Write
$s=\alpha \star (a,\sigma )$
and
$t=\beta \star (b,\tau )$
. If
$s\leq _{\mathcal TW[\nu ]} t$
holds by clause (ii) from Definition 2.3, we get
$h(s)\leq h(r)<h(t)$
by induction. If it holds by clause (i), then normality as in Definition 1.2 yields

Here the equalities rely on
$\sigma \in W_0(a)$
and
$\tau \in W_0(b)$
due to the definition of
$\mathcal TW[\nu ]$
. For any
$s'\in a$
we thus find a
$t'\in b$
with
$s'\leq _{\mathcal TW[\nu ]}t'$
. By induction hypothesis this gives
$h(s')\leq h(t')<h(t)$
. As
$s'\in a$
was arbitrary, we get
$h(s)\leq h(t)$
.
The previous lemma ensures that clause (ii) of Definition 2.3 cannot ‘break’ antisymmetry. Indeed, if
$s\leq _{\mathcal TW[\nu ]}t$
holds by said clause, we get
$h(s)<h(t)$
by the proof of the lemma. But then the latter excludes
$t\leq _{\mathcal TW[\nu ]} s$
. Apart from this observation—which motivates the notion of normality—we do not provide a proof of the following result: instead we refer to the proof of [Reference Freund, Rathjen and Weiermann13, Proposition 3.6], which treats the case
$\nu =1$
and generalizes without essential changes.
Lemma 2.5. The relation
$\leq _{\mathcal TW[\nu ]}$
is a partial order on
$\mathcal TW[\nu ]$
.
We now come to a crucial construction, which is justified by Lemma 2.2.
Definition 2.6. Given an ordinal
$\nu $
and a normal
$\mathsf {PO}$
-dilator W, we define

by stipulating
$\kappa _W^\nu (\alpha ,\sigma )=\alpha \star (a,\sigma _0)$
for
$\sigma =W(\iota )(\sigma _0)$
with
$\sigma _0\in W_0(a)$
, where we write
$\iota :a\hookrightarrow \mathcal TW[\nu ]$
for the inclusion.
Let us show that our definition yields the desired result.
Proposition 2.7. The partial order
$\mathcal TW[\nu ]$
and the function
$\kappa _W^\nu $
form a
$\nu $
-Kruskal fixed point of W, whenever the latter is a normal
$\mathsf {PO}$
-dilator.
Proof. First note that
$\kappa _W^\nu $
has an obvious inverse and is thus bijective. To establish the equivalence from Definition 1.3, we look at two values

Consider the inclusions
$\iota _a:a\hookrightarrow a\cup b$
and
$\iota _b:b\hookrightarrow a\cup b$
as well as
$\iota :a\cup b\hookrightarrow \mathcal TW[\nu ]$
. According to the definition of
$\kappa _W^\nu $
, we have

In view of
$\tau _0\in W_0(b)$
we get
$\operatorname {supp}_{\mathcal TW[\nu ]}(\tau )=b$
, as in the proof of Lemma 2.2. Given that
$W(\iota )$
is an embedding, the desired equivalence from Definition 1.3 amounts to

This coincides with the recursive characterization of
$\leq _{\mathcal TW[\nu ]}$
in Definition 2.3.
In the following result, the equivalence between (i) and (ii) is the criterion for initial fixed points that was promised at the beginning of this section. In particular, this criterion can be used to confirm that the uniform Kruskal–Friedman theorem entails the original version for trees, as noted in the introduction. Statement (iii) will allow us to relate Kruskal fixed points for different ordinals. A similar criterion for the case
$\nu =1$
is provided by [Reference Freund10, Theorem 3.5]
Theorem 2.8. Assume that X and
$\kappa :\nu \times W(X)\to X$
form a
$\nu $
-Kruskal fixed point of a normal
$\mathsf {PO}$
-dilator W. The following are equivalent:
-
(i) The
$\nu $ -Kruskal fixed point
$(X,\kappa )$ is initial.
-
(ii) There is a function
$h:X\to \mathbb N$ such that we have
$$ \begin{align*} h(x)<h(\kappa(\alpha,\sigma))\quad\text{whenever}\quad x\in\operatorname{supp}_X(\sigma). \end{align*} $$
-
(iii) For any embedding
$I:\nu \to \mu $ and any
$\mu $ -Kruskal fixed point
$(Y,\pi )$ of W, there is a unique quasi embedding
$f:X\to Y$ such that the diagram
$I\times W(f)$ maps
$(\alpha ,\sigma )$ to
$(I(\alpha ),W(f)(\sigma ))$ .
Furthermore, the unique f in (iii) is always an embedding.
Proof. First, a glance at Definition 1.3 reveals that (iii) entails (i) as the special case where I is the identity on
$\nu =\mu $
. Next, we assume (i) and derive (ii). In view of Proposition 2.7, we get a quasi embedding
$f:X\to \mathcal TW[\nu ]$
with

Let
$h:\mathcal TW[\nu ]\to \mathbb N$
be the function defined before Lemma 2.4. We show that (ii) holds with
$h\circ f:X\to \mathbb N$
at the place of h. Given any
$\alpha <\nu $
and
$\sigma \in W(X)$
, invoke Lemma 2.2 to write
$W(f)(\sigma )=W(\iota )(\sigma _0)$
with
$\iota :a\hookrightarrow \mathcal TW[\nu ]$
and
$\sigma _0\in W_0(a)$
. For an arbitrary element
$x\in \operatorname {supp}_X(\sigma )$
, we get

where the last equality is established as in the proof of Lemma 2.2. Considering the definition of h, we can conclude

as needed to establish (ii) for
$h\circ f$
. Finally, we assume (ii) and deduce (iii). To see that there is at most one f with the required property, note that an arbitrary element of X can be uniquely written as
$\kappa (\alpha ,\sigma )$
, since
$\kappa $
is bijective by Definition 1.3. Commutativity of the diagram in (iii) amounts to

where we take
$\iota :a\hookrightarrow X$
and
$\sigma _0\in W_0(a)$
to be determined by Lemma 2.2. As in the proof of the latter, we learn that the domain a of
$f\circ \iota $
is equal to
$\operatorname {supp}_X(\sigma )$
. In view of (⋆), we can thus invoke induction over
$h(x)$
to see that
$f(x)$
is uniquely determined. For existence, the idea is to read (⋆) as a recursive definition of f. To ensure that the application of W is explained, we must simultaneously verify that
$f\circ \iota $
is a (quasi) embedding. In order to make this precise, we use recursion over h to define a length function

One can now use (⋆) to define
$f(x)$
by recursion on
$l(x)$
, while showing

by a simultaneous induction on
$l(x_0)+l(x_1)<l(x)$
. This last induction is based on the fact that
$\kappa $
and
$\pi $
satisfy the same equivalence from Definition 1.3. For details, we refer to the proof of the case
$\nu =1$
in [Reference Freund10, Theorem 3.5].
With respect to the constructions from Definitions 2.3 and 2.6, we can now strengthen Proposition 2.7 as follows.
Corollary 2.9. The
$\nu $
-Kruskal fixed point
$(\mathcal TW[\nu ],\kappa _W^\nu )$
of W is initial, for any normal
$\mathsf {PO}$
-dilator W.
Proof. Statement (ii) of Theorem 2.8 holds for the function
$h:\mathcal T W[\nu ]\to \mathbb N$
defined before Lemma 2.4, as shown in the proof of the theorem.
As pointed out in the introduction, all initial
$\nu $
-Kruskal fixed points of W are isomorphic, by a general argument about universal properties: Given two initial fixed points
$(X,\kappa )$
and
$(Y,\pi )$
, we get quasi embeddings f and g such that

is a commutative diagram. Now there can only be one quasi embedding
$g\circ f$
such that the outer rectangle commutes, by the uniqueness condition in Definition 1.3. Hence
$g\circ f$
must be the identity on X. In the same way, we see that
$f\circ g$
is the identity on Y, which makes f an isomorphism with inverse g. Having shown existence and essential uniqueness, we may speak of ‘the’ initial
$\nu $
-Kruskal fixed point of a normal
$\mathsf {PO}$
-dilator W. Furthermore, we can use
$\mathcal TW[\nu ]$
as notation for ‘this’ fixed point, without committing to the specific construction from Definition 2.3. The formulation of Theorem 1.5 from the introduction is now fully explained and justified. We conclude this section with the following observation.
Corollary 2.10. For ordinals
$\nu <\mu $
and any normal
$\mathsf {PO}$
-dilator W, we have an order embedding of
$\mathcal TW[\nu ]$
into
$\mathcal TW[\mu ]$
.
Proof. The assumption
$\nu <\mu $
is witnessed by an embedding
$I:\nu \to \mu $
. We can conclude via statement (iii) of Theorem 2.8 (which applies due to Corollary 2.9).
It follows that the uniform Kruskal–Friedman theorem holds for all finite
$\nu <\omega $
if it holds for some infinite
$\nu $
, as needed to infer Corollary 1.6 from Theorem 1.5.
3 Kříž’s minimality principle
In this section, we derive the uniform Kruskal–Friedman theorem from a minimal bad sequence principle for structures with gap condition, which is due to Kříž [Reference Kříž22]. We also show that this principle can be derived from
$\Pi ^1_1$
-recursion, by presenting a modified version of Kříž’s original argument. As mentioned in the introduction, Gordeev [Reference Gordeev18, Reference Gordeev, Schuster, Seisenberger and Weiermann19] and Krombholz [Reference Krombholz23] have previously used
$\Pi ^1_1$
-recursion to derive an analogous result for trees.
The following coincides with [Reference Kříž22, Definition 1.4], except that clause (v) has been added (cf. the proof of Theorem 3.5 and the remark before Theorem 3.7).
Definition 3.1. For an ordinal
$\nu $
, a
$\nu $
-gap order consists of a quasiorder
$(X,\leq _X)$
, a function
$q:X\to \nu $
and a binary relation
$\ll $
on X, such that the following holds:
-
(i) if we have
$s\leq _X t$ , then we must also have
$q(s)\leq q(t)$ ,
-
(ii) from
$r\leq _X s\ll t$ and
$q(r)\leq q(t)$ , we can infer
$r\leq _X t$ ,
-
(iii) given
$s_0\ll \ldots \ll s_n$ with
$\min _{i\leq n}q(s_i)\in \{q(s_0),q(s_n)\}$ , we get
$s_0\ll s_n$ ,
-
(iv) there is no infinite sequence
$f:\mathbb N\to X$ with
$f(i+1)\ll f(i)$ for all
$i\in \mathbb N$ ,
-
(v) the set
$\{s\in X\,|\,s\ll t\}$ is finite for each
$t\in X$ .
We shall write
$\ll ^*$
for the reflexive and transitive closure of
$\ll $
.
Let us now show that any initial
$\nu $
-Kruskal fixed point (see the previous section) can be seen as a
$\nu $
-gap order. With respect to the following definition, we point out that
$\kappa :\nu \times W(X)\to X$
is a bijection by Definition 1.3. The recursion is justified due to part (ii) of Theorem 2.8 (with
$\operatorname {supp}_X:W(X)\to [X]^{<\omega }$
as in Definition 1.1).
Definition 3.2. We consider a normal
$\mathsf {PO}$
-dilator W and its initial
$\nu $
-Kruskal fixed point
$(X,\kappa )$
. Let the function
$q:X\to \nu $
be given by
$q(\kappa (\alpha ,\sigma )):=\alpha $
. For
$\gamma <\nu $
we define
$K^W_\gamma :X\to [X]^{<\omega }$
and
$K_\gamma :W(X)\to [X]^{<\omega }$
by the recursive clauses

To obtain a binary relation
$\ll $
on X, we now declare that
$\kappa (\alpha ,\sigma )\ll \kappa (\beta ,\tau )$
is equivalent to
$\kappa (\alpha ,\sigma )\in K_\gamma (\tau )$
for
$\gamma =\min \{\alpha ,\beta \}$
.
Intuitively, we have
$s\ll t$
when s is a proper subtree of t and the minimal label on the path between the roots of s and t occurs at one of these roots. The following extends [Reference Kříž22, Lemma 1.6] from trees to general recursive data types.
Lemma 3.3. The previous definition yields a
$\nu $
-gap order
$(X,\leq _X,q,\ll )$
.
Proof. Condition (i) of Definition 3.1 follows from the equivalence in Definition 1.3. To show condition (ii), we note that
$r\leq _Xs\ll t$
and
$q(r)\leq q(t)$
entail
$q(r)\leq q(s)$
and hence
$s\in K^W_\gamma (t)$
with
$q(r)\leq \gamma $
. So it suffices to prove

We employ induction on
$h(t)$
, for
$h:X\to \mathbb N$
as in part (ii) of Theorem 2.8. In the non-trivial case of
$s\neq t$
, write
$t=\kappa (q(t),\tau )$
to get
$s\in K^W_\gamma (t')$
with
$t'\in \operatorname {supp}_X(\tau )$
. We obtain
$r\leq _X t'$
by induction hypothesis. Given
$q(r)\leq q(t)$
, the equivalence from Definition 1.3 yields
$r\leq _X t$
, as required. As preparation for the other conditions, one uses induction on
$h(t)$
to verify

One can infer that
$s\ll t$
entails
$h(s)<h(t)$
, which yields condition (iv). Furthermore, it follows that any
$s\ll t=\kappa (q(t),\tau )$
is contained in the finite set
$K_0(\tau )$
, so that condition (v) is satisfied. To derive condition (iii), we write
$s_i=\kappa (q(s_i),\sigma _i)$
. Given
$\min _{i\leq n}q(s_i)\in \{q(s_0),q(s_n)\}$
, we have

for all
$i<n$
. Assuming
$s_0\ll \ldots \ll s_n$
, we also get

This entails
$K^W_\gamma (s_i)\subseteq K^W_\gamma (s_{i+1})$
and then
$s_0\in K^W_\gamma (s_n)$
. As we have
$h(s_0)<h(s_n)$
and hence
$s_0\neq s_n$
, we even get
$s_0\in K_\gamma (\sigma _n)$
, which amounts to
$s_0\ll s_n$
.
Let us agree to write
$[N]^\omega $
for the set of infinite subsets of a given set
$N\subseteq \mathbb N$
. To formulate the minimality principle of Kříž, we need the following terminology.
Definition 3.4. Consider a
$\nu $
-gap order
$(X,\leq _X,q,\ll )$
. A sequence
$f:M\to X$
with
$M\in [\mathbb N]^\omega $
is called bad if there are no
$i,j\in M$
with
$i<j$
and
$f(i)\leq _X f(j)$
. It is called regular if we have
$q(f(i))\leq q(f(j))$
for all
$i<j$
in M.
The following coincides with [Reference Kříž22, Theorem 1.5], except that our Definition 3.1 is more restrictive than [Reference Kříž22, Definition 1.4]. This means that the present statement is somewhat weaker than Kříž’s original result.
Theorem 3.5 (Kříž’s minimality principle for
$\nu $
)
Assume that
$(X,\leq _X,q,\ll )$
is a
$\nu $
-gap order. Given any bad sequence
$f:\mathbb N\to X$
, one can find an
$M\in [\mathbb N]^\omega $
and a regular bad sequence
$g:M\to X$
such that
-
(i) we have
$g(i)\ll ^*f(i)$ for all
$i\in M$ ,
-
(ii) there is no bad
$h:N\to X$ with
$N\in [M]^\omega $ and
$h(i)\ll g(i)$ for all
$i\in N$ .
Below, we show that Kříž’s minimality principle can be proved by
$\Pi ^1_1$
-recursion. Together with the following result, this entails that (ii) implies (i) in Theorem 1.5. We note that the following extends [Reference Kříž22, Theorem 1.7] from trees to general recursive data types. A related generalization is given in [Reference Kříž22, Section 3].
Theorem 3.6. The uniform Kruskal–Friedman theorem for
$\nu $
follows from Kříž’s minimality principle for
$\nu $
, for any ordinal
$\nu $
and over
$\mathsf {RCA}_0$
.
Proof. Given a normal
$\mathsf {WPO}$
-dilator W, we consider its initial
$\nu $
-Kruskal fixed point
$(X,\kappa )$
and the
$\nu $
-gap order
$(X,\leq _X,q,\ll )$
that arises from Lemma 3.3. To establish the uniform Kruskal–Friedman theorem, we must show that X is a
$\mathsf {WPO}$
. Aiming at a contradiction, we assume that there is a bad sequence
$f:\mathbb N\to X$
. By Kříž’s minimality principle, we get a regular bad
$g:M\to X$
as in Theorem 3.5. Let us write
$g(i)=\kappa (q(g(i)),\sigma _i)$
in order to define

We show that Y is a
$\mathsf {WPO}$
. If not, we find a bad sequence
$h:N\to Y$
with
$N\in [M]^\omega $
and
$h(i)\in \operatorname {supp}_X(\sigma _i)$
, as the supports are finite by Definition 1.1. In view of Definition 3.2, we put
$\gamma :=\min \{q(g(i)),q(h(i))\}$
to get

and hence
$h(i)\ll g(i)$
. We have reached a contradiction with (ii) from Theorem 3.5. Let us now consider the inclusion
$\iota :Y\hookrightarrow X$
. We may write
$\sigma _i=W(\iota )(\tau _i)$
with
$\tau _i\in W(Y)$
, due to the support condition from Definition 1.1. Given that Y is a
$\mathsf {WPO}$
, so is
$W(Y)$
, since W is a
$\mathsf {WPO}$
-dilator. We thus get
$\tau _i\leq _{W(Y)}\tau _j$
for some
$i<j$
in M. Since W preserves embeddings (see again Definition 1.1), we can conclude
$\sigma _i\leq _{W(X)}\sigma _j$
. The latter entails
$g(i)\leq _X g(j)$
by the equivalence from Definition 1.3, as g is regular. This is in contradiction with g being bad.
We now present Kříž’s proof [Reference Kříž22] of his minimality principle with some small but crucial modifications (which will be pointed out). Due to these modifications, the principle of
$\Pi ^1_1$
-recursion suffices to carry out the proof, as we shall argue below.
Proof of Theorem 3.5
Let us begin with some preparations. For each ordinal
$\beta $
, we truncate the given
$q:X\to \nu $
into a function

In the presence of Kőnig’s lemma, conditions (iv) and (v) of Definition 3.1 yield a rank function
$r:X\to \omega $
such that
$s\ll t$
entails
$r(s)<r(t)$
. While condition (v) will be crucial later, it is not essential at this point, as one could also work with infinite ordinal ranks. We now consider

At stage
$\beta +1$
of the recursive construction below, we minimize with respect to
$p_\beta $
where Kříž uses q. We will see that this forces our construction to close after
$\nu +1$
stages, while Kříž gives no information on the closure ordinal, except that it must be countable. We now make precise what we mean by minimizing. For
$k\in \mathbb N$
and
$g:M\to X$
with
$M\subseteq \mathbb N$
, we put
$[k]:=\{0,\ldots ,k-1\}$
and define
$g[k]:M\cap [k]\to X$
as the restriction of g. Given
$g_i:M_i\to X$
with
$M_i\in [\mathbb N]^\omega $
for
$i\in \{0,1\}$
, we write
$g_0\mathbin {\angle }_\beta g_1$
if we have
$M_0\subseteq M_1$
and there is a
$k\in M_1$
with
$g_0[k]=g_1[k]$
and

There is a close connection with the relation
$\mathbin {\angle }$
from [Reference Kříž22, Definition 2.1] and the minimal bad sequences of Nash-Williams [Reference Nash-Williams26]. For the purpose of this proof, we declare that a tree is a non-empty set T of functions
$\sigma :a\to X$
with finite domain
$a\subseteq \mathbb N$
, such that
$\sigma \in T$
entails
$\sigma [k]\in T$
. By a branch of
$T,$
we mean an infinite function
$g:M\to X$
with
$g[k]\in T$
for all
$k\in \mathbb N$
. As usual, we write
$[T]$
for the set of branches of T (even though this overloads the notation
$[\cdot ]$
). Given a non-empty set
$\mathcal S$
of infinite functions
$g:M\to X$
, we define a tree
$T(\mathcal S)$
by stipulating

We shall use the following variant of [Reference Kříž22, Lemma 2.2], which is standard but will also be verified in the proof of Theorem 3.7 below:

Aiming at a contradiction, assume that Theorem 3.5 fails for a given bad
$f:\mathbb N\to X$
. Let
$\mathcal S$
be the set of all infinite bad sequences
$g:M\to X$
with
$M\subseteq \mathbb N$
such that
$g(i)\ll ^*f(i)$
holds for all
$i\in M$
. To see that (†) can be applied, we first note that
$\mathcal S$
contains f and is thus non-empty. The remaining condition
$[T(\mathcal S)]\subseteq \mathcal S$
expresses that membership in
$\mathcal S$
can be refuted on finite initial segments, which is indeed the case. More explicitly, if
$g:M\to X$
lies in
$g\in [T(\mathcal S)]$
, then any finite segment
$g[k]$
is equal to
$g'[k]$
for some
$g'\in \mathcal S$
. For
$i,j\in M$
with
$i,j<k$
, the corresponding properties of
$g'$
ensure
$g(i)\not \leq _X g(j)$
and
$g(i)\ll ^*f(i)$
, as needed to get
$g\in \mathcal S$
. Now (†) yields a minimal bad
$g_0:M_0\to X$
with
$g_0(i)\ll ^*f(i)$
for all
$i\in M_0\in [\mathbb N]^\omega $
. More explicitly, minimality means that
$g\mathbin {\angle }_0 g_0$
fails for any bad
$g:M\to X$
with
$g(i)\ll ^*f(i)$
for
$i\in M\in [\mathbb N]^\omega $
. Let us observe that this minimal
$g_0$
is regular: If not, then we have
$q(g_0(k'))<q(g_0(k))$
for some
$k<k'$
in
$M_0$
. Now let

be the restriction of
$g_0$
to the indicated domain. Given that
$q_0$
coincides with q, we get
$p_0(g(k'))<p_0(g_0(k))$
and hence
$g\mathbin {\angle }_0 g_0$
, which contradicts the minimality of
$g_0$
. Starting with
$k_0:=\min (M_0)$
and
$g_0:M_0\to X$
as specified, we shall use recursion on the ordinal
$\alpha $
to construct functions
$g_\alpha :M_\alpha \to X$
and elements
$k_\alpha \in M_\alpha $
that validate the following properties:
-
(i.α) we have
$M_\alpha \in [\mathbb N]^\omega $ , and
$g_\alpha :M_\alpha \to X$ is regular and bad,
-
(ii.α) for any
$\beta <\alpha $ and all
$i\in M_\alpha $ , we have
$i\in M_\beta $ and
$q(g_\alpha (i))\geq q(g_\beta (i))$ , as well as
$g_\alpha (i)\ll g_\beta (i)$ or
$g_\alpha (i)=g_\beta (i)$ ,
-
(iii.α) if
$\alpha $ is a limit, any
$i\in M_\alpha =\bigcap _{\beta <\alpha }M_\beta $ admits a
$\beta <\alpha $ with
$g_\alpha (i)=g_\beta (i)$ ,
-
(iv.α) if
$\alpha =\beta +1$ , then we have
$g_\alpha (i)=g_\beta (i)$ for all
$i\in M_\alpha \cap [k_\alpha ]=M_\beta \cap [k_\alpha ]$ , as well as
$g_\alpha (i)\ll g_\beta (i)$ for all
$i\in M_\alpha \backslash [k_\alpha ]$ ,
-
(v.α) if
$\alpha =\beta +1$ and
$g^{\prime }_\alpha :M^{\prime }_\alpha \to X$ is the restriction of
$g_\alpha $ to
$M^{\prime }_\alpha :=M_\alpha \backslash [k_\alpha ]$ , then
$g\mathbin {\angle }_\beta g^{\prime }_\alpha $ fails for any bad sequence
$g:M\to X$ with
$q(g(i))\geq q(g_\beta (i))$ and
$g(i)\ll g_\beta (i)$ for all
$i\in M\in [M^{\prime }_\alpha ]^\omega $ .
We note that
$k_\alpha $
is only relevant when
$\alpha $
is a successor. In this case, (v.α) asserts that
$g_\alpha '$
is minimally bad with the properties in (ii.α) and (iv.α). All conditions are valid (and mostly void) when
$\alpha =0$
. For the recursion step towards
$\alpha>0$
, assume that
$g_\gamma :M_\gamma \to X$
and
$k_\gamma \in M_\gamma $
are defined and that (i.
$\gamma $
) to (v.
$\gamma $
) hold for
$\gamma <\alpha $
.
Let us first consider the case where
$\alpha $
is a limit. The crucial task is to show that the set
$M_\alpha :=\bigcap _{\beta <\alpha }M_\beta $
is infinite. As in the proof of Kříž, we map
$\beta <\alpha $
to

Note that
$\alpha \ni \beta \mapsto k(\beta )\in \mathbb N$
is non-decreasing. Let us show that it is unbounded: If not, pick a
$\beta _0<\alpha $
such that
$k:=k(\beta _0)=k(\beta )$
holds for
$\beta _0\leq \beta <\alpha $
. We then find a sequence
$\beta _0<\gamma (0)<\gamma (1)<\ldots <\alpha $
with
$k_{\gamma (i)+1}=k$
for all
$i\in \mathbb N$
. Due to conditions (iv) and (ii) for
$\gamma (i+1)+1$
, we obtain

which contradicts the well foundedness of
$\ll $
, i.e., property (iv) from Definition 3.1. For any
$\beta <\alpha $
, we now show
$k(\beta )\in M_\gamma $
by induction on
$\gamma <\alpha $
. When
$\gamma \leq \beta +1$
, it suffices to observe that there is a
$\delta>\beta $
with
$k(\beta )=k_{\delta +1}\in M_{\delta +1}\subseteq M_\gamma $
, where the inclusion relies on (ii.
$\delta +1$
). The induction step towards a limit
$\gamma $
is immediate by condition (iii.
$\gamma $
). Finally, for
$\gamma =\delta +1>\beta +1$
we have
$k(\beta )\leq k_\gamma $
by definition. If this is an equality, the claim is immediate. Otherwise, we get

by the induction hypothesis and (iv.
$\gamma $
). We can now conclude that

is infinite, as desired. By condition (ii) and the well foundedness of
$\ll $
, each
$i\in M_\alpha $
admits a
$\beta (i)<\alpha $
such that
$g_\beta (i)=g_{\beta (i)}(i)$
holds for
$\beta (i)\leq \beta <\alpha $
. To complete the limit case of the recursion step, we define
$g_\alpha :M_\alpha \to X$
by
$g_\alpha (i):=g_{\beta (i)}(i)$
. It is straightforward to check that conditions (i.
$\alpha $
) to (v.
$\alpha $
) are satisfied.
Let us continue with the recursion step towards a successor ordinal
$\alpha =\beta +1$
. As preparation, we consider an arbitrary bad
$h:M\to X$
such that
$h(i)\ll g_\beta (i)$
holds for all
$i\in M\in [M_\beta ]^\omega $
. By induction over
$\gamma \leq \beta $
, we show that we have

The induction step towards a limit
$\gamma $
is immediate by condition (iii.
$\gamma $
). To establish the claim for
$\gamma =0$
and arbitrary
$k\in M$
, we consider the sequence

For
$j\in M\backslash [k],$
we can invoke condition (ii.
$\beta $
) to get
$g_\beta (j)=g_0(j)$
or
$g_\beta (j)\ll g_0(j)$
and
$q(g_\beta (j))\geq q(g_0(j))$
. In either case, we obtain
$h(j)\ll g_0(j)$
, due to property (iii) from Definition 3.1. It follows that g is bad: if we had
$g(i)\leq _X g(j)$
for
$i\in M_0\cap [k]$
and
$j\in M\backslash [k]$
, we could conclude
$g_0(i)\leq _X g_0(j)$
by (ii) of Definition 3.1, as
$g_0$
is regular. Now
$g\mathbin {\angle }_0 g_0$
must fail by the choice of
$g_0$
. We thus get

Since
$h(k)\ll g_0(k)$
entails
$r(h(k))<r(g_0(k))$
, this yields
$q_0(h(k))>q_0(g_0(k))$
and hence
$q(h(k))>q(g_0(k))$
. In the induction step towards a successor
$\gamma =\delta +1$
, we first assume that we have
$k\in M\cap [k_\gamma ]$
. By condition (iv.
$\gamma $
), we get
$g_\gamma (k)=g_\delta (k)$
, so that it suffices to invoke the induction hypothesis. Now assume
$k\in M\backslash [k_\gamma ]$
and define
$g^{\prime }_\gamma :M^{\prime }_\gamma \to X$
as in condition (v.
$\gamma $
). We observe
$M\backslash [k]\subseteq M^{\prime }_\gamma $
and consider

As in the case of
$\gamma =0$
, one can show that
$g'$
is bad. Using the induction hypothesis and conditions (ii.
$\gamma $
) and (iv.
$\gamma $
) with
$\gamma =\delta +1$
, one obtains
$q(g'(i))\geq q(g_\delta (i))$
and
$g'(i)\ll g_\delta (i)$
for any i in the domain of
$g'$
. Invoking the minimality condition (v.
$\gamma $
), we can infer
$p_\delta (h(k))\geq p_\delta (g_\gamma (k))$
. As before, we combine this with
$h(k)\ll g_\gamma (k)$
to get
$q_\delta (h(k))>q_\delta (g_\gamma (k))$
and then
$q(h(k))>q(g_\gamma (k))$
, as needed.
After these preparations, we carry out the recursion step towards
$\alpha =\beta +1$
. We shall first describe the construction of a sequence
$g_\alpha :M_\alpha \to X$
relative to a given set
$N_\beta \in [M_\beta ]^\omega $
. Afterwards, we will show that conditions (i.
$\alpha $
) to (v.
$\alpha $
) hold for a suitable choice of
$N_\beta $
, which depends on the form of
$\beta $
. As the first step in the construction of
$g_\alpha $
, we declare that
$g^\circ _\beta :N_\beta \to X$
is the restriction of
$g_\beta $
. Condition (i.
$\beta $
) ensures that
$g^\circ _\beta $
is regular and bad. By (ii.
$\beta $
) and the choice of
$g_0$
, we have
$g^\circ _\beta (i)\ll ^*f(i)$
for all
$i\in N_\beta $
. Due to the assumption that Theorem 3.5 fails for f, condition (ii) of the theorem must be false for
$g=g^\circ _\beta $
. Hence we get a bad sequence
$h:N\to X$
with
$h(i)\ll g_\beta (i)$
for all
$i\in N\in [N_\beta ]^\omega $
. The preparatory claim above yields
$q(h(i))\geq q(g_\beta (i))$
. By the principle (3.1), we may assume that h is minimal with the given properties, in the sense that (v.
$\alpha $
) holds with
$h:N\to X$
at the place of
$g^{\prime }_\alpha :M^{\prime }_\alpha \to X$
. For
$k_\alpha :=\min (N)$
and
$M_\alpha :=(M_\beta \cap [k_\alpha ])\cup N$
, we now consider the sequence

This does indeed yield
$M^{\prime }_\alpha =M_\alpha \backslash [k_\alpha ]=N$
and
$g^{\prime }_\alpha =h$
. So (iv.
$\alpha $
) and (v.
$\alpha $
) hold by construction, while (iii.
$\alpha $
) is void for a successor. Invoking (iii) of Definition 3.1, one readily derives (ii.
$\alpha $
) from (ii.
$\beta $
). If we had
$g_\beta (i)\leq _X h(j)$
with
$i\in M_\beta \cap [k_\alpha ]$
and
$j\in N$
, we would get
$g_\beta (i)\leq _X g_\beta (j)$
by (iii) of Definition 3.1, as
$g_\beta $
is regular due to (i.
$\beta $
). So the sequence
$g_\alpha $
is bad. This reduces (i.
$\alpha $
) to the claim that h is regular. For indices
$j<k$
in N, the minimality condition (v.
$\alpha $
) entails
$q_\beta (h(j))\leq q_\beta (h(k))$
, by the argument that we have used to prove the regularity of
$g_0$
. To conclude the recursion step, we now describe a choice of
$N_\beta \in [M_\beta ]^\omega $
that ensures

Let us point out that the previous arguments are essentially due to Kříž [Reference Kříž22]. The latter, however, had no need to show (‡), as he worked with q at the place of
$p_\beta $
. This means that the following is a new contribution of the present article. To satisfy (‡) for
$\beta =0$
, we can simply take
$N_0:=M_0$
, as
$q_0$
and q coincide. When
$\beta =\gamma +1$
is a successor itself, we put
$N_\beta :=M^{\prime }_\beta =M_\beta \backslash [k_\beta ]$
. Note that in this case, we get

Property (‡) is immediate when we have
$q_\beta (h(j))>0$
and hence
$q_\beta (h(j))=q(h(j))$
. To cover the case where we have
$q_\beta (h(j))=0$
and thus
$q(h(j))<\beta =\gamma +1$
, we establish
$q(h(k))\geq \gamma $
. Aiming at a contradiction, we assume
$q(h(k))<\gamma $
, so that we get
$q_\gamma (h(k))=0$
. From the above, we have
$h(k)\ll g_\beta (k)$
and thus
$r(h(k))<r(g_\beta (k))$
, which yields

One readily derives a contradiction with (v.
$\beta $
), by considering the sequence

For
$\alpha =\beta +1=\gamma +2$
and
$k\in N=M_\alpha \backslash [k_\alpha ]$
, we have seen
$q(g_\alpha (k))= q(h(k))\geq \gamma $
. We now consider
$\alpha =\beta +1$
for a limit
$\beta $
. For all
$\gamma <\beta $
, the given argument yields

With the aim of setting
$N_\beta :=\{i(j)\,|\,j\in \mathbb N\}$
, we define
$i(0)<i(1)<\ldots \subseteq M_\beta $
as follows: Assuming that
$i(j')$
is defined for
$j'<j$
, put

Crucially, the new condition (v) in Definition 3.1 ensures
$\gamma (j)<\beta $
. We now pick a next index
$i(j)\in M_\beta $
with
$i(j)\geq k_{\gamma (j)+2}$
. Let us point out that this choice of
$N_\beta $
is loosely inspired by an argument of Gordeev [Reference Gordeev18]. Consider (‡) with
$i(j')$
and
$i(j)$
at the place of j and k, respectively. In the non-trivial case of
$q_\beta (h(i(j')))=0$
, we invoke
$h(i(j'))\ll g_\beta (i(j'))$
and the definition of
$\gamma (j)$
to get the first inequality in

Note that the penultimate inequality comes from condition (ii.
$\beta $
), while the remaining two inequalities were observed during the previous construction. This completes the proof of (‡) and hence the last case in the recursive construction of our sequences
$g_\alpha :M_\alpha \to X$
and elements
$k_\alpha \in M_\alpha $
.
Recall that we aim at a contradiction with the assumption that Theorem 3.5 fails for the given f. To conclude his proof, Kříž [Reference Kříž22] argues that the recursive construction cannot be successful up to an ordinal
$\alpha $
that has uncountable cofinality. Indeed, we have seen that the limit case gives rise to a map
$\alpha \ni \beta \mapsto k(\beta )\in \mathbb N$
that is non-decreasing and unbounded. It follows that
$\mathbb N\ni k\mapsto \min \{\beta <\alpha \,|\,k(\beta )\geq k\}$
is cofinal in
$\alpha $
. Our modified construction cannot even be successful up to
$\alpha =\nu +2$
. Otherwise, we would get
$q(g_{\nu +2}(k))\geq \nu $
for sufficiently large
$k\in M_{\nu +2}$
, as we have seen above. This would yield a value beyond the range of
$q:X\to \nu $
.
To conclude this section, we discuss the formalization of the previous proof. Let us recall that Definition 3.1 is more restrictive than the notion of ‘quasiordering with gap-condition’ in [Reference Kříž22], as we have added the new condition (v). Thus our version of ‘Kříž’s minimality principle’ is somewhat weaker than the original result in [Reference Kříž22]. It is still strong enough for a large class of applications, as Theorem 3.6 demonstrates. We do not know whether the following remains valid for Kříž’s original principle.
Theorem 3.7. Kříž’s minimality principle for
$\nu $
(as formulated in Theorem 3.5) follows from
$\Pi ^1_1$
-recursion along
$\nu $
, where
$\nu $
can be any ordinal.
Proof. We build on the notation and arguments from the previous proof. Our first aim is to show that the sequence h in (†) is arithmetically definable from
$T(\mathcal S)$
and
$\beta $
. In particular, we will confirm that (†) is correct. Let us say that a tree T is perfect if we have
$T\subseteq T([T])$
and hence
$T=T([T])$
(in particular
$[T]\neq \emptyset $
). Note that this is equivalent to the arithmetical condition that T has no leaves. Given any perfect tree T, we can apply (†) with
$\mathcal S:=[T]$
, which yields
$T=T(\mathcal S)$
. Conversely,
$T(\mathcal S)$
is perfect for any set
$\mathcal S\neq \emptyset $
of infinite
$g:M\to X$
, as we have
$\mathcal S\subseteq [T(\mathcal S)]$
. Under the assumption of (†), we can recover
$\mathcal S=[T]$
from
$T:=T(\mathcal S)$
. So (†) is equivalent to the statement that any perfect tree T has a branch
$g_{T,\beta }\in [T]$
such that
$g\mathbin {\angle }_\beta g_{T,\beta }$
fails for all
$g\in [T]$
, where
$\beta $
can be any ordinal. To construct
$g_{T,\beta }$
explicitly, we consider
$\sigma \in T$
and write it as
$\sigma :\{a_i\,|\,i<k\}\to X$
with
$a_0<\ldots <a_{k-1}$
in
$\mathbb N$
. We say that
$\sigma $
is
$\beta $
-minimal in T if the following holds for each index
$j<k$
, every number
$a^{\prime }_j\geq \sup \{a_i+1\,|\,i<j\}$
and any sequence
$\tau :\{a_i\,|\,i<j\}\cup \{a^{\prime }_j\}\to X$
in T such that we have
$\sigma (a_i)=\tau (a_i)$
for all
$i<j$
:
-
(i) we have
$p_\beta (\sigma (a_j))\leq p_\beta (\tau (a^{\prime }_j))$ ,
-
(ii) if equality holds in (i), then we have
$a_j\leq a^{\prime }_j$ ,
-
(iii) if equality holds in (i) and (ii), then the numerical code of
$\sigma (a_j)$ is smaller than or equal to the one of
$\tau (a^{\prime }_j)$ .
Note that these conditions are arithmetical. Given that T is perfect, a straightforward induction shows that it contains a unique
$\beta $
-minimal
$\sigma _k:\{a_{k,i}\,|\,i<k\}\to X$
of each length
$k\in \mathbb N$
, where we still assume
$a_{k,0}<\ldots <a_{k,k-1}$
. When we have
$j<k$
, the restriction
$\sigma _k[a_{k,j}]$
is
$\beta $
-minimal and thus equal to
$\sigma _j$
. So by uniqueness, the numbers
$a_i:=a_{k,i}$
and the values
$\sigma _k(a_i)$
do not depend on k. We now consider

The construction ensures that we have
$g_{T,\beta }\in [T]$
and that
$g\mathbin {\angle }_\beta g_{T,\beta }$
fails for any branch
$g\in [T]$
. Due to the considerations above, we obtain (†) with
$h=g_{T(\mathcal S),\beta }$
. Crucially, our presentation demonstrates that
$g_{T,\beta }:M_{T,\beta }\to X$
is arithmetically definable from T and
$\beta $
. For example, we have
$a\in M_{T,\beta }$
precisely when a lies in the domain of some or every
$\beta $
-minimal sequence of suitable finite length.
In the notation from the previous proof, let
$\mathcal S_\beta $
be the set of all bad
$h:N\to X$
with
$q(h(i))\geq q(g_\beta (i))$
and
$h(i)\ll g_\beta (i)$
for all
$i\in N\in [N_\beta ]^\omega $
. By considering the recursive construction in this proof, we see that the sequence from condition (v.
$\alpha $
) can be given as

Let us agree that
$\overline T(\mathcal S)$
denotes the complement of
$T(\mathcal S)$
, say, within the set of all finite sequences
$\sigma :a\to X$
with
$a\subseteq \mathbb N$
. We argue that
$\Pi ^1_1$
-recursion on
$\alpha $
allows for a simultaneous construction of the sequences
$g_\alpha :M_\alpha \to X$
, the numbers
$k_\alpha \in M_\alpha $
and the auxiliary sets
$\overline T(\mathcal S_\alpha )$
. Indeed, we have just seen that
$g^{\prime }_{\beta +1}$
is arithmetical in
$T(\mathcal S_\beta )$
and hence in
$\overline T(\mathcal S_\beta )$
. One can conclude that
$g_\alpha $
and
$k_\alpha $
are arithmetically definable from the previous stages of the recursion. Also, the relation
$h\in \mathcal S_\alpha $
is arithmetical with parameters
$g_\alpha $
and
$k_\beta $
for
$\beta \leq \alpha $
(which determine
$N_\alpha $
). It follows that
$T(\mathcal S_\alpha )$
is
$\Sigma ^1_1$
-definable from these parameters, due to the defining equivalence for
$T(\mathcal S)$
in the paragraph before statement (†). Hence the complement
$\overline T(\mathcal S_\alpha )$
is
$\Pi ^1_1$
-definable from the previous stages. Given that
$\Pi ^1_1$
-recursion is permitted along
$\nu $
and hence along
$\nu +3$
(in the non-trivial case of
$\nu>0$
), we can thus carry out the recursive construction from the previous proof. Once this is achieved, it remains to accommodate the inductive verification of conditions (i.
$\alpha )$
to (v.
$\alpha $
). Since the latter are
$\Pi ^1_1$
, the induction principle is justified by
$\Pi ^1_1$
-recursion (along
$1\leq \nu $
).
4 From Kruskal–Friedman theorem to
$\Pi ^1_1$
-recursion
In this section, we prove the remaining implication of Theorem 1.5, which asserts that the uniform Kruskal–Friedman theorem entails
$\Pi ^1_1$
-recursion. A sketch of the proof can be found in the introduction.
The following result allows us to work over a stronger base theory. It will be superseded by the aforementioned implication from Theorem 1.5.
Lemma 4.1. If the uniform Kruskal–Friedman theorem holds for some
$\nu>0$
, then we get arithmetical comprehension, over the base theory
$\mathsf {RCA}_0+\mathsf {CAC}$
.
Proof. Using Corollary 2.10, we can reduce the claim to the case of
$\nu =1$
, which is covered by [Reference Freund, Rathjen and Weiermann13, Lemma 4.6]. We recall the proof of the cited lemma, so that the reader can compare it with the next proof below. Let us write
$\operatorname {Seq}(Z)$
for the set of finite sequences
$\sigma =\langle z_1,\ldots ,z_n\rangle $
with entries
$z_i\in Z$
. For another element
$z_0\in Z$
and
$\sigma \in \operatorname {Seq}(Z)$
as given, we put
${z_0}^\frown \sigma :=\langle z_0,\ldots ,z_n\rangle $
. If Z is a partial order, we declare that
$\langle \rangle \leq _{\operatorname {Seq}(Z)}\sigma $
holds for all
$\sigma \in \operatorname {Seq}(Z)$
and that we have

This is a recursive characterization of the order from Higman’s lemma, which asserts that
$\operatorname {Seq}(Z)$
is a
$\mathsf {WPO}$
whenever the same holds for Z. The latter is equivalent to arithmetical comprehension, as shown by Simpson [Reference Simpson31]. To complete the proof, we show that Higman’s lemma follows from the uniform Kruskal–Friedman theorem for
$\nu =1$
. Assume Z is a
$\mathsf {WPO}$
. For each partial order X, put

Let us agree that the only inequalities in
$W[Z](X)$
are
$0\leq 0$
and
$(z,x)\leq (z',x')$
for
$z\leq _Z z'$
and
$x\leq _X x'$
. For a quasi embedding
$f:X\to Y$
, we define functions

by setting
$W[Z](f)(0):=0$
and
$W[Z](f)(z,x):=(z,f(x))$
as well as
$\operatorname {supp}_X(0):=\emptyset $
and
$\operatorname {supp}_X(z,x):=\{x\}$
. It is straightforward to check that this turns
$W[Z]$
into a normal
$\mathsf {PO}$
-dilator. To learn that we have a
$\mathsf {WPO}$
-dilator, we invoke the chain antichain principle, which ensures that the
$\mathsf {WPO}$
s are closed under taking products (see [Reference Cholak, Marcone and Solomon6]). Now consider the function

with
$\kappa (0):=\langle \rangle $
and
$\kappa (z,\sigma ):=z^\frown \sigma $
. Our recursive characterization of Higman’s order coincides with the equivalence from Definition 1.3. Hence
$\operatorname {Seq}(Z)$
and
$\kappa $
form a
$1$
-Kruskal fixed point of
$W[Z]$
. This fixed point is initial, since the usual length function
$h:\operatorname {Seq}(Z)\to \mathbb N$
validates condition (ii) of Theorem 2.8. Now the uniform Kruskal–Friedman theorem entails that
$\operatorname {Seq}(Z)$
is a
$\mathsf {WPO}$
.
We do not know if
$\mathsf {CAC}$
follows from the uniform Kruskal–Friedman theorem for fixed
$\nu $
, even though the theorem has enormous strength when
$\mathsf {CAC}$
is present. At the same time, the next result shows that
$\mathsf {CAC}$
is not needed when we admit arbitrary
$\nu $
, as the latter can then replace the order Z from the previous proof. This accounts for the weaker base theory in Corollary 1.6.
Lemma 4.2. If the uniform Kruskal–Friedman theorem holds for all
$\nu $
, then we get arithmetical comprehension (and in particular
$\mathsf {CAC}$
), over the base theory
$\mathsf {RCA}_0$
.
Proof. Let
$\omega (\nu )$
be the set of sequences
$\langle \alpha _0,\ldots ,\alpha _{n-1}\rangle $
with
$\nu>\alpha _0\geq \ldots \geq \alpha _{n-1}$
(think of Cantor normal forms). The lexicographic order on
$\omega (\nu )$
can then be characterized by
$\langle \rangle \leq _{\omega (\nu )}\sigma $
and

Arithmetical comprehension is equivalent to the statement that this order is well founded for every ordinal
$\nu $
, by a result of Girard [Reference Girard17, Section II.5] and Hirst [Reference Hirst20]. To apply the uniform Kruskal–Friedman theorem, consider the transformation

of partial orders, where we declare that
$0\leq _{W(X)}0$
and
$1+x\leq _{W(X)}1+y$
for
$x\leq _X y$
are the only inequalities in
$W(X)$
. For a quasi embedding
$f:X\to Y$
, we define
$W(f):W(X)\to W(Y)$
and
$\operatorname {supp}_X:W(X)\to [X]^{<\omega }$
by setting

It is straightforward to check that this turns W into a normal
$\mathsf {WPO}$
-dilator. By the uniform Kruskal–Friedman theorem for
$\nu $
, we get a
$\mathsf {WPO} \ X$
and a bijection
$\kappa :\nu \times W(X)\to X$
that validates the equivalence from Definition 1.3. Assuming
$\nu>0$
, we define
$f:\omega (\nu )\to X$
by the recursive clauses

One can show that
$f(\sigma )\leq _X f(\tau )$
entails
$\sigma \leq _{\omega (\nu )}\tau $
, by a straightforward induction on the combined lengths of
$\sigma $
and
$\tau $
(for
$\tau =\beta ^\frown \rho $
prove
$\rho \leq _{\omega (\nu )}\tau $
as preparation). Given that X is a
$\mathsf {WPO}$
, it follows that
$\omega (\nu )$
is well founded, as needed to get arithmetical comprehension via the cited result of Girard and Hirst.
As indicated in the introduction, we now connect the linear and the partial case. A quasi embedding
$f:X\to Y$
of a linear order X into a partial order Y can be seen as a linearization of its range. If we know that Y is a
$\mathsf {WPO}$
, then we can conclude that X is a well order. To exploit this crucial fact, we shall extend the notion from single orders to order transformations, i.e., to dilators. By a predilator D on linear orders we mean a predilator in the sense of [Reference Freund and Rathjen12, Definition 1.1] (which is the obvious linear counterpart of Definition 1.1 above). If
$D(X)$
is well founded for any well order X, we say that D is a dilator on linear orders.
Definition 4.3. Consider a
$\mathsf {PO}$
-dilator W and a predilator D on linear orders. A quasi embedding
$\eta :D\Rightarrow W$
consists of a quasi embedding
$\eta _X:D(X)\to W(X)$
for each linear order X, such that the naturality condition
$W(f)\circ \eta _X=\eta _Y\circ D(f)$
is satisfied for any embedding
$f:X\to Y$
of linear orders.
Recall that we write
$\mathcal TW[\nu ]$
to denote the initial
$\nu $
-Kruskal fixed point of a normal
$\mathsf {PO}$
-dilator W. For each predilator D on linear orders, we have a unique linear order
$\psi _\nu (D)$
that is a
$\nu $
-fixed point in the sense of [Reference Freund and Rathjen12, Definition 1.4], due to Corollary 2.2 and Theorem 2.9 from the same reference. The cited definition of
$\nu $
-fixed points in the linear case will be recalled at the beginning of the following proof. Note that a related result for the simpler case
$\nu =1$
has been shown in [Reference Freund, Rathjen and Weiermann13].
Theorem 4.4. Given a quasi embedding of a predilator D on linear orders into a normal
$\mathsf {PO}$
-dilator W, we obtain a quasi embedding of
$\psi _\nu (D)$
into
$\mathcal T W[\nu ]$
.
Proof. According to [Reference Freund and Rathjen12, Definition 1.4], the
$\nu $
-fixed point
$\psi _\nu (D)$
comes with a map

that satisfies certain conditions. In order to state the latter, we consider the support functions
$\operatorname {supp}^D_X:D(X)\to [X]^{<\omega }$
that are associated with the given predilator D (see [Reference Freund and Rathjen12, Definition 1.1]). Let the binary relation
$\vartriangleleft $
on
$Y:=\psi _\nu (D)$
be given by

By the first condition from [Reference Freund and Rathjen12, Definition 1.4], this relation
$\vartriangleleft $
must be well founded. For any
$\gamma <\nu $
, we can thus use recursion along
$\vartriangleleft $
to define
$G^D_\gamma :Y\to [D(Y)]^{<\omega }$
and simultaneously
$G_\gamma :D(Y)\to [D(Y)]^{<\omega }$
with

Now the final and crucial condition from [Reference Freund and Rathjen12, Definition 1.4] asserts that we have

It may be instructive to consider the similarity with Definition 3.2. The rest of the present proof will shed further light on the significance of
$G_\alpha $
. Let us also point out that the well foundedness of
$\vartriangleleft $
relates to criterion (ii) from Theorem 2.8. Indeed, all
$\nu $
-fixed points of dilators on linear orders are initial by [Reference Freund and Rathjen12, Proposition 2.1].
Let us write
$\eta :D\Rightarrow W$
for the given quasi embedding. As indicated in the introduction, we shall define a quasi embedding f such that the diagram

commutes, for
$Y=\psi _\nu (D)$
and
$(\nu \times g)(\alpha ,\tau )=(\alpha ,g(\tau ))$
. The point is that this diagram can be rewritten as a recursive equation. To see this, consider
$t\in \psi _\nu (D)$
and write
$\pi (t)=(\alpha ,\tau )$
. Due to the support condition from [Reference Freund and Rathjen12, Definition 1.1], we have a unique normal form

where
$\iota _a:a\hookrightarrow Y$
denotes the inclusion. In view of
$\eta _Y\circ D(\iota _a)=W(\iota _a)\circ \eta _a$
, our diagram commutes precisely when we have

As
$s\in a$
means
$s\vartriangleleft t$
, this equation can be read as a recursive clause, which is satisfied by at most one f. Concerning existence, we note that our clause can only be applied when
$f\circ \iota _a$
is a quasi embedding, so that
$W(f\circ \iota _a)$
is defined. To complete the recursive definition regardless, we agree to assign some default value
$f(t)\in \mathcal T W[\nu ]$
in the hypothetical case that
$f\circ \iota _a$
is no quasi embedding.
We want to show that this hypothetical case does indeed not occur and, in doing so, that f is a quasi embedding. For this purpose, we first use recursion over
$\vartriangleleft $
to define a length function
$l:\psi _\nu (D)\to \mathbb N$
with

We shall now use induction on
$l(t_0)+l(t_1)$
to prove

Let us write
$\pi (t_i)=(\alpha _i,\tau _i)$
and
$\tau _i\mathbin {=_{\operatorname {NF}}} D(\iota _{a(i)})(\tau ^{\prime }_i)$
. By the induction hypothesis, f is a quasi embedding on
$a(0)\cup a(1)$
. In particular, we are not in the hypothetical case that was mentioned above, so that
$f(t_i)$
is defined by the intended recursive clause. This means that
$f(t_0)\leq f(t_1)$
amounts to

By the equivalence from Definition 1.3, we must have
$\alpha _0\leq \alpha _1$
. If the inequality is strict, we get
$\pi (t_0)<\pi (t_1)$
in the linear order
$\nu \times D(Y)$
and hence
$t_0<_Y t_1$
in Y, as
$\pi $
is an embedding by [Reference Freund and Rathjen12, Definition 1.4]. In the following, we assume
$\alpha _0=\alpha _1$
. Let us begin with the case where
$f(t_0)\leq f(t_1)$
holds by the first disjunct in the equivalence from Definition 1.3, so that we have

We factor the inclusions
$\iota _{a(i)}$
as in the commutative diagram

As noted above, the induction hypothesis ensures that
$f\circ \iota $
is a quasi embedding. We can thus form
$W(f\circ \iota )$
and compute

Since
$W(f\circ \iota )\circ \eta _{a(0)\cup a(1)}$
is a quasi embedding while
$D(\iota )$
is an embedding, we get

Once again this yields
$\pi (t_0)\leq \pi (t_1)$
and hence
$t_0\leq t_1$
. We now consider the case where the second disjunct from Definition 1.3 applies, which means that we have

Here we write
$\operatorname {supp}^W_X:W(X)\to [X]^{<\omega }$
for the support functions that come with the
$\mathsf {PO}$
-dilator W. As explained in the paragraph after Definition 1.1, the supports are uniquely determined by W as a functor. It follows that any quasi embedding such as
$\eta :D\Rightarrow W$
respects supports, as shown in [Reference Freund, Rathjen and Weiermann13, Lemma 4.2]. Together with naturality, we obtain

We thus get
$f(t_0)\leq f(t)$
for some
$t\in \operatorname {supp}^D_Y(\tau _1)=a(1)$
. Let us put

Alternatively, one could take c to be the transitive closure of
$a(0)\cup a(1)$
under
$\vartriangleleft $
(which can be seen as a subterm relation). Write
$\iota ':c\hookrightarrow Y$
for the inclusion and observe that
$f\circ \iota '$
is a quasi embedding by induction hypothesis (due to the factor
$2$
in the definition of l). We will show that any
$s\in c$
validates the following.
Whenever we have
$\kappa (\gamma ,\rho )\leq _{\mathcal T W[\nu ]} f(s)$
, we get

for some
$\sigma \in D(c)$
with
$D(\iota ')(\sigma )\in G^D_\gamma (s)$
.
Before we prove this statement, we show how the desired inequality
$t_0\leq t_1$
follows. To this end, we apply (§) with
$\kappa (\gamma ,\rho ):=f(t_0)$
and
$s:=t\in a(1)\subseteq c$
from above. Let us extend our earlier diagram of inclusions into

Due to the definition of
$f(t_0)$
, we get
$\gamma =\alpha _0=\alpha _1=:\alpha $
as well as

For
$\sigma $
as provided by (§), we can derive

In view of the characterization of
$\operatorname {rng}(\pi )\ni \pi (t_1)=(\alpha ,\tau _1)$
from [Reference Freund and Rathjen12, Definition 1.4], which was recalled above, we also have

We obtain
$\tau _0\leq D(\iota ')(\sigma )<\tau _1$
and hence
$\pi (t_0)<\pi (t_1)$
and
$t_0<t_1$
.
To complete the proof, we establish (§) by induction on
$s\in c$
in the well order
$\vartriangleleft $
. Let us write
$\pi (s)=(\beta ,\sigma _0)$
with
$\sigma _0\mathbin {=_{\operatorname {NF}}} D(\iota _b)(\sigma ^{\prime }_0)$
, so that the premise of (§) reads

Analogous to the above, we factor
$\iota _b=\iota '\circ \iota ^{\prime }_b:b\hookrightarrow Y$
with
$\iota _b':b\hookrightarrow c$
. In view of Definition 1.3, we have
$\gamma \leq \beta $
and one of the following two cases applies. First assume that we have

Then (§) is valid with
$\sigma :=D(\iota ^{\prime }_b)(\sigma ^{\prime }_0)$
, as
$\pi (s)=(\beta ,\sigma _0)$
and
$\beta \geq \gamma $
yield

In the remaining case from Definition 1.3, we have

where equality is shown as above. We get
$\kappa (\gamma ,\rho )\leq f(s')$
for some
$s'\in \operatorname {supp}^D_Y(\sigma _0)$
. Since the latter entails
$s'\vartriangleleft s$
and in particular
$s'\in c$
, we inductively obtain an element
$\sigma \in D(c)$
with
$\rho \leq W(f\circ \iota ')\circ \eta _c(\sigma )$
and

as needed to complete the proof of (§).
Let us deduce the remaining direction of Theorem 1.5 from the introduction.
Theorem 4.5. The uniform Kruskal–Friedman theorem for any infinite ordinal
$\nu $
entails
$\Pi ^1_1$
-recursion along the same
$\nu $
, over the base theory
$\mathsf {RCA}_0+\mathsf {CAC}$
.
Proof. According to [Reference Freund and Rathjen12, Theorem 1.6], the principle of
$\Pi ^1_1$
-comprehension along
$\nu $
reduces to the claim that every dilator D on linear orders has a well founded
$\nu $
-fixed point in the sense of [Reference Freund and Rathjen12, Definition 1.4]. We note that the restriction to infinite
$\nu $
is inherited from the cited theorem (but see [Reference Freund and Rathjen12, Corollary 4.4] for the case of externally fixed
$\nu <\omega $
). To establish the claim about
$\nu $
-fixed points, we assume the uniform Kruskal–Friedman theorem for
$\nu $
and consider an arbitrary dilator D on linear orders. One can find a quasi embedding

into a normal
$\mathsf {WPO}$
-dilator W, as shown in [Reference Freund, Rathjen and Weiermann13, Section 5] (where dilators on linear orders are called
$\mathsf {WO}$
-dilators). The cited reference uses arithmetical comprehension, which we can accommodate by Lemma 4.1 above. We note that substantial work goes into the proof that W preserves
$\mathsf {WPO}$
s (see [Reference Freund, Rathjen and Weiermann13, Theorem 5.11]). Due to Theorem 4.4 above, we can convert
$\eta $
into a quasi embedding

where the linear order
$\psi _\nu (D)$
is the unique
$\nu $
-fixed point of D while the partial order
$\mathcal T W[\nu ]$
is the initial
$\nu $
-Kruskal fixed point of W (see Section 2 of [Reference Freund and Rathjen12] and of the present article, respectively). So any infinite sequence
$t_0,t_1,\ldots $
in
$\psi _\nu (D)$
gives rise to a sequence
$f(t_0),f(t_1),\ldots $
in
$\mathcal T W[\nu ]$
. The latter is a
$\mathsf {WPO}$
by the uniform Kruskal–Friedman theorem. Thus there must be
$i<j$
with
$f(t_i)\leq f(t_j)$
and hence
$t_i\leq t_j$
, which allows us to conclude that
$\psi _\nu (D)$
is a well order. As indicated at the beginning of this proof, we can now invoke [Reference Freund and Rathjen12, Theorem 1.6] to secure
$\Pi ^1_1$
-recursion along
$\nu $
.
We have just shown that (i) implies (ii) in Theorem 1.5. To conclude this article, we recall how the converse implication is obtained by combining previous results: From Theorem 3.7, we know that
$\Pi ^1_1$
-comprehension along
$\nu $
implies Kříž’s minimality principle for
$\nu $
. The latter entails the uniform Kruskal–Friedman theorem for
$\nu $
, due to Theorem 3.6. The circle of implications shows that
$\Pi ^1_1$
-recursion is equivalent not only to the uniform Kruskal–Friedman theorem but also to Kříž’s minimality principle (up to the paragraph before Theorem 3.7), which is an interesting result in its own right.
Funding Statement
Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation)—Project number 460597863.