1 Introduction
There is no doubt that the continuum problem is one of the driving forces of set theory. The attempts to determine the cardinality of the continuum has accompanied the history of set theory, from its very beginning to the present day, and motivated many of its most significant advances. Cantor’s definition of the Perfect Set Property was prompted by a partial solution to the continuum problem; more in general, the initial developments of descriptive set theory were also driven by an attempt to confirm the continuum hypothesis ( ${\mathsf {CH}}$ : $2^{\aleph _0}=\aleph _1$ ) at least for the definable subsets of $\mathbb {R}$ . In more recent times, Gödel’s constructible universe and Cohen’s method of forcing were devised to show the independence of ${\mathsf {CH}}$ from $\mathsf {ZFC}$ (the standard first order axiomatization of set theory).
The techniques developed by Gödel and Cohen clarified the intrinsic limitations of the axiomatic approach to the continuum problem, and profoundly influenced the subsequent development of set theory. As a matter of fact, after the 1960s, the many independence results obtained combining the methods of inner models and forcing partially shifted the main focus of set theory: set theorists progressively devoted more and more efforts to the study of the models of set theory, and only derivatively to the study of sets (understood as autonomous mathematical objects).
The present paper presents a new approach to the quest of a solution of the continuum problem and—more generally—to the search of new set theoretic truths; this approach stems from a model-theoretic perspective on set theory, and aims to clarify the philosophical import of the results in [Reference Venturi and Viale38, Reference Viale41]. As is standard, we recognize the intrinsic limitations of $\mathsf {ZFC}$ in capturing set-theoretic validities, but, to overcome these limitations, we bring forward the idea that a powerful criterion to validate a new axiom is given by the nice model theoretic properties of the first order presentations of set theory which include it.
The central notion we borrow from model theory and that guides the present approach is that of model companionship. This concept was developed by Abraham Robinson in the 1960s, and it is meant to capture, in an abstract setting, the closure properties that algebraically closed fields display with respect to commutative rings with no zero divisors. As algebraically closed fields contain solutions for all Diophantine equations, the models of the model companions of set theory will have solutions for all “simple” set theoretical problems.
A fundamental property of model companionship is its dependence on the signature.Footnote 1 Loosely speaking: there can be distinct first order axiomatizations of a mathematical theory T, one in signature $\sigma $ and the other in signature $\tau $ , such that T admits a model companion when axiomatized according to $\sigma $ , but does not when axiomatized according to $\tau $ .Footnote 2
The dependence on the signature of model companionship motivates a detailed discussion aimed to single out the relevant signatures for set theory. We will start by clarifying some fundamental aspects of set-theoretical practice. In particular, we will explain why bounded formulae (i.e., $\Delta _0$ -formulae) express set-theoretic concepts of low complexity and therefore should be included in any reasonably rich signature for set theory. We will then show how a careful choice of the signatures for set theory allows one to classify the complexity of set-theoretic concepts and to clarify the informal notion of “simplicity of a set-theoretic assertion.” Our analysis will provide precise criteria able to link logic complexity, forcing invariance, and simplicity of set-theoretic concepts. Once we have outlined clear criteria to select signatures, we will study set theory under different expansions of $\{\in \}$ . By varying the signature, we will show that the corresponding model companions of set theory describe possible theories of the structures $H_{\kappa ^+}$ , for $\kappa $ an infinite cardinal.Footnote 3 Moreover, given an appropriate signature $\tau _\kappa $ for set theory (relative to a cardinal $\kappa $ ), the corresponding $\tau _{\kappa }$ -theory of $H_{\kappa ^+}$ will describe a structure which is closed-off with respect to specific constraints. Roughly speaking: much alike an algebraically closed field K contains solutions to all polynomial equations with coefficients in K, the $\tau _\kappa $ -structures realizing the model companion of set theory in signature $\tau _\kappa $ will make true all $\Sigma _1$ -properties for $\tau _\kappa $ (with the relevant parameters from $H_{\kappa ^+}$ ) which are not “outward inconsistent.”
A precise syntactic description of this closing-off procedure will be given by the notion of absolute model companionship (AMC). We defer to Section 5.2 a thorough presentation of this notion, we can briefly describe it right away by saying that AMC describes those $\tau $ -theories S for which a model companion exists and is axiomatized by the $\Pi _2$ -sentences for $\tau $ which are consistent with the universal and existential fragment of any completion of S.Footnote 4
Building on the properties of AMC, the sought solution to the continuum problem can be presented as follows:
-
(i) $\neg {\mathsf {CH}}$ and even a definable versionFootnote 5 $\theta $ of $2^{\aleph _0}=\aleph _2$ can be expressed by $\Pi _2$ -formulae in any signature for set theory which contains the $\Delta _0$ -properties among its atomic formulae and has a parameter for .
-
(ii) For each infinite (and definable) cardinal $\kappa $ there is at least one (actually several) signature(s) $\tau _\kappa $ containing the $\Delta _0$ -properties among its atomic formulae, a constant to interpret the cardinal $\kappa $ , and such that set theory axiomatized in the corresponding $\tau _\kappa $ admits an AMC.Footnote 6
-
(iii) Given any signature $\tau $ for set theory, containing the $\Delta _0$ -properties among its atomic formulae, any first order $\tau $ -axiomatization T of set theory admitting an AMC S for $\tau $ will not have neither ${\mathsf {CH}}$ nor $\neg \theta $ among the $\tau $ -theorems of S, unless T models ${\mathsf {CH}}$ or $\neg \theta $ (furthermore for many choices of $\tau $ and T, even if T models ${\mathsf {CH}}$ and/or $\neg \theta $ , S will not).
-
(iv) There is at least one (recursive and naturalFootnote 7 ) signature $\tau ^*$ for set theory which contains the $\Delta _0$ -properties among its atomic formulae and such that all $\tau ^*$ -axiomatizations T of any set theory enriched with large cardinal axiomsFootnote 8 admit an AMC S for $\tau ^*$ that contains $\neg {\mathsf {CH}}$ and $\theta $ among its axioms. Furthermore S describes the $\tau ^*$ -theory of $H_{\aleph _2}$ in models of strong forcing axioms.
Therefore, these results single out (a definable version of) $2^{\aleph _0}=\aleph _2$ as the unique solution of the continuum problem which can fall in at least one AMC of set theory enriched with large cardinal axioms (for example the AMC with respect to signature $\tau ^*$ of item (iv) above).Footnote 9
The present model-theoretic approach to set theoretic validities complements well standard strategies for producing axiomatic extensions of $\mathsf {ZFC}$ . Actually we see our approach as an implementation of Gödel’s program. Given the stratification of V as the increasing union of its initial segments provided by the various $H_{\kappa ^+}$ (as $\kappa $ ranges over the infinite cardinals), we can interpret the AMCs of set theory given by item (ii) as tentative descriptions of the theories of these $H_{\kappa ^+}$ , theories which coherently extend each other.Footnote 10 Assuming this perspective, these $\tau _\kappa $ -theories of $H_{\kappa ^+}$ are gradually closing-off the universe of sets with respect to all “simple” set theoretical problems (with each $\tau _\kappa $ giving its own description of what is a “simple” set theoretic concept, and with each $\tau _\kappa $ making more and more set theoretic concepts becoming “simple” as $\kappa $ increases). In this sense, the nice model-theoretic properties displayed by the theories of $H_{\kappa ^+}$ (when they describe possible model companions of set theory) realize a form of maximality that we dare to call Hilbertian completeness. As is well-known, Hilbert’s axiomatization of geometry expresses a form of maximality for geometrical objects, by stating that the domain of geometry could not be extended if it were to preserve the axioms of the Grundlagen der Geometrie. Similarly, the closing-off processes that each model companionship result carries along can (and will) be described as a way to assert the maximization of set-theoretical objects. Moreover, important axiomatic extensions of $\mathsf {ZFC}$ (considered candidates for the realization of Gödel’s programFootnote 11 ) like large cardinals and forcing axioms find, in the present approach, a further justification. As a matter of fact, the generic absoluteness results that we can obtain from large cardinal hypotheses are now of pivotal importance in detecting the suitable signature expansions able to allow the existence of a(n absolute) model companion for set theory. Another surprising fact which shows that model companionship results and forcing axioms complement each other is that the theory of $H_{\aleph _2}$ in models of strong forcing axioms describes the (absolute) model companion of $\mathsf {ZFC}+$ large cardinals (as expressed in the natural signature $\tau ^*$ of item (iv) above); conversely the existence of a model companion for the theory of V in this signature $\tau ^*$ already implies that a strong form of bounded Martin’s maximum (which is one of the strongest forcing axioms) and Woodin’s axiom $(*)$ both hold in V.
The paper is structured as follows: in Sections 2 and 3 we gently analyze the role that signatures can play in outlining the properties of a mathematical theory T (with a focus on how the morphisms between models of T can be used to detect the right signatures for T). In Section 4 we present a few standard notions from model theory (i.e., existentially closed structures, model completeness, and model companionship), which are discussed and generalized in Section 5, where we define the notions of partial Morleyization, absolute model companionship, and (absolute) model companionship spectrum. From Section 6 onwards we shift our attention to set theory. We first present precise mathematical criteria to detect what are the “simple” set-theoretical concepts. Then we analyze various signatures for set theory in Section 6.1. In Section 6.2 we show that the intended models of the model companions of set theory are structures of the form $H_{\kappa ^+}$ , for $\kappa $ an infinite cardinal. Finally, in Section 7 we present the main results regarding the model companions of set theory and discuss the information they convey on the cardinality of the continuum. We conclude, in Section 8, by comparing our approach with the various notions of maximality we find in the literature and with the debate on the justification of new axioms in set theory.
This paper aims to reach scholars with interests at the crossroads of philosophy and mathematics. We tried for this reason to minimize the mathematical and philosophical prerequisites needed to follow it. Those who made through this introduction should encounter no serious obstacles to read the remainder of this article. The model companionship results we discuss in the present paper are presented without proofs. The reader interested in them is referred to [Reference Venturi and Viale38, Reference Viale41].
Clearly the opinions expressed (and the errors occurring) in this article are the sole responsibility of the authors and do not extend in any way to those who had the patience to give some advice to improve our presentation.
2 What is the right signature for a mathematical theory?
One of the great successes of mathematical logic consists in providing an efficient formalization of mathematics: by means of first order logic it is possible to render mathematical theories the objects of a mathematical investigation. In this way logic is able to produce unexpected and non-trivial mathematical results as well as novel insights on a variety of mathematical fields. It is a matter of facts that there can be many distinct first order formalizations of a mathematical theory: varying the linguistic presentation of a theory (i.e., its signature) we obtain different axiomatic presentations of the same set of theorems. In order to appreciate the variety of possibilities we can encounter, let us consider the concrete case of group theory. This will help us to gently introduce one of the main themes of the present present: the role of the signature in detecting the properties of a theory formalized in it.
We can formalize group theory in first order logic using the signature $\left \lbrace \cdot \right \rbrace $ , which simply consists of a binary function symbol and the following axioms:
Notice that the third axiom, which expresses the existence of a multiplicative inverse, represents a rather complicated assertion, both from the point of view of its syntactic readability and of its Lévy complexity (being a $\Pi _3$ -sentence). The reason is that in this basic signature we lack a constant symbol to denote the neutral element of a group. Enriching the language to $\left \lbrace \cdot ,e \right \rbrace $ , with e a constant symbol, we can now formalize group theory with a simpler set of axioms:
We increased readability and decreased (Lévy) complexity, as we are now dealing with a $\Pi _2$ -axiomatization. But we can do better. Indeed, if we consider the signature $\left \lbrace \cdot ,e,{}^{-1} \right \rbrace $ , which further adds a unary operation symbol for the inverse operation, we can axiomatize group theory with a set of universal equations ( $\Pi _1$ -sentences):
On the other hand, we could follow a completely different route and axiomatize group theory avoiding the use of function symbols. For example we can consider the signature $\left \lbrace R,e \right \rbrace $ consisting of a ternary relation symbol R and a constant symbol e and produce the following axiomatization for group theory:
At the cost of further complicating our axiomatization, we could even drop the use of the constant symbol e and formalize group theory in the signature $\{R\}$ . The latter is clearly an artificial solution; moreover, the minimality of the signature does not help the perspicuity of the axiomatization.
Of all the above formalizations, the one mathematicians use more frequently is certainly the one in signature $\left \lbrace \cdot ,e,{}^{-1} \right \rbrace $ . On the contrary, to recognize that the system in signature $\left \lbrace R,e \right \rbrace $ is an axiomatization of group theory would surely require a bit of thought.
These considerations suggest that among the many possible signatures in which we can formalize a mathematical theory some are better than others. Our aim is to unfold criteria which allow us to detect the best signatures $\tau $ for the formalization of a mathematical theory T. To do so we appeal to two sets of arguments.Footnote 12
-
(intCrit) On the one hand, we can select $\tau $ on the basis of specific considerations internal to T. For example, by checking the adherence of a $\tau $ -formalization of T to the standard informal practice (e.g., the signature $\left \lbrace \cdot , e,{}^{-1} \right \rbrace $ clearly gives the best presentation of group theory in terms of its basic operations).
-
(AbsCrit) On the other hand, we can give abstract criteria for the choice of $\tau $ based on the structural properties that the $\tau $ -axiomatization of T displays, disregarding any consideration on the adherence of the $\tau $ -axiomatization of T with the informal one.
It occurs for many T that the reasons internal to T for choosing a signature $\tau $ for its axiomatization cohere with the abstract criteria, and that the two approaches produce the same outcome. This is desirable as abstract criteria are easily expressible with a sufficient degree of mathematical precision: we only need to define precisely what structural properties are preferable; then for a given theory T we can select the signatures $\sigma $ for which the $\sigma $ -axiomatization of T has the preferred property. On the other hand it is less transparent how to give a precise mathematical formulation of the reasons internal to T for choosing a signature. For example, which mathematical criteria allow us to recognize $\left \lbrace \cdot ,e \right \rbrace $ as a better signature than $\left \lbrace R,e \right \rbrace $ for the formalization of group theory? For example both give $\Pi _2$ -axiomatizations,Footnote 13 although it is clear that any mathematician would regard the $\left \lbrace \cdot ,e \right \rbrace $ -axiomatization as more natural than the one given in $\left \lbrace R,e \right \rbrace $ . Can we turn this qualitative preference into a precise mathematical criterion?
3 Simple and complicated concepts for a mathematical theory
When dealing with a first order theory T in a signature $\tau $ , the $\tau $ -formulae naturally correspond to the concepts/notions T intends to formalize. The well-known Lévy complexity then gives a measure gauged by $\tau $ of the complexity of a concept/notion of T by analyzing the number and patterns of the quantifiers that appear in a $\tau $ -formula which should formalize it. The Lévy hierarchy of the $\tau $ -formulae with respect to T-equivalence stratifies the concepts/notions of T as follows: the basic notions are those formalized by a boolean combination of atomic $\tau $ -formulae. The complexity of a notion then increases according to the number of alternations of $\forall ,\exists $ quantifiers that a formula $\phi $ formalizing it displays when expressed in a prenex normal form. In assigning a Lévy complexity to a notion of T, we consider, among all the $\tau $ -formulae which are T-equivalent and formalize it, those in prenex normal form with the least number of quantifier alternations. In this way we can assign complexity $\Pi _n$ , $\Sigma _n$ or $\Delta _n$ to the notions of T that are expressible by means of $\tau $ -formulas. More precisely, $\Pi _0=\Sigma _0=\Delta _0$ is the complexity of notions formalized by boolean combinations of atomic formulae; $\Pi _{n+1}$ represents the complexity of those notions formalized by a formula of type $\forall \vec {x}\,\psi (\vec {x},\vec {y})$ with $\psi (\vec {x},\vec {y}) \Sigma _n$ ; $\Sigma _{n+1}$ represents the complexity of those notions formalized by a formula of type $\exists \vec {x}\,\phi (\vec {x},\vec {y})$ with $\phi (\vec {x},\vec {y}) \Pi _n$ , and $\Delta _n$ represents the complexity of the notions whose complexity lies in $\Pi _n\cap \Sigma _n$ . However, Levy complexity for a notion of T does not always match with the intuitive complexity we attribute in practice to it, and is clearly dependent on the signature $\tau $ we use to express it. For example the process of Morleyization (see [Reference Tent and Ziegler34, sec. 3.2] or Definition 5.5) produces a mathematically equivalent axiomatization $T^*$ of a first order $\tau $ -theory T in a new signature $\tau ^*\supseteq \tau $ , such that every $\tau ^*$ -formula is $T^*$ -equivalent to an atomic formula; hence all notions of T become of the same Levy complexity when expressed in signature $\tau ^*$ according to the mathematically equivalent theory $T^*$ .
To better understand the relevance of the signature in describing the correct logical complexity of the notions of a theory, let us examine the case of the theory of commutative semi-rings with no zero-divisors. Consider the signature $\left \lbrace +,\cdot ,0,1 \right \rbrace $ , which is standardly used to axiomatize rings and fields. In this signature we can provide the axioms of commutative semi-rings with no zero-divisors by means of the following universal sentences:
By further extending the above list we can obtain the theory of commutative rings with no zero-divisors (e.g., integral domains), adding the $\Pi _2$ -axiom
and the theory of fields by further adding the $\Pi _2$ -axiom
and finally the theory of algebraically closed fields by supplementing the above axioms with the following $\Pi _2$ -sentencesFootnote 14 for all $n\geq 1$
A common feature of the above set of $\Pi _2$ -axioms is that they assert the existence of solutions to certain basic equations of the theory expressible in terms of sum, multiplication, $0$ , $1$ , i.e., the atomic formulae of $\left \lbrace \cdot ,+,0,1 \right \rbrace $ . Indeed, any commutative semi-ring $\mathcal {M}$ without zero divisors can be extended to an algebraically closed field simply by extending it with solutions to the basic polynomial equations with parameters in $\mathcal {M}$ (and which may not exist in $\mathcal {M}$ ).
In the signature $\left \lbrace +,\cdot ,0,1 \right \rbrace $ we have as basic operations $+,\cdot $ . Using this signature for integral domains we can also define the additive inverse, $-$ , using the atomic formula $(x+z=0)$ , while for fields we can add the multiplicative inverse, ${}^{-1}$ , using the boolean combination of atomic formulae $(x=0\wedge z=0)\vee (x\neq 0\wedge x\cdot z=1)$ . Notice that even when subtraction and division are partially defined on a semiring, we can still meaningfully interpret these two operations in it, just by adding two unary function symbols $-,{}^{-1}$ for them and by adopting the convention that the corresponding inverse operations are trivially defined on the non invertible elements; this is captured for example by the axioms
to interpret ${}^{-1}$ , and
to interpret $-$ .
It is clear that the class of $\left \lbrace +,\cdot ,0,1,-,{}^{-1} \right \rbrace $ -structures satisfying axioms (1), (5), and (6) are exactly the commutative semi-rings with no zero-divisors, however, expanding the signatures and introducing the new axioms which correctly interpret the new symbols, we perturbed the notion of morphism. To see this, notice that the inclusion of $\mathbb {N}$ into $\mathbb {Z}$ is a $\left \lbrace +,\cdot ,0,1 \right \rbrace $ -morphism but not a $\left \lbrace +,\cdot ,0,1,-,{}^{-1} \right \rbrace $ -morphism as ${-}(2)=0$ when computed in $\mathbb {N}$ seen as a model of (1), (5), and (6), while ${-}(2)= {-}2$ when computed in $\mathbb {Z}$ seen as a model of (1), (5), and (6). Similarly the inclusion of $\mathbb {Z}$ into $\mathbb {Q}$ is a $\left \lbrace +,\cdot ,0,1,- \right \rbrace $ -morphism but not a $\left \lbrace +,\cdot ,0,1,-,{}^{-1} \right \rbrace $ -morphism.
Moreover, we can observe that in the class of integral domains as formalized in signature $\left \lbrace +,\cdot ,0,1,- \right \rbrace $ subtraction is axiomatized now by the universal sentence $\forall x\,[x+(-x)=0]$ rather than the $\Pi _2$ -sentence (6), which defines it in the larger class of commutative semirings. Similarly, if we consider the class of $\left \lbrace +,\cdot ,0,1,-,{}^{-1} \right \rbrace $ -structures which are fields, i.e., models of (1)–(3), then (5) becomes logically equivalent, modulo the other axioms, to $\forall x\,[x\neq 0\rightarrow x\cdot x^{-1}=1]$ , which is a universal $\left \lbrace +,\cdot ,0,1,-,{}^{-1} \right \rbrace $ -sentence. On the algebraic side note that the $\left \lbrace \cdot ,+,0,1 \right \rbrace $ -morphisms between integral domains naturally extend to $\left \lbrace \cdot ,+,0,1,- \right \rbrace $ -morphisms, as the operation $x\mapsto -x$ is preserved by additive morphisms on rings. Similarly the $\left \lbrace \cdot ,+,0,1 \right \rbrace $ -morphisms between fields naturally extend to $\left \lbrace +,\cdot ,0,1,-,{}^{-1} \right \rbrace $ -morphisms, as the operation $x\mapsto x^{-1}$ is preserved by additive and multiplicative morphisms between fields.
More generally, when a $\tau $ -theory T can define an operation by means of universal $\tau $ -equations, the $\tau $ -morphisms between $\tau $ -models of T preserve the operation. In our concrete example, $\left \lbrace +,\cdot ,0,1 \right \rbrace $ -morphisms between integral domains naturally extends to $\left \lbrace +,\cdot ,0,1,- \right \rbrace $ -morphisms, while $\left \lbrace +,\cdot ,0,1 \right \rbrace $ -morphisms between fields naturally extends to $\left \lbrace +,\cdot ,0,1,-,^{-1} \right \rbrace $ -morphisms exactly because the operations $-,^{-1}$ are defined by universal equations in the respective theories.
These examples suggest the following observation: when considering the first order axiomatization of a theory, we should carefully consider not only its class of models, but also the class of morphisms between these models. Dealing with the theory of commutative semi-rings with no zero-divisors, the notion of additive inverse has a complexity which exceeds that of $+$ and $\cdot $ . We can detect this by noticing that the class of morphisms between these structures shrinks when we impose the preservation of this operation. On the other hand, in the context of commutative rings, the notion of additive inverse has the same complexity of $+,\cdot $ . But, again, this is not the case for the notion of multiplicative inverse: we need to focus on the theory of field in order to regard it as a concept with the same complexity as addition and multiplication. We can sum up these considerations as follows:
-
• The signature $\left \lbrace +,\cdot ,0,1 \right \rbrace $ is suitable for commutative (semi)rings with no zero-divisors, and fields.
-
• The signature $\left \lbrace +,\cdot ,0,1,- \right \rbrace $ is better suited for commutative rings with no zero-divisors, and fields.
-
• The signature $\left \lbrace +,\cdot ,0,1,-,{}^{-1} \right \rbrace $ is better suited (only) for fields.
The algebraic case we discussed is instructive and guides us towards more general considerations on the interplay between the complexity of concepts/notions of a theory T, the first order language $\tau $ in which to express them, and the properties of the $\tau $ -morphisms between models of T. First of all, the Lévy hierarchy gives a potentially useful hierarchy by which we can stratify the complexity of concepts/notions of a theory; furthermore our discussion suggests that for a given a signature $\tau $ , the basic concepts of a $\tau $ -theory T are not only those expressed by (boolean combinations of) atomic $\tau $ -formulae, but also those expressed by universal $\tau $ -equations (e.g., the additive inverse is a simple operation for ring theory, being axiomatized in signature $\left \lbrace +,\cdot ,0,1,- \right \rbrace $ by universal quantification on an equation, while it is not for semi-ring theory as axiomatized in signature $\left \lbrace +,\cdot ,0,1,- \right \rbrace $ ). The reason is that if $\mathcal {M}\sqsubseteq \mathcal {N}$ are $\tau $ -structures which model the same $\tau $ -theory T, the graph of a relation/operation $R^{\mathcal {M}}$ defined on $\mathcal {M}$ as prescribed by the universal axiom $\psi _R$ of T is exactly the restriction to $\mathcal {M}$ of the graph of that same relation/operation $R^{\mathcal {N}}$ defined now on $\mathcal {N}$ as prescribed by $\psi _R$ (in our example, $-$ is preserved by $\left \lbrace +,\cdot ,0,1,- \right \rbrace $ -morphisms of rings but not by $\left \lbrace +,\cdot ,0,1,- \right \rbrace $ -morphisms of semi-rings).
Summing up, when dealing with a mathematical theory T for which we have a clear picture of its intended class of models, the choice of a signature $\tau $ in which to formalize T can be made on the basis of which morphism between models of T we would like to be the $\tau $ -morphisms between the $\tau $ -models of T, or, correspondingly, on the basis of which concepts of T we would like to be of low logical complexity in the Lévy hierarchy induced by $\tau $ on T. Once this decision is made, this affects the type of signature $\tau $ in which the theory can be formalized, what are the $\tau $ -morphisms between the T-models, and what are the universal $\tau $ -axioms of T. Therefore, we can get different stratifications of the complexity of concepts for a theory T by selecting the appropriate class of morphisms between its models and by selecting which concepts of T we require to be axiomatized by universal sentences.
4 Existentially closed models, model completeness, and model companionship
It is now the time to introduce the notion of existentially closed model and of model companionship.Footnote 15 These model-theoretic notions will be the structural properties of first order theories we will use to select good signatures for set theory. Before giving the precise mathematical definitions of these notions, let us start by introducing these concepts in the context of (semi-)ring theory.
A common trait of the set of axioms (2)–(4) is that they assert the existence of solutions to certain basic equations that are expressible by atomic formulae in the $\left \lbrace +,\cdot ,0,1 \right \rbrace $ -theory given by axiom (1). Indeed, any commutative (semi-)ring $\mathcal {M}$ without zero divisors can be enlarged to an algebraically closed field simply by extending it with solutions to the basic polynomial equations with parameters in $\mathcal {M}$ (which in principle may not exist in $\mathcal {M}$ ). Notice that the process of adding new solutions to a commutative (semi-)ring can be seen as closing it with respect to its basic operations. It is here that the choice of a signature $\tau $ for the formalization of a theory T becomes relevant: the richer $\tau $ is, the more complex are the concepts expressible by atomic $\tau $ -formulae (expressing the basic operations and relations) and universal $\tau $ -sentences (expressing the basic properties of the T-definable relations and operations). Therefore, the richer $\tau $ is, the more closed-off are the models of a universal $\tau $ -theory T that have solutions for the atomic $\tau $ -formulae with parameters in the model.
This closing-off process is one of the driving forces of mathematics. For example it is by adding new solutions to basic equations that commutative semi-rings with no zero-divisors (like $\mathbb {N}$ ) have been extended to commutative rings (like $\mathbb {Z}$ , which contains all additive inverses), and then to fields (like $\mathbb {Q}$ , which contains all multiplicative inverses), and finally to algebraically closed fields (like $\mathbb {C}$ , in which all diophantine equations have a solution).
Of course, the choice of the signature $\tau $ in which a theory T is formalized affects the outcome of this closing-off process. A basic request is that the $\tau $ -theory $T'$ describing the validities in the closed-off structures, should agree with T at least with respect to the $\Pi _1$ -consequences of T in signature $\tau $ (as the latter express the properties of the basic concepts and operations of T according to $\tau $ ). Indeed, only in this case we can say that the closing-off process has been performed with respect to the basic concepts and operations of T. The notion of existentially closed model defines precisely which structures are the outcome of this closing-off process.
Definition 4.1 Let $\tau $ be a first order signature and T be a $\tau $ -theory. A $\tau $ -structure $\mathcal {M}$ is T-ec if:
-
• There is some $\mathcal {N} \tau $ -superstructure of $\mathcal {M}$ which models T.
-
• $\mathcal {M}$ is a $\Sigma _1$ -substructure of $\mathcal {P}$ whenever $\mathcal {P}$ is a $\tau $ -superstructure of $\mathcal {M}$ which models T.
A caveat is in order: while a mathematical theory T can be unambiguously defined regardless of any of its possible first order axiomatizations, to define unambiguously the notion of T-ec model we commit ourselves to choosing a specific signature $\tau $ in which to formalize T.
Notation 4.2 Given a signature $\tau $ and a $\tau $ -theory T, $T_\forall ^\tau $ is the set of universal $\tau $ -sentences which follows from T (similarly we define $T^\tau _{\exists }$ , $T^\tau _{\forall \exists }$ , etc.).
While the set of logical consequences of T is independent of the signature in which we axiomatize it, the set $T_\forall ^\tau $ heavily depends on the choice of $\tau $ .
Notice that any T-ec $\tau $ -structure is a model of $T^\tau _\forall $ . A less trivial observation is the following.
Fact 4.3 [Reference Viale43, fact 8] Given a $\tau $ -theory T, a $\tau $ -structure $\mathcal {M}$ is T-ec if and only if it is $T^\tau _\forall $ -ec. Hence a $\tau $ -structure $\mathcal {M}$ which is T-ec is also S-ec for any $\tau $ -theory S such that $S^\tau _\forall =T^\tau _\forall $ .
It is not hard to see that algebraically closed fields, in the signature $\sigma =\left \lbrace +,\cdot ,0,1 \right \rbrace $ , are S-ec structures for S the $\sigma $ -theory of commutative (semi-)rings with no zero-divisors. Consider the two $\sigma $ -theories U and R, respectively, of fields and of integral domains that are not fields. Then, any $\sigma $ -structure $\mathcal {M}$ which is an algebraically closed field is automatically S-ec, U-ec, and R-ec at the same time (since $R^\sigma _\forall =S^\sigma _\forall =U^\sigma _\forall $ ).
In general, given a $\tau $ -theory T, the class of T-ec models might not be elementary. In the specific case of algebraically closed fields this is the case, since this class of structures is axiomatized by the $\left \lbrace +,\cdot ,0,1 \right \rbrace $ -theory $\mathsf {ACF}$ given by axioms (1) and (4); but this is a rather peculiar case, which also depends on the specific choice of the signature. On the contrary, if we consider the $\left \lbrace \cdot ,e,{}^{-1} \right \rbrace $ -theory T of groups, then the T-ec $\left \lbrace \cdot ,e,{}^{-1} \right \rbrace $ -structures do not form an elementary classFootnote 16 [Reference Chen Chang and Keisler13, example 3.5.16]. Abraham Robinson introduced the notion of model companionship to describe exactly when the collection of T-ec $\tau $ -models form an elementary class:
Definition 4.4 Let $\tau $ be a signature.
-
• A $\tau $ -theory T is model complete Footnote 17 if for any $\tau $ -structure $\mathcal {M}$
$$\begin{align*}\mathcal{M}\models T \text{ if and only if }\mathcal{M}\text {is}\ T^\tau_\forall{-ec}. \end{align*}$$ -
• A $\tau $ -theory T is the model companion of a $\tau $ -theory S if:
-
(i) S and T are jointly consistent: i.e., $T^\tau _\forall =S^\tau _\forall $ , (or equivalently—by [Reference Viale41, Lemma 2.1.1] or [Reference Chen Chang and Keisler13, Remark 3.5.6(2)]—every model of T embeds in a model of S, and vice versa).
-
(ii) T is model complete.
-
It is useful to recall that model complete theories are axiomatized by their $\Pi _2$ -consequences (see [Reference Chen Chang and Keisler13, Proposition 3.5.10]), and that the model companion of a $\tau $ -theory T (if it exists) is unique (by [Reference Chen Chang and Keisler13, Proposition 3.5.13]).
Conforming to our previous notation of R being the $\left \lbrace +,\cdot ,0,1 \right \rbrace $ -theory of integral domains which are not fields, and U being the $\left \lbrace +,\cdot ,0,1 \right \rbrace $ -theory of fields, we can observe that no algebraically closed field is a model of R, and, conversely, no model of R is an algebraically closed field. As a matter of fact, the notion of model companionship for a $\tau $ -theory T does not necessarily isolate a proper subclass of the models of T, but rather the models that are closed-off with respect to the basic operations and relations definable in T by universal $\tau $ -sentences.
Given the connection between model companions and existentially closed structures, it should be no surprise that the notion of model companionship is dependent on the choice of the signature. For example, consider the language $\left \lbrace +,\cdot ,0,1,{}^{-1} \right \rbrace $ and the theory $S'$ which results from adding axiom (5) to axioms (1) and (2). We obtain that $S'$ is still the theory of commutative rings with no zero-divisors, now formalized in the signature $\left \lbrace +,\cdot ,0,1,{}^{-1} \right \rbrace $ . However, the $\left \lbrace +,\cdot ,0,1,{}^{-1} \right \rbrace $ -theory of algebraically closed fields $\mathsf {ACF}'$ given by axioms (1), (4), (5) is not the model companion of the $\left \lbrace +,\cdot ,0,1,{}^{-1} \right \rbrace $ -theory $S'$ . In order to see this, let $\mathcal {M}$ be the field of complex numbers and let $\mathcal {N}$ be $\mathbb {C}[X]$ (the ring of polynomials in complex coefficients and variable X), both seen as $\left \lbrace +,\cdot ,0,1,{}^{-1} \right \rbrace $ -structures. Then it is the case that $\mathcal {M}\sqsubseteq \mathcal {N}$ , but now $\mathcal {N}$ models the $\Sigma _1$ -sentence for $\left \lbrace +,\cdot ,0,1,{}^{-1} \right \rbrace $ $\exists y\,(y\neq 0\wedge y^{-1}=0)$ (as witnessed by the polynomial X), while $\mathcal {M}$ does not. In particular, $\mathcal {M}$ is not $S'$ -ec, as $\mathcal {M}$ is not a $\Sigma _1$ -substructure of $\mathcal {N}$ , which is a model of $S'$ .
Given that the existence of a(n absolute) model companion for a theory T depends so much on the choice of the signature, we can use the former to select the latter:
-
Model-theoretic criterion for selecting signatures: Given a mathematical theory T and two signatures $\tau $ and $\sigma $ in which T can be axiomatized, $\tau $ is preferable to $\sigma $ for T if the $\tau $ -axiomatization of T admits a(n absolute) model companion, while its $\sigma $ -axiomatization does not.
Clearly this criterion is in the mold of (AbsCrit). Let us see with a concrete example why criteria of this kind can cohere with others that are internal to a theory. Recall the example of integral domains: in Section 3 we argued on the basis of considerations internal to this theory that the signature $\left \lbrace \cdot ,+,0,1,- \right \rbrace $ is better suited for it than $\left \lbrace \cdot ,+,0,1,-,^{-1} \right \rbrace $ . Let us apply the above abstract criteria to select which among the two signatures is best suited for the theory of integral domains. Once again the answer is $\left \lbrace \cdot ,+,0,1,- \right \rbrace $ , in this case because in this signature the theory of integral domains has a natural and simple to compute model companion (which is the $\left \lbrace \cdot ,+,0,1,- \right \rbrace $ -theory of algebraically closed fields).Footnote 18 It is actually the case (at least for concrete examples) that we can prove that some theory T have a(n absolute) model companion in a signature $\tau $ only when we outline reasons internal to T of why the universal $\tau $ -theory of T is catching certain central and basic concepts of T.
A category theoretic perspective can also be useful to justify this model-theoretic criterion. We already remarked that for a mathematical theory T, its semantic extension might be given not only by its class of models, but also by its intended class of morphisms between these models, i.e., by a certain category $\mathcal {E}_T$ of models and morphisms (this is the standard approach with which ring theory is presented in a basic undergraduate course in algebra). By axiomatizing T in some signature $\tau $ we univocally describe the objects of $\mathcal {E}_T$ as the $\tau $ -models of T, but the arrows of $\mathcal {E}_T$ may not coincide with the $\tau $ -morphisms between $\tau $ -models of T (consider again the case of integral domains with the two signatures $\left \lbrace \cdot ,+,0,1,- \right \rbrace $ , $\left \lbrace \cdot ,+,0,1,-,^{-1} \right \rbrace $ ). How do we detect the most natural family of morphisms between models of a mathematical theory T (i.e., the arrows of $\mathcal {E}_T$ )? Can we even say that there is always a most natural family of morphisms between models of T (e.g., that $\mathcal {E}_T$ can be meaningfully and univocally defined for any mathematical theory T)? Our model theoretic criterion to select signatures is a useful tool to address these questions.
5 Partial Morleyizations, absolute model companionship, and the AMC-spectrum of a theory
We can now give a more formal treatment of the ideas presented in the previous sections, by introducing the central concepts of (partial) Morleyization and absolute model companionship.
5.1 Partial Morleyizations
Notation 5.1 Given a signature $\tau $ , let $\phi (x_0,\dots ,x_n)$ be a $\tau $ -formula.
We let:
-
• $R_\phi $ be a new $n+1$ -ary relation symbols,
-
• $f_\phi $ be a new n-ary function symbols,Footnote 19
-
• $c_\tau $ be a new constant symbol.
We also let
for $\phi (x_0,\dots ,x_n)$ having at least two free variables, and
for $\phi (x)$ having exactly one free variable.
Let $\mathsf {Form}_\tau $ denotes the set of $\tau $ -formulae. For $A\subseteq \mathsf {Form}_\tau \times 2$ :
-
• $\tau _A$ is the signature obtained by adding to $\tau $ all relation symbols $R_\phi $ , for $(\phi ,0)\in A$ , and all function symbols $f_\phi $ , for $(\phi ,1)\in A$ (together with the special symbol $c_\tau $ if at least one $(\phi ,1)$ is in A).
-
• $T_{\tau ,A}$ is the $\tau _A$ -theory having as axioms the sentences ${\mathsf {AX}}^i_\phi $ for $(\phi ,i)\in A$ .
We let $\tau ^*=\tau _C$ for $C=\mathsf {Form}_\tau \times \left \lbrace 0 \right \rbrace $ and $T^*$ be the $\tau ^*$ -theory $T_{\tau ,C}$ .
Roughly speaking the axioms ${\mathsf {AX}}^i_\phi $ are meant to formalize the idea that the (potentially complicated) concept formalized by $\phi $ according to $\tau $ becomes an elementary concept if we formalize instead it using $\tau _{\left \lbrace (\phi ,i) \right \rbrace }$ . More precisely, given a $\tau $ -theory S:
-
• ${\mathsf {AX}}^0_\phi $ transforms the concept of S formalized by the $\tau $ -formula $\phi $ (which could have a high Levy complexity according to $\tau $ ) in a concept formalized by an atomic $\tau \cup \left \lbrace R_\phi \right \rbrace $ -formula according to $S+{\mathsf {AX}}^0_\phi $ .
-
• For any $\tau $ -theory S, and for $C=\mathsf {Form}_\tau \times \left \lbrace 0 \right \rbrace $ , $S^*=S+T_{\tau ,C}$ is a $\tau ^*=\tau _C$ -theory admitting quantifier elimination (the Morleyization of T).Footnote 20
-
• ${\mathsf {AX}}^1_\phi $ performs a delicate Skolemization process. The standard Skolemization introduces for each formula $\phi $ a Skolem term $f_\phi $ together with an axiom stating $\forall \vec {x}\,[\exists y\,\phi (\vec {x},y)\leftrightarrow \phi (\vec {x},f_\phi (\vec {x}))]$ . However, this has serious drawbacks. Consider the following situation: we have a $\tau $ -structure $\mathcal {M}$ which is such that for all $b\in \mathcal {M} \exists y\,\phi (b,y)$ holds, and also such that for some $a\in \mathcal {M}$ , we have that $\phi (a,y)$ has more than one solution in $\mathcal {M}$ . In this case we can find distinct functions $h_0,h_1:\mathcal {M}\to \mathcal {M}$ such that $\forall x\,[\exists y\,\phi (x,y)\leftrightarrow \phi (x,f_\phi (x))]$ is satisfied in the expansions of $\mathcal {M}$ to a $\tau \cup \left \lbrace f_\phi \right \rbrace $ -structure which interprets $f_\phi $ by either one of the $h_i$ . Which one of the two expansions should we consider the natural Skolemization of $\mathcal {M}$ for the formula $\phi $ ? ${\mathsf {AX}}^1_\phi $ rules out this source of ambiguity by saying that when $\phi (a,y)$ has more than one solution in $\mathcal {M}$ , then $f_\phi ^{\mathcal {M}}$ does not select any of them and instead behaves trivially by assigning to a the value $c_\tau ^{\mathcal {M}}$ . In this way, given a formula $\psi (\vec {x},y)$ , once we choose the interpretation of $c_\tau $ , any $\tau $ -structure has a unique expansion to a $\tau \cup \left \lbrace f_\psi ,c_\tau \right \rbrace $ -structure which is a model of ${\mathsf {AX}}^1_\psi $ and has that interpretation of $c_\tau $ .
Notice that with respect to the signatures expanding $\left \lbrace \cdot ,+,0,1 \right \rbrace $ discussed in the previous section, axiom (5) is (equivalent to) ${\mathsf {AX}}^1_\phi $ for $\phi (x,y)$ being $x\cdot y=1$ (assuming $c_\tau $ is interpreted by $0$ ). Similarly axiom (6) is (equivalent to) ${\mathsf {AX}}^1_\psi $ for $\psi (x,y)$ being $x+ y=0$ . In what follows we want to analyze what happens when the Morleyization process is performed on arbitrary subsets of $\mathsf {Form}_\tau \times 2$ .
5.2 Absolute model companionship
The following properties will bring us to introduce the notion of absolute model companionship.
-
• A $\tau $ -structure $\mathcal {M}$ is T-ec if and only if it is $T^\tau _\forall $ -ec (by [Reference Viale43, fact 8]).
-
• If T is complete, a T-ec structure $\mathcal {M}$ realizes any $\Pi _2$ -sentence which is consistent with $T^\tau _\forall $ (by [Reference Viale43, fact 12]).
-
• If a $\tau $ -theory S is the model companion of a $\tau $ -theory T, S is axiomatized by its $\Pi _2$ -consequences for signature $\tau $ (by Robinson’s test, e.g., [Reference Viale43, lemma 14]).
These results are outlining “syntactic properties” of T-ec models which reflect the “semantic fact” that T-ec models are the “closed-off” structures for T; furthermore they outline that the axiomatization of the model companion of a complete theory can be detected by separately solving a consistency task for each $\Pi _2$ -sentence for $\tau $ . One might therefore wonder whether these syntactic properties can give another equivalent characterization of model companionship, i.e., if the model companion S of a $\tau $ -theory T (whenever it exists) can be axiomatized by the family of all $\Pi _2$ -sentences for $\tau $ which holds in some model of the universal and existential fragment of T. This is not always the case; it holds when T is a complete model companionable theory, however there is a standard counterexample: the $\left \lbrace +,\cdot ,0,1 \right \rbrace $ -theory $\mathsf {Fields}_0$ of fields of characteristic $0$ , given by axioms (1)–(3) and the infinitely many axioms granting that the characteristic of its models is $0$ (note that $\mathsf {Fields}_0$ is not a complete theory) has as its model companion $\mathsf {ACF}_0$ , which adds axiom (4) to $\mathsf {Fields}_0$ (note that $\mathsf {ACF}_0$ is a complete theory). Indeed, the sentence $\forall x \neg (x^2+1=0)$ is a $\Pi _2$ -sentence (actually $\Pi _1$ ) which is not in $\mathsf {ACF}_0$ but holds in $\mathbb {Q}$ , hence is consistent with the universal and existential fragment of $\mathsf {Fields}_0$ .
Absolute model companionship rules out counterexamples of this kind to the above tentative characterization of model companionship.
Notation 5.2 Given a $\tau $ -theory T, $T_{\forall \vee \exists }^\tau $ denotes the boolean combinations of universal $\tau $ -sentences which follow from $\tau $ .
Note that $T_{\forall \vee \exists }^\tau $ may convey more information than $T_{\forall }^\tau \cup T_{\exists }^\tau $ , as there could be $\psi \vee \phi $ (with $\psi $ universal and $\phi $ existential) which is in the former but does not follow from the latter.
Definition 5.3 [Reference Viale41, definition 2.2.4, lemma 2.3.4]
Given a $\tau $ -theory T, we say that a $\tau $ -sentence $\psi $ is strongly $T_{\forall \vee \exists }^\tau $ -consistent for $\tau $ whenever $\psi +R^\tau _{\forall \vee \exists }$ is consistent for any $\tau $ -theory R which extends T.
A $\tau $ -theory S is the absolute model companion (AMC) of a $\tau $ -theory T if it is the model companion of T and it is axiomatized by the $\Pi _2$ -sentences (for $\tau $ ) which are strongly $T_{\forall \vee \exists }^\tau $ -consistent for $\tau $ .
Equivalently (by [Reference Viale43, lemma 22]) T is the AMC of S if:
-
• $T^\tau _{\forall \vee \exists }=S^\tau _{\forall \vee \exists }$ ,
-
• T is model complete.
Note that a model complete $\tau $ -theory T is the model companion of $T^\tau _\forall $ and the AMC of $T^\tau _{\forall \vee \exists }$ .
A $\tau $ -theory S can have an AMC if, to some extent, it has already maximized the family of existential sentences which are consistent with its universal consequences. Let us elaborate more on this point. Given a $\tau $ -theory T, the family of strongly $T_{\forall \vee \exists }^\tau $ -consistent $\Pi _2$ -sentences for $\tau $ describes a fragment of the $\Pi _2$ -sentences which hold in a T-ec $\tau $ -structure (see [Reference Viale41, fact 2.2.5]). An AMC describes the set-up in which the choice of $\tau $ is such that the family of strongly $T_{\forall \vee \exists }^\tau $ -consistent $\Pi _2$ -sentences axiomatizes a model complete theory. This leads to another peculiar aspect separating model-companionship from absolute model companionship:
Lemma 5.4 [Reference Viale41, lemma 1.5]
Assume $S,S'$ are $\tau $ -theories such that $S'$ is the AMC of S. Then for all $T\supseteq S$ , $S'+T_{\forall }$ is the AMC of T.
On the contrary this property fails for model companionship: the universal fragment of the theory of $(\mathbb {Q},+,\cdot ,0,1)$ is inconsistent with the $\left \lbrace +,\cdot ,0,1 \right \rbrace $ -theory of algebraically closed fields. As we already mentioned, complete first order $\tau $ -theories are model companionable if and only if they admit an AMC. Indeed, for a complete theory S, the information conveyed by $S^\tau _\forall $ and by $S^\tau _{\forall \vee \exists }$ is the same. On the other hand we have already seen that there are non-complete $\tau $ -theories admitting a model companion but not an AMC, e.g., the $\left \lbrace +,\cdot ,0,1 \right \rbrace $ -theory of fields. Let us briefly analyze more in details what is the obstruction for the $\left \lbrace +,\cdot ,0,1 \right \rbrace $ -theory of fields of characteristic $0$ to have the $\left \lbrace +,\cdot ,0,1 \right \rbrace $ -theory of algebraically closed fields of characteristic $0$ as its AMC. The problem is that Diophantine equations can be expressed by atomic formulae of $\left \lbrace +,\cdot ,0,1 \right \rbrace $ and not all of them have rational solutions. On the other hand the existence of solutions for these equations does not contradict the field axioms. Note that the smallest existentially closed $\left \lbrace +,\cdot ,0,1 \right \rbrace $ -model $\mathcal {M}$ for the theory of fields is given by the algebraically closed field consisting exactly of the solutions of diophantine equations. Its standard construction builds it as a direct limit of Galois extensions of the rationals each adding new solutions to polynomial equations with coefficients in $\mathbb {N}$ but without rational solutions. By performing this construction we can preserve the field axioms and the characteristic $0$ (which holds in $\mathbb {Q}$ as well as in all the Galois extensions of $\mathbb {Q}$ under consideration), but at each stage we may invalidate some universal sentence that is true in $\mathbb {Q}$ (i.e., a universal sentence asserting that a certain diophantine equation does not have rational solutions). In particular, the closing-off process performed in signature $\sigma =\left \lbrace +,\cdot ,0,1 \right \rbrace $ which brings from the theory of fields T to that of algebraically closed fields cannot be described only by the $\Pi _2$ -sentences $\psi $ which are strongly $T_{\forall \vee \exists }^\tau $ -consistent for $\sigma $ . Indeed, there are other $\Pi _2$ -sentences which need to be realized in an algebraically closed field and which are not strongly $T_{\forall \vee \exists }^\tau $ -consistent for $\sigma $ (i.e., the existential statements asserting the existence of solutions for diophantine equations). In particular, the $\left \lbrace +,\cdot ,0,1 \right \rbrace $ -theory of algebraically closed fields is the AMC only of the theory of integral domains which extends the field of algebraic numbers (i.e., which have solutions to all polynomial equations expressible in the signature $\left \lbrace +,\cdot ,0,1 \right \rbrace $ ).
On the other hand we will argue that in the case of set theory, AMC faithfully describes the closing-off process of models of set theory with respect to basic set-theoretic operations.
5.3 The (A)MC-spectra of a first order theory
Model theory has been extremely successful in classifying the complexity of a mathematical theory according to its structural properties, and has produced a variety of properties and criteria to separate the (so-called) tame mathematical theories from the (so-called) untame ones (e.g., o-minimality, stability, simplicity, NIP), see [Reference Baldwin8] for a philosophical appraisal of this theme. Typically a mathematical theory is considered hard to classify (and thus untame) if it can code in itself first order arithmetic. In this respect the $\in $ -theory $\mathsf {ZFC}$ is untame.
We have already observed that most mathematical theories admit many different first order axiomatizations in (almost) as many distinct signatures. A common characteristic of tameness properties such as o-minimality, stability, simplicity, and NIP is that they are signature invariant. More precisely, if we take a $\tau $ -theory T and we consider its Morleyization $T^*$ in the signature $\tau ^*$ (see Notation 5.1), T is o-minimal (stable, simple, and NIP) if and only if so is $T^*$ . In contrast, this is not the case for Robinson’s notion of model companionship, as we have already observed for the theory of integral domains. We instantiate the model-theoretic criterion to select signatures formulated in the last part of Section 4 by means of the following:
Definition 5.5 Let T be a $\tau $ -theory.
-
• Its AMC-spectrum ( $\mathfrak {spec}_{\mathsf {AMC}}\left (T,\tau \right )$ ) is given by the sets $A\subseteq \mathsf {Form}_\tau \times 2$ such that $T+T_{\tau ,A}$ has an AMC (which we denote by $\mathsf {AMC}(T,A)$ ).
-
• Its MC-spectrum ( $\mathfrak {spec}_{\mathsf {MC}}(T,\tau )$ ) is given by the sets $A\subseteq \mathsf {Form}_\tau \times 2$ such that $T+T_{\tau ,A}$ has a model companion (which we denote by $\mathsf {MC}(T,A)$ ).
Clearly $\mathfrak {spec}_{\mathsf {MC}}(T,\tau )$ is a superset of $\mathfrak {spec}_{\mathsf {AMC}}\left (T,\tau \right )$ and the two can be distinct. Observe that $C=\mathsf {Form}_\tau \times \left \lbrace 0 \right \rbrace $ is always in the AMC-spectrum of a theory T, as $T+T_{\tau ,C}$ admits quantifier elimination and therefore it is model complete and its own AMC in signature $\tau _C=\tau ^*$ . Moreover, $\emptyset $ is in the (A)MC-spectrum of T if and only if T has an (absolute) model companion. For $\sigma =\left \lbrace +,\cdot ,0,1 \right \rbrace $ and the various $\sigma $ -theories considered in Section 3 we have that:
-
• $\emptyset $ is in $\mathfrak {spec}_{\mathsf {MC}}(T,\sigma )\setminus \mathfrak {spec}_{\mathsf {AMC}}(T,\sigma )$ for T the $\sigma $ -theory of commutative (semi)rings with no $0$ -divisors or the theory of fields.
-
• For $\phi $ the $\sigma $ -formula defining the additive inverse, $\left \lbrace (\phi ,1) \right \rbrace $ is in $\mathfrak {spec}_{\mathsf {MC}}(T,\sigma )$ if T is the theory of commutative rings with no $0$ -divisors or the theory of fields, while $\left \lbrace (\phi ,1) \right \rbrace $ is not in $\mathfrak {spec}_{\mathsf {MC}}(S,\sigma )$ if S is the $\sigma $ -theory of commutative semirings with no $0$ -divisors.
-
• For $\psi $ the $\sigma $ -formula defining the multiplicative inverse, $\left \lbrace (\psi ,1) \right \rbrace $ is in $\mathfrak {spec}_{\mathsf {MC}}(T,\sigma )$ if T is the theory of fields, while $\left \lbrace (\psi ,1) \right \rbrace $ is not in $\mathfrak {spec}_{\mathsf {MC}}(S,\sigma )$ if S is the $\sigma $ -theory of commutative (semi)rings with no $0$ -divisors.
-
• It can be shown that $\mathfrak {spec}_{\mathsf {AMC}}(T,\sigma )=\emptyset $ for T the $\sigma $ -theory of commutative (semi)rings with no $0$ -divisors or the theory of fields.
A category theoretic perspective on the notion of (A)MC spectrum can again be enlightening: given a $\tau $ -theory T, consider the category $\mathcal {C}_T$ given by the elementary class of its $\tau $ -models as objects and the $\tau $ -morphisms (e.g., maps which preserve the atomic $\tau $ -formulae) between them as arrows. By taking a subset A of $\mathsf {Form}_\tau \times 2$ we can pass to the category $\mathcal {C}_{T+T_{\tau ,A}}$ whose objects are $\tau _A$ -models of $T+T_{\tau ,A}$ and whose arrows are the $\tau _A$ -morphisms (in accordance with Notation 5.1). There is a natural identification of the objects of $\mathcal {C}_T$ and those of $\mathcal {C}_{T+T_{\tau ,A}}$ , but the arrows of $\mathcal {C}_{T+T_{\tau ,A}}$ are now a possibly much narrower subfamily of the arrows of $\mathcal {C}_{T}$ . Important structural information on a $\tau $ -theory T is given (at least from a categorial point of view) by classifying which sets A produce an elementary class of $\tau _A$ -models for which the $T+T_{\tau ,A}$ -ec models constitute themselves an elementary class. The model companionship spectrum of T gives exactly this structural information on the arrows of $\mathcal {C}_T$ . Similar considerations apply for the AMC-spectrum of a theory. On the other hand simplicity, stability, and NIP provide fundamental structural informations on the class of objects of $\mathcal {C}_T$ , but it is not transparent whether they also convey information on its class of arrows.
6 The (A)MC-spectra of set theory
From now on we focus our attention on set theory. Our main goal is to study the model companions of set theory and the relevant signatures for which they exist. We will not only provide a new set of arguments in favour of $2^{\aleph _0}=\aleph _2$ , but will also present a new method to detect set theoretic validities. The realist-minded readers can consider the following part of the paper as an attempt to produce a complete (first order) axiomatization of set theory. Although a realist stance towards mathematics is not needed to understand and prove the results we are going to present, it is within a realist agenda that we can better appreciate their meaning. We can make this perspective more explicit by outlining a principle and a goal that will motivate the present model-theoretic approach.
-
Semantic realism: The universe of all sets $(V,\in )$ is sufficiently well-defined so that its $\in $ -theory is a well-defined (complete and non-recursive) first order theory. Therefore, any set theoretical statement axiomatizable in first order logic receives, in $(V,\in )$ , a well-defined truth value.
-
Hilbertian completeness: $(V,\in )$ should offer a complete picture of what exists in set theory. That is to say, $(V,\in )$ should contain any set whose existence is not in direct contradiction with already recognized truths of set theory (e.g., $\mathsf {ZFC}$ ).Footnote 21
In the context of a realist conception of sets, a consistent formalization of Hilbertian completeness would come along with a notion of maximality for set theory. Notice that the quest for maximality is what motivates many set-theoretical investigations; for this reason we believe it is appropriate to analyze the role that model companionship can play in its investigation; however, we defer this task to the the last section of this paper. Meanwhile, we can propose to implement Hilbertian completeness by means of a more concrete (but still partially informal) principle clearly inspired by it.
-
(IEC) Informal Existential Completeness If $P(x,Y)$ is a “simple” property formalizable in the $\in $ -signature in parameter some set Y of V and $\exists x\,P(x,Y)$ is consistent with the basic principles of set theory, then in V a witness of this existential statement should exist.
Our task now consists in spelling out a precise mathematical definition of which set theoretic properties are to be regarded as “simple” and which sets Y of V we can accept as parameters for the property P (also attention must be paid to singling out what we exactly mean by “consistent with the basic principles of set theory” to avoid trivial counterexamples to (IEC)). Towards this end we can take advantage of our previous discussion about signatures and AMC.
-
(a) We determine that a property of sets P is “simple” by analyzing the peculiarity of set theory as a mathematical theory. After choosing $A\subseteq \mathsf {Form}_{\left \lbrace \in \right \rbrace }\times 2$ and performing a partial Morleyization with respect to A, we can check whether $P(x,y)$ can be formalized by a quantifier-free formula in signature $\left \lbrace \in \right \rbrace _A$ . If this is the case we consider $P(x,y)$ simple (with respect to A). We clearly need criteria for the choice of A. Indeed, if $A=\mathsf {Form}_{\left \lbrace \in \right \rbrace }\times 2$ , then any $\in $ -formalizable set theoretic property becomes simple, since it is expressible by an atomic formula of $\left \lbrace \in \right \rbrace _A$ . In order to avoid trivialities, we use set theoretic practice and forcing as a guiding tool to select the appropriate sets A and, consequently, the simple set theoretic properties. We regard a property $P(\vec {x})$ as simple if one or more of the following conditions are met:
-
(i) Its $\in $ -formalization can be expressed by a $\Delta _0$ -formula.
-
(ii) Its meaning is invariant across forcing extensions: i.e., $P(\vec {a})$ holds in V if and only if it holds in any generic extension of V.Footnote 22
-
(iii) The basic truths about $P(\vec {x})$ are forcing invariant, which roughly amounts to require that, whenever $\psi (\vec {x})$ is an $\in $ -formula formalizing $P(\vec {x})$ and $\phi (\vec {y})$ is a boolean combination of $\Delta _0$ -formulae and $\psi $ , then the truth value of $\forall \vec {y}\,\phi (\vec {y})$ cannot be changed by forcing.Footnote 23
Notice that the second and the third conditions are implied by the first. There are properties $P(x)$ which are not expressible using $\Delta _0$ -formulae but for which nonetheless the second or third (or both) conditions apply; for example, the second condition applies to the provably $\Delta _1$ -properties for $\mathsf {ZFC}^-$ and (assuming large cardinals) to the property “x belongs to a certain universally Baire set B”, while the third holds (again assuming large cardinals) for $P(x)$ being “x is a stationary subset of ” (see Theorem 7.4 and the comments following it).We can give separate motivations for why each of the above condition is a simplicity criterion: (i) is motivated by set theoretic practice (we expand on this in Section 6.1). (ii) and (iii) are simplicity criteria as they state that we cannot use forcing to change the basic truths regarding properties satisfying them. We consider forcing invariance a legitimate simplicity check, since forcing is the most efficient method to produce independence results and (by the results we aim to expose in this paper) also the only method at all, at least for a large class of statements; we expand more on this in Section 6.3. Furthermore, notice that assuming large cardinals many set theoretic concepts become forcing invariant (this is the content of generic absoluteness results). In this respect if one is eager to accept large cardinal axioms as correct set theoretical principles (as we do), it is then natural to regard as “simple” all set theoretic concepts and properties which are made forcing invariant by large cardinals.
-
-
(b) Rather than focusing on the particularly delicate task of establishing which sets Y do exist in V and which do not (in order to argue whether $\exists x P(x,Y)$ can be consistent or valid), we just investigate if the $\Pi _2$ -sentence $\forall x\exists y\,P(x,y)$ is consistent with the basic principles of set theory. This task can be realised by checking whether the sentence $\forall x\exists y\,P(x,y)$ belongs to $\mathsf {AMC}(T,A)$ , for T some $\in $ -theory extending $\mathsf {ZFC}+$ large cardinals, and $A\subseteq \mathsf {Form}_{\left \lbrace \in \right \rbrace }\times 2$ such that $P(x,y)$ is formalizable by a quantifier-free formula in signature $\left \lbrace \in \right \rbrace _A$ . Moreover (assuming (IEC)), if $P(x,y)$ satisfies any of the above conditions for simplicity listed in (a), we can regard a $\Pi _2$ -sentence $\forall x\exists y P(x,y)$ which is consistent with the basic principles of set theory as a sentence expressing a true property of V. Indeed, by its being $\Pi _2$ , such a sentence would assert that the (suitable fragment of the) universe of sets in which it holds is closed-off with respect to the simple operations and relations described by P.
Now that we have provided insights on which mathematical criteria we use to detect “simplicity” and that we have clarified what we mean by “consistent with the basic principles of set theory,” we can give a first general outline of the main results on the model companions of set theory.
-
(A) The (A)MC-spectrum of set theory can be used to characterize theories of the various $H_{\kappa ^+}$ as $\kappa $ ranges over the infinite cardinals. More precisely, for any signature $\tau $ that extends $\left \lbrace \in \right \rbrace $ and includes all $\Delta _0$ -formulae among its atomic formulae, if the $\tau $ -theory of sets admits a model companion, this model companion is a model of Extensionality, Foundation, Union, it is closed under rudimentary functions or Gödel operations, and satisfies the replacement axiom for a very large family of $\Sigma _1$ -formulae for $\tau $ . Furthermore, for any extensions T of $\mathsf {ZFC}$ and any infinite cardinal $\kappa $ definable in T we can (uniformly in $\kappa $ ) cook up at least one signature $\tau _\kappa $ (extending $\left \lbrace \in \right \rbrace $ and including all $\Delta _0$ -formulae among its atomic formulae) such that the $\tau _\kappa $ -theory of $H_{\kappa ^+}^{\mathcal {M}}$ common to all $\mathcal {M}$ which model T is the AMC of T. Moreover these $\tau _\lambda $ -theories of $H_{\lambda ^+}$ cohere; more precisely for $\lambda <\kappa $ both definable in T we can further assume that in any model $\mathcal {M}$ of T the $\tau _\lambda $ -theory of $H_{\lambda ^+}^{\mathcal {M}}$ is correctly computed in the structure $(H_{\kappa ^+}^{\mathcal {M}},\tau _\kappa ^{\mathcal {M}})$ , being $(H_{\lambda ^+}^{\mathcal {M}},\tau _\lambda ^{\mathcal {M}})$ a definable structure in $(H_{\kappa ^+}^{\mathcal {M}},\tau _\kappa ^{\mathcal {M}})$ (see Theorem 7.3 and Notation 6.3 for details).
-
(B) There is (more than one, and even recursive ones) $B\subseteq \mathsf {Form}_{\left \lbrace \in \right \rbrace }\times 2$ such that:
-
• $T_{\left \lbrace \in \right \rbrace ,B}$ makes any $\Delta _0$ -formula logically equivalent to some $\left \lbrace \in \right \rbrace _B$ -atomic formula, and $\in _B$ has a constant symbol which is interpreted as in any $\left \lbrace \in \right \rbrace _B$ -model $\mathcal {M}$ of $\mathsf {ZFC}+T_{\left \lbrace \in \right \rbrace ,B}$ .
-
• All quantifier-free formulae of $\left \lbrace \in \right \rbrace _B$ express simple properties according to simplicity criterion (a) (assuming large cardinals). Furthermore the signature $\in _B$ introduces relations, constants, and function symbols which denote natural and central concepts in the set-theoretical practice of the last 40 years.Footnote 24
-
• Whenever S is any extension of the $\left \lbrace \in \right \rbrace $ -theory $\mathsf {ZFC}+$ there are class many supercompact cardinals,Footnote 25 B in $\mathfrak {spec}_{\mathsf {AMC}}\left (S,\left \lbrace \in \right \rbrace \right )$ .
-
• Whenever S is as in the previous item, the corresponding theory $\mathsf {AMC}(S,B)$ describes a theory of , and is axiomatized by the $\Pi _2$ -sentences (in the signature $\left \lbrace \in \right \rbrace _B$ ) which hold in the of a model of strong forcing axioms.Footnote 26 Among these $\Pi _2$ -sentences there are $\neg {\mathsf {CH}}$ and a definable version $\theta $ of $2^{\aleph _0}=\aleph _2$ .
-
-
(C) ForFootnote 27 any theory R extending $\mathsf {ZFC}$ such that $R+\neg {\mathsf {CH}}$ is consistent, ${\mathsf {CH}}$ cannot be in $\mathsf {AMC}(R,A)$ for any A in $\mathfrak {spec}_{\mathsf {AMC}}\left (R,\left \lbrace \in \right \rbrace \right )$ such that the $\Delta _0$ -formulae are ( $T_{\left \lbrace \in \right \rbrace ,A}$ -equivalent to) atomic $\left \lbrace \in \right \rbrace _A$ -formulae.
-
(D) For anyFootnote 28 $S,B$ as in (B), any $\Pi _2$ -sentence $\psi $ of $\left \lbrace \in \right \rbrace _B$ is in $\mathsf {AMC}(S,B)$ if and only if S proves that is forcible.
-
(E) The model completeness results for the theory of $H_{\aleph _2}$ of (B) mesh perfectly with certain forms of strong forcing axioms; for example one can prove that if one is in a model $(V,\in )$ of S where the $\left \lbrace \in \right \rbrace _B$ -theory of $H_{\aleph _2}^V$ is model complete (hence the AMC of the $\in _B$ -theory of V), then a strong form of bounded Martin’s maximum holds.
The resulting picture of V that emerges from (A) is that of a cumulative hierarchy whose initial segments (the $H_{\kappa ^+}$ ’s) are closed off with respect to more and more complex set-theoretic operations and produce a coherent step-by-step approximations of the theory of V given by the various theories of $H_{\lambda ^+}$ as $\lambda $ ranges over the infinite cardinals.
In this way, set theory is treated algebraically like field theory: once we choose the cardinal $\kappa $ and a corresponding signature $\tau _\kappa $ , the initial segment $H_{\kappa ^+}$ of the universe V has a theory which behaves, with respect to $\mathsf {ZFC}_{\tau _\kappa }$ (e.g., $\mathsf {ZFC}$ enriched with the replacement axiom for $\tau _\kappa $ -formulae, see 6.1 for details), as algebraically closed fields behave with respect to field theory as axiomatized in $\left \lbrace \cdot ,+,0,1 \right \rbrace $ .
Moreover, by (B) above, this analogy between set theory and algebra is exemplified for the cardinal $\aleph _1$ by a signature $\in _B$ which renders the theory of $H_{\aleph _2}$ the result of a closing-off process with respect to simple set-theoretical concepts (according to our simplicity criteria (a)), and to the standard set theoretic practice of the last 40 years.Footnote 29 Consequently, the algebraic completeness displayed by $H_{\aleph _2}$ , as the model companion of set theory with respect to $\in _B$ , satisfies the conditions described in (IEC) and substantiates the Hilbertian completeness encoded by it. Furthermore, (C) shows that the study of this model companion of set theory gives trustworthy information on the value of the continuum, while (D) suggests an explanation of the incredible success of the method of forcing (and of forcing axioms) in providing a consistent solution to problems formalizable at the level of $H_{\aleph _2}$ as $\Pi _2$ -sentences for $\in _B$ . Finally condition (E) shows that the present model-theoretic approach and the use of forcing axioms are two sides of the same coin. This, in turn, can be seen as a further justification for extending the axioms of set theory with forcing axioms and, consequently, a justification of Gödel’s program at least at the level of (a large fragment of) third order arithmetic.
6.1 What is the right signature for set theory?
The standard axioms of $\mathsf {ZFC}$ in the $\in $ -signature are clearly sufficient to provide a first order axiomatization of set theory. However a closer inspection reveals that many simple set-theoretic concepts are not formalized by simple $\in $ -formulae.
Consider, for example, the notion of ordered pair. While we informally write $x=\langle y,z \rangle $ to mean that x is the ordered pair with first component y and second component z, in set theoretic terms this statement hides a non-trivial coding of the concept of ordered pair (for example) by means of Kuratowski’s definition: $x=\left \lbrace \left \lbrace y \right \rbrace ,\left \lbrace y,z \right \rbrace \right \rbrace $ . A proper definition of the concept of ordered pair in the $\in $ -signature can then be given by the following $\in $ -formula:
It is clear that the meaning of this $\in $ -formula is hardly recognizable with a rapid glance (unlike $x=\langle y,z \rangle $ ). Moreover, from a purely logical perspective, its Lévy complexity is already $\Sigma _2$ . This clashes with our understanding that the concept of ordered pair is simple. Indeed, we do not regard the notion of ordered pair as a complex concept, contrary to other more complicated and theoretically loaded ones like that of uncountability, or many of the properties of the continuum (such as its correct place in the hierarchy of uncountable cardinals). In a similar vein other very basic notions such as being a function, a binary relation, or the domain or the range of a function are formalized by rather complicated $\in $ -formulae, both from the point of view of their readability and of their Lévy complexity.
The standard solution adopted in set theory textbooksFootnote 30 is to regard as basic all those $\in $ -formulae in which the quantifiers are bounded to range over the elements of some set, that is the $\Delta _0$ -formulae. In order to make these observations precise we need to be extremely cautious on our notational conventions.
Notation 6.1 For any $A\subseteq \mathsf {Form}_{\left \lbrace \in \right \rbrace }\times 2$ we write $\in _A$ rather than $\left \lbrace \in \right \rbrace _A$ , and we let $T_{\in ,A}$ be the $\in _A$ -theory
where the theory $T_{\left \lbrace \in \right \rbrace ,A}$ (according to Notation 5.1 for $\left \lbrace \in \right \rbrace $ and A) is reinforced by an axiom asserting that the interpretation of the constant symbol $c_{\left \lbrace \in \right \rbrace }$ is the empty set. We use the abbreviations $\in _{\Delta _0}$ and $T_{\Delta _0}$ to denote what, according to the above conventions, should rather be slight extensions ofFootnote 31 $\in _{\Delta _0\times \left \lbrace 0 \right \rbrace }$ and $T_{\in ,\Delta _0\times \left \lbrace 0 \right \rbrace }$ .
For any $\tau \supseteq \in _{\Delta _0}$ we also let $\mathsf {ZFC}_\tau $ denote $\mathsf {ZFC}+T_{\Delta _0}$ enriched with the replacement axiomFootnote 32 for all $\tau $ -formulae, and $\mathsf {ZFC}^-_\tau $ denote $\mathsf {ZFC}_\tau $ without the powerset axiom. $\mathsf {ZFC}_{\Delta _0}$ denotes $\mathsf {ZFC}_\tau $ for $\tau $ being $\in _{\Delta _0}$ , and similarly we define $\mathsf {ZFC}^-_{\Delta _0}$ .
The reason why set-theoretic practice regards the concepts expressed by quantifier-free formulae of $\in _{\Delta _0}$ as “simple” is the fact that the truth value of these formulae is invariant among transitive models of large enough fragments of $\mathsf {ZF}$ (e.g., [Reference Kunen25, corollary IV.3.6]), and thus also forcing invariant (e.g., [Reference Jech22, lemma 14.21]). Furthermore $\in _{\Delta _0}$ is a signature which allows one to formalize many fundamental set theoretic concepts using formulae whose Lévy complexity is in accordance to our intuitive understanding (e.g., [Reference Jech22, chap. 13, lemma 13.10]).
For example consider the following notions and their logical complexity:
-
• $(x$ is a cardinal $)$ is the $\Pi _1$ -formula (for $\in _{\Delta _0}$ )
-
• $(x$ is $\aleph _1)$ is the boolean combination of $\Sigma _1$ -formulae
-
• ${\mathsf {CH}}$ is the $\Sigma _2$ -sentence
-
• $(x\ {is}\ \aleph _2)$ is the $\Sigma _2$ -formula
$$ \begin{align*} \hspace{255pt}(x\text{ is a cardinal})\wedge \end{align*} $$$\hspace{-30pt}\wedge\exists F\exists y\,\left[ (y\ {is}\ \aleph_1)\wedge (y\in x)\wedge(F:y\times x\to x) \wedge \forall\alpha\in x\,(F\restriction y\times\left\lbrace \alpha \right\rbrace\text{ is a surjection on }\alpha) \right]. $ -
• $2^{\aleph _0}>\aleph _2$ is the boolean combination of $\Pi _2$ -sentences
-
• $2^{\aleph _0}\leq \aleph _2$ is the $\Sigma _2$ -sentence
Let us also introduce the notation we will use to handle the substructure relation over expanded signatures. The following conventions supplement Notation 5.1.
Notation 6.2 Given $\tau $ -structures $\mathcal {M}$ , $\mathcal {N}$ , $\mathcal {M}\sqsubseteq \mathcal {N}$ denotes the substructure relation, $\mathcal {M}\prec _n \mathcal {N}$ denotes the $\Sigma _n$ -substructure relation, and $\mathcal {M}\prec \mathcal {N}$ denotes the elementary substructure relation.
Notation 6.3 Let $\tau \supseteq \in _{\Delta _0}\cup \left \lbrace \kappa \right \rbrace $ be a signature with $\kappa $ a constant symbol, $(M,\tau ^M)$ a $\tau $ -structure, and B a subset of $\mathsf {Form}_{\tau }\times 2$ . Then, $(M,\tau _B^M)$ is the unique extension of $(M,\tau )$ defined in accordance with Notations 5.1 which satisfies $T_{\tau ,B}$ . In particular, $(M,\tau _B^M)$ is a shorthand for $(M,S^M:S\in \tau _B)$ . If $(N,\tau ^N)$ is a substructure of $(M,\tau ^M)$ we also write $(N,\tau _B^M)$ as a shorthand for $(N,S^M\restriction N:S\in \tau _B)$ .
6.2 Lévy absoluteness and the possible AMCs of set theory
Recall that for an infinite cardinal $\lambda $ , $H_{\lambda }$ is the initial transitive fragment of V given by those sets whose transitive closure has size less than $\lambda $ and is by itself a set in V (see [Reference Kunen25, sec. IV.6]). We have the following results.
-
Stratification of V: The universe of sets V can be stratified as the union, along the class of infinite cardinals $\kappa $ , of the sets $H_{\kappa ^+}$ .
-
Standard theory of $H_{\lambda }$ : [Reference Kunen25, theorem IV.6.5] For any uncountable cardinal $\lambda $ , $(H_{\lambda },\in _{\Delta _0}^V)$ is a model of $\mathsf {ZFC}^-_{\Delta _0}$ (recall Notation 6.1).
-
Strong Lévy absoluteness: [Reference Venturi and Viale38, lemma 5.3] For all $A_i\subseteq \mathcal {P}\left (\kappa \right )^{n_i}$ for $i=1,\dots ,k$ ,
$$\begin{align*}(H_{\kappa^+}^V,\in_{\Delta_0}^V,A_1,\dots,A_k)\prec_1(V,\in_{\Delta_0}^V,A_1,\dots,A_k). \end{align*}$$ -
Second order characterization of $H_{\kappa ^+}$ : Whenever $\mathcal {M}$ is an $\in _{\Delta _0}\cup \left \lbrace \kappa \right \rbrace $ -model of $\mathsf {ZFC}_{\Delta _0}+\kappa $ is an infinite cardinal $(H_{\kappa ^+}^{\mathcal {M}},\in _{\Delta _0}^{\mathcal {M}},\kappa ^{\mathcal {M}})$ is a model of the $\Pi _2$ -sentence for $\in _{\Delta _0}\cup \left \lbrace \kappa \right \rbrace $
(7) $$ \begin{align} \forall x\exists f\,(f:\kappa\to x\text{ is a surjection}). \end{align} $$Furthermore $H_{\kappa ^+}^V$ can be described as the unique set $M\in V$ such that:
-
• $\kappa \in M$ and M is transitive;
-
• $(M,\in _{\Delta _0}^V)$ satisfies (7) and $\mathsf {ZFC}^-_{\Delta _0}$ ;
-
• M has the (second order property) that for all $a\in M$ and $b\subseteq a$ , $b\in M$ as well.
-
In particular, the theory of each $H_{\kappa ^+}$ offers a $\Sigma _1$ -approximation of the theory of V with respect to a signatures $\tau _\kappa $ which extends $\in _{\Delta _0}\cup \left \lbrace \kappa \right \rbrace $ and contains predicates for subsets of $\mathcal {P}\left (\kappa \right )^n$ for some $n\in \mathbb {N}$ . Consequently, if we want to study the $\Sigma _1$ -properties of the reals (second order arithmetic), or the powerset of the reals (third order arithmetic) it is sufficient to study them within the theory of some $H_{\kappa ^+}$ for $\kappa $ a large enough regular cardinal (and for most purposes $H_{\aleph _1}$ suffices for second order arithmetic, and $H_{\aleph _2}$ for third-order arithmetic). Moreover, by Lévy absoluteness, $H_{\kappa ^+}$ and V agree on the computation of many simple set theoretic operations and relations: for example the operations and relations formalizable by means of $\Delta _0$ -formulae and $\Delta _0$ -definable Skolem functions.
The above results entail that whenever T is a $\tau $ -theory that extends $\mathsf {ZFC}_{\Delta _0}+\kappa $ is a cardinal, $\tau $ is a signature that contains $\in _{\Delta _0}\cup \left \lbrace \kappa \right \rbrace $ and is contained in
and $(V,\tau ^V)$ models T, then
Hence $(H_{\kappa ^+},\tau ^V)$ witnesses that (7) is consistent with the universal fragment of the $\tau $ -theory of V. In particular, if T has an AMC for $\tau $ , say S, then (7) must be in S, since it is a $\Pi _2$ -sentence for $\tau $ which is strongly $T^\tau _{\forall \vee \exists }$ -consistent.
We can now collect all these observations and conclude the following.
-
(1) The structure of V is uniquely determined by the structure of the various $H_{\kappa ^+}$ as $\kappa $ ranges among the infinite cardinals.
-
(2) Given an infinite cardinal $\kappa $ , consider a signature $\tau $ extending $\in _{\Delta _0}$ only with predicates for subsets of $\mathcal {P}\left (\kappa \right )$ and a constant symbol for $\kappa $ . Then, if some ${T\supseteq \mathsf {ZFC}_\tau }$ has an AMC, this AMC looks like a theory of $H_{\kappa ^+}$ and is axiomatized by the $\Pi _2$ -sentences for $\tau $ which are strongly $T^\tau _{\forall \vee \exists }$ -consistent.
Now assume T is some $\in $ -theory $T\supseteq \mathsf {ZFC}+$ large cardinals and $A\subseteq \mathsf {Form}_\in \times 2$ is in $\mathfrak {spec}_{\mathsf {AMC}}\left (T,\left \lbrace \in \right \rbrace \right )$ . Assume further that:
-
• For some constant symbol $\kappa $ of $\in _A$
$$\begin{align*}\mathsf{ZFC}^-+\forall X\exists f\,(f:\kappa\to X\text{ is a surjection}) \end{align*}$$is in $\mathsf {AMC}(T,A)$ . -
• Every atomic formula of $\in _A$ satisfies at least one of the simplicity criteria set forth in (a).
Then (on the basis of (IEC)) we should conclude that $\mathsf {AMC}(T,A)$ describes the correct theory of $H_{\kappa ^+}$ , since criterion (B) is satisfied by all axioms of $\mathsf {AMC}(T,A)$ .
6.3 Existentially closed fragments of the set theoretic universe versus AMC
We conclude this section by giving a non-exhaustive list of results showing that, for certain infinite cardinals $\kappa $ , the corresponding $H_{\kappa ^+}$ already witnesses some important existential closure properties (e.g., Lévy absoluteness, Shoenfield’s absoluteness, $\mathsf {BMM}$ , and $\mathsf {BMM}^{++}$ ) and its theory has some traits which are peculiar of model complete theories (e.g., Woodin’s absoluteness, ${\mathsf {MM}}^{+++}$ ). The reader does not need to be familiar with these results, as they only serve as motivation to bring our focus on existentially closed models for set theory.Footnote 34
-
Lévy absoluteness: [Reference Venturi and Viale38, lemma 5.3] Whenever $\lambda $ is a regular uncountable cardinal,
$$\begin{align*}(H_\lambda,\in_{\Delta_0}^V)\prec_1 (V,\in_{\Delta_0}^V). \end{align*}$$ -
Shoenfield’s absoluteness: (see [Reference Viale40, lemma 1.2] for the apparently weaker formulation we give here.) Whenever G is V-generic for some forcing notion in V,
-
Woodin’s absoluteness: (see [Reference Viale40, lemma 3.2] for the weak form of Woodin’s result we give here.) Whenever G is V-generic for some forcing notion in V (and there are class many Woodin cardinals in V),
-
Bounded Martin’s Maximum: ( $\mathsf {BMM}$ ) [Reference Bagaria6] Whenever G is V-generic for some stationary set preserving forcing notion in V,
-
$\mathsf {BMM}^{++}$ : [Reference Woodin47, definition 10.91] Whenever G is V-generic for some stationary set preserving forcing notion in V,
-
Bounded category forcing axioms, ${\mathsf {MM}}^{+++}$ , [Reference Asperó and Viale4, Reference Audrito and Viale5, Reference Viale39] Whenever V and $V[G]$ are models of ${\mathsf {MM}}^{+++}$ ( , $\mathsf {BCFA}({\mathsf {SSP}})$ ) and G is V-generic for some stationary set preserving forcing notion in V,
-
Woodin’s axiom: $(*)$ (see [Reference Larson26, definition 7.5], or [Reference Asperó and Schindler3] for the formulation given here.) Whenever there are class many Woodin cardinals, is a precipitous ideal, and G is V-generic for some stationary set preserving forcing notion in V
7 Main results
We can now present results culminating the discussion of the previous sections. The proofs can be found in [Reference Viale41].
Notation 7.1 For $A\subseteq \mathsf {Form}_\in \times 2$ , $\mathsf {ZFC}_A$ denotes the $\in _A$ -theory $\mathsf {ZFC}_{\in _A}$ defined in Notation 6.1. Similarly we define $\mathsf {ZFC}^-_A$ .
Definition 7.2 Let $T\supseteq \mathsf {ZFC}^-$ be an $\in $ -theory. $\kappa $ is a T-definable cardinal if for some $\in $ -formula $\phi _\kappa (x)$ , T proves:
-
• $\exists !x\,\phi _\kappa (x)$ and
$$\begin{align*}\forall x\,[\phi_\kappa(x)\rightarrow (x{\ is\ a\ cardinal})]. \end{align*}$$ -
• $\kappa $ is the constant $f_{\phi _\kappa }$ existing in the signature $\in _{\left \lbrace (\phi _\kappa ,1) \right \rbrace }$ .
7.1 The AMCs of set theory are theories of $H_{\kappa ^+}$
The first result shows that the AMC spectrum of set theory isolates a rich set of theories which produce models of $\mathsf {ZFC}^-$ , that is, structures which behave like an $H_{\lambda }$ for some regular uncountable cardinal $\lambda $ .
Theorem 7.3 Let R be an $\in $ -theory extending $\mathsf {ZFC}$ .
-
(i) Assume $A\in \mathfrak {spec}_{\mathsf {MC}}\left (R,\in \right )$ and $\in _A\supseteq \in _{\Delta _0}$ . Then any model of $\mathsf {MC}(R,A)$ models extensionality, foundation, choice, is closed under Gödel operations, and satisfies the replacement axiom for any $\Sigma _1$ -formula $\psi (x,y)$ of $\in _{\Delta _0}$ such that $R+T_{\in ,\mathsf {A}}$ models $\forall x\exists y\psi (x,y)$ .
-
(ii) Assume $A\in \mathfrak {spec}_{\mathsf {MC}}\left (R,\in \right )$ , $\in _A\supseteq \in _{\Delta _0}$ , and $\lambda $ is an R-definable cardinal represented by a constant symbol of $\in _A$ and such thatFootnote 35
$$\begin{align*}(H_{\lambda^+}^{\mathcal{M}},\in_A^{\mathcal{M}})\prec_1\mathcal{M}, \end{align*}$$whenever $\mathcal {M}$ models $R+T_{\in ,A}$ . Then $\forall x\exists f\,(f:\lambda \to x\text { is a surjection})$ is in $\mathsf {MC}(R,A)$ . -
(iii) Assume $\lambda $ is an R-definable cardinal.Footnote 36 Then there exists $A_\lambda \in \mathfrak {spec}_{\mathsf {AMC}}\left (R,\in \right )$ with $\in _{A_\lambda }$ containing $\in _{\Delta _0}$ such that $\mathsf {AMC}(R,A_\lambda )$ is given by the $\in _{A_\lambda }$ -theory common to the structures $H_{\lambda ^+}^{\mathcal {M}}$ as $\mathcal {M}$ ranges among the $\in _{A_\lambda }$ -models of $R+T_{\in ,A_\lambda }$ .
7.2 Forcibility versus absolute model companionship
The following is the major result relating AMC to forcibility and to forcing axioms:Footnote 37
Theorem 7.4 Let S be the $\in $ -theory
Then there is a recursive setFootnote 38 $B\in \mathfrak {spec}_{\mathsf {AMC}}\left (S,\in \right )$ with $\in _B$ containing $\in _{\Delta _0}$ , and such that for any $\Pi _2$ -sentence $\psi $ for $\in _B$ and any $\in $ -theory $R\supseteq S$ the following are equivalent:
-
(a) $\psi \in \mathsf {AMC}(R,B)$ ;
-
(b) $(R+T_{\in ,B})^{\in _B}_{\forall \vee \exists }+S+{\mathsf {MM}}^{++}+T_{\in ,B}$ provesFootnote 39 ;
-
(c) R proves that is forcibleFootnote 40 by a stationary set preserving forcing;
-
(d) R proves that is forcible by some forcing;
-
(e) For any $R'\supseteq R$ , $\psi +(R'+T_{\in ,B})^{\in _B}_{\forall \vee \exists }$ is consistent.
Furthermore for any $\theta $ which is a boolean combination of $\Pi _1$ -sentences for $\in _B$ and any $(V,\in )$ model of S, TFAE:
-
(A) $(V,\in _B^V)$ models $\theta $ ;
-
(B) $(V,\in _B^V)$ models that some forcing notion P forces $\theta $ ;
-
(C) $(V,\in _B^V)$ models that all forcing notions P force $\theta $ .
The equivalence of (a) and (b) shows that the $\in _B$ -theory of $H_{\aleph _2}$ is model complete in models of strong forcing axioms; we give a partial converse in Theorem 7.9.
The second part of the theorem shows that forcing cannot change the $\Pi _1$ -fragment of the theory of V in signature ${\in _B}\supseteq {\in _{\Delta _0}}$ . Moreover, a direct consequence of the above result is that if $(V,\in )$ is a model of S and $S^*$ is the $\in _B$ -theory of its unique extension to a model of $T_{\in ,B}$ , we get that a $\Pi _2$ -sentence $\psi $ for $\in _B$ is consistent with the universal fragment of $S^*$ if and only if is forcible over V.
7.3 The AMC-spectrum of set theory and the continuum problem
The study of the AMC spectrum of set theory places $\aleph _2$ in a very special position among the possible values of the continuum.
Theorem 7.5 The following holds:
-
(1) Let $R\supseteq \mathsf {ZFC}$ be an $\in $ -theory. Assume $A\in \mathfrak {spec}_{\mathsf {AMC}}\left (R,\in \right )$ is such that $\in _A$ contains $\in _{\Delta _0}$ and $\neg {\mathsf {CH}}+(R+T_{\in ,A})^{\in _A}_{\forall \vee \exists }$ is consistent. Then ${\mathsf {CH}}\not \in \mathsf {AMC}(R,A)$ .
-
(2) For the signature $\in _B$ and the the theory S of Theorem 7.4 $\neg {\mathsf {CH}}$ is in $\mathsf {AMC}(R,B)$ for any $\in $ -theory $R\supseteq S$ .
Let us briefly argue why ${\mathsf {CH}}$ cannot be in $\mathsf {AMC}(R,A)$ whenever $R+\neg {\mathsf {CH}}$ is consistent.
Proof We use the following peculiar property of AMC (which can fail for model companionshipFootnote 41 ):
Fact 7.6 [Reference Viale41, proposition 2.2.8, definition 2.2.4, lemma 2.3.4] Let $T,S$ be $\tau $ -theories with T the AMC of S, and $\psi $ be a $\Pi _2$ -sentence such that $S+\psi $ is consistent. Then there is a model of $T+\psi $ .
Recall that ${\mathsf {CH}}$ is the conjunction of two $\Sigma _2$ -sentences (for $\in _{\Delta _0}$ ): $\psi _0$ stating “the first uncountable cardinal exists” and $\psi _1$ stating “there is a surjective function with domain the first uncountable cardinal and range the reals,” while $\neg {\mathsf {CH}}$ is the conjunction of $\psi _0$ with the $\Pi _2$ -sentence $\neg \psi _1$ .
Now if $\psi _0\notin \mathsf {AMC}(R,A)$ , ${\mathsf {CH}}\not \in \mathsf {AMC}(R,A)$ as well. Otherwise $R+\neg {\mathsf {CH}}$ is consistent by assumption, hence so is $R+T_{\in ,A}+\psi _0\wedge \neg \psi _1$ , since the latter is a conservative extension of the former. By the above, we can find a model $\mathcal {M}$ of $\mathsf {AMC}(R,A)+\neg \psi _1$ . This model witnesses that $\mathsf {AMC}(R,A)$ cannot prove ${\mathsf {CH}}$ , even when $\psi _0\in \mathsf {AMC}(R,A)$ .
Let us also argue why $\neg {\mathsf {CH}}$ falls in the AMC of $R+T_{\in ,B}$ for the set $B\subseteq \mathsf {Form}_\in \times 2$ and the theory R of (2) above. Towards this aim we appeal to the equivalence between (A) and (C) of Theorem 7.4 (which gives a precise instantiation of simplicity criterion (iii)). First of all notice that an $\in $ -theory R is complete if and only if so is $R+T_{\in ,B}$ . Let $R_0$ be some $\in $ -completion of $R\supseteq S$ and $(V,\in )$ a model of $R_0$ . We now note that $\in _B$ has a constant $\kappa $ to denote
, $\mathsf {AMC}(S,B)$ satisfies the $\in _{\Delta _0}\cup \left \lbrace \kappa \right \rbrace $ -sentence “ $\kappa $ is the first uncountable cardinal”, $\mathsf {AMC}(R_0,B)=\mathsf {AMC}(S,B)+(R_0+T_{\in ,B})_{\forall \vee \exists }$ (by Lemma 5.4). Let $\mathsf {B}$ be a cba such that
holds in $(V,\in )$ .Footnote 42 Let G be any ultrafilter on $\mathsf {B}$ . Then
(by [Reference Jech22, lemma 14.14]) and $V^{\mathsf {B}}/_G$ also models the universal and existential $\in _B$ -fragment of $R+T_{\in ,B}$ (by (A) $\Leftrightarrow $ (C) of Theorem 7.4, and [Reference Jech22, lemma 14.14]). In particular, the $\Pi _2$ -conjunct of $\neg {\mathsf {CH}}$ is consistent with the universal and existential fragment of any completion of $S+T_{\in ,B}$ , hence it belongs to the AMC of $S+T_{\in ,B}$ , while the $\Sigma _2$ -conjunct of $\neg {\mathsf {CH}}$ is in the AMC of $S+T_{\in ,B}$ since the axiom $\mathsf {Ax}^1_\psi $ for $\psi $ defining the first uncountable cardinal is in $T_{\in ,B}$ .
In particular, we see that forcing becomes a powerful tool to prove that a $\Pi _2$ -sentence formalizable in $\in _B$ is in the AMC of $S+T_{\in ,B}$ . Indeed, it suffices to prove over S that this sentence is forcible. More generally minimal variants of the above argument give:
Remark 7.7 Let S be the theory of Theorem 7.4, and $R_0\supseteq \mathsf {ZFC}$ be such that $R_0\cup S$ is consistent. Consider an $\in $ -sentence $\psi $ such that $\psi $ as formulated in signature $\in _B$ is $R_0+T_{\in _B}$ -equivalent to a $\Pi _2$ -sentence for $\in _B$ such that $R_0$ proves that $\psi ^{H_{\aleph _2}}$ is forcible. Then:
-
• $\psi \in \mathsf {AMC}(R,B)$ for any $R\supseteq S\cup R_0$ ;
-
• $\neg \psi \not \in \mathsf {AMC}(T,A)$ for any A and $\in $ -theory $T\supseteq R_0$ such that $\in _A$ contains all the symbols of $\in _B$ appearing in $\psi $ , and $(T+T_{\in _A})_{\forall \vee \exists }+\psi $ is consistent.
This shows that the argument described for ${\mathsf {CH}}$ is far more general and can be used to single out many other set theoretic properties. We can apply it for example to $2^{\aleph _0}>\aleph _2$ : Moore introduced in [Reference Moore32] a $\Pi _2$ -sentence $\theta _{\mathrm {Moore}}$ for $\in _{\Delta _0}$ to show the existence of a definable well order of the reals in type in models of the bounded proper forcing axiom.Footnote 43
Theorem 7.8 There is a $\Pi _2$ -sentence $\theta _{\mathrm {Moore}}$ for $\in _{\Delta _0}$ such that the following holds:
-
(1) $\theta _{\mathrm {Moore}}$ is independent of $S+T_{\Delta _0}$ , where S is the $\in $ -theory of Theorem 7.4.
-
(2) $\mathsf {ZFC}^-_{\Delta _0}+\exists x\,(x\text {is} \aleph _1)+\theta _{\mathrm {Moore}}$ proves that there exists a well-ordering ofFootnote 44 in type .
-
(3) $\mathsf {ZFC}^-_{\Delta _0}+\exists x\,(x\text {is} \aleph _2)+\theta _{\mathrm {Moore}}$ provesFootnote 45 that $2^{\aleph _0}=\aleph _2$ .
-
(4) For S and $\in _B$ the theory and signature considered in Theorem 7.4, $\exists x\,(x$ is $\aleph _1),\theta _{\mathrm {Moore}}$ are both in $\mathsf {AMC}(R,B)$ for any $\in $ -theory R extending S.
-
(5) If R extends $\mathsf {ZFC}$ , $A\in \mathfrak {spec}_{\mathsf {AMC}}\left (R,\in \right )$ is such that $\in _A$ contains $\in _{\Delta _0}$ , $\exists x\, (x$ is $\aleph _2)\in \mathsf {AMC}(R,A)$ and
$$\begin{align*}\theta_{\mathrm{Moore}}+(R+T_{\in,A})^{\in_A}_{\forall\vee\exists}+\mathsf{ZFC} \end{align*}$$is consistent, then $2^{\aleph _0}>\aleph _2$ is not in $\mathsf {AMC}(R,A)$ .
The two theorems single out $2^{\aleph _0}=\aleph _2$ among all possible solutions of the continuum problem.Footnote 46 For any $\in $ -theory R extending $\mathsf {ZFC}+$ large cardinals there is at least one $B\in \mathfrak {spec}_{\mathsf {AMC}}\left (R,\in \right )$ with $\neg {\mathsf {CH}}$ (actually with a definable version of $2^{\aleph _0}=\aleph _2$ ) in $\mathsf {AMC}(R,B)$ , and this occurs even if $R\models {\mathsf {CH}}$ or $R\models 2^{\aleph _0}>\aleph _2$ . On the other hand for any $\in $ -theory R extending $\mathsf {ZFC}$ , if ${\mathsf {CH}}$ is independent of R, then ${\mathsf {CH}}$ is never in $\mathsf {AMC}(R,A)$ for any $A\in \mathfrak {spec}_{\mathsf {AMC}}\left (R,\in \right )$ (if $\in _A$ contains $\in _{\Delta _0}$ ) and similarly if $\theta _{\mathrm {Moore}}$ is independent of R, $2^{\aleph _0}>\aleph _2$ is never in $\mathsf {AMC}(R,A)$ for any $A\in \mathfrak {spec}_{\mathsf {AMC}}\left (R,\in \right )$ (if $\in _A$ contains $\in _{\Delta _0}$ ).Footnote 47
7.4 Model completeness for the theory of $H_{\aleph _2}$ versus bounded forcing axioms and Woodin’s axiom $(*)$
Finally we can show that for certain signatures $\tau $ and a model $(V,\in )$ of $\mathsf {ZFC}+$ large cardinals, a model completeness result for the $\tau $ -theory of $H_{\aleph _2}^V$ is equivalent to assert that V satisfies a strong form of Woodin’s axiom $(*)$ . The reader interested in the (so far) optimal result is referred to [Reference Viale41, theorem 5.6]. Recall the axioms ${\mathsf {BMM}}$ , ${\mathsf {BMM}}^{++}$ , $(*)$ presented in Section 6.3. An immediate consequence of the definition of $\in _B$ and of the invariance of the universal $\in _B$ -theory of V across forcing extensions is the following result.Footnote 48
Theorem 7.9 Assume S and B are as in Theorem 7.4 and $(V,\in )$ is a model of S such that the $\in _B$ -theory of $H_{\aleph _2}^V$ is model complete. Then $(V,\in )$ models ${\mathsf {BMM}}^{++}$ .
Assume furthermore that is precipitous in V. Then $(V,\in )$ models $(*)$ .
Note that
is a precipitous ideal in V, if V models Martin’s maximum (of which ${\mathsf {BMM}}$ is a weakening). The theorem follows almost immediately from Theorem 7.4 by the observation that if G is V-generic for a stationary set preserving forcing and $(V,\in )$ models S
8 On maximality and justification
In this last section we focus on two philosophical aspects of the present approach. First, we address the following question: which notion of maximality is displayed by the model companions of $\mathsf {ZFC} +$ large cardinals? Then, we study the role of large cardinals and strong forcing axioms in providing model completeness for the theory of $H_{\aleph _2}$ . The role these axioms play in obtaining these model companionship results suggests a second question: is there a form of justification of set-theoretical axioms that emerges from the present approach? In the remainder of this section we address these two questions.
8.1 Maximality
Maximality principles play an important role in the study of the axiomatic extensions of $\mathsf {ZFC}$ . Maximality conveys the idea that the universe of sets is as full as it could be, while maximality principles are (first order) statements which try to capture this idea axiomatically.Footnote 49
There is no unique way to realize maximality in set theory, and its various forms depend on what in fact is maximized. Indeed, the process of maximization can be applied to objects, or possibilities.Footnote 50
-
• With respect to objects we find two standard forms of maximization: height maximality (expressing maximality for the lengths of the series of ordinals) and width maximality (expressing the maximality of the power-set operation). The former is successfully exemplified by large cardinals axioms [Reference Koellner24], while the latter represents a cluster of notions often connected to the justification of forcing axioms, generic absoluteness principles [Reference Bagaria7], and forms of horizontal reflection like the Inner Model Hypothesis [Reference Antos, Barton and Friedman1].
-
• For what concerns the maximization of possibilities, we find modal principles of forcing like the so-called Maximality Principle [Reference Hamkins18] (expressing the idea that everything that is possibly necessary is true) or resurrection axioms [Reference Asperó and Viale4, Reference Audrito and Viale5, Reference Hamkins and Johnstone20] (expressing the idea that if something is true, then it is necessarily possible). This form of maximality is closely related to forms of generic absoluteness, but it differs from it conceptually. Indeed, the justification of these modal principles often relays on potentialist views of set theory [27, Reference Shapiro28] that are not always compatible with the actualist ones that motivate generic absoluteness principles.Footnote 51
Now, what form of maximality is displayed by the notion of model companionship? Can it be compared to any of the above forms of set-theoretical maximality?
A first step toward answering the above questions consists in analyzing whether the maximality provided by models companions concerns primarily the underlying ontology (thus suggesting a form of maximization of objects), or the class of sentences that are considered valid (as it seems the case when considering generic absoluteness principles or modal principles of forcing). In other words, does the maximality provided by model companions concerns primarily the ontology of sets, or the family of set-theoretic truths? This question does not have a clear-cut answer. As a matter of fact, we believe that model companionship results for set theory display a multifaceted character related to both forms of maximality.
For what concerns height maximality, this notion is successfully realized by large cardinal axioms. While the model companionship results for set theory do not seem to enforce this form of maximality, they can nonetheless be used to justify it (we argue on this point in the next section). On the other hand, for what concerns width maximality, notice that bounded forcing axioms (and similar principles meant to express forms of horizontal maximality) can be reformulated in terms of generic absoluteness properties for . Many of the results we presented in the previous section outlines that such generic absoluteness results come in pairs with the model completeness for the theory of $H_{\aleph _2}$ with respect to natural signatures for this structure (e.g., Theorem 7.9 or [Reference Viale41, theorem 5.6]). Note that the standard formulation of bounded forcing axioms, such as ${\mathsf {BMM}},{\mathsf {BMM}}^{++},(*)$ , yield forms of absoluteness for the theory of $H_{\aleph _2}$ , which are (apparently) weaker than the model completeness result given by Theorem 7.4: they only assert that (for the appropriate fragment $\tau $ of $\in _B$ ) $H_{\aleph _2}$ is $\Sigma _1$ -elementary for $\tau $ only with respect to certain forcing extensions of V (which are $\tau $ -models of the universal $\tau $ -theory of V by the second part of Theorem 7.4). On the contrary, model completeness of the $\in _B$ -theory of $H_{\aleph _2}$ yields $\Sigma _1$ -elementarity for the much larger class of all $\in _B$ -superstructures which satisfy the same universal $\in _B$ -theory.
For what concerns the modal principles of forcing, we can argue as for width maximality. As a matter of fact, the Maximality Principle and resurrection axioms, as exposed in [Reference Asperó and Viale4, Reference Audrito and Viale5, Reference Hamkins18, Reference Hamkins and Johnstone20] are, by all means, principles of generic absoluteness for the theory of $H_{\aleph _2}$ (at least in the presence of mild forms of bounded forcing axioms). Indeed, for many of these principles their natural formulation asserts that the theory of $H_{\aleph _2}$ is given by those $\Pi _2$ -sentences $\varphi $ (for an appropriate fragment of $\in _B$ ) for which $\varphi \to \square \Diamond \varphi $ holds in the (Kripke frame given by the) generic multiverse composed of initial segments of forcing extensions of V. Therefore, they describe the theory of $H_{\aleph _2}$ with respect to the generic multiverse much like infinitely generic structures (introduced by Robinson in connection with the notion of infinite forcing) describe the theory of the existentially closed models for a first order theory. Leveraging on the model completeness results for $\in _B$ we can actually identify the theory of $H_{\aleph _2}$ in models of strong forcing axioms with that given by the infinitely generic structures, in signature $\in _B$ , for the class of models of $\mathsf {ZFC}+$ large cardinals. Hence the absoluteness displayed by the model companions of set theory, relative to $\in _B$ , largely encompasses that of the modal principles of forcing, since the latter are infinitely generic (at least on the face of their definition) only with respect to the generic multiverse and not with respect to the elementary class of the $\in _B$ -theory of $\mathsf {ZFC}_B+$ large cardinals.Footnote 52
Finally, what is the connection between the maximization given by model companionship results and the (somewhat vague) form of completeness that is encoded in the principle that we called Hilbertian Completeness? Although our aim, similar to Hilbert’s, is to implement the idea that the universe $(V, \in )$ contains all possible sets, our formal way to make this idea concrete differs significantly from Hilbert’s approach. Instead of focusing directly on the objects, we maximize existential sentences using our principle of Informal Existential Completeness (IEC). Thus, maximality of the objects receives in our approach a syntactic twist, being formulated in terms of maximality of solutions to simple problems (according to a syntactic measure of simplicity given by the selection of a specific first order signature).
Summing up, it seems that the form of maximality provided by (absolute) model companionship does not precisely fit with any of the standard forms of set-theoretical maximality. Nonetheless there are very strong similarities with generic absoluteness principles and certain types of forcing axioms,Footnote 53 and conceptual similarities with Hilbert’s idea of completeness. We see the notion of model companionship as a useful tool towards a mathematical formalization of the somewhat vague ideas of width maximality and Hilbertian completeness. Our opinion is that the model companions of set theory yield a form of maximality that we may call algebraic maximality. The inspiration clearly comes from algebraically closed fields. We consider algebraic maximization of a theory T as a two steps process: first, one determines which are the basic, simple, concepts of the theory of interest; then, once the language is extended to a signature $\tau $ with symbols for these concepts, one analyzes the T-ec $\tau $ -models. T is algebraically maximal for $\tau $ in case the $\tau $ -theory of the T-ec $\tau $ -models is model complete and also when its T-ec $\tau $ -models can be seen as “parts of” the models of T. This can be formally implemented as follows:
Definition 8.1 Let T be a $\tau $ -theory and A be a subset of $\mathsf {Form}_\tau \times 2$ . T is algebraically maximal relative to A, if $A\in \mathfrak {spec}_{\mathsf {AMC}}\left (T,\tau \right )$ , and there is a $\tau $ -formula $\psi _A(x)$ such that for any $\mathcal {M}=(M,\tau ^M)$ which models T:Footnote 54
-
• $(\psi _A(\mathcal {M}),\tau _A^{M})$ realizes $\mathsf {AMC}(T,A)$ ,
-
• $(\psi _A(\mathcal {M}),\tau _A^{M})$ is a $\tau _A$ -substructure of $(M,\tau _A^M)$ .
In terms of algebraic maximality we briefly review the results of Section 7. First of all notice that Theorem 7.3 states that for any extension T of $\mathsf {ZFC}$ , algebraic maximality can be obtained for T by sets $A_\kappa $ such that the AMC of T in signature $\in _{A_\kappa }$ describes the theory of $H_{\kappa ^+}$ in models of T, as $\kappa $ ranges among the T-definable cardinals.Footnote 55 Moreover, the first part of Theorem 7.4 states that algebraic maximality, relative to B, holds for the theory $\mathsf {ZFC}+{large cardinals} +$ forcing axioms, while its second part states that $\in _B$ satisfies the simplicity criteria set forth in (IEC) of Section 6. Finally, we have that Theorem 7.9 (and its optimal strengthening given by [Reference Viale41, theorem 5.6]) shows that algebraic maximality for the $\in _B$ of Theorem 7.4 entails (or is equivalent to) certain strong forcing axioms.
In conclusion we can argue that Theorems 7.4 and 7.9 show that algebraic maximality for set theory, relative to $\in _B$ , describes the correct axioms for the theory of $H_{\aleph _2}$ , as algebraic maximality for $\in _B$ realizes (IEC) for this fragment of the universe. This naturally lead us to discuss the second topic of this section: justification.
8.2 Justification
Let us now turn to the topic of justification. When it comes to justifying new set-theoretical principles, the proposed reasons normally fall into two main categories: intrinsic or extrinsic. These forms of justification have been developed within the so-called Gödel’s program:Footnote 56 a step-by-step extension of $\mathsf {ZFC}$ , aimed to coherently complete our picture of the universe of sets. In this sense, the present approach is clearly in the wake of Gödel’s program. Indeed, the axiomatization of the model companions of set theory should, eventually, provide a description of the theory of V as the stratification of the theories of the various $H_{\kappa ^+}$ . Furthermore, the essential role that large cardinals and strong forms of forcing axioms play in the individuation of these model companionship results suggests a justification of the truth of these axioms in view of the nice model-theoretic properties for the theories of $H_{\aleph _1}$ or $H_{\aleph _2}$ they entail.
In order to better argue for this form of justification of large cardinals and forcing axioms, let us briefly revise what we mean by intrinsic and extrinsic reasons.
-
• A form of justification is considered intrinsic when it is based on intrinsic features of the concept of set or, derivatively (given the foundational role of set theory), on notions that are key to the whole edifice of mathematics and logic. In this sense, a new axiom is justified when it captures an essential aspect of the concept of set or when it formalizes notions that are fundamental for mathematics and logic. This form of justification is utterly conceptual, since it rests on the theoretical priority of the notion of set over its formalization. Consequently, an extension of $\mathsf {ZFC}$ is well justified, when it faithfully represents the correct concept of set (e.g., the iterative conception [Reference Boolos11] or the quasi-combinatorial one [Reference Bernays10]).
-
• A form of justification is considered extrinsic when we are forced to accept an axiom by the abundance of its desirable consequences; even in the absence of intrinsic reasons. In this case an axiom is justified even if it does not seem prima facie to capture any relevant aspect of the concept of set. This second form of justification is clearly meant to overcome the limits of intrinsic reasons and to account for an experimental methodology in the context of the foundations of mathematics. Extrinsic justifications are pragmatic in nature, since they rely on the fruitfulness of an axiom in solving new problems, shortening proofs of already known results, and unifying substantial bodies of theory. The form of justification put forward by extrinsic reasons is akin to an inference to the best explanation within mathematics. Therefore, an extrinsically justified axiom is judged by its consequence and not by its meaning.
Intrinsic and extrinsic reasons have been extensively studied in the philosophy of set theory [Reference Maddy29, Reference Maddy30] and they have been widely applied, in recent debates, for the justifications of competing programs [Reference Arrigoni and Friedman2, Reference Koellner23]. These two forms of justification have also been criticized for their opacity in offering clear criteria of application and for the lack of demarcation between intrinsic and extrinsic reasons [Reference Barton, Ternullo and Venturi9]. Moreover, as it happened in the debate on the analytic-synthetic distinction, there is also no shortage of contributions that reject the problem of justification at its very base. Following a naturalistic account of mathematics, authors like Hamkins (or before him Cohen [Reference Cohen14]) propose to dismiss the problem of justification, together with the problem of independence, by defending the liberty of mathematicians to study different universes of set theory and by declaring the study of the variation of truth values among the models of $\mathsf {ZFC}$ to be all that mathematically matters for the study of independence [Reference Hamkins19].
Now, in which sense the nice model-theoretic properties of model companions are able to justify large cardinals and strong forcing axioms? Is the role of the latter set-theoretical principles in the individuation of model companions of set theory able to provide an intrinsic or an extrinsic justification for them?
A first complication that we face in addressing these questions is that the present approach does not only deal with axiomatic extensions of $\mathsf {ZFC}$ , but also with linguistic ones: the introduction of new signatures for set theory. This peculiar aspect is responsible for a justification of large cardinals and strong forcing axioms that contains elements of both intrinsic and extrinsic arguments. Indeed, on the one hand strong forcing axioms provide a maximization of the $\Pi _2$ -statements realized over $H_{\aleph _2}$ and, thus, have tremendously abundant consequences on third order arithmetic (and on this ground they can be extrinsically justified). On the other hand, because of the possibility to provide (generic) absoluteness results, large cardinal axioms determine the “simple” concepts that need to be included in the new signatures for set theory.Footnote 57 Therefore, large cardinals and strong forcing axioms help us understanding what are the basic concepts on which third order arithmetic should be based (and for this reason they can be intrinsically justified).Footnote 58
Another aspect of the present approach (that places the justification of new axioms somewhat outside the standard practice) is the focus on models, instead of sets (like intrinsic reasons), or problems (like extrinsic ones). As a matter of fact, large cardinals and strong forcing axioms play an important role in the construction of model companions of set theory, by detecting the natural signatures for which one has nice model-theoretic properties for (the models of) set theory. Consequently, the explicit link these axioms establish between the choice of a signature and model companionship results for set theory can also be seen as an indirect effect of these axioms in capturing certain natural features of the universe of sets. In this sense, the present approach makes evident the role that certain additional axioms of set theory have in clarifying the model theory of set theory, but also, and derivatively, the role that model-theoretic properties have in selecting what could be the natural theories for initial segments of the universe of sets. It is with respect to this double role (selecting simple concepts and providing natural theories) that our approach rooted in model companionship argues for the justifications of new axioms of set theory.
Because of the limits of the standard intrinsic–extrinsic dichotomy to capture the form of justification of this model-theoretic approach, we may call this new form of justification algebraic, since it rests on the model-theoretic properties of algebraically maximal theories. It is a form of justification that has both intrinsic and extrinsic elements and that heavily relies on the model-theoretic form of maximality that we described in the previous section.
In order to exemplify this algebraic form of justification, let us see how the present approach advances Gödel’s program in providing well-justified extensions of $\mathsf {ZFC}$ . This particular implementation of Gödel’s program is devised to provide new well-justified axioms of larger and larger initial segments of the universe V, concretely, by providing axioms able to make the theories of $H_{\kappa ^+}$ (as $\kappa $ ranges among the infinite cardinal) the model companions of set theory for the “correct” signature aimed at describing the elementary properties of $\mathcal {P}\left (\kappa \right )$ . In this way the theory of V is gradually completed by adding new principles whose justification rest on the nice model-theoretic properties they provide for the theories of the different $H_{\kappa ^+}$ . Let us see how this justification process can be implemented.
We start, of course, from $\mathsf {ZFC}$ and notice that this theory is algebraically maximal relative to $\in _A$ , which is the signature obtained by adding to $\in _{\Delta _0}$ predicates for the lightface definable subsets of in the Chang model .Footnote 59 In this case its existentially complete models describe the theory of , and—because is a definable sub-model of V—we have that all condition of algebraic maximality are fulfilled. By [Reference Venturi and Viale38, theorem 5.2], this theory is the AMC of any extension of $\mathsf {ZFC}$ for $\in _A$ .
However the set theoretic concepts expressed by atomic $\in _A$ -formulae or by the universal $\in _A$ -theory of $\mathsf {ZFC}+T_{\in ,A}$ are not simple according to any of our criteria (put forward in (a) of Section 6). For example is a $\Pi _1$ -sentence for $\in _A$ which holds in the $\mathsf {ZFC}$ model given by the constructible universe L and fails in $L[x]$ where x is a Cohen real over L. On the other hand, assuming as our base set theory $\mathsf {ZFC}+$ there are class many Woodin cardinals which are a limit of Woodin cardinals, the $\in _A$ -axiomatization of this set theory satisfies our simplicity criteria. Henceforth, we should accept large cardinal axioms —among other reasons—also because they render simple (according to our criteria) the notions codified by $\in _A$ . Consequently, on the base of this argument one can claim that the AMC of set theory enriched with large cardinal axioms, in the signature $\in _A$ describes the correct theory of $H_{\aleph _1}$ , and thus realizes the first step of our proposed way to implement Gödel’s program.
A similar argument can be applied to the theory of $H_{\aleph _2}$ , by appealing to Theorem 7.4. In this case the extension of $\mathsf {ZFC}$ by large cardinal axioms is able to make the signature $\in _B$ , in which Theorem 7.4 is expressed, simple according to our criteria of simplicity. Furthermore the nice model-theoretic properties of the theory of $H_{\aleph _2}$ described by the AMC of $\mathsf {ZFC} +$ large cardinals relative to $\in _B$ renders $\mathsf {ZFC} + {large cardinal axioms} + $ strong forms of bounded forcing axioms algebraically maximal with respect to the signature $\in _B$ . We can thus justify forcing axioms and large cardinals as they provide (by means of algebraic maximality relative to $\in _B$ ) the correct picture of the theory of $H_{\aleph _2}$ .
Now, starting with base theory $\mathsf {ZFC}+{large cardinals}+$ strong forms of bounded forcing axioms, we should search for criteria to detect which definable subsets of describe simple set theoretic notions, and search for the AMC of $\mathsf {ZFC}+{large cardinals} +$ strong forcing axioms relative to the signature expanded by these concepts. If and when this task is achieved,Footnote 60 we can argue that the corresponding AMC provides the correct description of $H_{\aleph _3}$ and that the relevant axioms should be added to the axioms of set theory.
In a nutshell this is the general strategy: to climb the cardinal hierarchy in order to implement algebraic maximality for set theory with respect to signatures that describe the properties of $\mathcal {P}\left (\kappa \right )$ for larger and larger cardinals $\kappa $ . This will come in pairs with set-theoretic axioms that fulfill (possibly weaker and weaker) criteria of simplicity for these signaturesFootnote 61 (which are algebraically maximal for more and more of their initial segments). This strategy has been successful so far with respect to $H_{\aleph _1},H_{\aleph _2}$ . Theorem 7.3 shows that no obstruction to its implementation for other cardinals is in sight.Footnote 62
We conclude by noting that there is another sense in which these model companionship results, the justification they offer to new axioms, and the solution they yield for the continuum problem are utterly Hilbertian. This is the autonomy they provide for mathematics with respect to its foundations. As a matter of fact, the use of the notion of model companionship to fix the theory of second and third order arithmetic (and to enforce algebraic maximality) does not introduce any element that is foreign to mathematical practice. The sought solution to the continuum problem is a solution obtained with formal tools and that is justified on a purely mathematical ground. In this sense the results presented in this paper seem to realize in full the possibility of an autonomous foundation for mathematics, as the one sought by Hilbert [Reference Franks15].
Acknowledgements
We thank Gabriel Goldberg and the two anonymous referees of RSL for many useful comments on previous drafts of this paper. We also thank David Asperó, Ilijas Farah, Leon Horsten, Luca Motto Ros, and Boban Veličković for many interesting discussions on these topics.
Funding
The first author acknowledges support from the Jovem Pesquisador FAPESP project no. 2016/25891-3 and from the RyC grant USE-21881-P. The second author acknowledges support from the project: PRIN 2017-2017NWTM8R Mathematical Logic: models, sets, computability and GNSAGA.