SET-THEORETIC BICONTEXTUALISM

SIMON SCHMITT

doi:10.1017/S1755020325100774

SET-THEORETIC BICONTEXTUALISM

Part of: Set theory General logic Philosophical aspects of logic and foundations

Published online by Cambridge University Press: 22 August 2025

SIMON SCHMITT

Show author details

SIMON SCHMITT*: Affiliation:
DEPARTMENT OF PHILOSOPHY AND EDUCATION CENTER FOR LOGIC, LANGUAGE, AND COGNITION (LLC) UNIVERSITY OF TURIN 10124 TURIN ITALY
*: E-mail: schmitt.sim@gmail.com

Article contents

Abstract
Introduction
Relativism in set theory
Bicontextualism for set theory
Set-theoretic bicontextualist treatment of the paradoxes
Extracting core properties
Objections and replies
Conclusion
Funding
Footnotes
References

Rights & Permissions

Abstract

Can we quantify over absolutely every set? Absolutists typically affirm, while relativists typically deny, the possibility of unrestricted quantification (in set theory). In the first part of this article, I develop a novel and intermediate philosophical position in the absolutism versus relativism debate in set theory. In a nutshell, the idea is that problematic sentences related to paradoxes cannot be interpreted with unrestricted quantifier domains, while prima facie absolutist sentences (e.g., “no set is contained in the empty set”) are unproblematic in this respect and can be interpreted over a domain containing all sets. In the second part of the paper, I develop a semantic theory that can implement the intermediate position. The resulting framework allows us to distinguish between inherently absolutist and inherently relativist sentences of the language of set theory.

Keywords

philosophy of set theory absolute generality quantifiers higher-order logic

MSC classification

Primary: 03A05: Philosophical and critical 03E70: Nonclassical and second-order set theories 03B80: Other applications of logic

Information

Type: Research Article
Information: The Review of Symbolic Logic , Volume 18 , Issue 3 , September 2025 , pp. 826 - 858

DOI: https://doi.org/10.1017/S1755020325100774 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: © The Author(s), 2025. Published by Cambridge University Press on behalf of The Association for Symbolic Logic

1 Introduction

Generality absolutism is the claim that it is possible to quantify over absolutely everything. Examples of absolutely general quantification are not hard to find and involve fundamental logical, mathematical, and metaphysical claims, such as ‘Everything is self-identical’ or ‘Nothing is a member of the empty set’. Intuitively, these claims are absolutely unrestricted and their interpretation over a less than all-inclusive domain would be a fundamental misinterpretation. Generality relativism is the denial of generality absolutism, i.e., the claim that quantification is always restricted, and that no quantifier ever achieves absolute generality.Footnote ¹

Despite its initial appeal, many have questioned absolutism. The strongest argument against it, the Argument from Paradox as it is often called, comes from the semantic and set-theoretic paradoxes. In a nutshell, the argument begins by assuming absolutism and, with it, the existence of a domain containing absolutely everything. Now, according to standard model-theoretic semantics, domains of quantification are sets. Yet, Russell’s paradox immediately implies that no set can be universal. The reasoning is well-known. Let M be a supposedly all-inclusive set, and consider M’s Russell set $R_M := \{x \in M ~ | ~ x \notin x \}$ . If $R_M$ is in M, which it should be if M is universal, then it follows paradoxically that $R_M \in R_M \leftrightarrow R_M \notin R_M$ . Thus, $R_M$ is not in M, whence M is not universal after all. Since this argument can be generalized to apply to any kind of collection-like entity, the relativistic Argument from Paradox not only seemingly disqualifies universal set domains, but also any universal domain based on any other collection-like entity. It follows, relativists claim, that no domain can comprise absolutely everything.

Russell’s paradox, and the set-theoretic paradoxes more generally, have led to far-reaching conceptual changes in the foundations of mathematics: the naïve conception of set implicitly at work in the early writings of Cantor and Frege, according to which every predicate has a set extension, has been replaced by the nowadays standard iterative conception of set. According to this, a set is anything obtainable from some previously given objects, the urelements, by iterated application of set construction mechanisms, such as the powerset and the union operation.Footnote ² This elegant and sophisticated conception has been enormously successful, and has lead to the nowadays standard picture of the set universe as a cumulative hierarchy of sets.Footnote ³

As Studd [Reference Studd46, chap. 2] among many others observes, though, that the cumulative-iterative account is inherently relativistic. Since each stage in the cumulative hierarchy has a successor stage that strictly extends all previous stages, no stage can be universal. Consequently, it would seem, the cumulative-iterative account is incompatible with absolutism. At the same time, the cumulative-iterative account provides an elegant and widely accepted solution to Russell’s paradox and to the set-theoretic paradoxes more generally: every stage has a Russell set, which however enters the cumulative hierarchy only at a later stage. This blocks the paradox outright. But it also confronts us with a seemingly unavoidable decision: we must either embrace absolutism, and forgo the commonly accepted, elegant, and sophisticated solution to the paradoxes, or accept the standard cumulative-iterative account, and give up absolutism.

In a semantic setting, the situation is analogous. The semantic paradoxes—the paradoxes of semantic notions such as truth, satisfaction, and denotation—reveal a tension between naïve semantic principles, such as the interderivability of $\varphi $ and “ $\varphi $ ’ is true”, and classical logic. Contextualist approaches to the semantic paradoxes propose to retain classical logic and interpret the paradoxes as evidence that semantic notions are non-naïve and hierarchical. Contextualism provides a solution to the semantic paradoxes that is philosophically well-motivated, technically elegant, and, importantly, fully in keeping with standard hierarchical treatments of the paradoxes of set theory. However, just like in the set-theoretic case, contextualism appears to be inherently relativistic.Footnote ⁴

In a recent publication, Rossi [Reference Rossi42] has shown that, in the truth-theoretical setting, this dilemma is by no means forced upon us, and that there is a way to combine a partial form of absolutism—the view that at least some sentences can be given an absolutely unrestricted interpretation—with contextualist, broadly hierarchical solutions to the paradoxes.Footnote ⁵ Following Cartwright [Reference Cartwright9], Rossi drops the assumption that domains have to be entities, and instead advocates an ontologically neutral reading of the notion of a ‘domain’.Footnote ⁶ Moreover, he proposes to split up the sentences of the underlying language in those that lead to paradoxes, such as the Liar sentence $\lambda $ , which says of itself that it isn’t true, and those that are unproblematic in this respect, such as, e.g., ‘5 + 7 = 12’ or ‘Everything is self-identical’. He then proposes a bipartite semantics, where the unproblematic sentences are interpreted over an all-inclusive (non-objectual) ‘domain’, while problematic sentences are interpreted over restricted, set-sized domains. Thus, Rossi shows, we can combine absolutism for unproblematic sentences with contextualist approaches to the semantic paradoxes.

In this paper, I put forward a bipartite semantics for set theory that allows us to combine set-theoretic absolutism with the cumulative-iterative conception of sets. As I show below, just like one can develop a semantic theory that combines partial absolutism with standard hierarchical treatments of the semantic paradoxes, one can also develop a semantics for set theory that combines partial absolutism with the cumulative-iterative conception of sets. According to that position, the paradoxes show that some sentences must be relativistically interpreted, whence it is not reasonable to demand that every sentence be possibly given an absolutely unrestricted interpretation. At the same time, it is reasonable to demand that some sentences be given an absolutely unrestricted interpretation. This account, then, provides all the unrestricted quantification one can reasonably ask for, or so I argue. The upshot is a novel conception of quantification in set theory that allows us to retain the universal character of set theory—a theory whose subject matter is the entire cumulative hierarchy, which comprises absolutely all sets—and the open-ended, indefinitely extensible character of this very same hierarchy.

The primary aim is to develop a new semantics for set theory along the lines just sketched. This allows for an elegant, if standard, solution to the set-theoretic paradoxes, while preserving the possibility of quantifying over absolutely all sets. Crucially, I propose to use the bipartite and partially absolutistic semantics to identify a class of inherently absolutistic sentences of the language of set theory. These sentences do not have countermodels, neither in the standard sense of the cumulative hierarchy, nor in the all-inclusive model containing all sets. They express, then, the most fundamental facts about the set-concept and thereby shed new light on the concept of set.

The paper is structured as follows. §2 sets the stage by outlining the philosophical debate between absolutism and relativism in set theory, with particular emphasis on the cumulative-iterative conception and its role in responding to the paradoxes. §3 introduces the bicontextualist framework in two stages: first through a heuristic explanation (§3.1), and then through a formal development (§3.2–§3.6). §4 applies the framework to the set-theoretic paradoxes, showing that the bicontextualist treatment of paradoxes is in line with the standard treatment in nowadays set theory. §5 explores how the framework allows us to isolate core structural features of the set concept. §6 addresses a number of potential objections and replies. §7 concludes. An appendix provides background material on infinitary logic (A) and its use in formalizing set theory (B).

2 Relativism in set theory

The most influential critique of absolutism—again, understood as the claim that absolutely unrestricted quantification is possible—is based on the semantic and set-theoretic paradoxes. The modern version of the argument goes back to Dummett [Reference Dummett and Dummett12], who took the set-theoretic paradoxes to show that the notion of set is indefinitely extensible. But the core of the argument can already be traced back to Russell, who similarly took the paradoxes to show that we can never quantify over absolutely all sets.Footnote ⁷

The argument runs as follows: The truth of absolutism requires a domain containing absolutely everything, where domains are standardly conceived as sets, or—more generally—collection-like entities. However, the notion of a set, and of a collection more generally, is indefinitely extensible, which is taken to imply that no set, or collection, can contain absolutely everything. Hence, the argument concludes, there cannot be a domain containing absolutely everything, and so absolutism must be false. This general version of the Argument from Paradox needs some unpacking, beginning with the somewhat mysterious notion of indefinite extensibility.

Following Incurvati, “a concept C is indefinitely extensible iff whenever we succeed in defining a set M of objects falling under C, there is an operation which, given M, produces an object falling under C but not belonging to M”.Footnote ⁸ It is easy to show that the concept set is indefinitely extensible in this sense. Let M be a set and consider the subset $R_M \subseteq M$ which contains all and only those sets in M which are not members of themselves, i.e., $R_M := \{x \in M ~ | ~ x \notin x \}$ . Now reason is that, if $R_M$ is an element of M, we can ask whether $R_M$ is contained in $R_M$ (since $R_M$ is defined to be a subset of M). If it is, it must satisfy the defining condition of $R_M$ , that it is not self-membered, and so $R_M$ cannot be an element of $R_M$ . If $R_M$ is not a member of itself, it satisfies the defining condition of $R_M$ , and therefore must be a member of itself. This implies that $R_M \in R_M \leftrightarrow R_M \notin R_M$ , which entails absurdity in any logic at least as strong as minimal logic. We can therefore reject that $R_M$ is an element of M, which implies that there is a set ( $R_M$ ) not contained in M. In Incurvati’s terminology, we started from an object M falling under the concept ‘set’, and produced an object, $R_M$ , also falling under the concept ‘set’, but not belonging to M. Since M was completely arbitrary, the reasoning applies to any set. Hence, the concept set is indefinitely extensible.

The formal derivation of Russell’s paradox uses the naïve comprehension schema, according to which every definable property $\varphi (x)$ determines a set. Formally:

Naïve Comprehension: For any formula $\varphi (x)$ of the language of set theory which lacks x free and contains no free occurrences of y, the following is an axiom:

$$\begin{align*}\exists y \forall x (x \in y \leftrightarrow \varphi(x)). \end{align*}$$

If we let $\varphi (x)$ be $x \notin x$ , we get that $x \in y \leftrightarrow x \notin x$ .

The indefinite extensibility of the set concept directly implies that there can be no universal set. To see this, assume that U is a universal set. Now consider its Russell set $R_U$ : on pain of contradiction, $R_U$ cannot be in U. So $R_U$ is a set not contained in the supposedly universal set U, whence U cannot be universal after all. The argument, which is just an instance of Russell’s paradox, is fully general, i.e., it makes no specific assumptions about U. It therefore disqualifies any set from being universal.

In what follows, I will exclusively focus on sets. However, I should stress that the argument from indefinite extensibility is not limited to sets: any collection-like entity is equipped with a notion of membership, whence any supposedly all-inclusive collection gives rise to a subcollection containing all and only the non-self-membered collections. At this point, the only way to avoid a paradox would seem to require abandoning the assumption of a universal collection.

Relativists claim that the non-existence of a universal set directly proves absolutism wrong. Since, they reason, the all-inclusive domain postulated by absolutists contains absolutely everything, and since sets are things, the all-inclusive domain of absolutism must contain absolutely every set—that is, the universal domain has to be a supercollection of the universal set. However, as the above reasoning shows, there can be no universal set, whence there can’t be an all-inclusive domain after all either: no domain can be all-inclusive, which is to say that all quantifier domains, and quantification more generally, are restricted.

The set-theoretic paradoxes have led to far-reaching conceptual changes that affect, to this day, our understanding of the foundations of mathematics. The iterative conception of set, and its accompanying intended model of set theory, the cumulative hierarchy, have both been developed in reaction to the discovery of the paradoxes. Their immediate aim was to avoid the paradoxes; according to many, however, they genuinely capture essential properties of our conception of sets, or of sets themselves.Footnote ⁹ According to the iterative conception, “a set is something obtainable from the integers (or some other well-defined objects) by iterated application of the operation ‘set of”’.Footnote ¹⁰ Instead of postulating sets to be extensions of predicates (as in the naïve, i.e., pre-iterative conception), the iterative conception considers sets to be all and only those objects that we obtain from given objects (the urelements) by applying accepted set-forming mechanisms. Today’s standard set theory $\mathsf {ZFC}$ is widely regarded as an axiomatization of the iterative conception.Footnote ¹¹

The cumulative hierarchy V organizes sets in an open-ended sequence of stages, beginning at stage $V_0$ with the empty set, and then iterating the powerset operation at successor stages $V_{\alpha +1}$ and taking unions at limit stages $V_\gamma $ . The resulting hierarchy forms the ontological counterpart of the iterative conception, in the sense that it provides structurally nice models of $\mathsf {ZFC}$ .

The cumulative-iterative conception immediately blocks Russell’s paradox. From an axiomatic point of view, $\mathsf {ZFC}$ replaces the naïve comprehension schema with the more restricted separation schema:

Separation: For any formula $\varphi (x)$ of the language of set theory which lacks x free and contains no free occurrences of y and z, the following is an axiom:

$$\begin{align*}\forall y \exists z \forall x (x \in z \leftrightarrow x \in y \land \varphi(x)). \end{align*}$$

The difference between naïve comprehension and separation is that naïve comprehension postulates the existence of a set y containing all objects with the property $\varphi (x)$ outright, whereas separation only postulates the existence of a set z of all objects satisfying $\varphi (x)$ which are members of a previously given set y. This blocks the formal derivation of Russell’s paradox, because when we substitute $x \notin x$ for $\varphi (x)$ , we get $\forall y \exists z \forall x (x \in z \leftrightarrow x \in y \land x \notin x)$ , which only gives us the set z of all non-self-membered sets in y.

From a conceptual or ontological point of view, the cumulative hierarchy avoids Russell’s paradox in virtue of its open-ended character. For every stage $V_\alpha $ in V, the Russell set of $V_\alpha $ is not in $V_\alpha $ , but enters the hierarchy only at the later stage $V_{\alpha +1} = \mathcal {P}(V_\alpha )$ —the powerset of $V_\alpha $ . By the open-endedness of the construction, each stage gets expanded in that way. As a result, we only get indefinitely many ‘Russell sets’, each of which can be harmlessly integrated in V.

As Studd [Reference Studd46, chap. 2] has observed, the cumulative-iterative account is inherently relativistic. Cantor’s theorem implies that $\mathcal {P}(V_\alpha )$ is strictly larger than $V_\alpha $ and contains objects which are not in the previous stages. Hence, the sequence of increasing stages never reaches a final stage containing all sets, and no stage achieves absolute generality.

The cumulative-iterative account of set offers an elegant resolution to Russell’s paradox by organizing sets in stages. However, as mentioned at the outset, this solution seemingly comes at the cost of having to abandon absolutism: it offers an understanding of sets in terms of an open-ended cumulative hierarchy—one that, by its very nature, would appear to preclude the existence of an all-encompassing absolute domain. In turn, this seemingly undermines the view of set theory as a comprehensive theory about all sets.

Relativists are of course aware of the problem and have sought to simulate absolutely general talk by means such as schemata (in the case of Russell) or modal operators (in Studd’s case). But, it seems fair to say, these surrogates are all arguably inadequate for a number of reasons. As is well-known, schemata don’t allow one to express unrestricted existential generalisations. And, as Studd himself acknowledges, primitive modal operators give rise to revenge paradoxes (see Studd [Reference Studd46, chap. 7]).

To summarize, the tension between absolutism and relativism arises from the fact that each view captures an essential aspect of our set-theoretic practice. Absolutism retains the intuitive idea that set theory is meant to be a theory of all sets. Relativism, in contrast, captures the insight that the set-concept is itself inherently hierarchical and open-ended. This latter feature is brought out with particular clarity in the cumulative-iterative conception of sets and its formal counterpart $\mathsf {ZFC}$ . However, one must not misunderstand $\mathsf {ZFC}$ as a technically convenient theory that is only designed to avoid paradoxes. Rather, it reflects a deeper philosophical understanding of set-concept as an inherently hierarchical concept, with sets being arranged stagewise in the cumulative hierarchy. As such, the cumulative-iterative conception—and with it, $\mathsf {ZFC}$ —provides the most compelling conceptual analysis of the set-concept currently available.

3 Bicontextualism for set theory

At this point, it seems as if we’ve reached a dilemma: we must choose between unrestricted quantification, on the one hand, and the elegant but relativistic solutions to the paradoxes, on the other hand—we cannot have both. In the truth-theoretic case, Rossi [Reference Rossi42] has recently shown that this trade-off can be overcome, and that it is possible to combine absolutism understood as the claim that absolutely unrestricted quantification is possible with the relativistic solutions to the paradoxes.

As Parsons [Reference Parsons37] long pointed out, there are obvious similarities between the set-theoretic and the semantic paradoxes. More specifically, in the truth-theoretic case, contextualists postulate an open-ended hierarchy of contexts accompanied by a sequence of growing domains; in the set-theoretic case, the cumulative-iterative conception postulates an open-ended hierarchy of growing ranks. In keeping with this observation, I propose to generalise Rossi’s bipartite approach to the semantic paradoxes to the set-theoretic paradoxes and to set theory more generally.

The first step is to notice that the relativistic Argument from Paradox and the prima facie examples for absolutism are, strictly speaking, not incompatible. Absolutists maintain that, e.g., $\forall x ~ x = x$ is unproblematic, and requires an absolutely unrestricted interpretation of its quantifier. Relativists, in contrast, argue that we cannot interpret the quantifiers in the Liar sentence or in Russell’s paradox over an unrestricted domain, on pain of contradiction, and conclude from this that no quantifier can have unrestricted range.

While the argument correctly rejects absolutism for sentences involved in paradoxical reasoning, it does not tell against an absolutely unrestricted interpretation of unproblematic sentences. The problem here is that the failure of absolutism for certain paradoxical sentences simply does not imply the failure of absolutism for all sentences.Footnote ¹² While the paradoxes may indeed undermine a strong version of absolutism, understood as the view that it must be possible to provide an absolutely unrestricted interpretation of all sentences, they do not tell against absolutism’s core tenet, that absolute generality is possible, and that it is possible to provide an absolutely unrestricted interpretation of sentences like $\forall x ~ x = x$ .

This position is a natural middle ground between strong absolutism and relativism—one that allows us to interpret unproblematic sentences like $\forall x ~ x = x$ over an all-inclusive, non-objectual ‘domain’, and problematic sentences like $\lambda $ over set-sized domain. In a truth-theoretic setting, Rossi [Reference Rossi42] calls this bicontextualism. Due to the similarities between the semantic and truth-theoretic paradoxes, I apply bicontextualism’s key ideas to set theory and the foundations of mathematics. As in the case of truth-theoretic bicontextualism, I allow the interpretation of unproblematic sentences from the language of set theory, such as $\neg \exists x ~ x \in \emptyset $ over a ‘domain’ containing absolutely all sets, and problematic sentences, such as $\forall x (x \in y \leftrightarrow x \notin x)$ over restricted domains only.

Let me now develop the bicontextualist approach to set theory, beginning with a first informal presentation of the main ideas, before providing the technical details.

3.1 Heuristics

The key idea of bicontextualism for sets is to decide sentence by sentence between an absolutistic and a relativistic interpretation: inherently relativistic sentences are interpreted relativistically, in order to avoid paradoxes; in contrast, inherently absolutistic sentences are allowed to be interpreted over an all-inclusive ‘domain’ containing absolutely every set.

Bicontextualism’s characteristic feauture is the abandonment of the principle of unified interpretation, according to which all sentences of the relevant object language receive the same interpretation.Footnote ¹³ In contrast, (set-theoretic) bicontextualism allows for a bipartite interpretation of the language (of set theory), on which some sentences can be interpreted over an all-inclusive ‘domain’, while others can only be interpreted over restricted domains. Ontologically speaking, bicontexualism postulates a non-objectual ‘domain’ containing absolutely everything, and several objectual restricted and hierarchically organized domains living inside the big ‘domain’. Unproblematic sentences can then be interpreted with the widest of all domains, while problematic and paradoxical sentences will always be restricted to one of the smaller domains inside the big ‘domain’.

I should stress that I don’t interpret talk of an all-inclusive ‘domain’ at face value; I rather understand it in a broadly Fregean sense (Williamson [Reference Williamson49]). More precisely, although I take the paradoxes to show that all-inclusive domains cannot exist as entities, I also maintain, in line with Rossi’s original presentation of bicontextualism, that we can nevertheless understand ‘domains’ through plurals: they are not reified objects, but rather pluralities of things.Footnote ¹⁴

As for the restricted domains, I take them to be the strongly inaccessible rank-initial segments of the cumulative hierarchy, that is, domains of the form $V_\kappa $ for inaccessible $\kappa $ .Footnote ¹⁵ This choice is not arbitrary. As argued above, the cumulative-iterative conception captures the hierarchical and open-ended character of the set-concept, and $\mathsf {ZFC}$ is widely understood as its formal counterpart. By working with $V_\kappa $ -domains, the framework retains the structural features that make the relativist approach to the paradoxes both philosophically attractive and technically robust. The restricted domains, then, are not just any set-sized models of set theory, but precisely those that arise from the iterative conception, as axiomatized by $\mathsf {ZFC}$ .Footnote ¹⁶

It is important not to presuppose the distinction between inherently absolutistic and relativistic sentences, but to construct a semantic theory for the language of set theory in such a way that the distinction actually drops out of the theory. In order to do so, I work with a generalization of standard model theory that is compatible with generality absolutism. According to this approach, there is a plural ‘domain’ containing absolutely all sets.Footnote ¹⁷ And since the ranks of the cumulative hierarchy are sets, they are all contained in the universal ‘domain’.Footnote ¹⁸

Formally, I use a language that has a name for every set—let $\mathcal {L}$ be such a language. When we interpret $\mathcal {L}$ over the ‘domain’ containing all sets, the resulting ‘interpretation’ is a total function. When we interpret $\mathcal {L}$ over a restricted domain, say some $V_\kappa $ in V, the resulting interpretation is a partial function instead. The reason is that, since $\mathcal {L}$ has a name for every set, there are many more names in $\mathcal {L}$ than sets in $V_\kappa $ (for any $\kappa $ ).Footnote ¹⁹ Names that do not correspond to sets in $V_\kappa $ simply do not denote.Footnote ²⁰

I use a three-valued semantics with truth values $\{0, {1}/{2}, 1\}$ . If the interpretation function is a partial function, the semantics will be such that every sentence with denotationless terms has value ${1}/{2}$ , and sentences which only contain denoting names have classical truth values. The main thought is that according to the restricted models, sentences referring objects outside the current domain are nonsensical or off-topic.Footnote ²¹

In the next step, I interpret all $\mathcal {L}$ -sentences over and over again, over all the restricted domains ( $V_\kappa $ ’s) and over the all-inclusive, maximal plural ‘domain’. This yields classes of sentences validated by all the different models, which allow me to define the inherently absolutistic and relativistic sentences of $\mathcal {L}$ . More specifically, notice that an inherently absolutistic sentence cannot just be taken to be a sentence that is true according to the universal ‘domain’.Footnote ²² Think of the sentence $\exists \kappa \mathsf {IC}(\kappa )$ , saying that there is an inaccessible cardinal. This sentence is false in all V-ranks below $\kappa $ (assuming $\kappa $ to be the first inaccessible). Hence, I will not count this sentence as inherently absolutistic, since it has at least one countermodel. I therefore take the inherently absolutistic sentences to be those that are true in the universal ‘domain’, and have no countermodel.

The inherently relativistic sentences are then defined via the inherently absolutistic sentences. To see this, consider, for some $V_\kappa $ , the collection of all and only those sentences true in $V_\kappa $ , and then subtract from it the sentences that are also true over the universal ‘domain’. The resulting collection contains all and only those sentences true in $V_\kappa $ due to their relativistic nature. My main claim, then, is that all the sentences of $\mathcal {L}$ that are classified as inherently absolutistic by the above semantics are to be interpreted over the all-inclusive ‘domain’, whereas all the sentences that are classified as inherently relativistic are to be interpreted over restricted domains. The upshot is a semantic theory that allows us to reconstruct the elegant relativistic approach to the set-theoretic paradoxes within a framework that also contains an all-inclusive ‘domain’, over which, in spite of the paradoxes, absolutistic sentences can be interpreted.

Using the three-valued semantics and the ability of $\mathcal {L}$ to name every set, we can also consider sentences such as the $\mathcal {L}$ -formalization of “for every $V_\alpha $ , there is a $V_\beta $ , such that $\alpha < \beta $ and $V_\alpha \in V_\beta $ ”. Such a sentence cannot be formalized in the standard language of set theory—at least not with its standard, absolutist meaning. The point is this: from within a given model $V_\kappa $ , one can define an internal version of the cumulative hierarchy, with internal ordinals $\alpha , \beta $ , and internal ranks $V_\alpha , V_\beta $ , satisfying $V_\alpha \in V_\beta $ .Footnote ²³ The sentence is thus expressible and even provable in the internal language of $V_\kappa $ . But this is a relativist interpretation: what the sentence expresses is a fact about how a given model internally simulates its own hierarchy. From an absolutist point of view, in contrast, the vocabulary of set theory refers not to what is definable inside some model, but to the actual cumulative hierarchy. In the bicontextualist semantics, the sentence is an example that requires the all-inclusive plural ‘domain’ containing all sets to express its intended meaning.

Both large cardinal axioms and Russell-style sentences are classified as inherently relativistic in the present framework. The reason is straightforward: their truth is not preserved across all domains. A sentence asserting the existence of a large cardinal $\kappa $ is false in all $V_\alpha $ with $\alpha < \kappa $ , and becomes true only in sufficiently large domains. Similarly, the sentence asserting the existence of the Russell set of $V_\alpha $ is false in $V_\alpha $ , but true in $V_{\alpha +1}$ . In both cases, the sentence fails in some domain and is verified in a strictly larger one. According to the classification criterion developed here, this makes them inherently relativistic.

Note that what I call inherently absolutistic and inherently relativistic sentences is inspired by, though not identical with, the distinction between problematic and unproblematic sentences in Rossi [Reference Rossi42]. While Rossi’s approach primarily focuses on the distinction between paradoxical and non-paradoxical sentences, the set-theoretic case takes into account that any sentence that transcends some ranks of the cumulative hierarchy due to size-restrictions gives rise to wider, more-encompassing ranks. This is particularly important for the case of large cardinal axioms mentioned above. It is important that, even though paradoxical sentences and large cardinal axioms are classified as inherently relativistic by the semantics to be developed below, this does not mean that they are equally problematic in all respects. Nevertheless, from the relativist point of view, they exhibit a crucial structural similarity: both require an act of domain expansion. Working within a given $V_\alpha $ , the sentence asserting the existence of the Russell set of $V_\alpha $ , and the sentence asserting the existence of a set of size $\kappa> \alpha $ , are both false in $V_\alpha $ and become true only in some $V_\beta $ with $\beta> \alpha $ . In this sense, both statements demand that we step outside the current set-theoretic universe in order to interpret them as true. The mechanism at work—failure at one level, truth at a higher one—is structurally the same.

Furthermore, what we classify as a paradox often depends on the background context. Russell’s paradox is typically regarded as a paradox, but in standard axiomatic contexts (such as $\mathsf {ZFC}$ ), it is treated as a theorem stating that a universal set cannot exist. The contradiction arises only when one assumes such a set exists. From this perspective, even paradigmatic paradoxes like Russell’s share more with large cardinal principles than might initially be assumed: both are sentences whose evaluation highlights the limits of a given domain and the need for extension beyond it. In that respect, the semantic framework developed here justifies grouping them under the same formal category of inherently relativistic sentences—while still allowing for important conceptual distinctions between them.

It is also important to clarify why the meta-theoretic assumption of large cardinals is unproblematic. Although sentences asserting the existence of large cardinals are classified as inherently relativistic within the object language, this only highlights that some models fail to accommodate them. It does not mean that such sentences are problematic in any stronger epistemic or metaphysical sense. From an absolutist standpoint, which bicontextualism partially retains, there is nothing puzzling about assuming the existence of inaccessible cardinals at the meta level, while simultaneously classifying them as non-absolutist truths about the set concept.Footnote ²⁴

3.2 The RU approach

Rayo and Uzquiano [Reference Rayo and Uzquiano40] develop a generalization of standard model theory based on higher-order logic that is compatible with generality absolutism.Footnote ²⁵ . The main idea of their proposal (the RU approach, as I call it) is to rephrase key principles from standard model theory (model, variable assignment, satisfaction) for a first-order object language in a second-order metalanguage. For that purpose, a free monadic second-order variable X is taken to code the notion of a model for a first-order object theory.

Rayo and Uzquiano [Reference Rayo and Uzquiano40] combine their higher-order semantics with the plural interpretation for higher-order expressions also developed by Boolos.Footnote ²⁶ On the plural interpretation, monadic second-order variables do not denote sets or set-like entities, as in standard semantics for second-order logic.Footnote ²⁷ Rather, second-order variables introduce a new kind of denotation, so that they plurally denote first-order entities.Footnote ²⁸ On the plural-interpreted RU approach, then, a ‘model’ for a first-order language is given by the values that the second-order variable X takes according to the second-order definition they give.

One key advantage of the plural-interpreted RU approach is that it allows us to dispense with what Cartwright [Reference Cartwright9] famously called the all-in-one principle. According to this principle, the domain of a model must always constitute a set or some other collection-like entity. In contrast, the RU approach avoids reifying ‘domains’ and ‘models’, as ‘domains’ are not treated as entities themselves but rather as the first-order entities plurally denoted by higher-order variables. As a result, the all-inclusive domain is not a ‘thing’ in the strict sense, making it immune to the relativistic argument from paradox, which only applies to reified domains, such as sets. This flexibility is pivotal: by rejecting the all-in-one principle and adopting a plural interpretation of higher-order variables, the RU approach circumvents set-theoretic size restrictions and aligns with generality absolutism.

One final note on the object theory: ontologically speaking, the relativistic solution to the paradoxes is based on the cumulative hierarchy. However, this is just a special class of models of set theory, since, after all, it doesn’t include the nonstandard models of (first-order) ZFC whose existence follows from the compactness and Löwenheim–Skolem theorems. Following Zermelo, I want to rule out all these nonstandard models and focus only on structurally nice models in the form of V. Consequently, I need to work with a theory strong enough to yield (quasi-)categorical axiomatizations. There are at least two options available: either work with a second-order version of ZFC, or with an infinitary one. Both give the desired results, and single out only the structurally nice models from the cumulative hierarchy. In what follows, I’ll use the infinitary approach.Footnote ²⁹ The choice made here is primarily technical, and $\mathsf {ZFC}_2$ would work just as well. Nevertheless, the infinitary approach allows us to work with a form of first-order logic which requires just a modicum of modifications to the RU approach.Footnote ³⁰ This means that I have to take an infinitary language as object languages and adjust the definition of satisfaction accordingly. This leads to the following definition.

Definition 3.1. Let $\mathcal {L}$ be a first-order language which satisfies the following requirements:

1. $\mathcal {L}$ has $\in $ as its only relation symbol, and has no function symbols;
2. $\mathcal {L}$ has a name for each set;
3. for every every ordinal $\alpha $ , $\mathcal {L}(\alpha )$ is the restriction of $\mathcal {L}$ to $V_\alpha $ ;
4. for ordinals $\kappa $ , $\lambda $ , $\mathcal {L}_{\kappa \lambda }$ is the infinitary language resulting from $\mathcal {L}$ by allowing disjunctions of length $\kappa $ and quantifier sequences of length $\lambda $ .

By requirement 1, the only relation constant is $\in $ , and the language has no function constants. This simplifies the definitions and comes at no cost since functions can always be defined by relations that are univocal to the right. By requirement 2, the object language has a name for every set, which makes the language huge.Footnote ³¹ By requirement 3, the restricted languages $\mathcal {L}(\alpha )$ have names for every set in $V_\alpha $ , where $\alpha $ is the $\alpha $ th inaccessible rank and $V_\alpha $ is the $\alpha $ th inaccessible rank. By regularity of $\alpha $ , $|\mathcal {L}(\alpha )|=|V_\alpha |=|H(\alpha )|=\alpha $ . By requirement 4, we have infinitary languages of the form $\mathcal {L}_{\kappa \lambda }$ , which allow disjunctions of length $< \kappa $ and quantifier sequences of length $< \lambda $ . Requirements 3 and 4 can be combined, so $\mathcal {L}_{\kappa \lambda }(\alpha )$ is the first-order language with a name for each set in $V_\alpha $ , allowing disjunctions of length $<\kappa $ and quantifier sequences of length $< \lambda $ . In practice, however, I will only consider cases where $\lambda = \kappa $ , so I will use $\mathcal {L}_\kappa $ as short for $\mathcal {L}_{\kappa \kappa }$ (and accordingly $\mathcal {L}_\kappa (\alpha )$ ) to simplify notation.

3.3 An absolutist-friend semantics for set theory

The definition of a higher-order model for $\mathcal {L}_{\kappa }(\kappa )$ is identical to the definition of a higher-order model for $\mathcal {L}$ , so the difference between standard and infinitary first-order languages will only be important for the definition of the satisfaction predicate. The model predicate will be defined in such a way that it captures the case for $\mathcal {L}$ and for $\mathcal {L}(\kappa )$ , the restriction of $\mathcal {L}$ to $V_\kappa $ . This will correspond to the intuition given in §3.1 that the interpretation function, when used to interpret $\mathcal {L}$ over all sets, is a total function, and when restricted to a smaller model, is only a partial function. However, this makes it necessary to adapt the classical higher-order semantics in some way: the definitions given in the literature don’t consider partial interpretation functions.Footnote ³² Formally, the restriction is implemented by requiring that only those constants in $\mathcal {L}(\kappa )$ have denotation, and those in $\mathcal {L} \setminus \mathcal {L}(\kappa )$ have no denotation. For the limit case, where $\mathcal {L}(\kappa ) = \mathcal {L}$ , $\mathcal {L} \setminus \mathcal {L}(\kappa ) = \emptyset $ .Footnote ³³ The definition of a model for a first-order language is given in monadic second-order logic.

Definition 3.2 (First-Order RU-model)

For every unary second-order variable X, the second-order formula “X is an RU-Model for $\mathcal {L}(\kappa )$ ”, in symbols $\mathbb {M}(X)$ , is defined as:

$$ \begin{align*} \mathbb{M}&(X) :\leftrightarrow \exists x X(\langle \ulcorner \forall \urcorner, x \rangle) \land \forall x \Big[X(x) \rightarrow \big( \exists y (x = \langle \ulcorner \forall \urcorner, y \rangle) \lor \\ & \hspace{0.5cm} \exists u \exists v ( x = \langle u, v \rangle ) \lor \exists w \exists z (x = \langle \ulcorner \in \urcorner, \langle w, z \rangle \rangle)\big) \Big] \land \\ & \forall x \Big[ \mathsf{Con}_\kappa (x) \lor \mathsf{Var}_\kappa (x) \rightarrow \exists ! y \big( X(\langle x,y \rangle) \land X( \langle \ulcorner \forall \urcorner, y \rangle ) \big) \Big] \land \\ & \forall x \Big[ \mathsf{Con}_{\mathcal{L} \setminus \kappa}(x) \lor \mathsf{Var}_{\mathcal{L} \setminus \kappa}(x) \rightarrow \neg \exists y \big( X(\langle x,y \rangle) \big) \Big] \land \\ & \forall w \forall z \Big[X (\langle \ulcorner \in \urcorner, \langle w, z \rangle \rangle) \rightarrow X(\langle \ulcorner \forall \urcorner, w \rangle) \land X(\langle \ulcorner \forall \urcorner, z \rangle )\Big]. \end{align*} $$

Remark 3.3.

• An RU-model for $\mathcal {L}(\kappa )$ contains ordered pairs of three different kinds. There are pairs of the form $\langle \ulcorner \forall \urcorner , x \rangle $ , encoding that x can be quantified on; $\langle \ulcorner \in \urcorner , \langle x, y \rangle \rangle $ , coding that the pair $\langle x, y \rangle $ is in the extension of the $\in $ -relation; and $\langle x, y \rangle $ , where x is either an $\mathcal {L}(\kappa )$ -constant or an $\mathcal {L}(\kappa )$ -variable, which encodes that y interprets x.
• According to the above definition, X is a ‘model’ for the language $\mathcal {L}(\kappa )$ if and only if X is non-empty, and X contains interpretations for all $\mathcal {L}(\kappa )$ -constants, -variables, and for all the n-ary $\mathcal {L}(\kappa )$ relation constants. Moreover, X does not interpret the constants (i.e., names) that are contained in the big language $\mathcal {L}$ but not in the smaller $\mathcal {L}(\kappa )$ . This is due to the already mentioned intuition that when we talk about objects outside $V_\kappa $ with $\mathcal {L}(\kappa )$ , this is off-topic.
• Recall that I use the plural interpretation for the higher-order variables, so when I speak about X as a model, this is actually misleading. Rather, the objects that collectively satisfy the above definition, when combined, interpret the object language $\mathcal {L}(\kappa )$ . I will nevertheless often fall back to standard model-talk for simplicity. However, the reader should bear in mind that this can always be translated back into more complicated plural-talk. Consequently, expressions such as the model should not falsely be read as reifications of pluralities into collection-like entities.Footnote ³⁴
• Note that unlike the standard approach (in model theory but also the approach presented in Rayo & Uzquiano [Reference Rayo and Uzquiano40]), in the above definition, the ‘model’ already contains the variable assignment. This follows the approach developed by Rossi [Reference Rossi42] and is only to simplify definitions. It is important for the definition of variations because I have to define them to be variations of ‘models’ and not just of assignments, which I do next.

3.3.1 RU-domains and RU-variations

Next I will define the domain component and the variations of variable assignments in the higher-order framework. These definitions will be crucial for the definition of the satisfaction predicate. I start with the domain component.

Definition 3.4 (RU-domain)

For all second-order variables Y and X, the second-order formula “Y is the RU-domain of the RU-model X”, in symbols $\mathbb {D}(X,Y)$ , is defined as:

$$ \begin{align*} & \mathbb{D}(X,Y) :\leftrightarrow \mathbb{M}(X) \land \forall x \Big(Y(x) \leftrightarrow \exists y (X (y) \land y = \langle \ulcorner \forall \urcorner, x \rangle) \Big) \end{align*} $$

The domain component Y of an RU-model X contains all and only those objects (sets), which the RU-model can quantify on.

Next I will formalize the notion of a variation of a variable assignment. As already anticipated in Remark 3.3, variable assignments are part of the definition of an RU-model, and thus a variation of an assignment is a variation of an RU-model. Since the focus is on infinitary object languages, a variation is always defined for a sequence of variables. Let the object language be $\mathcal {L}_\kappa $ . Here is the definition of a variation for a sequence of variables $\vec {x_\alpha }$ of length $\alpha $ for $\alpha <\kappa $ .

Definition 3.5 (RU-Variant)

For any sequence of first-order variable $\vec {x_\alpha }$ , and any second-order variables $X, Y$ , the second-order formula “the RU-model Y is an $\vec {x_\alpha }$ -variation of X”, where $\vec {x_\alpha }$ is a variable sequence of length $\alpha <\kappa $ , in symbols $\mathbb {V} (X, Y, \vec {x_\alpha })$ , is defined as:

$$ \begin{align*} & \mathbb{V}(X, W, \vec{x_\alpha}) :\leftrightarrow \mathbb{M}(X) \land \mathbb{M}(Y) \land \exists Z (\mathbb{D}(X, Z) \land \mathbb{D}(Y,Z)) \land \\ & \hspace{1.5cm} \forall y (y \neq x_1 \land \ldots \land y \neq x_\alpha \rightarrow \forall z (X(\langle y, z \rangle) \leftrightarrow Y (\langle y, z \rangle ))). \end{align*} $$

According to Definition 3.5, Y is an RU- $\vec {x_\alpha }$ -variant of X if and only if both X and Y are RU-models with the same domain Z, and if they both agree on the interpretation of all variables, expect possibly those in $\vec {x_\alpha }$ .

3.4 The satisfaction predicate

Recall that I want to have a semantics that allows me to assign every sentence that has denotationless terms the intermediate truth value ${1}/{2}$ . All other sentences, i.e., all those in which each term denotes, shall receive a classical truth value. For that purpose, I use a weak Kleene satisfaction predicate. In the literature on weak Kleene, the third truth value is sometimes called infectious, because as soon as a sentence $\varphi $ has this truth value, any complex sentence that you build with $\varphi $ also has this truth value. There might be different intuitions on that point. For instance, one might argue that even though $\varphi $ is meaningless in this sense, a sentence, such as $\varphi \lor 0 = 0,$ should still come out true. However, I stick with the off-topic interpretation. The resulting picture is that of a sequence of relativistic models $V_1, \ldots , V_\alpha , \ldots $ (where $V_1$ is the first inaccessible, $V_\alpha $ is the $\alpha $ th inaccessible and so on) and additionally a plural ‘domain’, to be denoted by $\mathsf {V}$ , which contains all sets. Moreover, I use the languages $\mathcal {L}_1, \ldots \mathcal {L}_\alpha , \ldots $ as well as the big language $\mathcal {L}$ containing names for every set. However, the language I’m ultimately interested in is the big language $\mathcal {L}$ . When we interpret $\mathcal {L}$ over the sequence of $V_\kappa $ ’s, i.e., as we move up in the cumulative hierarchy, more and more $\mathcal {L}$ -sentences receive a classical evaluation. The limit case is when we look at $\mathcal {L}$ and the all-inclusive ‘domain’ $\mathsf {V,}$ where all terms denote, and where satisfaction is fully classical. Here is the higher-order definition of the weak Kleene satisfaction predicate $\mathsf {Sat}_\kappa $ for the object language $\mathcal {L}_\kappa $ that allows for $\kappa $ -sequences of quantifiers and disjunctions.

Definition 3.6 (Weak Kleene Satisfaction)

Let X and Y be second-order variables. The second-order formula “the RU-model X with RU-domain Y satisfies the $\mathcal {L}_\kappa $ -sentence $\varphi $ ”, in symbols $\mathsf {Sat}_\kappa (\varphi ,X,Y)$ , is defined as follows: $\mathsf {Sat}_\kappa (\varphi ,X,Y)$ :iff

1. $\mathbb {M}(X)$ and $\mathbb {D}(X,Y)$ and
2. $\varphi \equiv v_i = v_j$ , there are $y_i, y_j$ s.t. $X(\langle v_i, y_i \rangle )$ , $X(\langle v_j, y_j \rangle )$ and $y_i =y_j$ , or
3. $\varphi \equiv v_i \neq v_j$ , there are $y_i, y_j$ s.t. $X(\langle v_i, y_i \rangle )$ , $X(\langle v_j, y_j \rangle )$ and $y_i \neq y_j$ , or
4. $\varphi \equiv v_i \in v_j$ , there are $y_i, y_j$ s.t. $X(\langle v_i, y_i \rangle )$ , $X(\langle v_j, y_j \rangle )$ and $X(\langle \ulcorner \in \urcorner , \langle v_i,v_j \rangle \rangle )$ , or
5. $\varphi \equiv v_i \notin v_j$ , there are $y_i, y_j$ s.t. $X(\langle v_i, y_i \rangle )$ , $X(\langle v_j, y_j \rangle )$ and $\neg X(\langle \ulcorner \in \urcorner , \langle v_i,v_j \rangle \rangle )$ , or
6. $\varphi \equiv \neg \neg \psi $ and $\mathsf {Sat}_\kappa (\psi , X, Y)$ , or
7. $\varphi \equiv \bigvee \Phi $ , $\Phi = \{ \varphi _\alpha : \alpha < \kappa \} \subseteq \mathsf {For}_\kappa $ , there is some $\varphi _\alpha \in \Phi $ s.t. $\mathsf {Sat}_\kappa (\varphi _\alpha , X,Y)$ , and for all $\varphi _\beta \in \Phi $ , $\mathsf {Sat}_\kappa (\varphi _\beta , X,Y)$ or $\mathsf {Sat}_\kappa (\neg \varphi _\beta , X,Y)$ , or
8. $\varphi \equiv \neg \bigvee \Phi $ , $\Phi = \{ \varphi _\alpha : \alpha < \kappa \} \subseteq \mathsf {For}_\kappa $ , and for all $\varphi _\alpha \in \Phi $ , $\mathsf {Sat}_\kappa (\neg \psi _\alpha , X, Y)$ , or
9. $\varphi \equiv \forall \vec {x_\kappa } \psi (Q)$ , and for every W s.t. $\mathbb {V}(X, W, \vec {x_\kappa })$ , $\mathsf {Sat}_\kappa (\psi (\vec {x_\kappa }), W, Y)$ , or
10. $\varphi \equiv \neg \forall \vec {x_\kappa } \psi (\vec {x_\kappa })$ , and for some W s.t. $\mathbb {V}(X, W, \vec {x_\kappa })$ , $\mathsf {Sat}_\kappa (\neg \psi (\vec {x_\kappa }), W, Y)$ ,

where $\mathsf {For}_\kappa $ is a set of $\mathcal {L}_\kappa $ -formulae.Footnote ³⁵

The second-order formula $\mathsf {Sat}_\kappa (\varphi , X, Y)$ generates the set of $\mathcal {L}_\kappa $ -sentences that are true in the RU-model X with RU-domain Y. Let us adopt the following convention.

Remark 3.7 (Convention on the enumeration of inaccessible ranks)

The enumeration of V-ranks is from now on restricted to inaccessible ranks, i.e., by $V_1,$ I mean the first inaccessible rank, by $V_2,$ I mean the second inaccessible rank, …, by $V_\alpha ,$ I mean the $\alpha $ th inaccessible rank of the cumulative hierarchy, and so on.

With Remark 3.7 in place, fix a standard model $V_\alpha $ , i.e., the $\alpha $ th inaccessible rank of the cumulative hierarchy. This generates the set of sentences that are true in $V_\alpha $ .

Definition 3.8. Let $V_\alpha $ be the $\alpha $ th inaccessible rank of the cumulative hierarchy. The set of $\mathcal {L}_\kappa (\alpha )$ -truths of $V_\alpha $ , in symbols $\mathsf {Sat}_\kappa ^\alpha $ , is the extension of $\mathsf {Sat}_\kappa $ over $V_\alpha $ , defined for the restricted language $\mathcal {L}_\kappa (\alpha )$ :

$$\begin{align*}\mathsf{Sat}_\kappa^\alpha := \{ \varphi \in \mathcal{L}_\kappa(\alpha) ~ | ~ \mathsf{Sat}_\kappa(\varphi, X, V_\alpha)\}. \end{align*}$$

The set of all $\mathcal {L}_\kappa $ -truths over the plurality of all sets is just the extension of $\mathsf {Sat}$ relative to the RU-model which has $\mathsf {V}$ as its domain, i.e., the extension of $\mathsf {Sat}_\kappa ^{\mathsf {V}}(\varphi , X, \mathsf {V})$ . I also use the overline notation for complements, i.e.,

$$\begin{align*}\overline{\mathsf{Sat}_\kappa^{\mathsf{V}}} = \{ \neg\varphi ~ | ~\varphi \in \mathsf{Sat}_\kappa^{\mathsf{V}} \}, \end{align*}$$

and $\overline {\mathsf {Sat}_\kappa ^\alpha }$ is the complement of $\mathsf {Sat}_\kappa ^\alpha $ , respectively.

Fixing $\mathcal {L}_\kappa $ and assuming an unbounded sequence of inaccessible models, the above definitions produce the following sequence:

$$\begin{align*}\mathsf{Sat}_\kappa^{\mathsf{V}}, \mathsf{Sat}_\kappa^1, \mathsf{Sat}_\kappa^2, \ldots , \mathsf{Sat}_\kappa^\alpha, \ldots \end{align*}$$

where $\mathsf {Sat}_\kappa ^{\mathsf {V}}$ corresponds to the model with domain $\mathsf {V}$ , and $\mathsf {Sat}_\kappa ^\alpha $ corresponds to the model with domain $V_\alpha $ .

3.5 Combining absolutism and realism in the RU approach

Armed with the higher-order semantics and the weak Kleene satisfaction predicate, let me now define the notions of inherently absolutistic and inherently relativistic sentences of the language $\mathcal {L}_\kappa $ , i.e., the language of set theory that has a name for every set and that allows for disjunctions and quantifier sequences of length $\kappa $ .

I start with the definition of the inherently absolutistic sentences. I first give the definition and unpack it afterwards.

Definition 3.9. The absolutely general truths of $\mathcal {L}_\kappa $ , in symbols $\mathsf {Abs}_\kappa $ , are defined as follows:

$$\begin{align*}\mathsf{Abs}_\kappa := \mathsf{Sat}_\kappa^{\mathsf{V}} \setminus ( \mathsf{Sat}_\kappa^{\mathsf{V}} \cap \bigcup_{\alpha \in \Omega} \overline{\mathsf{Sat}_\kappa^\alpha} ). \end{align*}$$

The absolutely general $\mathcal {L}_\kappa $ -truths are certainly evaluated by the ‘model’ with ‘domain’ $\mathsf {V}$ , as it can be seen in the first part of the definition. However, we cannot just take these $\mathcal {L}_\kappa $ -truths, because the plurality of all sets, under certain large cardinal assumptions, validates sentences, such as ‘There are 36 inaccessible cardinals’, which are not validated by all ranks smaller than the 37th inaccessible rank of V. We need to subtract all the $\mathcal {L}_\kappa $ -sentences on which the ‘model’ with ‘domain’ $\mathsf {V}$ and all the smaller models disagree. This is done in the second part of the definition: $\bigcup _{\alpha \in \Omega } \overline {\mathsf {Sat}_\kappa ^\alpha }$ is the union of all complements of collections of $\mathcal {L}_\kappa $ -sentences that are true in some standard model. In effect, I subtract all sentences that are true in $\mathsf {V}$ , but false in one of the $V_\alpha $ ’s.

One might ask why not just take as the absolutely general truths just those sentences on which all models agree, i.e., why not define them as follows:

$$\begin{align*}\mathsf{Abs}_\kappa^* := \mathsf{Sat}_\kappa^{\mathsf{V}} \cap \bigcap_{\alpha \in \Omega} \mathsf{Sat}_\kappa^\alpha. \end{align*}$$

The answer is that this leaves open the possibility that there are sentences which are true according to $\mathsf {V}$ , but which are left undecided by all the $V_\alpha $ ’s. One may have different intuitions about whether or not to count such cases as absolutely general truths. However, I take $\mathsf {Abs}_\kappa $ and not $\mathsf {Abs}_\kappa ^*$ as the official definition.

According to the bicontextualist semantics, sentences like the examples ‘Nothing is a member of the empty set’ or ‘Everything is self-identical’ are contained in the collection $\mathsf {Abs}_\kappa $ . These sentences are true according to a model that has $\mathsf {V}$ as its domain. Moreover, no standard model falsifies any of these sentences.

Let me now define the inherently relativistic truths. In this case, we cannot simply identify the inherently relativistic sentences with the relativistic truths of some standard model, since this would only give us a subcollection of the inherently relativistic sentences. Thus, I start with the relativistic truths of some $V_\alpha $ , i.e., the sentences that are true in a standard model by virtue of their inherently relativistic character, and give the generalization to the whole collection of inherently relativistic truths afterwards.

Definition 3.10. The relativistic $\mathcal {L}_\kappa (\alpha )$ -truths of $V_\alpha $ , in symbols $\mathsf {Rel}_\kappa ^\alpha $ , are defined as follows:

$$\begin{align*}\mathsf{Rel}_\kappa^\alpha := \mathsf{Sat}_\kappa^\alpha \setminus \mathsf{Abs}_\kappa. \end{align*}$$

$\mathsf {Rel}_\kappa ^\alpha $ contains sentences of $\mathcal {L}_\kappa $ which are true in the standard model $V_\alpha $ (the $\alpha $ th inaccessible rank of the cumulative hierarchy), but which are not inherently absolutistic. Consider, for example, the sentence ‘There are 36 inaccessible cardinals’. This sentence is true according to the ‘model’ that has $\mathsf {V}$ as its ‘domain’ (under appropriate large cardinal assumptions), and hence it is in $\mathsf {Sat}^{\mathsf {V}}_\kappa $ . However, it is not contained in any $\mathsf {Sat}^n_\kappa $ for $1 \leq n \leq 36$ , and so it is in, say, $\overline {\mathsf {Sat}_\kappa ^{15}}$ . Consequently, by the definition of inherently absolutistic truths, it is not in $\mathsf {Abs}_\kappa $ . But from $V_{37}$ on, the sentence becomes true, and so it is in $\mathsf {Sat}^\alpha _\kappa $ and also in $\mathsf {Rel}^\alpha _\kappa $ for $\alpha> 36$ . So we have, for every $V_\alpha $ , the inherently relativistic truths of $V_\alpha $ . To get the inherently relativistic truths simpliciter, we take the union of all the relativizations to standard models.

Definition 3.11. The inherently relativistic truths of $\mathcal {L}_\kappa $ , in symbols $\mathsf {Rel}_\kappa $ , are the union of all relativizations of inherently relativistic truths to standard models of the form $V_\alpha $ :

$$\begin{align*}\mathsf{Rel}_\kappa := \bigcup_{\alpha \in \Omega} \mathsf{Rel}^\alpha_\kappa. \end{align*}$$

For example, $\mathsf {Rel}_\kappa $ contains all large cardinal sentences of the form ‘There are $\alpha $ inaccessible cardinals’ as discussed above, since they all have countermodels and enter $\mathsf {Rel}^\beta _\kappa $ for $\beta> \alpha $ . On this approach, then, large cardinal sentences of this form are considered to be relativistic sentences.

Remark 3.12 (Bicontextualist classification of inherently absolutistic and inherently relativistic truths)

The bicontextualist semantics for the language of set theory classifies sentences as inherently absolutistic and inherently relativistic as follows:

• the inherently absolutistic truths are just the absolutely general truths of $\mathcal {L}_\kappa $ , given by $\mathsf {Abs}_\kappa $ ,
• the inherently relativistic truths of $\mathcal {L}_\kappa $ are defined as the union of all relativizations to standard models $V_\alpha $ , given by $\mathsf {Rel}_\kappa $ .

The above classification distinguishes two kinds of truths of the language of set theory: we have the inherently absolutistic truths, which express facts about sets that have no countermodels, neither in the cumulative hierarchy nor in the form of the plurality of all sets, and we have the inherently relativistic truths, which express facts about sets that can be true relative to some models and false relative to others.

My proposal is that we think of this distinction as giving each $\mathcal {L}_\kappa $ -sentence its intended interpretation. So, for example, the sentence $\forall x (x=x)$ contained in $\mathsf {Abs}_\kappa $ , which is taken to be inherently absolutistic, has as its intended interpretation the plurality of all sets. On the other hand, all sentences contained in one of the collections $\mathsf {Rel}^\alpha _\kappa $ always have only restricted quantifier ranges and are therefore to be inherently relativistic. We can approach inherently relativistic sentences in a coarse-grained way, and say that all sentences in $\mathsf {Rel}_\kappa $ are inherently relativistic, or we can approach them in a finer-grained way and consider the relativizations of relativistic truths to standard models as given by the $\mathsf {Rel}^\alpha _\kappa $ ’s. On this approach, the natural relativisation of a relativistic truth is given by the first model that makes it true. So the natural relativisation of ‘there are 36 inaccessible cardinals’ is given by the 37th standard model $V_{37}$ . In any case, the intended interpretation for inherently relativistic sentences is not given by a model whose domain is the plurality of all sets.

3.6 A bicontextualist notion of logical consequence

Having defined the sequence of $\mathsf {Rel}_\kappa ^\alpha $ and $\mathsf {Abs}_\kappa $ , we can define a bicontextualist notion of consequence that respects intended interpretations. Note that here, we focus on standard models and consider the sequence of $\mathsf {Rel}^\alpha _\kappa $ ’s rather than $\mathsf {Rel}_\kappa $ .

Definition 3.13. Let $V_\alpha $ and $\mathcal {L}_\kappa (\alpha )$ be as above (for $\alpha \leq \kappa $ ), and let $\{ \Gamma , \varphi \}$ be a set of $\mathcal {L}_\kappa (\kappa )$ -sentences. The fact that the RU-model with RU-domain $V_\alpha $ is a model of $\varphi $ , in symbols $V_\kappa \models _\kappa ^\alpha \varphi $ , is defined as follows:

$$\begin{align*}V_\alpha \models_\kappa^\alpha \varphi \text{ if and only if } \varphi \in \mathsf{Rel}_\kappa^\alpha \cup \mathsf{Abs}_\kappa. \end{align*}$$

The fact that the argument from $\Gamma $ to $\varphi $ is bicontextually valid, in symbols $\Gamma \models _\kappa ^\alpha \varphi $ , is defined as follows:

$$\begin{align*}\Gamma \models_\kappa^\alpha \varphi \text{ if and only if, if all sentences in } \Gamma \text{ are in } \mathsf{Rel}_\kappa^\alpha \cup \mathsf{Abs}_\kappa \text{, so is } \varphi. \end{align*}$$

It is important that the $\models _\kappa ^\alpha $ -relation is defined for $\mathcal {L}_\kappa (\alpha )$ , i.e., the restriction of $\mathcal {L}_\kappa $ to $V_\alpha $ . The reason for this is that otherwise, sentences from larger languages could get into $\mathsf {Rel}_\kappa ^\alpha \cup \mathsf {Abs}_\kappa $ via $\mathsf {Abs}_\kappa $ . For example, if $\varphi $ is a sentence in $\mathcal {L}_\kappa $ but not in $\mathcal {L}_\kappa (\alpha )$ that is in $\mathsf {Abs}_\kappa $ —i.e., an absolutely unrestricted truth that is validated by some $V_\beta $ for $\beta> \alpha $ , remains true from there on, but was undecided by all ranks below $V_\beta $ —we would end up in the absurd situation where $V_\alpha $ validates sentences about objects outside $V_\alpha $ . To get a bicontextualist notion of logical consequence that respects intended interpretations and is not restricted only to some standard model but considers the plurality of all sets, just replace $\mathsf {Rel}^\alpha _\kappa $ by $\mathsf {Rel}_\kappa $ in Definition 3.13.

4 Set-theoretic bicontextualist treatment of the paradoxes

Let me now explain how the bicontextualist semantics treats set-theoretic paradoxes. I will focus on Russell’s paradox because other paradoxes, such as the Burali-Forti paradox, are treated in a similar way. Consider some rank $V_\alpha $ of the cumulative hierarchy and the sentence

$$\begin{align*}\exists x \forall y (y \in x \leftrightarrow y \notin y). \end{align*}$$

Since there is no such set x in $V_\alpha $ , this sentence is false in the sense of both $\models $ and of $\models _\kappa ^\alpha $ . This is true for any model in the cumulative hierarchy, and also for the absolutist model. However, such a sentence corresponds to an application of the naive comprehension scheme, and not, as it is now standard, to an application of the separation scheme. Nevertheless, it shows that there is no universal Russell set, i.e., there is no set containing all and only the non-self-membered sets. This is also true according to set-theoretic bicontextualism, i.e., according to $\models _\kappa ^\alpha $ . So we get the following lemma.

Lemma 4.1. There is no universal Russell set, i.e., no set R such that

$$\begin{align*}\forall x (x \in R \leftrightarrow x \notin x). \end{align*}$$

Proof. If there were an R such that $\forall x (x \in R \leftrightarrow x \notin x)$ , then $R \in R \leftrightarrow R \notin R$ . Contradiction.

Now consider the case of separation. Instead of considering a sentence postulating the existence of a universal Russell set as above, let’s consider the case where the universal quantifier is syntactically restricted to the model $V_\alpha $ :Footnote ³⁶

$$\begin{align*}\rho_{V_\alpha} := \exists x \forall y (y \in x \leftrightarrow y \in V_\alpha \land y \notin y). \end{align*}$$

There is no set of all non-self-membered sets of $V_\alpha $ inside $V_\alpha $ , so in the classical sense $V_\alpha \not \models \rho _{V_\alpha }$ . However, since $R_\alpha $ is a subset of $V_\alpha $ , and thus an element of the powerset of $V_\alpha $ , $R_\alpha $ is in $V_{\alpha +1}$ and so $V_{\alpha +1} \models \rho _{V_\alpha }$ . Again, if we ask for a Russell set of $V_{\alpha +1}$ , we get similar results: $V_{\alpha +1} \not \models \rho _{V_{\alpha +1}}$ but $V_{\alpha +2} \models \rho _{V_{\alpha +1}}$ , and so on, all the way up the hierarchy.

Now consider the bicontextualist notion of logical consequence $\models _\kappa ^\alpha $ . Since $\mathcal {L}_\kappa (\alpha )$ has no term for $V_\alpha $ , the sentence $\rho _{V_\alpha }$ remains undecided over $V_\alpha $ . In the bicontextualist semantics, this is implemented by the fact that the truth value of $\rho _{V_\alpha }$ in $V_\alpha $ is ${1}/{2}$ . At the next stage, $V_{\alpha +1}$ , the sentence becomes true, and so $V_{\alpha +1} \models _\kappa ^\alpha \rho _{V_\alpha }$ . But again, $\rho _{V_{\alpha +1}}$ has truth value ${1}/{2}$ and becomes true only from $V_{\alpha +2}$ on, in the sense described in §2.

Note, however, that there is a slight difference between the relativistic treatment of the paradoxes in the standard setting and the one presented here: while according to $\models $ , the existence of a Russell set relative to $V_\alpha $ is strictly false in $V_\alpha $ , according to $\models _\kappa ^\alpha $ , the corresponding sentence is undecided in $V_\alpha $ . All Russell sentences of this form remain undecided until a point is reached where they are declared true, and then they remain true from that point on. As a consequence of that, the fact that every set has a Russell set is true in the absolute sense. This allows me to prove the following claim.

Lemma 4.2. It is an absolutely general truth that every set has a Russell set, i.e., for every set A, there is a Russell sentence

$$\begin{align*}\rho_A := \exists x \forall y (y \in x \leftrightarrow y \in A \land y \notin y) \end{align*}$$

such that $\rho _A \in \mathsf {Abs}_\kappa $ .

Proof. Let A be a set, $\alpha $ be the smallest ordinal such that $A \in V_\alpha $ , and $\kappa> \alpha $ be inaccessible. Then, $V_\kappa \models \mathsf {ZFC}$ , and so by the axiom of separation, there is a set ${R_A = \{ x \in R_A ~ | ~ x \notin x \}}$ . Since $R_A \subseteq A$ , and since $V_\alpha $ is transitive, $R_A \in V_\alpha $ , so $V_\kappa \models \rho _A$ . Since only languages of the form $\mathcal {L}(\kappa )$ have names for A, the truth value of $\rho _A$ is ${1}/{2}$ in all $V_\lambda $ for $\lambda $ inaccessible $< \kappa $ , and from $V_\kappa $ on, the truth value of $\rho _A$ is $1$ . Moreover, $R_A$ is a set, and hence it is among the sets, i.e., $R_A \preceq \mathsf {V}$ . Hence, $\rho _A \in \mathsf {Sat}^{\mathsf {V}}_\kappa $ . This implies that $\rho _A \in \mathsf {Abs}_\kappa $ .

Note that the lemma could also be formulated without making use of the constants A and quantifying over all sets. The sentence would then be

$$\begin{align*}\forall z \exists x \forall y (y \in x \leftrightarrow y \in z \land y \notin y). \end{align*}$$

This sentence also comes out true in all strongly inaccessible ranks, as well as in the universal ‘domain’. The reason to work with constants is that, from the absolutist point of view, this is a stronger claim. It says that for every set A, the sentence “The Russell set of A exists” is an inherently absolutistic truth of bicontextualism.Footnote ³⁷

What the treatment of the paradoxes in the bicontextualist semantics shows is that sentences of the form $\rho _{V_\alpha }$ must always carry the restriction to a previously given set. But this does not rule out absolutism in general. As the semantics of the previous section suggests, unproblematic or inherently absolutistic sentences can be interpreted over a domain containing all sets without any problems whatsoever. Moreover, the relativistic treatment of the paradox is compatible with absolutism.

5 Extracting core properties

Bicontextualism for sets addresses a fundamental shortcoming of the standard semantics for set theory, namely, it allows for a truly absolutistic interpretation of the inherently absolutistic sentences of the language of set theory. In my view, an adequate set theory must be about all sets—not just some. My framework captures this requirement through the absolutistic aspect of the semantics, which ensures that, under my interpretation, set theory is truly concerned with its subject matter—the entire universe of sets. As we’ve seen, this is achieved by allowing the quantifiers of inherently absolutistic sentences to range over absolutely all sets.

The distinction between absolutistic and relativistic sentences is crucial. The bicontextualist semantics allows for an absolutely unrestricted interpretation of those sentences that are intuitively about all sets. In particular, sentences like the axiom of extensionality, which provides an identity criterion for sets, are inherently absolutistic. Such sentences cannot have a countermodel, neither in the sense of an initial segment of the cumulative hierarchy, nor in the all-inclusive ‘model’. Consequently, I suggest, inherently absolutistic claims express fundamental core properties about the set concept. In contrast, the paradoxes can be seen as placing limitations on the possible interpretation of some of the sentences in the language of set theory. More precisely, the quantifiers of the sentences occurring in the paradoxes cannot have an unrestricted quantifier range, on pain of inconsistency. Set-theoretic bicontextualism respects these limitations by restricting the quantifiers of problematic sentences.

The class $\mathsf {Abs}_\kappa $ contains absolutely general truths of the language of set theory. These sentences are characterized by two features. First, every sentence $\varphi \in \mathsf {Abs}_\kappa $ is true according to the RU model that has all sets in its ‘domain’. Second, no sentence $\varphi \in \mathsf {Abs}_\kappa $ has a counter-model in the cumulative hierarchy. Examples are the axiom of extensionality, self-identity claims, and the like. A sentence $\varphi $ which has these two features expresses fundamental properties of the set concept. Not only is it true in the all-inclusive RU model—i.e., $\varphi \in \mathsf {Sat}_{\mathsf {V}}$ —but also any level in the cumulative hierarchy large enough to decide $\varphi $ makes it true—i.e., $\varphi \notin \overline {\mathsf {Sat}_\alpha }$ for each $\alpha $ .

The question which sentences go into $\mathsf {Abs}_\kappa $ can be answered, at least partly, via absoluteness results. More precisely, $\Sigma _1$ upwards absoluteness and $\Pi _1$ downwards absoluteness generalize to the absolutist case: whenever a $\Sigma _1$ -sentence is validated by some $V_\alpha $ , it remains true not only throughout the hierarchy of $V_\alpha $ ’s, but also in the limit case where the model has as its ‘domain’ the universe $\mathsf {V}$ . Similarly, whenever a $\Pi _1$ -sentence is validated by the absolutist model $\mathsf {V}$ , it remains true in every rank of the cumulative hierarchy. Note, however, that in the case of $\Pi _1$ downwards absoluteness, we cannot use the language $\mathcal {L}$ , which has a name for every set. The reason is that if $\varphi $ is a $\Pi _1$ -sentence of $\mathcal {L}$ of the form $\forall x \psi (x)$ , where $\psi $ contains a term that some smaller language $\mathcal {L}(\alpha )$ does not contain, then $\varphi $ has truth value ${1}/{2}$ in $V_\alpha $ even though it has truth value $1$ according to $\mathsf {V}$ .

The situation is somewhat different for $\Delta _0$ -sentences, since their absoluteness depends crucially on the transitivity of the sets, and the universe of sets, taken to be a plurality, is not transitive because pluralities are flat. To see this, consider $V_3 = \{\emptyset , \{\emptyset \}, \{ \{ \emptyset \} \} \{\emptyset , \{\emptyset \} \}\},$ which is transitive. However, if we only look at $\{ \{ \emptyset \} \}$ , it is not transitive because it has an element, $\{ \emptyset \}$ , which is not a subset of $\{ \{ \emptyset \} \}$ , because $\emptyset \notin \{ \{ \emptyset \} \}$ . But of course $\{ \{ \emptyset \} \}$ is a set, and so it is in the universe of sets. This means that we cannot argue downwards from $\mathsf {V}$ by $\Delta _0$ absoluteness. What we can do, however, is to argue upwards, because whenever a $\Delta _0$ -sentence is true in some $V_\alpha $ , the set to which the quantifiers are restricted is also in $\mathsf {V}$ , and so the sentence remains true. The other direction does not hold. All of the following theorems are generalization of the $\Delta _0$ -, $\Sigma _1$ -, and $\Pi _1$ -absoluteness theorems.Footnote ³⁸

Theorem 5.1. Let $\varphi $ be a $\Delta _0$ -formula of $\mathcal {L}$ . $\varphi \in \mathsf {Sat}_\kappa ^\alpha $ for some $\alpha $ if and only if $\varphi \in \mathsf {Sat}_{\mathsf {V}}$ .

Proof. If $\varphi $ contains no quantifiers (i.e., is atomic, or of $\varphi $ is of the form $\psi \land \chi $ , $\psi \lor \chi $ , $\neg \psi $ , or $\psi \rightarrow \chi $ where both $\psi $ and $\chi $ contain no quantifiers), then $\varphi \in \mathsf {Sat}_\kappa ^\alpha $ if and only if $\varphi \in \mathsf {Sat}_\kappa ^{\mathsf {V}}$ .

Let $\varphi $ be $(\exists x \in y) \psi (x)$ , and $y \in V_\alpha $ . Assume for induction that $\psi (x) \in \mathsf {Sat}_\kappa ^\alpha $ if and only if $\psi (x) \in \mathsf {Sat}_\kappa ^{\mathsf {V}}$ .

$\Rightarrow $ : If $\varphi \in \mathsf {Sat}_\kappa ^\alpha $ , then $(\exists x \in y) \psi (x) \in \mathsf {Sat}_\kappa ^\alpha $ for some $x \in V_\alpha $ such that $x \in y$ . But then, since $V_\alpha \preceq \mathsf {V}$ , x and y are in $\mathsf {V}$ . Hence, by induction hypothesis, $(\exists x \in y) \psi (x) \in \mathsf {Sat}_\kappa ^{\mathsf {V}}$ .

$\Leftarrow $ : Assume $(\exists x \in y) \psi (x) \in \mathsf {Sat}_\kappa ^{\mathsf {V}}$ for some y in $V_\alpha $ . Then, for some $x \in y$ , $\psi (x) \in \mathsf {Sat}_\kappa ^{\mathsf {V}}$ . Since $V_\alpha $ is transitive, $x \in V_\alpha $ , and so by induction hypothesis, we get $\psi (x) \in \mathsf {Sat}_\kappa ^\alpha $ for $x \in V_\alpha $ , and hence $(\exists x \in y) \psi (x) \in \mathsf {Sat}_\kappa ^\alpha $ .

Corollary 5.2. Let $\varphi $ be a $\Delta _0$ -formula of $\mathcal {L}$ . If $\varphi \in \mathsf {Sat}_\kappa ^\alpha $ for some $\alpha $ , then $\varphi \in \mathsf {Sat}_\kappa ^{\mathsf {V}}$ .

Proof. Immediate from Theorem 5.1.

Theorem 5.3. Let $\varphi $ be of the form $\exists x \psi (x)$ , where $\psi (x)$ is a $\Delta _0$ -formula of $\mathcal {L}$ with all free variables displayed. Then $\varphi \in \mathsf {Sat}_\kappa ^\alpha $ implies $\varphi \in \mathsf {Sat}_\kappa ^\beta $ for all $\beta> \alpha $ , and $\varphi \in \mathsf {Sat}_\kappa ^{\mathsf {V}}$ .

Proof. Let $\varphi $ be $\exists x \psi (x)$ , where $\psi (x)$ is a $\Delta _0$ -formula with all free variables displayed and assume that $\varphi \in \mathsf {Sat}_\kappa ^\alpha $ , i.e., $\mathsf {Sat}_\kappa (\exists x \psi (x),X,V_\alpha )$ . Then, there is some $y \in V_\alpha $ such that $\mathsf {Sat}_\kappa (\psi (y),X,V_\alpha )$ . Then, since $\psi (y)$ is $\Delta _0$ , by Theorem 5.1, $\mathsf {Sat}_\kappa (\psi (y),X,\mathsf {V})$ , and since y is a set and therefore in $\mathsf {V}$ , $\mathsf {Sat}_\kappa (\exists x \psi (x),X,\mathsf {V})$ , i.e., $\varphi \in \mathsf {Sat}_\kappa ^{\mathsf {V}}$ .

Theorem 5.4. Let $\varphi $ be of the form $\forall x \psi (x)$ , where $\psi (x)$ is a $\Delta _0$ -formula of $\mathcal {L}(\alpha )$ with all free variables displayed. Then $\varphi \in \mathsf {Sat}_\kappa ^\alpha $ implies $\varphi \in \mathsf {Sat}_\kappa ^\beta $ for all $\beta < \alpha $ , and $\varphi \in \mathsf {Sat}_\kappa ^{\mathsf {V}}$ implies $\varphi \in \mathsf {Sat}_\kappa ^\alpha $ for all $\alpha $ .

Proof. Let $\varphi $ be $\forall x \psi (x)$ , where $\psi (x)$ is a $\Delta _0$ -formula with all free variables displayed and assume that $\varphi \in \mathsf {Sat}_\kappa ^{\mathsf {V}}$ , i.e., $\mathsf {Sat}_\kappa (\forall x \psi (x),X,\mathsf {V})$ . Then, for each $y \preceq \mathsf {V}$ , $\mathsf {Sat}_\kappa (\psi (y),X,\mathsf {V})$ . Fix any $V_\alpha $ . Since $V_\alpha \preceq \mathsf {V}$ and since $\psi (y)$ is $\Delta _0$ , by Theorem 5.1, $\mathsf {Sat}_\kappa (\psi (y),X,V_\alpha )$ for all $y \in V_\alpha $ . But then, $\mathsf {Sat}_\kappa (\forall x \psi (x),X,V_\alpha )$ , i.e., $\varphi \in \mathsf {Sat}_\kappa ^\alpha $ .

The above results can be used to get a clearer picture of which sentences are in $\mathsf {Abs}_\kappa $ : since we have $\Sigma _1$ upwards absoluteness, all $\Sigma _1$ -sentences that are true in the first inaccessible segment will remain true throughout the whole bicontextualist framework. Consequently, all $\Sigma _1$ -truths of $V_1$ are in $\mathsf {Abs}_\kappa $ . Moreover, since all $\Pi _1$ -sentences that are true in the universe remain true no matter how far down the hierarchy we go, they also have no counter-models. Consequently, all $\Pi _1$ -truths of $\mathsf {V}$ are in $\mathsf {Abs}_\kappa $ . Finally, all $\Delta _0$ -sentences that are true in any $V_\alpha $ remain true in all other $V_\beta $ ’s and in $\mathsf {V}$ , so they’re also contained in $\mathsf {Abs}_\kappa $ . So, if

$$\begin{align*}A \subseteq_{\Sigma_1} \mathsf{Sat}_\kappa^\alpha, ~~~~~~~~~~ B \subseteq_{\Pi_1} \mathsf{Sat}_\kappa^{\mathsf{V}}, ~~~~~~~~~~ C \subseteq_{\Delta_0} \bigcup_{\alpha \in \Omega} \mathsf{Sat}_\kappa^\alpha \end{align*}$$

are the sets of $\Sigma _1$ -sentences in $\mathsf {Sat}_\kappa ^\alpha $ , of $\Pi _1$ -sentences in $\mathsf {Sat}_\kappa ^{\mathsf {V}}$ , and of $\Delta _0$ -sentences from all the $\mathsf {Sat}_\kappa ^\alpha $ ’s, then

$$\begin{align*}A \cup B \cup C \subseteq \mathsf{Abs}_\kappa. \end{align*}$$

Moreover, the $\mathsf {ZFC}$ -axioms are absolutely general truths of the language of set theory. This is a corollary of the following theorem.

Theorem 5.5. The universal domain satisfies the $\mathsf {ZFC}$ axioms, i.e., for all $\varphi \in \mathsf {ZFC}$ , $\mathsf {Sat}_\kappa (\varphi ,X,\mathsf {V})$ .

The proof consists of verifying that all $\mathsf {ZFC}$ -axioms hold in $\mathsf {V}$ . I write $\mathsf {V} \models \varphi $ for $\mathsf {Sat}_\kappa (\varphi ,X,\mathsf {V})$ . Most cases can be adopted from the proof of that inaccessible ranks are models of $\mathsf {ZFC}$ . However, it is crucial to know that $\mathsf {V}$ , the universe of sets, is closed under set-of operations as encoded by $\mathsf {ZFC}$ by assumption. This follows from the definition of the cumulative hierarchy $\mathsf {V}$ .

There is another important aspect: many steps in the proof that inaccessible ranks are models of $\mathsf {ZFC}$ start by showing that if a particular set or sets are contained in some $V_\kappa $ , then other sets are contained in $V_\kappa $ as well. For instance, if $x\in V_\kappa $ , then so is the powerset of x, its unionset, etc. Then, the proof proceeds by using $\Delta _0$ -absoluteness to show that not only is the particular set contained in $V_\kappa $ , but $V_\kappa $ also thinks that it contains the particular set, and therefore satisfies the axiom in question. This last step is crucial. It relies on the background assumption that the particular set we’re looking for is contained in the background universe which, by assumption, has all the sets, is a cumulative hierarchy, and therefore is always right about what sets there are and how they are structured. Then, since the definition of these sets is $\Delta _0$ , it follows that the truth of the particular axioms in the background universe carries over to the $V_\kappa $ . Theorem 5.5 operates directly on the background universe. This just makes explicit what is usually taken to be implicit in the proof. However, this has the direct implication that the second step—using $\Delta _0$ -absoluteness to show that the $V_\kappa $ satisfies the respective axiom—is not needed anymore. Therefore, most steps of the proof aim to show that if some set is given, then its powerset, unionset, etc. is also a set. This implies that they are contained in the background universe, and so the background universe, which is always right about what sets there are and how they are structured, satisfies the respective axioms. Here is the proof.

Proof.

Extensionality Let $x, y \preceq \mathsf {V}$ be distinct. Then, there is a set $z \in x \land z \notin y$ (or vice versa), and since z is a set, $z \preceq \mathsf {V}$ . Then,
$$\begin{align*}\mathsf{V} \models \forall x \forall y (x \neq y \rightarrow \neg \forall u (u \in x \leftrightarrow u \in y)), \end{align*}$$
which is equivalent to the axiom of extensionality.
Separation Let $\varphi (y)$ be a first-order formula such that $F = \{ y \mid \varphi (y) \}$ . Let x be a set. Then the intersection $F \cap x = \{ y \in x \mid \varphi (y) \}$ exists by first-order Separation. Hence, $F \cap x$ is a set and therefore $F \cap x \preceq \mathsf {V}$ .
Pairing Let $x,y$ be sets. Since $\mathsf {V}$ is closed under set-of operations, $a = \{x,y\}$ is a set and therefore in $\mathsf {V}$ .
Union Let x be a set. Then, all $y \in x$ are sets as well, and $z = \{y ~ | ~ y \in x\} = \cup x$ is also a set and therefore contained in $\mathsf {V}$ .
Powerset If x is a set, then each subset $y \subseteq x$ is a set, and so is their collection $\mathcal {P}(x)$ .
Infinity $V_\omega \preceq \mathsf {V}$ .
Foundation The set-theoretic universe is well-founded by assumption.
Replacement Let $\psi (u,v)$ be a first-order formula such that $G = \{ \langle u, v \rangle \mid \psi (u,v) \}$ and suppose that $\psi $ defines a functional relation on x, i.e., for every $u \in x,$ there exists a unique v such that $\psi (u,v)$ . Then, by the first-order Replacement schema, the image set $y = \{ v \mid \exists u \in x ~ \psi (u,v) \}$ exists and is therefore a set. Hence, $G[x] \preceq \mathsf {V}$ .
Choice The universe of sets is well-orderable by assumption.

Corollary 5.6. The $\mathsf {ZFC}$ axioms are absolutely general truths of the language of set theory.

Proof. Each inaccessible rank of $\mathsf {V}$ is a $\mathsf {ZFC}$ -model. Moreover, by Theorem 5.5, the $\mathsf {ZFC}$ -axioms are in $\mathsf {Sat}_\kappa ^{\mathsf {V}}$ . Consequently, by Definition 3.9, the $\mathsf {ZFC}$ -axioms are in $\mathsf {Abs}_\kappa $ .

6 Objections and replies

Let me now consider two potential objections against the semantics for set theory developed above. The first one is based on the observation that forms of absolutism can be recovered in $\mathsf {ZFC}$ via reflection principles or via inner models. The second is that set-theoretic bicontextualism is too revisionary and might therefore be anti-naturalistic.

6.1 Reflection principles and inner models

A first objection to bicontextualism for sets is that absolutism in set theory can be achieved via reflection principles or via inner models. Therefore, an extra absolutist interpretation is redundant, since truths about all sets can be extracted from reflection or inner models.

The main idea of reflection principles is the following: the cumulative hierarchy is so complex that it cannot be characterized by any formula $\varphi $ of the language of set theory. The reason is that any such formula that is true in the universe of all sets is already true in some initial segment $V_\alpha $ . And so any attempt to characterize the cumulative hierarchy as the collection of all things that satisfy $\varphi $ fails, because there is always a $V_\alpha $ such that $\varphi $ characterizes $V_\alpha $ .

For example, $\mathsf {V}$ is closed under replacement and powerset, but there is some $V_\kappa $ , for $\kappa $ an inaccessible cardinal, which is already closed under replacement and powerset, and hence, closure under replacement and powerset is not a unique characterization of $\mathsf {V}$ . Gödel therefore concludes that “[t]he universe of all sets is structurally indefinable”.Footnote ³⁹ The truth of any such description is already reflected by some rank.

Since reflection principles are usually presented as biconditionals, they work both ways:Footnote ⁴⁰ not only are truths about all sets reflected by initial segments, but also facts about the initial segments hold in the whole hierarchy. In this way, we can extract facts about the cumulative hierarchy, i.e., facts where the quantifiers can be taken to be unrestricted, from facts about initial segments.

Similarly, one could argue that absolutism can be mimicked by an inner model. Starting from the language of set theory $\mathsf {ZFC}$ , and a standard model $V_\kappa $ of $\mathsf {ZFC}$ , an inner model is a definable transitive class $\mathcal {N}$ in $V_\kappa $ , such that $\in ^{\mathcal {N}} = \in ^{V_\kappa } \restriction dom(\mathcal {N})^2$ , $\mathcal {N} \models \mathsf {ZFC}$ , and where $\mathcal {N}$ contains all the ordinals of $V_\kappa $ . This last fact is crucial: Since inner models contain all ordinals, it is—depending on which inner model exactly we consider—at least close to absolutism. But if this is the case, then inner models at least allow us to mimic absolutism.

However, reflection principles and inner models do not quite provide what the bicontextualist wants, i.e., a maximally general interpretation of the set-theoretic quantifiers in the case of unproblematic sentences. Moreover, neither reflection principles nor inner models show that paradoxes do not force us into relativism. So even if absolutism can be achieved by such techniques, this does not mean that bicontextualism can be achieved by reflection or inner models.Footnote ⁴¹

6.2 Anti-naturalism

Another line of criticism might arise from a naturalistically minded philosopher, and might look roughly as follows: Mathematicians work with $\mathsf {ZFC}$ (or some extensions thereof with large cardinal or forcing axioms), but they do not use bicontextualist semantics, and so neither should the philosopher.

I believe that, in the end, such criticism is misguided. There is no doubt that $\mathsf {ZFC}$ is the widely accepted canonical axiomatization of set theory, and I neither intend nor have the ability to change this. However, it is important to clarify what my proposal is, and what it is not.

Bicontextualism, at its core, classifies sentences in the language of set theory as either inherently absolutistic or inherently relativistic. The debate between absolutists and relativists is primarily a philosophical one, and my main goal is to contribute to this discussion. My use of languages with names for every set and the RU approach should be seen as methodological tools in pursuit of this philosophical aim.

Since the standard first-order language of set theory $\mathcal {L}_\in $ , which includes only $\in $ as a non-logical constant, is a sublanguage of my all-inclusive language $\mathcal {L}$ , which contains a name to every set, we can easily recover the inherently absolutistic $\mathcal {L}_\in $ -sentences from the inherently absolutistic $\mathcal {L}$ -sentences. In essence, I am using a distinct methodological tool to achieve a philosophical goal, rather than challenging the mathematical canon. There is nothing anti-naturalistic about this approach.

What I explicitly do not intend is to settle inherently mathematical and set-theoretical questions such as ‘What sets are there?’. This question can be reformulated as ‘What is the ultimate large cardinal axiom to adopt?’. Answering this question is a task for set theorists. If, however, set theorists settle on a particular axiomatization, say $\mathsf {ZFC+V=UltL}$ , then my approach can still be applied to identify the inherently absolutistic and the inherently relativistic sentences.

6.3 Isn’t that just generality absolutism?

The main claim of absolutism is that for a given language $\mathcal {L}$ , there is an interpretation according to which the $\mathcal {L}$ -quantifiers range over absolutely everything. Absolutists do not claim, however, that the $\mathcal {L}$ -quantifiers always range over absolutely everything, but only that they sometimes range over absolutely everything. Since bicontextualism also sometimes allows unrestricted quantification, a natural objection to bicontextualism is that it is, after all, just a version of generality absolutism. This, however, is not correct.

Even though one might paraphrase absolutism and bicontextualism as the claim that quantifiers sometimes have an all-inclusive quantifier domain, the interpretation of sometimes is a different one: according to absolutism, sometimes means, that there is one $\mathcal {L}$ -interpretation according to which all $\mathcal {L}$ -quantifiers range over an unrestricted domain. According to bicontextualism, sometimes means that only a special class of $\mathcal {L}$ -sentences is adequate for an absolutistic interpretation.

This explains why bicontextualism really is an intermediate position: according to relativism, no sentence can ever achieve absolute generality, and according to absolutism, all sentences can sometimes achieve absolute generality. According to bicontextualism, however, some sentences can never achieve absolute generality (relativistic aspect), while other sentences can sometimes achieve absolute generality (absolutistic aspect). In this sense, bicontextualism is neither fully absolutistic, nor fully relativistic.

7 Conclusion

As noted above, relativism is a strategy for avoiding set-theoretic paradoxes. Dummett has argued that paradoxes prove absolutism wrong. But the main lesson of this paper is that paradoxes do not force us into strict relativism, since absolutism can be combined with a relativistic treatment of paradoxes. A more cautious assessment shows that paradoxes only force us into relativism only with respect to sentences that are crucial to the paradoxes. Unproblematic sentences can be interpreted absolutistically even in the light of the paradoxes.

The development of a semantic framework that implements such an intermediate position between absolutism and relativism provides us with a semantic criterion for distinguishing between inherently absolutistic and inherently relativistic sentences. This shows that some sentences of the language of set theory are most naturally interpreted as expressing facts about only some sets, while other sentences are most naturally interpreted as expressing facts about all sets.

Moreover, a partly absolutist, partly relativist semantics such as bicontextualism for sets can be used to extract core properties of the set concept. The semantics can be used to isolate a class of sentences of the language of set theory with the special status of being true by the model containing all sets in its domain, and that are not satisfied by any standard model in the cumulative hierarchy.

A Infinitary logic

This brief summary of infinitary logic is not intended to be a complete presentation.Footnote ⁴² I start with a first-order language $\mathcal {L}$ and show how to extend its syntax and semantics to get an infinitary language $\mathcal {L}_{\kappa \lambda }$ , which allows disjunctions of length $< \kappa $ and quantifier sequences of length $< \lambda $ .

The first step is the syntax: we extend $\mathcal {L}$ by a set of variables, $\mathsf {Var}_{\kappa \lambda }$ of cardinality $\kappa $ , and a logical operator $\bigvee $ for infinite disjunction. The atomic formulae of $\mathcal {L}_{\kappa \lambda }$ are just the atomic formulae of $\mathcal {L}$ .

As for complex formulas, we add an extra clause to the recursive definition, saying that whenever $\Phi $ is a set of $\mathcal {L}_{\kappa \lambda }$ -formulas such that $|\Phi | < \kappa $ , then $\bigvee \Phi $ is a $\mathcal {L}_{\kappa \lambda }$ -formula. $\bigvee \Phi $ has the form $\varphi _1 \lor \varphi _2 \lor \ldots \lor \varphi _\alpha \lor \ldots $ for all $\varphi _\alpha \in \Phi $ , and we can also write $\bigvee _{\alpha \in \beta } \varphi _\alpha $ (assuming that $|\Phi | = \beta $ .)

We add a similar clause for the quantifier case: for X a set of $\mathcal {L}_{\kappa \lambda }$ -variables such that $|X| < \lambda $ , $\exists X \varphi $ is a formula. We can also write $(\exists x_\alpha )_{\alpha < \beta } \varphi $ , provided that $|X|=\beta $ . Infinite conjunctions and infinite universal quantification are defined according to the usual conventions:

• $\bigwedge \Phi := \neg \bigvee \{ \neg \varphi : \varphi \in \Phi \}$ ;
• $\forall X \varphi := \neg \exists X \neg \varphi $ .

In this characterization, standard first-order logic is just the language that allows only finite disjunctions and finite sequences of quantifiers. In the above terminology, we can write $\mathcal {L}$ as $\mathcal {L}_{\omega \omega }$ .

Now for the semantics. An $\mathcal {L}_{\kappa \lambda }$ -structure is just an $\mathcal {L}$ structure, and we just need to add the following clauses corresponding to infinite disjunction and infinite existential quantification to the standard first-order definition of satisfaction, where $\mathcal {M}$ is an $\mathcal {L}$ -structure and $\sigma $ is a variable assignment relative to $\mathcal {M}$ :

• $\mathcal {M},\sigma \models \bigvee \Phi $ iff $|\Phi | < \kappa $ and for some $\varphi _\alpha \in \Phi $ , $\mathcal {M},\sigma \models \varphi _\alpha $ ,
• $\mathcal {M},\sigma \models \exists X \varphi (X)$ iff there is an X-variant $\sigma _0$ such that $\mathcal {M},\sigma _0 \models \varphi (X)$ .

Validity and logical truths are defined as usual.

However, there are some restrictions on the cardinals $\kappa $ and $\lambda $ :

• $\lambda \leq \kappa $ : if $\lambda> \kappa $ , there would be $\mathcal {L}_{\kappa \lambda }$ -formulas with at most $\kappa $ -many free variables, but with $\lambda $ -many quantifiers, most of which would have no free variables to bind.
• $\kappa $ must be a regular cardinal. If $\kappa $ is singular, the languages $\mathcal {L}_{\kappa \lambda }$ and $\mathcal {L}_{\kappa ^{+} \lambda }$ have the same expressive power in the sense that an $\mathcal {L}_{\kappa ^{+} \lambda }$ -sentence $\varphi $ can always be converted into an $\mathcal {L}_{\kappa \lambda }$ -sentence $\varphi ^*$ with the same meaning. However, this is not a requirement for $\lambda $ because there is a difference in meaning between ‘there are at least $\lambda $ -many’, and ‘there are at least $\lambda ^{+}$ -many’, even if $\lambda $ is singular.Footnote ⁴³

To give just a very brief idea of the power of infinitary languages, consider how such languages can uniquely characterize important mathematical notions such as finiteness. This can be done by a single $\mathcal {L}_{\omega _1, \omega }$ -sentence, the language allowing countably infinite disjunctions and only finite sequences of quantifiers:

$$\begin{align*}\bigvee _{n < \omega} \exists x_1, \ldots , \exists x_n \forall y (y = x_1 \lor \ldots \lor y=x_n). \end{align*}$$

We can use them to uniquely characterize important mathematical structures such as the natural numbers. The first-order models isomorphic to $\mathbb {N}$ are just the class of models of the $\mathcal {L}_{\omega _1 \omega }$ -sentence

$$\begin{align*}\mathsf{PA}^- \land \forall x \bigvee_{n < \omega} x = succ^n(0), \end{align*}$$

where $\mathsf {PA}^-$ is $\mathsf {PA}$ without the induction scheme, and $succ^n(0)$ is the result of applying the successor operations n-times to $0$ .Footnote ⁴⁴

B Characterization of $\mathcal {H}(\kappa )$ in $\mathcal {L}_{\kappa ^{+} \kappa ^{+}}$

We can uniquely characterize the structure $\mathcal {H}(\kappa )$ , the class of sets hereditarily of power $\leq \kappa $ , in $\mathcal {L}_{\kappa ^{+} \kappa ^{+}}$ . To do this, I will first define the $\mathcal {H}(\kappa )$ ’s. Then, I will show how to properly characterise the notion of well-foundedness in $\mathcal {L}_{\omega _1 \omega _1}$ . Finally, I show that any structure $\mathcal {M}$ on which $\in $ is well-founded (i.e., which satisfies the $\mathcal {L}_{\omega _1 \omega _1}$ -characterization of well-foundedness) and which satisfies the $\mathcal {L}_{\kappa ^{+} \kappa ^{+}}$ -sentences, which intuitively says that all sets are at most of power $\kappa $ and that the domain consists of transitive sets only, is isomorphic to $\mathcal {H}(\kappa )$ .

Let’s start with the definition of the $\mathcal {H}(\kappa )$ ’s: recall that a set X is transitive if $x \in X$ implies $x \subseteq X$ . The transitive closure of a set X, $TC(X)$ , is defined as the intersection of the set of all transitive supersets of X, i.e.,

$$\begin{align*}TC(X) := \bigcap \{ Y : X \subseteq Y \text{ and } Y \text{ is transitive} \}. \end{align*}$$

Definition B.1 (Sets Hereditarily of Power $\leq \kappa $ )

For any cardinal $\kappa $ , the set of all sets hereditarily of cardinality less than $\kappa $ , $H(\kappa )$ , is defined as follows:

$$\begin{align*}H(\kappa) := \{ X : |TC(X)| < \kappa \}. \end{align*}$$

If $\mathsf {GCH}$ holds, then for all ordinals $\alpha $ , $H(\alpha ) = V_\alpha $ . However, if $\mathsf {GCH}$ does not hold, for $\kappa> \omega ,$ the cardinality of $H(\kappa )$ is $2^{<\kappa }$ , while the cardinality of $V_\kappa = \beth _\kappa $ . However, $H(\kappa )=V_\kappa $ whenever $\beth _\kappa = \kappa $ , i.e., for all fixed points of the $\beth $ -function.Footnote ⁴⁵ But the $\beth $ -function has arbitrarily large fixed points: if $\kappa $ is strongly inaccessible, $\kappa = \beth _\kappa $ .Footnote ⁴⁶ So, after all, $H(\kappa ) = V_\kappa $ whenever $\kappa $ is strongly inaccessible, independent of $\mathsf {GCH}$ .

The above definition gives the class of sets hereditarily of power $\leq \kappa $ . We can now define the corresponding structure.

Definition B.2. Let $H(\kappa )$ be the set of all sets hereditarily of cardinality less than $\kappa $ . $\mathcal {H}(\kappa )$ is defined as the structure having $H(\kappa )$ as its domain and restricting $\in $ to $H(\kappa )$ , formally

$$\begin{align*}\mathcal{H}(\kappa) := \langle H(\kappa), \in \cap (H(\kappa) \times H(\kappa)) \rangle. \end{align*}$$

Next we define the notion of well-foundedness in $\mathcal {L}_{\omega _1 \omega _1}$ . Recall that a relation E on W is well-founded iff every non-empty subset $W_0 \subseteq W$ has an E-least element. More precisely, we say that E is well-founded on E, i.e., for every $\emptyset \neq W_0 \subseteq W$ , there exists an $x \in W_0$ such that for all $y \in W_0 : \neg y E x$ . Alternatively, well-foundedness can be characterized by the absence of infinite descending chains.

Lemma B.3. Let W be a set and E be a relation of W. Then, E is well-founded on W if and only if there is no infinite descending E-chain of elements of W.

Proof. $\Rightarrow $ : Let E be well-founded on W and suppose, for reductio, that $\langle w_n \rangle _{n \in \omega }$ is an infinite descending E-chain in W, i.e., that all tuples of the form $w_n E w_{n+1}$ are in the extension of E. Let $S = \{w_0, w_1, w_2, \ldots \}$ and assume that $w_i$ is the E-least element in S, i.e., there are no $w_j \in S$ such that $w_j E w_i$ . This element exists because E is well-founded. But then, there is $w_{i+1}$ such that $w_{i+1} E w_i$ . Contradiction.

$\Leftarrow $ : We prove the contrapositive. Suppose E is not well-founded in W. Then, there exists $\emptyset \neq S \subseteq W$ with no E-minimal element. By a weak form of the axiom of choice, we can construct a sequence $\langle w_n \rangle _{n \in \omega }$ such that $\forall n \in \mathbb {N} : w_{n+1} E w_n$ . This is an infinitely descending E-chain.

In $\mathsf {ZFC}$ , the axiom of foundation expresses as much of the well-foundedness of $\in $ as is possible in first-order logic. However, due to the compactness of first-order logic, there are $\mathsf {ZFC}$ -models in which $\in $ is not well-founded. This is mainly due to the inability of first-order logic to exclude infinitely descending $\in $ -chains. All first-order logic can do is to rule out first-order definable infinite descending $\in $ -chains such as $x \in x$ (which gives rise to $x \in x \in x \in x \ldots $ ) and $x_n \in x_{n-1} \ldots \in x_0 \in x_n$ .Footnote ⁴⁷

By the above lemma, it suffices to exclude infinitely descending $\in $ -chains in order to make $\in $ well-founded. This can be done in $\mathcal {L}_{\omega _1 \omega _1}$ by the following sentence:

$$\begin{align*}\varphi_1 := \neg (\exists x_n)_{n < \omega} ( \bigwedge_{n < \omega} x_{n+1} \in x_n). \end{align*}$$

Lemma B.4. $\mathcal {M} \models \varphi _1$ iff $\in $ is well-founded on M.

Proof. If $\mathcal {M} \models \varphi _1$ , then it does not have infinitely descending $\in $ -chains. Thus, by Lemma B.3, $\in $ is well-founded on M. If $\in $ is well-founded on M, then every $\emptyset \neq R \subseteq M$ has a $\in $ -least element. Hence, by Lemma B.3, there are no infinitely descending $\in $ -chains on M, and so $\mathcal {M} \models \varphi _1$ .

Next we add $\mathcal {L}_{\kappa ^{+} \kappa ^{+}}$ -sentences which express that the elements of the domain are subsets of the domain and give an upper bound on the cardinality of the subsets of the domain, as well as the axiom of extensionality.

Let’s start with extensionality, which is a first-order sentence:

$$\begin{align*}\mathsf{Ext} := \forall x \forall y (\forall z (z \in x \leftrightarrow z \in y) \rightarrow x = y). \end{align*}$$

Next are the $\mathcal {L}_{\kappa ^{+} \kappa ^{+}}$ -sentences, first the sentence expressing that the elements of the domain are subsets of the domain:

$$\begin{align*}\varphi_2 := (\forall x_\alpha)_{\alpha < \kappa} \exists y \forall z (z \in y \leftrightarrow \bigvee_{\alpha < \kappa} (z = x_\alpha)), \end{align*}$$

and the upper bound on cardinality:

$$\begin{align*}\varphi_3 := \forall y [ \exists z (z \in y) \rightarrow (\exists x_\alpha)_{\alpha < \kappa} \forall z (z \in y \leftrightarrow \bigvee_{\alpha < \kappa} (z = x_\alpha))]. \end{align*}$$

Now I can give the axiomatization of set theory in infinitary logic.

Definition B.5 ( $\mathsf {ZFC}_\kappa $ )

The theory $\mathsf {ZFC}_\kappa $ consists of the axioms of extensionality, pairing, union, infinity, power set, and choice, the axiom schemata of separation and replacement, $\varphi _1$ (the infinitary version of foundation), as well as the axioms $\varphi _2$ and $\varphi _3$ characterizing the size of $\mathsf {ZFC}_\kappa $ -models.

Finally, here is the characterization of $H(\kappa )$ in $\mathcal {L}_{\kappa ^{+} \kappa ^{+}}$ .

Theorem B.6 Hanf & Scott [Reference Hanf and Scott20]

Let $\mathsf {Ext} \land \varphi _1 \land \varphi _2 \land \varphi _3$ be a $\mathcal {L}_{\kappa ^{+} \kappa ^{+}}$ -sentence, and $\mathcal {M} = \langle M, E \rangle $ be an $\mathcal {L}_{\kappa ^{+} \kappa ^{+}}$ -structure, where E interprets $\in $ in $\mathsf {Ext} \land \varphi _1 \land \varphi _2 \land \varphi _3$ . Then

$$\begin{align*}\mathcal{M} \models \mathsf{Ext} \land \varphi_1 \land \varphi_2 \land \varphi_3 \text{ if and only if } \mathcal{M} \cong \mathcal{H}(\kappa). \end{align*}$$

Proof. We follow Karp [Reference Karp25].

$\Leftarrow $ : It suffices to show that $\mathcal {H}(\kappa ) \models \mathsf {Ext} \land \varphi _1 \land \varphi _2 \land \varphi _3$ . $H(\kappa )$ is transitive, so $\mathsf {Ext}$ and $\varphi _2$ hold in $\mathcal {H}(\kappa )$ ; it is well-founded, so $\varphi _1$ holds in $\mathcal {H}(\kappa )$ ; and all elements are of power $< \kappa $ , so $\varphi _3$ holds in $\mathcal {H}(\kappa )$ .

$\Rightarrow $ : The proof is done in two steps.

1. If $\mathcal {M} \models \mathsf {Ext} \land \varphi _1$ , then $\mathcal {M}$ is isomorphic to a structure $\mathcal {N} = \langle N, \in \rangle $ , where N is a transitive set.Footnote ⁴⁸
2. If $\mathcal {N}$ is a transitive model in which $\varphi _2$ and $\varphi _3$ hold, then $N \subseteq H(\kappa )$ and ${H(\kappa ) \subseteq N}$ .

1.: Since $\mathcal {M} \models \varphi _1$ , E is well-founded on M, and so there exists an E-minimal element $m_0 \in M$ . Since $\mathcal {M} \models \mathsf {Ext}$ , $m_0$ is unique. Call it $0^E$ . We need the following lemma.

Lemma B.7. Induction holds for E in $\mathcal {M} = \langle M, E \rangle $ , i.e., for any property $\psi $ , if

1. $\psi (0^E)$ and,
2. if $x \in M$ and $\psi (y)$ for all $y E x$ , then $\psi (x)$ ,

then $\forall x \in M$ , $\psi (x)$ .

Proof of Lemma B.7

Let $S := \{ x \in M : \neg \psi (x) \} \subseteq M$ , and suppose $S \neq \emptyset $ . By $\varphi _1$ , S has an E-minimal element $s_0$ . By (1), $x_0 \neq 0^E$ . Then, $x_0$ is the E-least element in M such that $\neg \psi (x_0)$ . Then, $\psi (y)$ for all $y E x_0$ , and hence, by (2), $\psi (x_0)$ , which contradicts $\neg \psi (x_0)$ . So $S = \emptyset $ . This concludes the proof of Lemma B.7.

Now we can use E-induction to define the transitive collapse $\psi $ of E. For $x \in M$ , let

(1)

$$ \begin{align} \pi (x) := \{ \pi (z) : z E x \}. \end{align} $$

Then, $\pi : M \to \pi (M)$ . Obviously, $\pi (M)$ is transitive and $\pi $ is surjective. Now we show that $\pi $ is injective.

Define a rank function relative to E and M:

$$\begin{align*}\mathsf{rank}^E(x) = \begin{cases} 0, & \text{if } \forall y (\neg y E x),\\ \mathsf{sup}( \{ \mathsf{rank}^E(y) + 1 : y E x \} ), & \text{otherwise.} \end{cases} \end{align*}$$

Injectivity follows by induction on $\mathsf {rank}^E(x)$ : for the base case, let $x, y \in M$ be such that $\mathsf {rank}^E(x) = \mathsf {rank}^E(y) = 0$ . Then, $\pi (x) = \{ x' \in M : x' E x \} = \emptyset = \{ y' \in M : y' E y \} = \pi (y)$ . Now assume that if $\mathsf {max}(\mathsf {rank}^E(x),\mathsf {rank}^E(y)) < \alpha $ , then $\pi (x) = \pi (y)$ . Let $a, b \in M$ be such that $\mathsf {rank}^E(a) = \mathsf {rank}^E(b) = \alpha $ and assume $\pi (a) = \pi (b)$ . If $d E a$ , then $\pi (d) \in \pi (a) = \pi (b)$ , so for some $e \in M$ , $e E b$ and $\pi (d) = \pi (e)$ , and by IH, $d = e$ , so $d E b$ . The same holds for the other direction. So $\forall x \in M (x E a \leftrightarrow x E b)$ . Since E is extensional on M, $a = b$ .

Finally, $\pi $ preserves structure, i.e.,

$$\begin{align*}a E b \leftrightarrow \pi (a) \in \pi (b). \end{align*}$$

For if $b E a$ , then, by 1, $\pi (a) \in \pi (b)$ . On the other hand, if $\pi (a) \in \pi (b)$ , again by 1, $\pi (c) = \pi (b)$ for some $c \in M$ such that $c E a$ . But then $c = b$ since $\pi $ is injective, and so $b E a$ .

This shows that if $\mathcal {M} \models \mathsf {Ext} \land \varphi _1$ , then $\mathcal {M}$ is isomorphic to the structure

$$\begin{align*}\langle \pi (M), \{ \langle \pi (a), \pi (b) \rangle \in \pi (M) \times \pi (M) : a E b \} \rangle, \end{align*}$$

the transitive collapse (or Mostowski collapse) of E and M. Call this structure ${\mathcal {N} = \langle N, \in \rangle }$ .

2.: Now suppose that $\mathcal {N} \models \varphi _2 \land \varphi _3$ . We show that $N = H(\kappa )$ : since $\mathcal {N} \models \varphi _2 \land \varphi _3$ , the elements of N are transitive subsets of N with power at most $< \kappa $ , so $N \subseteq H(\kappa )$ . We now show $H(\alpha ) \subseteq N$ by induction on $\alpha < \kappa $ . For $\alpha = 0$ , $H(\alpha ) = \emptyset \subseteq S$ . Assume that for all $0 < \beta < \kappa $ , $H(\beta ) \subseteq S$ . Let $x \in H(\beta ^{+})$ . Then $|TC(x)| < \beta ^{+} \leq \kappa $ . Then, since $H(\beta ^{+})$ is transitive, so is x. So $x = TC(x)$ , and so $|x|<\beta ^{+}$ . Since $\mathcal {N} \models \varphi _2$ , there is some $y \in N$ such that for all $x_\gamma \in x$ (for all $\gamma \leq \beta ^{+}$ ), $x_\gamma \in y$ , and nothing else is in y. So by $\mathsf {Ext}$ , $x = y \in N$ . Since x was arbitrary, $H(\beta ^{+}) \subseteq N$ . This concludes the proof of Theorem B.6.

Acknowledgments

I would like to thank Neil Barton, Matteo de Ceglie, Ahmed Çevik, Laura Crosilla, Leon Horsten, Deborah Kant, Matteo Plebani, Sam Roberts, Lucas Rosenblatt, Chris Scambler, Jan Sprenger, Davide Sutto, Giorgio Venturi and the audiences in Oslo, London, Warsaw, Pavia, Munich, Konstanz, and Turin for helpful discussions. I am especially grateful to Francesca Boccuni, Andrea Iacona, Hannes Leitgeb, Øystein Linnebo, Julien Murzi and Lorenzo Rossi for their insightful comments and continuous support throughout the development of this work. I also thank an anonymous referee for valuable suggestions that helped improve this paper.

Funding

This research was supported by the Northwest Italy Philosophy PhD Program (FINO).

Footnotes

1 Proponents of absolutism include Lewis [Reference Lewis29], McGee [Reference McGee, Sher and Tieszen33, Reference McGee34], Rayo & Uzquiano [Reference Rayo and Uzquiano40], and Williamson [Reference Williamson49]. Proponents of relativism include Glanzberg [Reference Glanzberg18], Lavine [Reference Lavine28], and Studd [Reference Studd46].

2 One of the notable achievements of modern set theory is that urelements are dispensable for mathematical concerns. The cumulative hierarchy constructed without urelements is sometimes referred to as the hierarchy of pure sets. In what follows, I focus exclusively on sets that are pure in this sense.

3 See Boolos [Reference Boolos5], Incurvati [Reference Incurvati21], Jech [Reference Jech22], Kanamori [Reference Kanamori23, Reference Kanamori24], Kunen [Reference Kunen27], Maddy [Reference Maddy30, Reference Maddy31], and Potter [Reference Potter38].

4 For general background on contextualist approaches to the semantic paradoxes, see Glanzberg [Reference Glanzberg16, Reference Glanzberg17], Murzi & Rossi [Reference Murzi and Rossi35], and Parsons [Reference Parsons37].

5 This view is further expanded, and defended, in Murzi & Rossi [Reference Murzi and Rossi35].

6 Within this work, ‘domains’ (with quotation marks) indicates an ontologically neutral, non-standard reading of domains, according to which ‘domains’ are not entities, whereas domains (without quotation marks) simply refer to standard set-theoretic, entity-based domains.

7 See Russell [Reference Russell43].

8 See Incurvati [Reference Incurvati21, p. 27, notation adjusted].

9 See Boolos [Reference Boolos5], Dummett [Reference Dummett and Dummett12], Gödel [Reference Gödel, Benacerraf and Putnam19], Incurvati [Reference Incurvati21], Kanamori [Reference Kanamori23], Parsons [Reference Parsons37], Shapiro [Reference Shapiro45], and Studd [Reference Studd46].

10 Gödel [Reference Gödel, Benacerraf and Putnam19, pp. 474–475].

11 See Boolos [Reference Boolos5], Kanamori [Reference Kanamori23], Maddy [Reference Maddy30, Reference Maddy31], Potter [Reference Potter38], and Zermelo [Reference Zermelo50].

12 This rejection of the Argument from Paradox is implicit in Rossi [Reference Rossi42] and is further developed in Murzi & Rossi [Reference Murzi and Rossi35].

13 See Rossi [Reference Rossi42, sec. 4].

14 For general background on plural logic, see Florio & Linnebo [Reference Florio and Linnebo15] and Oliver & Smiley [Reference Oliver and Smiley36].

15 When I say that $\kappa $ must be inaccessible, this also covers stronger notions of infinity. So if $\kappa $ is Mahlo, measurable, or supercompact, then $\kappa $ is also inaccessible, and so these ranks are taken into account.

16 In addition to its conceptual and axiomatic virtues, $\mathsf {ZFC}$ is also the most widely accepted foundational theory among working set theorists. As such, focusing on $\mathsf {ZFC}$ aligns not only with philosophical considerations about the nature of sets, but also with the naturalistic observation that it serves as the de facto standard in contemporary set-theoretic research. Thanks to an anonymous referee for pressing me to clarify this point.

17 More on the generalization of standard model theory and plural ‘domains’ in §3.2.

18 Depending on the ambient set theory T, there are more or less V-ranks that are set-sized (i.e., T-provably exist). For instance, if $T = \mathsf {ZFC} - \mathsf {Inf}$ (the axiom of infinity), then only finite ranks exist. If, alternatively, $T = \mathsf {ZFC} + \exists !\kappa ~ \mathsf {MC}(\kappa )$ (there exists exactly one measurable cardinal $\kappa $ ), than all V-ranks below $\kappa $ exist. Either way, whatever provably exists according to the ambient theory T is contained as a set in the absolute ‘domain’.

19 The use of large languages can be understood as a practical heuristic which does not require deep philosophical justification. If we extend the language of set theory $\mathcal {L}_\in $ to a larger language $\mathcal {L}_\in ^{\prime }$ by adding $\kappa $ -many names, every $\mathcal {L}_\in ^{\prime }$ -structure has a unique $\mathcal {L}_\in $ -reduct. This means the semantic framework for such huge languages can always recover the $\mathcal {L}_\in $ -fragment if required. However, the names serve a technical purpose, and so I include them in the semantics.

20 Note the slight abuse of notation here: since $\mathcal {L}$ has a name for every set, it cannot be a set itself. Hence, an interpretation of $\mathcal {L}$ can also not be a set. The reader should bear in mind that all these notions are pluralities. For this reason, I use ‘interpretation’ (with quotation marks), and interpretation (without quotation marks), as in the case of ‘domain’ and domain. See fn. 6.

21 See Beall [Reference Beall2].

22 For simplicity, I often identify domains with models.

23 As emphasized by an anonymous referee, this would just be formalized in the standard way as $\forall \alpha (Ord(\alpha ) \to \exists \beta (Ord(\beta ) \land \alpha < \beta \land V_\alpha \in V_\beta ))$ .

24 I am especially grateful to Deborah Kant and an anonymous referee whose constructive suggestions have significantly improved this section.

25 Their paper is based on ideas developed by George Boolos in the 1980s (see Boolos [Reference Boolos6, Reference Boolos7])

26 Ibid.

27 See Bacon [Reference Bacon1], Bell [Reference Bell3], and Shapiro [Reference Shapiro44].

28 See Florio & Linnebo [Reference Florio and Linnebo13–Reference Florio and Linnebo15] for general plural logic, and Rayo [Reference Rayo39] for plural denotation.

29 For the second-order version $\mathsf {ZFC}_2$ , the important theorem is the famous quasi-categoricity theorem from Zermelo [Reference Zermelo50] (see also Button & Walsh [Reference Button and Walsh8, chap. 8.A]), according to which models of $\mathsf {ZFC}_2$ are exactly the strongly inaccessible rank-initials segments of the cumulative hierarchy. Regarding the infinitary version, a version of $\mathsf {ZFC}$ developed in an infinitary language is strong enough to characterize the sets, hereditarily of cardinality less than $\kappa $ , $\mathcal {H}(\kappa )$ (see Karp [Reference Karp25]). Moreover, whenever $\kappa $ is inaccessible, $V_\kappa = \mathcal {H}(\kappa )$ , which yields the desired result. See Appendix B for a detailed presentation of the object theory together with the characterization theorem.

30 Using a second-order object theory instead, the definition of the RU model for the second-order language would have to be given in a third-order metalanguage. Then, I’d have to give an interpretation of the third-order variables used in the definitions. And, if I stick with the use of plurals in the metatheory, this would then require superplurals in the metametatheory. However, superplural resources are controversial (see Florio & Linnebo [Reference Florio and Linnebo15, chap. 9]).

31 Note that we cannot talk about the cardinality of $\mathcal {L}$ since $\mathcal {L}$ is not a set-theoretic object.

32 See, e.g., Rayo & Uzquiano [Reference Rayo and Uzquiano40], Rayo & Williamson [Reference Rayo, Williamson and Beall41], Button & Walsh [Reference Button and Walsh8, sec. 12.A], Trueman [Reference Trueman47, sec. 7.A], and Rossi [Reference Rossi42].

33 This is again a slight abuse of notation, since $\mathcal {L}$ is not a set-theoretic object, and so set-theoretic operations are not defined for $\mathcal {L}$ . However, the intuition should be clear enough.

34 I will, again, use ‘model’ and model (with and without quotation marks), as in the case of ‘domain’ and domain. See fn. 6.

35 To adapt Definition 3.6 to the strong Kleene satisfaction scheme, replace clause 7 by

$7^{*}$ . $\varphi \equiv \bigvee \Phi $ , $\Phi = \{ \varphi _\alpha : \alpha < \kappa \} \subseteq \mathsf {For}_\kappa $ and for some $\varphi _\alpha \in \Phi $ , $\mathsf {Sat}_\kappa (\psi _\alpha , X,Y).$

The requirement of 7 $^*$ for a disjunction to be satisfied by a model is that one disjunct is satisfied regardless of whether the other disjuncts have a classical truth value. Clause $7$ requires instead that all disjuncts have a classical truth value.

36 Note that this is not the standard notion of a syntactic restriction of the universal quantifier that is in place here, as this would be of the form $\forall y (y \in V_\alpha \rightarrow \ldots )$ . However, the right-hand side of the biconditional ensures that $y \in V_\alpha ,$ which is enough.

37 Thanks to an anonymous referee for highlighting this point.

38 See Devlin [Reference Devlin10, sec. I.8].

39 Wang [Reference Wang, Haller and Brandl48, p. 280].

40 See, e.g., Devlin [Reference Devlin10, pp. 25–26].

41 Thanks to Leon Horsten and Giorgio Venturi for emphasizing this point.

42 For a more detailed presentation, the reader is referred to Bell [Reference Bell, Zalta and Nodelman4], Karp [Reference Karp25], and Marker [Reference Marker32].

43 See Dickmann [Reference Dickmann11, p. 85 and p. 139] for details.

44 For more on both, see Keisler [Reference Keisler26] or Marker [Reference Marker32].

45 See Kunen [Reference Kunen27, p. 78, lemma I.13.29].

46 Ibid. Lemma I.13.31.

47 See Shapiro [Reference Shapiro44, p. 113 ff.] for more details.

48 This is essentially an application of Mostowski’s collapsing theorem (see Jech [Reference Jech22, theorem 6.15, p. 69].

References

BIBLIOGRAPHY

Bacon, A. (2023). A Philosophical Introduction to Higher-order Logics. Cambridge, UK: Routledge; Chapman & Hall, Incorporated. https://doi.org/10.4324/9781003039181.CrossRef Google Scholar

Beall, J. C. (2016). Off-topic: A new interpretation of Weak-Kleene logic. Australasian Journal of Logic, 13(6), 136–142. https://doi.org/10.26686/ajl.v13i6.3976.CrossRef Google Scholar

Bell, J. L. (2022). Higher-Order Logic and Type Theory, Elements in Philosophy and Logic. Cambridge, UK: Cambridge University Press. https://doi.org/10.1017/9781108981804.CrossRef Google Scholar

Bell, J. L. (2023). Infinitary logic. In Zalta, E. N., and Nodelman, U., editors. The Stanford Encyclopedia of Philosophy. Standford, CA: Metaphysics Research Lab; Stanford University.Google Scholar

Boolos, G. (1971). The iterative conception of set. The Journal of Philosophy, 68(8), 215–231. https://doi.org/10.2307/2025204.CrossRef Google Scholar

Boolos, G. (1984). To be is to be a value of a variable (or to be some values of some variables). The Journal of Philosophy, 81(8), 430–449. https://doi.org/10.2307/2026308.CrossRef Google Scholar

Boolos, G. (1985). Nominalist platonism. The Philosophical Review, 94(3), 327–344. https://doi.org/10.2307/2185003.CrossRef Google Scholar

Button, T., & Walsh, S. (2018). Philosophy and Model Theory (first edition). Oxford: Oxford University Press. https://doi.org/10.1093/oso/9780198790396.001.0001.CrossRef Google Scholar

Cartwright, R. L. (1994). Speaking of everything. Noûs, 28(1), 1–20. https://doi.org/10.2307/2215917.CrossRef Google Scholar

Devlin, K. J. (2017). Constructibility, Perspectives in Logic. Cambridge, UK: Cambridge University Press. https://doi.org/10.1017/9781316717219.CrossRef Google Scholar

Dickmann, M. (1975). Large Infinitary Languages. Model Theory. Amsterdam: North-Holland.Google Scholar

Dummett, M. (1973). In Dummett, M., editor. Philosophy of Language. London: Duckworth, XXV, p. 698 S. Google Scholar

Florio, S., & Linnebo, Ø. (2016). On the innocence and determinacy of plural quantification. Noûs, 50(3), 565–583. https://doi.org/10.1111/nous.12091.Google Scholar

Florio, S., & Linnebo, Ø. (2020). Critical plural logic. Philosophia Mathematica, 28(2), 172–203. https://doi.org/10.1093/philmat/nkaa020.CrossRef Google Scholar

Florio, S., & Linnebo, Ø. (2021). The Many and the One: A Philosophical Study of Plural Logic. Oxford: Oxford University Press. https://doi.org/10.1093/oso/9780198791522.001.0001.CrossRef Google Scholar

Glanzberg, M. (2001). The liar in context. Philosophical Studies, 103(3), 217–251. http://www.jstor.org/stable/4321137.CrossRef Google Scholar

Glanzberg, M. (2004a). A contextual-hierarchical approach to truth and the liar paradox. Journal of Philosophical Logic, 33(1), 27–88. https://www.jstor.org/stable/30226945.CrossRef Google Scholar

Glanzberg, M. (2004b). Quantification and realism. Philosophy and Phenomenological Research, 69(3), 541–572. http://www.jstor.org/stable/40040767.CrossRef Google Scholar

Gödel, K. (1984). What is Cantor’s continuum problem? In Benacerraf, P., and Putnam, H., editors. Philosophy of Mathematics: Selected Readings. Cambridge, UK: Cambridge University Press, pp. 470–485. https://doi.org/10.1017/CBO9781139171519.025.CrossRef Google Scholar

Hanf, W. P., & Scott, D. (1961). Classifying inaccessible cardinals. Notices of the American Mathematical Society, 8, 445.Google Scholar

Incurvati, L. (2020). Conceptions of Set and the Foundations of Mathematics. Cambridge: Cambridge University Press. https://doi.org/10.1017/9781108596961.CrossRef Google Scholar

Jech, T. (2006). Set Theory. The third millennium edition, revised and expanded, corrected 4th printing, Springer Monographs in Mathematics. Berlin: Springer. https://doi.org/10.1007/3-540-44761-X.Google Scholar

Kanamori, A. (2004). Zermelo and set theory. The Bulletin of Symbolic Logic, 10(4), 487–553. http://www.jstor.org/stable/3216738.CrossRef Google Scholar

Kanamori, A. (2008). The higher infinite. Large cardinals in Set Theory From Their Beginnings (second edition), Springer Monographs in Mathematics. Berlin: Springer. https://doi.org/10.1007/978-3-540-88867-3.Google Scholar

Karp, C. (1964). Languages with Expressions of Infinite Length, Studies in Logic and the Foundations of Mathematics. Amsterdam: North-Holland.Google Scholar

Keisler, H. J. (2010). Model Theory for Infinitary Logic. Logic with Countable Conjunctions and Finite Quantifiers, Studies in Logic and the Foundations of Mathematics. Amsterdam: North-Holland Pub. Co. http://www.sciencedirect.com/science/bookseries/0049237X/62.Google Scholar

Kunen, K. (1980). Set Theory. An Introduction to Independence Proofs, Studies in Logic and the Foundations of Mathematics. Amsterdam: North-Holland.Google Scholar

Lavine, S. (2006). Something about everything: Universal quantification in the universal sense of universal quantification. In Absolute Generality, pp. 98–148. https://doi.org/10.1093/oso/9780199276424.003.0005.CrossRef Google Scholar

Lewis, D. K. (1991). Parts of Classes. Oxford, UK: Blackwell, p. 155 S. Google Scholar

Maddy, P. (1988a). Believing the axioms. I. The Journal of Symbolic Logic, 53(2), 481–511. https://doi.org/10.2307/2274520.CrossRef Google Scholar

Maddy, P. (1988b). Believing the axioms. II. Journal of Symbolic Logic, 53(3), pp. 736–764. https://doi.org/10.2307/2274569.CrossRef Google Scholar

Marker, D. (2016). Lectures on Infinitary Model Theory, Lecture Notes in Logic. Cambridge: Cambridge University Press. https://doi.org/10.1017/9781316855560.CrossRef Google Scholar

McGee, V. (2000). ‘Everything’. In Sher, G., and Tieszen, R., editors. Between Logic and Intuition: Essays in Honor of Charles Parsons. Cambridge, UK: Cambridge University Press, pp. 54–78. https://doi.org/10.1017/CBO9780511570681.005.CrossRef Google Scholar

McGee, V. (2006). There’s a rule for everything. In Rayo, A. & Uzquiano, G., editors. Absolute Generality. Oxford, UK: Oxford University Press, pp. 179–202. https://doi.org/10.1093/oso/9780199276424.003.0007.CrossRef Google Scholar

Murzi, J., & Rossi, L. Truth and Paradox in Context. Oxford. (in preparation).Google Scholar

Oliver, A., & Smiley, T. (2016). Plural Logic (second edition). Revised and Enlarged. Oxford, UK: Oxford University Press. https://doi.org/10.1093/acprof:oso/9780198744382.001.0001.CrossRef Google Scholar

Parsons, C. (1974). The liar paradox. Journal of Philosophical Logic, 3(4), 381–412. https://www.jstor.org/stable/30226094.CrossRef Google Scholar

Potter, M. D. (2004). Set Theory and its Philosophy. A Critical Introduction. Oxford: Oxford University Press. https://doi.org/10.1093/acprof:oso/9780199269730.001.0001.CrossRef Google Scholar

Rayo, A. (2006). Beyond plurals. In Rayo, A. & Uzquiano, G., editors. Absolute Generality. Oxford, UK: Oxford University Press, pp. 220–254. https://doi.org/10.1093/oso/9780199276424.003.0009.CrossRef Google Scholar

Rayo, A., & Uzquiano, G. (1999). Toward a theory of second-order consequence. Notre Dame Journal of Formal Logic, 40(3), 315–325. https://doi.org/10.1305/ndjfl/1022615612.CrossRef Google Scholar

Rayo, A., & Williamson, T. (2003). A completeness theorem for unrestricted first-order languages. In Beall, J. C., editor. Liars and Heaps—New Essays on Paradox. Oxford: Oxford University Press, pp. 331–356. https://doi.org/10.1093/oso/9780199264803.003.0016.Google Scholar

Rossi, L. (2023). Bicontextualism. Notre Dame Journal of Formal Logic, 64(1), 95–127. https://doi.org/10.1215/00294527-2023-0004.CrossRef Google Scholar

Russell, B. (1907). On some difficulties in the theory of transfinite numbers and order types. In Proceedings of the London Mathematical Society s2-4.1. London, UK: London Mathematical Society, pp. 29–53. https://doi.org/10.1112/plms/s2-4.1.29.CrossRef Google Scholar

Shapiro, S. (1991). Foundations Without Foundationalism: A Case for Second-Order Logic. Oxford: Oxford University Press. https://doi.org/10.1093/0198250290.001.0001.Google Scholar

Shapiro, S. (2003). All sets great and small: and I do mean ALL. Philosophical Perspectives, 17(1), 467–490. https://doi.org/10.1111/j.1520-8583.2003.00014.x.CrossRef Google Scholar

Studd, J. (2019). Everything, More or Less. A Defence of Generality Relativism. Oxford: Oxford University Press. https://doi.org/10.1093/oso/9780198719649.001.0001.CrossRef Google Scholar

Trueman, R. (2021). Properties and Propositions: The Metaphysics of Higher-Order Logic. Cambridge, UK: Cambridge University Press. https://doi.org/10.1017/9781108886123.Google Scholar

Wang, H. (1990). Philosophy through mathematics and logic. In Haller, R., and Brandl, J., editors. Wittgenstein—Eine Neubewertung / Wittgenstein—Towards a Re-Evaluation. Munich: J.F. Bergmann-Verlag, pp. 142–154. https://doi.org/10.1007/978-3-662-30086-2_13.CrossRef Google Scholar

Williamson, T. (2003). Everything. Philosophical Perspectives, 17, 415–465. https://doi.org/10.1111/j.1520-8583.2003.00017.x.CrossRef Google Scholar

Zermelo, E. (1930). Über grenzzahlen und mengenbereiche: Neue untersuchungen über die grundlagen der mengenlehre. Fundamenta Mathematicae, 16, 29–47.CrossRef Google Scholar

Article contents

SET-THEORETIC BICONTEXTUALISM

Abstract

Keywords

MSC classification

Information

1 Introduction

2 Relativism in set theory

3 Bicontextualism for set theory

3.1 Heuristics

3.2 The RU approach

3.3 An absolutist-friend semantics for set theory

Definition 3.2 (First-Order RU-model)

3.3.1 RU-domains and RU-variations

Definition 3.4 (RU-domain)

Definition 3.5 (RU-Variant)

3.4 The satisfaction predicate

Definition 3.6 (Weak Kleene Satisfaction)

Remark 3.7 (Convention on the enumeration of inaccessible ranks)

3.5 Combining absolutism and realism in the RU approach

Remark 3.12 (Bicontextualist classification of inherently absolutistic and inherently relativistic truths)

3.6 A bicontextualist notion of logical consequence

4 Set-theoretic bicontextualist treatment of the paradoxes

5 Extracting core properties

6 Objections and replies

6.1 Reflection principles and inner models

6.2 Anti-naturalism

6.3 Isn’t that just generality absolutism?

7 Conclusion

A Infinitary logic

B Characterization of $\mathcal {H}(\kappa )$ in $\mathcal {L}_{\kappa ^{+} \kappa ^{+}}$

Definition B.1 (Sets Hereditarily of Power $\leq \kappa $ )

Definition B.5 ( $\mathsf {ZFC}_\kappa $ )

Theorem B.6 Hanf & Scott [Reference Hanf and Scott20]

Proof of Lemma B.7

Acknowledgments

Funding

Footnotes

References

BIBLIOGRAPHY

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests