1 Introduction
This year, 2022, the mathematical world is celebrating the 100th anniversary of Stefan Banach’s fixed point theorem, typically referred to as the Banach contraction principle [Reference Banach1]. The standard formulation of the principle says that if $T: M \rightarrow M$ is a contraction, where M is a complete metric space, then T has a unique fixed point $z \in M$ and for every $x \in M$ , $T^n(x) \rightarrow z$ . Since 1922, thousands of variants and generalisations of the principle have been produced, providing new views on the main idea of the principle and enlarging its application domain. We can easily identify the four main ingredients of the Banach principle and all of them are usually present in such generalisations: (1) a set M equipped with a mathematical device to measure how distant from each other any two elements of the space are; (2) a contracting operation (or a family of such operations) acting on elements of M; (3) existence of a unique fixed point (or of a unique common fixed point) in M for this operation (or a family of operations); (4) a constructive method of approximating such a unique fixed point by the use of Picard’s iterates. Any fixed point results that contain these four elements are definitely children or grand-children of Banach’s principle.
In this paper, we present fixed point results that contain the four ingredients. As the set, we take a subset of a topological vector space equipped with a (not necessarily convex) modular as the device measuring the distance. We will prove the existence and uniqueness of a common fixed point for a contractive (in this modular sense) semigroup of nonlinear mappings acting in this set. And, yes, we will show that this fixed point is a limit of orbits being a semigroup equivalent of Picard’s iterates. The paper uses the framework of modulated topological vector spaces (MTVSs) introduced in [Reference Kozlowski10, Reference Kozlowski11]. Our fixed point theorems, besides being grand-children of Banach’s theorem, generalise known results from the fixed point theory in Banach spaces (for example, Theorem 3.1 in the 2008 paper by Kirk and Xu [Reference Kirk and Xu7]), and from the fixed point theory in modular function spaces (for example, Theorem 3.6 in Kozlowski’s paper [Reference Kozlowski9], see also [Reference Khamsi and Kozlowski6, Theorem 7.1]). We need to emphasise that the results of the current paper (Theorems 3.5 and 3.9) do not assume convexity of a modular, nor do we assume any particular underlying measure theory structures as in [Reference Khamsi and Kozlowski6, Reference Kozlowski9], which simplifies the exposition and widens the application domain. As a new technique, we introduce a concept of Opial sets, as a generalisation of the weak Opial property from Banach spaces. Interesting examples, including spaces as classical as $L^p$ , demonstrate the advantages of the modular approach, since we know that many such spaces (like $L^p$ spaces for $1 \leq p \ne 2$ ) do not satisfy the weak Opial condition.
2 Preliminaries
We use the framework of modulated topological vector spaces introduced by the author [Reference Kozlowski10, Reference Kozlowski11]. We refer the reader to these papers for the detailed exposition of the theory and a list of examples. We need to note though that, in contrast with the cited papers, we do not assume here the convexity of the modular. This does not have any impact on the results of this theory that we need in the current work.
First, let us recall the definition of a modular, a modular space and $\rho $ -convergence together with associated notions (see papers [Reference Kozlowski10, Reference Kozlowski11] and also [Reference Khamsi and Kozlowski6, Reference Kozlowski8] for further details).
Definition 2.1. A functional $\rho : X \rightarrow [0,\infty ]$ is called a modular if:
-
(1) $\rho (x) = 0$ if and only if $x =0$ ;
-
(2) $\rho (-x) = \rho (x)$ ;
-
(3) $\rho (\alpha x + \beta y) \leq \rho (x) + \rho (\kern1.1pt y)$ for all $x,y \in X$ , $\alpha , \beta \geq 0$ with $\alpha +\beta = 1$ .
A modular is called a convex modular if it is a convex function. The vector space $X_{\rho } = \{ x\in X: \rho (\lambda x) \rightarrow 0,\mbox { as } \lambda \rightarrow 0 \}$ is called a modular space.
Definition 2.2. Let $\rho $ be a modular defined on a vector space X.
-
(a) We say that $\{x_{n}\}$ , a sequence of elements of $X_{\rho }$ , is $\rho $ -convergent to x and write $x_{n} \stackrel {\rho }{\rightarrow } x$ if $\rho (x_{n}-x)\rightarrow 0$ .
-
(b) A sequence $\{x_{n}\}$ , where $x_{n} \in X_{\rho }$ , is called $\rho $ -Cauchy if $\rho (x_{n}{\kern-1.2pt}-{\kern-1.2pt}x_{m}){\kern-1.2pt}\rightarrow{\kern-1.2pt} 0$ as $n,m {\kern-1.2pt}\rightarrow{\kern-1.2pt} \infty $ .
-
(c) $X_{\rho }$ is called $\rho $ -complete if every $\rho $ -Cauchy sequence is $\rho $ -convergent to an $x \in X_{\rho }$ .
-
(d) A set $B\subset X_{\rho }$ is called $\rho $ -closed if for any sequence of $x_{n} \in B$ , the convergence $x_{n} \stackrel {\rho }{\rightarrow } x$ implies that x belongs to B.
-
(e) A set $B\subset X_{\rho }$ is called $\rho $ -bounded if its $\rho $ -diameter $\delta _\rho (B)$ , which is defined by $\delta _\rho (B) = \sup \{\,\rho (x-y): x \in B, y \in B \}$ , is finite.
-
(f) A set $K\subset X_{\rho }$ is called $\rho $ -compact if for any $\{x_n\}$ in K, there exists a subsequence $\{x_{n_k}\}$ and an $x \in K$ such that $\rho (x_{n_k}-x)\rightarrow 0$ .
-
(g) A $\rho $ -ball $B_{\rho }(x,r)$ is defined by $B_{\rho }(x,r) = \{ y \in X_{\rho }: \rho (x-y) \leq r\}$ .
We are now ready to introduce the main concept of a modulated topological vector space.
Definition 2.3. Let $\rho $ be a modular defined on a real vector space X and let $\tau $ be a linear, Hausdorff topology on $X_{\rho }$ . The triplet $(X_{\rho }, \rho , \tau )$ is called a modulated topological vector space if the following two conditions are satisfied:
-
(i) $\rho $ is sequentially $\tau $ -lower semi-continuous on X, that is, $\rho (x) \leq \liminf _{n \rightarrow \infty } \rho (x_n),$ provided $x_{n} \stackrel {\tau }{\rightarrow } x;$
-
(ii) if $x_{n} \stackrel {\rho }{\rightarrow } x$ , then there exists a sub-sequence $\{x_{n_k}\}$ of $\{x_{n}\}$ such that $x_{n_k} \stackrel {\tau }{\rightarrow } x$ , where $x,\:x_n \in X$ .
Proposition 2.4 [Reference Kozlowski10, Proposition 2.5].
Let $(X_{\rho }, \rho , \tau )$ be a $\rho $ -complete modulated topological vector space. The following assertions follow immediately from the above definitions.
-
(i) Every $\tau $ -closed set is also $\rho $ -closed.
-
(ii) Every $\rho $ -compact set is also sequentially $\tau $ -compact.
-
(iii) Every $\rho $ -ball $B_{\rho }(x,r)$ is $\tau $ -closed (and hence also $\rho $ -closed).
As shown in the cited papers, the typical examples of modulated topological vector space are: Banach spaces with $\rho $ being the norm and $\tau $ the weak topology; modular function spaces with $\tau $ being the topology of the convergence in sub-measure, and in particular $L^p$ for $p> 0$ ; variable Lebesgue spaces $L^{p(\cdot )}$ where $1 \leq p(t) < + \infty $ ; Orlicz and Musielak–Orlicz spaces, where $\tau $ denotes the topology of convergence in a finite measure m.
3 Fixed point theorems for contractive semigroups
Let us start with the definition of a contractive semigroup in a modulated topological vector space.
Definition 3.1. Let C be a nonempty subset of a modulated topological vector space $(X_{\rho }, \rho , \tau )$ and let $\mathcal {T} = \{T_t: t \geq 0 \}$ be a one-parameter family of mappings from C into itself. Then $\mathcal {T}$ is called a contractive semigroup on C if:
-
(i) $T_0(x) = x$ for any $x \in C$ ;
-
(ii) $T_{t+s}(x) = T_t(T_s(x))$ for any $x \in C$ and any $s,t \geq 0$ ;
-
(iii) for each $t \geq 0$ , $T_t$ is a contraction with a constant $L_t$ , $0 < L_t < 1$ , that is, we have ${\rho (T_t(x) - T_t(\kern1.1pt y)) \leq L_t \rho (x - y) }$ for all $x,y \in C$ ;
-
(iv) $\limsup _{t \rightarrow + \infty } L_t < 1.$
An element $x_0 \in C$ is called a common fixed point for $\mathcal {T}$ if $T_t(x_0) = x_0$ for every $t \geq 0$ . The set (possibly empty) of all common fixed points for $\mathcal {T}$ will be denoted by $F(\mathcal {T})$ .
The next result follows immediately from this definition.
Proposition 3.2. If $\mathcal {T}$ is a contractive semigroup, then $F(\mathcal {T})$ consists at most of one element.
The following ‘convergence of orbits’ lemma provides a constructive method of approximating a common fixed point knowing its existence.
Lemma 3.3. Let $z \in F(\mathcal {T})$ , where $\mathcal {T}= \{T_t: t \geq 0 \}$ is a contractive semigroup on $C \subset ~X_{\rho }$ . Then $T_t (u) \stackrel {\rho }{\rightarrow } z$ for every $u \in C$ .
Proof. Let us fix $u \in C$ . Note that for any $s, t \geq 0$ ,
Hence,
for any $t \geq 0$ , which implies that $\limsup _{s \rightarrow \infty } \rho (T_s(u) - z ) = 0$ as $\limsup _{t \rightarrow + \infty } L_t < 1$ . The same reasoning can be applied to $\liminf $ giving the required convergence.
Let us recall the concept of $\rho $ -types.
Definition 3.4. Given a nonempty subset C of a modulated topological vector space $(X_{\rho }, \rho , \tau )$ , and a sequence $\{x_n\}$ of elements from C (alternatively, a net $\{x_t\}_{t \in [0,+ \infty )}$ ), a function $\Phi : C \rightarrow [0,+ \infty ]$ defined for any $z \in X$ by
alternatively by
is called a $\rho $ -type on C.
We are now ready to prove our first fixed point result.
Theorem 3.5. Let C be a nonempty $\rho $ -bounded subset of a $\rho $ -complete modulated topological vector space $(X_{\rho }, \rho , \tau )$ and let $\mathcal {T}= \{T_t: t \geq 0 \}$ be a contractive semigroup on C. Assume that there exists an $x \in C$ such that the $\rho $ -type $\Phi $ defined for $u \in C$ by $\Phi (u) = \limsup _{t \rightarrow \infty } \rho (T_t(x) - u)$ attains its minimum in C. Then there exists a unique common fixed point $z \in F(\mathcal {T})$ . Moreover, $\rho (T_t(u) - z ) \rightarrow 0$ for every $u \in C$ .
Proof. In view of Proposition 3.2 and Lemma 3.3, we need only to prove the existence of a common fixed point. By our assumption, there exists $z \in C$ such that $\Phi (z) = \inf \{\Phi (\kern1.1pt y): y \in C\}$ . We shall prove now that $\Phi (z) = 0$ . To this end, we note first that for any $s,t \geq 0$ ,
and that by letting $s \rightarrow + \infty $ , we get $\Phi (T_t(z)) \leq L_t \Phi (z)$ , which, by letting t tend to infinity, implies that
Since $\limsup _{t \rightarrow + \infty } L_t < 1$ , we conclude that $\Phi (z) = 0$ , and hence, by the definition of $\Phi $ , for any $s \geq 0$ ,
Hence, $T_t(x) \stackrel {\rho }{\rightarrow } z$ and $T_t(x) \stackrel {\rho }{\rightarrow } T_s(z)$ for any $s \geq 0$ , which, by the uniqueness of the $\rho $ -limit, implies that $T_s(z) = z$ for any $s \geq 0$ , that is, $z \in F(\mathcal {T})$ , as claimed.
To be able to effectively use Theorem 3.5, we need to have practical ways of assessing for which sets C the $\rho $ -types attain their minimum. In the Banach space case, as is well known, this will be true if C is a nonempty, convex, bounded and weakly compact set (see, for example, [Reference Goebel and Kirk3, Reference Lim12]). However, this result depends on some specific Banach space characteristics, like a triangle property of norms, and on the fact that a closed convex subset of a weakly compact set is weakly compact itself. Since in the setting of modulated topological vector spaces these properties are generally not available, we are going to introduce a powerful technique of $\tau $ -Opial sets, which will allow us to prove our second fixed point result, Theorem 3.9, and to provide a list of examples and applications. We shall begin by recalling a standard result, written below in the language of modulated topological vector spaces. We write l.s.c. to mean lower semi-continuous.
Lemma 3.6. Let C be a nonempty, sequentially $\tau $ -compact subset of a $\rho $ -complete modulated topological vector space $(X_{\rho }, \rho , \tau )$ . Let $\Psi : C \rightarrow [0,+ \infty )$ . If $\Psi $ is sequentially $\tau $ -l.s.c., then $\Psi $ attains its minimum in C.
Proof. Denote $\Psi _0 = \inf \{ \Psi (\kern1.1pt y): y \in C\}$ . Let $\Psi _0 = \lim _{n \rightarrow \infty } \Psi (\kern1.1pt y_n)$ for some sequence $\{y_n\}$ of elements of C. By the sequential $\tau $ -compactness of C, we can choose a subsequence $\{y_{n_{k}} \}$ such that $ y_{n_{k}} \stackrel {\tau }{\rightarrow } y$ for a $y \in C$ . Using the assumption that $\Psi $ is sequentially $\tau $ -l.s.c.,
Definition 3.7. Let C be a nonempty subset of a modulated topological vector space $(X_{\rho }, \rho , \tau )$ . We say that C is a $\tau $ -Opial set (or, in short, a ( $\tau $ -O) set) if for every $y \in C$ and every sequence $\{x_n\}$ of elements of C with $x_n \stackrel {\tau } {\rightarrow } x$ for an $x \in C,$
Theorem 3.8. Let C be a nonempty, sequentially $\tau $ -compact, $\rho $ -bounded subset of a $\rho $ -complete modulated topological vector space $(X_{\rho }, \rho , \tau )$ . If C is a $\tau $ -Opial set, then every $\rho $ -type $\Phi $ on C is sequentially $\tau $ -l.s.c. and attains its minimum in C. Moreover, if $\{y_n\}$ is a sequence of elements of C such that $ y_{n} \stackrel {\tau }{\rightarrow } y$ , then
Proof. Observe first that $\Phi (\kern1.1pt y) < + \infty $ for every $y \in C$ because C is $\rho $ -bounded. It is obvious that to show that $\Phi $ is sequentially $\tau $ -l.s.c. (and hence that it attains its minimum, by Lemma 3.6), it is enough to prove inequality (3.1). To this end, let $\Phi $ be defined by a sequence $\{x_n\}$ of elements of C, that is, $\Phi (z) = \limsup _{n \rightarrow \infty } \rho (x_n - z)$ for any $z \in C$ . Let us fix $y \in C$ . Let $\{x_{p(n)} \}$ be a subsequence of $\{x_n\}$ such that $x_{p(n)} \stackrel {\tau }{\rightarrow } x$ for some $x \in C$ (recall that C is sequentially $\tau $ -compact) and that $\Phi (\kern1.1pt y) = \lim _{n \rightarrow \infty } \rho (x_{p(n )}- y)$ . Let us fix temporarily $m \in \mathbb {N}$ and observe that
Since C is a ( $\tau$ -O) set, it follows that
and hence by (3.2),
By taking $m \rightarrow \infty $ , we get
Using ( $\tau $ -O) again, this time with $\liminf _{m \rightarrow \infty } \rho (x - y_m)$ , and substituting into (3.3),
However,
and using ( $\tau $ -O) on the right-hand side of (3.5),
which, combined with (3.4), gives us finally
as claimed.
Combining Theorems 3.5 and 3.8, we immediately obtain the following fixed point result.
Theorem 3.9. Let C be a $\rho $ -bounded and sequentially $\tau $ -compact subset of a $\rho $ -complete modulated topological vector space $(X_{\rho }, \rho , \tau )$ . Let $\mathcal {T}= \{T_t: t \geq 0 \}$ be a contractive semigroup on C. If C is a $\tau $ -Opial set, then there exists a unique common fixed point $z \in F(\mathcal {T})$ . Moreover, $\rho (T_t(u) - z ) \rightarrow 0$ for every $u \in C$ .
While the notion of an Opial set as defined in Definition 3.7, alluding to the celebrated weak Opial property [Reference Opial13], is actually a novel concept, it has been known since the 1996 work by Khamsi [Reference Khamsi4] (see also [Reference Khamsi and Kozlowski6, Theorem 4.7]) that in every $\Delta _2$ modular function space defined by a convex, orthogonally additive modular $\rho $ , every $\rho $ -bounded set is an Opial set. Subject to some technicalities, the same also holds without $\Delta _2$ . This fact gives a wide range of function spaces where bounded sets are Opial sets, including $L^p$ for $p\geq 1$ , variable Lebesgue spaces $L^{p(\cdot )}$ where $1 \leq p(t) < + \infty $ , Orlicz and Musielak–Orlicz spaces. Let us recall the fact observed already in [Reference Opial13] that the weak Opial property does not hold in $L^p$ spaces for $1 \leq p \ne 2$ , and hence there are bounded sets which are not Opial sets with $\rho = \| \cdot \|_p$ , but, as seen above, they are Opial sets with $ \rho (x) = \int _{[0,1]} |x(t)|^p \,dm(t)$ . It is also important to notice the nonconvex modular case for $0 < p < 1$ where all sets are Opial sets, due to the fact that $\lim _{n \rightarrow + \infty } \{ \|x_n\|_{p}^{p} - \|x_n - x\|_{p}^{p} \} = \|x\|_{p}^{p}$ , whenever $x_n \rightarrow x$ almost everywhere, as shown already in [Reference Brezis and Lieb2].
In view of these remarks, Theorem 3.9 is a generalisation of the results proven in the context of regular convex modular function spaces (see, for example, [Reference Khamsi and Kozlowski5, Theorem 4.2], [Reference Kozlowski9, Theorem 3.6] and [Reference Khamsi and Kozlowski6, Theorem 7.1]) that can be applied to spaces like $L^p$ for $p> 0 $ , variable Lebesgue spaces $L^{p(\cdot )}$ where $1 \leq p(t) < + \infty $ , Orlicz and Musielak–Orlicz spaces.
Acknowledgement
The author would like to thank the anonymous referee for valuable suggestions to improve the presentation of the paper.