1 Introduction
In [Reference Tatarko and Werner34], an $L_p$ Steiner formula was proved for the $L_p$ affine surface area, namely, if a convex body K is $C^2_+$ , then we have, for all suitable t and for all $p \in \mathbb {R}$ , $p \neq -n$ , that
where
is the $L_p$ affine surface area of a convex body K, $N(x)$ is the outer normal to K in $x \in \partial K$ , the boundary of K, $H_{n-1}(x)$ is the Gauss curvature in x, $\mathcal {H}^{n-1}$ is the usual surface area measure on $\partial K$ , and ${\alpha \choose k}$ are binomial coefficients (see (2.3)). The Euclidean unit ball centered at $0$ is denoted by $B^n_2$ .
Identity (1.1) is the analog of the classical Steiner formula (e.g., [Reference Gardner13, Reference Schneider29]) of the Brunn–Minkowski theory in the more recent $L_p$ Brunn–Minkowski theory. This theory has as its starting point Lutwak’s seminal paper [Reference Lutwak23] and it has been developed immensely (e.g., [Reference Besau, Ludwig and Werner3–Reference Böröczky, Lutwak, Yang and Zhang6, Reference Gardner, Hug, Weil and Ye14, Reference Huang, Lutwak, Yang and Zhang17, Reference Ludwig and Reitzner21, Reference Meyer and Werner24, Reference Ye41–Reference Zhao44]). In analogy to the classical theory, the coefficients ${\mathcal {W}}^p_{{m}, k}(K) $ are called $L_p$ Steiner coefficients and they are defined in [Reference Tatarko and Werner34] for a (general) convex body K in $\mathbb {R}^n$ , for all $k, m \in \mathbb {N} \cup \{0\}$ as
where the $H_j$ are the jth normalized elementary symmetric functions of the principal curvatures. The $c(n, p,m)$ are certain binomial coefficients (see [Reference Tatarko and Werner34, Reference Tatarko and Werner35] for the details). The $L_p$ Steiner coefficients were studied in [Reference Tatarko and Werner35], where it was proved, among other results, that they are valuations on the set of convex bodies.
1.1 Main results
In this paper, we look at the $L_p$ Steiner formula with a different focus. Expressions also appearing naturally in formula (1.1) are weighted $L_p$ affine surface areas, $ \mu _{\vec {i}}-as_p(K)$ , which we define in Section 2.2.
We investigate in detail those weighted $L_p$ affine surface areas in Section 3. We show that they are homogeneous of a certain degree and are invariant under rotations and reflections. We show that they are valuations on the set of convex bodies. Valuations have become a vitally import subject of study in convexity and affine and differential geometry (e.g., [Reference Alesker1, Reference Colesanti, Ludwig and Mussnig10, Reference Haberl15, Reference Haberl and Parapatits16, Reference Ludwig22, Reference Schuster30]). The weighted $L_p$ affine surface areas satisfy isoperimetric inequalities which generalize the $L_p$ affine isoperimetric inequalities of [Reference Lutwak23, Reference Werner and Ye39]. This is shown in Theorem 3.2.
Theorem 3.2 Let $s\neq -n, r \neq -n, t \neq -n$ be real numbers. Let K be a $C^2_+$ convex body in $\mathbb R^n$ with centroid at the origin.
(i) If $\frac {(n+r)(t-s)}{(n+t)(r-s)}>1$ , then
(ii) If $\frac {(n+r)t}{(n+t)r}>1$ , then
Equality holds in the above inequalities, if and only if K is an ellipsoid.
We show in Section 3.3, that the weighted $L_p$ affine surface areas have natural geometric interpretations in terms of certain convex bodies associated with the given convex body K.
We prove a monotonicity behavior in the parameter p for the weighted $L_p$ affine surface areas which allows to establish asymptotics for the weighted $L_p$ affine surface areas. These asymptotics connects them to entropy powers, namely to the Kullbak–Leibler divergence $D_{KL}$ of the cone measures of K and its polar $K^\circ $ , $Q_K,$ and $P_K$ . We quote the relevant Theorem 3.8 and refer to Section 3.4 for the details. We put
and then the following theorem holds.
Theorem 3.8 Let K be a $C^2_+$ convex body in $\mathbb R^n$ with centroid at the origin. Then,
-
(i)
$$ \begin{align*} \lim\limits_{p \rightarrow \infty} \left(\frac{\omega_{m, k, \vec{i}}^p (K)}{\omega^\infty_{m, k, \vec{i}}(K)}\right)^{n+p} = \mathrm{exp} \left(-\frac{n \, D_{KL} (P_K||Q_K)}{\mu_{\vec{i}} - {\mathrm{vol}}_n(K^\circ)}\right). \end{align*} $$ -
(ii)
$$ \begin{align*} \lim\limits_{p \rightarrow 0} \left(\frac{\omega_{m, k, \vec{i}}^p (K^\circ)}{\omega^0_{m, k, \vec{i}}(K^\circ)}\right)^{\frac{n(n+p)}p} = \mathrm{exp} \left(-\frac{n \, D_{KL} (P_{K^\circ}||Q_{K^\circ})}{\mu_{\vec{i}} - {\mathrm{vol}}_n(K^\circ)}\right). \end{align*} $$
This leads naturally to consider more general f-divergences than just the Kullbak–Leibler divergence. We treat that in Section 4, where we also observe that the weighted $L_p$ affine surface areas themselves are special f-divergences.
Throughout the paper, we assume that the convex bodies K are $C^2_+$ , i.e., K has twice continuously differentiable boundary with strictly positive Gauss curvature everywhere and such that $0$ is the centroid of K, $0 = \frac {1}{{\mathrm {vol}}_n(K)} \int _{K} xdx$ .
2 Weighted $L_p$ -affine surface areas
2.1 Background from differential geometry
For more information and the details in this section, we refer to, e.g., [Reference Gardner13, Reference Schneider29].
Let K be a convex body of class $C^2.$ For a point x on the boundary $\partial K$ of $K,$ we denote by $N(x)$ the unique outward unit normal vector of K at $x.$ The map $N_K: \partial K \to S^{n - 1}$ is called the spherical image map or Gauss map of K and is of class $C^1.$ Its differential is called the Weingarten map. The eigenvalues of the Weingarten map are the principal curvatures $k_i(x)$ of K at $x.$
The jth normalized elementary symmetric functions of the principal curvatures are denoted by $H_j$ . They are defined as follows:
for $j = 1, \dots , n-1$ and $H_0 = 1.$ Note that
is the mean curvature, that is, the average of principal curvatures, and
is the Gauss curvature.
We say that K is of class $C^2_+$ if K is of class $C^2$ and the Gauss map $\nu $ is a diffeomorphism. This means in particular that $N_K$ has a smooth inverse. This assumption is stronger than just $C^2,$ and is equivalent to the assumption that all principal curvatures are strictly positive, or that the Gauss curvature $H_{n-1} \ne 0.$ It also means that the differential of $N_K$ , i.e., the Weingarten map, is of maximal rank everywhere.
Let K be of class $C^2_+$ . For $u \in {\mathbb {R}}^n \setminus \{0\}$ , let $\xi _K(u)$ be the unique point on the boundary of K at which u is an outward normal vector. The map $\xi _K$ is defined on ${\mathbb {R}}^n \setminus \{0\}$ . Its restriction to the sphere $S^{n - 1},$ the map $\bar {\xi }_K: S^{n - 1} \to \partial K$ , is called the reverse spherical image map, or reverse Gauss map. The differential of $\bar {\xi }_K$ is called the reverse Weingarten map. The eigenvalues of the reverse Weingarten map are called the principal radii of curvature $r_1, \dots , r_{n - 1}$ of K at $u\in S^{n - 1}.$
The jth normalized elementary symmetric functions of the principal radii of curvature are denoted by $s_j$ . In particular, $s_0=1$ , and for $1 \leq j \leq n-1,$ they are defined by
Note that the principal curvatures are functions on the boundary of K and the principal radii of curvature are functions on the sphere.
Now, we describe the connection between $H_j$ and $s_j.$ For a body K of class $C^2_+$ , we have for $u\in S^{n - 1}$ that $\bar {\xi }_K(u) = N_K^{-1}(u)$ . In particular, the principal radii of curvature are reciprocals of the principal curvatures, that is,
This implies that for $x \in \partial K$ with $N_K(x)=u$ ,
and
for $j = 1, \dots , n-1.$
2.2 Definitions
For $\alpha \in \mathbb {R}$ and $k \in \mathbb {N}$ , the generalized binomial coefficients are defined as
For fixed $k \in \mathbb {N}$ , $m\in \mathbb {N} \cup \{0\}$ and fixed sequence $\vec {i}=\{i_j\}_{j=0}^{n-1}$ such that $i_1+2 i_2+ \cdots (n-1) i_{n-1} =m$ and all $p \in \mathbb {R}$ , $p \neq -n$ , we define
and
Remark 2.1 Note that the above quantities are vanishing for polytopes. Therefore, we will treat only $C^2_+$ convex bodies throughout the text and then
Denote
Let $\mu _{m,k,\vec {i}}$ be the measure on $\partial K$ with density
with respect to the surface measure $\mathcal {H}^{n-1}$ on $\partial K,$ and let $\nu _{m,k,\vec {i}}$ be the measure on $S^{n-1}$ with density
with respect to the surface measure $\mathcal {H}^{n-1}$ on $S^{n-1}$ . To keep notations simple, we mostly write $\mu _{\vec {i}}$ and $\nu _{\vec {i}}$ instead of $\mu _{m,k,\vec {i}}$ and $\nu _{m,k,\vec {i}}$ .
We then define the weighted $L_p$ -affine surface areas by
Definition (1.2) explains that those can be considered as $L_p$ -affine surface area weighted by the measure $\mu _{\vec {i}}$ . In particular,
is a weighted volume of K, weighted by the measure $\mu _{\vec {i}}$ and
is a weighted volume of $K^\circ $ , weighted by the measure $\mu _{\vec {i}}$ , where
is the polar body of K.
We can take another point of view for the measure $d\mathcal {H}^{n-1}$ on $S^{n-1}$ via the density $f_p(K,u) = \left (\frac {f_K(u)}{h_K^{p-1}(u)}\right )^{\frac n{n+p}}$ . This density was introduced by Lutwak [Reference Lutwak23]. Then
2.3 Special cases
Note that
where we denote the sequence $\vec {i} = \{0, \dots , 0\}$ as $0$ .
1. When $k = m$ , we get
2. $m = 0$ implies that $\vec {i} = 0$ and (2.4) simplifies to
(see [Reference Tatarko and Werner34, equation (29)]).
3. For the Euclidean unit ball $B^n_2$ , we get
which does not depend on k. Note that if $i_0 \ne 0$ and $i_1 = \dots = i_{n-1} = 0$ , that is, ${\vec {i} = \{i_0, 0, \dots , 0\}}$ , then $\omega _{m,k,\vec {i}}^p (B^n_2) = {\mathrm {vol}}_{n-1}(\partial B^n_2)$ .
4. If $p = 0$ , we have
When in addition $k = m$ , we get
5. If $p = \infty $ , we get that
3 Properties of the weighted $L_p$ -affine surface areas
3.1 Valuation, invariance, and homogeneity
Proposition 3.1 Let K be a $C^2_+$ convex body in $\mathbb R^n$ with centroid at the origin. Let $p \in \mathbb {R}$ , $p\neq -n$ . Then $\omega _{m,k,\vec {i}}^p (K)$ is an $\left (n\frac {n-p}{n+p}-k\right )$ -homogeneous valuation that is invariant under rotations and reflections.
Proof The proof follows immediately from results in [Reference Tatarko and Werner35]. We present an outline of the proof for completeness.
1. Valuation. As was shown in [Reference Tatarko and Werner35, Theorem 5.9], for all $1 \leq i_1, \dots , i_{n-1} \leq n-1$ , $1\leq j \leq n-1$ , and $\alpha _1, \dots , \alpha _{j} \geq 0$ ,
is a valuation. It immediately follows that $ \omega _{m,k,\vec {i}}^p (K) $ are valuations as the linear combination of valuations is again a valuation.
2. Homogeneity. Similarly to [Reference Tatarko and Werner35, Theorem 5.1] we can show that $\omega _{m, k, \vec {i}}^p (K)$ are homogeneous of order $n\frac {n-p}{n+p} - k$ . Applying [Reference Tatarko and Werner35, Proposition 5.4] with $T = a\, Id$ , we get that $\omega _{m, k, \vec {i}}^p (K) = a^{k - n\frac {n-p}{n+p}} \omega _{m, k, \vec {i}}^p (aK)$ .
3. Invariance. If T is a rotation or a reflection, then $|\det T|=1$ , $\|T^{-1t}(N_{ K}(T^{-1}(y)))\| = \| N_{ K}(T^{-1}(y))\| = 1$ and for all $1 \leq j \leq n-1$ ,
Thus, using these observations and [Reference Tatarko and Werner35, Proposition 5.4], we get
3.2 Inequalities
Theorem 3.2 Let $s\neq -n, r \neq -n, t \neq -n$ be real numbers. Let K be a $C^2_+$ convex body in $\mathbb R^n$ with centroid at the origin.
-
(i) If $\frac {(n+r)(t-s)}{(n+t)(r-s)}>1$ , then
$$ \begin{align*} \mu_{\vec{i}} - as_r(K) \leq \big(\mu_{\vec{i}} - as_t(K) \big)^{\frac{(r-s)(n+t)}{(t-s)(n+r)}} \big(\mu_{\vec{i}} - as_s(K) \big)^{\frac{(t-r)(n+s)}{(t-s)(n+r)}}. \end{align*} $$ -
(ii) If $\frac {(n+r)t}{(n+t)r}>1$ , then
$$ \begin{align*}\frac{\mu_{\vec{i}} - as_r(K)}{\mu_{\vec{i}}-{\mathrm{vol}}_n(K)}\leq n^{\frac{n(t-r)}{t(n+r)}} \bigg(\frac{\mu_{\vec{i}} - as_t(K)}{\mu_{\vec{i}}-{\mathrm{vol}}_n(K)}\bigg)^{\frac{r(n+t)}{{t(n+r)}}}. \end{align*} $$Equality holds in the above inequalities, if and only if K is an ellipsoid.
Proof (i) By Hölder’s inequality—which enforces the condition $\frac {(n+r)(s-t)}{(n+t)(s-r)}>1$ , we then get
(ii) Similarly, again using Hölder’s inequality—which now enforces the condition $~\frac {(n+r)t}{(n+t)r}>1,~$
The equality characterizations follow from the equality characterization of Hölder’s inequality.
Equality holds in (i) and (ii) if and only if equality holds in Hölder’s inequality which happens if and only if
$\mu _{\vec {i}}$ -almost everywhere on $\partial K$ . As $\partial K$ is $C^2_+$ , $\{ x \in \partial K: \mu _{\vec {i}}(x) =0\} = \emptyset $ and therefore
holds for all $x \in \partial K$ . Thus, we can use the following theorem by Petty [Reference Petty27], which then finishes the proof.
Theorem 3.3 [Reference Petty27]
Let K be a convex body in ${\mathbb {R}}^n$ that is $C^2_+$ . K is an ellipsoid if and only if, for all x in $\partial K,$
where $c>0$ is a constant.
Theorem 3.4 Let K be a $C^2_+$ convex body in $\mathbb R^n$ with centroid at the origin. If $r<s<k,$ then
Equality holds if and only if K is a ball.
Proof
Equality holds if and only if equality holds in Hölder’s inequality which happens if and only if $\langle x, N(x)\rangle = \mathrm {constant}$ if and only if K is a ball.
Remark 3.5 When $\vec {i} = 0$ and $k = 0$ , we recover $L_p$ affine isoperimetric inequalities of [Reference Werner and Ye39].
We also obtain monotonicity behaviors.
Corollary 3.6 Let K be a $C^2_+$ convex body in ${\mathbb {R}}^n$ with centroid at the origin. Let $p \ne -n$ be a real number.
(i) The function $p \rightarrow \left (\frac {\omega _{m, k, \vec {i}}^p (K)}{\omega ^0_{m, k, \vec {i}}(K)}\right )^{\frac {n+p}{p}}$ is increasing in $p \in (-n, \infty )$ and $p \in (-\infty , -n)$ .
(ii) The function $p \rightarrow \left (\frac {\omega _{m, k, \vec {i}}^p (K)}{\omega ^0_{m, k, \vec {i}}(K)}\right )^{n+p}$ is decreasing in $p \in (-n, \infty )$ and $p \in (-\infty , -n)$ .
(iii) The inequalities are strict unless K is an ellipsoid.
3.3 Geometric interpretations
We recall several constructions of convex bodies associated with a given convex body K. Namely:
1. Weighted floating bodies [Reference Werner36]
Let K be a convex body in $ {\mathbb R}^n$ and denote by $\lambda $ the Lebesgue measure on $\mathbb {R}^n$ . Let $s \geq 0,$ and let $f:K\rightarrow \mathbb R$ be an integrable function such that $f>0\ \lambda $ -a.e. The weighted floating body $F(K,f,s)$ was defined in [Reference Werner36] (see also [Reference Besau, Ludwig and Werner3–Reference Besau and Werner4]) as the intersection of all closed half-spaces $H^{+}$ whose defining hyperplanes H cut off a set of $(f\, \lambda )$ -measure less than or equal to s from K,
It was shown in [Reference Werner36] that
2. Surface bodies [Reference Schütt and Werner33]
Let K be a convex body, and let $f:\partial K\rightarrow \mathbb R$ be a nonnegative, integrable function with $\int _{\partial K}f \, \mathcal {H}^{n-1}=1$ . The probability measure $\mathbb P_{f}$ is the measure on $\partial K$ with density f. Let $s \geq 0$ . The surface body $S(K, f,s)$ was defined in [Reference Schütt and Werner33] as the intersection of all the closed half-spaces $H^{+}$ whose defining hyperplanes H cut off a set of $\mathbb P_{f}$ -measure less than or equal to s from $\partial K$ ,
It was shown in [Reference Schütt and Werner33] that
3. Random polytopes [Reference Schütt and Werner32]
A random polytope is the convex hull of finitely many points that are chosen with respect to a probability measure. In [Reference Schütt and Werner32], random polytopes are considered where the points are chosen from $\partial K$ with respect to $\mathbb P_{f}$ , where $f:\partial K\rightarrow \mathbb R$ is an integrable, nonnegative function with $\int _{\partial K} f \, d\mathcal {H}^{n-1} =1$ and $ d \,\mathbb {P}_{f} = f \,d\mathcal {H}^{n-1}$ . Then the expected volume of such a random polytope is
where $[x_{1},\dots ,x_{N}]$ is the convex hull of the points $x_{1},\dots ,x_{N}$ . It was shown in [Reference Schütt and Werner32] that under mild smoothness assumptions on K,
where $c_n=\frac {(n-1)^{\frac {n+1}{n-1}}\Gamma \left (n+1+\tfrac {2}{n-1}\right )} {2(n+1)!(\mathrm {vol}_{n-2}(\partial B_{2}^{n-1}))^{\frac {2}{n-1}}}.$
4. Ulam floating bodies [Reference Huang, Slomka and Werner19]
Given a Borel measure $\mu $ on ${\mathbb {R}}^{n}$ , the metronoid associated with $\mu $ was introduced by Huang and Slomka [Reference Huang and Slomka18] and is the convex set defined by
where the union is taken over all functions $0\le f\le 1$ for which $\int _{{\mathbb {R}}^{n}}f d{\mu =1}$ and $\int _{{\mathbb {R}}^{n}}yf\left (y\right )d{\mu }\left (y\right )$ exists. The metronoid $\mathcal {M}_{\delta }(K)$ is generated by the uniform measure on K with total mass $\delta ^{-1}\left |K\right |$ . Namely, let be the characteristic function of K, and let $\mu $ be the measure whose density with respect to Lebesgue measure is . Then $\mathcal {M}_{\delta }(K):=\mathcal {M}(\mu )$ . In [Reference Huang, Slomka and Werner19] weighted variations of $\mathcal {M}_{\delta }(K)$ were defined where the weight is given by a positive continuous function $f:K\to {\mathbb {R}}$ ,
It was shown in [Reference Huang, Slomka and Werner19] that
This leads to geometric interpretations of the weighted $L_p$ -affine surface areas in terms of these associated bodies. That is, if we let
in the case of weighted floating bodies and Ulam floating bodies, and
in the case of surface bodies and random polytopes, respectively, and denote by $K_{f,s}$ the corresponding associated body, i.e.,
then, with the properly adjusted constant $c_n$ , we get the following proposition.
Proposition 3.7 Let K be a $C^2_+$ convex body in ${\mathbb {R}}^n$ with centroid at the origin. Let $p \ne -n$ be a real number.
3.4 Asymptotics for the weighted $L_p$ -affine surface areas
Let $(X, \mu )$ be a measure space, and let $dP = p d\mu $ and $Q = q d\mu $ be measures in X that are absolutely continuous with respect to the measure $\mu $ . The Kullback–Leibler divergence or relative entropy from P to Q is defined as in [Reference Cover and Thomas11]
The information inequality (also called Gibb’s inequality) [Reference Cover and Thomas11] holds for the Kullback–Leibler divergence. Let P and Q be as above, then
with equality if and only if $P=Q$ .
We will apply this when $(X, \mu ) = (\partial K, \mu _{\vec {i}})$ , where K is $C^2_+$ and densities p and q with respect to $\mu _{\vec {i}}$ given by
We let
Recall that classical cone measure $cm_K$ on $\partial K$ is defined as follows: for every measurable set $A \subseteq \partial K,$
It is wellknown (see, e.g., [Reference Paouris and Werner26] for a proof) that the measures $\frac {H_{n-1}}{\langle x,N(x)\rangle ^n}\mathcal {H}^{n-1}$ and $\langle x,N(x)\rangle \mathcal {H}^{n-1}$ are the cone measures of K and $K^\circ $ ,
The interpretation of $cm_{K^\circ }$ as the “cone measure of $K^\circ $ ” is via the Gauss map on $K^\circ $ , $N_{K^\circ }: \partial K^\circ \rightarrow S^{n-1}, \ x \longmapsto N_{K^\circ }(x)$ and the inverse of the Gauss map on $K N_{K}: \partial K \rightarrow S^{n-1}, \ x \longmapsto N_{K}(x)$ ,
where we use $N_K(x)$ to emphasize that it is the normal vector of a body K at $x \in \partial K$ .
We now define the weighted cone measures $\mu _{\vec {i}} -cm_K$ of K and $\mu _{\vec {i}} -cm_{K^\circ }$ of $K^\circ $ by
We show that the limits of the weighted $L_p$ affine surface areas are entropy powers.
Theorem 3.8 Let K be a $C^2_+$ convex body in $\mathbb R^n$ with centroid at the origin.
(i)
(ii)
Proof (i) We use L’Hospital’s rule
(ii) We use L’Hospital’s rule
4 f-divergence
The results in the previous section lead naturally to consider more general f-divergences than just the Kullback–Leibler divergence.
4.1 Background on f-divergence
In information theory, probability theory and statistics, an f-divergence is a functional that measures the difference between two (probability) distributions. This notion was introduced by Csiszár [Reference Csiszár12], and independently Morimoto [Reference Morimoto25] and Ali and Silvery [Reference Ali and Silvey2].
Let $(X, \mu )$ be a measure space, and let $P=p \mu $ and $Q=q \mu $ be (probability) measures on X that are absolutely continuous with respect to the measure $\mu $ . Let $f: (0, \infty ) \rightarrow \mathbb {R}$ be a convex or a concave function. The $*$ -adjoint function $f^*:(0, \infty ) \rightarrow \mathbb {R}$ of f is defined by
It is obvious that $(f^*)^*=f$ and that $f^*$ is again convex, if f is convex, respectively, concave, if f is concave. Then the f-divergence $D_f(P,Q)$ of the measures P and Q is defined by
provided the expressions exist. Here,
We make the convention that $0 \cdot \infty =0$ .
Please note that
With (4.3) and as
we can write in short
Examples of f-divergences are as follows:
1. For $f(t) = t \ln t$ (with $*$ -adjoint function $f^*(t) = - \ln t$ ), the f-divergence is Kullback–Leibler divergence or relative entropy from P to Q (see [Reference Cover and Thomas11])
2. For the convex or concave functions $f(t) = t^\alpha ,$ we obtain the Hellinger integrals (e.g., [Reference Liese and Vajda20])
Those are related to the Rényi divergence of order $\alpha $ , $\alpha \neq 1$ , introduced by Rényi [Reference Rényi28] (for $\alpha>0$ ) as
The case $\alpha =1$ is the relative entropy $ D_{KL}(P\|Q)$ .
More on f-divergence can be found in, e.g., [Reference Caglar, Kolesnikov and Werner7–Reference Caglar and Werner9, Reference Liese and Vajda20, Reference Werner37, Reference Werner38, Reference Werner and Ye40].
4.2 f-divergence for the $\mu _{\vec {i}}$ -measure
In [Reference Werner38], f-divergence with respect to the surface area measure $\mu _K$ was introduced for a convex body K in $\mathbb {R}^n$ with $0$ in its interior. We now introduce similarly f-divergence with respect to the measure $\mu _{\vec {i}}$ .
Definition 4.1 Let $f: (0, \infty ) \rightarrow \mathbb {R}$ be a convex or concave function. Let $p_K$ and $q_K$ be as in (3.5). Then the f-divergence $D_f \left (P_K, Q_K \right )$ of a convex body K in $\mathbb {R}^n$ with respect to the $\mu _{\vec {i}}$ -measure is
In particular, for $f(t)=t \log t,$ we recover the above Kullback–Leibler divergences of Section 3.4 and for $f(t) = t^{\frac {p}{n+p}}$ , we obtain the weighted $L_p$ -affine surface areas.
Remarks (i) By (4.4),
(ii) f-divergences can also be expressed as integrals over $S^{n-1}$ ,
and
(iii) Similar to (4.10) and (4.11), one can define mixed f-divergences for n convex bodies in $\mathbb {R}^n$ . We will not treat those here but will concentrate on f-divergence for one convex body. We also refer to [Reference Ye41], where they have been investigated for functions in $Conv(0,\infty )$ .
Proposition 4.1 Let $f: (0, \infty ) \rightarrow \mathbb {R}$ be a concave function. Then
If f is convex, the inequality is reversed. If f is linear, equality holds in Proposition 4.1. If f is not linear, equality holds iff K is an ellipsoid.
Proof Let $f: (0, \infty ) \rightarrow \mathbb {R}$ be a concave function. By Jensen’s inequality,
Equality holds in Jensen’s inequality iff either f is linear or $\frac {p_K}{q_K}$ is constant. Indeed, if $f(t) = at +b$ , then
If f is not linear, equality holds iff $\frac {p_K}{q_K}=c$ , where c is a constant. By the above quoted Theorem 3.3 of Petty, this holds iff K is an ellipsoid. Note that the constant c is different from $0$ , as we assume that K is $C^2_+$ .
Now we show that $D_f({P}_K,{Q}_K)$ are valuations, i.e., for convex bodies K and L such that $K \cup L $ is convex,
Proposition 4.2 Let K be a convex body in ${\mathbb {R}}^n$ with the origin in its interior, and let $f:~(0, \infty ) \rightarrow \mathbb {R}$ be a convex function. Then $D_f(P_K,Q_K)$ are valuations.
Proof To prove (4.12), we proceed as in Schütt [Reference Schütt31]. For completeness, we include the argument. We decompose
where all unions on the right-hand side are disjoint. Note that for x such that the curvatures $\kappa _K(x)$ , $\kappa _L(x)$ , $\kappa _{K\cup L} (x),$ and $\kappa _{K\cap L} (x)$ exist,
and
To prove (4.12), we split the involved integral using the above decompositions (4.13) and (4.14).