Jensen–Steffensen inequality for strongly convex functions

M Klaričić Bakula

doi:10.1186/s13660-018-1897-2

. 2018 Nov 12;2018(1):306. doi: 10.1186/s13660-018-1897-2

Jensen–Steffensen inequality for strongly convex functions

M Klaričić Bakula ^1,^✉

PMCID: PMC6244717 PMID: 30839851

Abstract

The Jensen inequality for convex functions holds under the assumption that all of the included weights are nonnegative. If we allow some of the weights to be negative, such an inequality is called the Jensen–Steffensen inequality for convex functions. In this paper we prove the Jensen–Steffensen inequality for strongly convex functions.

Keywords: Strongly convex functions, Jensen inequality, Jensen–Steffensen inequality

Introduction

Let $I \subset R$ be an interval. It is well known that if a function $f : I \to R$ is convex, then

f (\sum_{i = 1}^{n} p_{i} x_{i}) \leq \sum_{i = 1}^{n} p_{i} f (x_{i})

1.1

for all $n \in N$ , $x_{1}, \dots, x_{n} \in I$ , and $p_{1}, \dots, p_{n} > 0$ with $p_{1} + \dots + p_{n} = 1$ . If f is strictly convex, then (1.1) is strict unless all $x_{i}$ are equal [7, p. 43]. This classical Jensen inequality is one of the most important inequalities in convex analysis, and it has various applications in mathematics, statistics, economics, and engineering sciences.

It is also known that the assumption $p_{1}, \dots, p_{n} > 0$ can be relaxed at the expense of restricting $x_{1}, \dots, x_{n}$ more severely [9]. Namely, if $p = (p_{1}, \dots, p_{n})$ is a real n-tuple such that for every $k \in {1, \dots, n}$

0 \leq p_{1} + \dots + p_{k} \leq p_{1} + \dots + p_{n} = 1,

1.2

then for any monotonic n-tuple $x = (x_{1}, \dots, x_{n}) \in I^{n}$ (increasing or decreasing) we get

\overline{x} = p_{1} x_{1} + \dots + p_{n} x_{n} \in I,

and for any function f convex on I (1.1) still holds. Under such assumptions (1.1) is called the Jensen–Steffensen inequality for convex functions and (1.2) are called Steffensen’s conditions due to J. F. Steffensen. Again, for a strictly convex function f, (1.1) remains strict under certain additional assumptions on x and p [1]. It is needless to say that a mathematical mind has to question the limitation $p_{1}, \dots, p_{n} > 0$ even if in the usual practice we can cope with it.

Variants of the Jensen inequality are proved for various classes of generalized convex functions, and the class of strongly convex functions is among them. Recall that a function $f : I \to R$ is called strongly convex with modulus $c > 0$ if

f (t x + (1 - t) y) \leq t f (x) + (1 - t) f (y) - c t (1 - t) {(x - y)}^{2}

1.3

for all $x, y \in I$ and $t \in [0, 1]$ [8]. Strongly convex functions are useful in optimization theory, mathematical economics and approximation theory, and an interested reader can find more about them in an excellent survey paper [5].

As we can easily see, strong convexity is a strengthening of the notion of convexity, and some properties of strongly convex functions are just “stronger versions” of analogous properties of convex functions (for more details, see [5]). One example of such a stronger version is the Jensen inequality for strongly convex functions (see [4] or [5]). If $f : I \to R$ , $I \subset R$ , is strongly convex with modulus c, then

f (\sum_{i = 1}^{n} p_{i} x_{i}) \leq \sum_{i = 1}^{n} p_{i} f (x_{i}) - c \sum_{i = 1}^{n} p_{i} {(x_{i} - \bar{x})}^{2}

1.4

for all $x_{1}, \dots, x_{n} \in I$ and all $p_{1}, \dots, p_{n} > 0$ such that $p_{1} + \dots + p_{n} = 1$ . If we compare (1.4) with (1.1), we see that (1.4) provides a better upper bound for $f (\bar{x})$ since the term $c \sum_{i = 1}^{n} p_{i} {(x_{i} - \bar{x})}^{2}$ is always nonnegative. Of course, if $c = 0$ , we go right back to convex functions and (1.1).

We must emphasize here that proving a Jensen type inequality for some class of generalized convex functions does not necessarily mean that such inequality holds under Steffensen’s conditions. The goal of this paper is to prove that for the class of strongly convex functions this is not the case.

Main result

Strongly convex functions have a very useful characterization: they always have a specific convex representation. This is stated in the following theorem (see [3] or [6]).

Theorem 1

Let I be an interval in $R$ . A function $f : I \to R$ is strongly convex with modulus c if and only if the function $g = f - c {(\cdot)}^{2}$ is convex.

The Jensen inequality for strongly convex functions can be proved either using Theorem 1 and the Jensen inequality for convex functions or (for I open) directly, using the “support parabola” property [5, Theorem 1]. In this section we prove the Jensen–Steffensen inequality for strongly convex functions using Theorem 1.

In the rest of the paper we use the following notation related to the n-tuples $x = (x_{1}, \dots, x_{n})$ and $p = (p_{1}, \dots, p_{n}), n \in N$ :

\begin{aligned} \bar{x} = p_{1} x_{1} + \dots + p_{n} x_{n}, \\ P_{k} = p_{1} + \dots + p_{k}, k \in {1, 2, \dots, n}, \\ {\overline{P}}_{k} = p_{k} + \dots + p_{n}, k \in {1, 2, \dots, n} . \end{aligned}

Theorem 2

Let I be an interval in $R$ . If $f : I \to R$ is a strongly convex function with modulus c, then for every monotonic n-tuple $x = (x_{1}, \dots, x_{n}) \in I^{n}$ and every real n-tuple $p = (p_{1}, \dots, p_{n})$ such that, for every $i \in {1, 2, \dots, n}$ ,

0 \leq P_{i} \leq P_{n} = 1

the following inequality holds:

f (\sum_{i = 1}^{n} p_{i} x_{i}) \leq \sum_{i = 1}^{n} p_{i} f (x_{i}) - c \sum_{i = 1}^{n} p_{i} {(x_{i} - \bar{x})}^{2} .

Proof

Suppose that x is increasing (for x decreasing the proof is analogous). It can be easily seen that Steffensen’s conditions yield

{\overline{P}}_{k} \geq 0, k \in {1, 2, \dots, n},

and

x_{n} - \bar{x} = P_{n} (x_{n} - \bar{x}) = \sum_{i = 1}^{n - 1} P_{i} (x_{i + 1} - x_{i}) \geq 0,

hence we obtain $\bar{x} \leq x_{n}$ . Analogously,

\bar{x} - x_{1} = P_{n} (\bar{x} - x_{1}) = \sum_{i = 2}^{n} {\overline{P}}_{i} (x_{i} - x_{i - 1}) \geq 0,

and $x_{1} \leq \bar{x}$ . From that we may conclude $\bar{x} \in [x_{1}, x_{n}] \subset I$ , which means that $g (\bar{x}) = g (\sum_{i = 1}^{n} p_{i} x_{i})$ is defined.

Using the convex representation $g = f - c {(\cdot)}^{2}$ as in Theorem 1 and applying the Jensen–Steffensen inequality for convex functions, we obtain

g (\sum_{i = 1}^{n} p_{i} x_{i}) \leq \sum_{i = 1}^{n} p_{i} g (x_{i}) .

Returning back to f, we get

\begin{aligned} f (\sum_{i = 1}^{n} p_{i} x_{i}) - c {(\sum_{i = 1}^{n} p_{i} x_{i})}^{2} & \leq \sum_{i = 1}^{n} p_{i} (f (x_{i}) - c x_{i}^{2}) \\ = \sum_{i = 1}^{n} p_{i} f (x_{i}) - c \sum_{i = 1}^{n} p_{i} x_{i}^{2}, \end{aligned}

or written differently

\begin{aligned} f (\sum_{i = 1}^{n} p_{i} x_{i}) & \leq \sum_{i = 1}^{n} p_{i} f (x_{i}) - c [\sum_{i = 1}^{n} p_{i} x_{i}^{2} - {(\sum_{i = 1}^{n} p_{i} x_{i})}^{2}] \\ = \sum_{i = 1}^{n} p_{i} f (x_{i}) - c [\sum_{i = 1}^{n} p_{i} x_{i}^{2} - {\bar{x}}^{2}] \\ = \sum_{i = 1}^{n} p_{i} f (x_{i}) - c [\sum_{i = 1}^{n} p_{i} x_{i}^{2} - 2 {\bar{x}}^{2} + {\bar{x}}^{2}] \\ = \sum_{i = 1}^{n} p_{i} f (x_{i}) - c [\sum_{i = 1}^{n} p_{i} x_{i}^{2} - 2 \bar{x} \sum_{i = 1}^{n} p_{i} x_{i} + {\bar{x}}^{2} \sum_{i = 1}^{n} p_{i}] \\ = \sum_{i = 1}^{n} p_{i} f (x_{i}) - c \sum_{i = 1}^{n} p_{i} {(x_{i} - \bar{x})}^{2} . \end{aligned}

□

Alternative reproach

What would happen if we try to prove (1.4) under Steffensen’s conditions directly using the support parabola property? The question is not without sense since in the case of the Jensen inequality for strongly convex functions both ways produce the same inequality as in (1.4) but, generally speaking, any negative weights in p can at some place interrupt the chain of conclusions in a proof. This is exactly the reason why it is considerably more difficult to prove (1.1) under Steffensen’s conditions. We will see what happens in this case in the next theorem, but first we need the following lemma which basically says that the support parabola in $x_{0}$ can be “shifted up” from $x_{0}$ to y and still remain “under” $f (x)$ if $x \leq y \leq x_{0}$ .

Lemma 1

Let $I \subset R$ be an open interval, let $f : I \to R$ be a strongly convex function with modulus c, and for $x_{0} \in I$ let

y = f (x_{0}) + λ (x - x_{0}) + c {(x - x_{0})}^{2}

3.1

be the support parabola for f in $x_{0}$ . Then for every $x, y \in I$ such that $x \leq y \leq x_{0}$

f (x) - f (y) \geq λ (x - y) + c {(x - y)}^{2},

3.2

and for $x, y \in I$ such that $x_{0} \leq x \leq y$

f (y) - f (x) \geq λ (y - x) + c {(y - x)}^{2} .

3.3

Proof

Since (3.1) is a support parabola for f in $x_{0}$ , it follows that for every $x \in I$

f (x) - f (x_{0}) \geq λ (x - x_{0}) + c {(x - x_{0})}^{2} .

3.4

Let $x, y \in I$ be such that $x < y < x_{0}$ . The middle element y can be represented as a convex combination of x and z in the following way:

y = \frac{x_{0} - y}{x_{0} - x} x + \frac{y - x}{x_{0} - x} x_{0} .

From the definition of strong convexity we have

f (y) \leq \frac{x_{0} - y}{x_{0} - x} f (x) + \frac{y - x}{x_{0} - x} f (x_{0}) - c \frac{x_{0} - y}{x_{0} - x} \frac{y - x}{x_{0} - x} {(x - x_{0})}^{2},

and since

\frac{x_{0} - y}{x_{0} - x} + \frac{y - x}{x_{0} - x} = 1,

we can write

\begin{aligned} f (y) & = \frac{x_{0} - y}{x_{0} - x} f (y) + \frac{y - x}{x_{0} - x} f (y) \\ \leq \frac{x_{0} - y}{x_{0} - x} f (x) + \frac{y - x}{x_{0} - x} f (x_{0}) - c \frac{x_{0} - y}{x_{0} - x} \frac{y - x}{x_{0} - x} {(x - x_{0})}^{2} . \end{aligned}

After a simple calculation we obtain

(x_{0} - y) (f (x) - f (y)) \geq (x - y) (f (x_{0}) - f (y)) + c \frac{(x_{0} - y) (y - x)}{x_{0} - x} {(x - x_{0})}^{2}

and

\frac{f (x) - f (y)}{x - y} \leq \frac{f (x_{0}) - f (y)}{x_{0} - y} - c (x_{0} - x) .

3.5

The support parabola property (3.4) gives

f (y) - f (x_{0}) \geq λ (y - x_{0}) + c {(y - x_{0})}^{2},

and since $y - x_{0} < 0$

\frac{f (x_{0}) - f (y)}{x_{0} - y} \leq λ - c (x_{0} - y) .

Using the above inequality and (3.5), we obtain

\begin{aligned} \frac{f (x) - f (y)}{x - y} & \leq \frac{f (x_{0}) - f (y)}{x_{0} - y} - c (x_{0} - x) \\ \leq λ - c (x_{0} - y) - c (x_{0} - x) = λ + c (x + y - 2 x_{0}) . \end{aligned}

Since $x - y < 0$ we get

f (x) - f (y) \geq λ (x - y) + c (x - y) (x + y - 2 x_{0}),

and because of $x + y - 2 x_{0} < x - y$ , we end up with

f (x) - f (y) \geq λ (x - y) + c {(x - y)}^{2} .

If $x_{0} < x < y$ , in an analogous way we can prove

f (y) - f (x) \geq λ (y - x) + c {(y - x)}^{2} .

Note that the above inequalities still hold in the trivial way if $x = y$ . □

Remark 1

(3.2) and (3.3) can be also proved using the convex representation $g = f - c {(\cdot)}^{2}$ . We start from the support parabola property in $x_{0} \in I$

f (x) - f (x_{0}) \geq λ (x - x_{0}) + c {(x - x_{0})}^{2} .

Then

g (x) - g (x_{0}) + c x^{2} - c x_{0}^{2} \geq λ (x - x_{0}) + c {(x - x_{0})}^{2},

that is,

\begin{aligned} g (x) - g (x_{0}) & \geq λ (x - x_{0}) + c {(x - x_{0})}^{2} - c x^{2} + c x_{0}^{2} \\ = (λ - 2 c x_{0}) (x - x_{0}) = λ^{'} (x - x_{0}), \end{aligned}

hence g has a support line in $x_{0}$ for $λ^{'} = λ - 2 c x_{0}$ . Since g is convex, we know that for every $x_{0} \leq x \leq y$ [7]

g (y) - g (x) \geq λ^{'} (y - x) = (λ - 2 c x_{0}) (y - x) .

Returning to f, we obtain

f (y) - c y^{2} - f (x) + c x^{2} \geq (λ - 2 c x_{0}) (y - x),

hence

\begin{aligned} f (y) - f (x) & \geq (λ - 2 c x_{0}) (y - x) + c y^{2} - c x^{2} \\ = λ (y - x) + c (y - x) (x + y - 2 x_{0}) \\ \geq λ (y - x) + c (y - x) (x + y - 2 x) \\ = λ (y - x) + c {(y - x)}^{2} . \end{aligned}

Consequently,

f (y) - f (x) \geq λ (y - x) + c {(y - x)}^{2}, x_{0} \leq x \leq y .

Analogously, we can prove

f (x) - f (y) \geq λ (x - y) + c {(x - y)}^{2}, x \leq y \leq x_{0} .

Theorem 3

Let $I \subset R$ be an open interval. If $f : I \to R$ is a strongly convex function with modulus c, then for every monotonic n-tuple $x = (x_{1}, \dots, x_{n}) \in I^{n}$ and every real n-tuple $p = (p_{1}, \dots, p_{n})$ such that for every $i \in {1, 2, \dots, n}$

0 \leq P_{i} \leq P_{n} = 1,

there exists $k \in {1, \dots, n - 1}$ such that $\bar{x} \in [x_{k}, x_{k + 1}]$ for x increasing or $\bar{x} \in [x_{k + 1}, x_{k}]$ for x decreasing, and

\begin{aligned} \sum_{i = 1}^{n} p_{i} f (x_{i}) - f (\sum_{i = 1}^{n} p_{i} x_{i}) \\ \geq c [\sum_{i = 1}^{k - 1} P_{i} {(x_{i} - x_{i + 1})}^{2} + P_{k} {(x_{k} - \bar{x})}^{2} + {\overline{P}}_{k + 1} {(x_{k + 1} - \bar{x})}^{2} + \sum_{i = k + 2}^{n} {\overline{P}}_{i} {(x_{i} - x_{i - 1})}^{2}] \\ \geq 0 . \end{aligned}

Proof

Suppose that x is increasing (for x decreasing the proof is analogous).

First observe that as in Theorem 2 we know that $\bar{x} \in [x_{1}, x_{n}] \subset I$ , and we may conclude that there exists some $k \in {1, \dots, n - 1}$ such that $\bar{x} \in [x_{k}, x_{k + 1}]$ .

From (3.4), choosing $x_{0} = \bar{x}$ , we get

f (x) - f (\bar{x}) \geq λ (x - \bar{x}) + c {(x - \bar{x})}^{2}

for some $λ \in R$ and every $x \in I$ .

Next we use the Abel transformation to obtain the identities (similar can be found in [1])

\begin{aligned} 0 = & \sum_{i = 1}^{n} p_{i} x_{i} - \bar{x} \\ = & \sum_{i = 1}^{k - 1} P_{i} (x_{i} - x_{i + 1}) + P_{k} (x_{k} - \bar{x}) \\ + {\overline{P}}_{k + 1} (x_{k + 1} - \bar{x}) + \sum_{i = k + 2}^{n} {\overline{P}}_{i} (x_{i} - x_{i - 1}) \end{aligned}

3.6

and

\begin{aligned} \sum_{i = 1}^{n} p_{i} f (x_{i}) - f (\bar{x}) \\ = \sum_{i = 1}^{k - 1} P_{i} (f (x_{i}) - f (x_{i + 1})) + P_{k} (f (x_{k}) - f (\bar{x})) \\ + {\overline{P}}_{k + 1} (f (x_{k + 1}) - f (\bar{x})) + \sum_{i = k + 2}^{n} {\overline{P}}_{i} (f (x_{i}) - f (x_{i - 1})), \end{aligned}

3.7

where in the case $k = 1$ we assume $\sum_{i = 1}^{k - 1}$ to be 0, while in the case $k = n - 1$ we assume $\sum_{i = k + 2}^{n}$ to be 0.

From (3.7), using (3.2), (3.3), and then (3.6), we get

\begin{aligned} \sum_{i = 1}^{n} p_{i} f (x_{i}) - f (\bar{x}) \\ \geq \sum_{i = 1}^{k - 1} P_{i} (λ (x_{i} - x_{i + 1}) + c {(x_{i} - x_{i + 1})}^{2}) + P_{k} (λ (x_{k} - \bar{x}) + c {(x_{k} - \bar{x})}^{2}) \\ + {\overline{P}}_{k + 1} (λ (x_{k + 1} - \bar{x}) + c {(x_{k + 1} - \bar{x})}^{2}) + \sum_{i = k + 2}^{n} {\overline{P}}_{i} (λ (x_{i} - x_{i - 1}) + c {(x_{i} - x_{i - 1})}^{2}) \\ = λ [\sum_{i = 1}^{k - 1} P_{i} (x_{i} - x_{i + 1}) + P_{k} (x_{k} - \bar{x}) + {\overline{P}}_{k + 1} (x_{k + 1} - \bar{x}) + \sum_{i = k + 2}^{n} {\overline{P}}_{i} (x_{i} - x_{i - 1})] \\ + c [\sum_{i = 1}^{k - 1} P_{i} {(x_{i} - x_{i + 1})}^{2} + P_{k} {(x_{k} - \bar{x})}^{2} + {\overline{P}}_{k + 1} {(x_{k + 1} - \bar{x})}^{2} + \sum_{i = k + 2}^{n} {\overline{P}}_{i} {(x_{i} - x_{i - 1})}^{2}] \\ = c [\sum_{i = 1}^{k - 1} P_{i} {(x_{i} - x_{i + 1})}^{2} + P_{k} {(x_{k} - \bar{x})}^{2} + {\overline{P}}_{k + 1} {(x_{k + 1} - \bar{x})}^{2} + \sum_{i = k + 2}^{n} {\overline{P}}_{i} {(x_{i} - x_{i - 1})}^{2}] . \end{aligned}

□

It was hopeful to think that this way we can end up with

\sum_{i = 1}^{n} p_{i} f (x_{i}) - f (\bar{x}) \geq c \sum_{i = 1}^{n} p_{i} {(x_{i} - \bar{x})}^{2}

since this is exactly what happens in the analogous proofs (direct and indirect) for convex functions. It would be possible if

\begin{aligned} \sum_{i = 1}^{k - 1} P_{i} {(x_{i} - x_{i + 1})}^{2} + P_{k} {(x_{k} - \bar{x})}^{2} + {\overline{P}}_{k + 1} {(x_{k + 1} - \bar{x})}^{2} + \sum_{i = k + 2}^{n} {\overline{P}}_{i} {(x_{i} - x_{i - 1})}^{2} \\ \geq \sum_{i = 1}^{n} p_{i} {(x_{i} - \bar{x})}^{2}, \end{aligned}

3.8

but sadly this is not generally true.

Example 1

Let $x = (1, 2, 3, 4)$ , $p = (1, - 1, 0, 1)$ . Then

\begin{aligned} P_{1} = 1, P_{2} = 0, P_{3} = 0, P_{4} = 1, \\ {\overline{P}}_{1} = 1, {\overline{P}}_{2} = 0, {\overline{P}}_{3} = 1, {\overline{P}}_{4} = 1, \\ \bar{x} = 3 \in [2, 3], k = 2 (or k = 3), \\ \sum_{i = 1}^{1} P_{i} {(x_{i} - x_{i + 1})}^{2} + P_{2} {(x_{2} - \bar{x})}^{2} + {\overline{P}}_{3} {(x_{3} - \bar{x})}^{2} + \sum_{i = 4}^{4} {\overline{P}}_{i} {(x_{i} - x_{i - 1})}^{2} \\ = {(1 - 2)}^{2} + 0 + {(3 - 3)}^{2} + {(4 - 3)}^{2} = 2, \\ \sum_{i = 1}^{4} p_{i} {(x_{i} - \bar{x})}^{2} = {(1 - 3)}^{2} - {(2 - 3)}^{2} + 0 + {(4 - 3)}^{2} = 4 > 2 . \end{aligned}

In fact, the following theorem holds.

Theorem 4

Let $f, p, x$ , and k be as in Theorem 3. Then

\begin{aligned} \sum_{i = 1}^{k - 1} P_{i} {(x_{i} - x_{i + 1})}^{2} + P_{k} {(x_{k} - \bar{x})}^{2} + {\overline{P}}_{k + 1} {(x_{k + 1} - \bar{x})}^{2} + \sum_{i = k + 2}^{n} {\overline{P}}_{i} {(x_{i} - x_{i - 1})}^{2} \\ \leq \sum_{i = 1}^{n} p_{i} {(x_{i} - \bar{x})}^{2} . \end{aligned}

Proof

For the sake of simplicity, we introduce the following notation:

\begin{aligned} I_{k} = \sum_{i = 1}^{k - 1} P_{i} {(x_{i} - x_{i + 1})}^{2} + P_{k} {(x_{k} - \bar{x})}^{2} + {\overline{P}}_{k + 1} {(x_{k + 1} - \bar{x})}^{2} + \sum_{i = k + 2}^{n} {\overline{P}}_{i} {(x_{i} - x_{i - 1})}^{2}, \\ \overline{x^{2}} = \sum_{i = 1}^{n} p_{i} x_{i}^{2} . \end{aligned}

Suppose that x is increasing (for x decreasing the proof is analogous). First note that for k as in Theorem 3 we have

\begin{aligned} x_{i} \leq \bar{x}, i = 1, 2, \dots, k, \\ \bar{x} \leq x_{i}, i = k + 1, \dots, n . \end{aligned}

Using this notation, we get

\begin{aligned} I_{k} = & \sum_{i = 1}^{k - 1} P_{i} (x_{i}^{2} - x_{i + 1}^{2}) + P_{k} (x_{k}^{2} - \overline{x^{2}}) + {\overline{P}}_{k + 1} (x_{k + 1}^{2} - \overline{x^{2}}) + \sum_{i = k + 2}^{n} {\overline{P}}_{i} (x_{i}^{2} - x_{i - 1}^{2}) \\ - 2 \sum_{i = 1}^{k - 1} P_{i} x_{i} x_{i + 1} + 2 \sum_{i = 1}^{k - 1} P_{i} x_{i + 1}^{2} + P_{k} (- 2 x_{k} \bar{x} + {\bar{x}}^{2}) + P_{k} \overline{x^{2}} + {\overline{P}}_{k + 1} \overline{x^{2}} \\ + {\overline{P}}_{k + 1} (- 2 x_{k + 1} \bar{x} + {\bar{x}}^{2}) - 2 \sum_{i = k + 2}^{n} {\overline{P}}_{i} x_{i} x_{i - 1} + 2 \sum_{i = k + 2}^{n} {\overline{P}}_{i} x_{i - 1}^{2} . \end{aligned}

Applying (3.6) on p and $x^{2} = (x_{1}^{2}, \dots, x_{n}^{2})$ , we obtain

\sum_{i = 1}^{k - 1} P_{i} (x_{i}^{2} - x_{i + 1}^{2}) + P_{k} (x_{k}^{2} - \overline{x^{2}}) + {\overline{P}}_{k + 1} (x_{k + 1}^{2} - \overline{x^{2}}) + \sum_{i = k + 2}^{n} {\overline{P}}_{i} (x_{i}^{2} - x_{i - 1}^{2}) = 0,

hence

\begin{aligned} I_{k} = & 2 \sum_{i = 1}^{k - 1} P_{i} x_{i + 1} (x_{i + 1} - x_{i}) + 2 \sum_{i = k + 2}^{n} {\overline{P}}_{i} x_{i - 1} (x_{i - 1} - x_{i}) + P_{k} \overline{x^{2}} + {\overline{P}}_{k + 1} \overline{x^{2}} \\ + P_{k} (- 2 x_{k} \bar{x} + {\bar{x}}^{2}) + {\overline{P}}_{k + 1} (- 2 x_{k + 1} \bar{x} + {\bar{x}}^{2}) \\ = & 2 \sum_{i = 1}^{k - 1} P_{i} x_{i + 1} (x_{i + 1} - x_{i}) + 2 \sum_{i = k + 2}^{n} {\overline{P}}_{i} x_{i - 1} (x_{i - 1} - x_{i}) + \overline{x^{2}} \\ + P_{k} (- 2 x_{k} \bar{x} + {\bar{x}}^{2}) + {\overline{P}}_{k + 1} (- 2 x_{k + 1} \bar{x} + {\bar{x}}^{2}) . \end{aligned}

Taking into account that x is increasing and

\begin{aligned} P_{i}, {\overline{P}}_{i} \geq 0, i = 1, 2, \dots, n, \\ x_{i} \leq \bar{x}, i = 1, 2, \dots, k, \\ \bar{x} \leq x_{i}, i = k + 1, \dots, n, \end{aligned}

we obtain

\begin{aligned} I_{k} \leq & 2 \bar{x} \sum_{i = 1}^{k - 1} P_{i} (x_{i + 1} - x_{i}) + 2 \bar{x} \sum_{i = k + 2}^{n} {\overline{P}}_{i} (x_{i - 1} - x_{i}) + \overline{x^{2}} \\ - 2 P_{k} x_{k} \bar{x} - 2 {\overline{P}}_{k + 1} x_{k + 1} \bar{x} + {\bar{x}}^{2} . \end{aligned}

Applying again (3.6) on p and x, we get

\sum_{i = 1}^{k - 1} P_{i} (x_{i + 1} - x_{i}) + \sum_{i = k + 2}^{n} {\overline{P}}_{i} (x_{i - 1} - x_{i}) = P_{k} (x_{k} - \bar{x}) + {\overline{P}}_{k + 1} (x_{k + 1} - \bar{x}),

hence

\begin{aligned} I_{k} & \leq 2 \bar{x} [P_{k} (x_{k} - \bar{x}) + {\overline{P}}_{k + 1} (x_{k + 1} - \bar{x})] + \overline{x^{2}} - 2 P_{k} x_{k} \bar{x} - 2 {\overline{P}}_{k + 1} x_{k + 1} \bar{x} + {\bar{x}}^{2} \\ = - 2 P_{k} {\bar{x}}^{2} - 2 {\overline{P}}_{k + 1} {\bar{x}}^{2} + {\bar{x}}^{2} + \overline{x^{2}} = - 2 {\bar{x}}^{2} + {\bar{x}}^{2} + \overline{x^{2}} = \overline{x^{2}} - {\bar{x}}^{2}, \end{aligned}

or written differently

I_{k} \leq \sum_{i = 1}^{n} p_{i} x_{i}^{2} - {\bar{x}}^{2} = \sum_{i = 1}^{n} p_{i} {(x_{i} - \bar{x})}^{2} .

□

We have just proven that the Jensen–Steffensen inequality for strongly convex functions behaves differently than the Jensen inequality for strongly convex functions: applying the same proof techniques, we end up with two different bounds, and surprisingly the indirect proof gives the better one.

Integral version

The integral version of the Jensen–Steffensen inequality for convex functions was proved by Boas in 1970 [2].

Theorem 5

Let $x : [α, β] \to (a, b)$ be a continuous and monotonic function, where $- \infty < α < β < + \infty$ and $- \infty \leq a < b \leq + \infty$ , and let $f : (a, b) \to R$ be a convex function. If $λ : [α, β] \to R$ is either continuous or of bounded variation satisfying

\begin{aligned} (\forall t \in [α, β]) λ (α) \leq λ (t) \leq λ (β), \\ λ (β) - λ (α) > 0, \end{aligned}

then

f (\frac{\int_{α}^{β} x (t) d λ (t)}{\int_{α}^{β} d λ (t)}) \leq \frac{\int_{α}^{β} f (x (t)) d λ (t)}{\int_{α}^{β} d λ (t)} .

Since the indirect proof as in Theorem 2 produced a better bound, we will use the same technique to prove the integral version of the Jensen–Steffensen inequality for strongly convex functions.

Theorem 6

Let $x : [α, β] \to (a, b)$ be a continuous and monotonic function, where $- \infty < α < β < + \infty$ and $- \infty \leq a < b \leq + \infty$ , and let $f : (a, b) \to R$ be a strongly convex function with modulus c. If $λ : [α, β] \to R$ is either continuous or of bounded variation satisfying

\begin{aligned} (\forall t \in [α, β]) λ (α) \leq λ (t) \leq λ (β), \\ λ (β) - λ (α) > 0, \end{aligned}

then

f (μ) \leq \frac{\int_{α}^{β} f (x (t)) d λ (t)}{\int_{α}^{β} d λ (t)} - c \frac{\int_{α}^{β} {(x (t) - μ)}^{2} d λ (t)}{\int_{α}^{β} d λ (t)},

where

μ = \frac{\int_{α}^{β} x (t) d λ (t)}{\int_{α}^{β} d λ (t)} .

Proof

Using the convex representation $g = f - c {(\cdot)}^{2}$ as in Theorem 1 and applying the integral Jensen–Steffensen inequality for convex functions, we obtain

g (μ) = g (\frac{\int_{α}^{β} x (t) d λ (t)}{\int_{α}^{β} d λ (t)}) \leq \frac{\int_{α}^{β} g (x (t)) d λ (t)}{\int_{α}^{β} d λ (t)} .

Going back to f we get

f (μ) - c μ^{2} \leq \frac{\int_{α}^{β} (f (x (t)) - c x {(t)}^{2}) d λ (t)}{\int_{α}^{β} d λ (t)} = \frac{\int_{α}^{β} f (x (t)) d λ (t)}{\int_{α}^{β} d λ (t)} - c \frac{\int_{α}^{β} x {(t)}^{2} d λ (t)}{\int_{α}^{β} d λ (t)},

or written differently

\begin{aligned} f (μ) & \leq \frac{\int_{α}^{β} f (x (t)) d λ (t)}{\int_{α}^{β} d λ (t)} - c \frac{\int_{α}^{β} x {(t)}^{2} d λ (t)}{\int_{α}^{β} d λ (t)} + c μ^{2} \\ = \frac{\int_{α}^{β} f (x (t)) d λ (t)}{\int_{α}^{β} d λ (t)} - c [\frac{\int_{α}^{β} x {(t)}^{2} d λ (t)}{\int_{α}^{β} d λ (t)} - μ^{2}] \\ = \frac{\int_{α}^{β} f (x (t)) d λ (t)}{\int_{α}^{β} d λ (t)} - c [\frac{\int_{α}^{β} x {(t)}^{2} d λ (t)}{\int_{α}^{β} d λ (t)} - 2 μ^{2} + μ^{2}] \\ = \frac{\int_{α}^{β} f (x (t)) d λ (t)}{\int_{α}^{β} d λ (t)} - c [\frac{\int_{α}^{β} x {(t)}^{2} d λ (t)}{\int_{α}^{β} d λ (t)} - 2 μ \frac{\int_{α}^{β} x (t) d λ (t)}{\int_{α}^{β} d λ (t)} + μ^{2} \frac{\int_{α}^{β} d λ (t)}{\int_{α}^{β} d λ (t)}] \\ = \frac{\int_{α}^{β} f (x (t)) d λ (t)}{\int_{α}^{β} d λ (t)} - c \frac{\int_{α}^{β} {(x (t) - μ)}^{2} d λ (t)}{\int_{α}^{β} d λ (t)} . \end{aligned}

□

Availability of data and materials

Not applicable.

Authors’ contributions

Author read and approved the final manuscript.

Funding

University of Split, Faculty of Science, Split, Croatia.

Competing interests

The author declares that there are no competing interests.

Footnotes

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Abramovich S., Klaričić Bakula M., Matić M., Pečarić J. A variant of Jensen–Steffensen’s inequality and quasi-arithmetic means. J. Math. Anal. Appl. 2005;307(1):370–386. doi: 10.1016/j.jmaa.2004.10.027. [DOI] [Google Scholar]
2.Boas R.P., Jr. The Jensen–Steffensen inequality. Publ. Elektroteh. Fak. Univ. Beogr., Ser. Mat. Fiz. 1970;302–319:1–8. [Google Scholar]
3.Hiriart-Urruty J.-B., Lemaréchal C. Fundamentals of Convex Analysis. Abridged Version of Convex Analysis and Minimization Algorithms I and II. Berlin: Springer; 2001. [Google Scholar]
4.Merentes N., Nikodem K. Remarks on strongly convex functions. Aequ. Math. 2010;80(1–2):193–199. doi: 10.1007/s00010-010-0043-0. [DOI] [Google Scholar]
5.Nikodem K. Handbook of Functional Equations. New York: Springer; 2014. On strongly convex functions and related classes of functions; pp. 365–405. [Google Scholar]
6.Nikodem K., Páles Z. Characterizations of inner product spaces by strongly convex functions. Banach J. Math. Anal. 2011;5(1):83–87. doi: 10.15352/bjma/1313362982. [DOI] [Google Scholar]
7.Pečarić J.E., Proschan F., Tong Y.L. Convex Functions, Partial Orderings, and Statistical Applications. Boston: Academic Press; 1992. [Google Scholar]
8.Polyak B.T. Existence theorems and convergence of minimizing sequences in extremum problems with restrictions. Sov. Math. Dokl. 1966;7:72–75. [Google Scholar]
9.Steffensen J.F. On certain inequalities and methods of approximation. J. Inst. Actuar. 1919;51:274–297. [Google Scholar]

[CR1] 1.Abramovich S., Klaričić Bakula M., Matić M., Pečarić J. A variant of Jensen–Steffensen’s inequality and quasi-arithmetic means. J. Math. Anal. Appl. 2005;307(1):370–386. doi: 10.1016/j.jmaa.2004.10.027. [DOI] [Google Scholar]

[CR2] 2.Boas R.P., Jr. The Jensen–Steffensen inequality. Publ. Elektroteh. Fak. Univ. Beogr., Ser. Mat. Fiz. 1970;302–319:1–8. [Google Scholar]

[CR3] 3.Hiriart-Urruty J.-B., Lemaréchal C. Fundamentals of Convex Analysis. Abridged Version of Convex Analysis and Minimization Algorithms I and II. Berlin: Springer; 2001. [Google Scholar]

[CR4] 4.Merentes N., Nikodem K. Remarks on strongly convex functions. Aequ. Math. 2010;80(1–2):193–199. doi: 10.1007/s00010-010-0043-0. [DOI] [Google Scholar]

[CR5] 5.Nikodem K. Handbook of Functional Equations. New York: Springer; 2014. On strongly convex functions and related classes of functions; pp. 365–405. [Google Scholar]

[CR6] 6.Nikodem K., Páles Z. Characterizations of inner product spaces by strongly convex functions. Banach J. Math. Anal. 2011;5(1):83–87. doi: 10.15352/bjma/1313362982. [DOI] [Google Scholar]

[CR7] 7.Pečarić J.E., Proschan F., Tong Y.L. Convex Functions, Partial Orderings, and Statistical Applications. Boston: Academic Press; 1992. [Google Scholar]

[CR8] 8.Polyak B.T. Existence theorems and convergence of minimizing sequences in extremum problems with restrictions. Sov. Math. Dokl. 1966;7:72–75. [Google Scholar]

[CR9] 9.Steffensen J.F. On certain inequalities and methods of approximation. J. Inst. Actuar. 1919;51:274–297. [Google Scholar]

PERMALINK

Jensen–Steffensen inequality for strongly convex functions

M Klaričić Bakula

Abstract

Introduction

Main result

Theorem 1

Theorem 2

Proof

Alternative reproach

Lemma 1

Proof

Remark 1

Theorem 3

Proof

Example 1

Theorem 4

Proof

Integral version

Theorem 5

Theorem 6

Proof

Availability of data and materials

Authors’ contributions

Funding

Competing interests

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Jensen–Steffensen inequality for strongly convex functions

M Klaričić Bakula

Abstract

Introduction

Main result

Theorem 1

Theorem 2

Proof

Alternative reproach

Lemma 1

Proof

Remark 1

Theorem 3

Proof

Example 1

Theorem 4

Proof

Integral version

Theorem 5

Theorem 6

Proof

Availability of data and materials

Authors’ contributions

Funding

Competing interests

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases