Skip to main content
Springer logoLink to Springer
. 2017 Apr 4;2017(1):66. doi: 10.1186/s13660-017-1341-z

Dai-Kou type conjugate gradient methods with a line search only using gradient

Yuanyuan Huang 1,, Changhe Liu 1
PMCID: PMC5380739  PMID: 28435203

Abstract

In this paper, the Dai-Kou type conjugate gradient methods are developed to solve the optimality condition of an unconstrained optimization, they only utilize gradient information and have broader application scope. Under suitable conditions, the developed methods are globally convergent. Numerical tests and comparisons with the PRP+ conjugate gradient method only using gradient show that the methods are efficient.

Keywords: conjugate gradient, optimality condition, line search, sufficient descent condition, global convergence

Introduction

Consider the following problem of finding xRn such that

g(x)=0, 1

where g:RnRn is continuous. Throughout this paper, problem (1) corresponds to the first-order optimality condition of the unconstrained optimization

minf(x), 2

where f:RnR is the function whose gradient is g.

Conjugate gradient methods are very efficient in solving large scale problem (2), if f is known, due to their simple iteration and their low memory requirements. For any given starting point x0Rn, an iterative sequence {xk} is generated by the following form:

xk+1=xk+αkdk, 3

where αk is a step-length obtained by some line search, and dk is a search direction generated by

dk={gk,if k=0,gk+βkdk1,if k1, 4

where gk=g(xk). Different choices of the parameter βk in (4) lead to different nonlinear conjugate gradient methods. The Fletcher-Reeves [1], Hestenes-Stiefel [2], Polak-Ribiére-Polyak [3, 4], Dai-Yuan [5] and Liu-Storey [6] formulas, and so on, are well-known formulas for βk. Particularly, conjugate gradient methods with the following (sufficient) descent condition

gkTdkcgk2,k0,c>0, 5

are very important and are always more efficient.

Recently, Dai and Kou [7] designed a family of conjugate gradient methods for the unconstrained nonlinear problems, the corresponding search direction is close to the direction of the scaled memoryless BFGS method. More importantly, they satisfied the sufficient descent condition (5). Numerical experiments illustrated that the Dai-Kou type conjugate gradient methods are more efficient than the Hager-Zhang type methods [8] presented by Hager and Zhang [8, 9]. For other descent conjugate gradient methods proposed by researchers, please see [7, 911] and the references therein.

For conjugate gradient methods, line search plays an important role for the global convergence. In general, the weak Wolfe line search,

f(xk+αkdk)f(xk)+δαkgkTdk, 6
σgkTdkg(xk+αkdk)Tdk, 7

where 0<δ<σ<1, was used to obtain the step-length αk. Hager and Zhang [9] showed that the first condition (6) may never be satisfied due to the existence of the numerical errors (see also [7]). Thus, in order to avoid the numerical drawback of the weak Wolfe line search, they proposed approximate Wolfe conditions [8, 9], which was a combination of the weak Wolfe line search and

σgkTdkg(xk+αkdk)Tdk(2δ1)gkTdk, 8

where 0<δ<1/2 and δ<σ<1. Numerical tests showed that the combined line search performed well, but there is no theory to guarantee the global convergence. Then Dai and Kou proposed an improved Wolfe line search, that is, the step-length αk satisfied (7) and

f(xk+αkdk)f(xk)+min{ϵ|gkTdk|,δαk|gkTdk|+ηk}, 9

where 0<δ<σ<1, ϵ>0 is a constant parameter and {ηk} is a positive sequence satisfying k0ηk<+. With the improved Wolfe line search, the global convergence of Dai-Kou type conjugate gradient methods was guaranteed.

Although the Hager-Zhang type and Dai-Kou type conjugate gradient methods are efficient in solving problem (2), during the implementation of the methods, function evaluations are required. The goal of this paper is to solve problem (1) which is more general and includes some nonlinear equations, such as boundary value problems [12]. So, we hope to improve the Dai-Kou type conjugate gradient methods to directly solve problem (1) and retain their high numerical efficiency. More recently, Dong [13] embedded an Armijo-type line search only using gradient into the PRP+ conjugate gradient method [14] to solve problem (1), the step-length αk satisfied

g(xk+αkdk)Tdk+12max{μk,0}αkdk2σgkTdk, 10

where μk is a determined real number and 0<σ<1. The line search allowed small choices of αk. In order to avoid this drawback, Dong [15] considered the following line search:

σgkTdkg(xk+αkdk)TdkδgkTdk, 11

where 0<δ<σ<1. Motivated by the work of [15], we embed the line search (11) into the Dai-Kou type conjugate gradient methods, then the improved methods of this paper have several advantages. They have the positive features of the Dai-Kou type methods for problem (2), they can be used to solve the nonlinear optimization (2) only requiring gradient information, and they can be used to solve some systems of nonlinear equations, such as those arising in boundary value problems and others.

The rest of this paper is organized as follows. In the next section, we simply review the Dai-Kou type conjugate gradient methods for unconstrained minimization and develop them to solve problem (1). In Section 3, we prove the global convergence of the improved methods under some suitable conditions. In Section 4, we select two classes of test problems to test the improved methods. One class is composed of test problems from the CUTEst test environment, and the other class is composed of some boundary value problems. The numerical performance is used to confirm their broader application and to compare with that of the PRP+ conjugate gradient method in [13]. Finally, some conclusions are given in Section 5.

Algorithm

In this section, we describe the details of the proposed methods. First, we briefly review the Dai-Kou type conjugate gradient methods in the setting of unconstrained minimization (2). We have mentioned above that nonlinear conjugate gradient methods are identified by the definitions of the parameter βk in (4). For the family of Dai-Kou type conjugate gradient methods, the parameter βk is defined as

βkN(τk1)=max{βk(τk1),ηgkTdk1dk12}. 12

Here,

βk(τk1)=gkTyk1dk1Tyk1(τk1+yk12sk1Tyk1sk1Tyk1sk12)gkTsk1dk1Tyk1, 13

where yk1=gkgk1, sk1=αk1dk1=xkxk1, τk1 is a parameter corresponding to the scaling parameter in the scaled memoryless BFGS method, and η[0,1). The parameters βk in the Dai-Liao type methods [16] and the Hager-Zhang type methods [9] are special cases of formula (13). If τk1 is specially defined as

τk1=λτk1A+(1λ)τk1B 14

with λ[0,1] and

τk1A=yk12sk1Tyk1, 15
τk1B=sk1Tyk1sk12, 16

then the Dai-Kou type conjugate gradient methods satisfy the sufficient descent condition (5).

The Dai-Kou type methods are very efficient in solving the unconstrained minimization, so we hope they can be used to solve problem (1) only requiring gradient information. Now we describe the improved methods in detail.

Algorithm 2.1

Step 0.

Choose x0Rn, constants σ(0,1), δ(0,σ), λ[0,1], η[0,1), ε>0. Set g0:=g(x0) and k:=0.

Step 1.

If gkε, then stop.

Step 2.

Generate the search direction dk by (4) with βk from (12), where τk1 is defined by (14).

Step 3.

Find αk such that condition (11) holds, then compute the new iterate xk+1=xk+αkdk. Set k:=k+1 and go to Step 1.

In Step 3, the step-length αk is determined following the inexact line search strategies of Algorithm 2.6 in [17]. Detailed steps are described in the following line search algorithm.

Algorithm 2.2

Step 0.

Set u=0 and v=+. Choose α>0. Set j:=0.

Step 1.
If α does not satisfy
g(xk+αdk)TdkδgkTdk,
then set j:=j+1, and go to Step 2. If α does not satisfy
σgkTdkg(xk+αdk)Tdk,
then set j:=j+1, and go to Step 3. Otherwise, set αk:=α, and return.
Step 2.

Set v=α, α=(u+v)/2. Then go to Step 1.

Step 3.

Set u=α, α=2u. Then go to Step 1.

The choice of the initial step-length is important for a line search. For conjugate gradient methods, it is important to make an initial guess of the step-length by utilizing the current iterative information about the problem. So, in Algorithm 2.2, we choose the initial step-length α=1/g0 if k=0, and α=αk1gk1Tdk1/yk1Tdk1 if k1.

Convergence analysis

Assumption 1

Assume that f:RnR is bounded below, that is, f(x)> for all xRn, and f is continuously differentiable. Its gradient g:RnRn is L-Lipschitz continuous, that is, there exists a constant L>0 such that

g(x)g(y)Lxy,x,yRn. 17

Assumption 1 implies that there exists a positive constant γ̂ such that

g(x)γˆ,xRn. 18

Lemma 3.1

Assume that g:RnRn satisfies Assumption  1. If d0=g0 and dk1Tyk10 for all k1, then

gkTdkmin{34,1η}gk2. 19

Proof

Since d0=g0, we have g0Td0=g02, which satisfies (19). If

βkN(τk1)=gkTyk1dk1Tyk1(τk1+yk12sk1Tyk1sk1Tyk1sk12)gkTsk1dk1Tyk1,

from Lemma 2.3 in [5], we have the result that

gkTdk34gk2.

And if

βkN(τk1)=ηgkTdk1dk12,

it is easy to know that

gkTdk(1η)gk2.

The proof is complete. □

Lemma 3.2

Suppose that f:RnR is bounded below along the ray {xk+αdk|α>0}, its gradient g is continuous, dk is a search direction at xk, and gkTdk<0. Then if 0<δ<σ<1, there exists αk>0 satisfying the line search (11).

Proof

Define ϕ(α)=f(xk+αdk) and ψ(α)=f(xk)+αδgkTdk. Since ϕ(α) is bounded below for all α>0, 0<δ<1 and gkTdk<0, the functions ϕ(α) and ψ(α) must intersect at at least one point. Let αk>0 be the smallest intersecting value of α, i.e.,

f(xk+αkdk)=f(xk)+αkδgkTdk. 20

Since f is continuously differentiable, by the mean value theorem, there exists αk(0,αk) such that

f(xk+αkdk)f(xk)=αkg(xk+αkdk)Tdk. 21

By combining (20) and (21), we obtain

δgkTdk=g(xk+αkdk)Tdk. 22

Furthermore,

σgkTdkg(xk+αkdk)Tdk=δgkTdk, 23

since 0<δ<σ<1 and gkTdk<0. □

Lemma 3.3

Assume that g:RnRn is monotone on the interval {xk+αdk:0ααk}, where αk satisfies the line search (11), then the following inequality holds:

f(xk+αkdk)f(xk)+δαkgkTdk, 24

where f:RnR is the function whose gradient is g.

Proof

Since g is monotone on the interval {xk+αdk:0ααk}, then

(g(xk+αkdk)g(xk+αdk))T((xk+αkdk)(xk+αdk))0.

Since ααk, it is not difficult to get that

g(xk+αdk)Tdkg(xk+αkdk)TdkδgkTdk.

Applying this inequality to the following relation

f(xk+αkdk)=f(xk)+0αkg(xk+αdk)Tdkdα

yields inequality (24). □

Now, we state the Zoutendijk condition [18] for the line search (11).

Lemma 3.4

Assume that g:RnRn satisfies Assumption  1. Consider any iterative method in the form (3), where dk is a descent direction and αk satisfies the line search (11), then

k0(gkTdk)2dk2<+. 25

Proof

It follows from the Cauchy-Schwarz inequality, the Lipschitz condition (17) and the line search (11) that

(σ1)gkTdk(gk+1gk)TdkαkLdk2. 26

Then we have

αk1σLgkTdkdk2. 27

The formula with (24) implies that

(gkTdk)2dk2L(1σ)δ(f(xk)f(xk+1)). 28

Summing (28) over k and noting that f is bounded below, we have that the desired result holds. □

Now we discus the convergence properties of Algorithm 2.1. In the following, we will prove that if the gradient g:RnRn is μ-strongly monotone, that is, there exists a constant μ>0 such that

(g(x)g(y))T(xy)μxy2,x,yRn, 29

Algorithm 2.1 is globally convergent with limkgk=0, and for more general gradient g:RnRn, Algorithm 2.1 is convergent in the sense that lim infkgk=0.

Theorem 3.1

Assume that g:RnRn satisfies Assumption  1 and is μ-strongly monotone. The sequence {xk} is generated by Algorithm 2.1, then

limkgk=0. 30

Proof

It follows from (17) and (29) that

sk1Tyk1sk1yk1Lsk12, 31
μsk12sk1Tyk1. 32

By (31) and (32), it is easy to see that

sk1Tyk1sk12L, 33
yk12sk1Tyk1L2μ. 34

Then we have that

|τk1|(1λ)L2μ+λL.

Consequently, we have that

|βk(τk1)|=|gkTyk1dk1Tyk1(τk1+yk12sk1Tyk1sk1Tyk1sk12)gkTsk1dk1Tyk1|[(2λ)L2μ2+(2+λ)Lμ]gkdk1.

Furthermore,

|βkN(τk1)|max{(2λ)L2μ2+(2+λ)Lμ,η}gkdk1.

Then

dk=gk+βkN(τk1)dk1gk+|βkN(τk1)|dk1ζgk, 35

where ζ=1+max{(2λ)L2μ2+(2+λ)Lμ,η}.

By Lemmas 3.1 and 3.4, we have that

k0gk4dk2<.

It follows from this and (35) that

k0gk2<,

which implies the desired result. □

Theorem 3.2

Assume that g:RnRn satisfies Assumption  1. Then Algorithm 2.1 is convergent in the sense that

lim infkgk=0. 36

Proof

We prove the theorem by contradiction. Assume that both gk0 for all k and lim infkgk>0, then there must exist some γ>0 such that

gkγ,k0, 37

then dk0, otherwise Lemma 3.1 would imply gk=0.

It follows from (37), Lemma 3.1 and Lemma 3.4 that

γ2k01dk2k0gk2dk2

and

k0gk2dk2k01γ2gk4dk21γ2c¯k0(gkTdk)2dk2<, 38

where c¯=min{34,1η}, then we have that

dk+. 39

This means that there exists a positive integer N, for all kN,

βkN(τk1)=βk(τk1)=gkTyk1dk1Tyk1(τk1+yk12sk1Tyk1sk1Tyk1sk12)gkTsk1dk1Tyk1=gkTyk1dk1Tyk1((1+λ)yk12sk1Tyk1λsk1Tyk1sk12)gkTsk1dk1Tyk1. 40

It follows from Lemma 3.1, (11) and (37) that

dk1Tyk1(1σ)gk1Tdk1c¯(1σ)γ2. 41

It follows from (18), (33), (34), (40), (41) and the L-Lipschitz continuity of g that, for all kN,

|βkN(τk1)|γˆ(1+λ)c¯(1σ)γ2(L+L2μ)sk1. 42

Define uk=dk/dk, then similarly to the proof of Lemma 4.3 in [7], we can get the result that

ukuk12(1+η)gkdk. 43

Then it follows from (38) and (43) that

k1ukuk12<. 44

From Assumption 1 and Lemma 3.3, we know that the generated sequence {xk} is bounded, then there exists some positive constant γ̄ such that

xkγ¯,k0. 45

By using inequalities (42), (44) and (45), we can get the desired result similarly to the proof of items II and III of Theorem 3.2 in [9]. □

Numerical experiments

In this section, we did some numerical experiments to test the performance of the proposed method and compared it with the PRP+ conjugate gradient method in [13]. All codes were written in Matlab and run on a notebook computer with an Intel(R) Core(TM) i5-5200U 2.20 GHz CPU, 8.00 GB of RAM and Linux operation system Ubuntu 12.04. All test problems were drawn from the CUTEst test library [19, 20] and the literature [12]. For the test problems from the CUTEst test library, we particularly chose the unconstrained optimization problems whose dimensions were at least 50. Different from the work in the literature such as [5, 7], we solved them only using gradient information. In order to confirm the broader application scope of the proposed method, some boundary value problems were selected from [12]. See Chapter 1 in [21] for the background of the boundary value problems.

In practical implementations, the stopping criterion used was gk103. For the proposed method in this paper, the values of σ and δ in the line search (11) were taken to be 0.9 and 0.0001, respectively, λ=0.5, and η=0.5. For the PRP+ conjugate gradient, all the initial values came from the reference [13].

The numerical results are reported in Tables 1 and 2, where Name, Dim, Iter, Ng and CPU represent the name of the test problem, the dimension, the number of iterations, the number of gradient evaluations and the CPU time elapsed in seconds, respectively. ‘-’ means the method failed to achieve the prescribed accuracy when the number of iterations exceeded 50,000 or the gradient function generated ‘NaN’. The performances of the two methods were evaluated using the profiles of Dolan and Morè [22]. That is, we plotted the fraction P of the test problems for which each of the two methods was within a factor τ. In the performance profiles, the top curve represents the most robust one within the same factor τ, and the left curve represents the fastest one to solve the same percentage of test problems. Figures 1-3 show the performance profiles for test problems from the CUTEst library relating to the number of iterations, the number of gradient evaluations and the CPU time, respectively. Figures 4-6 show the performance profiles for some boundary value problems. These figures reveal that, for the test problems, the proposed method is more efficient and robust than the PRP+ conjugate gradient method. Consequently, the improved method not only can solve problems only referring to gradient information but also inherits the good numerical performance of the Dai-Kou type conjugate gradient methods.

Figure 2.

Figure 2

Performance profile for the test problems from the CUTEst library based on the number of gradient evaluations.

Figure 5.

Figure 5

Performance profile for some boundary value problems based on the number of gradient evaluations.

Table 1.

Numerical results for test problems from the CUTEst library

Name (Dim) Method Iter/Ng/CPU
ARGLINA (200) Dai_Kou 14/28/1.673e − 02
PRP+ 13/25/2.309e − 02
ARGLINB (200) Dai_Kou 22 /43/2.577e − 02
PRP+ 47/93/6.121e − 02
ARGLINC (200) Dai_Kou 22/43/2.420e − 02
PRP+ 47/92/6.144e − 02
BDQRTIC (500) Dai_Kou 118/264/3.731e − 02
PRP+ 181/317/6.208e − 02
BOX (10,000) Dai_Kou 30/100/1.662e − 01
PRP+ 56/104/2.615e − 01
BROWNAL (200) Dai_Kou 22/42/1.004e − 02
PRP+ -/-/-
BROWNALE (200) Dai_Kou 1/1/9.500e − 05
PRP+ 1/1/1.070e − 04
BRYBND (5,000) Dai_Kou 24/34/3.827e − 02
PRP+ 32/62/9.025e − 02
CHAINWOO (4,000) Dai_Kou 223/361/2.337e − 01
PRP+ 271/480/4.458e − 01
CHNROSNB (50) Dai_Kou 344/548/3.404e − 02
PRP+ 564/952/8.028e − 02
CRAGGLVY (5,000) Dai_Kou 142/273/2.638e − 01
PRP+ -/-/-
COSINE (1,000) Dai_Kou 9/22/6.495e − 03
PRP+ 14/25/1.433e − 02
CURLY10 (10,000) Dai_Kou -/-/-
PRP+ 20,040/39,984/6.169e + 01
CURLY20 (10,000) Dai_Kou -/-/-
PRP+ 27,216/54,259/1.278e + 02
DIXMAANA (3,000) Dai_Kou 10/12/5.625e − 03
PRP+ 16/27/2.274e − 02
DIXMAANB (3,000) Dai_Kou 10/12/5.704e − 03
PRP+ 11/15/1.145e − 02
DIXMAANC (3,000) Dai_Kou 12/15/6.271e − 03
PRP+ 14/21/1.697e − 02
DIXMAAND (3,000) Dai_Kou 14/17/1.011e − 02
PRP+ 16/24/1.547e − 02
DIXMAANE (3,000) Dai_Kou 85/123/4.520e − 02
PRP+ 80/152/8.792e − 02
DIXMAANF (3,000) Dai_Kou 31/42/2.522e − 02
PRP+ 30/41/4.214e − 02
DIXMAANG (3,000) Dai_Kou 29/40/2.873e − 02
PRP+ 27/35/2.557e − 02
DIXMAANH (3,000) Dai_Kou 28/37/1.468e − 02
PRP+ 26/34/2.635e − 02
DIXMAANI (3,000) Dai_Kou 124/186/6.319e − 02
PRP+ 124/239/1.124e − 01
DIXMAANJ (3,000) Dai_Kou 36/52/2.502e − 02
PRP+ 31/43/3.019e − 02
DIXMAANK (3,000) Dai_Kou 34/48/2.063e − 02
PRP+ 28/37/2.864e − 02
DIXMAANL (3,000) Dai_Kou 29/40/1.661e − 02
PRP+ 30/40/3.369e − 02
DIXMAANM (3,000) Dai_Kou 104/154/6.135e − 02
PRP+ 157/305/1.407e − 01
DIXMAANN (3,000) Dai_Kou 63/93/3.813e − 02
PRP+ 98/164/8.303e − 02
DIXMAANO (3,000) Dai_Kou 59/86/2.737e − 02
PRP+ 80/130/7.730e − 02
DIXMAANP (3,000) Dai_Kou 56/77/3.176e − 02
PRP+ 72/111/6.704e − 02
DIXON3DQ (10,000) Dai_Kou 620/945/5.557e − 01
PRP+ 1,467/2,933/2.524e + 00
DMN15103LS (99) Dai_Kou 119/206/1.417e + 00
PRP+ 39/106/1.053e + 00
DMN15333LS (99) Dai_Kou 80/171/1.143e + 00
PRP+ -/-/-
DQDRTIC (5,000) Dai_Kou 53/100/6.594e − 02
PRP+ 76/151/1.327e − 01
DQRTIC (5,000) Dai_Kou 18/31/1.109e − 02
PRP+ 25/25/2.123e − 02
EDENSCH (1,000) Dai_Kou 28/43/1.159e − 02
PRP+ 31/51/1.590e − 02
EG2 (1,000) Dai_Kou 19/37/9.933e − 03
PRP+ 32/58/2.803e − 02
EIGENALS (2,550) Dai_Kou 24,758/37,853/2.181e + 02
PRP+ 21,640/41,892/3.618e + 02
ENGVAL1 (1,000) Dai_Kou 25/35/6.147e − 03
PRP+ 20/28/1.253e − 02
ERRINROS (50) Dai_Kou 111/171 /1.860e − 02
PRP+ 25,995/48,312/3.756e + 00
ERRINRSM (50) Dai_Kou 419/805/4.634e − 02
PRP+ -/-/-
EXTROSNB (1,000) Dai_Kou 652/1,063/1.300e − 01
PRP+ 906/1,611/2.639e − 01
FLETBV3M (5,000) Dai_Kou 115/263/4.331e − 01
PRP+ 33/61/1.482e − 01
FLETCBV2 (5,000) Dai_Kou 1/1/1.099e − 03
PRP+ 1/1/1.283e − 03
FMINSRF2 (5,625) Dai_Kou 251/386/2.966e − 01
PRP+ 338/567/6.821e − 01
FREUROTH (5,000) Dai_Kou 191/331 /2.437e − 01
PRP+ 75/133/1.523e − 01
GENHUMPS (5,000) Dai_Kou 9,378/20,870/3.155e + 01
PRP+ 10,235/17,320/3.504e + 01
GENROSE (1,000) Dai_Kou 3,054/4,706/7.083e − 01
PRP+ 4,947/8,388/1.792e + 00
HYDC20LS (99) Dai_Kou 2,541/3,952/4.016e − 01
PRP+ -/-/-
INDEF (5,000) Dai_Kou -/-/-
PRP+ -/-/-
INDEFM (1,000) Dai_Kou -/-/-
PRP+ 628/1,271/5.722e − 01
JIMACK (3,549) Dai_Kou 716/1,098/4.231e + 01
PRP+ 401/725/4.284e + 01
LIARWHD (5,000) Dai_Kou 50/150/8.031e − 02
PRP+ 124/223/1.945e − 01
MANCINO (100) Dai_Kou 8/17/5.880e − 02
PRP+ 31/59/2.788e − 01
MODBEALE (10,000) Dai_Kou 371/738/1.879e + 00
PRP+ -/-/-
MOREBV (5,000) Dai_Kou 1/1/5.170e − 04
PRP+ 1/1/7.230e − 04
MSQRTALS (1,024) Dai_Kou 749/1,148/1.534e + 00
PRP+ 520/969/1.854e + 00
MSQRTBLS (1,024) Dai_Kou 783/1,196/1.639e + 00
PRP+ 681/1279/2.391e + 00
NCB20 (5,010) Dai_Kou 365/688/1.466e + 00
PRP+ 148/248/8.941e − 01
NCB20B (5,000) Dai_Kou 98/172/3.661e − 01
PRP+ 77/131/4.434e − 01
NONCVXU2 (5,000) Dai_Kou 1,159/1,751/1.945e + 00
PRP+ 4,582/8,610/1.396e + 01
NONCVXUN (5,000) Dai_Kou 1,247/1,887/2.110e + 00
PRP+ 9,929/18,942/3.063e + 01
NONDIA (5,000) Dai_Kou 13/23/1.189e − 02
PRP+ 54/103/8.099e − 02
NONDQUAR (5,000) Dai_Kou 66/129/5.082e − 02
PRP+ 139/202/1.238e − 01
OSCIGRAD (10,000) Dai_Kou 31/44/5.616e − 02
PRP+ -/-/-
OSCIPATH (500) Dai_Kou 30/78/6.678e − 03
PRP+ -/-/-
PENALTY1 (1,000) Dai_Kou 18/28/4.520e − 03
PRP+ -/-/-
PENALTY2 (200) Dai_Kou 112/164 /2.145e − 02
PRP+ 173/304/5.560e − 02
PENALTY3 (200) Dai_Kou -/-/-
PRP+ -/-/-
POWELLSG (5,000) Dai_Kou 118/225/7.709e − 02
PRP+ 147/260/1.233e − 01
POWER (10,000) Dai_Kou 22/25/1.965e − 02
PRP+ -/-/-
QUARTC (5,000) Dai_Kou 18/31/9.852e − 03
PRP+ 25/25/2.080e − 02
SCHMVETT (5,000) Dai_Kou 38/68/1.145e − 01
PRP+ 33/63/1.478e − 01
SENSORS (100) Dai_Kou -/-/-
PRP+ 32/65/4.099e − 01
SINQUAD (5,000) Dai_Kou 117/270/2.988e − 01
PRP+ 182/342/5.408e − 01
SPARSINE (5,000) Dai_Kou 875/1348/1.708e + 00
PRP+ -/-/-
SPARSQUR (10,000) Dai_Kou 21/22/4.845e − 02
PRP+ 16/16/6.262e − 02
SPMSRTLS (4,999) Dai_Kou 136/219/1.742e − 01
PRP+ 161/278/3.338e − 01
SROSENBR (5,000) Dai_Kou 26/63/2.904e − 02
PRP+ 33/57/4.532e − 02
SSBRYBND (5,000) Dai_Kou 6,337/9,751/9.184e + 00
PRP+ -/-/-
SSCOSINE (5,000) Dai_Kou -/-/-
PRP+ -/-/-
TESTQUAD (5,000) Dai_Kou 5,068/7,734/1.948e + 00
PRP+ 1,624/3,247/9.661e − 01
TOINTGOR (50) Dai_Kou 131/195/1.998e − 02
PRP+ 105/180/2.060e − 02
TOINTGSS (5,000) Dai_Kou 18/37/2.997e − 02
PRP+ 14/27/2.830e − 02
TOINTPSP (50) Dai_Kou 142/268/2.158e − 02
PRP+ 115/194/2.190e − 02
TOINTQOR (50) Dai_Kou 43/64/7.463e − 03
PRP+ 41/81/9.627e − 03
TQUARTIC (5,000) Dai_Kou 35/103/4.848e − 02
PRP+ 68/120/7.646e − 02
TRIDIA (5,000) Dai_Kou 1,633/2,491/7.701e − 01
PRP+ 628/1,255/5.693e − 01
VARDIM (200) Dai_Kou 18/18/1.765e − 03
PRP+ -/-/-
VAREIGVL (50) Dai_Kou 19/29/4.227e − 03
PRP+ 23/39/6.727e − 03
WOODS (4,000) Dai_Kou 36/67/3.083e − 02
PRP+ 22/28/2.143e − 02

Table 2.

Numerical results for some boundary value problems

Name (Dim) Method Iter/Ng/CPU
Function2 (10,000) Dai_Kou 12/27/1.266e − 02
PRP+ 12/23/1.529e − 02
Function6 (10,000) Dai_Kou 1/1/5.010e − 04
PRP+ 1/1/4.399e − 04
Function8 (10,000) Dai_Kou 12/16/4.678e − 02
PRP+ 10/17/7.151e − 02
Function12 (10,000) Dai_Kou 10/21/1.206e − 02
PRP+ 10/19/1.227e − 02
Function13 (10,000) Dai_Kou 222/330/2.044e − 01
PRP+ 346/691/5.704e − 01
Function14 (10,000) Dai_Kou 12/17/4.554e − 02
PRP+ 9/11/4.912e − 02
Function18 (10,000) Dai_Kou 1/1/8.588e − 04
PRP+ 1/1/7.632e − 04
Function19 (10,000) Dai_Kou 9/14/1.084e − 02
PRP+ 8/12/1.551e − 02
Function20 (10,000) Dai_Kou 1/1/7.464e − 04
PRP+ 1/1/9.391e − 04
Function21 (10,000) Dai_Kou 75/81/5.441e − 02
PRP+ -/-/-
Function22 (10,000) Dai_Kou 13/21/1.300e − 02
PRP+ 12/21/1.580e − 02
Function24 (10,000) Dai_Kou 5/7/7.387e + 00
PRP+ 6/10/1.609e + 01
Function25 (10,000) Dai_Kou 12/22/2.008e − 02
PRP+ 16/26/4.658e − 02
Function26 (10,000) Dai_Kou 258/387/1.890e − 01
PRP+ 345/689/4.391e − 01
Function27 (10,000) Dai_Kou 143/212/1.285e − 01
PRP+ 171/341/2.837e − 01
Function29 (10,000) Dai_Kou 2,211/3,355/6.638e + 00
PRP+ 8,150/16,299/4.633e + 01
Function31 (10,000) Dai_Kou 1/1/5.388e − 04
PRP+ 1/1/9.083e − 04

Figure 1.

Figure 1

Performance profile for the test problems from the CUTEst library based on the number of iterations.

Figure 3.

Figure 3

Performance profile for the test problems from the CUTEst library based on the CPU time.

Figure 4.

Figure 4

Performance profile for some boundary value problems based on the number of iterations.

Figure 6.

Figure 6

Performance profile for some boundary value problems based on the CPU time.

Conclusions

In this paper, we discussed the improved Dai-Kou type conjugate gradient methods only using gradient information. They inherited the advantages of the Dai-Kou type conjugate gradient methods for solving the unconstrained minimization problems, but had broader application scope. Moreover, the problem considered in this paper can be viewed as the nonlinear equation

F(x)=0 46

with F=g. While the convergence analysis of this paper needed some assumptions of the function f whose gradient is g, our further investigation is to avoid the function f and to solve general nonlinear equation (46) using different strategies from those of this paper and literature [2325].

Acknowledgements

The authors are very grateful to the associate editor and reviewers for their valuable suggestions which have greatly improved the paper. This work was partially supported by the National Natural Science Foundation of China (No. 11471102) and the Key Basic Research Foundation of the Higher Education Institutions of Henan Province (No. 16A110012).

Footnotes

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

All authors read and approved the final version of this paper.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Yuanyuan Huang, Email: yyuanhuang@126.com.

Changhe Liu, Email: changheliu@126.com.

References

  • 1.Fletcher R, Reeves C. Function minimization by conjugate gradients. Comput. J. 1964;7:149–154. doi: 10.1093/comjnl/7.2.149. [DOI] [Google Scholar]
  • 2.Hestenes MR, Stiefel EL. Methods of conjugate gradients for solving linear systems. J. Res. Natl. Bur. Stand. 1952;49(5):409–436. doi: 10.6028/jres.049.044. [DOI] [Google Scholar]
  • 3.Polak E, Ribière G. Note sur la méthodes de directions conjuguées. Rev. Fr. Inform. Rech. Opér., Sér Rouge. 1969;16:35–43. [Google Scholar]
  • 4.Polyak BT. The conjugate gradient methods in extreme problems. USSR Comput. Math. Math. Phys. 1969;9:94–112. doi: 10.1016/0041-5553(69)90035-4. [DOI] [Google Scholar]
  • 5.Dai YH, Yuan Y. A nonlinear conjugate gradient method with a strong global convergence property. SIAM J. Optim. 1999;10(1):177–182. doi: 10.1137/S1052623497318992. [DOI] [Google Scholar]
  • 6.Liu Y, Storey C. Efficient generalized conjugate gradient algorithms. Part 1: theory. J. Optim. Theory Appl. 1991;69:129–137. doi: 10.1007/BF00940464. [DOI] [Google Scholar]
  • 7.Dai YH, Kou CX. A nonlinear conjugate gradient algorithm with an optimal property and an improved Wolfe line search. SIAM J. Optim. 2013;23:296–320. doi: 10.1137/100813026. [DOI] [Google Scholar]
  • 8.Hager WW, Zhang HC. Algorithm 851: CG__DESCENT, a conjugate gradient method with guaranteed descent. ACM Trans. Math. Softw. 2006;32:113–137. doi: 10.1145/1132973.1132979. [DOI] [Google Scholar]
  • 9.Hager WW, Zhang HC. A new conjugate gradient method with guaranteed descent and an efficient line search. SIAM J. Optim. 2005;16:170–192. doi: 10.1137/030601880. [DOI] [Google Scholar]
  • 10.Zhang L, Li JL. A new globalization technique for nonlinear conjugate gradient methods for nonconvex minimization. Appl. Math. Comput. 2011;217:10295–10304. [Google Scholar]
  • 11.Nakamura W, Narushima Y, Yabe H. Nonlinear conjugate gradient methods with sufficient descent properties for unconstrained optimization. J. Ind. Manag. Optim. 2013;9:595–619. doi: 10.3934/jimo.2013.9.595. [DOI] [Google Scholar]
  • 12.Spedicato E, Huang Z. Numerical experience with Newton-like methods for nonlinear algebraic systems. Computing. 1997;58:69–89. doi: 10.1007/BF02684472. [DOI] [Google Scholar]
  • 13.Dong YD. A practical PR+ conjugate gradient method only using gradient. Appl. Math. Comput. 2012;219:2041–2052. [Google Scholar]
  • 14.Powell MJD. Convergence properties of algorithms for nonlinear optimization. SIAM Rev. 1986;28:487–500. doi: 10.1137/1028154. [DOI] [Google Scholar]
  • 15. Dong, YD: The BFGS method only using gradient. Preprint. http://www.optimization-online.org/DB_FILE/2016/07/5560.pdf
  • 16.Dai YH, Liao LZ. New conjugacy conditions and related nonlinear conjugate gradient methods. Appl. Math. Optim. 2001;43:87–101. doi: 10.1007/s002450010019. [DOI] [Google Scholar]
  • 17.Lewis AS, Overton ML. Nonsmooth optimization via quasi-Newton methods. Math. Program. 2013;141(1-2):135–163. doi: 10.1007/s10107-012-0514-2. [DOI] [Google Scholar]
  • 18.Zoutendijk G. Nonlinear programming, computational methods. In: Abadie J, editor. Integer and Nonlinear Programming. Amsterdam: North-Holland; 1970. pp. 37–86. [Google Scholar]
  • 19.Bongartz I, Conn AR, Gould N, Toint PL. CUTE: constrained and unconstrained testing environment. ACM Trans. Math. Softw. 1995;21:123–160. doi: 10.1145/200979.201043. [DOI] [Google Scholar]
  • 20.Gould NIM, Orban D, Toint PL. CUTEr and SifDec: a constrained and unconstrained testing environment, revisited. ACM Trans. Math. Softw. 2003;29:373–394. doi: 10.1145/962437.962439. [DOI] [Google Scholar]
  • 21.Ortega JM, Rheinboldt WC. Iterative Solution of Nonlinear Equations in Several Variables. New York: Academic Press; 1970. [Google Scholar]
  • 22.Dolan ED, Morè JJ. Benchmarking optimization software with performance profiles. Math. Program. 2002;91:201–213. doi: 10.1007/s101070100263. [DOI] [Google Scholar]
  • 23.Solodov MV, Svaiter BF. A globally convergent inexact Newton method for systems of monotone equations. In: Fukushima M, Qi L, editors. Reformulation: Nonsmooth, Piecewise Smooth, Semismooth and Smoothing Methods. Dordrecht: Kluwer Academic; 1999. pp. 355–369. [Google Scholar]
  • 24.Yan QR, Peng XZ, Li DH. A globally convergent derivative-free method for solving large-scale nonlinear monotone equations. J. Comput. Appl. Math. 2010;234:649–657. doi: 10.1016/j.cam.2010.01.001. [DOI] [Google Scholar]
  • 25.Li QN, Li DH. A class of derivative-free methods for large-scale nonlinear monotone equations. IMA J. Numer. Anal. 2011;31:1625–1635. doi: 10.1093/imanum/drq015. [DOI] [Google Scholar]

Articles from Journal of Inequalities and Applications are provided here courtesy of Springer

RESOURCES