Two New PRP Conjugate Gradient Algorithms for Minimization Optimization Models

Gonglin Yuan; Xiabin Duan; Wenjie Liu; Xiaoliang Wang; Zengru Cui; Zhou Sheng

doi:10.1371/journal.pone.0140071

. 2015 Oct 26;10(10):e0140071. doi: 10.1371/journal.pone.0140071

Two New PRP Conjugate Gradient Algorithms for Minimization Optimization Models

Gonglin Yuan ^1,², Xiabin Duan ^1,^*, Wenjie Liu ^2,³, Xiaoliang Wang ¹, Zengru Cui ¹, Zhou Sheng ¹

Editor: Yongtang Shi⁴

PMCID: PMC4621041 PMID: 26502409

Abstract

Two new PRP conjugate Algorithms are proposed in this paper based on two modified PRP conjugate gradient methods: the first algorithm is proposed for solving unconstrained optimization problems, and the second algorithm is proposed for solving nonlinear equations. The first method contains two aspects of information: function value and gradient value. The two methods both possess some good properties, as follows: 1)β _k ≥ 0 2) the search direction has the trust region property without the use of any line search method 3) the search direction has sufficient descent property without the use of any line search method. Under some suitable conditions, we establish the global convergence of the two algorithms. We conduct numerical experiments to evaluate our algorithms. The numerical results indicate that the first algorithm is effective and competitive for solving unconstrained optimization problems and that the second algorithm is effective for solving large-scale nonlinear equations.

Introduction

As we know, the conjugate gradient method is very popular and effective for solving the following unconstrained optimization problem

\begin{matrix} min_{x \in ℜ^{n}} f (x) \end{matrix}

(1)

where f : ℜⁿ → ℜ is continuously differentiable and g(x) denotes the gradient of f(x) at x, the problem Eq (1) also can be applied to model some other problems [1–5]. The iterative formula used in the conjugate gradient method is usually given by

\begin{matrix} x_{k + 1} = x_{k} + α_{k} d_{k} \end{matrix}

(2)

and

d_{k} = {\begin{matrix} - g_{k} & if k = 1 \\ - g_{k} + β_{k} d_{k - 1} & if k \geq 2 \end{matrix}

(3)

where g _k = g(x _k), β _k ∈ ℜ is a scalar, α _k > 0 is a step length that is determined by some line search, and d _k denotes the search direction. Different conjugate methods have different choices for β _k. Some of the popular methods [6–12] used to compute β _k are the DY conjugate gradient method [6], FR conjugate gradient method [7], PRP conjugate gradient method [8, 9], HS conjugate gradient method [10], LS conjugate gradient method [11], and CD conjugate gradient method [12]. β _k [8, 9] is defined by

β_{k}^{P R P} = \frac{g_{k}^{T} y_{k - 1}}{∥ g_{k - 1} ∥^{2}}

(4)

where $∥ \cdot ∥$ denotes the Euclidean norm, y _k−1 = g _k−g _k−1. The PRP conjugate gradient method is currently considered to have the best numerical performance, but it does not have good convergence. With an exact line search, the global convergence of the PRP conjugate gradient method has been established by Polak and Ribière [8] for convex objective functions. However, Powell [13] proposed a counter example that proved the existence of nonconvex functions on which the PRP conjugate gradient method does not have global convergence, even with an exact line search. With the weak Wolfe-Powell line search, Gilbert and Nocedal [14] proposed a modified PRP conjugate gradient method by restricting β _k to be not less than zero and proved that it has global convergence, with the hypothesis that it satisfies the sufficient descent condition. Gilbert and Nocedal [14] also gave an example showing that β _k may be negative even though the objective function is uniformly convex. When the Strong Wolfe-Powell line search was used, Dai [15] gave a example showing that the PRP method cannot guarantee that every step search direction is the descent direction, even if the objective function is uniformly convex.

Through the above observations and [13, 14, 16–18], we know that the following sufficient descent condition

\begin{matrix} - g_{k}^{T} d_{k} \geq b ∥ g_{k} ∥^{2}, \forall b > 0 \end{matrix}

(5)

and the condition β _k is not less than zero are very important for establishing the global convergence of the conjugate gradient method.

The weak Wolfe-Powell (WWP) line search is designed to compute α _k and is usually used for the global convergence analysis. The WWP line search is as follows

\begin{matrix} f (x_{k} + α_{k} d_{k}) \leq f (x_{k}) + δ_{1} α_{k} g_{k}^{T} d_{k} \end{matrix}

(6)

and

\begin{matrix} g {(x_{k} + α_{k} d_{k})}^{T} d_{k} \geq δ_{2} g_{k}^{T} d_{k} \end{matrix}

(7)

where $δ_{1} \in (0, \frac{1}{2}), δ_{2} \in (δ_{1}, 1)$ .

Recently, many new conjugate gradient methods ([19–28] etc.) that possess some good properties have been proposed for solving unconstrained optimization problems.

In Section 2, we state the motivation behind our approach and give a new modified PRP conjugate gradient method and new algorithm for solving problem Eq (1). In Section 3, we prove that the search direction of our new algorithm satisfies the sufficient descent property and trust region property; moreover, we establish the global convergence of the new algorithm with the WWP line search. In Section 4, we provide numerical experiment results for some test problems.

New algorithm for unconstrained optimization

Wei et al. [29] give a new PRP conjugate gradient method usually called the WYL method. When the WWP line search is used, this WYL method has global convergence under the sufficient descent condition. Zhang [30] give a modified WYL method called the NPRP method as follows

β_{k}^{N P R P} = \frac{{‖ g_{k} ‖}^{2} - \frac{∥ g_{k} ∥}{∥ g_{k - 1} ∥} | g_{k}^{T} g_{k - 1} |}{∥ g_{k - 1} ∥^{2}}

The NPRP method possesses better convergence properties. The above formula for y _k−1 contains only gradient value information, but some new y _k−1 formulas [31, 32] contain information on gradient value and function value. Yuan et al.[32] propose a new y _k−1 formula as follows

y_{k - 1}^{m} = y_{k - 1} + \frac{max {ρ_{k - 1}, 0}}{{∥ s_{k - 1} ∥}^{2}} s_{k - 1},

and

ρ_{k - 1} = 2 [f (x_{k - 1}) - f (x_{k})] + {(g (x_{k}) + g (x_{k - 1}))}^{T} s_{k - 1} .

Where s _k−1 = x _k−x _k−1.

Li and Qu [33] give a modified PRP conjugate method as follows

β_{k} = \frac{g_{k}^{T} y_{k - 1}}{m a x {t ∥ d_{k - 1} ∥, {∥ g_{k - 1} ∥}^{2}}}, t > 0

and

d_{k} = - g_{k} - β_{k} \frac{g_{k}^{T} d_{k - 1}}{{∥ g_{k} ∥}^{2}} g_{k} + β_{k} d_{k - 1}, d_{0} = - g_{0} .

Under suitable conditions, Li and Qu [33] prove that the modified PRP conjugate method has global convergence.

Motivated by the above discussions, we propose a new modified PRP conjugate method as follows

β_{k}^{B P R P} = \frac{min {| g_{k}^{T} y_{k - 1}^{m} |, u_{1} ({‖ g_{k} ‖}^{2} - \frac{‖ g_{k} ‖}{‖ g_{k - 1} ‖} | g_{k}^{T} g_{k - 1} |)}}{u_{2} ‖ d_{k - 1} ‖ ‖ y_{k - 1} ‖ + {‖ g_{k - 1} ‖}^{2}}

(8)

and

\begin{matrix} d_{k} = {\begin{matrix} - g_{k} & if k = 1 \\ - g_{k} - β_{k}^{B P R P} \frac{g_{k}^{T} d_{k - 1}}{∥ g_{k} ∥^{2}} g_{k} + β_{k}^{B P R P} d_{k - 1} & if k \geq 2 \end{matrix} \end{matrix}

(9)

where u ₁ > 0, u ₂ > 0, $y_{k - 1}^{m}$ is the $y_{k - 1}^{m}$ of [32].

As $∥ g_{k} ∥^{2} - \frac{∥ g_{k} ∥}{∥ g_{k - 1} ∥} | g_{k}^{T} g_{k - 1} | \geq 0,$ it follows directly from the above formula that $β_{k}^{B P R P} \geq 0$ . Next, we present a new algorithm and it’s diagram (Fig 1) as follows.

Algorithm 2.1

Step 0: Given the initial point $x_{1} \in ℜ^{n}, u_{1} > 0, u_{2} > 0, ε_{1} \geq 0, 0 < δ_{1} < \frac{1}{2}, δ_{1} < δ_{2} < 1$ , set d ₁ = −∇f(x ₁) = −g ₁, k: = 1.

Step 1: Calculate $∥ g_{k} ∥$ ; if $∥ g_{k} ∥ \leq ε_{1}$ , stop; otherwise, go to step 2.

Step 2: Calculate step length α _k by the WWP line search.

Step 3: Set x _k+1 = x _k + α _k d _k, then calculate $∥ g_{k + 1} ∥$ ; if $∥ g_{k + 1} ∥ \leq ε_{1}$ , stop; otherwise, go to step 4.

Step 4: Calculate the scalar β _k+1 by Eq (8) and calculate the search direction d _k+1 by Eq (9).

Step 5: Set k: = k + 1; go to step 2.

Global convergence analysis

Some suitable assumptions are often used to analyze the global convergence of the conjugate gradient method. Here, we state it as follows

Assumption 3.1

The level set Ω = {x ∈ ℜⁿ ∣ f(x) ≤ f(x ₁)} is bounded.
In some neighborhood H of Ω, f is a continuously differentiable function, and the gradient function g of f is Lipschitz continuous, namely, there exists a constant L > 0 such that
$‖ g (x) - g (y) ‖ \leq L ‖ x - y ‖, \forall x, y \in H$ (10)

By Assumption 3.1, it is easy to obtain that there exist two constants A > 0 and η ₁ > 0 satisfying

∥ x ∥ \leq A, ∥ g (x) ∥ \leq η_{1}, \forall x \in Ω

(11)

Lemma 0.1 Let the sequence {d _k} be generated by Eq (9); then, we have

g_{k}^{T} d_{k} = - {‖ g_{k} ‖}^{2}, \forall k \geq 1

(12)

Proof When k = 1, we can obtain $g_{1}^{T} d_{1} = - ‖ g_{1} ‖^{2}$ by Eq (9), so Eq (12) holds. When k ≥ 2, we can obtain

\begin{array}{l} g_{k}^{T} d_{k} = g_{k}^{T} (- g_{k} - β_{k}^{B P R P} \frac{g_{k}^{T} d_{k - 1}}{∥ g_{k} ∥^{2}} g_{k} + β_{k}^{B P R P} d_{k - 1}) \\ = - ∥ g_{k} ∥^{2} \end{array}

The proof is achieved.

We know directly from above Lemma that our new method has the sufficient descent property.

Lemma 0.2 Let the sequence {x _k} and {d _k, g _k} be generated by Algorithm 2.1, and suppose that Assumption 3.1 holds; then, we can obtain

\begin{matrix} \sum_{k = 1}^{\infty} \frac{{(g_{k}^{T} d_{k})}^{2}}{{∥ d_{k} ∥}^{2}} < + \infty \end{matrix}

(13)

Proof By Eq (7) and the Cauchy-Schwarz inequality, we have

- (1 - δ_{2}) g_{k}^{T} d_{k} \leq ∥ g_{k + 1} - g_{k} ∥ ∥ d_{k} ∥

Combining the above inequality with Assumption 3.1 ii) generates

- (1 - δ_{2}) g_{k}^{T} d_{k} \leq L α_{k} ∥ d_{k} ∥^{2}

it is easy to know $g_{k}^{T} d_{k} \leq 0$ by lemma 0.1. By combining the above inequality with Eq (6), we obtain

f_{k} - f_{k + 1} \geq \frac{δ_{1} (1 - δ_{2})}{L} \frac{{(g_{k}^{T} d_{k})}^{2}}{{∥ d_{k} ∥}^{2}} .

Summing up the above inequalities from k = 1 to k = ∞, we can deduce that

\frac{δ_{1} (1 - δ_{2})}{L} \sum_{k = 1}^{\infty} \frac{{(g_{k}^{T} d_{k})}^{2}}{{∥ d_{k} ∥}^{2}} \leq f_{1} - f_{\infty} .

By Eq (6), Assumption 3.1 and lemma 0.1, we know that {f _k} is bounded below, so we obtain

\sum_{k = 1}^{\infty} \frac{{(g_{k}^{T} d_{k})}^{2}}{{∥ d_{k} ∥}^{2}} < + \infty .

This finishes the proof.

The Eq (13) is usually called the Zoutendijk condition [34], and it is very important for establishing global convergence.

Lemma 0.3 Let the sequence {β _k, d _k} be generated by Algorithm 2.1, we have

\begin{matrix} ∥ d_{k} ∥ \leq N ∥ g_{k} ∥ \end{matrix}

(14)

where $N = 1 + \frac{4 u_{1}}{u_{2}}$ .

Proof When d _k = 0, we directly get g _k = 0 from Eq (12). When d _k ≠ 0, by the Cauchy-Schwarz inequality, we can easily obtain

∥ g_{k} ∥^{2} - \frac{∥ g_{k} ∥}{∥ g_{k - 1} ∥} | g_{k}^{T} g_{k - 1} | \leq g_{k}^{T} (g_{k} - \frac{∥ g_{k} ∥}{∥ g_{k - 1} ∥} g_{k - 1})

and

\begin{matrix} g_{k}^{T} (g_{k} - \frac{∥ g_{k} ∥}{∥ g_{k - 1} ∥} g_{k - 1}) & \leq & ∥ g_{k} ∥ ‖ (g_{k} - g_{k - 1}) + (g_{k - 1} - \frac{∥ g_{k} ∥}{∥ g_{k - 1} ∥} g_{k - 1}) ‖ \\ \leq & 2 ∥ g_{k} ∥ ∥ g_{k} - g_{k - 1} ∥ \end{matrix}

We can obtain

∥ g_{k} ∥^{2} - \frac{∥ g_{k} ∥}{∥ g_{k - 1} ∥} | g_{k}^{T} g_{k - 1} | \leq 2 ∥ g_{k} ∥ ∥ y_{k - 1} ∥

Using Eq (8), we have

\begin{matrix} | β_{k}^{B P R P} | & \leq & \frac{u_{1} (∥ g_{k} ∥^{2} - \frac{∥ g_{k} ∥}{∥ g_{k - 1} ∥} | g_{k}^{T} g_{k - 1} |)}{u_{2} ∥ d_{k - 1} ∥ ∥ y_{k - 1} ∥} \\ \leq & \frac{2 u_{1}}{u_{2}} \frac{∥ g_{k} ∥}{∥ d_{k - 1} ∥} \end{matrix}

Finally, when k ≥ 2 by Eq (9), we have

\begin{matrix} ∥ d_{k} ∥ & \leq & ‖ g_{k} ‖ + | β_{k}^{B P R P} | \frac{∥ g_{k} ∥ ∥ d_{k - 1} ∥}{∥ g_{k} ∥^{2}} ∥ g_{k} ∥ + | β_{k}^{B P R P} | ‖ d_{k - 1} ‖ \\ \leq & ‖ g_{k} ‖ + \frac{2 u_{1}}{u_{2}} ∥ g_{k} ∥ + \frac{2 u_{1}}{u_{2}} ‖ g_{k} ‖ \\ \leq & (1 + \frac{4 u_{1}}{u_{2}}) ‖ g_{k} ‖ \end{matrix}

Let $N = 1 + \frac{4 u_{1}}{u_{2}}$ ; we obtain $∥ d_{k} ∥ \leq N ∥ g_{k} ∥$ . This finishes the proof.

This lemma also shows that the search direction of our algorithm has the trust region property.

Theorem 0.1 Let the sequence {d _k, g _k, β _k} and {x _k} be generated by Algorithm 2.1. Suppose that Assumption 3.1 holds; then

\begin{matrix} lim_{k \to \infty} ∥ g_{k} ∥ = 0 \end{matrix}

(15)

Proof By Eqs (12) and (13), we obtain

\begin{matrix} \sum_{k = 1}^{\infty} \frac{{∥ g_{k} ∥}^{4}}{{∥ d_{k} ∥}^{2}} < + \infty \end{matrix}

(16)

By Eq (14), we have ${‖ d_{k} ‖}^{2} \leq N^{2} {‖ g_{k} ‖}^{2}$ ; then, we obtain

{‖ g_{k} ‖}^{2} \leq N^{2} \frac{{∥ g_{k} ∥}^{4}}{{∥ d_{k} ∥}^{2}},

which together with Eq (16) can yield

\sum_{k = 1}^{\infty} {∥ g_{k} ∥}^{2} \leq N^{2} \sum_{k = 1}^{\infty} \frac{{∥ g_{k} ∥}^{4}}{{∥ d_{k} ∥}^{2}} < + \infty .

From the above inequality, we can obtain $lim_{k \Rightarrow \infty} ∥ g_{k} ∥ = 0$ . The proof is finished.

Numerical Results

When β _k+1 and d _k+1 are calculated by Eqs (4) and (3), respectively, in step 4 of Algorithm 2.1, we call it the PRP conjugate gradient algorithm. We test Algorithm 2.1 and the PRP conjugate gradient algorithm using some benchmark problems. The test environment is MATLAB 7.0, on a Windows 7 system. The initial parameters are given by

u_{1} = 1, u_{2} = 2, δ_{1} = 0.2, δ_{2} = 0.8, ε_{1} = 10^{- 6} .

We use the following Himmeblau stop rule, which satisfies

If ∣f(x _k)∣ ≤ ɛ ₂, let stop1 = $s t o p 1 = | f (x_{k}) - f (x_{k + 1}) |$ ; otherwise, let $s t o p 1 = \frac{| f (x_{k}) - f (x_{k + 1}) |}{| f (x_{k}) |}$ . The test program will be stopped if stop1 < ɛ ₃ or $∥ g (x_{k}) ∥ < ε_{1}$ is satisfied, where ɛ ₂ = ɛ ₃ = 10⁻⁶. When the total number of iterations is greater than one thousand, the test program will be stopped. The test results are given in Tables 1 and 2: x ₁ denotes the initial point, Dim denotes the dimension of test function, NI denotes the the total number of iterations, and NFG = NF+NG (NF and NG denote the number of the function evaluations and the number of the gradient evaluations, respectively). $f^{^{'}}$ denotes the function value when the program is stopped. The test problems are defined as follows.

Schwefel function:
$f_{S c h} (x) = 418.9829 n + \sum_{i = 1}^{n} x_{i} sin \sqrt{| x_{i} |}, x_{i} \in [- 512.03, 511.97],$

$x^{*} = (- 420.9687, - 420.9687, . . ., - 420.9687), f_{S c h} (x^{*}) = 0 .$
Langerman function:
$f_{L a n} (x) = - \sum_{i = 1}^{m} c_{i} e^{- \frac{1}{π} \sum_{j = 1}^{n} {(x_{j} - a_{i j})}^{2}} cos (π \sum_{j = 1}^{n} {(x_{j} - a_{i j})}^{2}), x_{i} \in [0, 10], m = n,$

$x^{*} = r a n d o m, f_{L a n} (x^{*}) = r a n d o m .$
Schwefel′s function
$f_{S c h D S} (x) = \sum_{i = 1}^{n} {(\sum_{j = 1}^{i} x_{j})}^{2}, x_{i} \in [- 65.536, 65.536],$

$x^{*} = (0, 0, . . ., 0), f_{S c h D S} (x^{*}) = 0 .$
Sphere function:
$f_{S p h} (x) = \sum_{i = 1}^{n} x_{i}^{2}, x_{i} \in [- 5.12, 5.12],$

$x^{*} = (0, 0, . . ., 0), f_{S p h} (x^{*}) = 0 .$
Griewangk function:
$f_{G r i} (x) = 1 + \sum_{i = 1}^{n} \frac{x_{i}^{2}}{4000} - \prod_{i = 1}^{n} cos (\frac{x_{i}}{\sqrt{i}}), x_{i} \in [- 600, 600],$

$x^{*} = (0, 0, . . ., 0), f_{G r i} (x^{*}) = 0 .$
Rosenbrock function:
$f_{R o s} (x) = \sum_{i = 1}^{n - 1} [100 {(x_{i + 1} - x_{i}^{2})}^{2} + {(x_{i} - 1)}^{2}], x_{i} \in [- 2.048, 2.048],$

$x^{*} = (1, . . ., 1), f_{R o s} (x^{*}) = 0 .$
Ackley function:
$f_{A c k} (x) = 20 + e - 20 e^{- 0.2 \sqrt{\frac{1}{n} \sum_{i = 1}^{n} x_{i}^{2}}} - e^{\frac{1}{n} \sum_{i = 1}^{n} cos (2 π x_{i})}, x_{i} \in [- 30, 30],$

$x^{*} = (0, 0, . . ., 0), f_{A c k} (x^{*}) = 0 .$
Rastrigin function:
$f_{R a s} (x) = 10 n + \sum_{i = 1}^{n} (x_{i}^{2} - 10 cos (2 π x_{i})), x_{i} \in [- 5.12, 5.12],$

$x^{*} = (0, 0, . . ., 0), f_{R a s} (x^{*}) = 0 .$

Table 1. Test results for Algorithm 2.1.

Problems	Dim	x ₁	NI/NFG	f′
1	50	(-426,-426,…,-426)	2/9	6.363783e-004
	120	(-426,-426,…,-426)	2/9	1.527308e-003
	200	(-426,-426,…,-426)	2/9	2.545514e-003
	1000	(-410,-410,…,-410)	3/12	1.272757e-002
2	50	(3,3,…,3)	0/2	-1.520789e-060
	120	(5,5,…,5)	0/2	0.000000e+000
	200	(6,6,…,6)	0/2	0.000000e+000
	1000	(1,1,…,1)	0/2	-7.907025e-136
3	50	(-0.00001,0,-0.00001,0,…)	2/8	1.561447e-009
	120	(-0.00001,0,-0.00001,0,…)	2/8	1.769900e-008
	200	(-0.00001,0,-0.00001,0,…)	2/8	7.906818e-008
	1000	(0.000001,0,0.000001,0,…)	2/8	9.619586e-008
4	50	(-4,-4,…,-4)	1/6	1.577722e-028
	120	(-2,-2,…,-2)	1/6	3.786532e-028
	200	(1,1,…,1)	1/6	7.730837e-027
	1000	(3,3,…,3)	1/6	1.079951e-024
5	50	(-7,0,-7,0,…)	2/10	0.000000e+000
	120	(0.592,0,0.592,0,…)	4/14	3.183458e-007
	200	(0.451,0,0.451,0,…)	4/14	3.476453e-007
	1000	(0.38,0,0.38,0,…)	1/6	0.000000e+000
6	50	(1.001,1.001,…,1.001)	2/36	4.925508e-003
	120	(1.001,1.001,…,1.001)	2/36	1.198551e-002
	200	(1.001,1.001,…,1.001)	2/36	2.006158e-002
	1000	(1.001,1.001,…,1.001)	2/36	1.009107e-001
7	50	(0.01,0,0.01,0,…)	0/2	3.094491e-002
	120	(-0.05,0,-0.05,0,…)	0/2	2.066363e-001
	200	(0.01,0,0.01,0,…)	0/2	3.094491e-002
	1000	(0.07,0,0.07,0,…)	0/2	3.233371e-001
8	50	(0.003,0.003,…,0.003)	3/26	0.000000e+000
	120	(0.005,0.005,…,0.005)	2/9	0.000000e+000
	200	(0.006,0,0.006,0,…)	2/9	0.000000e+000
	1000	(0.015,0.015,…,0.015)	2/8	0.000000e+000

Open in a new tab

Table 2. Test results for the PRP conjugate gradient algorithm.

Problems	Dim	x ₁	NI/NFG	f′
1	50	(-426,-426,…,-426)	2/24	6.363783e-004
	120	(-426,-426,…,-426)	2/11	1.527308e-003
	200	(-426,-426,…,-426)	3/41	2.545514e-003
	1000	(-410,-410,…,-410)	3/41	1.272757e-002
2	50	(3,3,…,3)	0/2	-1.520789e-060
	120	(5,5,…,5)	0/2	0.000000e+000
	200	(6,6,…,6)	0/2	0.000000e+000
	1000	(1,1,…,1)	0/2	-7.907025e-136
3	50	(-0.00001,0,-0.00001,0,…)	2/8	1.516186e-009
	120	(-0.00001,0,-0.00001,0,…)	2/8	1.701075e-008
	200	(-0.00001,0,-0.00001,0,…)	2/8	7.579825e-008
	1000	(0.000001,0,0.000001,0,…)	2/8	9.198262e-008
4	50	(-4,-4,…,-4)	1/6	1.577722e-028
	120	(-2,-2,…,-2)	1/6	3.786532e-028
	200	(1,1,…,1)	1/6	7.730837e-027
	1000	(3,3,…,3)	1/6	1.079951e-024
5	50	(-7,0,-7,0,…)	4/16	3.597123e-013
	120	(0.592,0,0.592,0,…)	5/17	3.401145e-007
	200	(0.451,0,0.451,0,…)	5/17	4.566281e-007
	1000	(0.38,0,0.38,0,…)	1/6	0.000000e+000
6	50	(1.001,1.001,…,1.001)	2/36	4.925508e-003
	120	(1.001,1.001,…,1.001)	2/36	1.198551e-002
	200	(1.001,1.001,…,1.001)	2/36	2.006158e-002
	1000	(1.001,1.001,…,1.001)	2/36	1.009107e-001
7	50	(0.01,0,0.01,0,…)	0/2	3.094491e-002
	120	(-0.05,0,-0.05,0,…)	0/2	2.066363e-001
	200	(0.01,0,0.01,0,…)	0/2	3.094491e-002
	1000	(0.07,0,0.07,0,…)	0/2	3.233371e-001
8	50	(0.003,0.003,…,0.003)	2/10	0.000000e+000
	120	(0.005,0.005,…,0.005)	2/10	0.000000e+000
	200	(0.006,0,0.006,0,…)	2/10	0.000000e+000
	1000	(0.015,0.015,…,0.015)	2/22	3.636160e-009

Open in a new tab

It is easy to see that the two algorithms are effective for the above eight test problems listed in Tables 1 and 2. We use the tool of Dolan and Morè [35] to analyze the numerical performance of the two algorithms.

For the above eight test problems, Fig 2 shows the numerical performance of the two algorithms when the information of NI is considered, and Fig 3 shows the the numerical performance of the two algorithms when the information of NFG is considered. From the above two figures, it is easy to see that Algorithm 2.1 yields a better numerical performance than the PRP conjugate gradient algorithm on the whole. From Tables 1 and 2 and the two figures, we can conclude that Algorithm 2.1 is effective and competitive for solving unconstrained optimization problems.

A new algorithm is given for solving nonlinear equations in the next section. The sufficient descent property and the trust region property of the new algorithm are proved in Section 6; moreover, we establish the global convergence of the new algorithm. In Section 7, the numerical results are presented.

New algorithm for nonlinear equations

We consider the system of nonlinear equations

\begin{matrix} q (x) = 0, x \in ℜ^{n} . \end{matrix}

(17)

where q : ℜⁿ → ℜⁿ is a continuously differentiable and monotonic function. ∇q(x) denotes the Jacobian matrix of q(x); if ∇q(x) is symmetric, we call Eq (17) symmetric nonlinear equations. As q(x) is monotonic, the following inequality

{(q (x) - q (y))}^{T} (x - y) \geq 0, \forall x, y \in ℜ^{n}

holds. If a norm function is defined as follows

h (x) = \frac{1}{2} {‖ q (x) ‖}^{2}

and we define the unconstrained optimization problem as follows,

\begin{matrix} min h (x), x \in ℜ^{n} \end{matrix}

(18)

We know directly that the problem Eq (17) is equivalent to the problem Eq (18).

The iterative formula Eq (2) is also usually used in many algorithms for solving problem Eq (17). Many algorithms ([36–41], etc.) have been proposed for solving special classes of nonlinear equations. We are more interested in the process of dealing with large-scale nonlinear equations. By Eq (2), it is easy to see that the two factors of stepsize α _k and search direction d _k are very important for dealing with large-scale problems. When dealing with large-scale nonlinear equations and unconstrained optimization problems, there are many popular methods ([38, 42–46] etc.) for computing d _k, such as conjugate gradient methods, spectral gradient methods, and limited-memory quasi-Newton approaches. Some new line search methods [37, 47] have been proposed for calculating α _k. Li and Li [48] provide the following new derivative-free line search method

\begin{matrix} - q {(x_{k} + α_{k} d_{k})}^{T} d_{k} \geq σ_{3} α_{k} ‖ q (x_{k} + α_{k} d_{k}) ‖ {‖ d_{k} ‖}^{2}, \end{matrix}

(19)

where α _k = max{γ, ργ, ρ ² γ, …}, ρ ∈ (0,1), σ ₃ > 0 and γ > 0. This line search method is very effective for solving large-scale nonlinear monotonic equations.

Solodov and Svaiter [49] presented a hybrid projection-proximal point algorithm that could conquer some drawbacks when the form Eq (18) is used with nonlinear equations. Yuan et al.[50] proposed a three-term PRP conjugate gradient algorithm by using the projection-based technique, which was introduced by Solodov et al.[51] for optimization problems. The projection-based technique is very effective for solving nonlinear equations. It involves certain methods to compute search direction d _k and certain line search methods to calculate α _k, which satisfies

q {(w_{k})}^{T} (x_{k} - w_{k}) > 0

in which w _k = x _k + α _k d _k. For any x* that satisfies q(x*) = 0, considering that q(x) is monotonic, we can obtain

q {(w_{k})}^{T} (x^{*} - w_{k}) \leq 0 .

Thus, it is easy to obtain the current iterate x _k, which is strictly separated from the zeros of the system of equations Eq (17) by the following hyperplane

T_{k} = {x \in ℜ^{n} | q {(w_{k})}^{T} (x - w_{k}) = 0}

Then, the iterate x _k+1 can be obtained by projecting x _k onto the above hyperplane. The projection formula can be set as follows

\begin{matrix} x_{k + 1} = x_{k} - \frac{q {(w_{k})}^{T} (x_{k} - w_{k})}{{∥ q (w_{k}) ∥}^{2}} q (w_{k}) \end{matrix}

(20)

Yuan et al. [50] present a three-term Polak-Ribière-Polyak conjugate gradient algorithm in which the search direction d _k is defined as follows

d_{k} = {\begin{matrix} - q_{k} & if k = 0 \\ - q_{k} + \frac{q_{k}^{T} y_{k - 1} d_{k - 1} - q_{k}^{T} d_{k - 1} y_{k - 1}}{max {μ ∥ d_{k - 1} ∥ ∥ y_{k - 1} ∥, ∥ q_{k - 1} ∥^{2}}} & if k \geq 1 \end{matrix}

where y _k−1 = q _k−q _k−1. The derivative-free line search method [48] and the projection-based techniques are used by the algorithm [50], proved to be very suitable for solving large-scale nonlinear equations. The most attractive property of algorithm [50] is the the trust region property of d _k.

Motivated by our new modified PRP conjugate gradient formula, proposed in Section 2, we proposed the following modified PRP conjugate gradient formula

\begin{matrix} β_{k}^{*} = \frac{min {| q_{k}^{T} (q_{k} - q_{k - 1}) |, u_{3} (∥ q_{k} ∥^{2} - \frac{‖ q_{k} ‖}{‖ q_{k - 1} ‖} | q_{k}^{T} q_{k - 1} |)}}{u_{4} ∥ d_{k - 1} ∥ ∥ q_{k} - q_{k - 1} ∥ + ∥ q_{k - 1} ∥^{2}} \end{matrix}

(21)

and

\begin{matrix} d_{k} = {\begin{matrix} - q_{k}, & if k = 1 \\ - q_{k} - β_{k}^{*} \frac{q_{k}^{T} d_{k - 1}}{∥ q_{k} ∥^{2}} q_{k} + β_{k}^{*} d_{k - 1}, & if k \geq 2 \end{matrix} \end{matrix}

(22)

Where u ₃ > 0, u ₄ > 0. It is easy to see that $β_{k}^{*} \geq 0$ , motivated by the above observation and [50]. We present a new algorithm for solving problem Eq (17): it uses our modified PRP conjugate gradient formula Eqs (21) and (22). Here, we list the new algorithm and it’s diagram (Fig 4) as follows.

Algorithm 5.1

Step 1: Given the initial point x ₁ ∈ ℜⁿ,ɛ ₄ > 0,ρ ∈ (0,1), σ ₃ > 0, γ > 0,u ₃ > 0, u ₄ > 0, and k: = 1.

Step 2: If $∥ q (x_{k}) ∥ \leq ε_{4},$ stop; otherwise, go to step 3.

Step 3: Compute d _k by Eq (22) and calculate α _k by Eq (19)

Step 4: Set the next iterate to be w _k = x _k + α _k d _k;

Step 5: If $∥ q (w_{k}) ∥ \leq ε_{4}$ , stop and set x _k+1 = w _k; otherwise, calculate x _k+1 by Eq (20)

Step 6: Set k: = k + 1; go to step 2.

Convergence Analysis

When we analyze the global convergence of Algorithm 5.1, we require the following suitable assumptions.

Assumption 6.1

The solution set of the problem Eq (17) is nonempty.
q(x) is Lipschitz continuous, namely, there exists a constant E > 0 such that
$∥ q (x) - q (y) ∥ \leq E ∥ x - y ∥, \forall x, y \in ℜ^{n} .$

By Assumption 6.1, it is easy to obtain that there exists a positive constant ζ that satisfies

\begin{matrix} ∥ q (x_{k}) ∥ \leq ζ \end{matrix}

(23)

Lemma 0.4 Let the sequence {d _k} be generated by Eq (22) ; then, we can obtain

{\begin{matrix} q_{k}^{T} d_{k} = - ‖ q_{k} ‖ \end{matrix}}^{2}

(24)

and

\begin{matrix} ∥ q_{k} ∥ \leq ∥ d_{k} ∥ \leq (1 + \frac{4 u_{3}}{u_{4}}) ∥ q_{k} ∥ \end{matrix}

(25)

Proof As the proof is similar to Lemma 0.1 and Lemma 0.3 of this paper, we omit it here.

Similar to Lemma 3.1 of [50] and theorem 2.1 of [51], it is easy to obtain the following lemma. Here, we omit this proof and only list it.

Lemma 0.5 Suppose that Assumption 6.1 holds and x* is a solution of problem Eq (17) that satisfies g(x*) = 0. Let the sequence {x _k} be obtained by Algorithm 5.1; then, the {x _k} is a bounded sequence and

{‖ x_{k + 1} - x * ‖}^{2} \leq {‖ x_{k} - x * ‖}^{2} - {‖ x_{k + 1} - x_{k} ‖}^{2}

holds. Moreover, either {x _k} is a infinite sequence and

\sum_{k = 1}^{\infty} {∥ x_{k + 1} - x_{k} ∥}^{2} < \infty

or the {x _k} is a finite sequence and a solution of problem Eq (17) is the last iteration.

Lemma 0.6 Suppose that Assumption 6.1 holds, then, an iteration x _k+1 = x _k + α _k d _k will be generated by Algorithm 5.1 in a finite number of backtracking steps.

Proof We will obtain this conclusion by contradiction: suppose that $∥ q_{k} ∥ \to 0$ does not hold; then, there exists a positive constant ɛ ₅ that satisfies

\begin{matrix} ∥ q_{k} ∥ \geq ε_{5}, \forall k \geq 1 \end{matrix}

(26)

suppose that there exist some iterate indexes $k^{^{'}}$ that do not satisfy the condition Eq (19). We let $α_{k^{^{'}}}^{(c)} = ρ^{(c)} γ$ then it can obtain

- q {(x_{k^{^{'}}} + α_{k^{^{'}}}^{(c)} d_{k^{^{'}}})}^{T} d_{k^{^{'}}} < σ_{3} α_{k^{^{'}}}^{(c)} ‖ q (x_{k^{^{'}}} + α_{k^{^{'}}}^{(c)} d_{k^{^{'}}}) ‖ ‖ d_{k^{^{'}}} ‖^{2}, \forall c \in N^{*} \cup^{​} {0} .

By Assumption 6.1 (b) and Eq (24), we find

\begin{matrix} {‖ d_{k^{^{'}}} ‖}^{2} & = & - q_{k^{^{'}}}^{T} d_{k^{^{'}}} \\ = & {[q (x_{k^{^{'}}} + α_{k^{^{'}}}^{(c)} d_{k^{^{'}}}) - q (x_{k^{^{'}}})]}^{T} d_{k^{^{'}}} - q {(x_{k^{^{'}}} + α_{k^{^{'}}}^{(c)} d_{k^{^{'}}})}^{T} d_{k^{^{'}}} \\ < & [E + σ_{3} ∥ q (x_{k^{^{'}}} + α_{k^{^{'}}}^{(c)} d_{k^{^{'}}}) ∥] α_{k^{^{'}}}^{(c)} ∥ d_{k^{^{'}}} ∥^{2} \end{matrix}

By Eqs (23) and (25), we can obtain

\begin{matrix} ∥ q (x_{k^{^{'}}} + α_{k^{^{'}}}^{(c)} d_{k^{^{'}}}) ∥ & \leq & ∥ q (x_{k^{^{'}}} + α_{k^{^{'}}}^{(c)} d_{k^{^{'}}}) - q_{k^{^{'}}} ∥ + ∥ q_{k^{^{'}}} ∥ \\ \leq & E α_{k^{^{'}}}^{(c)} ∥ d_{k^{^{'}}} ∥ + ζ \\ \leq & E γ ζ (1 + \frac{4 u_{3}}{u_{4}}) + ζ \end{matrix}

Thus, we obtain

α_{k^{^{'}}}^{(c)} > \frac{ε_{5}^{2} u_{4}^{2}}{[E + σ_{3} (E γ ζ (1 + \frac{4 u_{3}}{u_{4}}) + ζ)] {(u_{4} + 4 u_{3})}^{2} ζ^{2}}, \forall c \in N^{*} \cup {0}

which shows that $α_{k^{^{'}}}^{(c)}$ is bounded below. This contradicts the definition of $α_{k^{^{'}}}^{(c)}$ ; so, the lemma holds.

Similar to Theorem 3.1 of [50], we list the following theorem but omit its proof.

Theorem 0.2 Let the sequence {x _k+1, q _k+1} and {α _k, d _k} be generated by Algorithm 5.1. Suppose that Assumption 6.1 holds; then, we have

\begin{matrix} lim_{k \to \infty} inf ∥ q_{k} ∥ = 0 . \end{matrix}

(27)

Numerical results

When the following d _k formula of the famous PRP conjugate gradient method [8, 9]

d_{k} = {\begin{matrix} - q_{k} & if k = 1 \\ - q_{k} + \frac{q_{k}^{T} (q_{k} - q_{k - 1})}{∥ q_{k - 1} ∥^{2}} d_{k - 1} & if k \geq 2 \end{matrix}

is used to compute d _k in step 3 of Algorithm 5.1, then it is called PRP algorithm. We test Algorithm 5.1 and the PRP algorithm for some problems in this section. The test environment is MATLAB 7.0 on a Windows 7 system. The initial parameters are given by

σ_{3} = u_{4} = 0.02, γ = 1, ρ = 0.1, u_{3} = 1, ε_{4} = 10^{- 5} .

When the number of iterations is greater than or equal to one thousand and five hundred, the test program will also be stopped. The test results are given in Tables 3 and 4. As we know, when the line search cannot guarantee that d _k satisfies $q_{k}^{T} d_{k} < 0$ , some uphill search direction may be produced; the line search method possibly fails in this case. In order to prevent this situation, when the search time is greater than or equal to fifteen in the inner cycle of our program, we set α _k that is acceptable. NG, NI stand for the number of gradient evaluations and iterations respectively. Dim denotes the dimension of the testing function, and cputime denotes the cpu time in seconds. GF denotes the evaluation of the final function norm when the program terminates. The test functions all have the following form

q (x) = {(f_{1} (x), f_{2} (x), . . ., f_{n} (x))}^{T}

the concrete function definitions are given as follows.

Function 1. Exponential function 2
$\begin{matrix} f_{1} (x) & = & e^{x_{1}} - 1, \\ f_{i} (x) & = & \frac{i}{10} (e^{x_{i}} + x_{i - 1} - 1), i = 2, 3, \dots, n \end{matrix}$
Initial guess: $x_{0} = {(\frac{1}{n^{2}}, \frac{1}{n^{2}}, \dots, \frac{1}{n^{2}})}^{T} .$
Function 2. Trigonometric function
$f_{i} (x) = 2 (n + i (1 - cos (x_{i})) - sin (x_{i}) - \sum_{k = 1}^{n} cos (x_{k})) (2 sin (x_{i}) - cos (x_{i})), i = 1, 2, \dots, n$
Initial guess: $x_{0} = {(\frac{101}{100 n}, \frac{101}{100 n}, \dots, \frac{101}{100 n})}^{T} .$
Function 3. Logarithmic function
$f_{i} (x) = ln (x_{i} + 1) - \frac{x_{i}}{n}, i = 1, 2, 3, \dots, n .$
Initial guess: x ₀ = (1,1,⋯,1)^T.
Function 4. Broyden Tridiagonal function [[52], pp. 471–472]
$\begin{matrix} f_{1} (x) & = & (3 - 0.5 x_{1}) x_{1} - 2 x_{2} + 1, \\ f_{i} (x) & = & (3 - 0.5 x_{i}) x_{i} - x_{i - 1} + 2 x_{i + 1} + 1, i = 2, 3, \dots, n - 1, \\ f_{n} (x) & = & (3 - 0.5 x_{n}) x_{n} - x_{n - 1} + 1 . \end{matrix}$
Initial guess: x ₀ = (−1,−1,⋯,−1)^T.
Function 5. Strictly convex function 1 [[44], p. 29]

q(x) is the gradient of $h (x) = \sum_{i = 1}^{n} (e^{x_{i}} - x_{i}) .$
$f_{i} (x) = e^{x_{i}} - 1, i = 1, 2, 3, \dots, n$
Initial guess: $x_{0} = {(\frac{1}{n}, \frac{2}{n}, \dots, 1)}^{T} .$
Function 6. Variable dimensioned function
$\begin{matrix} f_{i} (x) & = & x_{i} - 1, i = 1, 2, 3, \dots, n - 2, \\ f_{n - 1} (x) & = & \sum_{j = 1}^{n - 2} j (x_{j} - 1), \\ f_{n} (x) & = & {(\sum_{j = 1}^{n - 2} j (x_{j} - 1))}^{2} . \end{matrix}$
Initial guess: $x_{0} = {(1 - \frac{1}{n}, 1 - \frac{2}{n}, \dots, 0)}^{T} .$
Function 7. Discrete boundary value problem [53].
$\begin{matrix} f_{1} (x) & = & 2 x_{1} + 0.5 h^{2} {(x_{1} + h)}^{3} - x_{2}, \\ f_{i} (x) & = & 2 x_{i} + 0.5 h^{2} {(x_{i} + h i)}^{3} - x_{i - 1} + x_{i + 1}, i = 2, 3, \dots, n - 1 \\ f_{n} (x) & = & 2 x_{n} + 0.5 h^{2} {(x_{n} + h n)}^{3} - x_{n - 1}, \\ h & = & \frac{1}{n + 1} . \end{matrix}$
Initial guess: x ₀ = (h(h−1), h(2h−1),⋯,h(nh−1))^T.
Function 8. Troesch problem [54]
$\begin{matrix} f_{1} (x) & = & 2 x_{1} + ϱ h^{2} s i n h (ϱ x_{1}) - x_{2} \\ f_{i} (x) & = & 2 x_{i} + ϱ h^{2} s i n h (ϱ x_{1}) - x_{i - 1} - x_{i + 1}, i = 2, 3, \dots, n - 1 \\ f_{n} (x) & = & 2 x_{n} + ϱ h^{2} s i n h (ϱ x_{n}) - x_{n - 1}, \\ h & = & \frac{1}{n + 1}, ϱ = 10 . \end{matrix}$
Initial guess: x ₀ = (0, 0, ⋯, 0)^T.

Table 3. Test results for Algorithm 5.1.

Function	Dim	NI/NG	cputime	GF
1	3000	55/209	2.043613	9.850811e-006
	5000	8/33	0.858005	6.116936e-006
	30000	26/127	100.792246	8.983556e-006
	45000	7/36	62.681202	7.863794e-006
	50000	5/26	56.659563	5.807294e-006
2	3000	43/86	1.076407	8.532827e-006
	5000	42/84	2.745618	8.256326e-006
	30000	38/76	73.039668	8.065468e-006
	45000	37/74	164.284653	8.064230e-006
	50000	36/72	201.288090	9.519786e-006
3	3000	5/6	0.093601	1.009984e-008
	5000	5/6	0.249602	6.263918e-009
	30000	18/33	32.775810	2.472117e-009
	45000	21/39	91.229385	2.840234e-010
	50000	21/39	108.202294	2.661223e-010
4	3000	95/190	2.137214	9.497689e-006
	5000	97/194	5.834437	9.048858e-006
	30000	103/206	194.954450	8.891642e-006
	45000	104/208	446.568463	9.350859e-006
	50000	104/208	549.529123	9.856874e-006
5	3000	64/128	1.497610	9.111464e-006
	5000	65/130	4.102826	9.525878e-006
	30000	70/140	132.117247	8.131796e-006
	45000	70/140	297.868309	9.959279e-006
	50000	71/142	374.964004	8.502923e-006
6	3000	1/2	0.031200	0.000000e+000
	5000	1/2	0.062400	0.000000e+000
	30000	1/2	1.918812	0.000000e+000
	45000	1/2	4.258827	0.000000e+000
	50000	1/2	5.194833	0.000000e+000
7	3000	35/71	0.842405	9.291878e-006
	5000	34/69	2.121614	8.658237e-006
	30000	30/61	58.391174	8.288490e-006
	45000	29/59	135.627269	8.443996e-006
	50000	29/58	153.801386	9.993530e-006
8	3000	0/1	0.015600	0.000000e+000
	5000	0/1	0.046800	0.000000e+000
	30000	0/1	1.326008	0.000000e+000
	45000	0/1	2.917219	0.000000e+000
	50000	0/1	3.510022	0.000000e+000

Open in a new tab

Table 4. Test results for PRP algorithm.

Function	Dim	NI/NG	cputime	GF
1	3000	58/220	2.043613	9.947840e-006
	5000	24/97	2.496016	9.754454e-006
	30000	29/141	109.668703	9.705424e-006
	45000	13/66	118.108357	9.450575e-006
	50000	10/51	112.383120	9.221806e-006
2	3000	48/95	1.138807	8.647042e-006
	5000	46/91	2.932819	9.736889e-006
	30000	41/81	78.733705	9.983531e-006
	45000	40/79	181.709965	9.632281e-006
	50000	40/79	212.832164	9.121412e-006
3	3000	11/12	0.171601	1.012266e-008
	5000	11/12	0.530403	8.539532e-009
	30000	23/38	39.749055	2.574915e-009
	45000	26/44	100.542645	2.931611e-010
	50000	26/44	123.864794	2.838473e-010
4	3000	104/208	2.246414	9.243312e-006
	5000	106/212	6.193240	9.130520e-006
	30000	113/226	219.821009	8.747379e-006
	45000	114/228	487.908728	9.368026e-006
	50000	114/228	611.976323	9.874918e-006
5	3000	35/53	0.561604	2.164559e-006
	5000	35/53	1.716011	1.291210e-006
	30000	35/53	55.926358	1.336971e-006
	45000	33/49	116.361146	2.109293e-006
	50000	33/49	147.452145	2.225071e-006
6	3000	1/2	0.031200	0.000000e+000
	5000	1/2	0.062400	0.000000e+000
	30000	1/2	1.965613	0.000000e+000
	45000	1/2	4.290028	0.000000e+000
	50000	1/2	5.257234	0.000000e+000
7	3000	40/80	0.904806	9.908999e-006
	5000	39/78	2.386815	9.198351e-006
	30000	34/68	66.440826	9.515010e-006
	45000	33/66	140.026498	9.366998e-006
	50000	33/66	173.597913	8.886013e-006
8	3000	0/1	0.015600	0.000000e+000
	5000	0/1	0.031200	0.000000e+000
	30000	0/1	1.279208	0.000000e+000
	45000	0/1	2.808018	0.000000e+000
	50000	0/1	3.432022	0.000000e+000

Open in a new tab

By Tables 3 and 4, we see that Algorithm 5.1 and the PRP algorithm are effective for solving the above eight problems.

We use the tool of Dolan and Morè [35] to analyze the numerical performance of the two algorithms when NI, NG and cputime are considered, for which we generate three figures.

Fig 5 shows that the numerical performance of Algorithm 5.1 is slightly better than that of the PRP algorithm when NI is considered. It is easy to see that the numerical performance of Algorithm 5.1 is better than that of the PRP algorithm from Figs 6 and 7 because the PRP algorithm requires a bigger horizontal axis when the problems are completely solved.

From the above two tables and three figures, we see that Algorithm 5.1 is effective and competitive for solving large-scale nonlinear equations.

Conclusion

(i) This paper provides the first new algorithm based on the first modified PRP conjugate gradient method in Sections 1–4. The β _k formula of the method includes the gradient value and function value. The global convergence of the algorithm is established under some suitable conditions. The trust region property and sufficient descent property of the method have been proved without the use of any line search method. For some test functions, the numerical results indicate that the first algorithm is effective and competitive for solving unconstrained optimization problems.

(ii) The second new algorithm based on the second modified PRP conjugate gradient method is presented in Sections 5-7. The new algorithm has global convergence under suitable conditions. The trust region property and the sufficient descent property of the method are proved without the use of any line search method. The numerical results of some tests function are demonstrated. The numerical results show that the second algorithm is very effective for solving large-scale nonlinear equations.

Acknowledgments

This work is supported by China NSF (Grant No. 11261006 and 11161003), NSFC No. 61232016, NSFC No. U1405254, the Guangxi Science Fund for Distinguished Young Scholars (No. 2015GXNSFGA139001) and PAPD issue of Jiangsu advantages discipline. The authors wish to thank the editor and the referees for their useful suggestions and comments which greatly improve this paper.

Data Availability

All data are available in the paper.

Funding Statement

References

1. Gu B, Sheng V, Feasibility and finite convergence analysis for accurate on-line v-support vector learning, IEEE Transactions on Neural Networks and Learning Systems. 24 (2013) 1304–1315. 10.1109/TNNLS.2013.2250300 [DOI] [PubMed] [Google Scholar]
2. Li J, Li X, Yang B, Sun X, Segmentation-based Image Copy-move Forgery Detection Scheme, IEEE Transactions on Information Forensics and Security. 10 (2015) 507–518. 10.1109/TIFS.2014.2381872 [DOI] [Google Scholar]
3. Wen X, Shao L, Fang W, Xue Y, Efficient Feature Selection and Classification for Vehicle Detection, IEEE Transactions on Circuits and Systems for Video Technology. 2015, 10.1109/TCSVT.2014.2358031 [DOI] [Google Scholar]
4. Zhang H, Wu J, Nguyen TM, Sun M, Synthetic Aperture Radar Image Segmentation by Modified Student’s t-Mixture Model, IEEE Transaction on Geoscience and Remote Sensing. 52 (2014) 4391–4403. 10.1109/TGRS.2013.2281854 [DOI] [Google Scholar]
5. Fu Z, Achieving Efficient Cloud Search Services: Multi-keyword Ranked Search over Encrypted Cloud Data Supporting Parallel Computing, IEICE Transactions on Communications. E98-B (2015) 190–200. 10.1587/transcom.E98.B.190 [DOI] [Google Scholar]
6. Dai Y, Yuan Y, A nonlinear conjugate gradient with a strong global convergence properties, SIAM J. Optim. 10 (2000) 177–182. 10.1137/S1052623497318992 [DOI] [Google Scholar]
7. Fletcher R, Reeves C, Function minimization by conjugate gradients, Comput. J. 7 (1964) 149–154. 10.1093/comjnl/7.2.149 [DOI] [Google Scholar]
8. Polak E, Ribière G, Note sur la convergence de directions conjugees, Rev. Fran. Inf. Rech. Opérat. 3 (1969) 35–43. [Google Scholar]
9. Polyak BT, The conjugate gradient method in extreme problems, USSR Comput. Math. Math.Phys. 9 (1969) 94–112. 10.1016/0041-5553(69)90035-4 [DOI] [Google Scholar]
10. Hestenes MR, Stiefel EL, Methods of conjugate gradients for solving linear systems, J. Res. Natl.Bur. Stand. Sect. B. 49 (1952) 409–432. 10.6028/jres.049.044 [DOI] [Google Scholar]
11. Liu Y, Storey C, Effcient generalized conjugate gradient algorithms part 1: theory. J. Comput. Appl.Math. 69 (1992) 17–41. [Google Scholar]
12. Fletcher R, Practical method of optimization, vol I: unconstrained optimization, 2nd edn Wiley,New York, 1997. [Google Scholar]
13. Powell MJD, Nonconvex minimization calculations and the conjugate gradient method, Lecture Notes in Mathematics, vol. 1066, Spinger, Berlin, 1984, pp. 122–141. [Google Scholar]
14. Gibert JC, Nocedal J, Global convergence properties of conjugate gradient methods for optimization. SIAM J. Optim. 2 (1992) 21–42. 10.1137/0802003 [DOI] [Google Scholar]
15.Dai Y, Analyses of conjugate gradient methods, PH.D.thesis, Institute of Computational Mathematics and Scientific/Engineering Computing, Chinese Academy of Sciences (in Chinese), 1997.
16. Ahmed T, Storey D, Efficient hybrid conjugate gradient techniques, Journal of Optimization Theory and Applications. 64 (1990) 379–394. 10.1007/BF00939455 [DOI] [Google Scholar]
17. Al-Baali A, Descent property and global convergence of the Flecher-Reeves method with inexact line search, IMA Journal of Numerical Analysis. 5 (1985) 121–124. 10.1093/imanum/5.1.121 [DOI] [Google Scholar]
18. Hu Y F, Storey C, Global convergence result for conjugate gradient methods, Journal of Optimization Theory and Applications. 71 (1991) 399–405. 10.1007/BF00939927 [DOI] [Google Scholar]
19. Yuan G, Wei Z, Zhao Q, A modified Polak-Ribière-Polyak conjugate gradient algorithm for large-scale optimization problems, IIE Transactions. 46 (2014), 397–413. 10.1080/0740817X.2012.726757 [DOI] [Google Scholar]
20. Yuan G, Zhang M, A modified Hestenes-Stiefel conjugate gradient algorithm for large-scale optimization, Numerical Functional Analysis and Optimization. 34 (2013), 914–937. 10.1080/01630563.2013.777350 [DOI] [Google Scholar]
21. Yuan G, Lu X, Wei Z, A conjugate gradient method with descent direction for unconstrained optimization, Journal of Computational and Applied Mathematics, 233 (2009), 519–530. 10.1016/j.cam.2009.08.001 [DOI] [Google Scholar]
22. Yuan G, Lu X, A modified PRP conjugate gradient method, Annals of Operations Research. 166 (2009),73–90. 10.1007/s10479-008-0420-4 [DOI] [Google Scholar]
23. Li X, Zhao X, A hybrid conjugate gradient method for optimization problems, Natural Science. 3(2011): 85–90. 10.4236/ns.2011.31012 [DOI] [Google Scholar]
24. Yuan G, Modified nonlinear conjugate gradient methods with sufficient descent property for large-scale optimization problems, Optimization Letters. 3 (2009) 11–21. 10.1007/s11590-008-0086-5 [DOI] [Google Scholar]
25. Huang H, Lin S, A modified Wei-Yao-Liu conjugate gradient method for unconstrained optimization, Applied Mathematics and Computation. 231 (2014) 179–186. 10.1016/j.amc.2014.01.012 [DOI] [Google Scholar]
26. Yu G, Zhao Y, Wei Z, A descent nonlinear conjugate gradient method for large-scale unconstrained optimization, Applied mathematics and computation. 187 (2) (2007) 636–643. 10.1016/j.amc.2006.08.087 [DOI] [Google Scholar]
27. Yao S, Wei Z, Huang H, A note about WYL’s conjugate gradient method and its applications, Applied Mathematics and computation. 191 (2) (2007) 381–388. 10.1016/j.amc.2007.02.094 [DOI] [Google Scholar]
28. Hager WW, Zhang H, A new conjugate gradient method with guaranteed descent and an efficient line search, SIAM Journal on Optimization. 16(1) (2005) 170–192. 10.1137/030601880 [DOI] [Google Scholar]
29. Wei Z, Yao S, Liu L, The convergence properties of some new conjugate gradient methods, Applied Mathematics and Computation. 183 (2006) 1341–1350. 10.1016/j.amc.2006.05.150 [DOI] [Google Scholar]
30. Zhang L, An improved Wei-Yao-Liu nonlinear conjugate gradient method for optimization computation, Applied Mathematics and computation. 215 (6) (2009) 2269–2274. 10.1016/j.amc.2009.08.016 [DOI] [Google Scholar]
31. Wei Z, Yu G, Yuan G, Lian Z, The superlinear convergence of a modified BFGS-type method for unconstrained optimization, Comput. Optim. Appl. 29 (2004) 315–332. 10.1023/B:COAP.0000044184.25410.39 [DOI] [Google Scholar]
32. Yuan G, Wei Z, Convergence analysis of a modified BFGS method on convex minimizations, Computational Optimization and Applications. 47 (2) (2010) 237–255. 10.1007/s10589-008-9219-0 [DOI] [Google Scholar]
33. Li M, Qu A, Some sufficient descent conjugate gradient methods and their global convergence, Computational and Applied Mathematics. 33 (2) (2014) 333–347. 10.1007/s40314-013-0064-0 [DOI] [Google Scholar]
34. Zoutendijk Z, Nonlinear programming computational methods In: Abadie J. (ed.) Integer and nonlinear programming, North-Holland, Amsterdam, 1970, 37–86. [Google Scholar]
35. Dolan ED, Morè JJ, Benchmarking optimization software with performance profiles, Math.Program. 91 (2002) 201–213. 10.1007/s101070100263 [DOI] [Google Scholar]
36. Buhmiler S, Krejic´ N, Luzanin Z, Practical quasi-Newton algorithms for singular nonlinear systems, Numer. Algorithms 55 (2010) 481–502. 10.1007/s11075-010-9367-z [DOI] [Google Scholar]
37. Gu G, Li D, Qi L, Zhou S, Descent directions of quasi-Newton methods for symmetric nonlinear equations, SIAMJ.Numer.Anal. 40 (2002) 1763–1774. 10.1137/S0036142901397423 [DOI] [Google Scholar]
38. La Cruz W, Martínez. M, Raydan M. Spectral residual method without gradient information for solving large-scale nonlinear systems of equations, Math. Comp. 75 (2006) 1429–1448. 10.1090/S0025-5718-06-01840-0 [DOI] [Google Scholar]
39. Kanzow C, Yamashita N, Fukushima M, Levenberg-Marquardt methods for constrained nonlinear equations with strong local convergence properties, J. Comput. Appl. Math. 172 (2004) 375–397. 10.1016/j.cam.2004.02.013 [DOI] [Google Scholar]
40. Zhang J, Wang Y, A new trust region method for nonlinear equations, Math. Methods Oper. Res,58 (2003) 283–298. 10.1007/s001860300302 [DOI] [Google Scholar]
41. Grippo L, Sciandrone M, Nonmonotone derivative-free methods for nonlinear equations, Comput. Optim. Appl. 37 (2007) 297–328. 10.1007/s10589-007-9028-x [DOI] [Google Scholar]
42. Cheng W, A PRP type method for systems of monotone equations, Math. Comput. Modelling. 50 (2009) 15–20. 10.1016/j.mcm.2009.04.007 [DOI] [Google Scholar]
43. La Cruz W, Raydan M, Nonmonotone spectral methods for large-scale nonlinear systems, Optim. Methods Softw. 18 (2003) 583–599. 10.1080/10556780310001610493 [DOI] [Google Scholar]
44. Raydan M, The Barzilai and Borwein gradient method for the large scale unconstrained minimization problem, SIAM J. Optim. 7 (1997) 26–33. 10.1137/S1052623494266365 [DOI] [Google Scholar]
45. Yu G, Guan L, Chen W, Spectral conjugate gradient methods with sufficient descent property for large-scale unconstraned optimization, Optim.Methods Softw. 23 (2) (2008) 275–293. 10.1080/10556780701661344 [DOI] [Google Scholar]
46. Yuan G, Wei Z, Lu S, Limited memory BFGS method with backtracking for symmetric nonlinear equations, Math. Comput. Modelling. 54 (2011) 367–377. 10.1016/j.mcm.2011.02.021 [DOI] [Google Scholar]
47. Li D, Fukushima M, A global and superlinear convergent Gauss-Newton -based BFGS method for symmetric nonlinear equations, SIAMJ.Numer.Anal. 37 (1999) 152–172. 10.1137/S0036142998335704 [DOI] [Google Scholar]
48. Li Q, Li D, A class of derivative-free methods for large-scale nonlinear monotone equations, IMA J. Numer. Anal. 31 (2011) 1625–1635. 10.1093/imanum/drq015 [DOI] [Google Scholar]
49. Solodov MV, Svaiter BF, A hybrid projection-proximal point algorithm, J. Convex Anal.6 (1999) 59–70. [Google Scholar]
50. Yuan G, Zhang M, A three-terms Polak-Ribière-Polyak conjugate gradient algorithm for large-scale nonlinear equations, Journal of Computational and Applied Mathematics. 286 (2015) 186–195. 10.1016/j.cam.2015.03.014 [DOI] [Google Scholar]
51. Solodov MV, Svaiter BF, A globally convergent inexact Newton method for systems of monotone equations, in: Fukushima M., Qi L. (Eds.),Reformulation:Nonsmooth, Piecewise Smooth, Semismooth and Smoothing Methods, Kluwer Academic Publishers; 1998, pp. 355–369. [Google Scholar]
52. Gomez-Ruggiero M, Martinez J, Moretti A, Comparing algorithms for solving sparse nonlinear systems of equations, SIAM J. Sci. Comput. 23 (1992)459–483. 10.1137/0913025 [DOI] [Google Scholar]
53. Morè J, Garbow B, Hillström K, Testing unconstrained optimization software, ACM Trans. Math. Softw. 7 (1981) 17–41. [Google Scholar]
54. Roberts SM, Shipman JJ, On the closed form solution of Troesch’s problem, J. Comput. Phys. 21 (1976) 291–304. 10.1016/0021-9991(76)90026-7 [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All data are available in the paper.

[pone.0140071.ref001] 1. Gu B, Sheng V, Feasibility and finite convergence analysis for accurate on-line v-support vector learning, IEEE Transactions on Neural Networks and Learning Systems. 24 (2013) 1304–1315. 10.1109/TNNLS.2013.2250300 [DOI] [PubMed] [Google Scholar]

[pone.0140071.ref002] 2. Li J, Li X, Yang B, Sun X, Segmentation-based Image Copy-move Forgery Detection Scheme, IEEE Transactions on Information Forensics and Security. 10 (2015) 507–518. 10.1109/TIFS.2014.2381872 [DOI] [Google Scholar]

[pone.0140071.ref003] 3. Wen X, Shao L, Fang W, Xue Y, Efficient Feature Selection and Classification for Vehicle Detection, IEEE Transactions on Circuits and Systems for Video Technology. 2015, 10.1109/TCSVT.2014.2358031 [DOI] [Google Scholar]

[pone.0140071.ref004] 4. Zhang H, Wu J, Nguyen TM, Sun M, Synthetic Aperture Radar Image Segmentation by Modified Student’s t-Mixture Model, IEEE Transaction on Geoscience and Remote Sensing. 52 (2014) 4391–4403. 10.1109/TGRS.2013.2281854 [DOI] [Google Scholar]

[pone.0140071.ref005] 5. Fu Z, Achieving Efficient Cloud Search Services: Multi-keyword Ranked Search over Encrypted Cloud Data Supporting Parallel Computing, IEICE Transactions on Communications. E98-B (2015) 190–200. 10.1587/transcom.E98.B.190 [DOI] [Google Scholar]

[pone.0140071.ref006] 6. Dai Y, Yuan Y, A nonlinear conjugate gradient with a strong global convergence properties, SIAM J. Optim. 10 (2000) 177–182. 10.1137/S1052623497318992 [DOI] [Google Scholar]

[pone.0140071.ref007] 7. Fletcher R, Reeves C, Function minimization by conjugate gradients, Comput. J. 7 (1964) 149–154. 10.1093/comjnl/7.2.149 [DOI] [Google Scholar]

[pone.0140071.ref008] 8. Polak E, Ribière G, Note sur la convergence de directions conjugees, Rev. Fran. Inf. Rech. Opérat. 3 (1969) 35–43. [Google Scholar]

[pone.0140071.ref009] 9. Polyak BT, The conjugate gradient method in extreme problems, USSR Comput. Math. Math.Phys. 9 (1969) 94–112. 10.1016/0041-5553(69)90035-4 [DOI] [Google Scholar]

[pone.0140071.ref010] 10. Hestenes MR, Stiefel EL, Methods of conjugate gradients for solving linear systems, J. Res. Natl.Bur. Stand. Sect. B. 49 (1952) 409–432. 10.6028/jres.049.044 [DOI] [Google Scholar]

[pone.0140071.ref011] 11. Liu Y, Storey C, Effcient generalized conjugate gradient algorithms part 1: theory. J. Comput. Appl.Math. 69 (1992) 17–41. [Google Scholar]

[pone.0140071.ref012] 12. Fletcher R, Practical method of optimization, vol I: unconstrained optimization, 2nd edn Wiley,New York, 1997. [Google Scholar]

[pone.0140071.ref013] 13. Powell MJD, Nonconvex minimization calculations and the conjugate gradient method, Lecture Notes in Mathematics, vol. 1066, Spinger, Berlin, 1984, pp. 122–141. [Google Scholar]

[pone.0140071.ref014] 14. Gibert JC, Nocedal J, Global convergence properties of conjugate gradient methods for optimization. SIAM J. Optim. 2 (1992) 21–42. 10.1137/0802003 [DOI] [Google Scholar]

[pone.0140071.ref015] 15.Dai Y, Analyses of conjugate gradient methods, PH.D.thesis, Institute of Computational Mathematics and Scientific/Engineering Computing, Chinese Academy of Sciences (in Chinese), 1997.

[pone.0140071.ref016] 16. Ahmed T, Storey D, Efficient hybrid conjugate gradient techniques, Journal of Optimization Theory and Applications. 64 (1990) 379–394. 10.1007/BF00939455 [DOI] [Google Scholar]

[pone.0140071.ref017] 17. Al-Baali A, Descent property and global convergence of the Flecher-Reeves method with inexact line search, IMA Journal of Numerical Analysis. 5 (1985) 121–124. 10.1093/imanum/5.1.121 [DOI] [Google Scholar]

[pone.0140071.ref018] 18. Hu Y F, Storey C, Global convergence result for conjugate gradient methods, Journal of Optimization Theory and Applications. 71 (1991) 399–405. 10.1007/BF00939927 [DOI] [Google Scholar]

[pone.0140071.ref019] 19. Yuan G, Wei Z, Zhao Q, A modified Polak-Ribière-Polyak conjugate gradient algorithm for large-scale optimization problems, IIE Transactions. 46 (2014), 397–413. 10.1080/0740817X.2012.726757 [DOI] [Google Scholar]

[pone.0140071.ref020] 20. Yuan G, Zhang M, A modified Hestenes-Stiefel conjugate gradient algorithm for large-scale optimization, Numerical Functional Analysis and Optimization. 34 (2013), 914–937. 10.1080/01630563.2013.777350 [DOI] [Google Scholar]

[pone.0140071.ref021] 21. Yuan G, Lu X, Wei Z, A conjugate gradient method with descent direction for unconstrained optimization, Journal of Computational and Applied Mathematics, 233 (2009), 519–530. 10.1016/j.cam.2009.08.001 [DOI] [Google Scholar]

[pone.0140071.ref022] 22. Yuan G, Lu X, A modified PRP conjugate gradient method, Annals of Operations Research. 166 (2009),73–90. 10.1007/s10479-008-0420-4 [DOI] [Google Scholar]

[pone.0140071.ref023] 23. Li X, Zhao X, A hybrid conjugate gradient method for optimization problems, Natural Science. 3(2011): 85–90. 10.4236/ns.2011.31012 [DOI] [Google Scholar]

[pone.0140071.ref024] 24. Yuan G, Modified nonlinear conjugate gradient methods with sufficient descent property for large-scale optimization problems, Optimization Letters. 3 (2009) 11–21. 10.1007/s11590-008-0086-5 [DOI] [Google Scholar]

[pone.0140071.ref025] 25. Huang H, Lin S, A modified Wei-Yao-Liu conjugate gradient method for unconstrained optimization, Applied Mathematics and Computation. 231 (2014) 179–186. 10.1016/j.amc.2014.01.012 [DOI] [Google Scholar]

[pone.0140071.ref026] 26. Yu G, Zhao Y, Wei Z, A descent nonlinear conjugate gradient method for large-scale unconstrained optimization, Applied mathematics and computation. 187 (2) (2007) 636–643. 10.1016/j.amc.2006.08.087 [DOI] [Google Scholar]

[pone.0140071.ref027] 27. Yao S, Wei Z, Huang H, A note about WYL’s conjugate gradient method and its applications, Applied Mathematics and computation. 191 (2) (2007) 381–388. 10.1016/j.amc.2007.02.094 [DOI] [Google Scholar]

[pone.0140071.ref028] 28. Hager WW, Zhang H, A new conjugate gradient method with guaranteed descent and an efficient line search, SIAM Journal on Optimization. 16(1) (2005) 170–192. 10.1137/030601880 [DOI] [Google Scholar]

[pone.0140071.ref029] 29. Wei Z, Yao S, Liu L, The convergence properties of some new conjugate gradient methods, Applied Mathematics and Computation. 183 (2006) 1341–1350. 10.1016/j.amc.2006.05.150 [DOI] [Google Scholar]

[pone.0140071.ref030] 30. Zhang L, An improved Wei-Yao-Liu nonlinear conjugate gradient method for optimization computation, Applied Mathematics and computation. 215 (6) (2009) 2269–2274. 10.1016/j.amc.2009.08.016 [DOI] [Google Scholar]

[pone.0140071.ref031] 31. Wei Z, Yu G, Yuan G, Lian Z, The superlinear convergence of a modified BFGS-type method for unconstrained optimization, Comput. Optim. Appl. 29 (2004) 315–332. 10.1023/B:COAP.0000044184.25410.39 [DOI] [Google Scholar]

[pone.0140071.ref032] 32. Yuan G, Wei Z, Convergence analysis of a modified BFGS method on convex minimizations, Computational Optimization and Applications. 47 (2) (2010) 237–255. 10.1007/s10589-008-9219-0 [DOI] [Google Scholar]

[pone.0140071.ref033] 33. Li M, Qu A, Some sufficient descent conjugate gradient methods and their global convergence, Computational and Applied Mathematics. 33 (2) (2014) 333–347. 10.1007/s40314-013-0064-0 [DOI] [Google Scholar]

[pone.0140071.ref034] 34. Zoutendijk Z, Nonlinear programming computational methods In: Abadie J. (ed.) Integer and nonlinear programming, North-Holland, Amsterdam, 1970, 37–86. [Google Scholar]

[pone.0140071.ref035] 35. Dolan ED, Morè JJ, Benchmarking optimization software with performance profiles, Math.Program. 91 (2002) 201–213. 10.1007/s101070100263 [DOI] [Google Scholar]

[pone.0140071.ref036] 36. Buhmiler S, Krejic´ N, Luzanin Z, Practical quasi-Newton algorithms for singular nonlinear systems, Numer. Algorithms 55 (2010) 481–502. 10.1007/s11075-010-9367-z [DOI] [Google Scholar]

[pone.0140071.ref037] 37. Gu G, Li D, Qi L, Zhou S, Descent directions of quasi-Newton methods for symmetric nonlinear equations, SIAMJ.Numer.Anal. 40 (2002) 1763–1774. 10.1137/S0036142901397423 [DOI] [Google Scholar]

[pone.0140071.ref038] 38. La Cruz W, Martínez. M, Raydan M. Spectral residual method without gradient information for solving large-scale nonlinear systems of equations, Math. Comp. 75 (2006) 1429–1448. 10.1090/S0025-5718-06-01840-0 [DOI] [Google Scholar]

[pone.0140071.ref039] 39. Kanzow C, Yamashita N, Fukushima M, Levenberg-Marquardt methods for constrained nonlinear equations with strong local convergence properties, J. Comput. Appl. Math. 172 (2004) 375–397. 10.1016/j.cam.2004.02.013 [DOI] [Google Scholar]

[pone.0140071.ref040] 40. Zhang J, Wang Y, A new trust region method for nonlinear equations, Math. Methods Oper. Res,58 (2003) 283–298. 10.1007/s001860300302 [DOI] [Google Scholar]

[pone.0140071.ref041] 41. Grippo L, Sciandrone M, Nonmonotone derivative-free methods for nonlinear equations, Comput. Optim. Appl. 37 (2007) 297–328. 10.1007/s10589-007-9028-x [DOI] [Google Scholar]

[pone.0140071.ref042] 42. Cheng W, A PRP type method for systems of monotone equations, Math. Comput. Modelling. 50 (2009) 15–20. 10.1016/j.mcm.2009.04.007 [DOI] [Google Scholar]

[pone.0140071.ref043] 43. La Cruz W, Raydan M, Nonmonotone spectral methods for large-scale nonlinear systems, Optim. Methods Softw. 18 (2003) 583–599. 10.1080/10556780310001610493 [DOI] [Google Scholar]

[pone.0140071.ref044] 44. Raydan M, The Barzilai and Borwein gradient method for the large scale unconstrained minimization problem, SIAM J. Optim. 7 (1997) 26–33. 10.1137/S1052623494266365 [DOI] [Google Scholar]

[pone.0140071.ref045] 45. Yu G, Guan L, Chen W, Spectral conjugate gradient methods with sufficient descent property for large-scale unconstraned optimization, Optim.Methods Softw. 23 (2) (2008) 275–293. 10.1080/10556780701661344 [DOI] [Google Scholar]

[pone.0140071.ref046] 46. Yuan G, Wei Z, Lu S, Limited memory BFGS method with backtracking for symmetric nonlinear equations, Math. Comput. Modelling. 54 (2011) 367–377. 10.1016/j.mcm.2011.02.021 [DOI] [Google Scholar]

[pone.0140071.ref047] 47. Li D, Fukushima M, A global and superlinear convergent Gauss-Newton -based BFGS method for symmetric nonlinear equations, SIAMJ.Numer.Anal. 37 (1999) 152–172. 10.1137/S0036142998335704 [DOI] [Google Scholar]

[pone.0140071.ref048] 48. Li Q, Li D, A class of derivative-free methods for large-scale nonlinear monotone equations, IMA J. Numer. Anal. 31 (2011) 1625–1635. 10.1093/imanum/drq015 [DOI] [Google Scholar]

[pone.0140071.ref049] 49. Solodov MV, Svaiter BF, A hybrid projection-proximal point algorithm, J. Convex Anal.6 (1999) 59–70. [Google Scholar]

[pone.0140071.ref050] 50. Yuan G, Zhang M, A three-terms Polak-Ribière-Polyak conjugate gradient algorithm for large-scale nonlinear equations, Journal of Computational and Applied Mathematics. 286 (2015) 186–195. 10.1016/j.cam.2015.03.014 [DOI] [Google Scholar]

[pone.0140071.ref051] 51. Solodov MV, Svaiter BF, A globally convergent inexact Newton method for systems of monotone equations, in: Fukushima M., Qi L. (Eds.),Reformulation:Nonsmooth, Piecewise Smooth, Semismooth and Smoothing Methods, Kluwer Academic Publishers; 1998, pp. 355–369. [Google Scholar]

[pone.0140071.ref052] 52. Gomez-Ruggiero M, Martinez J, Moretti A, Comparing algorithms for solving sparse nonlinear systems of equations, SIAM J. Sci. Comput. 23 (1992)459–483. 10.1137/0913025 [DOI] [Google Scholar]

[pone.0140071.ref053] 53. Morè J, Garbow B, Hillström K, Testing unconstrained optimization software, ACM Trans. Math. Softw. 7 (1981) 17–41. [Google Scholar]

[pone.0140071.ref054] 54. Roberts SM, Shipman JJ, On the closed form solution of Troesch’s problem, J. Comput. Phys. 21 (1976) 291–304. 10.1016/0021-9991(76)90026-7 [DOI] [Google Scholar]

PERMALINK

Two New PRP Conjugate Gradient Algorithms for Minimization Optimization Models

Gonglin Yuan

Xiabin Duan

Wenjie Liu

Xiaoliang Wang

Zengru Cui

Zhou Sheng

Roles

Abstract

Introduction

New algorithm for unconstrained optimization

Fig 1. The diagram about Algorithm 2.1.

Global convergence analysis