• Nenhum resultado encontrado

A Jacobi-type Newton method for Nash equilibrium problems with descent guarantees

N/A
N/A
Protected

Academic year: 2022

Share "A Jacobi-type Newton method for Nash equilibrium problems with descent guarantees"

Copied!
24
0
0

Texto

(1)

A Jacobi-type Newton method for Nash equilibrium problems with descent guarantees

L. F. Bueno

G. Haeser

O. Kolossoski

September 22, 2022

Abstract

A common strategy for solving an unconstrained two-player Nash equilibrium problem with continuous variables is applying Newton’s method to the system of nonlinear equations obtained by the corresponding first-order necessary optimality conditions. However, when taking into account the game dynamics, it is not clear what is the goal of each player when considering that they are taking their current decision following Newton’s iterates. In this paper we provide an interpretation for Newton’s iterate in view of the game dynamics as follows: instead of minimizing the quadratic approximation of their objective function param- eterized by the other player current decision (as a typical Jacobi-type strategy), we show that the Newton iterate follows this approach but with the objective function parameterized by a prediction of the other player action, considering that they are following the same New- tonian strategy. This interpretation allows us to present a new Newtonian algorithm where a backtracking procedure is introduced in order to guarantee that the computed Newtonian directions, for each player, are descent directions for their corresponding parameterized func- tions. Thus, besides favoring global convergence, our algorithm also favors true minimizers instead of maximizers or saddle points, differently from the standard Newton method, which does not consider the minimization structure of the problem in the non-convex case. Thus, our method is more robust in comparison with other Jacobi-type strategies or the pure Newtonian approach, which is corroborated by our illustrative numerical experiments. We also present a proof of the well-definiteness of the algorithm under some standard assumptions, together with a thorough analysis of its convergence properties taking into account the game dynamics.

Keywords: Nash equilibrium, Newtonian method, Jacobi-type methods, Non-convex games, Game dynamics.

1 Introduction

The Nash Equilibrium Problem (NEP), introduced in [Nash (1951)], models an economic game where each player aims at minimizing their own objective function, but the decision taken by each player influences the payoff of the other players. This is a fundamental problem in economics and so- cial behavior theory that has been studied extensively. See, for instance, [Arrow and Debreu (1954)];

[Myerson (1991)]; [Aubin (1993)]. For surveys on the subject, see [Facchinei and Kanzow (2007)];

[Fischer et.al. (2014)]. In this paper we consider the unconstrained two-player NEP given by

Institute of Science and Technology, Federal University of S˜ao Paulo, S˜ao Jos´e dos Campos, SP, Brazil.

([email protected])

Department of Applied Mathematics, University of S˜ao Paulo, S˜ao Paulo, SP, Brazil. ([email protected])

Department of Applied Mathematics, University of ao Paulo, ao Paulo, SP, Brazil.

([email protected])

(2)

Minimize f1(x1, x2),

x1∈Rn1 (1)

and

Minimize f2(x1, x2),

x2∈Rn2 (2)

wheref1, f2:Rn1×Rn2 →Rare (possibly non-convex) twice continuously differentiable functions that describe each player’s objective to be minimized, parameterized by the other player’s decision.

A (local) solution to the NEP (1-2) is a point (x1, x2)∈Rn1×Rn2such thatx1is a (local) minimizer for (1) withx2=x2, andx2is a (local) minimizer for (2) with x1=x1.

In order to solve the NEP (1-2), some studies deal with a reformulation of the problem, such as a

variational inequality problem or as a complementarity problem, see for instance [Majig and Fukushima (2008)];

[Facchinei and Pang (2003)]. One problem with this approach is that several solutions of the re- formulated problem may not be solutions of the associated equilibrium problem. Another possi- bility is to reformulate the NEP as an optimization problem via the Nikaido-Isoda function (see [Nikaido and Isoda (1955)]), which transforms the NEP into a minimax problem. A penalty update scheme was also proposed in the works of [Fukushima and Pang (1998)]; [Facchinei and Kanzow (2009)].

Those indirect approaches are somewhat computationally demanding, as they require solving an optimization problem at each step of the algorithm. In order to avoid such heavy computations, the method proposed in this paper aims at solving the system (1-2) without rewriting it as an indirect problem.

One may also consider partitioning methods, namely Jacobi- and Gauss-Seidel-type splitting methods (see [Facchinei and Kanzow (2007)]; [Yuan (2011)]) that deal with problem (1-2) by it- erating the idea of fixing the variables of one player and taking a step towards minimizing the problem of the other player. In particular, a standard Jacobi strategy would fix preliminary de- cisions for each player, and then iteratively update them simultaneously by solving a model for the corresponding optimization problem parameterized by the fixed decision of the other player.

Alternatively, so-called best response methods fix the decision of each player as the best possible response, see [Caruso et.al. (2018)] and [Caruso et.al. (2020)]. Those methods also require solving an optimization problem at each step for finding the best response for the other player before solving the corresponding optimization problem for one player, which may also be computationally expensive. However, the advantage of theses approaches is that they are able to treat problem (1-2) in its original form, without losing information due to reformulations.

The game structure plays a key role when the optimization problems involved are non-convex.

Several important applications result in such problems, including NEPs (see [Chui and Pang (2021)]).

Moreover, the dynamics of the algorithms, namely, the way that the iterates are computed, may capture the true dynamics of the game, reflecting the way a competitive game actually unfolds in practice. That is, we suspect that economic systems behave in practice as a dynamical system, where each player considers the current decision of the other players and a model for predicting their future behavior, before optimizing their strategy. We expect that Jacobi-type methods should capture this behavior, predicting accurately the final outcome of a game, even if no equilibrium is reached. In fact, differently from what is usual in an algorithm for solving a single optimization problem, where we expect and hope to build a sequence converging to a solution of the problem, in a NEP viewed as such dynamical system, the dynamics may converge to an equilibrium state, but other interesting phenomena may be observed such as orbiting, cycling or divergence. The reader should have this observation in mind when considering what type of theoretical property should be expected by an algorithm aiming to capture such dynamics.

In this paper we introduce a method based on a Jacobi-type approach within a Newtonian framework. The main novelty of our approach is that instead of taking a decision based on the

(3)

current decision of the opponent, our Newton model is built around a prediction of the opponent step. This should reflect the reality of equilibrium games in practice, where each player tries to make the best decision based on what is the expected response of the other player. The algorithm proposed in this paper is posed in such a way that each player aims at decreasing their objective function with respect to the predicted decision of the other player. This avoids the heavy com- putations of best response methods, while still reflecting the dynamics of equilibrium games in practice.

Additionally, we note that our method considers the true minimization structure of the system (1-2) in a Newtonian framework, instead of considering only the system of equations given by the necessary optimality conditions associated with each problem. The idea is that one should favor minimizers in place of maximizers or saddle points when considering the solutions of this system of equations, which is usually not considered in other Newtonian algorithms for solving non-convex instances of (1-2).

In a related work, a Jacobi-type approach using quadratic approximations of players’ objective functions was considered in [Yuan (2011)]. The proposed algorithm consists in a trust region method which does not directly use the Hessian information on its iterative step; however, it relies on some strong assumptions on the true Hessians that we do not rely on in our work. Additionally, for each player, the method in [Yuan (2011)] is based on the current response of the other players, rather than on a predicted one.

The paper is organized as follows: In Section 2 we introduce our Jacobi-type algorithm in a Newtonian framework. In Section 3 we establish some notations and summarize the overall assumptions to be considered throughout the paper, together with some preliminary results. In Section 4 we establish the well-definiteness of our algorithm. In Section 5 we state some theoretical properties of the algorithm, establishing convergence to stationarity under certain conditions. In section 6 we illustrate the behavior and effectiveness of the proposed method with some numerical experiments. Finally, some concluding remarks and future prospective works are given in Section 7.

Notation: Given a twice continuously differentiable function f : Rn1+n2 → R, (x1, x2)7→f(x1, x2), fori= 1,2 we denote by∇xif(x) its partial gradient with respect to variable xievaluated atx. The partial Hessian with respect to variablesxiandxj evaluated atxis denoted by∇2xixjf(x), fori, j= 1,2. We use∥ · ∥to denote the 2-norm of vectors and matrices, whileuTv denotes the canonical inner product of vectorsuand v of the same dimensions. We denote byN the set of positive integers.

2 A Descent Newton Algorithm for the Two-Player Nash Equilibrium Problem

Recall that a solution (x1, x2) to the equilibrium problem (1-2) satisfies the following first-order necessary optimality condition:

x1f1(x1, x2)

x2f2(x1, x2)

!

= 0

0

!

. (3)

Most algorithms for solving the NEP (1-2) rely on approaching directly the nonlinear system of equations (3), disregarding the minimization structure of problem (1-2). This poses no issues whenf1andf2are convex with respect to their decision variables, however, in the presence of non- convexities, the algorithm may find local maximizers or saddle points as often as local minimizers.

Our goal in this paper is to propose an algorithm that takes into account the minimization structure

(4)

of the problem in order to find local minimizers more often, thus better reflecting the behavior of two-player games in practice.

Given a current iterate (xk1, xk2), the classical Newtonian step (dk1, dk2) for solving (3) is given by solving the following linear system:

2x1x1f1(xk1, xk2) ∇2x1x2f1(xk1, xk2)

2x

1x2f2(xk1, xk2) ∇2x

2x2f2(xk1, xk2)

! dk1 dk2

=− ∇x1f1(xk1, xk2)

x2f2(xk1, xk2)

!

. (4)

It is well known that, for optimization problems, applying Newton’s method to the correspond- ing first-order necessary optimality conditions can be better interpreted as minimizing a quadratic approximation of the objective function. With this interpretation, we can extend Newton’s method to non-convex problems, and we can also provide elements for globalization schemes. Two well- stablished strategies for doing this are the use of trust regions and line search techniques, ensuring progress near the current point. The first one guarantees the existence of a minimizer for the quadratic approximation of the objective function by establishing a compact domain for the sub- problem. In line search approaches, one should use some positive definite approximation for the Hessian of the objective function to obtain a coercive model and descent directions. In this case, one must carefully choose a convenient step-size along the Newton direction.

In order to exploit the minimization structure of the NEP (1-2), we shall consider an idea which resembles a standard line-search globalization described above. Given an approximate solution (xk1, xk2) at some iterationk, the most natural step to be made by the first player would be associated with minimizing, with respect to variable d1, a simplified model for the functionf1(xk1+d1, xk2).

Namely, the first player would select its response by computing Minimize ϕ1(xk1+d1, xk2),

d1

whereϕ1 is an approximation of f1.

Similarly, the second player would seek to minimize at the same time a model for the function f2(xk1, xk2+d2) with respect to d2, leading to

Minimize ϕ2(xk1, xk2+d2).

d2

This approach is related to Jacobi’s method and its theoretical investigation has been conducted in [Yuan (2011)] under a trust-region globalization. The main concern that we want to address is to base the minimization model for each player on a prediction of the other player’s action, rather than on its actual decision. Thus, the main contribution of this work is considering the case when the direction dk1, the decision of the first player, is obtained by a Newton step with respect to a predicted behavior of the second player. Namely, given xk2 +dk2, a predicted decision for player two, the problem we aim to solve for player one is:

Minimize ϕ1(xk1+d1, xk2+dk2).

d1 (5)

Inspired by a Newtonian appeal, let us consider the second-order Taylor approximation off1, which leads to the following convex quadratic function:

f1(xk1+d1, xk2+dk2)≈f1(xk1, xk2+dk2) +∇x1f1(xk1, xk2+dk2)Td1+1

2dT1H1kd1, (6) where H1k is a positive definite approximation for the Hessian ∇2x1x1f1(xk1, xk2 +dk2). Since dk2 needs to be found together withdk1, suppose that the same approach is done simultaneously by the

(5)

other player in order to make their decisiondk2. Then, given (xk1, xk2), we would still have a Nash equilibrium problem in the variables (d1, d2), however, the resulting NEP is simpler to solve, since the objective functions are parameterized convex quadratics in the corresponding decision variables.

But, since the term ∇x1f1(xk1, xk2+dk2)Td1 might still combine the variables in a non-linear way, the approximated problem might still be hard to solve. Thinking in eliminating this inconvenient with a typical idea from Newton’s method, we use linear approximations of the gradients in the following way:

x1f1(xk1, xk2+dk2)≈ ∇x1f1(xk1, xk2) +∇2x1x2f1(xk1, xk2)dk2. (7) In this way, our approximated problem (5) is now clearly defined by combining approximations (6) and (7), which has a simple structure and its solution is readily given by solving the linear system:

H1kd1+∇2x1x2f1(xk1, xk2)dk2+∇x1f1(xk1, xk2) = 0.

Repeating the approach for the second player, with a positive definite H2k ≈ ∇2x2x2f2(xk1+ dk1, xk2), our iteration is based on the solution of the following linear system:

H1k2x

1x2f1(xk1, xk2)

2x1x2f2(xk1, xk2) H2k

! d1 d2

=− ∇x1f1(xk1, xk2)

x2f2(xk1, xk2)

!

. (8)

Given the equivalence with the standard Newtonian system (4), it is natural to chooseH1k

2x1x1f1(xk1, xk2) and H2k ≈ ∇2x2x2f2(xk1, xk2). Thus, we have associated the standard Newtonian system (4) with the original NEP, where every player makes its decision considering the predicted decision of the other player. It now seems natural to consider descent conditions associated with the predicted functionsf1(x1, xk2+dk2) andf2(xk1+dk1, x2). However, we cannot guarantee that the resulting directions from (8) are descent directions for these functions, as is shown in the following example.

Example 2.1 Consider the equilibrium problem where f1(x1, x2) := 12x21+ (x22+ 2x2+ 1)x1 and f2(x1, x2) := 12(x2+ 2)2. In this case, we have

• ∇x1f1(x1, x2) =x1+x22+ 2x2+ 1,∇2x1x1f1(x1, x2) = 1,∇2x1x2f1(x1, x2) = 2x2+ 2

• ∇x2f2(x1, x2) =x2+ 2,∇2x2x2f2(x2, x2) = 1 and∇2x1x2f2(x2, x2) = 0.

Taking(xk1, xk2) := (0,0), the Newton direction (dk1, dk2)obtained from (8)is the solution of:

1 2 0 1

! d1 d2

=− 1

2

! .

Sodk1= 3 anddk2=−2. Note that∇x1f1(xk1, xk2) =∇x1f1(xk1, xk2+dk2) = 1 and thereforedk1 is not a descent direction forf1(x1, xk2+dk2)nor for f1(x1, xk2) atxk1.

This means that we do not expect the descent conditionf1(xk1+tdk1, xk2+dk2)< f1(xk1, xk2+dk2) to hold even for arbitrarily small t. Therefore, an algorithm that requires an Armijo-like line search could be not well-defined. The reason for not obtaining a descent direction when minimizing the convex quadratic approximation of the function comes from the fact that we did not use its true gradient. That is, the approximation of ∇x1f1(xk1, xk2+dk2) in (7) might not be accurate enough when ∥dk2∥ is large. To improve this approximation, we should consider also taking small steps along the direction dk2. In this way, the line search should be done simultaneously forf1 and f2, considering a simultaneous backtracking in dk1 and dk2. Thus, replacingdk2 by tdk2 in the previous

(6)

derivation of (8), we arrive at the following system of equations for computing (d1, d2), given a tentative step lengtht:

H1k t∇2x1x2f1(xk1, xk2) t∇2x1x2f2(xk1, xk2) H2k

! d1 d2

=− ∇x1f1(xk1, xk2)

x2f2(xk1, xk2)

! .

Note that for t >0 sufficiently small, we expect the solution (d1, d2) to be such thatd1 is a descent direction for problem (1) while d2 is a descent direction for problem (2) considering the predicted decision for the other player. Hence, our method favors local minimizers for solving problem (1-2). Notice, however, that the solution (d1, d2) depends on the choice of the step length t, thus, once a step length is rejected by the descent condition, a new direction (d1, d2) must be recomputed, similarly to a trust region approach. Finally, once the descent condition is met for some t, the new iterate is defined as (xk+11 , xk+12 ) := (xk1, xk2) +t(d1, d2).

Our algorithm selects the stepsizes based on a standard Armijo-type condition, namely, checking whether the decrease along a component (with the other component fixed at the predicted point) is proportional to what is predicted by the first order approximation of the function. In mathematical terms, for someα∈(0,1), we perform the check

f1(xk1+td1, xk2+td2)≤f1(xk1, xk2+td2) +αt∇x1f1(xk1, xk2+td2)Td1, f2(xk1+td1, xk2+td2)≤f2(xk1+td1, xk2) +αt∇x2f2(xk1+td1, xk2)Td2,

decreasing tand recomputing the directions whenever one of these inequalities do not hold.

Before formally presenting the algorithm, let us discuss some other conditions needed in the line search procedure. Differently from optimization problems, since the directions are not necessarily descent directions whentis large, we must ensure a negative slope over the search direction when computing the step-size t. In doing so, we are lead to also avoiding directions that are near orthogonal to the gradient of the predicted function. Additionally, in order to avoid stagnation when far from stationary points, we also demand that the direction is not too small compared with the corresponding gradient of the predicted function.

Mathematically, forθ∈(0,1) andγ >0, our line search checks also if

x1f1(xk1, xk2+td2)Td1≤ −θ∥∇x1f1(xk1, xk2+td2)∥ · ∥d1∥,

∥d1∥ ≥γ∥∇x1f1(xk1, xk2+td2)∥, (9) with a similar requirement for the second player. Notice that, in optimization, similar inequalities are standard requirements on the direction for guaranteeing global convergence of a descent method using an Armijo line search.

Guided by all these ideas, we formalize below the algorithm for computing a sequence{(xk1, xk2)}k∈N

for solving the NEP (1-2), where we note that the algorithm includes a safeguarding strategy that replaces the mixed hessians with zeroes whenever stationarity for the corresponding player has been reached andtis small enough, which will be explained later.

Algorithm 1Jacobi-type descent Newton algorithm

Step 0. Given (x01, x02)∈Rn1×Rn2, letα∈(0,1),τ ∈(0,1],θ >0, γ >0 and setk:= 0.

Step 1. Compute gk1 :=∇x1f1(xk1, xk2) and g2k :=∇x2f2(xk1, xk2), choose symmetric positive defi- nite matricesH1k andH2k, and sett:= 1.

(7)

Step 2. Set

M1:=

2x1x2f1(xk1, xk2), if∥g1k∥>0 ort > τ,

0, otherwise;

M2:=

2x1x2f2(xk1, xk2), if∥g2k∥>0 ort > τ,

0, otherwise.

(10)

Step 3. Check if the matrix

"

H1k tM1

tM2 H2k

#

is non-singular. If not, sett:=t/2 and repeat Step 3.

Step 4. Find (d1, d2) by solving the linear system H1k tM1

tM2 H2k

! d1 d2

=− g1k g2k

!

. (11)

Step 5. Check the inequalities

f1(xk1+td1, xk2+td2)≤f1(xk1, xk2+td2) +αt∇x1f1(xk1, xk2+td2)Td1, (12)

x1f1(xk1, xk2+td2)Td1≤ −θ∥∇x1f1(xk1, xk2+td2)∥ · ∥d1∥, (13) γ∥∇x1f1(xk1, xk2+td2)∥ · ∥gk1∥ ≤ ∥d1∥ · ∥gk1∥; (14) f2(xk1+td1, xk2+td2)≤f2(xk1+td1, xk2) +αt∇x2f2(xk1+td1, xk2)Td2, (15)

x2f2(xk1+td1, xk2)Td2≤ −θ∥∇x2f2(xk1+td1, xk2)∥ · ∥d2∥, (16) γ∥∇x2f2(xk1+td1, xk2)∥ · ∥gk2∥ ≤ ∥d2∥ · ∥gk2∥. (17) If one of the inequalities (12)-(17) do not hold, sett:=t/2 and go to Step 2.

Step 6. Set tk := t, dk1 := d1, dk2 := d2 and update xk+11 :=xk1+tkdk1, xk+12 := xk2+tkdk2, and k:=k+ 1. Go to Step 1.

Notice that for finding the direction (d1, d2), we use the linear system (11), which fori= 1,2 replaces the mixed Hessian matrixt∇2x1x2fi(xk1, xk2), used in the previous discussion when deducing (8), with zeroes whenevergki = 0 andt≤τ. This is done in order to forcedito be zero if stationarity is already reached for playeriand the step size is sufficiently small. As a consequence, a standard Newton step is set for the other player. The reason for doing this is that in these situations it may not be possible to obtain a function decrease for player i. We also replaced the aforementioned condition (9), which controls the ratio between gradient and direction, by (14). Both conditions are the same when g1k ̸= 0, but using (14) ensures that when stationarity is reached for the first player, inequality (14) is automatically satisfied. A similar situation occurs for the second player with respect to condition (17).

However, takingdki = 0 may not be the most adequate strategy, even ifgik= 0, since stationarity for player i may be lost after a step is made for the other player. Thus, this is done only as a safeguarding procedure after the stepsize is at least as small as a threshold valueτ ∈(0,1]. Notice that when τ := 1, the safeguarding procedure is always activated whenever gik = 0, while when τ <1, it is activated only after rejecting the Newtonian step at least once. This is done in order to try using the Newtonian direction as much as possible due to its fast local properties.

(8)

3 General Assumptions

The assumptions stated in this section are supposed to hold throughout the paper without specific mentioning. Before stating them, we establish some notation that shall be used in the sequel. We denote

gk :=

g1k g2k

=

x1f1(xk1, xk2)

x2f2(xk1, xk2)

, Ht,k :=

H1k tM1 tM2 H2k

,

dk :=

dk1 dk2

, d:=

d1 d2

, and xk :=

xk1 xk2

.

Additionally, for each player i = 1,2, we denote the index of the other player by ¬i. The general assumptions that we use in all our analysis are the following:

Assumption A There exists CG > 0 such that max{∥∇xifi(x)∥,∥∇x¬ifi(x)∥} ≤ CG for all x∈Rn1+n2 andi∈ {1,2}.

Assumption B There existsL >0 such that ∇xifi(·, x¬i) areL-Lipschitz continuous for every fixedx¬i andi∈ {1,2}. As a consequence, we have that

fi(xi+tdi, x¬i)≤fi(x) +t∇xifi(x)Tdi+t2L 2 ∥di2 for every x= (xi, x¬i)∈Rni+n¬i, di∈Rni, andt >0.

Assumption C There exists CH > 0 such that ∥∇2x1x2fi(x)∥ ≤ CH for all x ∈ Rn1+n2 and i∈ {1,2}.

Assumption D There existsCR>0 and functionsr1 andr2such that fori∈ {1,2}and for all x= (xi, x¬i)∈Rni+n¬i, alld¬i∈Rn¬i andt >0 it holds

xifi(xi, x¬i+td¬i) =∇xifi(x) +t∇2x1x2fi(x)d¬i+ri(t, x, d¬i), (18) and

∥ri(t, x, d¬i)∥ ≤CRt2∥d¬i2.

Assumption E There existλmax≥λmin>0 such that for alli∈ {1,2}andk∈Nthe eigenvalues ofHik lie in [λmin, λmax]. As a consequence, for allxi∈Rni, we have that

∥Hik∥ ≤λmax, ∥(Hik)−1∥ ≤ 1 λmin

, and λmin∥xi2≤xTiHikxi≤λmax∥xi2, i∈ {1,2}.

We highlight that all Assumptions A to D are simple assumptions on the smoothness and bound- edness of the derivatives of the functionsf1andf2, while Assumption E can be easily guaranteed when choosing the matrices Hik.

We end this section with a few results that arise from Assumptions A-E. The first result guarantees uniform boundedness of the matricesHt,k andH−1t,k for small enough values oft.

Lemma 3.1 For everyk∈Nandt∈(0,1], the matrix Ht,k satisfies

∥Ht,k∥ ≤µmax:=

q

λ2max+ 4λmaxCH+CH2. (19) Moreover, if t≤ λ2min

maxCH, thenHt,k is non-singular and∥Ht,k−1∥ ≤µmin, where the constantµmin

is given by

µmin :=

√2 λmin

. (20)

(9)

Proof:

By the definition ofHt,k, for a vector z= (z1, z2)∈Rn1×Rn2, we have that Ht,kz=

H1k tM1

tM2 H2k

· z1

z2

=

H1kz1+tM1z2

tM2z1+H2kz2

.

Therefore, we can write

∥Ht,kz∥2= (H1kz1+tM1z2)T(H1kz1+tM1z2) + (tM2z1+H2kz2)T(tM2z1+H2kz2)

=∥H1kz12+ 2tz1TH1kM1z2+∥tM1z22+∥H2kz22+ 2tz2TH2kM2z1+∥tM2z12. (21) Now, by the definition ofMi on (10), we see that∥Mi∥ ≤ ∥∇2xix¬ifi(xk)∥, i= 1,2. Hence, by the Cauchy-Schwarz inequality, we have that

∥Ht,kz∥2≤ ∥H1kz12+ 2t∥H1k∥ · ∥∇2x1x2f1(xk)∥ · ∥z1∥ · ∥z2∥+t2∥∇2x1x2f1(xk)∥2· ∥z22 +∥H2kz22+ 2t∥H2k∥ · ∥∇2x

2x1f2(xk)∥ · ∥z1∥ · ∥z2∥+t2∥∇2x

2x1f2(xk)∥2· ∥z12. Using Assumptions C and E, and the fact thatt≤1, we have that

∥Ht,kz∥2≤λ2max∥z12+ 2tλmaxCH· ∥z1∥ · ∥z2∥+t2CH2 · ∥z222max∥z22+ 2tλmaxCH· ∥z1∥ · ∥z2∥+t2CH2∥z12

≤λ2max ∥z12+∥z22

+ 4λmaxCH· ∥z1∥ · ∥z2∥+CH2 ∥z12+∥z22

= λ2max+CH2

∥z12+∥z22

+ 4λmaxCH∥z1∥ · ∥z2∥.

Hence, observing that∥zi∥ ≤ ∥z∥ fori= 1,2, we have that

∥Ht,kz∥2≤ λ2max+ 4λmaxCH+CH2

∥z∥2,

and thus (19) holds. To prove the bound on ∥Ht,k∥, recalling (21), we have that

∥Ht,kz∥2=∥H1kz12+ 2tzT1H1kM1z2+∥tM1z22+∥H2kz22+ 2tz2TH2kM2z1+∥tM2z12

≥λ2min∥z12−2tλmax∥M1∥ · ∥z1∥ · ∥z2∥+∥tM1z222min∥z22

−2tλmax∥M2∥ · ∥z1∥ · ∥z2∥+∥tM2z12.

Since∥tMiz¬i2≥0,∥Mi∥ ≤CH,∥z∥2=∥z12+∥z22, and∥zi∥ ≤ ∥z∥, we have

∥Ht,kz∥2≥ λ2min−4tλmaxCH

∥z2∥.

Thus, fort≤ λ2min

maxCH, we have∥Ht,kz∥2λ2min2 ∥z∥2, soHt,k is non-singular and∥H−1t,k∥ ≤

2 λmin. 2

The next result guarantees boundedness of the computed directionsdfor sufficiently smallt.

Lemma 3.2 For everyk ∈N, if t ≤ λ2min

maxCH then∥d∥ ≤ 2CGµmin =:Cd, where µmin is given by (20).

Proof: By the system equation (11) and using Lemma 3.1, we have that

∥d∥=∥H−1t,kgk∥ ≤ ∥H−1t,k∥ ·(∥gk1∥+∥g2k∥)≤2CGµmin.

2 The next result shows that for each component i= 1,2, the gradientsgik are comparable withdi provided thattis bounded by a fraction of∥gki∥.

(10)

Lemma 3.3 For everyk∈N,i= 1,2, ift satisfies t≤min

λ2min

maxCH, ∥gik∥ 2CHCd

, (22)

whereCd is given as in Lemma 3.2, then 1

2∥gik∥ ≤ ∥Hikdi∥ ≤ 3

2∥gik∥. (23)

Proof: We shall prove the result for i = 1, being the result analogous for i = 2. By the first equation of system (11), the definition of M1, and Assumption C, together with Lemma 3.2, we have that

∥H1kd1∥=∥ −tM1d2−g1k∥ ≤tCHCd+∥g1k∥.

Thus, from (22), we conclude the right-hand side inequality of (23). Similarly, to prove the left-hand side inequality in (23), considering again the first equation of system (11), we have

∥gk1∥=∥ −H1kd1−tM1d2∥ ≤ ∥H1kd1∥+tCHCd.Thus, the result follows from (22). 2

4 Well Definiteness of the Algorithm

This Section is devoted to proving well-definiteness of Algorithm 1. As previously mentioned, by the way Algorithm 1 is built, in cases where one of the gradients is zero, say ∥g2k∥ = 0, and the step-sizet has sufficiently decreased (t≤τ), the direction for the first player becomes a standard Newton direction for optimization with Armijo line search, which is known to be well-defined and satisfy conditions (12)-(14) with t ≤ 2(1−α)λ 2min

max , θ = λλmin

max and γ = λ1

max (see for instance the discussion in [Bertsekas (2016)], pages 29-36, and Lemma 2.20 in [Izmailov and Solodov (2014)]).

Thus, we focus on the case where ∥gik∥ ̸= 0, i= 1,2, and we show that the inequalities (12)-(17) are satisfied for sufficiently smallt.

In order to prove that Algorithm 1 is well-defined, we are going to prove that each pair of inequalities in (12)-(17) hold for t sufficiently small, meaning that the stepsize does not decrease indefinitely. We begin with the gradient/direction ratio inequalities (14) and (17). However, we show a stronger version bounding the directions by the gradient on the points (xki, xk¬i+td¬i) from both above and below.

Lemma 4.1 For everyk∈Nandi= 1,2, if t satisfies t≤min

∥gik∥ 8CHCd

, λ2minmaxCH

, (24)

whereCd is given as in Lemma 3.2, then it holds γ

xifi(xki, xk¬i+td¬i)

≤ ∥di∥ ≤β

xifi(xki, xk¬i+td¬i)

, (25) for every γ andβ such that

γ≤ 2 3λmax

, β ≥ 2 λmin

. (26)

Proof: We prove the result only fori= 1, as the proof fori= 2 is similar. By (11), we have that

−g1k=H1kd1+tM1d2. (27)

(11)

Also, we have that

x1f1(xk1, xk2+td2) =gk1+t Z 1

0

2x1x2f1(xk1, xk2+ξtd2)dξ

d2. (28)

Combining (27) and (28) we obtain

x1f1(xk1, xk2+td2) =−H1kd1+t

−M1+ Z 1

0

2x

1x2f1(xk1, xk2+ξtd2)dξ

d2. (29) By the triangular inequality, we have that

∥∇x1f1(xk1, xk2+td2)∥ ≥

∥H1kd1∥ −t

−M1+ Z 1

0

2x1x2f1(xk1, xk2+ξtd2)dξ

d2

. (30) and

∥∇x1f1(xk1, xk2+td2)∥ ≤ ∥H1kd1∥+t

−M1+ Z 1

0

2x1x2f1(xk1, xk2+ξtd2)dξ

d2

. (31) Next, we prove that for tsufficiently small it holds

1

2∥H1kd1∥ ≤ ∥∇x1f1(xk1, xk2+td2)∥ ≤ 3

2∥H1kd1∥. (32)

Let us suppose that

−M1+ Z 1

0

2x1x2f1(xk1, xk2+ξtd2)dξ

d2̸= 0, since otherwise this is trivial from (29). Hence, if we selecttsatisfying

t≤ ∥H1kd1

2

−M1+ Z 1

0

2x1x2f1(xk1, xk2+ξtd2)dξ

d2

, (33)

we see from (30) that the left-hand side inequality in (32) holds and, by (31), we obtain the right-hand side inequality. Now, using thatλmin∥d1∥ ≤ ∥H1kd1∥ ≤λmax∥d1∥, (32) implies that

λmin

2 ∥d1∥ ≤ ∥∇x1f1(xk1, xk2+td2)∥ ≤ 3λmax 2 ∥d1∥, which implies the desired bounds in (25).

Finally, let us show thatt satisfying (24) implies the bound (33). By Assumption C and by Lemma 3.2, we see that

−M1+ Z 1

0

2x1x2f1(xk1, xk2+ξtd2)dξ

d2

∥M1∥+

Z 1

0

2x1x2f1(xk1, xk2+ξtd2)dξ

· ∥d2

∥M1∥+ Z 1

0

∥∇2x1x2f1(xk1, xk2+ξtd2)∥dξ

· ∥d2∥ ≤2CHCd, therefore, we have that

∥H1kd1

4CHCd ≤ ∥H1kd1

2

−M1+ Z 1

0

2x1x2f1(xk1, xk2+ξtd2)dξ

d2 ,

Referências

Documentos relacionados

Abstract: As in ancient architecture of Greece and Rome there was an interconnection between picturesque and monumental forms of arts, in antique period in the architecture

Ousasse apontar algumas hipóteses para a solução desse problema público a partir do exposto dos autores usados como base para fundamentação teórica, da análise dos dados

A necessidade de associada à formação haver também maior partilha de informação no seio da equipa foi referida ainda por dois enfermeiros, verbalizando que “É necessária

Immobilization has long been used to improve enzyme activity and stability in aqueous media, and, more recently, also in nonaqueous media (Bickerstaff, 1997; Bornscheuer, 2003),

Os objetivos específicos são divididos em cinco, sendo: estudar os conceitos de cultura da convergência, da conexão e do audiovisual e sua relação com o objeto de estudo;

The probability of attending school four our group of interest in this region increased by 6.5 percentage points after the expansion of the Bolsa Família program in 2007 and

Ao longo dos anos, a degradação ambiental vem sendo acelerada por diversas atividades antrópicas, que além de afetar a conservação da fauna, flora e solo, a

Correlation between the free gossypol content of the international cottonseed meal samples determined by the American Oil Chemists’ Society (AOCS) [19] official method and the