Universidade de Aveiro
2010
Departamento de Matemática
Cristiana João
Soares da Silva
Regularização e Pontos Conjugados Bang-bang no
Controlo Óptimo
Regularization and Bang-bang Conjugate Times in
Optimal Control
Universidade de Aveiro
2010
Departamento de Matemática
Cristiana João
Soares da Silva
Regularização e Pontos Conjugados Bang-bang no
Controlo Óptimo
Regularization and Bang-bang Conjugate Times in
Optimal Control
Tese apresentada à Universidade de Aveiro para cumprimento dos requisitos
necessários à obtenção do grau de Doutor em Matemática, realizada sob a
orientação científica do Doutor Delfim Fernando Marado Torres, Professor
Associado do Departamento de Matemática da Universidade de Aveiro, e sob a
co-orientação científica do Doutor Emmanuel Trélat, Professor no Laboratório
MAPMO (Matemática e Aplicações, Física Matemática de Orléans) da
Universidade de Orléans, França.
Thesis submitted to the University of Aveiro in fulfilment of the requirements for
the degree of Doctor of Philosophy in Mathematics, under the supervision of Dr.
Delfim Fernando Marado Torres, Associate Professor at the Department of
Mathematics of the University of Aveiro, and co-supervision of Dr. Emmanuel
Trélat, Professor at the Laboratory MAPMO (Mathematic and Applications,
Mathematical Physics of Orléans) of the University of Orléans, France.
Apoio financeiro da FTC (Fundação
para a Ciência e Tecnologia) através
da
bolsa
de
doutoramento
com
referência SFRH/BD/27272/2006.
o júri
presidente
Prof. Doutor Manuel João Senos Matias
Professor Catedrático da Universidade de Aveiro
Prof. Doutor Bernard Bonnard
Professor Catedrático do Institut de Mathématiques de Bourgogne da Université de Bourgogne,
França
Prof. Doutor Emmanuel Trélat
Professor Catedrático do Laboratoire MAPMO da Université d’Orléans, França (co-orientador)
Prof. Doutor Vasile Staicu
Professor Catedrático da Universidade de Aveiro
Prof. Doutora Maria do Rosário Marques Fernandes Teixeira de Pinho
Professora Associada da Faculdade de Engenharia da Universidade do Porto
Prof. Doutora Laura Poggiolini
Investigadora Universitária do Dipartimento di Matemática Applicata da Universititá di Firenze,
Itália
Prof. Doutor Delfim Fernando Marado Torres
Professor Associado da Universidade de Aveiro (orientador)
Prof. Doutor Thomas Haberkorn
agradecimentos
acknowledgements
I would like to acknowledge my two supervisors Dr. Delfim Torres and Dr.
Emmanuel Trélat for accepting me as their student. Their constant support,
their trust and confidence made me fell lucky for having them as supervisors.
Their enthusiastic way of being and scientific skills were the source of my
inspiration. I am very grateful for the interesting young and senior researchers
whom I met during my PhD. I thank both my supervisors for always welcome
me with a smile, they are undoubtedly the persons that I will remember with
more affection of the years of my thesis.
I thank
The Portuguese Foundation for Science and Technology
(FCT) for the
financial support through the PhD fellowship SFRH/BD/27272/2006,
without
which this work wouldn't be possible.
I am grateful to the director of the Department of Mathematic of the University
of Aveiro Dr. João Santos, and the Director of MAPMO of the University of
Orléans, Dr. Stéphane Cordier for the excellent conditions in both laboratories.
A special acknowledge to the secretaries of MAPMO for their kindness.
I acknowledge Dr. Laura Poggiolini and Dr. Gianna Stefani for the important
remarks and interesting ideas, and I thank especially Laura for her attention
and support.
Je remercie mes collègues de bureau du MAPMO, en particulier à Chi,
Guillaume et Roland. Je remercie la patience et la
compréhension qu’ils ont
toujours eu avec moi, les moments de detente pendant les pauses café et les
soirées amusantes. Je remercie Bassirou, Jradeh et Radouen pour la
sympathie et bonne humeur.
Agradeço a todas as pessoas que tão bem me acolheram no Departamento de
Matemática da Universidade de Aveiro durante os meses da tese que passei
em Portugal.
Agradeço aos meus amigos. Em especial, ao Cromo (João Pedro) por nunca
ter deixado de me ir telefonando, por ter continuado a ser meu amigo como se
estivéssemos a apenas alguns quilómetros de distância. Agradeço à Inês, à
Silvia e à Zélia pela amizade e atenção. Agradeço a todos aqueles que se
ligaram ao messenger para falarem comigo e me fazerem um pouco de
companhia!
Agradeço ao Daniel pelo carinho e apoio incondicional, pela atenção diária e
infinita paciência. Muito obrigada por nunca teres deixado que me sentisse
sozinha.
Agradeço à minha família. Aos meus pais, pela preocupação que sempre
tiveram para que tudo corresse bem. Aos meus irmãos e às suas famílias, por
serem a minha família.
palavras-chave
Controlo óptimo, princípio do máximo de Pontryagin, problemas de tempo
mínimo, controlos bang-bang, procedimentos de regularização, método de tiro
simples, tempos conjugados.
resumo
Consideramos o problema de controlo óptimo de tempo mínimo para sistemas
de controlo mono-entrada e controlo afim num espaço de dimensão finita com
condições inicial e final fixas, onde o controlo escalar toma valores num
intervalo fechado. Quando aplicamos o método de tiro a este problema, vários
obstáculos podem surgir uma vez que a função de tiro não é diferenciável
quando o controlo é
bang-bang
. No caso
bang-bang
os tempos conjugados
são teoricamente bem definidos para este tipo de sistemas de controlo,
contudo os algoritmos computacionais directos disponíveis são de difícil
aplicação. Por outro lado, no caso suave o conceito teórico e prático de
tempos conjugados é bem conhecido, e ferramentas computacionais eficazes
estão disponíveis.
Propomos um procedimento de regularização para o qual as soluções do
problema de tempo mínimo correspondente dependem de um parâmetro real
positivo suficientemente pequeno e são definidas por funções suaves em
relação à variável tempo, facilitando a aplicação do método de tiro simples.
Provamos, sob hipóteses convenientes, a convergência forte das soluções do
problema regularizado para a solução do problema inicial, quando o parâmetro
real tende para zero. A determinação de tempos conjugados das trajectórias
localmente óptimas do problema regularizado enquadra-se na teoria suave
conhecida. Provamos, sob hipóteses adequadas, a convergência do primeiro
tempo conjugado do problema regularizado para o primeiro tempo conjugado
do problema inicial
bang-bang
, quando o parâmetro real tende para zero.
Consequentemente, obtemos um algoritmo eficiente para a computação de
tempos conjugados no caso
bang-bang
.
keywords
Optimal control, Pontryagin maximum principle, minimal time problems,
bang-bang controls, regularization procedures, single shooting methods, conjugate
times.
abstract
In this thesis we consider a minimal time control problem for single-input
control-affine systems in finite dimension with fixed initial and final conditions,
where the scalar control take values on a closed interval. When applying a
shooting method for solving this problem, one may encounter numerical
obstacles due to the fact that the shooting function is non smooth whenever the
control is bang-bang. For these systems a theoretical concept of conjugate time
has been defined in the bang-bang case, however direct algorithms of
computation are difficult to apply. Besides, theoretical and practical issues for
conjugate time theory are well known in the smooth case, and efficient
implementation tools are available.
We propose a regularization procedure for which the solutions of the
minimal time problem depend on a small enough real positive parameter and
are defined by smooth functions with respect to the time variable, facilitating the
application of a single shooting method. Under appropriate assumptions, we
prove a strong convergence result of the solutions of the regularized problem
towards the solution of the initial problem, when the real parameter tends to
zero. The conjugate times computation of the locally optimal trajectories for the
regularized problem falls into the standard theory. We prove, under appropriate
assumptions, the convergence of the first conjugate time of the regularized
problem towards the first conjugate time of the initial bang-bang control
problem, when the real parameter tends to zero. As a byproduct, we obtain an
efficient algorithmic way to compute conjugate times in the bang-bang case.
mots-clés
Contrôle optimal, principe du maximum de Pontryagin, problème de temps
minimal, contrôle bang-bang, procédures de régularisation, méthode de tir
simple, temps conjugué.
résumé
On considère le problème de contrôle optimal de temps minimal pour des
systèmes affine et mono-entrée en dimension finie, avec conditions initiales et
finales fixées, où le contrôle scalaire prend ses valeurs dans un intervalle
fermé. Lors de l'application d'une méthode de tir pour résoudre ce problème,
on peut rencontrer des obstacles numériques car la fonction de tir n'est pas
lisse lorsque le contrôle est
bang-bang
. Pour ces systèmes, dans le cas
bang-bang, un concept théorique de temps conjugué a été défini, toutefois les
algorithmes de calcul direct sont difficiles à appliquer. En outre, les questions
théoriques et pratiques de la théorie du temps conjugué sont bien connues
dans le cas lisse, et des outils efficaces de mise en œuvre sont disponibles.
On propose une procédure de régularisation pour laquelle les solutions du
problème
de temps minimal dépendent d’un paramètre réel positif
suffisamment petit et sont définis par des fonctions lisses en temps, ce qui
facilite l’application de la méthode de tir simple. Sous des hypothèses
convenables, nous prouvons un résultat de convergence forte des solutions du
problème régularisé vers la solution du problème initial, lorsque le paramètre
réel tend vers zéro. Le calcul des temps conjugués pour les trajectoires
localement optimales du problème régularisé est standard. Nous prouvons,
sous des hypothèses appropriées, la convergence du premier temps conjugué
du problème régularisé vers le premier temps conjugué du problème de
contrôle
bang-bang
initial, quand le paramètre réel tend vers zéro. Ainsi, on
obtient une procédure algorithmique efficace pour calculer les temps conjugués
dans le cas
bang-bang
.
Introdu tion 1
1 Preliminarieson OptimalControl Theory 5
1.1 Introdu tion . . . 5
1.2 Short histori aloverview . . . 7
1.3 Linear optimal ontrol . . . 9
1.3.1 Statement ofthe problem . . . 9
1.3.2 Controllability: denitionand a essibleset . . . 11
1.3.3 Minimal time problem . . . 13
1.3.4 Example: optimal ontrol ofan harmoni os illator (linear ase). . . 15
1.4 Nonlinear optimal ontrol . . . 18
1.4.1 Statement ofthe problem . . . 18
1.4.2 End-point mapping . . . 19
1.4.3 A essibleset and ontrollability . . . 21
1.4.4 Singular ontrols . . . 23
1.4.5 Existen e ofoptimal traje tories . . . 23
1.4.6 Pontryagin maximum prin iple . . . 25
1.4.7 Example: optimal ontrol ofan harmoni os illator (nonlinear ase) . . 31
1.4.8 Example: Newton'sproblem ofminimalresistan e . . . 32
1.5 Proof ofthe Pontryagin maximum prin iple fora general minimaltime problem 44 1.5.1 Needle-like variations. . . 45
1.5.2 Coni Impli it Fun tion Theorem . . . 49
1.5.3 Lagrange multipliers and Pontryagin maximum prin iple . . . 50
1.6 Generalized ontrols . . . 51
1.6.1 Generalized ontrol denition . . . 52
1.6.2 Minimal time problem . . . 53
1.6.3 Variation ofgeneralized ontrols andPontryagin maximization ondition 53 2 Smooth regularization of bang-bang optimal ontrol problems 57 2.1 Introdu tion . . . 57
2.4 Regularization pro edure. . . 59
2.5 Convergen e results. . . 61
2.6 Examples . . . 71
2.6.1 The Rayleighminimaltime ontrol problem . . . 71
2.6.2 Minimal time optimal ontrol problem involvinga singular ar . . . 73
2.6.3 The harmoni os illator problem(linear ase) . . . 76
2.6.4 Minimal time ontrol ofa Vander Polos illator. . . 79
3 Asymptoti approa h on onjugate points for bang-bang ontrol problems 83 3.1 Introdu tion . . . 83
3.1.1 Statement ofthe problem . . . 84
3.1.2 Bang-bang Pontryagin extremals . . . 84
3.1.3 Se ond orderoptimality onditions andbang-bang onjugatetimes . . . 86
3.1.4 Regularization pro edure . . . 92
3.1.5 Conjugate timesin the smooth ase . . . 93
3.2 Convergen e results. . . 95
3.3 Examples . . . 101
3.3.1 First example: Rayleighminimal time ontrol problem . . . 101
3.3.2 Se ond example. . . 107
Con lusion and open problems 111 Appendix 113 A First and se ond order su ient optimality onditions in the normal ase 113 A.1 Example: Rayleighminimal time ontrol problem . . . 118
Referen es 121
In this thesis, we investigate the minimal time Optimal Control Problem (OCP) for
single-input ontrol-ane systems in IR
n
˙x = X(x) + u
1
Y
1
(x),
with xedinitial andnal times onditions
x(0) = ˆ
x
0
, x(t
f
) = ˆ
x
1
,
where
X
andY
1
aresmooth ve tor elds, and the ontrolu
1
is a measurable s alar fun tionsatisfying the onstraint
|u
1
(t)| ≤ 1 , ∀t ∈ [0, t
f
]
with
t
f
the nal time. We develop regularization pro edures in order to ompute smoothapproximations of the above bang-bang ontrol problem, and to ompute onjugatetimes.
Therst onjugatetimeofatraje tory
x(·)
isthetimeatwhi hitlosesitslo aloptimality.The denition and omputation of onjugate points are an important topi in the theory of
al ulusofvariations(seee.g.[13 ℄). In[99℄theinvestigationofthedenitionand omputation
of onjugate points for minimal time ontrol problems is based on the study of ne essary
and/or su ient se ond order onditions. In [110℄, the theory of envelopes and onjugate
points isusedfor the study ofthe stru ture oflo ally optimal bang-bang traje tories for the
problem (OCP) in IR
2
and IR
3
; these results were generalized in [60 ℄. In [81 ,100℄ rst and
se ond order su ient optimality onditions arederivedin termsof a quadrati form
Q
t
, fora minimal time ontrol problem with ontrol-ane systems. In [100 ℄
L
1
-lo al optimality is
onsidered andin[81℄strong lo aloptimality. In[5℄theauthors derivese ondordersu ient
onditions, under the same regularity assumptions as [81℄, for an optimal ontrol problem
in the Mayer form with xed nal time, with ontrol-ane systems and bang-bang optimal
ontrols. In [90 ℄ the authors proved the equivalen e of the se ond order su ient onditions
given in [81 ℄ with the onesgiven in [5℄. In [95 ℄ ananalogous quadrati formto the one in [5℄
is dened, but the su ient optimality onditions derived are valid for a stronger kind of
optimality(statelo aloptimality).
The ombination of ne essaryand su ient onditions for bang-bang extremals provided
forabang-bangextremaltraje tory
x(·)
andthequadrati formQ
t
ispositivedeniteon[0, t]
,then
x(·)
is lo ally optimal for problem (OCP)in theC
0
topology on
[0, t]
( [5 ,81,87,95 ℄).If we assume moreover, that
x(·)
has a unique extremal lift (up to a multipli ative s alar)(x(·), p(·), p
0
, u
1
(·))
, whi h is moreover normal(p
0
= −1)
and
x(·)
is lo ally optimal inC
0
topology forproblem(OCP)on
[0, t]
thenQ
t
isnonnegative([3 ℄). Undertheseassumptions,the times
t
,t > 0
, su h that the quadrati formQ
t
has a trivial kernel are isolated and anonly onsist of some swit hing times of the bang-bang extremal ontrol (see [5℄); the rst
onjugate time
t
c
of a bang-bang strong lo ally optimal traje toryx(·)
(starting fromx
ˆ
0
) isthen dened by
t
c
= sup{t | Q
t
is positive denite} = inf{t | Q
t
is indenite} .
The point
x(t
c
)
is alled the rst onjugate point ofthe traje toryx(·)
.Su ient optimality onditionsaredeveloped in[87 ℄ (seealso[113 ℄)basedonthe method
of hara teristi sand the theoryof extremalelds. Su ient optimality onditions aregiven
for embeddingareferen etraje toryinto alo aleldofbrokenextremals. In[1,4,5,95 ℄,using
Hamiltonian methods and the extremal eld theory, the authors onstru t, under ertain
onditions, a non-interse ting eldof state extremals that overs agiven extremaltraje tory
x(·)
. In[5,61,87℄the authors asso iate the o urren e ofa onjugate point with a fold pointof the ow of the extremal eld, that is, a so- alled overlap of the ow near the swit hing
surfa e.
The omputationof onjugatetimesinthebang-bang aseisdi ultinpra ti e. Inthelast
years workshave been developed on the numeri al implementation ofse ond order su ient
optimality onditions (see, e.g., [61,78,81 ℄ and referen es ited therein). These pro edures
allowthe hara terization ofthe rst onjugatetime,forbang-bangoptimal ontrol problems
with ontrol-anesystems,wheneveritexistsandisattainedata
j
th
swit hingtime. However,
inpra ti e,if
j
istoolargethenthenumeri al omputationmaybe omeverydi ult. Besides,theoreti al and pra ti al issuesfor onjugate time theory arewell knownin the smooth ase
(see e.g. [2,86 ℄),and e ient implementation tools areavailable (see[15 ℄).
The ontributions of this thesisarethe following.
We propose aregularization pro edure whi h permits to usethe e ient tools of
ompu-tation of onjugate timesin the smooth ase provided in [15℄ forthe omputation of the rst
onjugate time of the problem (OCP) . The regularization pro edure isthe following. Let
ε
beapositiverealparameterandlet
Y
2
, . . . , Y
m
bem − 1
arbitrarysmoothve toreldsonIRn
,
where
m ≥ 2
is an integer. We onsider the minimal time problem (OCP)ε
for the ontrolsystem
˙x
ε
(t) = X (x
ε
(t)) + u
ε
1
(t)Y
1
(x
ε
(t)) + ε
m
X
i=2
u
ε
i
(t)Y
i
(x
ε
(t)) ,
under the onstraint
m
X
i=1
(u
ε
i
(t))
2
≤ 1 ,
with the xedboundary onditions
x
ε
(0) = ˆ
x
0
,x
ε
(t
f
) = ˆ
x
1
of the initial problem(OCP).Inthe next theoremwe derive ni e onvergen e properties.
Theorem 0.0.1 ( f. Chapter 2 , p. 61). Assume that the problem (OCP)
1
has a unique
solution
x(·)
, dened on[0, t
f
]
, asso iated with a ontrolu
1
(·)
on[0, t
f
]
. Moreover, assumethat
x(·)
has a unique extremal lift (up to a multipli ative s alar), that is moreover normal,and denoted
(x(·), p(·), −1, u
1
(·))
.Then,under the assumption
Span{Y
i
| i = 1, . . . , m} =
IRn
, there exists
ε
0
> 0
su h that,for every
ε ∈ (0, ε
0
)
, the problem (OCP)ε
has at least one solutionx
ε
(·)
, dened on[0, t
ε
f
]
witht
ε
f
≤ t
f
, asso iated with a smooth ontrolu
ε
= (u
ε
1
, . . . , u
ε
m
)
satisfying the onstraintm
X
i=1
(u
ε
i
(t))
2
≤ 1
, every extremal lift of whi h is normal. Let(x
ε
(·), p
ε
(·), −1, u
ε
(·))
be su h a
normal extremal lift. Then,as
ε
tends to0
,• t
ε
f
onverges tot
f
;• x
ε
(·)
onverges uniformly tox(·)
, andp
ε
(·)
onverges uniformly to
p(·)
on[0, t
f
]
;• u
ε
1
(·)
onverges weaklytou
1
(·)
for the weakL
1
(0, t
f
)
topology.If the ontrol
u
1
(·)
is moreover bang-bang, i.e., if the ( ontinuous) swit hing fun tionϕ(t) =
hp(t), Y
1
(x(t))i
does not vanishon any subinterval of[0, t
f
]
, thenu
ε
1
(·)
onverges tou
1
(·)
andu
ε
i
(·)
,i = 2, . . . , m
, onverge to0
almost everywhere on[0, t
f
]
, and thus in parti ular for thestrong
L
1
(0, t
f
)
topology.We provide an example where the optimal ontrol of the initial system isnot bang-bang
(it hasa singularar ) andfor whi h the almosteverywhere onvergen e fails.
Amongthenumerousnumeri almethodsthatexisttosolveoptimal ontrol problems,the
shooting methods onsist in solving, via Newton-like methods, the two-point or multi-point
boundary value problem arising from the appli ation of the Pontryagin maximum prin iple.
Forthe minimaltime problem(OCP),optimal ontrolsmaybedis ontinuous, anditfollows
that the shooting fun tion is not smooth on IR
n
in general. A tually it maybe non
dieren-tiableonswit hingsurfa es. Thisimpliestwodi ultieswhenusingashootingmethod. First,
if one doesnot know a priori the stru ture of the optimal ontrol, then it may be very
di- ult to initialize properly the shooting method, and in general the iterates of the underlying
Newtonmethodwillbeunableto rossbarriersgenerated byswit hingsurfa es(seee.g.[71 ℄).
Se ond, the numeri al omputation of the shooting fun tion and of its dierential may be
1
ThisTheoremremainsvalidifwe onsider
x(0) ∈ M
0
andx(1) ∈ M
1
whereM
0
andM
1
aretwo ompa tsetsofIR
n
of the possible motivations of the regularization pro edure onsidered in this thesis. Indeed,
the shootingfun tionsrelatedto the smooth optimal ontrol problems (OCP)
ε
aresmooth.From Theorem 0.0.1 , under appropriate assumptions, the optimal ontrols of problem
(OCP)
ε
are smooth, therefore the omputation of asso iated onjugate pointsx
ε
(t
ε
c
)
fallsinto the standard smooth theory. Our next resultasserts the onvergen e,as
ε
tends to 0,oft
ε
c
towardsthe onjugate timet
c
of the initial bang-bang optimal ontrol problem.Theorem0.0.2( f.Chapter3,p.95 ). Assumethattheproblem
(
OCP)
hasauniquesolutionx(·)
, asso iated with a bang-bang ontrolu
1
(·)
, on a maximal intervalI
. Moreover, assumethat
x(·)
has a unique extremal lift(up toa multipli ative s alar), whi his moreover normal,and denoted by
(x(·), p(·), −1, u
1
(·))
. If the extremal(x(·), p(·), −1, u
1
(·))
satises, moreover,the stri t bang-bang Legendre ondition on
[0, t
c
]
, then the rst geometri onjugate timet
ε
c
onverges tothe rst onjugatetime
t
c
asε
tends to 0.Thisresultpermitstousetheavailablee ientimplementationpro eduresforthesmooth
ase, like for instan e the free pa kage COTCOT
2
(see [15℄), to ompute onjugate times in
the bang-bang ase. We laimthat when applyingthe smooth pro edures to the regularized
pro edure,itisnotneededto onsiderverysmallvaluesof
ε
toestimatetherst onjugatetimet
c
. Indeed, a onjugate time of a lo ally bang-bang traje tory an only o urat aswit hingtime and,under our assumptions,swit hingtimesareisolated. From Theorem0.0.2 ,the rst
geometri onjugatetime
t
ε
c
onvergestot
c
, whenε
tendto 0. Therefore,assoonasε
issmallenough so that
t
ε
c
is in a (not ne essarily so small) neighborhood of some swit hing timeτ
s
of the bang-bangtraje tory
x(·)
, thismeans thatthe bang-bang onjugate timet
c
isequal tothat swit hing time
τ
s
.Thisthesisis organized in the following way.
Intherst hapterwere allsomeimportantdenitionsandtheoremsoflinearand
nonlin-earoptimal ontroltheory. InChapter2weproposearegularizationpro edureforbang-bang
optimal ontrol problems with single-input ontrol-ane systems andprove, under
appropri-ate assumptions, onvergen e properties of the optimal solutions of the regularized problem
towardsthe solutions of the initial problem. These onvergen e results areillustrated in
sev-eral examples. InChapter3the regularizationpro edureintrodu edin Chapter2isusedand
we prove the onvergen e of the rst geometri onjugate time
t
ε
c
of the regularized problemto the rst onjugate time of
t
c
of the bang-bang optimaltraje tory, asε
tendsto 0. Severalexamples are provided where the onvergen e properties proved in Theorems 0.0.1 and 0.0.2
areillustrated. InAppendixAwere allrstandse ondordersu ient optimality onditions
proved in [78 81℄ andapply themto one of the examples onsidered in Chapter3.
2
Preliminaries on Optimal Control
Theory
1.1 Introdu tion
Inthis haptersomeimportantdenitionsandresultsofthe optimal ontrol theoryaregiven.
We start with general explanations of the main elements of an optimal ontrol problem and
give some motivations for the study of these problems. Se tion 1.2 gives a brief histori al
overviewoftheoptimal ontroltheory. In1.3wepresentsomeimportantresultsofthelinear
optimal ontrol theory and an example of a linear optimal ontrol problem. Some results
of the nonlinear optimal ontrol theory are presented in 1.4 together with two examples.
For both linear and nonlinear general optimal ontrol problems the Pontryagin maximum
prin iple is formulated and in 1.5 a proof of this theorem is given for a general nonlinear
minimaltimeoptimal ontrol problem,usingneedle-likevariationswhi hareneededto derive
themainresultofChapter2(Theorem2.5.1 ). In1.6wederivethemaximization onditionof
Pontryagin maximum prin iple for a minimal time problem using Gamkrelidze's generalized
ontrols.
All of us already tried, in some o asion, to keep in balan e a ball on a nger (i.e., solve
the problemofthe invertedpendulum). However itis mu h more di ult to keep in balan e
a double inverted pendulum, that is, a system omposed by two balls one over the other,
spe iallyif we lose our eyes. The ontrol theory allows to do it, ifwe dispose of a suitable
mathemati al modelthatdes ribes the physi al pro ess.
The main elements of an optimal ontrol problem are: the mathemati al model whi h
relates the state
x
to the input or ontrolu
by a dierential system; the initial point orstate
x
0
and a nal pointx
1
or targetS
; the output of the system whi h hara terize thepro ess, i.e., the state of the ontrolled obje t at ea h instant of time; a set of admissible
for the e ien y of ea h admissible ontrol; and the length of time
t
f
required to rea h theterminal state.
A ontrol system is a dynami alsystem, whi h evolves over time, on whi h we an work
through a ommand fun tion or a ontrol, and their origin is vast (me hani s, ele troni ,
biology, e onomy, et .). Someexamplesof ontrolsystemswhi h anbemodeledandtreated
by the theory of ontrol systems are: a omputer that allows the user to perform a series of
basi ommands;ane osystemonwhi hwe ana tpromotingaparti ularsituationtoa hieve
abalan e;nervetissuesforminganetwork ontrolledbythe brainpro essingthe stimulifrom
outside and having an ee t on the body; a robot performing a spe i task; a ar that we
an ommand with the a elerator,brake andwheel; a satelliteor a spa e raft.
The ontrol theory analyzes the properties of su h systems, with the aim of steering an
initial state to a ertain nal state, eventually respe ting ertain restri tions. The obje tive
an be also to stabilize the systemmaking it insensitive to some perturbations (stabilization
problem), orevento omputetheoptimalsolutionsfora ertainoptimization riteria(optimal
ontrol problem). For the onstru tion of the ontrol system model, we an make use of
dierential equations, fun tional integrals, nite dieren es, partialderivatives, et . For this
reason the ontroltheoryisthe inter onne tion ofmanymathemati al areas(see,e.g.,[21 ,38,
39,65 ,85,106℄).
The dynami s of a system dene the system possible transformations, o urring in time
in a deterministi or random way. An equation is given, or typi ally a systemof dierential
equations, relating the variables and modeling the dynami s of the system. The examples
alreadygivenshowthatthestru tureanddynami sofa ontrolsystemmayhaveverydierent
meanings. Inparti ular,the ontrolsystem anbedes ribedbydis rete, ontinuous,orhybrid
transformations or,more generally, on atime s ale or measure hain [43 ,45,72℄.
Consider a ontrolsystemwhose state ata givenmoment isrepresented byave tor. The
ontrolsarefun tionsorparameters,usuallysubje ttorestri tions,whi ha tonthesystemin
the formofoutsidefor es thatae t thedynami s. Giventhesystemofdierentialequations
whi hmodelsthedynami softhesystem,itisthenne essarytousetheavailableinformation
andfeaturesoftheproblemto onstru tthe appropriate ontrolsthatwillenableusto attain
our obje tive. For example, when we travel in our ar a ting a ordingly to the ode of the
road (at least this is advisable) and we onstru t the travel plan to rea h our destination,
therearesomerestri tionsonthe traje toryand/oronthe ontrols,whi hmustbetakeninto
onsideration.
A ontrol system is alled ontrollable if we an steer it (in a nite time) from a given
initial state to anynalstate. Kalman proved in 1949 an important resulton ontrollability
whi h hara terizes ontrollable linear ontrol systems of nite dimension (Theorem 1.3.9 ).
Fornonlinearsystemsthe ontrollabilityproblemismu hmoredi ultandremainsana tive
nal state minimizing or maximizing a spe i riteria. In this ase we are speaking about
an optimal ontrol problem. For example, a driver going from Lisbon to Porto may wish to
travel in minimal time, and in that ase he will take the highway and spend more money
and fuel. Another optimal ontrol problem is obtained if the driver hooses as a
rite-ria spend less money as possible. The solution to this problem implies to hose se ondary
roads, forfree, and hewill takea lotmore time to hisdestination (following the internet site
http://www.google.pt/maps hoosing the highway thedriver takes3h fromLisbon toPorto
and bythe se ondaryroads 6h45m).
Thetheoryofoptimal ontrolisofgreatimportan einaerospa eengineering,inparti ular
for ondu tion problems, aero-assisted transfer orbits, development of re overable laun hers
(thenan ial aspe thereisveryimportant)andproblemsofatmospheri reentry, su hasthe
famous proje t Mars Sample Return fromthe European Spa e Agen y (ESA)whi h onsists
in sending a spa e raft to Mars with the obje tive of bringing to Earth martian samples
(Figure 1.1 ).
Figure1.1: Optimal ontrol theoryhasanimportant role in the aeroespa ial engineering.
1.2 Short histori al overview
The al ulusofvariationswasbornintheseventeen enturywiththe ontributionofBernoulli,
Fermat, Leibniz and Newton. Somemathemati ians asH.J.Sussmann andJ.C. Willems
de-fend that the origin of optimal ontrol oin ides with the birth of al ulus of variations, in
1697,dateofthepubli ationofthesolutionofthebra histo hroneproblembythe
mathemati- ianJohannBernoulli[114 ℄. Thebra histo hroneproblem(inGreekbrakhistos, theshortest,
and hronos, time) was studied by Galileu in 1638. The aim was to determine the urve
between twopointsonaverti alplanethatis overedin theleasttime byaspherethatstarts
point
B
, under the a tion of onstant gravity and assuming no fri tion (optimal sliding, seeFigure 1.2). In ontrast to what ould be our intuitive rst answer, the shortest time path
Figure 1.2: Bra histo hrone problem.
between two points is not a straight line! Galileo believed (wrongly) that the required urve
wasanar ofa ir le,buthehadalreadynoti edthatthestraightlineisnotthe shortesttime
path. In1696,JeanBernoulliposedtheproblemasa hallengetothebestmathemati iansof
his time. JeanBernoulli himself foundthe solution, aswell ashis brother Ja ques Bernoulli,
Newton, Leibniz and the Marquis de l'Hopital. The solution is a y loid ar starting with a
verti altangent [64,114 ℄. Skateboardingramps, aswell asthe fastestde reasesofaqua-parks,
have the form of y loid (Figure1.3).
Figure1.3: Cy loidar slead to fastestde reases andmaximal adrenaline.
Some authors go further,remarking that Newton's problemof aerodynami al resistan e,
proposed and solved by Isaa Newton in 1686, in his Prin ipia Mathemati a, is a typi al
optimal ontrol problem(see 1.4.8 ande.g. [102 ,118℄).
Inmathemati s, optimal ontrol theoryemerged afterthe Se ond World War responding
to pra ti al needsof engineering,parti ularly in the eldof aeronauti s and ight dynami s.
The formalization of this theory raised several new questions. For example, the theory of
optimal ontrol motivated the introdu tion of new on epts for generalized solutions in the
theory ofdierential equationsand generated new resultson the existen eof traje tories.
maximum prin iple by L.S. Pontryagin (Figure 1.4 ) and his group of ollaborators in 1956:
V.G. Boltyanskii,R.V. Gamkrelidze andE.F.Mish henko [96℄.
Figure1.4: LevSemenovi hPontryagin (3/September/1908 3/May/1988)
Pontryaginandhisasso iatesintrodu ed animportantepoint: theygeneralizedthetheory
of al ulusofvariationsto urvesthattakevalueson losedsets (withboundary). Thetheory
of optimal ontrol is loselyrelatedto lassi al me hani s, inparti ular variational prin iples
(Fermat'sprin iple,Euler-Lagrange,et .). Infa tthemaximumprin ipleofPontryagin
gener-alizes thene essary onditionsofEuler-Lagrange andWeierstrass. Somestrengthsofthe new
theorywasthedis overyofthe dynami programming method, theintrodu tionoffun tional
analysis to the theory ofoptimal systems and the dis overy of linksbetween the solutions of
an optimal ontrol problem and the results on stability of Lyapunov theory[120 ,122℄. Later
ame the foundations of sto hasti ontrol and ltering in dynami systems, game theory,
ontrolofpartialdierentialequationsandhybrid ontrolsystems,whi haresomeamongthe
manyareas of urrentresear h[2,106 ℄.
1.3 Linear optimal ontrol
Theoptimal ontroltheoryismu hmoresimplewhenthe ontrolsystemunderstudyislinear.
The nonlinear optimal ontrol theory will be re alled in Se tion 1.4 . Even in our days the
linear ontrol theory is one of the areas more used in engineering and its appli ations (see
e.g. [8℄).
1.3.1 Statement of the problem
Let
M
n,p
(
IR)
denote the set of matri es withn
rows andp
olumns, with entries in IR. LetI
be an interval of IR;A, B, r
three lo ally integrable mappings onI
(A, B ∈ L
1
values respe tively in
M
n,n
(
IR)
,M
n,m
(
IR)
andM
n,1
(
IR)
. LetΩ
be a subset of IRm
, and let
x
0
∈
IRn
. We onsiderthe linear ontrol system
˙x(t) = A(t)x(t) + B(t)u(t) + r(t) ,
∀t ∈ I ,
x(0) = x
0
,
(1.1)
where the ontrols
u
are mensurable lo ally bounded mappings overI
, taking values on asubset
Ω ⊂
IRm
.
Theexisten etheoremforsolutionsofdierentialequationsensures(seee.g.[121 ,Chapter
11℄), for every ontrol
u
, the existen eof aunique, absolutely ontinuous, solutionx(·) : I →
IR
n
forthe system(1.1 ). Let
M (·) : I → M
n,n
(
IR)
bethe fundamental matrix solutionofthehomogeneous linear system
˙x(t) = A(t)x(t)
, dened by˙
M (t) = A(t)M (t)
,M (0) = Id
. Note that ifA(t) = A
is onstant overI
, thenM (t) = e
tA
. Therefore, the solution
x(·)
of system(1.1 ) asso iated to the ontrol
u
is given byx(t) = M (t)x
0
+
Z
t
0
M (t)M (s)
−1
(B(s)u(s) + r(s)) ds ,
for every
t ∈ I
.Thismappingdependsonthe ontrol
u
. Therefore,ifwe hangethe fun tionu
we obtaina dierent traje tory
t 7→ x(t)
in IRn
(see Figure1.5).
x
0
Figure 1.5: The traje tory solution of the ontrol system(1.1 ) dependson the hoi e of the
ontrol
u
.Inthis ontext,some questionsarisenaturally:
(i)Given apoint
x
1
∈
IRn
,istherea ontrol
u
su hthattheasso iatedtraje toryx
steersx
0
tox
1
in a nitetimet
f
? (see Figure1.6) Thisisthe ontrollability problem.x
0
x(t)
x
1
= x(t
f
)
steers
x
0
tox
1
and minimizes a given fun tionalC(u)
(See Figure 1.7). It is an optimalontrol problem. The fun tional
C(u)
is the optimization riteria, and we all it ost. Forexampleifthe ostisthetransfertimefrom
x
0
tox
1
,thenwehavethe so- alledminimaltimeproblem.
x
0
x
1
= x(t
f
)
Figure1.7: Optimal ontrol problem
1.3.2 Controllability: denition and a essible set
Consider the linear ontrol system(1.1) . In what follows we introdu e a very important set:
the a essible set,also alled attainable set or rea hable set (see e.g. [53 ,62℄).
Denition 1.3.1. Thesetof a essiblepointsfrom
x
0
in timeT > 0
isdenoted byA(x
0
, T )
and dened by
A(x
0
, T ) = {x
u
(T ) | u ∈ L
∞
([0, T ], Ω)} ,
where
x
u
(·)
isthe solution ofsystem(1.1) asso iatedto the ontrolu
.In otherwords,
A(x
0
, T )
is the set of endpointsof the solutions of (1.1 ) in timeT
, whenthe ontrol
u
varies(seeFigure 1.8 ). We setA(x
0
, 0) = {x
0
}
.x
0
A(x
0
, T )
Figure1.8: A essibleset.
In what follows some properties of the a essible set for linear ontrol systems are given
Theorem 1.3.2. Consider the linear ontrol system in IR
n
˙x(t) = A(t)x(t) + B(t)u(t) + r(t)
where
Ω ⊂
IRm
is ompa t. Let
T > 0
andx
0
∈
IRn
. Then for every
t ∈ [0, T ]
,A(x
0
, t)
isompa t, onvexand varies ontinuously with
t
in[0, T ]
.Corollary 1.3.3. Ifwenoteby
A
Ω
(x
0
, t)
thea essiblesetstartingatx
0
intimet
for ontrolstaking values in
Ω
, then we setA
Ω
(x
0
, t) = A
Conv(Ω)
(x
0
, t) ,
where
Conv(Ω)
is the onvex envelope ofΩ
. In parti ular, we haveA
∂Ω
(x
0
, t) = A
Ω
(x
0
, t)
,where
∂Ω
isthe boundary ofΩ
.Thislast result illustratesthe bang-bang prin iple(see Theorem 1.3.15 ). Infa t, in many
optimal ontrol problems the optimal ontrols take values always onthe boundary
∂Ω
oftheontrol onstraint set
Ω
.Remark 1.3.4. We observe that if
r = 0
andx
0
= 0
, then the solution of˙x = Ax + Bu
,x(0) = 0
, isgiven byx(t) = M (t)
Z
t
0
M (s)
−1
B(s)u(s)ds ,
and is linearwith respe tto
u
.Thisremark leadus to the following proposition.
Proposition 1.3.5. Suppose that
r = 0
,x
0
= 0
andΩ =
IRm
. Then,
1.
∀ t > 0 A(0, t)
isa ve torial subspa e of IRn
. Moreover,
2.
∀t
1
, t
2
,
s.t.0 < t
1
< t
2
,A(0, t
1
) ⊂ A(0, t
2
)
.Denition 1.3.6. The set
A(0) = ∪
t≥0
A(0, t)
is the set of a essible points (at any time)starting at the origin.
Corollary 1.3.7. The set
A(0)
isa ve torial subspa e of IRn
.
The ontrollability denitionfor linear ontrol systemsfollows.
Denition1.3.8.The ontrolsystem
˙x(t) = A(t)x(t)+B(t)u(t)+r(t)
issaidtobe ontrollablein time
T
ifA(x
0
, T ) =
IRn
, that is, for every
x
0
, x
1
∈
IRn
, there existsa ontrol
u
su h thatthe asso iated traje torysteers
x
0
tox
1
in timeT
(see Figure1.9 ).The following theorem give us a ne essary and su ient ondition for ontrollability, in
the ase where
A
andB
do not depend oft
and there are no onstraints on the ontrol(
u(t) ∈
IRm
x
0
x
1
Figure1.9: Controllability
Theorem 1.3.9 (Kalman ondition). Suppose that
Ω =
IRm
(no onstraintson the ontrol).
The system
˙x(t) = Ax(t) + Bu(t) + r(t)
is ontrollable in timeT
(arbitrary) ifandonlyif thematrix
C = (B, AB, · · · , A
n−1
B)
isof rank
n
.Thematrix
C
is alled the Kalmanmatrix.Remark 1.3.10. TheKalman onditiondoesnot depend on
T
neither onx
0
. In otherwords,ifanautonomouslinearsystemis ontrollable in time
T
starting atx
0
,then is ontrollable inanytime starting at any point.
In Theorem 1.3.9 no onstraint on the ontrol is onsidered. Thenext theorem is a
on-trollabilityresultwhen the ontrol is s alar, i.e.,
m = 1
, andu(t) ∈ Ω ⊂
IR.Theorem 1.3.11. Let
b ∈
IRn
and
Ω ⊂
IRan interval having0
in its interior. Consider thesystem
˙x(t) = Ax(t) + bu(t)
, withu(t) ∈ Ω
. Then every point of IRn
an be steered to the
originin nitetime ifandonlyifthe ouple
(A, b)
satises theKalman onditionandthe realpartof ea h eigenvalue of
A
isless or equal than zero.1.3.3 Minimal time problem
We start by formalizing, with the help of the a essible set
A(x
0
, t)
, the notion of minimaltime.
Consider the ontrol systemonIR
n
˙x(t) = A(t)x(t) + B(t)u(t) + r(t) ,
where the ontrols
u
takevaluesina ompa t setΩ ⊂
IRm
withnonemptyinterior. Let
x
0
, x
1
be two points of IR
n
. Suppose that
x
1
is a essible fromx
0
, i.e., suppose that there existsat least one traje tory steering
x
0
tox
1
. Between all the traje tories that steerx
0
tox
1
wewouldlike to hara terize the onethat doesit in minimaltime
ˆ
t
f
(seeFigure 1.10 ).x
0
x
1
= x(t
f
)
If
ˆ
t
f
is the minimal time, then for everyt < ˆ
t
f
,x
1
6∈ A(x
0
, t)
(in ee t, otherwisex
1
wouldbea essiblefrom
x
0
in atime smallerthant
ˆ
f
andˆ
t
f
wouldnot bethe minimaltime).Therefore,
ˆ
t
f
= inf{t > 0 | x
1
∈ A(x
0
, t)} .
(1.2)The valueof
ˆ
t
f
iswelldened be ause,from Theorem1.3.2 ,A(x
0
, t)
varies ontinuouslywitht
, thus{t > 0 | x
1
∈ A(x
0
, t)}
is losed inIR. Inparti ular theinmum in(1.2) isaminimum. The timet = ˆ
t
f
is the rstinstant su h thatA(x
0
, t)
ontainsx
1
(seeFigure 1.11 ).x
0
x
1
A(x
0
, t
f
)
A(x
0
, t)
Figure1.11: Minimaltime.
Onthe other hand,we have
x
1
∈ ∂ A(x
0
, ˆ
t
f
) = A(x
0
, ˆ
t
f
)\int A(x
0
, ˆ
t
f
) .
In fa t, if
x
1
belongs to the interior ofA(x
0
, ˆ
t
f
)
, then fort < ˆ
t
f
lose toˆ
t
f
,x
1
alsobelongs to
A(x
0
, t)
sin eA(x
0
, t)
varies ontinuouslywitht
. This ontradi ts the fa tthatˆ
t
f
is minimaltime.
The next theoremstates that if a minimal time problem with a linear ontrol systemin
IR
n
is ontrollable then it hasat leastonesolution.
Theorem 1.3.12. If the point
x
1
is a essible fromx
0
then there exists a minimal timetraje tory steering
x
0
tox
1
.Remark 1.3.13. We an also onsider the steering problem to a target that does not redu e
to a singlepoint. Therefore,let
(M
1
(t))
0≤t≤t
f
be afamily of ompa t subsetsof IRn
varying
ontinuously with
t
. As before, we see that if there exists a ontrolu
taking values inΩ
steering
x
0
toM
1
(t
f
)
, then there exists a minimal time ontrol dened on[0, ˆ
t
f
]
steeringx
0
to
M (ˆ
t
f
)
.Thisremark give us ageometri visionof the notionof minimum time andlead us tothe
Denition 1.3.14. The ontrol
u
is an extremal on[0, t]
if the traje tory of system (1.1 )asso iated to
u
satisesx(t) ∈ ∂ A(x
0
, t)
.Every minimaltime ontrol isan extremal. The onverse doesnot holdin general.
Optimality ondition: maximum prin iple in the linear ase
The nexttheoremgiveus ane essaryand su ient onditionin orderthatextremal ontrols
are alsooptimal ontrols.
Theorem 1.3.15. Consider the linear ontrol system
˙x(t) = A(t)x(t) + B(t)u(t) + r(t) ,
x(0) = x
0
,
where the domain of ontrol onstraints
Ω ⊂
IRm
is ompa t. Let
t
f
> 0
. The ontrolu
is an extremal on
[0, t
f
]
if and only if there exists a nontrivial solutionp(t)
of the equation˙p(t) = −p(t)A(t)
su h thatp(t)B(t)u(t) = max
w∈Ω
p(t)B(t)w
(1.3)
for every
t ∈ [0, t
f
]
. The rowve torp(t) ∈
IRn
is alled the adjoint ve tor.
Remark 1.3.16. Inthe aseof a s alar ontrol, and ifmoreover
Ω = [−a, a]
wherea > 0
, themaximization ondition(1.3)impliesimmediatelythat
u(t) = a signhp(t), B(t)i
. Thefun tionϕ(t) = hp(t), B(t)i
is alledaswit hingfun tion,andthetimet
s
atwhi htheextremal ontrolu(t)
hange itssign is alled a swit hing time. Itis, in parti ular, a root of the fun tionϕ
.The initial ondition
p(0)
depends onx
1
. As this ondition is not dire tly known, theappli ation ofTheorem 1.3.15 ismostlydone indire tly. Letus seean example.
1.3.4 Example: optimal ontrol of an harmoni os illator (linear ase)
Consider a pun tual mass
m
, for ed to move along an axis (Ox
), atta hed to a spring (seeFigure 1.12 ).
x
m
~ι
O
Figure1.12: AspringThe massis then drawn towards the origin by a for e thatis assumed equal to
−k
1
(x −
l) − k
2
(x − l)
3
, wherel
is the length of the spring at rest, andk
1
,k
2
are the oe ients ofstiness. We applyto thismasspoint an external horizontal for e
u(t)~l
. The laws ofphysi sgive us the motion equation
m¨
x(t) + k
1
(x(t) − l) + k
2
(x(t) − l)
3
= u(t) .
(1.4)Moreover we impose a onstraint onthe external for e,
|u(t)| ≤ 1 ,
∀ t .
This means we an not apply any external horizontal for e to the point mass: the external
for e an only take values on the interval
[−1, 1]
, ree ting the fa tthat our power of a tionis limited.
Assumethatthe initialpositionand velo ityoftheobje tare,respe tively,
x(0) = x
0
and˙x(0) = y
0
. The problem onsists in driving the point mass to the equilibrium positionx = l
in minimal time ontrolling the external for e
u(t)
that is applied to this obje t, and takinginto a ount the onstraint
|u(t)| ≤ 1
. The fun tionu
is the ontrol.Problem 1.3.17. Given the initial onditions
x(0) = x
0
and˙x(0) = y
0
, the goal is to nda fun tion
u(t)
whi h allows the movement of the point mass to its equilibrium position inminimal time.
Mathemati modeling
To simplify the presentation, we will suppose that
m = 1 kg
,k
1
= 1 N.m
−1
andl = 0 m
(wepass to
l = 0
by translation). The equation of motion (1.4 ) is equivalent to the ontrolleddierential system
˙x(t) = y(t)
˙y(t) = −x(t) − k
2
x(t)
3
+ u(t)
(1.5) withx(0) = x
0
and˙x(0) = y
0
.Writing (1.5 ) in matri ialnotation wehave
˙
X(t) = AX(t) + f (X(t)) + Bu(t) , X(0) = X
0
,
(1.6) whereA =
0
1
−1 0
!
, B =
0
1
!
,
X =
x
y
!
, X
0
=
x
0
y
0
!
, f (X) =
0
−k
2
x
3
!
.
In this se tion we are onsidering linear ontrol systems, therefore we xe
k
2
= 0
, andwe do not take into a ount the nonlinear onservative ee ts (in Se tion 1.4, we onsider
nonlinear ontrol systems andtake
k
2
6= 0
). Ifk
2
= 0
thenf (X) ≡ 0
andthe ontrol system(1.6 ) has the form of (1.1) (linear ontrol system). We wish to answer the two following
questions.
1. Is therealways,foranyinitial ondition
x(0) = x
0
and˙x(0) = y
0
,anhorizontalexteriorfor e(a ontrol) thatallows tomove,in nitetime
t
f
, the point massto itsequilibriumposition
x(t
f
) = 0
and˙x(t
f
) = 0
?2. Ifthe answer tothe rstquestionisarmative,whi histhefor e(whi histhe ontrol)
that minimizes the transfertime of thepoint massto its equilibriumposition?
System ontrollability
Our systemwrites in the form
˙
X = AX + Bu
X(0) = X
0
withA =
0
1
−1 0
!
andB =
0
1
!
. Wehave thenrank (B, AB) = rank
0 1
1 0
!
= 2
and the eigenvalues of
A
have zero real part. Therefore,from Theorem 1.3.11 , the systemisontrollable, that is, there exist ontrols
u
satisfying the onstraint|u(t)| ≤ 1
su h that theasso iated traje tories steer
X
0
to0. Weansweredarmatively to the rstquestion.Thisanswer orrespondstothephysi alinterpretationoftheproblem. Infa t,ifwedonot
apply an exteriorfor e, thatis, if
u = 0
then the motion equation isx + x = 0
¨
and the pointmass will ontinues to os illate, never stopping, in a nite time, at its equilibrium position.
Onthe otherhand,whenexteriorfor esareapplied,we tendto dampentheos illations. The
ontrol theorypredi ts thatwe an stopthe obje tin anite time.
Computation of the optimal ontrol
We knowthat thereexist ontrolsthatallow to steer the systemfrom
X
0
to0
in nite time.Nowwe wantto ompute, on retely, whi hone ofthese ontrolsdoesitinminimaltime. To
do sowe apply the Theorem 1.3.15 andobtain
where
p(t) ∈
IR2
isthe solutionof
˙p = −pA
. Letp(t) =
p
1
(t)
p
2
(t)
!
. Then,
u(t) = sign (p
2
(t))
and
˙p
1
= p
2
,˙p
2
= −p
1
, that is,p
¨
2
+ p
2
= 0
. Thusp
2
(t) = λ cos t + µ sin t
. Therefore, theoptimal ontrolispie ewise onstant in intervalsof size
π
andtakealternately thevalues±1
.•
Ifu = −1
, we getthe dierential system
˙x = y ,
˙y = −x − 1 .
(1.7)•
Ifu = +1
, we get
˙x = y ,
˙y = −x + 1 .
(1.8)The optimal traje tory, steering
X
0
to0
, onsistsin on atenatedpie esof solutions of (1.7 )and (1.8). The solutions of (1.7) and (1.8 ) are obtained easily: from equation (1.7 ) we have
(x + 1)
2
+ y
2
= const = R
2
and we on lude that the solution urves of (1.7) are ir lesentered on
x = −1
andy = 0
of period2π
(in fa t,x(t) = −1 + R cos t
andy(t) = R sin t
);assolutions of (1.8)weget
x(t) = 1 + R cos t
andy(t) = R sin t
, i.e.,the solutionsof (1.8)areir les entered in
x = 1
andy = 0
of period2π
.Theoptimaltraje torythatsteers
X
0
to0
followsalternately anar ofa ir le enteredinx = −1
andy = 0
andanar ofa ir le enteredinx = 1
andy = 0
. Thedetailedstudyoftheoptimal traje tory and its numeri al implementation, for every
X
0
, an be founded in [121 ℄.See also Se tion 2.6where the optimal ontrol problemissolved.
1.4 Nonlinear optimal ontrol
We now present some te hniques to analyze nonlinear optimal ontrol problems (the proofs
of the presented results an be found, for example, in [62 ,121℄). In parti ular, we enun iate
the Pontryagin maximum prin iple in a more general form than the one we have seen in
Se tion 1.3 . Thenonlinear example ofthe spring will be one ofthe appli ation examples.
1.4.1 Statement of the problem
Fromageneralpointofview,theproblemshouldbepresentedinamanifold
M
,butourpointof view will be lo al and we work on an open
V
of IRn
small enough. The general optimal
ontrol problemis the following. Consider the ontrol system
where
f
is a mapping of lassC
1
1 fromI × V × U
into IRn
,I
is an interval of IR,V
is an open set ofIRn
,U
is anopen set of IRm
,
(t
0
, x
0
) ∈ I × V
. We supposethat the ontrolsu(·)
belong toa subset of
L
∞
loc
(I,
IRm
)
.
Thesehypothesesassure,forevery ontrol
u(·)
, theexisten eanduniqueness ofamaximalsolution
x
u
(·)
over anintervalJ ⊂ I
,oftheCau hyproblem(1.9 )(see e.g.[121,Chapter11℄).Inwhat followswewill onsider, withoutlossof generality,
t
0
= 0
.Denition 1.4.1. Let
t
f
> 0
,t
f
∈ I
. A ontrol fun tionu(·) ∈ L
∞
([0, t
f
],
IRm
)
is said
admissible on
[0, t
f
]
ifthe traje toryx(·)
, solutionof (1.9 ) asso iated tou(·)
, is well denedon
[0, t
f
]
. Thesetofadmissible ontrolson[0, t
f
]
isdenotedU
t
f
,
IRm
,andthe setofadmissibleontrols on
[0, t
f
]
taking theirvalues inΩ
isdenotedU
t
f
,Ω
.Inwhatfollows wewillabbreviatethe notationforadmissible ontrolstakingvaluein IR
m
writingU
t
f
. Letf
0
beafun tion of lassC
1
over
I × V × U
,andg
a ontinuousfun tion overV
. Forevery ontrol
u(·) ∈ U
t
f
we dene the ost ofthe asso iated traje toryx
u
(·)
over the interval[0, t
f
]
byC(t
f
, u) =
Z
t
f
0
f
0
(t, x
u
(t), u(t))dt + g(t
f
, x
u
(t
f
)) .
Let
M
0
andM
1
be two subsets ofV
. Theoptimal ontrol problem isto ompute thetraje -tories
x
u
(·)
solutions of˙x
u
(t) = f (t, x
u
(t), u(t)) ,
su hthat
x
u
(0) ∈ M
0
,x
u
(t
f
) ∈ M
1
,andminimizingthe ostC(t
f
, u)
. Wesaythattheoptimalontrol hasfree nal time if the nal time
t
f
is free, otherwise we say that the problemhasxed naltime.
1.4.2 End-point mapping
Consider for the system(1.9 ) the following optimal ontrol problem: given a point
x
1
∈
IRn
,
nd atime
t
f
and a ontrolu
over[0, t
f
]
su hthatthe traje toryx
u
asso iated tothe ontrolu
, solutionof (1.9 ),satisesx
u
(0) = x
0
, x
u
(t
f
) = x
1
.
This leadsus to the following denition.
1
F.H. Clarke is theauthorofthe so- alledNonsmoothAnalysis reatedintheseventieswhi hallows the
study ofmore general optimal ontrol problems, where the usedfun tions are not ne essarily dierentiable
in the lassi sense. For adetailedstudy onNonsmoothAnalysis see, e.g., [30 33 ℄ and thereferen es ited
Denition 1.4.2. Let
t
f
> 0
. The end-point mappingin timet
f
ofthe ontrol system (1.9 )starting in
x
0
isthe mappingE
t
f
: U
t
f
−→
IRn
u 7−→ x
u
(t
f
) .
Inotherwords, the end-point mappingin time
t
f
asso iesto a ontrolu
the nalpoint ofthe traje tory asso iated to the ontrol
u
(see Figure1.13).Remark 1.4.3. We also an denote the end-point mapping by
E(x
0
, t
f
, u)
(see, e.g., Se tion1.5 ).
x
0
x
u
(·, x
0
)
x
u
(t
f
, x
0
)
Figure1.13: End-point mapping.
A very important issue in the theory of optimal ontrol is the study of the map
E
t
f
,des ribing its image, singularities, regularity, et . The answer to these questions depends,
obviously, onthe spa e
U
t
f
andon the shape ofthe system(on the fun tionf
). Withallthegenerality wehave the following result(see,e.g., [16,53 ,106℄).
Proposition1.4.4. Considerthesystem(1.9 )where
f
isC
p
,
p ≥ 1
,andletU
t
f
⊂ L
∞
([0, t
f
],
IRm
)
be the domain of
E
t
f
, that is, the set of ontrols whose asso iated traje tory is well denedover
[0, t
f
]
. ThenU
t
f
is an openset ofL
∞
([0, t
f
],
IRm
)
, andE
t
f
isC
p
in theL
∞
sense.Moreover theFré het dierential of
E
t
f
atapointu ∈ U
t
f
isgivenby thelinearized systemat
u
in the following way. Let,for everyt ∈ [0, t
f
]
,A(t) =
∂f
∂x
(t, x
u
(t), u(t)) , B(t) =
∂f
∂u
(t, x
u
(t), u(t)) .
The linearized ontrol system
˙y
v
(t) = A(t)y
v
(t) + B(t)v(t)
y
v
(0) = 0
is alled the linearized system along the traje tory
x
u
. The Fré het dierential ofE
t
f
atu
isthen themapping