• Nenhum resultado encontrado

Bergen Andrea Thursday SA

N/A
N/A
Protected

Academic year: 2018

Share "Bergen Andrea Thursday SA"

Copied!
121
0
0

Texto

(1)

Sensitivity Analysis

Andrea Saltelli

Centre for the Study of the Sciences and the Humanities (SVT) - University of Bergen (UIB) Institut de Ciència i Tecnologia Ambientals (ICTA)

-Universitat Autonoma de Barcelona (UAB)

PhD-course

Numbers for policy: Practical problems in quantification

(2)
(3)

= more material on my web site

(4)
(5)
(6)

Definitions

Uncertainty analysis: Focuses on just quantifying the uncertainty in model output

Sensitivity analysis: The study of the relative importance of different input factors on the

(7)

[Global

*

] sensitivity analysis: The

study of how the uncertainty in the

output of a model (numerical or

otherwise) can be apportioned to

different sources of uncertainty in the

model input

(8)

8 Simulation Model parameters Resolution levels data

errors model structures

uncertainty analysis

sensitivity analysis

model output

feedbacks on input data and model factors

(9)

One can sample more than just factors

One can sample modelling assumptions

(10)

Assumption Alternatives

Number of indicators  all six indicators included or

one-at-time excluded (6 options)

Weighting method  original set of weights,

 factor analysis,

 equal weighting,

 data envelopment analysis

Aggregation rule  additive,

 multiplicative,

(11)

Space of alternatives Including/ excluding variables Normalisation Missing data Weights Aggregation Country 1 10 20 30 40 50 60

Country 2 Country 3

(12)
(13)
(14)

Sample matrix for uncertainty and

sensitivity analysis

(15)

Each column is a sample of size N from the

(16)

Model results:

Each entry is the

(17)

Input matrix Output vector:

In the simplest case y could be a function of - a simple mathematical expression of - the x1,x2,…xk

e.g. y= x1 sin(x2)/x3

(18)
(19)

European Commission, 2015

Office for the Management and Budget, 2006

Environmental Protection Agency, 2009

EPA, 2009, March. Guidance on the Development, Evaluation, and Application of Environmental Models. Technical Report EPA/100/K-09/003. Office of the Science Advisor, Council for Regulatory Environmental Modeling,

http://nepis.epa.gov/Exe/ZyPDF.cgi?Dockey=P1003E4R.PDF, Last accessed December 2015.

EUROPEAN COMMISSION, Better regulation toolbox, appendix to the Better Regulation Guidelines, Strasbourg, 19.5.2015, SWD(2015) 111 final, COM(2015) 215 final, http://ec.europa.eu/smart-regulation/guidelines/docs/swd_br_guidelines_en.pdf.

OMB, Proposed risk assessment bulletin, Technical report, The Office of Management and Budget’s –Office of Information and Regulatory Affairs (OIRA), January 2006,

(20)

http://ec.europa.eu/smart-regulation/

(21)

Page 391 Six steps for a global SA:

1. Select one output of interest;

2. Participatory step: discuss which input may matter;

3. Participatory step (extended peer review): define distributions;

4. Sample from the distributions;

5. Run (=evaluate) the model for the sampled values;

6. Obtain in this way bot the uncertainty of the

(22)

Limits of

sensitivity

(23)

Useless Arithmetic: Why

Environmental Scientists Can't Predict the Future

by Orrin H. Pilkey and Linda Pilkey-Jarvis

Orrin H. Pilkey Duke University,

(24)

<<It is important, however, to

recognize that the sensitivity of the parameter in the equation is what is being determined, not the sensitivity of the parameter in nature.

[…] If the model is wrong or if it is a

poor representation of reality, determining the sensitivity of an

(25)

One of the examples discussed concerns the

Yucca Mountain repository for radioactive waste. TSPA model (for total system performance

assessment) for safety analysis.

(26)

TSPA (like any other model)

relies on assumptions  one is the low permeability of the

geological formation  long

(27)

The confidence of the stakeholders in TSPA was not helped when evidence was produced

which could lead to an upward revision of 4 orders of magnitude of this parameter

(28)

Type III error in sensitivity: Examples:

In the case of TSPA (Yucca mountain) a range of 0.02 to 1 millimetre per year was used for

percolation of flux rate.

(29)

Scientific mathematical modelling

should involve constant efforts to

falsify the model

Ref.  Robert K. Merton s Organized skepticism Communalism- the common ownership of scient40

ific discoveries, according to which scientists give up intellectual property rights in exchange for recognition and esteem (Merton actually used the term Communism, but had this notion of communalism in mind, not Marxism);

Universalism- according to which claims to truth are evaluated in terms of universal or impersonal criteria, and not on the basis of race, class, gender, religion, or nationality;

Disinterestedness- according to which scientists are rewarded for acting in ways that outwardly appear to be selfless;

Organized Skepticism- all ideas must be tested and are subject to rigorous, structured community scrutiny.

(30)

Can I lie with

sensitivity

(31)

Will any sensitivity analysis do the

job? Can I lie with sensitivity analysis

as I can lie with statistics?

(32)

Why not just changing one factor

at a time (OAT)?

<< one-at-a-time (OAT) approach is most commonly used in Commission IAs>>

(33)

Sensitivity

analysis usually proceeds

by changing one variable or assumption

at a time, but it can also be done by

varying a combination of variables

simultaneously to learn more about the

robustness of your results to

widespread

changes .

Why not just changing one factor at a time (OAT)?

Source: Office for the management and Budget of the White House (OMB), Circular A4, 2003

(34)

Why not just changing one factor

at a time (OAT)?

(35)

OAT in 2 dimensions

Area circle / area

square =?

(36)

OAT in 3 dimensions

Volume sphere /

volume cube =?

(37)

OAT in 10 dimensions

Volume hypersphere / volume

(38)

OAT in k dimensions

K=2

K=3

(39)

Bottom-line: once a sensitivity

analysis is done via OAT there is no

guarantee that either uncertainty

analysis (UA) or sensitivity analysis

(SA) is any good:

 UA will be non conservative

(40)

OAT is still the most largely used technique in SA. Out of every 100 papers with modelling & SA only 4 are global in the sense discussed here.

(41)

In 2014 out of 1000

papers in modelling 12 have a sensitivity

(42)
(43)
(44)

Discussion points (1)

• Is the geometric argument necessary? Anyone experience in design of experiment (DOE)?

(45)

Discussion points (2)

(46)

Discussion points (3)

• If I keep a parameter fixed I am in error, if I give it a

distribution there are problems to justify it … is this a law

(47)
(48)
(49)

Input matrix:

• Each column is a sample from the distribution of a factor

• Each row is a sample trial to

(50)

Output vector:

• Just one output of interest; but y could also be a vector (function of time) or a map,

etc. …

(51)

-60 -40 -20 0 20 40 60

-4 -3 -2 -1 0 1 2 3 4

-60 -40 -20 0 20 40 60

-4 -3 -2 -1 0 1 2 3 4

Y plotted against two different factors xi and xj

The values of the output on the ordinate are the same

Input variable xi

Input variable xj

(52)

-60 -40 -20 0 20 40 60

-4 -3 -2 -1 0 1 2 3 4

-60 -40 -20 0 20 40 60

-4 -3 -2 -1 0 1 2 3 4

Scatterplots of y versus sorted factors

Can I do a

sensitivity analysis

just looking at the

plots?

Output variable

Output variable 

Input variable xi

(53)

-60 -40 -20 0 20 40 60

-4 -3 -2 -1 0 1 2 3 4

-60 -40 -20 0 20 40 60

-4 -3 -2 -1 0 1 2 3 4

Which factor is more important?

Output variable Output variable

Input variable xi

Input variable xj

(54)

-60 -40 -20 0 20 40 60

-4 -3 -2 -1 0 1 2 3 4

-60 -40 -20 0 20 40 60

-4 -3 -2 -1 0 1 2 3 4

~1,000 blue

points

Divide them

in 20 bins of

~ 50 points

Compute the

bin s average

(55)

 

Y

X

i

E

i

~

X

Each pink point is ~

-60 -40 -20 0 20 40 60

(56)

 

i

X

E

Y

X

V

i i X~

Take the variance of

the pink points and

you have a

sensitivity measure

-60 -40 -20 0 20 40 60

(57)

-60 -40 -20 0 20 40 60

-4 -3 -2 -1 0 1 2 3 4

-60 -40 -20 0 20 40 60

-4 -3 -2 -1 0 1 2 3 4

Which factor

has the highest

?

 

i

X

E

Y

X

V

i

(58)

 

Y

i i

V

X

Y

E

V

(59)

First order sensitivity index

Pearson s correlation

ratio

Smoothed curve

(60)

First order sensitivity index:

(61)

 

i

X

E

Y

X

V

i

i

X

~

First order effect, or top marginal

variance=

= the expected reduction in variance that

would be achieved if factor Xi could be

(62)

We need first to prove that

(63)

 

 

(

)

~ ~

Y

V

X

Y

V

E

X

Y

E

V

i X i X i i i i

X X

Because:

(64)

 

 

(

)

~ ~

Y

V

X

Y

V

E

X

Y

E

V

i X i X i i i i

X X

Because:

This is what variance would be left (on

(65)

 

 

(

)

~ ~

Y

V

X

Y

V

E

X

Y

E

V

i X i X i i i i

X X

… must be the

expected reduction

in variance that would be achieved

if factor Xi could be fixed

(66)

 

(

)

~

Y

X

V

Y

E

V

i

i

Xi i

X

For additive models one can

decompose the total variance as a

sum of first order effects

… which is also how additive

(67)

If an additive model is one where the variance

V

of the output is a linear combination of the partial variances of the inputs then:

- can I guess a formula for an additive model

f(x

1

,x

2

,x

3

,…)

?

(68)
(69)

Is S

i

=0?

-60 -40 -20 0 20 40 60

(70)

Is this factor non-important?

-60 -40 -20 0 20 40 60

(71)

There are terms which capture

two-

way, three way, … interactions

among variables.

(72)

Variance decomposition (ANOVA)

 

k

i

j

i

ij

i

i

V

V

(73)

Variance decomposition (ANOVA)

 

i

i

X

E

Y

X

V

V

i

i

X

~

(74)

Variance decomposition (ANOVA)

When the factors are

independent the total variance

can be decomposed into main

effects and interaction effects

up to the order k, the

(75)

Variance decomposition (ANOVA)

When the factors are not

independent the

decomposition loses its

(76)

If fact interactions terms are

(77)

Wouldn t it be handy to have just a

single importance terms for all

(78)

In fact such terms exist and can be

computed easily, without

(79)

Thus given a model

Y=f(X

1

,X

2

,X

3

)

Instead of

V=V

1

+V

2

+V

3

+

+V

12

+V

13

+V

23

+

+V

123

(80)

1=S

1

+S

2

+S

3

+

+S

12

+S

13

+S

23

+

(81)

and analogue formulae for

S

T2

, S

T3

which can be computed without

knowing

S

1

, S

12

, S

13

, S

123

S

T1

is called a total effect

sensitivity index

We can compute an index

S

Ti

so

that e.g. for factor

x

1

:

(82)

Total effect, or bottom marginal

variance=

= the expected variance that

would be left if all factors but Xi

could be fixed.

V

X

Y

i

E

i

i

~

~

X

(83)

 

Y

i

Ti

V

Y

V

E

(84)

What is the shortcoming

(85)

Ti i X

S

Y

V

Y

V

E

i i

)

(

~

~

X

X

 

i i X

S

Y

V

X

Y

E

V

i i

)

(

~

X

(86)

V

X

Y

i

E

i

i

~

~

X

X

Why these measures?

Factors

prioritization

 

i

X

E

Y

X

V

i

i

X

~

Fixing (dropping)

non important

factors

(87)

More about the settings:

•Factor prioritisation 

 

Y i i V X Y E V S

If the cost of discovering factors

(88)

•Factor fixing: Can I fix a factor [or a subset of

input factors] at any given value over their range of uncertainty without reducing significantly the

output?

 

Y i Ti

V

Y

V

E

(89)

Factor fixing

is useful to achieve

model simplification and

(90)

Can we use S

i

to fix a

factor?

If S

i

=0 is X

i

a

non-influential factor?

-60 -40 -20 0 20 40 60

(91)

We cannot use S

i

to fix a factor;

S

i

=0 is a necessary condition for

X

i

to be non-influential but not a

sufficient one

X

i

could be influent at the second

(92)

Can we use S

Ti

to fix a

factor?

If S

Ti

=0 is X

i

a

(93)

Variance is always a

positive number

V

X

Y

i

E

i

i

~

~

X

X

For a mean of

non-negative

entries to be zero

all entries must

be zero

(94)

Summary for variance based measures:

1.

Easy-to-code, Monte Carlo

better

on quasi-random points. Estimate of

the error available.

2. The main effect can be made

(95)

-60 -40 -20 0 20 40 60

-4 -3 -2 -1 0 1 2 3 4

(96)

3. The total effect is more expensive;

its computational cost is (

k+1)N

where

N

is one of the order of one

thousand (unless e.g. using

emulators …).

(97)
(98)
(99)

When to use Monte Carlo

Filtering?

When we are interested not in the

precise value of the output y but on

whether or not this value is permitted

(100)

NOT OK

OK

NOT OK

(101)

NOT OK

OK

NOT OK NOT OK

OK

NOT OK

(102)

NOT OK OK

NOT OK

(103)

Monte Carlo filtering

) (Xi B

i

X

Y

) (Xi B

B B

= OK

= not OK

(104)

Monte Carlo filtering

Step by step:

 Classifying simulations as either or . This allows distinguishing two sub-sets for each Xi: and

 The Smirnov two-sample test (two-sided version) is performed for each factor independently,

analyzing the maximum distance between the

cumulative distributions of the and sets.

) (Xi B

B B

) (Xi B

(105)

Monte Carlo filtering

(106)

How to generate

the random

sample?

We use quasi

random

sequences

(107)

sequence

(108)

X1,X2 plane, 100 Sobol’ points X1,X2 plane, 1000 Sobol’ points

Sobol sequences of quasi

(109)

Sobol sequences of quasi

-random points

(110)
(111)

Root mean square error over K=50 different trials. The error refers to the

numeric-versus-analytic value the integral of the function (for n=360) over its dominion.

Source: Kucherenko S., Feil B., Shah N., Mauntz W. The identification of model effective dimensions using global sensitivity analysis Reliability Engineering and System Safety 96 (2011) 440–449.

Why quasi-random

Sergei Kucherenko, Imperial College

(112)

Variance based measures are: -well scaled,

-concise,

-easy to communicate.

Further

- Si reduces to squared standard regression

coefficients for linear model.

- STi detect and describe interactions and

- Becomes a screening test at low sample size

(113)
(114)

First secret: The most important

question is the question.

Corollary 1: Sensitivity analysis is

not “run” on a model but on a

(115)

First secret: The most important question is the

question.

Corollary 2: The best setting for a sensitivity

analysis is one when one wants to prove that a

question cannot be answered given the model

It is better to be in a setting of falsification than in

one of confirmation (Oreskes et al., 1994 ).

[Normally the opposite is the case]

(116)

Second secret: Sensitivity analysis should

not be used to hide assumptions

(117)

Third secret: If sensitivity analysis shows that a

question cannot be answered by the model one

should find another question/model which can

be treated meaningfully.

(118)

Badly kept secret:

There is always one more bug!

(119)

And of course please don’t …

… run a sensitivity analysis where each

(120)

Discussion point

(121)

END

Twitter:

Referências

Documentos relacionados

But the way the Bush administration interpreted the war on terror has hindered its ability to deal with these threats, and, in an added irony, if Iran gets nuclear weapons, the

Esta circunstancia es la que hace interesante, nos atreveríamos a decir apasionante, los límites de nuestro trabajo, en el que, luego de sintetizar los antecedentes de la

It accomplishes that goal by using the Application Model , to find which tasks need to be executed, and the Assembly Distribution Policy , to find which are the assemblies to

Na zona reservada ao funcionamento interno da Farmácia Reis encontram-se não só armazenados os medicamentos sujeitos a receita médica (MSRM), como também

Nessa linha de pensamento o processo de escolarização contemporâneo deve consubs- tanciar-se em variados enfoques, que vai além da racionalidade científica, incluindo a ética, o

Segundo Huberman (1989, citado por Alves, 2001), as últimas fases do desenvolvimento profissio­ nal  dos  professores,   designadas  por   conservantismo   e

The higher the sampling variability, the greater the chances of creating a model that can be extrapolated. The development of a prediction model that can be applied to different

We included only randomized controlled trials about women with breast cancer or women after breast cancer surgery undergoing treatment and doing pilates exer- cises in one arm of