• Nenhum resultado encontrado

Observed sample H0 :µ=0 H0012 (¯x1, x¯2) ¯x1−x¯2 p-value p-value

(0.05,-0.05) 0.1 0.9753 0.8231

(0.09,-0.11) 0.2 0.9039 0.6547

(0.14,-0.16) 0.3 0.7977 0.5023

(0.19,-0.21) 0.4 0.6697 0.3711

(0.23,-0.27) 0.5 0.5331 0.2636

(0.28,-0.32) 0.6 0.4049 0.1797

(0.33,-0.37) 0.7 0.2926 0.1175

(0.37,-0.43) 0.8 0.2001 0.0736

(0.42,-0.48) 0.9 0.1308 0.0442

P-value definition and its limitations

Level curves (contour curves)

−1.0 −0.5 0.0 0.5 1.0

−1.0−0.50.00.51.0

Significance level 10%

µ1

µ2

●●

●●●●

●●

●●●●

●●●

●●●

●●●

●●●

●●●

●●●

●●●

●●

●●

●●●

●●

●●●●

●●

●●●●

●●●

●●●

●●●●

●●

●●●

●●●

●●●

●●

●●

●●●

●●

●●●●

●●●

●●●

●●●●

●●●

●●●●

●●

●●●

●●

●●

●●●

●●

●●●●

●●●

●●●●

●●●

●●●

●●●●

●●

●●●

●●

●●

●●●

●●

●●

●●●

●●

●●●●

●●●

●●●

●●●

●●

●●●●

●●

●●●

●●

●●

●●●

●●●

●●●

●●●

●●●

●●●

●●●

●●●●

●●

●●●●

●●

●●●

●●●

●●●

●●●

●●●●

●●●

Region of rejection Region of rejection

P-value definition and its limitations

Level curves (contour curves)

−0.50.00.51.0

Significance level 10%

µ2

●●●●

●●●

●●●

●●●●

●●

●●●

●●●

●●●

●●

●●

●●●

●●

●●●●

●●●

●●●

●●●●

●●●

●●●●

●●

●●●

●●

●●

●●●

●●

●●●●

●●●

●●●●

●●●

●●●

●●●●

●●

●●●

●●

●●

●●●

●●

●●

●●●

●●

●●●●

●●●

●●●

●●●

●●

●●●●

●●

●●●

●●

●●

●●●

●●●

●●●

●●●

●●●

●●●

●●●

●●●●

●●

●●●●

●●

●●●

●●●

●●●

●●●

●●●●

●●●

Region of rejection Region of rejection

H0012

P-value definition and its limitations

Level curves (contour curves)

−1.0 −0.5 0.0 0.5 1.0

−1.0−0.50.00.51.0

Significance level 10%

µ1

µ2

●●

●●●●

●●

●●●●

●●●

●●●

●●●

●●●

●●●

●●●

●●●

●●

●●

●●●

●●

●●●●

●●

●●●●

●●●

●●●

●●●●

●●

●●●

●●●

●●●

●●

●●

●●●

●●

●●●●

●●●

●●●

●●●●

●●●

●●●●

●●

●●●

●●

●●

●●●

●●

●●●●

●●●

●●●●

●●●

●●●

●●●●

●●

●●●

●●

●●

●●●

●●

●●

●●●

●●

●●●●

●●●

●●●

●●●

●●

●●●●

●●

●●●

●●

●●

●●●

●●●

●●●

●●●

●●●

●●●

●●●

●●●●

●●

●●●●

●●

●●●

●●●

●●●

●●●

●●●●

●●●

Region of rejection Region of rejection

H0012

⇑ H0 :µ=0

P-value definition and its limitations

Level curves (contour curves)

−0.50.00.51.0

Significance level 10%

µ2

●●●●

●●●

●●●

●●●●

●●

●●●

●●●

●●●

●●

●●

●●●

●●

●●●●

●●●

●●●

●●●●

●●●

●●●●

●●

●●●

●●

●●

●●●

●●

●●●●

●●●

●●●●

●●●

●●●

●●●●

●●

●●●

●●

●●

●●●

●●

●●

●●●

●●

●●●●

●●●

●●●

●●●

●●

●●●●

●●

●●●

●●

●●

●●●

●●●

●●●

●●●

●●●

●●●

●●●

●●●●

●●

●●●●

●●

●●●

●●●

●●●

●●●

●●●●

●●●

Region of rejection Region of rejection

H0012

H0 :µ=0

Rejection ofH00 but no rejection ofH0

and

An alternative measure of evidence and some of its properties

An alternative measure of evidence (parametric case)

In what follows, we present an alternative measure calleds-value to overcome the previous issue (Patriota, 2013, FSS, 233).

Thes-value is a function s : 2Θ× X → [0,1] such that s(Θ0,x) =

sup{α∈(0,1) : Λα(x)∩Θ0 6=∅}, if Θ0 6=∅,

0, if Θ0 =∅.

where Λα is a confidence set for θ with confidence level 1−α with some “nice” properties.

An alternative measure of evidence and some of its properties

An alternative measure of evidence (parametric case)

In what follows, we present an alternative measure calleds-value to overcome the previous issue (Patriota, 2013, FSS, 233).

Thes-value is a function s : 2Θ× X → [0,1] such that s(Θ0,x) =

sup{α∈(0,1) : Λα(x)∩Θ0 6=∅}, if Θ0 6=∅,

0, if Θ0 =∅.

where Λα is a confidence set for θ with confidence level 1−α

An alternative measure of evidence and some of its properties

Interpretation

Interpretation: s =s(Θ0,x) is the largest significance level α (or 1−s is the smallest confidence level1−α) for which the confidence set and the set Θ0 have at least one element in common.

Large values ofs indicate that there exists at least one element inΘ0 close to the center ofΛα (e.g., close to the ML estimate).

Small values of s indicate that ALLelements of Θ0 are far away from the center of Λα.

An alternative measure of evidence and some of its properties

Interpretation

Interpretation: s =s(Θ0,x) is the largest significance level α (or 1−s is the smallest confidence level1−α) for which the confidence set and the set Θ0 have at least one element in common.

Large values ofs indicate that there exists at least one element inΘ0 close to the center ofΛα (e.g., close to the ML estimate).

Small values of s indicate that ALLelements of Θ0 are far away from the center of Λα.

An alternative measure of evidence and some of its properties

Interpretation

Interpretation: s =s(Θ0,x) is the largest significance level α (or 1−s is the smallest confidence level1−α) for which the confidence set and the set Θ0 have at least one element in common.

Large values ofs indicate that there exists at least one element inΘ0 close to the center ofΛα (e.g., close to the ML estimate).

Small values of s indicate that ALL elements of Θ0 are far away from the center of Λα.

An alternative measure of evidence and some of its properties

Graphical illustration: s

1

= s (Θ

1

, x )

−1.0−0.50.00.51.01.5

θ2

Θ1

Θ2

Λs1

θ^ Θ

An alternative measure of evidence and some of its properties

Graphical illustration: s

2

= s (Θ

2

, x )

−1.5 −1.0 −0.5 0.0 0.5 1.0 1.5

−1.5−1.0−0.50.00.51.01.5

θ

θ2

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

Θ1

Θ2

Λs1

θ^ Θ

Λs2

An alternative measure of evidence and some of its properties

Properties: the s-value is a possibility measure

1 s(∅,x) = 0 and s(Θ,x) = 1,

2 If Θ1 ⊆Θ2, then s(Θ1,x)≤s(Θ2,x),

3 For anyΘ12 ⊆Θ, s(Θ1∪Θ2,x) = max{s(Θ1,x),s(Θ2,x)},

4 If Θ1 ⊆Θ, then s(Θ1,x) = supθ∈Θ1s({θ},x),

5 s(Θ1,x) = 1 or s(Θc1,x) = 1:

ifθb∈Θ1 (closure ofΘ1), then s(Θ1,x) = 1, ifθb∈Θc1, then s(Θc1,x) = 1.

where θˆis an element of the center of Λα, i.e., θˆ∈T

αΛα(x).

An alternative measure of evidence and some of its properties

Properties: the s-value is a possibility measure

1 s(∅,x) = 0 and s(Θ,x) = 1,

2 If Θ1 ⊆Θ2, then s(Θ1,x)≤s(Θ2,x),

3 For anyΘ12 ⊆Θ, s(Θ1∪Θ2,x) = max{s(Θ1,x),s(Θ2,x)},

4 If Θ1 ⊆Θ, then s(Θ1,x) = supθ∈Θ1s({θ},x),

5 s(Θ1,x) = 1 or s(Θc1,x) = 1:

ifθb∈Θ1 (closure ofΘ1), then s(Θ1,x) = 1, ifθb∈Θc1, then s(Θc1,x) = 1.

where θˆis an element of the center of Λα, i.e., θˆ∈T

αΛα(x).

An alternative measure of evidence and some of its properties

Properties: the s-value is a possibility measure

1 s(∅,x) = 0 and s(Θ,x) = 1,

2 If Θ1 ⊆Θ2, then s(Θ1,x)≤s(Θ2,x),

3 For anyΘ12 ⊆Θ, s(Θ1∪Θ2,x) = max{s(Θ1,x),s(Θ2,x)},

4 If Θ1 ⊆Θ, then s(Θ1,x) = supθ∈Θ1s({θ},x),

5 s(Θ1,x) = 1 or s(Θc1,x) = 1:

ifθb∈Θ1 (closure ofΘ1), then s(Θ1,x) = 1, ifθb∈Θc1, then s(Θc1,x) = 1.

where θˆis an element of the center of Λα, i.e., θˆ∈T

αΛα(x).

An alternative measure of evidence and some of its properties

Properties: the s-value is a possibility measure

1 s(∅,x) = 0 and s(Θ,x) = 1,

2 If Θ1 ⊆Θ2, then s(Θ1,x)≤s(Θ2,x),

3 For anyΘ12 ⊆Θ, s(Θ1∪Θ2,x) = max{s(Θ1,x),s(Θ2,x)},

4 If Θ1 ⊆Θ, then s(Θ1,x) = supθ∈Θ1s({θ},x),

5 s(Θ1,x) = 1 or s(Θc1,x) = 1:

ifθb∈Θ1 (closure ofΘ1), then s(Θ1,x) = 1, ifθb∈Θc1, then s(Θc1,x) = 1.

where θˆis an element of the center of Λα, i.e., θˆ∈T

αΛα(x).

An alternative measure of evidence and some of its properties

Properties: the s-value is a possibility measure

1 s(∅,x) = 0 and s(Θ,x) = 1,

2 If Θ1 ⊆Θ2, then s(Θ1,x)≤s(Θ2,x),

3 For anyΘ12 ⊆Θ, s(Θ1∪Θ2,x) = max{s(Θ1,x),s(Θ2,x)},

4 If Θ1 ⊆Θ, then s(Θ1,x) = supθ∈Θ1s({θ},x),

5 s(Θ1,x) = 1 or s(Θc1,x) = 1:

ifθb∈Θ1 (closure ofΘ1), then s(Θ1,x) = 1,

An alternative measure of evidence and some of its properties

Decisions about H

0

LetΦ be a function such that:

Φ(Θ0) =hs(Θ0),s(Θc0)i.

Then,

Φ(Θ0) = ha,1i =⇒ rejectionof H0 if a is “small” enough,

Φ(Θ0) = h1,bi =⇒ acceptance ofH0 if b is “small” enough. Φ(Θ0) = h1,1i =⇒ total ignorance about H0.

An alternative measure of evidence and some of its properties

Decisions about H

0

LetΦ be a function such that:

Φ(Θ0) =hs(Θ0),s(Θc0)i.

Then,

Φ(Θ0) = ha,1i =⇒ rejectionof H0 if a is “small” enough, Φ(Θ0) = h1,bi =⇒ acceptance ofH0 if b is “small” enough.

Φ(Θ0) = h1,1i =⇒ total ignorance about H0.

An alternative measure of evidence and some of its properties

Decisions about H

0

LetΦ be a function such that:

Φ(Θ0) =hs(Θ0),s(Θc0)i.

Then,

Φ(Θ0) = ha,1i =⇒ rejectionof H0 if a is “small” enough, Φ(Θ0) = h1,bi =⇒ acceptance ofH0 if b is “small” enough.

Φ(Θ0) = h1,1i =⇒ total ignorance about H0.

An alternative measure of evidence and some of its properties

How to find the thresholds for a and b to decide about H

0

?

This is still an open problem.

We could try to find those thresholds vialoss functions.

or viafrequentist criteria by employing the following asymptotic property:

Property: If the statistical model is regular and the confidence region is built from a statistics Tθ(X) that converges in

distribution to χ2k, then:

sa = 1−Fk(FH−10(1−pa)),

where pa = 1−FH0(t) is the asymptotic p-value to testH0.

An alternative measure of evidence and some of its properties

How to find the thresholds for a and b to decide about H

0

?

This is still an open problem.

We could try to find those thresholds vialoss functions.

or viafrequentist criteria by employing the following asymptotic property:

Property: If the statistical model is regular and the confidence region is built from a statistics Tθ(X) that converges in

distribution to χ2k, then:

sa = 1−Fk(FH−10(1−pa)),

where pa = 1−FH0(t) is the asymptotic p-value to testH0.

An alternative measure of evidence and some of its properties

How to find the thresholds for a and b to decide about H

0

?

This is still an open problem.

We could try to find those thresholds vialoss functions.

or viafrequentist criteria by employing the following asymptotic property:

Property: If the statistical model is regular and the confidence region is built from a statistics Tθ(X) that converges in

distribution to χ2k, then:

sa = 1−Fk(FH−10(1−pa)),

where pa = 1−FH0(t) is the asymptotic p-value to testH0.

An alternative measure of evidence and some of its properties

How to find the thresholds for a and b to decide about H

0

?

This is still an open problem.

We could try to find those thresholds vialoss functions.

or viafrequentist criteria by employing the following asymptotic property:

Property: If the statistical model is regular and the confidence region is built from a statistics Tθ(X) that converges in

distribution to χ2k, then:

sa = 1−Fk(FH−10(1−pa)),

where pa = 1−FH0(t) is the asymptotic p-value to testH0.

An alternative measure of evidence and some of its properties

How to find the thresholds for a and b to decide about H

0

?

This is still an open problem.

We could try to find those thresholds vialoss functions.

or viafrequentist criteria by employing the following asymptotic property:

Property: If the statistical model is regular and the confidence region is built from a statistics Tθ(X) that converges in

distribution to χ2k, then:

Numerical illustration

Example: Bivariate Normal distribution

LetX = (X1, . . . ,Xn)be a sample from a bivariate normal distribution with mean vectorµ= (µ1, µ2)> and identity variance matrix.

Notice that (−2 log of) the likelihood-ratio statistic is Tµ(x) =n( ¯X −µ)>( ¯X −µ)∼χ22,

The confidence setΛα is given by

Λα(x) = {µ∈R2 : Tµ(x)≤F2−1(1−α)}, where F2 is the cumulative chi-squared distribution with two degrees of freedom.

Numerical illustration

Example: Bivariate Normal distribution

LetX = (X1, . . . ,Xn)be a sample from a bivariate normal distribution with mean vectorµ= (µ1, µ2)> and identity variance matrix.

Notice that (−2 log of) the likelihood-ratio statistic is Tµ(x) =n( ¯X −µ)>( ¯X −µ)∼χ22,

The confidence setΛα is given by

Λα(x) = {µ∈R2 : Tµ(x)≤F2−1(1−α)}, where F2 is the cumulative chi-squared distribution with two degrees of freedom.

Numerical illustration

Example: Bivariate Normal distribution

LetX = (X1, . . . ,Xn)be a sample from a bivariate normal distribution with mean vectorµ= (µ1, µ2)> and identity variance matrix.

Notice that (−2 log of) the likelihood-ratio statistic is Tµ(x) =n( ¯X −µ)>( ¯X −µ)∼χ22,

The confidence setΛα is given by

Λα(x) = {µ∈R2 : Tµ(x)≤F2−1(1−α)}, where F2 is the cumulative chi-squared distribution with two degrees of freedom.

Numerical illustration

Numerical illustration

Observed sample H0 :µ=0 H0012 (¯x1, ¯x2) x¯1−¯x2 p/s-value p-value s-value (0.05,-0.05) 0.1 0.9753 0.8231 0.9753 (0.09,-0.11) 0.2 0.9039 0.6547 0.9048 (0.14,-0.16) 0.3 0.7977 0.5023 0.7985 (0.19,-0.21) 0.4 0.6697 0.3711 0.6703 (0.23,-0.27) 0.5 0.5331 0.2636 0.5353 (0.28,-0.32) 0.6 0.4049 0.1797 0.4066 (0.33,-0.37) 0.7 0.2926 0.1175 0.2938 (0.37,-0.43) 0.8 0.2001 0.0736 0.2019

Numerical illustration

Numerical illustration

Observed sample H0 :µ=0 H0012 (¯x1, ¯x2) x¯1−¯x2 p/s-value p-value s-value (0.05,-0.05) 0.1 0.9753 0.8231 0.9753 (0.09,-0.11) 0.2 0.9039 0.6547 0.9048 (0.14,-0.16) 0.3 0.7977 0.5023 0.7985 (0.19,-0.21) 0.4 0.6697 0.3711 0.6703 (0.23,-0.27) 0.5 0.5331 0.2636 0.5353 (0.28,-0.32) 0.6 0.4049 0.1797 0.4066 (0.33,-0.37) 0.7 0.2926 0.1175 0.2938 (0.37,-0.43) 0.8 0.2001 0.0736 0.2019 (0.42,-0.48) 0.9 0.1308 0.0442 0.1320 (0.47,-0.53) 1.0 0.0813 0.0253 0.0821

Numerical illustration

Numerical illustration

Observed sample H0 :µ=0 H0012 (¯x1, ¯x2) x¯1−¯x2 p/s-value p-value s-value (0.05,-0.05) 0.1 0.9753 0.8231 0.9753 (0.09,-0.11) 0.2 0.9039 0.6547 0.9048 (0.14,-0.16) 0.3 0.7977 0.5023 0.7985 (0.19,-0.21) 0.4 0.6697 0.3711 0.6703 (0.23,-0.27) 0.5 0.5331 0.2636 0.5353 (0.28,-0.32) 0.6 0.4049 0.1797 0.4066 (0.33,-0.37) 0.7 0.2926 0.1175 0.2938 (0.37,-0.43) 0.8 0.2001 0.0736 0.2019

Numerical illustration

Numerical illustration

Observed sample H0 :µ=0 H0012 (¯x1, ¯x2) x¯1−¯x2 p/s-value p-value s-value (0.05,-0.05) 0.1 0.9753 0.8231 0.9753 (0.09,-0.11) 0.2 0.9039 0.6547 0.9048 (0.14,-0.16) 0.3 0.7977 0.5023 0.7985 (0.19,-0.21) 0.4 0.6697 0.3711 0.6703 (0.23,-0.27) 0.5 0.5331 0.2636 0.5353 (0.28,-0.32) 0.6 0.4049 0.1797 0.4066 (0.33,-0.37) 0.7 0.2926 0.1175 0.2938 (0.37,-0.43) 0.8 0.2001 0.0736 0.2019 (0.42,-0.48) 0.9 0.1308 0.0442 0.1320 (0.47,-0.53) 1.0 0.0813 0.0253 0.0821

Numerical illustration

Graphical illustration: s ({µ

1

= µ

2

}, x

1

) = 0.9753

−0.6−0.4−0.20.00.2

µ2

Numerical illustration

Graphical illustration: s ({µ

1

= µ

2

}, x

2

) = 0.9048

−0.2 0.0 0.2 0.4 0.6 0.8

−0.8−0.6−0.4−0.20.00.2

µ µ2

Numerical illustration

Graphical illustration: s ({µ

1

= µ

2

}, x

3

) = 0.7985

−0.6−0.4−0.20.00.2

µ2

Numerical illustration

Graphical illustration: s ({µ

1

= µ

2

}, x

4

) = 0.6703

−0.2 0.0 0.2 0.4 0.6 0.8

−0.8−0.6−0.4−0.20.00.2

µ µ2

Numerical illustration

Graphical illustration: s ({µ

1

= µ

2

}, x

5

) = 0.5353

−0.6−0.4−0.20.00.2

µ2

Numerical illustration

Graphical illustration: s ({µ

1

= µ

2

}, x

6

) = 0.4066

−0.2 0.0 0.2 0.4 0.6 0.8

−0.8−0.6−0.4−0.20.00.2

µ µ2

Numerical illustration

Graphical illustration: s ({µ

1

= µ

2

}, x

7

) = 0.2938

−0.6−0.4−0.20.00.2

µ2

Numerical illustration

Graphical illustration: s ({µ

1

= µ

2

}, x

8

) = 0.2019

−0.2 0.0 0.2 0.4 0.6 0.8

−0.8−0.6−0.4−0.20.00.2

µ µ2

Numerical illustration

Graphical illustration: s ({µ

1

= µ

2

}, x

9

) = 0.1320

−0.6−0.4−0.20.00.2

µ2

Numerical illustration

Graphical illustration: s ({µ

1

= µ

2

}, x

10

) = 0.0821

−0.2 0.0 0.2 0.4 0.6 0.8

−0.8−0.6−0.4−0.20.00.2

µ µ2

Final remarks

Final remarks

The s-value:

can be applied directly whenever the log-likelihood function is concave by the formulas = 1−F(FH0(1−p))

is a possibilistic measure and can be studied by means of the Abstract belief Calculus ABC (Darwiche, Ginsberg, 1992).

can be justified bydesiderata(more basic axioms).

avoids the p-value problem of non-monotonicity.

Documentos relacionados