Independence - Probability: Theory and Examples

Measure theory ends and probability begins with the definition of independence. We begin with what is hopefully a familiar definition and then work our way up to a definition that is appropriate for our current setting.

Two eventsAandB areindependent ifP(A∩B) =P(A)P(B).

Two random variablesX andY areindependentif for all C, D∈ R, P(X ∈C, Y ∈D) =P(X ∈C)P(Y ∈D) i.e., the eventsA={X ∈C} andB={Y ∈D}are independent.

Twoσ-fieldsF andGareindependentif for allA∈ F andB ∈ Gthe eventsAand B are independent.

As the next exercise shows, the second definition is a special case of the third.

Exercise 2.1.1. (i) Show that ifX andY are independent thenσ(X) andσ(Y) are.

(ii) Conversely, if F andG are independent, X ∈ F, andY ∈ G, thenX and Y are independent.

The first definition is, in turn, a special case of the second.

Exercise 2.1.2. (i) Show that ifA andB are independent then so areA^c andB,A and B^c, andA^c and B^c. (ii) Conclude that events Aand B are independent if and only if their indicator random variables 1Aand 1B are independent.

In view of the fact that the first definition is a special case of the second, which is a special case of the third, we take things in the opposite order when we say what it means for several things to be independent. We begin by reducing to the case of finitely many objects. An infinite collection of objects (σ-fields, random variables, or sets) is said to be independent if every finite subcollection is.

σ-fields F1,F2, . . . ,Fn are independent if whenever Ai ∈ Fi for i = 1, . . . , n, we have

P(∩ⁿ_i=1Ai) =

i=1

P(Ai) 37

Random variablesX1, . . . , Xn areindependent if wheneverBi∈ Rfori= 1, . . . , n we have

P(∩ⁿ_i=1{Xi∈Bi}) =

i=1

P(Xi∈Bi)

SetsA1, . . . , An areindependentif whenever I⊂ {1, . . . n}we have P(∩i∈IA_i) =Y

i∈I

P(A_i)

At first glance, it might seem that the last definition does not match the other two.

However, if you think about it for a minute, you will see that if the indicator variables 1A_i, 1≤i≤nare independent and we takeBi={1} fori∈I, and Bi=Rfori6∈I then the condition in the definition results. Conversely,

Exercise 2.1.3. Let A₁, A₂, . . . , A_n be independent. Show (i) A^c₁, A₂, . . . , A_n are independent; (ii) 1_A₁, . . . ,1_A_n are independent.

One of the first things to understand about the definition of independent events is that it is not enough to assumeP(Ai∩Aj) =P(Ai)P(Aj) for alli6=j. A sequence of eventsA1, . . . , An with the last property is calledpairwise independent. It is clear that independent events are pairwise independent. The next example shows that the converse is not true.

Example 2.1.1. LetX1, X2,X3 be independent random variables with P(X_i = 0) =P(X_i = 1) = 1/2

Let A1 ={X2 =X3}, A2 = {X3 = X1} and A3 ={X1 =X2}. These events are pairwise independent since ifi6=j then

P(A_i∩A_j) =P(X₁=X₂=X₃) = 1/4 =P(A_i)P(A_j) but they are not independent since

P(A1∩A2∩A3) = 1/46= 1/8 =P(A1)P(A2)P(A3)

In order to show that random variables X and Y are independent, we have to check that P(X ∈ A, Y ∈ B) = P(X ∈ A)P(Y ∈ B) for all Borel sets A and B.

Since there are a lot of Borel sets, our next topic is

2.1.1 Sufficient Conditions for Independence

Our main result is Theorem 2.1.3. To state that result, we need a definition that generalizes all our earlier definitions.

Collections of sets A1,A2, . . . ,An ⊂ F are said to be independent if whenever Ai∈ Ai andI⊂ {1, . . . , n} we haveP(∩_i∈IAi) =Q

i∈IP(Ai)

If each collection is a single set i.e.,Ai={Ai}, this definition reduces to the one for sets.

Lemma 2.1.1. Without loss of generality we can suppose each A_i contains Ω. In this case the condition is equivalent to

P(∩ⁿ_i=1Ai) =

i=1

P(Ai) wheneverAi∈ Ai

since we can setAi= Ωfori6∈I.

2.1. INDEPENDENCE 39 Proof. IfA1,A2, . . . ,An are independent and ¯Ai=Ai∪ {Ω}then ¯A1,A¯2, . . . ,A¯n are independent, since ifA_i∈A¯iand I={j:A_j = Ω} ∩iA_i=∩i∈IA_i.

The proof of Theorem 2.1.3 is based on Dynkin’s π−λtheorem. To state this result, we need two definitions. We say that A is a π-system if it is closed under intersection, i.e., ifA, B ∈ A thenA∩B ∈ A. We say thatL is a λ-system if: (i) Ω∈ L. (ii) IfA, B∈ L andA⊂B then B−A∈ L. (iii) IfA_n∈ L andA_n ↑Athen A∈ L.

Theorem 2.1.2. π−λ Theorem. If P is a π-system and L is a λ-system that containsP thenσ(P)⊂ L.

The proof is hidden away in Section A.1 of the Appendix.

Theorem 2.1.3. SupposeA1,A2, . . . ,An are independent and eachAiis aπ-system.

Thenσ(A1), σ(A2), . . . , σ(An) are independent.

Proof. Let A₂, . . . , A_n be sets with A_i ∈ A_i, let F = A₂∩ · · · ∩A_n and let L = {A : P(A∩F) = P(A)P(F)}. Since P(Ω∩F) = P(Ω)P(F), Ω ∈ L. To check (ii) of the definition of a λ-system, we note that if A, B ∈ L with A ⊂ B then (B−A)∩F = (B∩F)−(A∩F). So using (i) in Theorem 1.1.1, the factA, B ∈ L and then (i) in Theorem 1.1.1 again:

P((B−A)∩F) =P(B∩F)−P(A∩F) =P(B)P(F)−P(A)P(F)

={P(B)−P(A)}P(F) =P(B−A)P(F)

and we have B−A ∈ L. To check (iii) let B_k ∈ L with B_k ↑ B and note that (B_k∩F)↑(B∩F) so using (iii) in Theorem 1.1.1, the fact that B_k ∈ L, and then (iii) in Theorem 1.1.1 again:

P(B∩F) = lim

k P(Bk∩F) = lim

k P(Bk)P(F) =P(B)P(F)

Applying the π−λtheorem now givesL ⊃σ(A1). It follows that ifA1 ∈σ(A1) andAi∈ Ai for 2≤i≤nthen

P(∩ⁿ_i=1Ai) =P(A1)P(∩ⁿ_i=2Ai) =

i=1

P(Ai)

Using Lemma 2.1.1 now, we have:

(∗) IfA1,A2, . . . ,An are independent thenσ(A1),A2, . . . ,An are independent.

Applying (∗) toA2, . . . ,An, σ(A1) (which are independent since the definition is un-changed by permuting the order) shows that σ(A2),A3, . . . ,An,σ(A1) are indepen-dent, and afterniterations we have the desired result.

Remark. The reader should note that it is not easy to show that if A, B ∈ Lthen A∩B ∈ L, orA∪B∈ L, but it is easy to check that ifA, B ∈ Lwith A⊂B then B−A∈ L.

Having worked to establish Theorem 2.1.3, we get several corollaries.

Theorem 2.1.4. In order for X1, . . . , Xn to be independent, it is sufficient that for allx₁, . . . , x_n ∈(−∞,∞]

P(X₁≤x₁, . . . , X_n≤x_n) =

i=1

P(X_i ≤x_i) Proof. LetAi= the sets of the form{Xi≤x_i}. Since

{Xi≤x} ∩ {Xi≤y}={Xi≤x∧y},

where (x∧y)_i = x_i∧y_i = min{xi, y_i}. Ai is a π-system. Since we have allowed x_i = ∞, Ω ∈ Ai. Exercise 1.3.1 implies σ(Ai) = σ(X_i), so the result follows from Theorem 2.1.3.

The last result expresses independence of random variables in terms of their distri-bution functions. The next two exercises treat density functions and discrete random variables.

Exercise 2.1.4. Suppose (X1, . . . , Xn) has densityf(x1, x2, . . . , xn), that is P((X1, X2, . . . , Xn)∈A) =

f(x)dxforA∈ Rⁿ

If f(x) can be written asg₁(x₁)· · ·g_n(x_n) where the g_m ≥0 are measurable, then X₁, X₂, . . . , X_n are independent. Note that theg_mare not assumed to be probability densities.

Exercise 2.1.5. SupposeX1, . . . , Xnare random variables that take values in count-able setsS₁, . . . , S_n. Then in order forX₁, . . . , X_n to be independent, it is sufficient that wheneverx_i∈S_i

P(X1=x1, . . . , Xn=xn) =

i=1

P(Xi =xi)

Our next goal is to prove that functions of disjoint collections of independent random variables are independent. See Theorem 2.1.6 for the precise statement. First we will prove an analogous result forσ-fields.

Theorem 2.1.5. Suppose F_i,j,1 ≤ i ≤ n,1 ≤ j ≤ m(i) are independent and let G_i=σ(∪_jF_i,j). ThenG₁, . . . ,G_n are independent.

Proof. LetAi be the collection of sets of the form∩jAi,j where Ai,j ∈ Fi,j. Ai is a π-system that contains Ω and contains∪jFi,j so Theorem 2.1.3 implies σ(Ai) =Gi

are independent.

Theorem 2.1.6. If for 1 ≤ i ≤ n, 1 ≤ j ≤ m(i), Xi,j are independent and fi : R^m(i)→Rare measurable then fi(Xi,1, . . . , X_i,m(i)) are independent.

Proof. Let Fi,j =σ(X_i,j) and Gi = σ(∪jFi,j). Sincef_i(X_i,1, . . . , X_i,m(i)) ∈ Gi, the desired result follows from Theorem 2.1.5 and Exercise 2.1.1.

A concrete special case of Theorem 2.1.6 that we will use in a minute is: if X1, . . . , Xnare independent thenX=X1andY =X2· · ·Xnare independent. Later, when we study sumsSm=X1+· · ·+Xmof independent random variablesX1, . . . , Xn, we will use Theorem 2.1.6 to conclude that ifm < nthenSn−Smis independent of the indicator function of the event{max1≤k≤mSk > x}.

No documento Probability: Theory and Examples (páginas 45-49)