Comput. Appl. Math. vol.28 número1

(1)

ISSN 0101-8205 www.scielo.br/cam

Unitary invariant and residual independent

matrix distributions

ARJUN K. GUPTA1_{, DAYA K. NAGAR}2∗

and ASTRID M. VÉLEZ-CARVAJAL2 1_{Department of Mathematics and Statistics,}

Bowling Green State University, Bowling Green, Ohio 43403-0221, USA

2_{Departamento de Matemáticas,}

Universidad de Antioquia, Calle 67, No. 53-108, Medellín, Colombia E-mails: [email protected] / [email protected]

Abstract. Define Z₁₃ ₌ A1/2Y A1/2H

(A and Y are independent) and Z₁₅ ₌ B1/2Y B1/2H₍

BandYare independent), whereY,AandBfollow inverted complex Wishart, complex beta type I and complex beta type II distributions, respectively. In this article sev-eral properties including expected values of scalar and matrix valued functions ofZ13 andZ15

are derived.

Mathematical subject classification: Primary: 62E15; Secondary: 62H99.

Key words: beta distribution, inverted complex Wishart, complex random matrix, Gauss hypergeometric function, residual independent, unitary invariant, zonal polynomial.

1 Introduction

This paper deals with complex random quadratic forms involving complex Wishart, inverted complex Wishart, complex beta type I and complex beta type II matrices. To define these distributions we need some notations from complex

#768/08. Received: 27/V/08. Accepted: 03/XII/08.

(2)

matrix algebra. Let A₌(ai j)be anm_×mmatrix of complex numbers. Then, AH _{denotes conjugate transpose of} _A_;

tr(A) ₌ a11+ ∙ ∙ ∙ +amm;

etr(A) ₌ exp(tr(A))_;

det(A) ₌ determinant ofA_;

A ₌ AH _>_{0 means that} _A_{is Hermitian positive definite; 0} _< _A

= AH _< _Im

means thatAandIm₋Aare Hermitian positive definite andA1/2denotes square

root ofA₌ AH _>_0.

Now, we define complex Wishart, inverted complex Wishart, complex matrix variate beta type I, complex matrix variate beta type II distributions.

Definition 1.1. The m _×m random Hermitian positive definite matrix X is said to have a complex Wishart distribution with parameters m, ν(_≥ m) and 6₌6H >0, written as X _∼C_Wm_{(ν, 6), if its p.d.f. is given by}

˜

Ŵm(ν)det(6)ν −1

det(X)ν−metr −6−1X, X =XH >0, whereŴ˜m(a)is the complex multivariate gamma function defined by

˜

Ŵm(a)=πm(m−1)/2 m Y

i=1

Ŵ (a₋i₊1) , Re(a) >m₋1.

Definition 1.2. The m_×m random Hermitian positive definite matrix Y is said to be distributed as inverted complex Wishart with parameters m,μ (_≥ m)and 9₌9H >0, denoted by Y _∼ IC_Wm_{(μ, 9), if its p.d.f. is given by}

˜

Ŵm(μ) − 1

det(9)μdet(Y)−(μ+m)etr ₋Y−19, Y ₌YH >0.

Note that if Y _∼ IC_Wm(μ, 9), then Y−1

∼ C_Wm(μ, 9−1₎_{. The inverted}

complex Wishart distribution was first derived by Tan [17] as posterior distri-bution of6 in a complex multivariate regression model. Later Shaman [16]

studied some of its properties and applied it to spectral estimation. For m ₌

1, the inverted complex Wishart density slides to an inverted gamma density given by

Ŵ(μ) −1ψμy−(μ+1)exp ₋ψy−1

, y >0, μ >0, ψ >0.

(3)

Definition 1.3. The m _×m random Hermitian positive definite matrix X is said to have a complex matrix variate beta type I distribution, denoted as X _∼

C_BI

m(a,b), if its p.d.f. is given by

˜

Bm(a,b) −1det(X)a−mdet(Im₋X)b−m, 0<X ₌ XH < Im,

where a > m₋1, b > m₋1and Bm˜ (a,b)is the complex multivariate beta function defined by

˜

Bm(a,b)= Ŵ˜m(a)Ŵ˜m(b) ˜

Ŵm(a₊b) , Re(a) >m−1, Re(b) >m−1.

Definition 1.4. The m _×m random Hermitian positive definite matrix Y is said to have a complex matrix variate beta type II distribution, denoted as Y _∼

C_BI I

m (a,b), if its p.d.f. is given by

˜

Bm(a,b) −1det(Y)a−mdet(Im ₊Y)−(a+b), Y ₌YH >0, where a>m₋1, and b>m₋1.

The complex matrix variate beta distributions arise in various problems in multivariate statistical analysis. Several test statistics in complex multivariate analysis of variance and covariance are functions of complex beta matrices. These distributions can be derived using complex Wishart matrices (Tan [18]). The relationship between beta type I and type II matrices can be stated as fol-lows. If X _∼ C_BI

m(a,b), then (Im − X)−1/2X(Im − X)−1/2 ∼ CBmI I(a,b).

Further, ifY _∼C_BI I

m (a,b), thenY− 1

∼C_BI I

m (b,a)and(Im +Y)− 1/2_Y_(Im

+ Y)−1/2

∼C_BI m(a,b).

It is well known in the statistical literature that the complex matrix variate beta type I and type II, complex Wishart (6 ₌ Im), and complex inverted

Wishart (9 ₌ Im) distributions are unitary invariant and residually

indepen-dent (Khatri [9], Tan [18], Nagar, Bedoya and Arias [14], Bedoya, Nagar and Gupta [1], Goodman [4], Shaman [16]) and therefore belong to the class of unitary invariant and residual independent matrix (UNIARIM) distributions,

˜

(4)

Definition 1.5. The m_×m random Hermitian positive definite matrix X is said to have an UNIARIM distribution if

(i) for any m _×m unitary matrix U , distributions of X and U XUH _are identical, and

(ii) for any lower triangular factorization X ₌T TH_{, T}

=(Ti j), Tii(mi_×mi), i ₌1, . . . ,k are independent, for any partition_{m1,m2, . . . ,mk}of m.

When X has UNIARIM distribution we will write X _{∈ ˜}C_m. Next, we state

several properties of UNIARIM distributions.

Theorem 1.6. Let X _{∈ ˜}C_m_{. Partition X as X} ₌ X11 X12 X21 X22

, X11(q×q). Then X₁₁and X₂₂_∙₁₌X₂₂₋X₂₁X₁₁−1X₁₂are independent, X11 ∈ ˜Cq, and X22∙1∈ ˜Cm−q.

Theorem 1.7. Let X _{∈ ˜}C_m_{and Y} _{∈ ˜}C_m _{be independent. Then, for any square}

root T of Y , the distribution of Z ₌ T X TH belongs toC˜_m_{. Further, if T}₁_and

T2are two different square roots of Y , then T1X T1H and T2X T2H have identical distributions.

From the above theorem it follows that if U ₌ (U1,U2), Ui(m × mi), i ₌ 1,2, m1+m2 = m is a random unitary matrix independent of Z ∈ ˜Cm,

thenUH

1 ZU1 ∈ ˜Cm1 and(U2HZ−1U2)−1 ∈ ˜Cm2 are independent. Further, for

c ∈ Cm_, _c

6= 0, (i)cHZc/cHchas same distribution as z11, where Z = (zi j),

and (ii)cHc/cHZ−1c has same distribution as 1/z11 where Z−1 ₌ (zi j).

Fur-thermore, ifE(Z),E(Z−1₎_{, and} _E_(Zα₎_,_α _{an integer, exist, then} _E_(Z₎

=a Im, E(Z−1)₌bIm andE(Zα)_{= ˜}cαIm, wherea = E(x11y11),b= E(x11)E(y11),

and the constant c_˜α depends on moments of order less than or equal to α of X andY.

Let Z[i] ₌ (zαβ), 1 ≤ α, β ≤ i, i ≤ m be a submatrix of Z = (zαβ),

1 _≤ α,β _≤ mandY ₌ T TH_, _X

= R RH _{be lower triangular factorizations.}

Then

vii ₌ det(Z [i]₎

det(Z[i−1]₎ =t 2 iir

2

ii, i =1, . . . ,m,

where det(Z[0])₌1, are independent andE_[det(Z)α

] =Qm

i=1E(viiα)provided

(5)

In Section 2, we give several UNIARIM distributions by using Theorem 1.7.

Section 3 gives a number of results on random quadratic forms A1/2Y(A1/2)H

(AandY are independent) andB1/2Y(B1/2)H(BandY are independent), where Y _∼IC_Wm(μ,Im),A_∼ C_BI

m(a,b)andB ∼ CBmI I(c,d)by exploiting the fact

that they too belong to the class of UNIARIM distributions and using properties that are available for UNIARIM distributions. Finally, in Appendix, we give certain known results on complex matrix variate beta type I and type II, complex Wishart and inverted complex Wishart distributions, confluent hypergeometric functions and zonal polynomials of Hermitian matrix argument.

2 Generating UNIARIM Distributions

In the previous section it is stated that ifX _{∈ ˜}C_m and_Y _{∈ ˜}C_m are independent,

then for any square root T ofY, the distribution of Z ₌ T X TH _{belongs to} ˜

C_m. In this section we exploit this property to generate a number of UNIARIM

distributions. Let Ai _∼ C_BI

m(ai,bi), Bi ∼ CBmI I(ci,di),i = 1,2, A ∼ CBmI(a,b), B ∼

C_BI I

m (c,d)and define

Z1 = A1/21 A2 A 1/2 1

H

A1and A2are independent, Z2 = B₁1/2B₂ B₁1/2H B1andB2are independent, Z3 = A1/2B A1/2H AandB are independent,

and

Z4 = B1/2A B1/2H AandBare independent.

Then, from Theorem 1.7, it follows that Zi _{∈ ˜}C_m,_i ₌ 1,2,3,4. From Cui,

Gupta and Nagar [3], the p.d.f. ofZ1is available as ˜

Ŵm(a1+b1)Ŵ˜m(a2+b2) ˜

Ŵm(a1)Ŵm˜ (a2)Ŵm˜ (b1+b2)

det(Z1)a2−mdet(Im−Z1)b1+b2−m

× 2F˜1 b1,a2+b2−a1;b1+b2;Im−Z1,0< Z1=Z1H <Im,

where2F˜1is the Gauss hypergeometric function of Hermitian matrix argument

(6)

Vélez-Carvajal [7]),

˜

Bm(d1+c2,c1+d2) ˜

Bm(c1,d1)Bm(c˜ 2,d2)

det(Z2)c2−m

× 2F˜1 d1+c2,c2+d2;c1+c2+d1+d2;Im −Z2, Z2=Z2H >0,

and

˜

Bm(b,a+d) ˜

Bm(a,b)Bm˜ (c,d)det(Z3) −d−m

× 2F˜1 c+d,a+d;a+b+d; −Z₃−1, Z3= Z3H >0,

respectively. Note that the distribution ofZ4is same as that ofZ3.

Next, let Xi _∼ C_Wm_(νi_,_Im), _i ₌ 1_,2, _Yi _∼ _IC_Wm_(μi_,_Im₎,_i ₌ 1_,2, _X _∼ C_Wm_(ν,_Im₎_{, and}_Y _∼ _IC_Wm_(μ,_Im₎_{, where}_X₁_and_X₂_{are independent,}_Y₁_and

Y2are independent, and XandY are independent. Let Z5 = X1/21 X2 X

1/2 1

H ,

Z6 = Y₁1/2Y₂ Y₁1/2H, Z7 = X1/2Y X1/2H,

and

Z8 = Y1/2X Y1/2H.

Then, the p.d.f. ofZ5is (Gupta and Nagar [6]),

˜

Ŵm(ν1)Ŵm˜ (ν2) −1det(Z5)ν1−mBν1−ν2(Z5), Z5=Z H 5 >0,

whereB˜δ(∙)is the Herz’s type II Bessel function of Hermitian matrix argument.

Since Z−₆1 ₌(Y₁1/2Y₂(Y₁1/2)H₎−1

=(Y₁−1/2)H_Y−1 2 Y−

1/2

1 ,Y1−1=(Y− 1/2 1 )HY−

1/2 1 ∼ C_Wm_(μ₁_,_Im₎, _Y₂−1 _∼ C_Wm_(μ₂_,_Im), the p.d.f. of _Z₆, obtained from the

p.d.f. ofZ5, is given as

˜

Ŵm(μ1)Ŵ˜m(μ2) −1det(Z6)−μ1−mB˜μ1−μ2 Z−

1 6

, Z6= Z6H >0.

Note thatZ7∼CBmI I(μ, ν)andZ8∼CB I I

m (μ, ν)which follows from Tan [18]

(7)

Further, define the following complex random matrices which again belong to the classC˜_m:

Z9 = A1/2X A1/2H, Z10 = X1/2A X1/2H, Z11 = B1/2X B1/2H, Z12 = X1/2B X1/2H, Z13 = A1/2Y A1/2H, Z14 = Y1/2A Y1/2H, Z15 = B1/2Y B1/2H,

and

Z16 = Y1/2B Y1/2H.

The random matricesZ9andZ10have the same density given by ˜

Ŵm(b)etr(−Z9)det(Z9)ν−m ˜

Ŵm(ν)Bm˜ (a,b)

˜

9(b,₋a₊ν₊m_;Z9), Z9= Z9H >0,

where9˜ is the confluent hypergeometric function of Hermitian matrix argument.

The random matricesZ11andZ12 have the same density derived as ˜

Ŵm(ν+d)det(Z11)ν−m ˜

Ŵm(ν)Bm˜ (c,d) ˜

9(ν₊d,₋c₊ν₊m_;Z11), Z11 =Z11H >0.

Similarly, the random matrices Z13 and Z14 have the same density derived in

Theorem 3.1 and the random matricesZ15andZ16have the same density obtained

in Theorem 3.3.

3 Properties of UNIARIM Distributions

In the previous sections we have discussed a number of properties of UNIARIM distributions and generated them by noting that if X _{∈ ˜}C_m and_Y _{∈ ˜}C_m are

independent, then for any square rootT ofY, the distribution of Z ₌ T X TH

(8)

Several properties, distributional results, expected values, etc. of random matrices defined in Section 2 are available in the literature. The distributional results and properties ofZ1, Z2, Z3and Z4were studied by Gupta, Nagar and

Vélez-Carvajal [7]. Results related to Z5 and Z6 were derived by Gupta and

Nagar [6] and properties of Z9, Z10, Z11 and Z12 were obtained by Khatri,

Khattree and Gupta [12].

In this section we derive distributions ofZ13andZ15and study their properties.

First we obtain the densities ofZ13 andZ15.

Theorem 3.1. Let A and Y be independent, A _∼ C_BI

m(a,b) and Y ∼ IC_Wm(μ,Im). Then, the density of Z13= A1/2Y(A1/2)H is derived as

˜

Ŵm(a+b)det(Z13)−μ−m ˜

Bm(μ,a)Ŵ˜m(a+b+μ)

× 1F˜1 a+μ;a+b+μ; −Z13−1

, Z₁₃ ₌Z₁₃H >0,

where1F˜1is the confluent hypergeometric function of Hermitian matrix argu-ment.

Proof. The joint density ofAandY is given by

etr(₋Y−1₎_det_(Y₎−μ−m_det₍_A)a−m_det_(Im

−A)b−m ˜

Ŵm(μ)Bm˜ (a,b)

,

where 0 < A ₌ AH _< _Im _and_Y

= YH _> _{0. Making the transformation} Z13 = A1/2Y(A1/2)H with the Jacobian J(A,Y → A,Z13) = det(A)−m and

integratingAwe obtain

det(Z13)−μ−m ˜

Ŵm(μ)Bm(a,˜ b) Z

0<A=AH_<_I m

etr(₋Z₁₃−1A)det(A)a+μ−mdet(Im−A)b−md A,

whereZ₁₃ ₌Z₁₃H >0. Now, integration using (A.1) yields the result.

(9)

Corollary 3.2. If x and y are mutually independent, x _∼ BI(a,b) and y _∼ I G(μ,1), then x y _∼ H₁(13)(μ,a,b). Further, the p.d.f. of z13 = x y is given by

Ŵ(a₊μ)Ŵ(a₊b) Ŵ(μ)Ŵ(a)Ŵ(a₊b₊μ)z

−μ−m

13 1F1 a+μ;a+b+μ; −z−₁₃1, z13>0, where 1F1 is the confluent hypergeometric function of scalar argument (Luke_[13_]).

Theorem 3.3. Let B and Y be independent, B ∼ C_BI I

m (c,d) and Y ∼ IC_Wm(μ,Im). Then, the density of Z15= B1/2Y(B1/2)H is derived as

˜

Ŵm(c₊μ)det(Z15)−μ−m ˜

Ŵm(μ)Bm˜ (c,d) ˜

9 c₊μ_;μ₋d₊m_{; −}Z₁₅−1, Z₁₅₌ Z₁₅H >0.

Proof. The joint density ofBandY is given by

etr(₋Y−1)det(Y)−μ−m_det_(B)c−m_det_(Im

+B)−(c+d) ˜

Ŵm(μ)Bm˜ (c,d) ,

where B ₌ BH > 0 andY ₌ YH > 0. Transforming Z15 = B1/2Y(B1/2)H

with the Jacobian J(B,Y _→ B,Z13)=det(B)−m and integrating Bwe obtain

det(Z15)−μ−m ˜

Ŵm(μ)Bm˜ (c,d) Z

B=BH_>0

etr(₋Z₁₅−1B)det(B)c+μ−m

det(Im ₊B)c+d d B,

whereZ₁₅ ₌ZH

15 >0. The desired result is obtained by using (A.2).

The above distribution will be denoted byH˜_m(15)(μ,c,d).

Corollary 3.4. If x and y are independent, x _∼BI I_(c,_d₎_{and y}

∼ I G(μ,1), then x y_∼ H₁(15)(μ,c,d). Further, the p.d.f. of z13 =x y is given by

Ŵ(c₊μ) Ŵ(μ)B(c,d)z

−μ−m

15 ψ c+μ;μ−d+m; −z−151

, z15>0,

where ψ is the confluent hypergeometric function of scalar argument (Luke_[13_]).

(10)

Theorem 3.5. Let A and Y be independent, A _∼ C_BI

m1+m2(a,b) and Y ∼ IC_Wm

1+m2(μ,Im).Then,

A₁₁1/2Y₁₁ A1/2₁₁ H _{∼ ˜}H_m(13)₁ μ₋m2,a,b and A1/2₂₂_∙₁Y₂₂_∙₁ A1/2₂₂_∙₁H

∼ ˜H_m(13)₂ μ,a₋m1,b are independent. Further,

A1/2₂₂ Y₂₂ A1/2₂₂ H

∼ ˜H_m(13)₂ μ₋m1,a,b and A1/2₁₁_∙₂Y₁₁_∙₂ A1/2₁₁_∙₂H _{∼ ˜}H_m(13)₁ μ,a₋m2,b

are independent where A11∙2, Y11∙2, A22∙1 and Y22∙1 are Schur complements of A22, Y22, A11 and Y11, respectively.

Proof. From Theorem A.8 and Theorem A.9, A11,Y11, A22_∙1andY22_∙1are

in-dependent,A11∼CBmI1(a,b),Y11 ∼ICWm1(μ−m2,Im1), A22∙1∼CBmI2(a− m1,b)andY22∙1 ∼ ICWm2(μ,Im2). Now, application of Theorem 3.1 yields the desired result. The proof of the second part is similar.

Theorem 3.6. Let B and Y be independent, Y _∼IC_Wm

1+m2(μ,Im) and B ∼

C_BI I

m1+m2(c,d).Then,

B₁₁1/2Y₁₁ B₁₁1/2H

∼ ˜H_m(15)₁ μ₋m2,c,d−m2 and B₂₂1/2_∙₁Y₂₂_∙₁ B₂₂1/2_∙₁H _{∼ ˜}H_m(15)₂ μ,c₋m1,d

are independent. Further,

B₂₂1/2Y₂₂ B₂₂1/2H _{∼ ˜}H_m(15)₂ (μ₋m1,c,d−m1) and B₁₁1/2_∙₂Y₁₁_∙₂ B₁₁1/2_∙₂H

∼ ˜H_m(15)₁ μ,c₋m2,d

are independent where Y11∙2, B11∙2, Y22∙1 and B22∙1 are Schur complements of Y22, B22, Y11and B11, respectively.

Proof. From Theorem A.8 and Theorem A.10, Y11, B11, Y22∙1 and B22∙1 are

independent, Y11 ∼ ICWm1(μ−m2,Im1), B11 ∼ CBmI I1(c,d −m2), Y22∙1 ∼ IC_Wm

(11)

3.1 Properties of Z13

In this section some properties of the random matrix Z13are derived using the

fact thatZ13belongs to the class of UNIARIM distributions.

(i) Let Z13 = ZZ13111321 ZZ13121322

, Z1311(m1×m1),m1 +m2 = m. Then,

us-ing Theorem 1.6, Theorem 1.7 and Theorem 3.5, Z1311 and its Schur

complement Z1322∙1 are independent, Z1311 ∼ ˜Hm(13)1 (μ−m2,a,b) and Z1322∙1 ∼ ˜Hm(13)2 (μ,a −m1,b). Further, Z1322 and its Schur comple-mentZ1311∙2are independent,Z1322∼ ˜Hm(13)2 (μ−m1,a,b)andZ1311∙2∼

˜

H_m(13)₁ (μ,a₋m2,b).

(ii) For aq_×mcomplex non-random matrixCof rankq(_≤m),

(CCH)−1/2C Z13CH(CCH)−1/2∼ ˜Hm(13)1 (μ−m2,a,b) and

(CCH)1/2(C Z−₁₃1CH)−1(CCH)1/2_{∼ ˜}Hm(13)2 (μ,a−m1,b). (iii) Forc∈Cm_,_c

6=0, cH_Z_13c

cH_c ∼ H (13)

1 (μ−m+1,a,b),

and

cHc cH_Z−1

13c

∼ H₁(13)(μ,a₋m₊1,b).

Note that the distributions of

cHZ13c cH_c and

cHc cH_Z−1

13c

do not depend on c. Thus, if y(m _×1) is a complex random vector

independent ofZ13, andP(y6=0)= 1, then it follows that yHZ13y

yH_y ∼ H (13)

1 (μ−m+1,a,b),

and

yHy yH_Z−1

13y

(12)

(iv) LetZ13 =(z13i j)andZ13−1=(z i j

13). Then,z13ii ∼ H (13)

1 (μ−m+1,a,b), i ₌1, . . . ,mand 1/zii

13 ∼ H1(13)(μ,a−m+1,b),i =1, . . . ,m.

(v) Let Z[₁₃i] ₌(z13j k), 1≤ j,k ≤i. Define

vi =

det(Z₁₃[i])

det(Z₁₃[i−1]),i =1, . . . ,m and det(Z [0] 13)=1.

Then, the random variablesv1, . . . , vm are mutually independent and

us-ing Theorem A.11, Theorem A.7 and Corollary 3.1,vi _∼ H₁(13)(μ₋i₊1, a,b), i ₌ 1, . . . ,m. Further the distribution of det(Z13) is the same as

that ofQm i=1vi.

3.2 Moments of functions of Z13

In this section we derive several expected values of scalar and complex matrix valued functions of the complex random matrixZ13.

Using the representationZ13 = A1/2Y(A1/2)Hand (A.10)–(A.17), one obtains E Z13 = a

(a₊b)(μ₋m)Im, μ >m,

E Z−₁₃1 = μ(a+b−m)

a₋m Im, a >m,

EC˜κ(Z13) =

(₋1)k_[a_]κ [−μ₊m]κ[a+b]κ

˜

Cκ(Im), μ≥k1+m,

EC˜κ(Z13−1)

= [μ]κ[−a−b+m]κ [−a₊m_]κ

˜

Cκ(Im),a≥k1+m, i=1,2.

Further, using (A.4), (A.5) and above expressions, the expected values of(trZ13)2

and(trZ₁₃−1)2are evaluated as

E(trZ13)2 = EC˜(2)(Z13)+EC˜(12₎(Z₁₃) = (−1)

2 [a_](2) [−μ₊m_](2)[a+b](2)

˜ C(2)(Im)

+ (−1) 2

[a_](12₎ [−μ₊m_](12₎[a+b]₍₁2₎

˜

(13)

and

E(trZ₁₃−1)2 ₌ EC˜(2)(Z13−1)

+EC˜(12₎(Z₁₃−1) = [μ](2)[−a−b+m](2)

[−a₊m_](2) ˜ C(2)(Im)

+[μ](12)[−a−b+m](12) [−a₊m_](12₎

˜

C(12₎(Im).

Now, applying the results_[n_](2) =n(n+1), [n](12₎ =n(n−1), [−n+m](2) = (n₋m)(n₋m₋1), _[−n₊m_](12₎ =(n−m)(n−m+1), C˜(2)(Im)=m(m+

1)/2 and C˜(12₎(Im)=m(m−1)/2 in the above expressions and simplifying, we obtain

E(trZ13)2 = ma

2(μ₋m)(a₊b)

(m+1)(a+1) (μ₋m₋1)(a₊b₊1)

+ (m−1)(a−1) (μ₋m₊1)(a₊b₋1)

, μ >m,

and

E

(trZ₁₃−1)2 ₌ mμ(a+b−m)

2(a₋m)

(μ₊1)(a₊b₋m₋1)(m₊1) (a₋m₋1)

+(μ−1)(a+b−m+1)(m−1) (a₋m₊1)

, a >m₊1.

Similarly, using (A.6)–(A.8), the expected values of(trZ13)3and(trZ13−1)3are

obtained as

E

(trZ13)3 = EC˜(3)(Z13)+EC˜(2,1)(Z13)+EC˜(13₎(Z13) = − [a](3)

[−μ₊m_](3)[a+b](3) ˜ C(3)(Im)

− [a](2,1)

[−μ₊m](2,1)[a+b](2,1) ˜

C(2,1)(Im)

− [a](13)

[−μ₊m_](13₎[a+b]₍₁3₎ ˜

(14)

and

E(trZ₁₃−1)3 ₌ EC˜(3)(Z13−1)

+EC˜(2,1)(Z13−1)

+E_{[ ˜}C(13₎(Z₁₃−1)] = [μ](3)[−a−b+m](3)

[−a₊m_](3) ˜ C(3)(Im)

+[μ](2,1)[−a−b+m](2,1) [−a₊m_](2,1)

˜

C(2,1)(Im)

+[μ](13)[−a−b+m](13) [−a₊m_](13₎

˜

C(13₎(Im),

respectively. Now, using the results_[n_](3) =n(n+1)(n+2),[n](2,1) =n(n+

1)(n₋1),_[n_](13₎=n(n−1)(n−2),[−n+m](3)= −(n−m)(n−m−1)(n−m−2), [−n₊m_](2,1) = −(n−m)(n−m−1)(n−m+1),[−n+m](13₎= −(n−m)(n− m₊1)(n₋m₊2),C˜(3)(Im)=m(m+1)(m+2)/6,C˜(2,1)(Im)=2m(m2−1)/3

andC˜(13₎(Im)=m(m−1)(m−2)/6 in the above expressions and simplifying, we obtain

E

(trZ13)3 = ma

6(μ₋m)(a₊b)

₄

(a2₋1)(m2₋1) [(μ₋m)2₋1_][(a₊b)2₋1_] + (a+1)(a+2)(m+1)(m+2)

(μ₋m₋1)(μ₋m₋2)(a₊b₊1)(a₊b₊2)

+ (a−1)(a−2)(m−1)(m−2)

(μ₋m₊1)(μ₋m₊2)(a₊b₋1)(a₊b₋2)

,

whereμ >m+1, and

E

(trZ−₁₃1)3 = mμ(a₆_(a+b−m)

−m) ₄

(μ2₋1)_[(a₊b₋m)2₋1_](m2₋1) [(a₋m)2₋1_]

+ (μ+1)(μ+2)(a+b−m−1)(a+b−m−2)(m+1)(m+2) (a₋m₋1)(a₋m₋2)

+ (μ−1)(μ−2)(a+b−m+1)(a+b−m+2)(m−1)(m−2) (a₋m₊1)(a₋m₊2)

(15)

where a > m ₊ 2. Similarly, the expected values of tr(Z13)tr(Z132) and

tr(Z−₁₃1)tr(Z₁₃−2) are obtained as E

tr(Z13)tr(Z213)

=E ˜

C(3)(Z13)−EC˜(13₎(Z13) = ₆ ma

(μ₋m)(a₊b)

(a+1)(a+2)(m+1)(m+2) (μ₋m₋1)(μ₋m₋2)(a₊b₊1)(a₊b₊2)

− (a−1)(a−2)(m−1)(m−2) (μ₋m₊1)(μ₋m₊2)(a₊b₋1)(a₊b₋2)

, μ >m₊1,

and

E

tr(Z₁₃−1)tr(Z−₁₃2)₌E ˜

C(3)(Z−131)

−E ˜

C(13₎(Z₁₃−1) = mμ(a₆ +b−m)

(a₋m)

(μ₊1)(μ₊2)(a₊b₋m₋1) (a₋m₋1)

(a₊b₋m₋2)(m₊1)(m₊2) (a₋m₋2)

−(μ−1)(μ−2)(a+b−m+1)(a+b−m+2)(m−1)(m−2) (a−m+1)(a−m+2)

,

a>m₊2,

respectively. Since, E(Zα

13)= ˜cαIm, we haveE[tr(Z13α)] = ˜cαm. Thus, the

co-efficient ofminE[tr(Z₁₃α)_]isc_˜α. Therefore, evaluatingE[tr(Z132)],E[tr(Z−132)], E_[tr(Z3₁₃)_] and E_[tr(Z−₁₃3_]using the technique describe above, and computing

the coefficients ofmin the resulting expressions, one obtains

E Z132

= ₂ a

(μ₋m)(a₊b)

(m₊1)(a₊1) (μ₋m₋1)(a₊b₊1)

− (m−1)(a−1) (μ₋m₊1)(a₊b₋1)

Im, μ >m,

E Z₁₃−2 ₌ μ(a+b−m)

2(a₋m)

(μ₊1)(a+b−m−1)(m+1) (a₋m₋1)

− (μ−1)(a_(a+b−m+1)(m−1) −m₊1)

(16)

E Z₁₃3

= ₆ a

(μ₋m)(a₊b)

(a₊1)(a₊2)(m₊1)(m₊2) (μ₋m₋1)(μ₋m₋2)(a₊b₊1)(a₊b₊2)

+ (a−1)(a−2)(m−1)(m−2)

(μ₋m+1)(μ₋m+2)(a+b−1)(a+b−2)

− 2(a 2

−1)(m2₋1) [(μ₋m)2₋1_][(a₊b)2₋1_]

Im, μ >m₊1,

and

E Z−₁₃3 ₌ μ(a+b−m)

6(a₋m)

(μ₊1)(μ₊2)(a₊b₋m₋1) (a₋m₋1)

(a₊b₋m₋2)(m₊1)(m₊2) (a₋m₋2)

+ (μ−1)(μ−2)(a+b−m+1)(a+b−m+2)(m−1)(m−2) (a₋m₊1)(a₋m₊2)

− 2(μ 2

−1)_[(a₊b₋m)2₋1_](m2₋1) [(a₋m)2₋1_]

Im, a>m₊2.

3.3 Properties of Z15

In this section we give several properties of Z15.

(i) Let Z15 =

Z1511 Z1512 Z1521 Z1522

, Z1511(m1×m1), m1+m2 = m.Then, using

Theorem 1.6, Theorem 1.7 and Theorem 3.6,Z1511andZ1522∙1are

indepen-dent,Z1511∼ ˜Hm(15)1 (μ−m2,c,d−m2)andZ1522∙1∼ ˜H

(15)

m2 (μ,c−m1,d). Further,Z1522andZ1511∙2are independent,Z1522∼ ˜Hm(15)2 (μ−m1,c,d− m1)andZ1511∙2∼ ˜Hm(15)1 (μ,c−m2,d).

(ii) For aq×mcomplex non-random matrixCof rankq(≤m),

(CCH)−1/2C Z15CH(CCH)−1/2∼ ˜Hm(15)1 (μ−m2,c,d−m2) and

(17)

(iii) Ify(m_×1)is a non-random complex vector withy ₆₌ 0, or a complex

random vector independent ofZ15withP(y6=0)=1, then it follows that yHZ15y

yH_y ∼ H (15)

1 (μ−m+1,c,d−m+1)

and

yHy yH_Z−1

15y

∼ H₁(15)(μ,c₋m₊1,d).

(iv) Let Z15 = (z15i j) and Z15−1 = (z i j

15). Then, for i = 1, . . . ,m, z15ii ∼ H₁(15)(μ₋m₊1,c,d₋m₊1), and 1/zii₁₅ _∼H₁(15)(μ,c₋m₊1,d).

(v) Let Z[₁₅i] ₌(z15j k), 1≤ j,k ≤i. Define

vi =

det(Z₁₅[i])

det(Z₁₅[i−1]), i =1, . . . ,m and det(Z [0] 15)=1.

Then, the random variables v1, . . . , vm are mutually independent and

using Theorem A.11, Theorem A.12 and Corollary 3.4,vi ∼ H (15) 1 (μ− i ₊1,c,d₋i₊1),i ₌1, . . . ,m. Further, the distribution of det(Z15)

is the same as that ofQm i=1vi. 3.4 Moments of functions of Z15

This section deals with expected values of scalar and complex matrix valued functions of the complex random matrixZ15.

Using the representation Z15 = Y1/2B(Y1/2)H ∼ ˜Hm(15)(μ,c,d), (A.10)–

(A.13), (A.18), (A.19), and the technique of computing expected values given in Subsection 3.2, we obtain

E Z15 = c

(μ−m)(d−m)Im, μ >m,d >m,

E Z₁₅−1

= μd

c−mIm,c>m,

EC˜κ(Z15) = [c]κ

[−μ+m]κ[−d+m]κ

˜

Cκ(Im), μ≥m+k1,d≥m+k1,

EC˜κ(Z₁₅−1) =

(₋1)k_[μ_]κ[d]κ

[−c+m]κ

˜

(18)

E Z₁₅2

= ₂ c

(μ₋m)(d−m)

₍

c+1)(m+1) (μ₋m−1)(d−m−1)

− (c−1)(m−1) (μ−m+1)(d−m+1)

Im, μ >m,d>m+1,

E Z₁₅−2

= ₂ μd (c−m)

_(μ

+1)(d+1)(m+1)

(c−m−1) −

(μ₋1)(d−1)(m−1) (c−m+1)

Im,

c>m+1,

E Z₁₅3

= ₆ c

(μ₋m)(d−m)

₍

c+1)(c+2)(m+1)(m+2)

(μ₋m−1)(μ₋m−2)(d−m−1)(d−m−2)

+ (c−1)(c−2)(m−1)(m−2)

(μ−m+1)(μ−m+2)(d−m+1)(d−m+2)

− 2(c

2

−1)(m2−1) [(μ−m)2−1][(d−m)2−1]

Im, μ >m+2,d >m+2,

E Z₁₅−3

= ₆ μd (c−m)

_(μ

+1)(μ₊2)(d+1)(d+2)(m+1)(m+2) (c−m−1)(c−m−2)

+(μ−1)(μ−2)(d−1)(d−2)(m−1)(m−2) (c−m+1)(c−m+2)

−2(μ

2

−1)(d2−1)(m2−1) [(c−m)2−1]

Im,c>m+2,

E

(trZ15)2 = mc

2(μ₋m)(d−m)

₍

c+1)(m+1) (μ₋m−1)(d−m−1)

+ (c−1)(m−1) (μ₋m+1)(d−m+1)

, μ >m,d>m+1,

E(trZ₁₅−1)2 ₌ mμd 2(c−m)

_(μ

+1)(d+1)(m+1) (c−m−1)

+(μ−1)(d−1)(m−1) (c−m+1)

,c>m+1,

E(trZ15)3 = ₆ mc

(μ₋m)(d−m)

_(c

+1)(c₊2)(m₊1)(m₊2)

(19)

+ (c−1)(c−2)(m−1)(m−2)

(μ₋m₊1)(μ₋m₊2)(d₋m₊1)(d₋m₊2)

+ 4(c 2

−1)(m2₋1) [(μ₋m)2₋1_][(d₋m)2₋1_]

, μ >m₊2, d >m₊2,

E(trZ−₁₅1)3

= ₆_(cmμd −m)

(μ₊1)(μ₊2)(d₊1)(d ₊2)(m₊1)(m₊2) (c₋m₋1)(c₋m₋2)

+ (μ−1)(μ−2)(d−1)(d−2)(m−1)(m−2) (c₋m₊1)(c₋m₊2)

+ 4(μ 2

−1)(d2₋1)(m2₋1) [(c₋m)2₋1_]

, c>m₊2,

E

tr(Z15)tr(Z215)

= ₆ mc (μ₋m)(d₋m)

(c₊1)(c₊2)(m₊1)(m₊2) (μ₋m₋1)(μ₋m₋2)(d₋m₋1)(d₋m₋2)

− (c−1)(c−2)(m−1)(m−2) (μ₋m₊1)(μ₋m₊2)(d₋m₊1)(d₋m₊2)

, μ,d >m₊2,

and

E

tr(Z₁₅−1)tr(Z₁₅−2)

= ₆mμd (c₋m)

(μ₊1)(μ₊2)(d₊1)(d₊2)(m₊1)(m₊2) (c₋m₋1)(c₋m₋2)

− (μ−1)(μ_(c−2)(d−1)(d−2)(m−1)(m−2) −m₊1)(c₋m₊2)

(20)

Appendix

The confluent hypergeometric function1F˜1 of Hermitian matrix argument has

the integral representation (James [8] and Chikuse [2]),

1F˜1(α;γ;X) = 1 ˜

Bm(α, γ ₋α) Z

0<Y=YH_<_I m

det(Y)α−m

×det(Im ₋Y)γ−α−metr(X Y)dY,

(A.1)

valid for all Hermitian X, Re(α) > m ₋1 and Re(γ ₋α) > m ₋1. The

confluent hypergeometric function9˜ ofm_×mHermitian matrix R is defined

by (Chikuse [2]),

˜

9(α, γ_;R) ₌ 1 ˜ Ŵm(α)

Z

Y=YH_>0

etr(₋RY)

×det(Y)α−mdet(Im ₊Y)γ−α−mdY,

(A.2)

where Re(R) >0 and Re(α) >m₋1.

Let C˜κ(X) be a zonal polynomial of an m ×m Hermitian matrix X

corre-sponding to the partitionκ ₌ (k1, . . . ,km),k1+ ∙ ∙ ∙ +km = k, k1 ≥ ∙ ∙ ∙ ≥ km _≥ 0. Then, for small values ofk, explicit formulas forC˜κ(X)are available

as (James [8], Khatri [11]),

˜

C(1)(X) = tr(X), (A.3)

˜

C(2)(X) = 1

2

(trX)2₊tr(X2)

, (A.4)

˜

C(12₎(X) = 1 2

(trX)2₋tr(X2), (A.5) ˜

C(3)(X) =

1 6

(trX)3₊3(trX)(trX2)₊2 tr(X3), (A.6) ˜

C(2,1)(X) = 2

3

(trX)3−tr(X3), (A.7) ˜

C(13₎(X) = 1 6

(trX)3₋3(trX)(trX2)₊2 tr(X3). (A.8)

Also, substituting X ₌ Im in (A.3)–(A.8), it is easy to see thatC˜(1)(Im) = m, ˜

(21)

2)/6, C˜(2,1)(Im) = 2m(m2−1)/3, C˜(13₎(Im) = m(m −1)(m −2)/6. The complex generalized hypergeometric coefficient_[a_]κis defined as

[a_]κ= m Y

i=1

(a₋i₊1)k_i, (A.9)

with(a)r = a(a+1)∙ ∙ ∙(a+r −1) = (a)r−1(a +r −1) forr = 1,2, . . .,

and(a)0= 1. Further, substituting appropriately in (A.9), it is easy to observe

that_[a_](2) =a(a+1),[a](12₎ =a(a−1),[a]₍₃₎ =a(a+1)(a+2),[a]_(2,1) = a(a₊1)(a₋1)and_[a_](13₎ =a(a−1)(a−2).

IfY ∼ IC_Wm_{(μ, 9)}_, _A _∼ C_BI

m(a,b)and B ∼ CBmI I(c,d) then (Bedoya,

Nagar and Gupta [1], James [8], Khatri [10], Nagar and Gupta [15], Shaman [16]),

E(Y−1) ₌ μ9−1, (A.10)

E(Y) ₌ (μ₋m)−19, μ >m, (A.11) E_{[ ˜}Cκ(Y−1R)] = [μ]κC˜κ(9−1R), (A.12) E_{[ ˜}Cκ(Y R)] =

(₋1)k [−μ₊m_]κ

˜

Cκ(9R), μ≥k1+m, (A.13) E(A) ₌ a

a₊bIm, (A.14)

E(A−1) ₌ a+b−m

a₋m Im, a >m, (A.15)

E_{[ ˜}Cκ(R A)] = [ a_]κ [a₊b_]κ

˜

Cκ(R), (A.16)

E_{[ ˜}Cκ(R A−1)] = [−

a₋b₊m_]κ [−a₊m_]κ

˜

Cκ(R),a ≥k1+m, (A.17) E(B) ₌ c

d₋mIm, d >m, (A.18)

and

E_{[ ˜}Cκ(R B)] =

(₋1)k_[c_]κ [−d+m]κ

˜

Cκ(R),d ≥k1+m, (A.19)

(22)

Theorem A.7. Let A _∼ C_Wm(ν,Im)and A ₌ T TH, where T ₌ (ti j) is a complex lower triangular matrix with tii >0and ti j ₌t1i j +

√

−1t2i j, j <i . Then, ti j, 1 _≤ j _≤ i _≤ m are independently distributed, t_ii2 _∼ G(ν ₋i₊1),

1_≤i _≤m and ti j _∼C_N(0,1),1_≤ j<i_≤m.

Proof. See Goodman [4].

The univariate gamma distribution denoted byG(a)is defined by the p.d.f.

Ŵ(a) −1exp(₋x)xa−1, x >0, a>0.

Theorem A.8. Let Y ∼IC_Wm_{(μ, 9)}_{and partition Y and}₉_as

Y = Y11 Y12 Y21 Y22 !

and 9 ₌ 911 912 921 922

! ,

where Y11 and911 are q×q matrices. Then, Y11 and its Schur complement Y22_∙1are independent, Y11 ∼ ICWq(μ−m+q, 911)and Y22_∙1∼ ICWm₋q(μ, 922∙1). Further, Y22 and its Schur complement Y11∙2 are independent, Y22 ∼ IC_Wm₋_q(μ₋_{q, 9}₂₂₎_{and Y}₁₁_∙₂_∼_IC_{Wq(μ, 9}₁₁_∙₂_).

Proof. See Tan [17].

Theorem A.9. Let A_∼C_BI

m(a,b)and partition A as

A₌ A11 A12 A21 A22 !

, A11(q×q).

Then,

(i) A11 and its Schur complement A22∙1 are independent, A11 ∼ CBqI(a,b) and A22∙1∼CBmI−q(a−q,b)and

(ii) A22and its Schur complement A11∙2are independent, A22∼CBmI−q(a,b) and A11∙2∼CBqI(a−m+q,b).

(23)

Theorem A.10. Let B _∼C_BI I

m (c,d)and partition B as

B ₌ B11 B12 B21 B22

!

, B11(q×q).

Then,(i) B11 and its Schur complement B22∙1are independent, B11 ∼CBqI I(c, d₋m₊q)and B22∙1∼CBmI I−q(c−q,d)and(ii)B22and its Schur complement B11∙2are independent, B22 ∼CBmI I−q(c,d−q)and B11∙2∼CBqI I(c−m+q,d).

Proof. See Tan [18].

Theorem A.11. Let A _∼ C_BI

m(a,b) and A = T TH, where T = (ti j) is a complex lower triangular matrix with positive diagonal elements. Then, t₁₁2, . . . ,t_mm2 are independently distributed, t_ii2 _∼BI_(a

−i₊1,b),i ₌1, . . . ,m.

The univariate beta type I distribution, denoted by BI_(a,_b)_{, is defined by}

the p.d.f.

B(a,b) −1xa−1(1₋x)b−1, 0<x <1,

whereB(a,b)₌ Ŵ(a)Ŵ(b) Ŵ(a₊b).

Theorem A.12. Let B _∼ C_BI I

m (c,d) and B = T TH, where T = (ti j) is a complex lower triangular matrix with positive diagonal elements. Then, t₁₁2, . . . ,t_mm2 are independently distributed, t_ii2 _∼ BI I_(c

−i ₊1,d ₋m ₊i), i=1, . . . ,m.

The univariate beta type II distribution, denoted by BI I(a,b), is defined by

the p.d.f.

B(a,b) −1xa−1(1₊x)−(a+b), x >0.

Acknowledgements. The authors would like to thank the worthy referee for

(24)

REFERENCES

[1] E. Bedoya, D.K. Nagar and A.K. Gupta,Moments of the complex matrix variate beta distri-bution.PanAmerican Mathematical Journal,17(2) (2007), 21–32.

[2] Y. Chikuse, Partial differential equations for hypergeometric functions of complex matrices and their applications.Annals of the Institute of Statistical Mathematics,28(1976), 187–199. [3] Xinping Cui, A.K. Gupta and D.K. Nagar,Wilks’ factorization of the complex matrix variate

Dirichlet distributions.Revista Matemática Complutense,18(2) (2005), 315–328.

[4] N.R. Goodman,Statistical analysis based on a certain multivariate complex Gaussian distri-bution (an introduction).Annals of Mathematical Statistics,34(1963), 152–177.

[5] A.K. Gupta and D.K. Nagar, Matrix Variate Distributions. Chapman & Hall/CRC, Boca Raton (2000).

[6] A.K. Gupta and D.K. Nagar,Product of independent complex Wishart matrices.International Journal of Statistics and System,2(1) (2007), 59–73.

[7] A.K. Gupta, D.K. Nagar and A.M. Vélez-Carvajal,Quadratic forms of independent complex beta matrices.International Journal of Applied Mathematics and Statistics,13(D08) (2008), 76–91.

[8] A.T. James, Distributions of matrix variate and latent roots derived from normal samples. Annals of Mathematical Statistics,35(1964), 475–501.

[9] C.G. Khatri,Classical statistical analysis based on a certain multivariate complex Gaussian distribution.Annals of Mathematical Statistics,36(1965), 98–114.

[10] C.G. Khatri,On certain distribution problems based on positive definite quadratic functions in normal vectors.Annals of Mathematical Statistics,37(2) (1966), 468–479.

[11] C.G. Khatri, On the moments of traces of two matrices in three situations for complex multivariate normal populations.Sankhy¯a,A32(1970), 65–80.

[12] C.G. Khatri, R. Khattree and R.D. Gupta,On a class of orthogonal invariant and residual independent matrix distributions.Sankhy¯a,B53(1) (1991), 1–10.

[13] Y.L. Luke, The Special Functions and Their Approximations. Volume I, Academic Press, New York (1969).

[14] D.K. Nagar, E. Bedoya and E.L. Arias,Non-central complex matrix-variate beta distribution. Advances and Applications in Statistics,4(3) (2004), 287–302.

[15] D.K. Nagar and A.K. Gupta,Expectations of functions of complex Wishart matrix.Applied Mathematics and Computations (submitted for publication).

[16] P. Shaman, The inverted complex Wishart distribution and its application to spectral estimates.Journal of Multivariate Analysis,10(1980), 51–59.

[17] W.Y. Tan, On the complex analogue of Bayesian estimation of a multivariate regression model.Annals of Institute of Statistical Mathematics,25(1973), 135–152.