• Nenhum resultado encontrado

On the number of subtrees on the fringe of random trees

N/A
N/A
Protected

Academic year: 2022

Share "On the number of subtrees on the fringe of random trees"

Copied!
86
0
0

Texto

(1)

On the number of subtrees on the fringe of random trees

(partly joined with Huilan Chang)

Michael Fuchs

Institute of Applied Mathematics National Chiao Tung University

Hsinchu, Taiwan

April 16th, 2008

(2)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1 7

3 6 8

2 5

1

2 3

4

5 6

7

8

X8,1= 2 X8,2= 2 X8,3= 1 X8,4= 0 X8,5= 1 X8,6= 0 X8,7= 0 X8,8= 1

(3)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1 7

3 6 8

2 5

1

2 3

4

5 6

7

8

X8,1= 2 X8,2= 2 X8,3= 1 X8,4= 0 X8,5= 1 X8,6= 0 X8,7= 0 X8,8= 1

(4)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1

7

3 6 8

2 5

1

2 3

4

5 6

7

8

X8,1= 2 X8,2= 2 X8,3= 1 X8,4= 0 X8,5= 1 X8,6= 0 X8,7= 0 X8,8= 1

(5)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1

7

3

6

8

2 5

1

2 3

4

5 6

7

8

X8,1= 2 X8,2= 2 X8,3= 1 X8,4= 0 X8,5= 1 X8,6= 0 X8,7= 0 X8,8= 1

(6)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1 7

3

6

8

2 5

1

2 3

4

5 6

7

8

X8,1= 2 X8,2= 2 X8,3= 1 X8,4= 0 X8,5= 1 X8,6= 0 X8,7= 0 X8,8= 1

(7)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1 7

3

6 8

2 5

1

2 3

4

5 6

7

8

X8,1= 2 X8,2= 2 X8,3= 1 X8,4= 0 X8,5= 1 X8,6= 0 X8,7= 0 X8,8= 1

(8)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1 7

3

6 8

2

5

1

2 3

4

5 6

7

8

X8,1= 2 X8,2= 2 X8,3= 1 X8,4= 0 X8,5= 1 X8,6= 0 X8,7= 0 X8,8= 1

(9)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1 7

3 6 8

2 1

2 3

4

5 6

7

8

X8,1= 2 X8,2= 2 X8,3= 1 X8,4= 0 X8,5= 1 X8,6= 0 X8,7= 0 X8,8= 1

(10)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1 7

3 6 8

2 5

1

2 3

4

5 6

7

8

X8,1= 2 X8,2= 2 X8,3= 1 X8,4= 0 X8,5= 1 X8,6= 0 X8,7= 0 X8,8= 1

(11)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1 7

3 6 8

1

3 4

6 7

8

X8,1= 2

X8,2= 2 X8,3= 1 X8,4= 0 X8,5= 1 X8,6= 0 X8,7= 0 X8,8= 1

(12)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1 7

3 6 8

2 5

1

2 3

4

5 6

7

8

X8,1= 2 X8,2= 2

X8,3= 1 X8,4= 0 X8,5= 1 X8,6= 0 X8,7= 0 X8,8= 1

(13)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1 7

3 6 8

1

3

4

5 6

7

8

X8,1= 2 X8,2= 2 X8,3= 1

X8,4= 0 X8,5= 1 X8,6= 0 X8,7= 0 X8,8= 1

(14)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1 7

3 6 8

2 5

1

2 3

4

5 6

7

8

X8,1= 2 X8,2= 2 X8,3= 1 X8,4= 0

X8,5= 1 X8,6= 0 X8,7= 0 X8,8= 1

(15)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1 7

3 6 8

1

2 3

4

6 7

8

X8,1= 2 X8,2= 2 X8,3= 1 X8,4= 0 X8,5= 1

X8,6= 0 X8,7= 0 X8,8= 1

(16)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1 7

3 6 8

2 5

1

2 3

4

5 6

7

8

X8,1= 2 X8,2= 2 X8,3= 1 X8,4= 0 X8,5= 1 X8,6= 0

X8,7= 0 X8,8= 1

(17)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1 7

3 6 8

1

2 3

4

5 6

7

8

X8,1= 2 X8,2= 2 X8,3= 1 X8,4= 0 X8,5= 1 X8,6= 0 X8,7= 0

X8,8= 1

(18)

The number of subtrees

Xn,k=number of subtrees of sizek on the fringe of random binary search trees of size n.

Example: Input: 4,7,6,1,8,5,3,2

4

1 7

3 6 8

2 5

1

2 3

4

5 6

7

8

X8,1= 2 X8,2= 2 X8,3= 1 X8,4= 0 X8,5= 1 X8,6= 0 X8,7= 0 X8,8= 1

(19)

Mean value and variance

Xn,k satisfies

Xn,k =d XIn,k+Xn−1−I n,k, where Xk,k = 1,XIn,k andXn−1−I

n,k are conditionally independent given In, andIn=Unif{0, . . . , n−1}.

This yields

µn,k :=E(Xn,k) = 2(n+ 1)

(k+ 1)(k+ 2), (n > k), and

σ2n,k:= Var(Xn,k) = 2k(4k2+ 5k−3)(n+ 1) (k+ 1)(k+ 2)2(2k+ 1)(2k+ 3) for n >2k+ 1.

(20)

Mean value and variance

Xn,k satisfies

Xn,k =d XIn,k+Xn−1−I n,k, where Xk,k = 1,XIn,k andXn−1−I

n,k are conditionally independent given In, andIn=Unif{0, . . . , n−1}.

This yields

µn,k:=E(Xn,k) = 2(n+ 1)

(k+ 1)(k+ 2), (n > k), and

σn,k2 := Var(Xn,k) = 2k(4k2+ 5k−3)(n+ 1) (k+ 1)(k+ 2)2(2k+ 1)(2k+ 3) for n >2k+ 1.

(21)

Some previous results

Aldous (1991): Weak law of large numbers Xn,k

µn,k

−→1 in probability.

Devroye (1991): Central limit theorem Xn,k−µn,k

σn,k

−→ Nd (0,1).

Flajolet, Gourdon, Martinez (1997):

Central limit theorem with optimal Berry-Esseen bound and LLT

−→ All the above results are for fixed k.

(22)

Some previous results

Aldous (1991): Weak law of large numbers Xn,k

µn,k

−→1 in probability.

Devroye (1991): Central limit theorem Xn,k−µn,k

σn,k

−→ Nd (0,1).

Flajolet, Gourdon, Martinez (1997):

Central limit theorem with optimal Berry-Esseen bound and LLT

−→ All the above results are for fixed k.

(23)

Some previous results

Aldous (1991): Weak law of large numbers Xn,k

µn,k

−→1 in probability.

Devroye (1991): Central limit theorem Xn,k−µn,k

σn,k

−→ Nd (0,1).

Flajolet, Gourdon, Martinez (1997):

Central limit theorem with optimal Berry-Esseen bound and LLT

−→ All the above results are for fixed k.

(24)

Some previous results

Aldous (1991): Weak law of large numbers Xn,k

µn,k

−→1 in probability.

Devroye (1991): Central limit theorem Xn,k−µn,k

σn,k

−→ Nd (0,1).

Flajolet, Gourdon, Martinez (1997):

Central limit theorem with optimal Berry-Esseen bound and LLT

−→ All the above results are for fixed k.

(25)

Results for k = k

n

Theorem (Feng, Mahmoud, Panholzer (2008)) (i) (Normal range) Letk=o(√

n)and k→ ∞ as n→ ∞. Then,

Xn,k−µn,k

p2n/k2

−→ Nd (0,1).

(ii) (Poisson range) Let k∼c√

n asn→ ∞. Then,

Xn,k−→d Poisson(2c−2).

(iii) (Degenerate range) Let k < nand√

n=o(k) as n→ ∞. Then, Xn,k

L1

−→0.

(26)

Why are we interested in X

n,k

?

Xn,k is a new kind of profile of a tree.

The phase change from normal to Poisson is a universal phenomenon expected to hold for many classes of random trees.

The methods for proving phase change results might be applicable to other parameters which are expected to exhibit the same phase change behavior as well.

Xn,k is related to parameters arising in genetics.

(27)

Why are we interested in X

n,k

?

Xn,k is a new kind of profile of a tree.

The phase change from normal to Poisson is a universal phenomenon expected to hold for many classes of random trees.

The methods for proving phase change results might be applicable to other parameters which are expected to exhibit the same phase change behavior as well.

Xn,k is related to parameters arising in genetics.

(28)

Why are we interested in X

n,k

?

Xn,k is a new kind of profile of a tree.

The phase change from normal to Poisson is a universal phenomenon expected to hold for many classes of random trees.

The methods for proving phase change results might be applicable to other parameters which are expected to exhibit the same phase change behavior as well.

Xn,k is related to parameters arising in genetics.

(29)

Why are we interested in X

n,k

?

Xn,k is a new kind of profile of a tree.

The phase change from normal to Poisson is a universal phenomenon expected to hold for many classes of random trees.

The methods for proving phase change results might be applicable to other parameters which are expected to exhibit the same phase change behavior as well.

Xn,k is related to parameters arising in genetics.

(30)

Yule generated random genealogical trees

Example:

5

4

2 3

1

1 2 3 4 5 6

5

4

2 3

1

1 2 3 4 5 6

genes

Random model: At every time point, two yellow nodes uniformly coalescent. Same model as random binary search tree model!

time

(31)

Yule generated random genealogical trees

Example:

5

4

2 3

1

1 2 3 4 5 6

5

4

2 3

1

1 2 3 4 5 6 genes

Random model:

At every time point, two yellow nodes uniformly coalescent.

Same model as random binary search tree model!

time

(32)

Yule generated random genealogical trees

Example:

5

4

2 3

1

1 2

3 4

5 6

5

4

2 3

1

1 2

3 4

5 6 genes

Random model:

At every time point, two yellow nodes uniformly coalescent.

Same model as random binary search tree model!

time

(33)

Yule generated random genealogical trees

Example:

5

4

2 3

1

1 2

3 4 5 6

5

4

2

3

1

1 2

3 4 5 6

genes

Random model:

At every time point, two yellow nodes uniformly coalescent.

Same model as random binary search tree model!

time

(34)

Yule generated random genealogical trees

Example:

5

4

2 3

1

1

2 3 4 5 6

5

4

2 3

1

1

2 3 4 5 6

genes

Random model:

At every time point, two yellow nodes uniformly coalescent.

Same model as random binary search tree model!

time

(35)

Yule generated random genealogical trees

Example:

5

4

2 3

1

1

2 3 4 5 6

5

4

2 3

1

1

2 3 4 5 6

genes

Random model:

At every time point, two yellow nodes uniformly coalescent.

Same model as random binary search tree model!

time

(36)

Yule generated random genealogical trees

Example:

5

4

2 3

1

1 2 3 4 5 6

5

4

2 3

1

1 2 3 4 5 6

genes

Random model:

At every time point, two yellow nodes uniformly coalescent.

Same model as random binary search tree model!

time

(37)

Yule generated random genealogical trees

Example:

5

4

2 3

1

1 2 3 4 5 6

5

4

2 3

1

1 2 3 4 5 6 genes

Random model:

At every time point, two yellow nodes uniformly coalescent.

Same model as random binary search tree model!

time

(38)

Yule generated random genealogical trees

Example:

5

4

2 3

1

1 2 3 4 5 6

5

4

2 3

1

1 2 3 4 5 6 genes

Random model:

At every time point, two yellow nodes uniformly coalescent.

Same model as random binary search tree model!

time

(39)

Shape parameters of genealogical trees

k-pronged nodes (Rosenberg 2006):

Nodes with an induced subtree with k−1internal nodes.

k-caterpillars (Rosenberg 2006):

Correspond to nodes whose induced subtree has k−1 internal nodes all of them with out-degree either0 or 1.

Nodes with minimal clade sizek (Blum and Fran¸cois (2005)): Ifk≥3, then they are internal nodes with induced subtree of size k−1 and either an empty right subtree or empty left subtree.

(40)

Shape parameters of genealogical trees

k-pronged nodes (Rosenberg 2006):

Nodes with an induced subtree with k−1internal nodes.

k-caterpillars (Rosenberg 2006):

Correspond to nodes whose induced subtree has k−1internal nodes all of them with out-degree either0 or 1.

Nodes with minimal clade sizek (Blum and Fran¸cois (2005)): Ifk≥3, then they are internal nodes with induced subtree of size k−1 and either an empty right subtree or empty left subtree.

(41)

Shape parameters of genealogical trees

k-pronged nodes (Rosenberg 2006):

Nodes with an induced subtree with k−1internal nodes.

k-caterpillars (Rosenberg 2006):

Correspond to nodes whose induced subtree has k−1internal nodes all of them with out-degree either0 or 1.

Nodes with minimal clade sizek (Blum and Fran¸cois (2005)):

Ifk≥3, then they are internal nodes with induced subtree of size k−1 and either an empty right subtree or empty left subtree.

(42)

Counting pattern in random binary search trees

ConsiderXn,k with

Xn,k

=d XIn,k+Xn−1−I n,k, where Xk,k =Bernoulli(pk),XIn,k andXn−1−I

n,k are conditionally independent givenIn, andIn=Unif{0, . . . , n−1}.

Then,

pk shape parameter

1 #ofk+ 1-pronged nodes

2/k #of nodes with minimal clade size k+ 1 2k−1/k! #of k+ 1caterpillars

(43)

Counting pattern in random binary search trees

ConsiderXn,k with

Xn,k

=d XIn,k+Xn−1−I n,k, where Xk,k =Bernoulli(pk),XIn,k andXn−1−I

n,k are conditionally independent givenIn, andIn=Unif{0, . . . , n−1}.

Then,

pk shape parameter

1 # ofk+ 1-pronged nodes

2/k #of nodes with minimal clade size k+ 1 2k−1/k! #of k+ 1caterpillars

(44)

Underlying recurrence and solution

All (centered or non-centered) moments satisfy an,k= 2

n

n−1

X

j=0

aj,k+bn,k,

where ak,k is given and an,k = 0 forn < k.

We have

an,k = 2(n+ 1)

(k+ 1)(k+ 2)ak,k+ 2(n+ 1) X

k<j<n

bj,k

(j+ 1)(j+ 2)+bn,k, where n > k.

(45)

Underlying recurrence and solution

All (centered or non-centered) moments satisfy an,k= 2

n

n−1

X

j=0

aj,k+bn,k,

where ak,k is given and an,k = 0 forn < k.

We have

an,k= 2(n+ 1)

(k+ 1)(k+ 2)ak,k+ 2(n+ 1) X

k<j<n

bj,k

(j+ 1)(j+ 2)+bn,k, where n > k.

(46)

Mean value and variance

We have

E(Xn,k) = 2(n+ 1)

(k+ 1)(k+ 2)pk, (n > k), and

Var(Xn,k) = 2pk(4k3+ 16k2+ 19k+ 6−(11k2+ 22k+ 6)pk)(n+ 1) (k+ 1)(k+ 2)2(2k+ 1)(2k+ 3)

for n >2k+ 1.

Note that

E(Xn,k)∼Var(Xn,k)∼ 2pk k2 n for n >2k+ 1andk→ ∞ as n→ ∞.

(47)

Mean value and variance

We have

E(Xn,k) = 2(n+ 1)

(k+ 1)(k+ 2)pk, (n > k), and

Var(Xn,k) = 2pk(4k3+ 16k2+ 19k+ 6−(11k2+ 22k+ 6)pk)(n+ 1) (k+ 1)(k+ 2)2(2k+ 1)(2k+ 3)

for n >2k+ 1.

Note that

E(Xn,k)∼Var(Xn,k)∼ 2pk k2 n for n >2k+ 1andk→ ∞ as n→ ∞.

(48)

Higher moments

Denote by

A(m)n,k :=E(Xn,k−E(Xn,k))m.

Then,

A(m)n,k = 2 n

n−1

X

j=0

A(m)j,k +Bn,k(m),

where

Bn,k(m):= X

i1+i2+i3=m 0≤i1,i2<m

m i1, i2, i3

1 n

n−1

X

j=0

A(ij,k1)A(in−1−j,k2)in,j,k3

and

n,j,k =E(Xj,k) +E(Xn−1−j,k)−E(Xn,k).

(49)

Higher moments

Denote by

A(m)n,k :=E(Xn,k−E(Xn,k))m.

Then,

A(m)n,k = 2 n

n−1

X

j=0

A(m)j,k +Bn,k(m),

where

Bn,k(m):= X

i1+i2+i3=m 0≤i1,i2<m

m i1, i2, i3

1 n

n−1

X

j=0

A(ij,k1)A(in−1−j,k2)in,j,k3

and

n,j,k =E(Xj,k) +E(Xn−1−j,k)−E(Xn,k).

(50)

Methods of moments

Theorem

Let Zn, Z be random variables. Assume that, asn→ ∞, E(Znm)−→E(Zm)

for allm≥1andZ is uniquely determined by its moment sequence. Then, Zn

−→d Z.

Inductive approach based on asymptotic transfers was extensively used in the AofA to prove numerous result (“moment pumping”).

Fuchs, Hwang, Neininger (2007): variation of the above scheme to study the profile of random binary search trees and random recursive trees.

(51)

Methods of moments

Theorem

Let Zn, Z be random variables. Assume that, asn→ ∞, E(Znm)−→E(Zm)

for allm≥1andZ is uniquely determined by its moment sequence. Then, Zn

−→d Z.

Inductive approach based on asymptotic transfers was extensively used in the AofA to prove numerous result (“moment pumping”).

Fuchs, Hwang, Neininger (2007): variation of the above scheme to study the profile of random binary search trees and random recursive trees.

(52)

Methods of moments

Theorem

Let Zn, Z be random variables. Assume that, asn→ ∞, E(Znm)−→E(Zm)

for allm≥1andZ is uniquely determined by its moment sequence. Then, Zn

−→d Z.

Inductive approach based on asymptotic transfers was extensively used in the AofA to prove numerous result (“moment pumping”).

Fuchs, Hwang, Neininger (2007): variation of the above scheme to study the profile of random binary search trees and random recursive trees.

(53)

Normal range

Proposition

Uniformly for n, k, m≥1andn > k

A(m)n,k = max

(2pkn k2 ,

2pkn k2

m/2) .

Proposition

For E(Xn,k)→ ∞ asn→ ∞,

A(2m−1)n,k =o

2pkn k2

m−1/2!

, A(2m)n,k ∼gm

2pkn k2

m

,

where

gm = (2m)!/(2mm!).

(54)

Normal range

Proposition

Uniformly for n, k, m≥1andn > k

A(m)n,k = max

(2pkn k2 ,

2pkn k2

m/2) .

Proposition

For E(Xn,k)→ ∞ asn→ ∞,

A(2m−1)n,k =o

2pkn k2

m−1/2!

, A(2m)n,k ∼gm

2pkn k2

m

,

where

gm= (2m)!/(2mm!).

(55)

Poisson range

Consider

(m)n,k =E(Xn,k(Xn,k−1)· · ·(Xn,k−m+ 1)).

Then, similarly as before: Proposition

(i) Uniformly for n, k, m≥1 andn > k

(m)n,k = max 2pkn

k2 , 2pkn

k2 m

. (ii) For E(Xn,k)→cand k < nasn→ ∞,

(m)n,k −→cm.

(56)

Poisson range

Consider

(m)n,k =E(Xn,k(Xn,k−1)· · ·(Xn,k−m+ 1)).

Then, similarly as before:

Proposition

(i) Uniformly for n, k, m≥1 andn > k

(m)n,k = max 2pkn

k2 , 2pkn

k2 m

. (ii) For E(Xn,k)→cand k < nasn→ ∞,

(m)n,k −→cm.

(57)

The phase change

Theorem

(i) (Normal range) LetE(Xn,k)→ ∞ andk→ ∞ asn→ ∞. Then, Xn,k−E(Xn,k)

p2pkn/k2

−→ Nd (0,1).

(ii) (Poisson range) Let E(Xn,k)→c >0and k < nasn→ ∞. Then,

Xn,k −→d Poisson(c).

(iii) (Degenerate range) Let E(Xn,k)→0as n→ ∞. Then,

Xn,k−→L1 0.

(58)

A comparison of the phase change

For k-caterpillars, we have

E(Xn,k) = 2k−1n (k+ 2)!. Note that either

E(Xn,k)→ ∞ or E(Xn,k)→0.

So, there is no Poisson range.

shape parameter location phase change

k-pronged nodes √

n normal - poisson - degenerate minimal clade sizek √3

n normal - poisson - degenerate k-caterpillars lnn/(ln lnn) normal - degenerate

(59)

A comparison of the phase change

For k-caterpillars, we have

E(Xn,k) = 2k−1n (k+ 2)!. Note that either

E(Xn,k)→ ∞ or E(Xn,k)→0.

So, there is no Poisson range.

shape parameter location phase change

k-pronged nodes √

n normal - poisson - degenerate minimal clade size k √3

n normal - poisson - degenerate k-caterpillars lnn/(ln lnn) normal - degenerate

(60)

Refined results (for # of subtrees)

Define

φn,k(y) =e−σ2n,ky2/2E

e(Xn,k−µn,k)y

. and

φ(m)n,k = dmφn,k(y) dym

y=0

.

Proposition

Uniformly for n, k≥1 andm≥0

(m)n,k| ≤m!Ammax n

k2,n k2

m/3

for a suitable constant A.

(61)

Refined results (for # of subtrees)

Define

φn,k(y) =e−σ2n,ky2/2E

e(Xn,k−µn,k)y

. and

φ(m)n,k = dmφn,k(y) dym

y=0

.

Proposition

Uniformly for n, k≥1 andm≥0

(m)n,k| ≤m!Ammax n

k2,n k2

m/3

for a suitable constant A.

(62)

Characteristic function

Let ϕn,k(y) =E(exp{(Xn,k−µn,k)iy/σn,k}).

Proposition Let 1≤k=o(√

n).

(i) For nlarge

ϕn,k(y) =e−y2/2

1 +O

|y|3 k

√n

,

uniformly for y with |y| ≤n1/6/k1/3. (ii) For nlarge

n,k(y)| ≤e−y2/2, where|y| ≤πσn,k.

(63)

Berry-Esseen bound and LLT for the normal range

Theorem (Rate of convergency) For 1≤k=o(√

n)as n→ ∞,

sup

x∈R

P Xn,k−µn,k

σn,k < x

!

−Φ(x)

=O k

√n

.

Theorem (LLT) For 1≤k=o(√

n)as n→ ∞,

P(Xn,k=bµn,k+xσn,kc) = e−x2/2

√2πσn,k

1 +O

(1 +|x|3) k

√n

,

uniformly in x=o(n1/6/k1/3).

(64)

Berry-Esseen bound and LLT for the normal range

Theorem (Rate of convergency) For 1≤k=o(√

n)as n→ ∞,

sup

x∈R

P Xn,k−µn,k

σn,k < x

!

−Φ(x)

=O k

√n

.

Theorem (LLT) For 1≤k=o(√

n)as n→ ∞,

P(Xn,k=bµn,k+xσn,kc) = e−x2/2

√ 2πσn,k

1 +O

(1 +|x|3) k

√n

,

uniformly in x=o(n1/6/k1/3).

(65)

LLT for the Poisson range

Define

φ¯n,k(y) =e−µn,k(y−1)E yXn,k . and

φ(m)n,k = dmφ¯n,k(y) dym

y=1

.

Proposition

Uniformly for n > k andm≥0

|φ¯(m)n,k| ≤m!Amn k3

m/2

for a suitable constant A.

(66)

LLT for the Poisson range

Define

φ¯n,k(y) =e−µn,k(y−1)E yXn,k . and

φ(m)n,k = dmφ¯n,k(y) dym

y=1

.

Proposition

Uniformly for n > k andm≥0

|φ¯(m)n,k| ≤m!Am n

k3 m/2

for a suitable constant A.

(67)

Poisson approximation

Theorem (LLT)

For k < nandn→ ∞,

P(Xn,k =l) =e−µn,kn,k)l

l! +On k3

uniformly in l.

Theorem (Poisson approximation)

Let k < nandk→ ∞ asn→ ∞. Then,

dT V(Xn,k,Poisson(µn,k))−→0. Remark: A rate can be given as well.

(68)

Poisson approximation

Theorem (LLT)

For k < nandn→ ∞,

P(Xn,k =l) =e−µn,kn,k)l

l! +On k3

uniformly in l.

Theorem (Poisson approximation)

Let k < nandk→ ∞ asn→ ∞. Then,

dT V(Xn,k,Poisson(µn,k))−→0.

Remark: A rate can be given as well.

(69)

Other types of random trees

Random recursive trees

Non-plane, labelled trees with every label sequence from the root to a leave increasing; random model is the uniform model.

Methods works as well (with minor modifications) and similar results can be proved.

Plane-oriented recursive trees (PORTs)

Plane, labelled trees with every label sequence from the root to a leave increasing; random model is the uniform model.

Method works as well, but details more involved.

(70)

Other types of random trees

Random recursive trees

Non-plane, labelled trees with every label sequence from the root to a leave increasing; random model is the uniform model.

Methods works as well (with minor modifications) and similar results can be proved.

Plane-oriented recursive trees (PORTs)

Plane, labelled trees with every label sequence from the root to a leave increasing; random model is the uniform model.

Method works as well, but details more involved.

(71)

# of subtrees for PORTs

Xn,k satisfies

Xn,k=d

N

X

i=1

XI(i)

i,k, where Xk,k = 1 andXI(i)

i,k are conditionally independent given (N, I1, I2, . . .).

This can be simplified to

Xn,k =d XIn,k+Xn−I n,k−1{n−In=k}, whereXk,k = 1,XIn,k andXn−I

n,k are conditionally independent givenIn

and

P(In=j) = 2(n−j)CjCn−j

nCn .

(72)

# of subtrees for PORTs

Xn,k satisfies

Xn,k=d

N

X

i=1

XI(i)

i,k, where Xk,k = 1 andXI(i)

i,k are conditionally independent given (N, I1, I2, . . .).

This can be simplified to

Xn,k =d XIn,k+Xn−I n,k−1{n−In=k}, whereXk,k = 1,XIn,k andXn−I

n,k are conditionally independent givenIn

and

P(In=j) = 2(n−j)CjCn−j

nCn .

(73)

Underlying recurrence and solution

All (centered or non-centered) moments satisfy an,k= 2

n−1

X

j=1

CjCn−j

Cn

aj,k+bn,k,

where ak,k is given and an,k = 0 forn < k.

We have

an,k= Ck(n+ 1−k)Cn+1−k

Cn ak,k+ X

k<j≤n

Cj(n+ 1−j)Cn+1−j

Cn bn,k, where n > k.

(74)

Underlying recurrence and solution

All (centered or non-centered) moments satisfy an,k= 2

n−1

X

j=1

CjCn−j

Cn

aj,k+bn,k,

where ak,k is given and an,k = 0 forn < k.

We have

an,k= Ck(n+ 1−k)Cn+1−k

Cn ak,k+ X

k<j≤n

Cj(n+ 1−j)Cn+1−j

Cn bn,k, where n > k.

(75)

Mean value and variance of PORTs

We have,

µn,k:=E(Xn,k) = 2n−1

4k2−1, (n > k).

Moreover, for fixed k asn→ ∞,

Var(Xn,k)∼ckn, where

ck = 8k2−4k−8

(4k2−1)2 − ((2k−3)!!)2

((k−1)!)24k−1k(2k+ 1), and, for k < nandk→ ∞ asn→ ∞,

E(Xn,k)∼Var(Xn,k)∼ n 2k2.

(76)

Mean value and variance of PORTs

We have,

µn,k:=E(Xn,k) = 2n−1

4k2−1, (n > k).

Moreover, for fixed k asn→ ∞,

Var(Xn,k)∼ckn, where

ck = 8k2−4k−8

(4k2−1)2 − ((2k−3)!!)2

((k−1)!)24k−1k(2k+ 1), and, for k < nandk→ ∞ asn→ ∞,

E(Xn,k)∼Var(Xn,k)∼ n 2k2.

(77)

The phase change

Theorem

(i) (Normal range) Letk=o(√

n)and k→ ∞ as n→ ∞. Then,

Xn,k−µn,k pn/(2k2)

−→ Nd (0,1).

(ii) (Poisson range) Let k∼c√

n asn→ ∞. Then, Xn,k d

−→Poisson((2c2)−1).

(iii) (Degenerate range) Let k < nand√

n=o(k) as n→ ∞. Then, Xn,k−→L1 0.

(78)

More results and future research

Parameters of genealogical trees under different random models

Universality of the phase change for the number of subtrees Very simple classes of increasing trees and more general classes of increasing trees (polynomial varieties, mobile trees, etc.)

Phase change results for the number of nodes with out-degree k Important in computer science.

A phase change from normal to degenerate is expected (no Poisson range).

(79)

More results and future research

Parameters of genealogical trees under different random models Universality of the phase change for the number of subtrees Very simple classes of increasing trees and more general classes of increasing trees (polynomial varieties, mobile trees, etc.)

Phase change results for the number of nodes with out-degree k Important in computer science.

A phase change from normal to degenerate is expected (no Poisson range).

(80)

Polynomial varieties

Bergeron, Flajolet, Salvy (1992): classes of increasing trees with degree functionφ(ω) under the uniform model.

Polynomial varieties: class of increasing tree with φ(ω) =φdωd+· · ·+φ0,

where d≥2 andφd, φ0 6= 0.

For mean value and variance of the number of subtrees, E(Xn,k)∼Var(Xn,k)∼ d

d−1 · n k2, where k→ ∞ as n→ ∞.

(81)

Polynomial varieties

Bergeron, Flajolet, Salvy (1992): classes of increasing trees with degree functionφ(ω) under the uniform model.

Polynomial varieties: class of increasing tree with φ(ω) =φdωd+· · ·+φ0, where d≥2 andφd, φ06= 0.

For mean value and variance of the number of subtrees, E(Xn,k)∼Var(Xn,k)∼ d

d−1 · n k2, where k→ ∞ as n→ ∞.

(82)

Polynomial varieties

Bergeron, Flajolet, Salvy (1992): classes of increasing trees with degree functionφ(ω) under the uniform model.

Polynomial varieties: class of increasing tree with φ(ω) =φdωd+· · ·+φ0,

where d≥2 andφd, φ06= 0.

For mean value and variance of the number of subtrees, E(Xn,k)∼Var(Xn,k)∼ d

d−1 · n k2, where k→ ∞ asn→ ∞.

(83)

Polynomial varieties - phase change

Theorem

(i) (Normal range) Letk=o(√

n)and k→ ∞ as n→ ∞. Then,

Xn,k−µn,k pnd/((d−1)k2)

−→ Nd (0,1).

(ii) (Poisson range) Let k∼c√

n asn→ ∞. Then, Xn,k d

−→Poisson(d/((d−1)c2)).

(iii) (Degenerate range) Let k < nand√

n=o(k) as n→ ∞. Then, Xn,k−→L1 0.

(84)

Mobile trees - phase change

Theorem

(i) (Normal range) Letk=o p

n/lnn

andk→ ∞ asn→ ∞. Then,

Xn,k−µn,k

pn/(k2lnk)

−→ Nd (0,1).

(ii) (Poisson range) Let k∼cp

n/lnn asn→ ∞. Then,

Xn,k−→d Poisson(2c−2).

(iii) (Degenerate range) Let k < nandp

n/lnn=o(k) as n→ ∞.

Then,

Xn,k−→L1 0.

(85)

More results and future research

Parameters of genealogical trees under different random models Universality of the phase change for the number of subtrees Very simple classes of increasing trees and more general classes of increasing trees (polynomial varieties, mobile trees, etc.)

Phase change results for the number of nodes with out-degree k Important in computer science.

A phase change from normal to degenerate is expected (no Poisson range).

(86)

More results and future research

Parameters of genealogical trees under different random models Universality of the phase change for the number of subtrees Very simple classes of increasing trees and more general classes of increasing trees (polynomial varieties, mobile trees, etc.)

Phase change results for the number of nodes with out-degree k Important in computer science.

A phase change from normal to degenerate is expected (no Poisson range).

Referências

Documentos relacionados

didático e resolva as ​listas de exercícios (disponíveis no ​Classroom​) referentes às obras de Carlos Drummond de Andrade, João Guimarães Rosa, Machado de Assis,

The probability of attending school four our group of interest in this region increased by 6.5 percentage points after the expansion of the Bolsa Família program in 2007 and

No campo, os efeitos da seca e da privatiza- ção dos recursos recaíram principalmente sobre agricultores familiares, que mobilizaram as comunidades rurais organizadas e as agências

It is underlined variability of number of tourists during the long period and one year and this should not be neglected in tourism planning on this mountain’s center.. Key words:

social assistance. The protection of jobs within some enterprises, cooperatives, forms of economical associations, constitute an efficient social policy, totally different from

H„ autores que preferem excluir dos estudos de prevalˆncia lesŽes associadas a dentes restaurados para evitar confus‚o de diagn€stico com lesŽes de

Na hepatite B, as enzimas hepáticas têm valores menores tanto para quem toma quanto para os que não tomam café comparados ao vírus C, porém os dados foram estatisticamente

Mas, apesar das recomendações do Ministério da Saúde (MS) sobre aleitamento materno e sobre as desvantagens de uso de bicos artificiais (OPAS/OMS, 2003), parece não