• Nenhum resultado encontrado

Probabilistic Forests An application to the rhythmic distinction between Brazilian and European Portuguese Antonio Galves Denis Lacerda Florencia Leonardi

N/A
N/A
Protected

Academic year: 2022

Share "Probabilistic Forests An application to the rhythmic distinction between Brazilian and European Portuguese Antonio Galves Denis Lacerda Florencia Leonardi"

Copied!
52
0
0

Texto

(1)

Probabilistic Forests

An application to the rhythmic distinction between Brazilian and European Portuguese

Antonio Galves Denis Lacerda Florencia Leonardi

Instituto de Matemática e Estatística Universidade de São Paulo

28th November, 2005

(2)

Remembering VLMC

A VLMC is a stochastic chain(X0,X1, . . .)taking values on a finite alphabetAand characterized by two elements:

The set of all contexts.

A contextXn−`, . . . ,Xn−1is the finite portion of the past X0, . . . ,Xn−1which is relevant to predict the next symbol Xn.

A family of transition probabilities associated to each context.

Given a context, its associated transition probability gives

(3)

Remembering VLMC

A VLMC is a stochastic chain(X0,X1, . . .)taking values on a finite alphabetAand characterized by two elements:

The set of all contexts.

A contextXn−`, . . . ,Xn−1is the finite portion of the past X0, . . . ,Xn−1which is relevant to predict the next symbol Xn.

A family of transition probabilities associated to each context.

Given a context, its associated transition probability gives

(4)

Remembering VLMC

A VLMC is a stochastic chain(X0,X1, . . .)taking values on a finite alphabetAand characterized by two elements:

The set of all contexts.

A contextXn−`, . . . ,Xn−1is the finite portion of the past X0, . . . ,Xn−1which is relevant to predict the next symbol Xn.

A family of transition probabilities associated to each context.

Given a context, its associated transition probability gives

(5)

Remembering VLMC

The set of all contexts is asuffix code. This means that no context is a suffix of another context.

For this reason this set can be represented as a tree. This tree with the associated transition probabilities is called a Probabilistic Tree.

(6)

Remembering VLMC

The set of all contexts is asuffix code. This means that no context is a suffix of another context.

For this reason this set can be represented as a tree. This tree with the associated transition probabilities is called a Probabilistic Tree.

(7)

Preliminary study

Variable Length Markov Chains (VLMC) have been used to discriminate rhythmic patterns in European and Brazilian Portuguese written texts (Galves et al. 2005).

The written texts where transformed into sequences over the alphabetA={0,1,2,3,4}.

The sequences where obtained using well defined rules taking into account the boundaries between prosodic words and the stressed syllabus.

This analysis suggests a typical contexts tree for each

(8)

Preliminary study

Variable Length Markov Chains (VLMC) have been used to discriminate rhythmic patterns in European and Brazilian Portuguese written texts (Galves et al. 2005).

The written texts where transformed into sequences over the alphabetA={0,1,2,3,4}.

The sequences where obtained using well defined rules taking into account the boundaries between prosodic words and the stressed syllabus.

This analysis suggests a typical contexts tree for each

(9)

Preliminary study

Variable Length Markov Chains (VLMC) have been used to discriminate rhythmic patterns in European and Brazilian Portuguese written texts (Galves et al. 2005).

The written texts where transformed into sequences over the alphabetA={0,1,2,3,4}.

The sequences where obtained using well defined rules taking into account the boundaries between prosodic words and the stressed syllabus.

This analysis suggests a typical contexts tree for each

(10)

Preliminary study

Variable Length Markov Chains (VLMC) have been used to discriminate rhythmic patterns in European and Brazilian Portuguese written texts (Galves et al. 2005).

The written texts where transformed into sequences over the alphabetA={0,1,2,3,4}.

The sequences where obtained using well defined rules taking into account the boundaries between prosodic words and the stressed syllabus.

This analysis suggests a typical contexts tree for each

(11)

Preliminary study

The typical trees for BP and EP are:

This study showed significant patterns that had been

(12)

But it is also true that...

The preceding results show high variability.

The Context Algorithm used to estimate the trees is very sensitive to spurious strings.

(13)

But it is also true that...

The preceding results show high variability.

The Context Algorithm used to estimate the trees is very sensitive to spurious strings.

(14)

A new approach: probabilistic forests

A probabilistic forest is a mixture of probabilistic trees, that is A setΓof probabilistic trees.

A probability distribution over this set{p(τ)}τ∈Γ.

(15)

A new approach: probabilistic forests

A probabilistic forest is a mixture of probabilistic trees, that is A setΓof probabilistic trees.

A probability distribution over this set{p(τ)}τ∈Γ.

(16)

A new approach: probabilistic forests

A probabilistic forest is a mixture of probabilistic trees, that is A setΓof probabilistic trees.

A probability distribution over this set{p(τ)}τ∈Γ.

(17)

How to define the set of trees Γ

The setΓdepends on the problem and can be any set with a reasonable number of trees.

For the linguistic problem, we choose the set of trees satisfying the algebraic restrictions given by the codification and with depth not bigger than 3.

Given a sample sequencex1,x2, . . . ,xn(or a set of sample sequences) the probability of each tree in the set is

estimated by the procedure followed in Eskinet al. (2002).

(18)

How to define the set of trees Γ

The setΓdepends on the problem and can be any set with a reasonable number of trees.

For the linguistic problem, we choose the set of trees satisfying the algebraic restrictions given by the codification and with depth not bigger than 3.

Given a sample sequencex1,x2, . . . ,xn(or a set of sample sequences) the probability of each tree in the set is

estimated by the procedure followed in Eskinet al. (2002).

(19)

How to define the set of trees Γ

The setΓdepends on the problem and can be any set with a reasonable number of trees.

For the linguistic problem, we choose the set of trees satisfying the algebraic restrictions given by the codification and with depth not bigger than 3.

Given a sample sequencex1,x2, . . . ,xn(or a set of sample sequences) the probability of each tree in the set is

estimated by the procedure followed in Eskinet al. (2002).

(20)

How to estimate the probability of each tree

Following Eskinet al. (2002):

First, propose apriorweightω0(τ).

Then, for each instant of timei update this weight by the formula:

ωi(τ) =

ωi−1(τ)|A|1 , ifi<d(Γ)

ωi−1(τ)ˆPiτ(xi|cτ(x0, . . . ,xi−1)), ifn≥d(Γ) wherePˆiτ(xi|cτ(x0, . . . ,xi−1))is theMaximum Likelihood Estimateof the transition probabilities inτ with the sample

(21)

How to estimate the probability of each tree

Following Eskinet al. (2002):

First, propose apriorweightω0(τ).

Then, for each instant of timei update this weight by the formula:

ωi(τ) =

ωi−1(τ)|A|1 , ifi<d(Γ)

ωi−1(τ)ˆPiτ(xi|cτ(x0, . . . ,xi−1)), ifn≥d(Γ) wherePˆiτ(xi|cτ(x0, . . . ,xi−1))is theMaximum Likelihood Estimateof the transition probabilities inτ with the sample

(22)

A closed formula for ω

n

(τ )

These weightsωn(τ)can be rewritten as:

ωn(τ) =ωn−1(τ)ˆPnτ(xn|cτ(x0, . . . ,xn−1))

0(τ) 1

|A|d(Γ)

n

Y

i=d(Γ)

iτ(xi|cτ(x0, . . . ,xi−1))

0(τ) 1

|A|d(Γ)

Y Y (Nn(s,a) +1)!

(Nn(s) +|A|)!

(23)

A closed formula for ω

n

(τ )

These weightsωn(τ)can be rewritten as:

ωn(τ) =ωn−1(τ)ˆPnτ(xn|cτ(x0, . . . ,xn−1))

0(τ) 1

|A|d(Γ)

n

Y

i=d(Γ)

iτ(xi|cτ(x0, . . . ,xi−1))

0(τ) 1

|A|d(Γ)

Y Y (Nn(s,a) +1)!

(Nn(s) +|A|)!

(24)

A closed formula for ω

n

(τ )

These weightsωn(τ)can be rewritten as:

ωn(τ) =ωn−1(τ)ˆPnτ(xn|cτ(x0, . . . ,xn−1))

0(τ) 1

|A|d(Γ)

n

Y

i=d(Γ)

iτ(xi|cτ(x0, . . . ,xi−1))

0(τ) 1

|A|d(Γ)

Y Y (Nn(s,a) +1)!

(Nn(s) +|A|)!

(25)

How to estimate the probability of each tree

Finally, we normalize the weights using the formula:

pn(τ) = ωn(τ) P

τ0∈Γωn0)

Therefore,pn(τ)is the probability of treeτ estimated with the samplex1,x2, . . . ,xnby the preceding procedure.

(26)

Empirical results for EP

In the case of European Portuguese, we estimate the weights of all possible trees with depth less or equal 3. We used 31 texts of Portuguese authors.

n=148887.

ω0(τ) = (500n)−t(τ), wheret(τ)is the number of terminal nodes ofτ.

pn(τ) =0.99 for the following tree:

(27)

Empirical results for EP

In the case of European Portuguese, we estimate the weights of all possible trees with depth less or equal 3. We used 31 texts of Portuguese authors.

n=148887.

ω0(τ) = (500n)−t(τ), wheret(τ)is the number of terminal nodes ofτ.

pn(τ) =0.99 for the following tree:

(28)

Empirical results for EP

In the case of European Portuguese, we estimate the weights of all possible trees with depth less or equal 3. We used 31 texts of Portuguese authors.

n=148887.

ω0(τ) = (500n)−t(τ), wheret(τ)is the number of terminal nodes ofτ.

pn(τ) =0.99 for the following tree:

(29)

Empirical results for EP

In the case of European Portuguese, we estimate the weights of all possible trees with depth less or equal 3. We used 31 texts of Portuguese authors.

n=148887.

ω0(τ) = (500n)−t(τ), wheret(τ)is the number of terminal nodes ofτ.

pn(τ) =0.99 for the following tree:

(30)

Empirical results for EP

n=148887.

ω0(τ) = (500n)−t(τ), wheret(τ)is the number of terminal nodes ofτ.

pn(τ) =0.99 for the following tree:

(31)

Empirical results for BP

In the case of Brazilian Portuguese, we also estimate the weights of all possible trees with depth less or equal 3. We used 35 texts of Brazilian authors.

n=619282.

ω0(τ) = (500n)−t(τ), wheret(τ)is the number of terminal nodes ofτ.

pn(τ) =0.8 for the following tree:

(32)

Empirical results for BP

In the case of Brazilian Portuguese, we also estimate the weights of all possible trees with depth less or equal 3. We used 35 texts of Brazilian authors.

n=619282.

ω0(τ) = (500n)−t(τ), wheret(τ)is the number of terminal nodes ofτ.

pn(τ) =0.8 for the following tree:

(33)

Empirical results for BP

In the case of Brazilian Portuguese, we also estimate the weights of all possible trees with depth less or equal 3. We used 35 texts of Brazilian authors.

n=619282.

ω0(τ) = (500n)−t(τ), wheret(τ)is the number of terminal nodes ofτ.

pn(τ) =0.8 for the following tree:

(34)

Empirical results for BP

In the case of Brazilian Portuguese, we also estimate the weights of all possible trees with depth less or equal 3. We used 35 texts of Brazilian authors.

n=619282.

ω0(τ) = (500n)−t(τ), wheret(τ)is the number of terminal nodes ofτ.

pn(τ) =0.8 for the following tree:

(35)

Empirical results for BP

n=619282.

ω0(τ) = (500n)−t(τ), wheret(τ)is the number of terminal nodes ofτ.

pn(τ) =0.8 for the following tree:

(36)

Consistency Theorem

Theorem

Letτ be a probabilistic tree. Then for almost all sequences x1,x2, . . .generated byτ, we have that

pn(τ)→1when n→ ∞.

(37)

Consistency Theorem

Idea of the proof

pn(τ) = ωn(τ) P

τ0∈Γωn0)

= X

τ0∈Γ

ωn0) ωn(τ)

!−1

=

1+ X ωn0)

n

−1

(38)

Consistency Theorem

Idea of the proof

pn(τ) = ωn(τ) P

τ0∈Γωn0)

= X

τ0∈Γ

ωn0) ωn(τ)

!−1

=

1+ X ωn0)

n

−1

(39)

Consistency Theorem

Idea of the proof

pn(τ) = ωn(τ) P

τ0∈Γωn0)

= X

τ0∈Γ

ωn0) ωn(τ)

!−1

=

1+ X ωn0)

n

−1

(40)

Consistency Theorem

Idea of the proof

ωn(τ) =ω0(τ) 1

|A|d(Γ) Y

s∈τ

Y

a∈A

(Nn(s,a) +1)!

(Nn(s) +|A|)!

0(τ) 1

|A|d(Γ) Y

s∈τ

KT,s(x1n)

(41)

Consistency Theorem

Idea of the proof

ωn(τ) =ω0(τ) 1

|A|d(Γ) Y

s∈τ

Y

a∈A

(Nn(s,a) +1)!

(Nn(s) +|A|)!

0(τ) 1

|A|d(Γ) Y

s∈τ

KT,s(x1n)

(42)

Consistency Theorem

Idea of the proof

ωn0)

ωn(τ) = ω00) ω0(τ)

Q

s0∈τ0KT,s0(x1n) Q

s∈τKT,s(x1n)

= ω00) ω0(τ)

Y

s∈τ,s≺τ0

Q

s0∈τ0,s0>sKT,s0(x1n) PˆKT,s(x1n)

Y PˆKT,s0(x1n) Q Pˆ (xn)

(43)

Consistency Theorem

Idea of the proof

ωn0)

ωn(τ) = ω00) ω0(τ)

Q

s0∈τ0KT,s0(x1n) Q

s∈τKT,s(x1n)

= ω00) ω0(τ)

Y

s∈τ,s≺τ0

Q

s0∈τ0,s0>sKT,s0(x1n) PˆKT,s(x1n)

Y PˆKT,s0(x1n) Q Pˆ (xn)

(44)

Consistency Theorem

Idea of the proof

logωn0)

ωn(τ) = logω00) ω0(τ) +

+ X

s∈τ,s≺τ0

 X

s0∈τ0,s0>s

logPˆKT,s0(x1n)−logPˆKT,s(x1n)

+ X

s0∈τ0,s0≺τ

logPˆKT,s0(x1n)− X

s∈τ,s>s0≺τ

logPˆKT,s(x1n)

(45)

Consistency Theorem

Idea of the proof

logωn0)

ωn(τ) = logω00) ω0(τ) +

+ X

s∈τ,s≺τ0

 X

s0∈τ0,s0>s

logPˆKT,s0(x1n)−logPˆKT,s(x1n)

+ X

s0∈τ0,s0≺τ

logPˆKT,s0(x1n)− X

s∈τ,s>s0≺τ

logPˆKT,s(x1n)

(46)

Consistency Theorem

Idea of the proof

logωn0)

ωn(τ) = logω00) ω0(τ) +

+ X

s∈τ,s≺τ0

 X

s0∈τ0,s0>s

logPˆKT,s0(x1n)−logPˆKT,s(x1n)

+ X

s0∈τ0,s0≺τ

logPˆKT,s0(x1n)− X

s∈τ,s>s0≺τ

logPˆKT,s(x1n)

(47)

Consistency Theorem

Idea of the proof

logωn0)

ωn(τ) = logω00) ω0(τ) +

+ X

s∈τ,s≺τ0

 X

s0∈τ0,s0>s

logPˆKT,s0(x1n)−logPˆKT,s(x1n)

+ X

s0∈τ0,s0≺τ

logPˆKT,s0(x1n)− X

s∈τ,s>s0≺τ

logPˆKT,s(x1n)

(48)

Consistency Theorem

Idea of the proof

Following the ideas of Csiszár and Talata (2005) we proved that Lemma

Let s∈τ such that s is a proper suffix of some context inτ0. Then there exists a constant c<0such that

X

s0∈τ0,s0>s

logPˆKT,s0(x1n)−logPˆKT,s(x1n)<clogn, eventually almost surely as n→ ∞.

(49)

Consistency Theorem

Idea of the proof

Lemma

Let s0 ∈τ0such that s0 is a proper suffix of some context inτ. Then there exists a constant c<0such that

logPˆKT,s0(x1n)− X

s∈τ,s>s0

logPˆKT,s(x1n)<cn, eventually almost surely as n→ ∞.

(50)

Consistency Theorem

Idea of the proof

Then there exist constantsC1≤0 andC2≤0, not vanishing simultaneously, such that

logωn0)

ωn(τ) <logω00)

ω0(τ) +C1logn+C2n

Withω0(τ) = (cn)−t(τ), wheret(τ)is the number of terminal nodes ofτ we have that

logωn0)

ωn(τ) → −∞,

(51)

Consistency Theorem

Idea of the proof

Then there exist constantsC1≤0 andC2≤0, not vanishing simultaneously, such that

logωn0)

ωn(τ) <logω00)

ω0(τ) +C1logn+C2n

Withω0(τ) = (cn)−t(τ), wheret(τ)is the number of terminal nodes ofτ we have that

logωn0)

ωn(τ) → −∞,

(52)

Conclusion

“Only one tree represents the forest”

Referências

Documentos relacionados

Cette observation de surface, s’abstrayant de modèles reconnus alors comme peu efficaces pour traiter des observables, va amener le professeur à se poser des

As pessoas, como parceiros, são um “cliente” muito particular que investe na organização com o seu saber, com o seu esforço, com a sua dedicação e com a sua

Apenas 45% apresentavam-se dentro do adequado, mostrando-se, portanto, que ainda há um grande desafio na luta de se atualizar a situação imune das crianças,

Sabendo que um dos pontos cruciais na compreensão de eletrônica é o estudo dos transistores, foi desenvolvido em um simulador, um circuito capaz de

É por isso que a família moderna é sempre socioafetiva, já que é um grupo social unido pela convivência afetiva, e que transformou o afeto numa ca- tegoria jurídica, por ser um

Posteriormente, são descritas as políticas públicas brasileiras na área de saúde mental tanto para a população em geral, quanto aquelas voltadas para crianças e

O presente trabalho teve como finalidade avaliar e monitorizar a qualidade da água da bacia hidrográfica do rio Tâmega nos concelhos de Mondim de Basto e Celorico de

Já atrás refcrimos que as ânforas Mana Pascua l A4 es tão ainda presentes em Castro Marim du r ante o período tardo republicano, parecendo mesmo que o auge da importação