• Nenhum resultado encontrado

Hervé Abdi - Cedric-Cnam

N/A
N/A
Protected

Academic year: 2023

Share "Hervé Abdi - Cedric-Cnam"

Copied!
91
0
0

Texto

(1)

Hervé Abdi

The University of Texas at Dallas

Paris CNAM November 2011

ANALYSE MULTI-TABLEAUX: LA FAMILLE STATIS

(2)

CNAM: 7 NOVEMBRE 2011 Part two: La Famille STATIS

(3)

CNAM: 7 NOVEMBRE 2011 La Famille STATIS

Biblio. Allez voir à www.utdallas.edu/~herve A87: STATIS & DISTATIS

A71, A59: DISTATIS C71: Rv Coefficient

C40: Multiple Factor Analysis C33: STATIS

(4)

START WITH K TABLES The STATIS Family

(5)

START WITH K TABLES The STATIS Family

(6)

BEFORE STATIS

•  Center and Normalize

(or not) columns (almost always)

(7)

BEFORE STATIS

•  Normalize (or not) rows (rarely) exception: CA

è sum of x = 1

Escofier/Volle/Rao/Hellinger (etc.) è sum of x2 = 1

(8)

BEFORE STATIS

•  What about the tables?

(9)

NORMALIZING TABLES: WHY?

(10)

HOW TO NORMALIZE TABLE K.

•  Divide all elements of Xk by Jk

(11)

HOW TO NORMALIZE TABLE K.

•  Divide all elements of Xk by Jk or better by Jk½

•  Plain multi-block èTucker 1 & consensus PCA

(12)

HOW TO NORMALIZE TABLE K.

•  Divide all elements of Xk by (sum Xk) ½

•  Plain multi-block è SUM-PCA

(13)

HOW TO NORMALIZE TABLE K.

•  Divide all elements of Xk by (sum XkXkT) ½

•  Plain multi-block è RV-PCA

(14)

HOW TO NORMALIZE TABLE K.

•  Divide all elements of Xk by first singular value

•  Plain multi-block è Multiple Factor Analysis

(15)

TABLE NORMALIZATION: MOST IMPORTANT STEP!

(16)

WHEN THE TABLES ARE NORMALIZED: WHAT TO DO?

•  STATIS: 1,2, 3 …

(17)

START WITH A MULTI TABLE MATRIX

1

i I

1 j J1 1 j Jk 1 jJK

X

C

< Sk ,Sk’ >

K

K 1

2

PCA of C

GPCA weighted by a

Compromise Tables in the compromise

J variables in K studies

I Observations

Inner Product matrix

Inner Product map

a

X1 Xk XK

(18)

COMPUTE THE BETWEEN TABLE SIMILARITY

1

i I

1 j J1 1 j Jk 1 jJK

X

C

< Sk ,Sk’ >

K

K 1

2

PCA of C

GPCA weighted by a

Compromise Tables in the compromise

J variables in K studies

I Observations

Inner Product matrix

Inner Product map

a

X1 Xk XK

(19)

GET A PCA OF THE BETWEEN TABLE SIMILARITY

1

i I

1 j J1 1 j Jk 1 jJK

X

C

< Sk ,Sk’ >

K

K 1

2

PCA of C

GPCA weighted by a

Compromise Tables in the compromise

J variables in K studies

I Observations

Inner Product matrix

Inner Product map

a

X1 Xk XK

(20)

PCA OF C GIVES OPTIMAL ALPHA WEIGHTS

1

i I

1 j J1 1 j Jk 1 jJK

X

C

< Sk ,Sk’ >

K

K 1

2

PCA of C

GPCA weighted by a

Compromise Tables in the compromise

J variables in K studies

I Observations

Inner Product matrix

Inner Product map

a

X1 Xk XK

(21)

ALPHA WEIGHTS ARE USED FOR GPCA OF X

1

i I

1 j J1 1 j Jk 1 jJK

X

C

< Sk ,Sk’ >

K

K 1

2

PCA of C

GPCA weighted by a

Compromise Tables in the compromise

J variables in K studies

I Observations

Inner Product matrix

Inner Product map

a

X1 Xk XK

(22)

START WITH A MULTI TABLE MATRIX

1

i I

1 j J1 1 j Jk 1 jJK

X

C

< Sk ,Sk’ >

K

K 1

2

PCA of C

GPCA weighted by a

Compromise Tables in the compromise

J variables in K studies

I Observations

Inner Product matrix

Inner Product map

a

X1 Xk XK

(23)

GPCA OF X è FACTOR SCORES F (COMPROMISE)

1

i I

1 j J1 1 j Jk 1 jJK

X

C

< Sk ,Sk’ >

K

K 1

2

PCA of C

GPCA weighted by a

Compromise Tables in the compromise

J variables in K studies

I Observations

Inner Product matrix

Inner Product map

a

X1 Xk XK

(24)

PROJECT THE XK ON COMPROMISE è FK

1

i I

1 j J1 1 j Jk 1j JK

X

C

< Sk ,Sk’ >

K

K 1

2

PCA of C

GPCA weighted by a

Compromise Tables in the compromise

J variables in K studies

I Observations

Inner Product matrix

Inner Product map

a

X1 Xk XK

(25)

AN EXAMPLE

(26)

EXAMPLE: 10 PARTICIPANTS TASTE 3*4 =12 WINES

•  Sauvignon Blanc Wines

•  From New-Zealand, France, and Canada

•  Chemical/Physical measurements

•  Specific scales + Four commons scales:

cat-pee, passion, green pepper, mineral

(27)

REMEMBER: THE STEPS OF STATIS

•  1. Between Table Structure

•  2. Derive Optimal Weights

•  3. Compute Compromise from Weights

•  4. Eigen-decompose Compromise

•  5. Project Original Tables (factor scores)

•  6. … Is the Earth round? …

(28)

THE STEPS OF STATIS

•  1. Between Table Structure

(29)

THE DATA: 10 ASSESSORS BY 3*4 = 12 WINES

(30)

SUPPLEMENTARY TABLE: CHEMISTRY

(31)

A MATRIX: ASSESSOR 1

(32)

PRE-PROCESSED (CENTER, SUM OF X2 = 1)

(33)

A CROSS-PRODUCT MATRIX: X1X1T

(34)

COSINE (OR RV) MATRIX

(35)

EIGEN DECOMPOSITION OF C

(36)

EIGEN OF C: THE ASSESSORSMAP

(37)

FACTOR SCORES FROM C

(38)

THE STEPS OF STATIS

•  1. Between Table Structure

•  2. Derive Optimal Weights

(39)

RESCALE FACTOR SCORES DIMENSION 1 TO SUM OF 1

(40)

WEIGHTS (EQUAL WEIGHTS = .10)

(41)

WEIGHTS (EQUAL WEIGHTS = .10)

•  From α get diagonal matrix Α

(42)

THE STEPS OF STATIS

•  1. Between Table Structure

•  2. Derive Optimal Weights

•  3. Compute Compromise from α weights

(43)

MASSES ARE FOR THE ROWS (EQUAL MASSES = .08)

•  Get diagonal matrix of masses for rows M

•  M = 1/I I (equal masses)

(44)

GET COMPROMISE. 1 GENERALIZED SVD OF X

•  X = PΔQT with QTAQ = PTMP = I

(45)

GET COMPROMISE. 2 FACTOR SCORES

•  X = PΔQT with QTAQ = PTMP = I

•  F = PΔ = XAQ

(46)

COMPROMISE: PLOT OF FACTOR SCORES

1 2

2 2

4 3

1 3

4

2 1 3

4 1 2

(47)

THE STEPS OF STATIS

•  1. Between Table Structure

•  2. Derive Optimal Weights

•  3. Compute Compromise from Weights

•  4. Eigen-decompose Compromise

•  5. Project Original Tables (factor scores)

(48)

PARTIAL FACTOR SCORES

•  X = PΔQT with QTAQ = PTMP = I

•  F = PΔ = XAQ

•  Fk = XkQk

(49)

PARTIAL FACTOR SCORES: BARYCENTRIC PROPERTY

•  X = PΔQT with QTAQ = PTMP = I

•  F = PΔ = XAQ

•  Fk = XkQk

•  F =

Σ

αkFk =

Σ

αk XkQk

(50)

COMPROMISE WITH “TABLES

(51)

THE TABLES ASBIPLOTS

(52)

WHAT ARE THE IMPORTANT TABLES?

(53)

CONTRIBUTIONS TO INERTIA

(54)

PARTIAL INERTIA

(55)

HOW TO HELP: PROJECTING NEW TABLES

(56)

PHYSICO AS SUP (FACTOR SCORES + LOADINGS)

1 2

1 1

1

2 2

3 2

3

4 3 4

4

Acidity pH

Alcohol Sugar

(57)

PHYSICO. RV AS SUP

1

2 3 4 5

6 7

8 9

101 2

(58)

THE STEPS OF STATIS

•  1. Between Table Structure

•  2. Derive Optimal Weights

•  3. Compute Compromise from Weights

•  4. Eigen-decompose Compromise

•  5. Project Original Tables (factor scores)

•  6… ? Is the Earth …

(59)

IS THE EARTH ROUND P < .05?

(60)

IS THE EARTH ROUND P < .05?

•  Bootstrap again: The assessors are random

(61)

COMPUTE CONFIDENCE INTERVALS

(62)

BOOTSTRAP 95% CI

(63)

BOOTSTRAP RATIOS (WHAT IS THAT?)

(64)
(65)

EXTENSIONS OF STATIS

(66)

PARTIAL TRIADIC ANALYSIS

(67)

PARTIAL TRIADIC ANALYSIS

•  Same variables all over:

(68)

PARTIAL TRIADIC ANALYSIS

•  Same variables all over:

use Xk in lieu of Sk

(69)

PARTIAL TRIADIC ANALYSIS

•  Same variables all over:

use Xk in lieu of Sk

Possible problem: negative cosine

(70)

PTA FACTOR SCORES

PeeCat

Passion Fruit

Green Pepper

Mineral

1

2

2

4 3

1

3 4

2 1 4 3

2 1

(71)

DISTATIS

(72)

DISTATIS: START WITH DISTANCE MATRICES

•  K (squared Euclidean) distance matrices Dk

(73)

TRANSFORMS THE DISTANCES INTO COVARIANCE

I

I

S

I

I

D

Double Centering

Distance

With a formula:

With matrices:

si,j = di,j(di,+ – d+,+)(d+,j – d+,+)

S = –.5ΞDΞT with Ξ = I 1mT and mT1 = 1

(74)

AND BACK TO STANDARD STATIS

(75)

N GROUPS: CANONICAL STATIS: CANOSTATIS.

•  Here 3 groups:

France, Canada, New Zealand

(76)

N GROUPS

•  Compute Mahalanobis distance per Table

(77)

N GROUPS

•  Compute Mahalanobis distance per Table

•  And back to DISTATIS

(78)

WINE EXAMPLE

(79)

CANOSTATIS: THE ASSESSORS

(80)

CANONICAL STATIS: 3 GROUPS

(81)

CANOSTATIS WITH CONFIDENCE

(82)

ONE MORE TABLE: (K+1) STATIS

(83)

ONE MORE TABLE: (K+1) STATIS

•  Use Sk* = HXk instead of Sk

(84)

ONE MORE TABLE: (K+1) STATIS

•  Use Sk* = HXk instead of Sk

•  and back to STATIS

(85)

(K+1) STATIS

1) COMPROMISE & 2) PHYSICO

A B

(86)

ANISO-STATIS

(87)

ANISO-STATIS

•  One weight per column

(88)

ANISOSTATIS

(89)

DOUBLE STATIS: DO-STATIS OR DO-ACT

(90)

DOUBLE STATIS: DO-STATIS OR DO-ACT

•  Two sets of matrices

(91)

RELATED TECHNIQUES

•  Generalized canonical correlation

•  Multiple factor analysis & SUM-PCA

•  INDSCAL

Referências

Documentos relacionados

Porém, se excluirmos a influência determinante do modelo teatral francês na recepção de Ernani, o facto de esta ópera se afirmar inequivocamente como o grande sucesso en- tre