Cochran's theorem

Cochran's theorem In statistics, Cochran's theorem, devised by William G. Cochran,[1] is a theorem used to justify results relating to the probability distributions of statistics that are used in the analysis of variance.[2] Contenu 1 Déclaration 1.1 Preuve 2 Exemples 2.1 Sample mean and sample variance 2.2 Distributions 2.3 Estimation of variance 3 Alternative formulation 4 Voir également 5 References Statement Let U1, ..., UN be i.i.d. standard normally distributed random variables, et {style d'affichage U=[U_{1},...,U_{N}]^{J}} . Laisser {style d'affichage B^{(1)},B^{(2)},ldots ,B^{(k)}} be symmetric matrices. Define ri to be the rank of {style d'affichage B^{(je)}} . Définir {style d'affichage Q_{je}=U^{J}B^{(je)}tu} , so that the Qi are quadratic forms. Further assume {somme de style d'affichage _{je}Q_{je}=U^{J}tu} .

Cochran's theorem states that the following are equivalent: {style d'affichage r_{1}+cdots +r_{k}=N} , the Qi are independent each Qi has a chi-squared distribution with ri degrees of freedom.[1][3] Often it's stated as {somme de style d'affichage _{je}UN_{je}=A} , où {style d'affichage A} is idempotent, et {somme de style d'affichage _{je}r_{je}=N} is replaced by {somme de style d'affichage _{je}r_{je}=rank(UN)} . But after an orthogonal transform, {displaystyle A=diag(JE_{M},0)} , and so we reduce to the above theorem.

Proof Claim: Laisser {style d'affichage X} be a standard Gaussian in {style d'affichage mathbb {R} ^{n}} , then for any symmetric matrices {style d'affichage Q,Q'} , si {style d'affichage X^{J}QX} et {style d'affichage X^{J}Q'X} have the same distribution, alors {style d'affichage Q,Q'} have the same eigenvalues (up to multiplicity).

Preuve: Let the eigenvalues of {style d'affichage Q} être {style d'affichage lambda _{1},...,lambda _{n}} , then calculate the characteristic function of {style d'affichage X^{J}QX} . It comes out to be {style d'affichage phi (t)=gauche(produit _{j}(1-2ilambda _{j}t)droit)^{-1/2}} (To calculate it, first diagonalize {style d'affichage Q} , change into that frame, then use the fact that the characteristic function of the sum of independent variables is the product of their characteristic functions.) Pour {style d'affichage X^{J}QX} et {style d'affichage X^{J}Q'X} to be equal, their characteristic functions must be equal, alors {style d'affichage Q,Q'} have the same eigenvalues (up to multiplicity).

Claim: {displaystyle I=sum _{je}B_{je}} .

Preuve: {style d'affichage U^{J}(I-sum _{je}B_{je})U=0} . Depuis {style d'affichage (I-sum _{je}B_{je})} is symmetric, et {style d'affichage U^{J}(I-sum _{je}B_{je})U=^{ré}U ^{J}0tu} , by the previous claim, {style d'affichage (I-sum _{je}B_{je})} has the same eigenvalues as 0.

Lemme: Si {somme de style d'affichage _{je}M_{je}=je} , all {style d'affichage M_{je}} symmetric, and have eigenvalues 0, 1, then they are simultaneously diagonalizable.

Fix i, and consider the eigenvectors v of {style d'affichage M_{je}} tel que {style d'affichage M_{je}v=v} . Then we have {displaystyle v^{J}v=v^{J}Iv=v^{J}v+sum _{jneq i}v^{J}M_{j}v} , so all {displaystyle v^{J}M_{j}v=0} . Thus we obtain a split of {style d'affichage mathbb {R} ^{N}} dans {displaystyle Voplus V^{perp }} , such that V is the 1-eigenspace of {style d'affichage M_{je}} , and in the 0-eigenspaces of all other {style d'affichage M_{j}} . Now induct by moving into {style d'affichage V^{perp }} .

Cas: All {style d'affichage Q_{je}} are independent Fix some {style d'affichage i} , définir {displaystyle C_{je}=I-B_{je}=somme _{jneq i}B_{j}} , and diagonalize {style d'affichage B_{je}} by an orthogonal transform {style d'affichage O} . Then consider {displaystyle OC_{je}O^{J}=I-OB_{je}O^{J}} . It is diagonalized as well.

Laisser {displaystyle W=OU} , then it is also standard Gaussian. Then we have {style d'affichage Q_{je}=W^{J}(OB_{je}O^{J})O;quad sum _{jneq i}Q_{j}=W^{J}(I-OB_{je}O^{J})O} Inspect their diagonal entries, to see that {style d'affichage Q_{je}perp sum _{jneq i}Q_{j}} implies that their nonzero diagonal entries are disjoint.

Thus all eigenvalues of {style d'affichage B_{je}} sommes 0, 1, alors {style d'affichage Q_{je}} est un {displaystyle chi ^{2}} dist with {style d'affichage r_{je}} degrees of freedom.

Cas: Chaque {style d'affichage Q_{je}} est un {displaystyle chi ^{2}(r_{je})} distribution.

Fix any {style d'affichage i} , diagonalize it by orthogonal transform {style d'affichage O} , and reindex, pour que {displaystyle OB_{je}O^{J}=diag(lambda _{1},...,lambda _{r_{je}},0,...,0)} . Alors {style d'affichage Q_{je}=somme _{j}lambda _{j}{U'}_{j}^{2}} pour certains {displaystyle U'_{j}} , a spherical rotation of {style d'affichage U_{je}} .

Depuis {style d'affichage Q_{je}sim chi ^{2}(r_{je})} , we get all {style d'affichage lambda _{j}=1} . So all {style d'affichage B_{je}succeq 0} , and have eigenvalues {style d'affichage 0,1} .

So diagonalize them simultaneously, add them up, to find {somme de style d'affichage _{je}r_{je}=N} .

Cas: {style d'affichage r_{1}+cdots +r_{k}=N} We first show that the matrices B(je) can be simultaneously diagonalized by an orthogonal matrix and that their non-zero eigenvalues are all equal to +1. Once that's shown, take this orthogonal transform to this simultaneous eigenbasis, in which the random vector {style d'affichage [U_{1},...,U_{N}]^{J}} becomes {style d'affichage [U'_{1},...,U'_{N}]^{J}} , but all {style d'affichage U_{je}'} are still independent and standard Gaussian. Then the result follows.

Each of the matrices B(je) has rank ri and thus ri non-zero eigenvalues. For each i, la somme {displaystyle C^{(je)}equiv sum _{jneq i}B^{(j)}} has at most rank {somme de style d'affichage _{jneq i}r_{j}=N-r_{je}} . Depuis {style d'affichage B^{(je)}+C^{(je)}=I_{Ntimes N}} , it follows that C(je) has exactly rank N − ri.

Therefore B(je) et C(je) can be simultaneously diagonalized. This can be shown by first diagonalizing B(je), by the spectral theorem. In this basis, it is of the form: {style d'affichage {commencer{bmatrice}lambda _{1}&0&0&cdots &cdots &&0\0&lambda _{2}&0&cdots &cdots &&0\0&0&ddots &&&&vdots \vdots &vdots &&lambda _{r_{je}}&&\vdots &vdots &&&0&\0&vdots &&&&ddots \0&0&ldots &&&&0end{bmatrice}}.} Thus the lower {style d'affichage (N-r_{je})} rows are zero. Depuis {displaystyle C^{(je)}=I-B^{(je)}} , it follows that these rows in C(je) in this basis contain a right block which is a {style d'affichage (N-r_{je})fois (N-r_{je})} unit matrix, with zeros in the rest of these rows. But since C(je) has rank N − ri, it must be zero elsewhere. Thus it is diagonal in this basis as well. It follows that all the non-zero eigenvalues of both B(je) et C(je) sommes +1. This argument applies for all i, thus all B(je) are positive semidefinite.

En outre, the above analysis can be repeated in the diagonal basis for {displaystyle C^{(1)}=B^{(2)}+somme _{j>2}B^{(j)}} . In this basis {displaystyle C^{(1)}} is the identity of an {style d'affichage (N-r_{1})fois (N-r_{1})} vector space, so it follows that both B(2) et {somme de style d'affichage _{j>2}B^{(j)}} are simultaneously diagonalizable in this vector space (and hence also together with B(1)). By iteration it follows that all B-s are simultaneously diagonalizable.

Thus there exists an orthogonal matrix {style d'affichage S} telle que pour tout {style d'affichage i} , {style d'affichage S^{mathrm {J} }B^{(je)}Sequiv B^{(je)prime }} is diagonal, where any entry {style d'affichage B_{X,y}^{(je)prime }} with indices {style d'affichage x=y} , {somme de style d'affichage _{j=1}^{i-1}r_{j}

Si vous voulez connaître d'autres articles similaires à Cochran's theorem vous pouvez visiter la catégorie Characterization of probability distributions.

Laisser un commentaire

Votre adresse email ne sera pas publiée.

Monter

Nous utilisons nos propres cookies et ceux de tiers pour améliorer l'expérience utilisateur Plus d'informations