fbpx
Wikipedia

Centering matrix

In mathematics and multivariate statistics, the centering matrix[1] is a symmetric and idempotent matrix, which when multiplied with a vector has the same effect as subtracting the mean of the components of the vector from every component of that vector.

Definition edit

The centering matrix of size n is defined as the n-by-n matrix

 

where   is the identity matrix of size n and   is an n-by-n matrix of all 1's.

For example

 ,
  ,
 

Properties edit

Given a column-vector,   of size n, the centering property of   can be expressed as

 

where   is a column vector of ones and   is the mean of the components of  .

  is symmetric positive semi-definite.

  is idempotent, so that  , for  . Once the mean has been removed, it is zero and removing it again has no effect.

  is singular. The effects of applying the transformation   cannot be reversed.

  has the eigenvalue 1 of multiplicity n − 1 and eigenvalue 0 of multiplicity 1.

  has a nullspace of dimension 1, along the vector  .

  is an orthogonal projection matrix. That is,   is a projection of   onto the (n − 1)-dimensional subspace that is orthogonal to the nullspace  . (This is the subspace of all n-vectors whose components sum to zero.)

The trace of   is  .

Application edit

Although multiplication by the centering matrix is not a computationally efficient way of removing the mean from a vector, it is a convenient analytical tool. It can be used not only to remove the mean of a single vector, but also of multiple vectors stored in the rows or columns of an m-by-n matrix  .

The left multiplication by   subtracts a corresponding mean value from each of the n columns, so that each column of the product   has a zero mean. Similarly, the multiplication by   on the right subtracts a corresponding mean value from each of the m rows, and each row of the product   has a zero mean. The multiplication on both sides creates a doubly centred matrix  , whose row and column means are equal to zero.

The centering matrix provides in particular a succinct way to express the scatter matrix,   of a data sample  , where   is the sample mean. The centering matrix allows us to express the scatter matrix more compactly as

 

  is the covariance matrix of the multinomial distribution, in the special case where the parameters of that distribution are  , and  .

References edit

  1. ^ John I. Marden, Analyzing and Modeling Rank Data, Chapman & Hall, 1995, ISBN 0-412-99521-2, page 59.

centering, matrix, mathematics, multivariate, statistics, centering, matrix, symmetric, idempotent, matrix, which, when, multiplied, with, vector, same, effect, subtracting, mean, components, vector, from, every, component, that, vector, contents, definition, . In mathematics and multivariate statistics the centering matrix 1 is a symmetric and idempotent matrix which when multiplied with a vector has the same effect as subtracting the mean of the components of the vector from every component of that vector Contents 1 Definition 2 Properties 3 Application 4 ReferencesDefinition editThe centering matrix of size n is defined as the n by n matrix C n I n 1 n J n displaystyle C n I n tfrac 1 n J n nbsp where I n displaystyle I n nbsp is the identity matrix of size n and J n displaystyle J n nbsp is an n by n matrix of all 1 s For example C 1 0 displaystyle C 1 begin bmatrix 0 end bmatrix nbsp C 2 1 0 0 1 1 2 1 1 1 1 1 2 1 2 1 2 1 2 displaystyle C 2 left begin array rrr 1 amp 0 0 amp 1 end array right frac 1 2 left begin array rrr 1 amp 1 1 amp 1 end array right left begin array rrr frac 1 2 amp frac 1 2 frac 1 2 amp frac 1 2 end array right nbsp C 3 1 0 0 0 1 0 0 0 1 1 3 1 1 1 1 1 1 1 1 1 2 3 1 3 1 3 1 3 2 3 1 3 1 3 1 3 2 3 displaystyle C 3 left begin array rrr 1 amp 0 amp 0 0 amp 1 amp 0 0 amp 0 amp 1 end array right frac 1 3 left begin array rrr 1 amp 1 amp 1 1 amp 1 amp 1 1 amp 1 amp 1 end array right left begin array rrr frac 2 3 amp frac 1 3 amp frac 1 3 frac 1 3 amp frac 2 3 amp frac 1 3 frac 1 3 amp frac 1 3 amp frac 2 3 end array right nbsp Properties editGiven a column vector v displaystyle mathbf v nbsp of size n the centering property of C n displaystyle C n nbsp can be expressed as C n v v 1 n J n 1 T v J n 1 displaystyle C n mathbf v mathbf v tfrac 1 n J n 1 textrm T mathbf v J n 1 nbsp where J n 1 displaystyle J n 1 nbsp is a column vector of ones and 1 n J n 1 T v displaystyle tfrac 1 n J n 1 textrm T mathbf v nbsp is the mean of the components of v displaystyle mathbf v nbsp C n displaystyle C n nbsp is symmetric positive semi definite C n displaystyle C n nbsp is idempotent so that C n k C n displaystyle C n k C n nbsp for k 1 2 displaystyle k 1 2 ldots nbsp Once the mean has been removed it is zero and removing it again has no effect C n displaystyle C n nbsp is singular The effects of applying the transformation C n v displaystyle C n mathbf v nbsp cannot be reversed C n displaystyle C n nbsp has the eigenvalue 1 of multiplicity n 1 and eigenvalue 0 of multiplicity 1 C n displaystyle C n nbsp has a nullspace of dimension 1 along the vector J n 1 displaystyle J n 1 nbsp C n displaystyle C n nbsp is an orthogonal projection matrix That is C n v displaystyle C n mathbf v nbsp is a projection of v displaystyle mathbf v nbsp onto the n 1 dimensional subspace that is orthogonal to the nullspace J n 1 displaystyle J n 1 nbsp This is the subspace of all n vectors whose components sum to zero The trace of C n displaystyle C n nbsp is n n 1 n n 1 displaystyle n n 1 n n 1 nbsp Application editAlthough multiplication by the centering matrix is not a computationally efficient way of removing the mean from a vector it is a convenient analytical tool It can be used not only to remove the mean of a single vector but also of multiple vectors stored in the rows or columns of an m by n matrix X displaystyle X nbsp The left multiplication by C m displaystyle C m nbsp subtracts a corresponding mean value from each of the n columns so that each column of the product C m X displaystyle C m X nbsp has a zero mean Similarly the multiplication by C n displaystyle C n nbsp on the right subtracts a corresponding mean value from each of the m rows and each row of the product X C n displaystyle X C n nbsp has a zero mean The multiplication on both sides creates a doubly centred matrix C m X C n displaystyle C m X C n nbsp whose row and column means are equal to zero The centering matrix provides in particular a succinct way to express the scatter matrix S X m J n 1 T X m J n 1 T T displaystyle S X mu J n 1 mathrm T X mu J n 1 mathrm T mathrm T nbsp of a data sample X displaystyle X nbsp where m 1 n X J n 1 displaystyle mu tfrac 1 n XJ n 1 nbsp is the sample mean The centering matrix allows us to express the scatter matrix more compactly as S X C n X C n T X C n C n X T X C n X T displaystyle S X C n X C n mathrm T X C n C n X mathrm T X C n X mathrm T nbsp C n displaystyle C n nbsp is the covariance matrix of the multinomial distribution in the special case where the parameters of that distribution are k n displaystyle k n nbsp and p 1 p 2 p n 1 n displaystyle p 1 p 2 cdots p n frac 1 n nbsp References edit John I Marden Analyzing and Modeling Rank Data Chapman amp Hall 1995 ISBN 0 412 99521 2 page 59 Retrieved from https en wikipedia org w index php title Centering matrix amp oldid 1138070810, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.