[go: nahoru, domu]

Scatter matrix

This is an old revision of this page, as edited by Aaron McDaid (talk | contribs) at 15:21, 24 July 2013 (→‎Definition: Another definition of S, more convenient computationally when you make a small change to the data and wish to update S). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

For the notion in quantum mechanics, see scattering matrix.

In multivariate statistics and probability theory, the scatter matrix is a statistic that is used to make estimates of the covariance matrix of the multivariate normal distribution.

Definition

Given n samples of m-dimensional data, represented as the m-by-n matrix,  , the sample mean is

 

where   is the jth column of  .

The scatter matrix is the m-by-m positive semi-definite matrix

 

where   denotes matrix transpose. The scatter matrix may be expressed more succinctly as

 

where   is the n-by-n centering matrix.

Application

The maximum likelihood estimate, given n samples, for the covariance matrix of a multivariate normal distribution can be expressed as the normalized scatter matrix

 

When the columns of   are independently sampled from a multivariate normal distribution, then   has a Wishart distribution.

See also