Multivariate Normal Distribution: 1 Random Vector
Multivariate Normal Distribution: 1 Random Vector
Random Vector
1. Discrete Case
If X takes value on a nite or countable set (or each Xi is a discrete random variable),we say X is a discrete random vector. In this case, the distribution of X is driven by the joint probability function. p(t1 , t2 , , tk ) = P (X1 = t1 , , Xk = tk ). 2. Continuous Case
In this case, the distribution of X is driven by the joint probability density function f (x1 , , xk ). The joint density function f satises that for any measurable set A Rk . P (X A) =
A
f (x1 , , xk )dx.
F (t1 , , tk ) = P (X1 t1 , , Xk tk ) =
It is easy to see that f (x1 , , xk ) = 3. Moments E (X ) = (E (X1 ), , E (Xk ))T COV (X ) = E ((X EX )(X EX )T ) = E (XX T ) E (X )E (X )T . It can be seen that for any matrix A, COV (AX ) = ACOV (X )AT . The moment generating function of X is dened as (for t Rk ) MX (t) = E (et
T X )).
k F (x1 , , xk ). x1 xk
Suppose X = (X1 , , Xk ) and Xi are i.i.d. standard normal random variables. Then it is obviously that E (X ) = (0, 0, , 0), COV (X ) = Ik . Then for a n dimensional vector and n k matrix A E ( + AX ) = , COV ( + AX ) = AAT . Denote AAT by , we have the following denition. Denition 1 The distribution of random vector AX is called a multivariate normal distribution with covariance matrix and is denoted by N (0, ). And the distribution of + AX is called a multivariate normal distribution with mean and covariance matrix , N (, ).
To make the denition valid, we need to verify that the distribution of AX depend on A only throuth AAT . We can use the moment generating function to do this.
Suppose the moment generateing function of X is M (t), we know that M (t) = et So the m.g.f. of AX is MAX (t) = E (et
T AX
T t/2
) = M (tT A) = et
T AAT t
This means the m.g.f. of AX depend on A only through AAT , so the distribution of AX only depends on AAT . Based on the denition, we can also calculate the joint pdf of N (, ) (when is full rank),
1 1 T 1 f (x) = ( )n (det||)1/2 e 2 (x) (x) 2
where x = (x1 , , xn )T is a n dimensional vector. We can also see that if Y follows N (, ) distribution then for any matrix B BY N (B, B B T ).