Problem 1
Find the population principal components Y1 and Y2 for the covariance matrix . Then, calculate the proportion of the total population variance explained by the first principal component.
Answer
Find the eigenvalues and the corresponding eigenvectors.
(λ-5)(λ-2) – (-2)(-2) = 0
(λ2 – 7λ + 10) – 4 = 0
λ2 – 7λ + 6 = 0
(λ-6)(λ-1) = 0
λ1 = 6 and λ2 = 1
From λ = 6 we get the eigenvector and from λ = 1 we get the eigenvector .
Determine the population principal components.
First principal component:
Second principal component:
The proportion of the total population variance explained by the first principal component:
Problem 2
Convert the covariance matrix in Problem 1 to a correlation matrix ρ.
- Determine the principal components Y1 and Y2 from ρ and compute the proportion of total population variance explained by Y1.
- Compare the components calculated in Part 1 with those obtained in Problem 1. Are they the same? Should they be?
- Compute the correlations and .
Answer
Part 1
The correlation matrix of Σ is:
Find the eigenvalues and eigenvectors.
The eigenvalue λ1 gives the eigenvector while λ2 yields .
First principal component:
Second principal component:
The proportion of total population variance explained by Y1:
Part 2
The principal components obtained from Σ and ρ are not the same. In general, the two matrices produce different eigenvalues and eigenvectors.
Part 3
Find the correlations and :
The formula used to calculate the correlation between the component Yi and the original variable Xk is (see Theorem 3 in the article: Population Principal Components). However, since the correlation matrix is used as the basis for determining the principal components, σ11 = σ22 = 1, thus in this case .
Problem 3
Let . Determine the principal components Y1, Y2, and Y3. What can you say about the eigenvectors (and principal components) associated with eigenvalues that are not distinct?
Answer
The characteristic equation of Σ is (λ-2)(λ-4)2 = 0 and this gives the eigenvalues λ1 = λ2 = 4 and λ3 = 2.
The eigenvector obtained from λ3 = 2 is . The resulting eigenspace from λ1 = λ2 = 4 has the vectors and as the basis vectors.
From these results, we obtain the following principal components.
Y1 = X2
Y2 = X3
Y3 = X1
Below, we show that the principal components are not unique.
Note that and are two independent vectors in E1, hence they collectively form a basis for E1. By applying the Gram-Schmidt process, the orthonormal basis vectors and can be determined as follows.
From , , and we get the principal components that differ from the ones previously-obtained, i.e.:
Note that other than the pair of and , there are infinitely other pairs of basis vectors for E1 and those other pairs of basis vectors will produce other principal components as well. In conclusion, if there are eigenvalues that are not distinct then the principal components corresponding to the eigenvalues are not unique.
Problem 4
Determine the principal components and the proportion of the total population variance explained by each component when the covariance matrix is:
where .
Answer
The characteristic equation of Σ is:
Solutions to the quadratic equation in λ are and . As a consequence, Σ yields three distinct eigenvalues λ1, λ2 and λ3, where , , and .
Moreover, it can be proved that λ1, λ2, and λ3 result in the eigenvectors , , and , respectively.
The principal components obtained from Σ are:
The proportions of the total population variance explained by the components are: