# Why is the second Principal Component orthogonal to the first one?

Because **the second Principal Component** should capture the highest variance **from what is left** after the first Principal Component explains the data as much as it can. (The first principal component has the largest possible variance, that is, accounts for as much of the variability in the data as possible.)

Then where should we look for the…