Status: half-baked
This all is true in any number of dimensions, but let’s say we measure two things for a bunch of people. Perhaps it is height and weight, who knows. A measurement is then a vector . If we have many measurements we can put them in a matrix .
Suppose we center these data by subtracting the average vector from each individual vector so that , with being the average. The total variance in these data is , the average squared length of the centered vectors.
The variance along a particular vector is just the average squared length of your centered vectors if you project them onto .
For any orthogonal basis we can have and because of Pythagoras we will have . This means that we can always express the total variance as the sum of variances in orthogonal directions.