The first three posts in this series showed that the orthogonal projection of a (n-1)-dimensional ellipsoid in a n-dimensional Euclidean space has a surface which is a (n-2)-dimensional ellipsoid. Our original ellipsoid is defined by a quadratic form and a respective matrix. The projection can be performed along the direction of an arbitrary vector onto a (n-1)-dimensional subspace, whose vector-elements are orthogonal to the projection vector. This sub-space orthogonally complements the 1-dimensional sub-space given by an infinite line along the projection vector.
In this post I briefly touch a subject which shows us again an interesting link between geometry and linear algebra. We show that the matrix which defines the quadratic form of the border ellipsoid of the projection image is given by a Schur complement of the original ellipsoid’s matrix. This insight can be extended to orthogonal projections onto p-dimensional sub-spaces.
Previous posts
- Post I: Orthogonal projections of multidimensional ellipsoids – I – points on the ellipsoid that give us the surface points of the projection
- Post II: Orthogonal projections of multidimensional ellipsoids – II – the surface of the projection image is a lower dimensional ellipsoid
- Post III: Orthogonal projections of multidimensional ellipsoids – III – arbitrary vectors for the projection direction and cuts of the unit sphere
Quadratic form
We have an ellipsoid in the ℝn. It is defined as a closed (n-1)-dim hypersurface E
As in previous posts we have named the matrix of the quadratic form Σ-1 to keep in touch with Multivariate Normal Distributions. The matrices Σ and Σ-1 are symmetric, invertible and positive-definite. Σ has a Cholesky decomposition
Matrix A generates the ellipsoid when applied to vectors u of the unit sphere 𝕊n-1 (u ∈ 𝕊n-1) :
The Cartesian Coordinate System [CCS] is defined by orthogonal unit-vectors ei (1 ≤ i ≤ n). We write down our vectors x and the matrix in sectioned form – one component of vector z along en and the rest in the (n-1)-dim sub-space spanned by ei (1 ≤ i ≤ n-1):
We can always reorganize the order of unit vectors that our chosen unit vector is the last one. Remember that the change of order of unit vectors corresponds to an orthogonal transformation to a new base of the ℝn. So, without loosing generality we choose en to give us the projection’s direction below. To cover the effect upon the matrix we write it down partitioned into blocks:
The introduction of Q is just to make writing simpler as we can omit the {-1}. We see
Orthogonal projection and condition for tangential points
We introduce the projection P along en as a projection matrix
What property characterizes the critical points xet,
on the ellipsoid that give us the surface of the projection image? Answer from previous posts: en must be tangential and orthogonal to the normal vector of the ellipsoid’s surface manifold at such a tangential point. As we have seen in previous posts that the normal vector to the ellipsoid’s surface at vector xe is given by Qxe. Thus the condition is:
Furthermore, we know
The vectors (yet , 0)T reside in a subspace Vp ℝn whose vectors are orthogonal to en:
This sub-space is a representation of the ℝ(n-1). So far, we learned nothing new compared to previous posts. But now let us draw conclusions regarding the structure of Q. A quick check of the matrix multiplication shows that condition (9) means
Now we do once again the same as in previous posts. We apply condition (1) to our tangential points xet, resolve the matrix multiplication and insert eq. (12) for zet:
We found again a matrix mediating a quadratic form – this time working on the reduced vectors yet.
The matrix of the quadratic form for the surface of the projection image is a Schur complement
Eq. (16) above gives us a certain symmetric reduced ((n-1) x (n-1)) matrix QP,red, which affects the projection image vectors (yet) ∈ SP of the vectors xet (giving the tangential points on the ellipsoid):
It is relatively easy to show that this matrix is positive definite because Q was it. Just regard eigenvalues and the above analysis in an eigen-system. The matrix defined by eq. (17) is just the so called Schur complement of block qzz of matrix Q (= Σ-1).
Now, we could repeat the whole argumentation for our (n-2)-dimensional ellipsoid defined by eq. (16) with a further projection down to an even lower-dimensional sub-space. However, we repeat the whole process for a projection operator that eliminates (n-p) dimensions to see effects regarding the matrix structure in cases of orthogonal projections down to p-dimensional sub-space.
Generalization – orthogonal projection down to a p-dimensional sub-space
We introduce a (nxn)-projection operators Pnp and Pn(n-p) as
A matrix 𝕀q is the (qxq)-identiy matrix. The matrices 𝕆mxn are (mxn) zero matrices. Let us look at special sets of vectors, Vp ⊂ ℝn and Vn-p ⊂ ℝn:
Vectors y ∈ Vp and z ∈ Vn-p are orthogonal to each other. The sets, Vp and Vn-p, correspond to two orthogonal sub-vector-spaces, SP and (SP)⊥, of the ℝn. Subspace SP is p-dimensional, subspace (SP)⊥ is (n-p)-dimensional. They are complementary regarding the full ℝn :
We introduce respective reduced vectors
Projection Pnp produces vectors y in Vp. It is therefore natural to write the vectors xe ∈ E as a sum of a vector ye ∈ Vp and a ze ∈ Vn-p :
To understand matrix-operations working on the reduced vectors, we partition the (n x n) matrix Q into respective blocks with respect to the orthogonal sub-spaces (SP ⊕ (SP)⊥):
We use the orthogonality condition for tangent vectors along the relevant ei (p+1 ≤ i ≤ n) spanning Vn-p. You can imagine the resulting condition as a summary of searching for the the relevant points for surfaces of (n-p) subsequent projections, each projection along one of the series ep+1, ep+2, … en, eventually down to VP (or regarding the reduced vectors to SP). Note also: Due to the quadratic form, for any vector xe on the ellipsoid the y-part is maximized if the z-related part is minimized:
The respective minimization condition reduces the number of relevant tangential points for the surface of the projection image:
Note that this actually means the respective original vectors uet of our xet must be vertical to all back-transformed A-1ej (p+1 ≤ j ≤ n)
This can be used to find the relevant vectors uet of the unit sphere 𝕊n-1 from which we can construct the tangential points xet with the help of matrix A. In one of the next post we will use this insight for a numerical experiment.
Let us return to the partitioned matrix Q. Expanding condition (28) and checking for relevant sub-matrices gives us:
Now we evaluate the quadratic form for our tangential points on the ellipsoid, which give us the surface points of the projection image,
Due to our solution of condition (28), the right side of eq. (27) reduces to 1 and eq.(28) can be simplified for our tangent vectors:
This leads to
The zero sub-vector on the left side leads to the following expressions on the left side to
We insert our result of eq. (30) to get
Again, we find a reduced matrix QP,red which provides a quadratic form for the reduced vectors in sub-space SP (corresponding to the target space of projection Pnp) :
As we projected the “tangential” vectors, this quadratic form controls the (p-1)-dimensional surface of the projection image of the original ellipsoid. This defined a (p-1)-dimensional ellipsoid ESp :
And, most importantly: The matrix QP,red is the Schur complement of the original matrix Q with respect to block Qzz (based on a block partitioning with respect to the complementary sub-spaces SP and (SP)⊥).
Conclusion
An orthogonal projection of an (n-1)-dimensional ellipsoid in the ℝn down to a p-dimensional sub-space results in a p-dimensional volume figure. The (p-1)-dimensional surface of this figure is an ellipsoid, too. The matrix controlling the quadratic form in the projection’s target sub-space is a Schur complement of the original matrix Q (= Σ-1). The Schur complement relates to matrix blocks which reflect complementary orthogonal p– and (n-p)-dimensional sub-spaces of the ℝn.
In the next post we are going to use this result to find out, how the respective inverse matrices Q-1= Σ and [QP,red]-1=ΣP,red are related. We shall see that this relation is rather simple. Stay tuned …