fbpx
Wikipedia

Projection (linear algebra)

In linear algebra and functional analysis, a projection is a linear transformation from a vector space to itself (an endomorphism) such that . That is, whenever is applied twice to any vector, it gives the same result as if it were applied once (i.e. is idempotent). It leaves its image unchanged.[1] This definition of "projection" formalizes and generalizes the idea of graphical projection. One can also consider the effect of a projection on a geometrical object by examining the effect of the projection on points in the object.

The transformation P is the orthogonal projection onto the line m.

Definitions edit

A projection on a vector space   is a linear operator   such that  .

When   has an inner product and is complete, i.e. when   is a Hilbert space, the concept of orthogonality can be used. A projection   on a Hilbert space   is called an orthogonal projection if it satisfies   for all  . A projection on a Hilbert space that is not orthogonal is called an oblique projection.

Projection matrix edit

  • A square matrix   is called a projection matrix if it is equal to its square, i.e. if  .[2]: p. 38 
  • A square matrix   is called an orthogonal projection matrix if   for a real matrix, and respectively   for a complex matrix, where   denotes the transpose of   and   denotes the adjoint or Hermitian transpose of  .[2]: p. 223 
  • A projection matrix that is not an orthogonal projection matrix is called an oblique projection matrix.

The eigenvalues of a projection matrix must be 0 or 1.

Examples edit

Orthogonal projection edit

For example, the function which maps the point   in three-dimensional space   to the point   is an orthogonal projection onto the xy-plane. This function is represented by the matrix

 

The action of this matrix on an arbitrary vector is

 

To see that   is indeed a projection, i.e.,  , we compute

 

Observing that   shows that the projection is an orthogonal projection.

Oblique projection edit

A simple example of a non-orthogonal (oblique) projection is

 

Via matrix multiplication, one sees that

 
showing that   is indeed a projection.

The projection   is orthogonal if and only if   because only then  

Properties and classification edit

 
The transformation T is the projection along k onto m. The range of T is m and the kernel is k.

Idempotence edit

By definition, a projection   is idempotent (i.e.  ).

Open map edit

Every projection is an open map, meaning that it maps each open set in the domain to an open set in the subspace topology of the image.[citation needed] That is, for any vector   and any ball   (with positive radius) centered on  , there exists a ball   (with positive radius) centered on   that is wholly contained in the image  .

Complementarity of image and kernel edit

Let   be a finite-dimensional vector space and   be a projection on  . Suppose the subspaces   and   are the image and kernel of   respectively. Then   has the following properties:

  1.   is the identity operator   on  :
     
  2. We have a direct sum  . Every vector   may be decomposed uniquely as   with   and  , and where  

The image and kernel of a projection are complementary, as are   and  . The operator   is also a projection as the image and kernel of   become the kernel and image of   and vice versa. We say   is a projection along   onto   (kernel/image) and   is a projection along   onto  .

Spectrum edit

In infinite-dimensional vector spaces, the spectrum of a projection is contained in   as

 
Only 0 or 1 can be an eigenvalue of a projection. This implies that an orthogonal projection   is always a positive semi-definite matrix. In general, the corresponding eigenspaces are (respectively) the kernel and range of the projection. Decomposition of a vector space into direct sums is not unique. Therefore, given a subspace  , there may be many projections whose range (or kernel) is  .

If a projection is nontrivial it has minimal polynomial  , which factors into distinct linear factors, and thus   is diagonalizable.

Product of projections edit

The product of projections is not in general a projection, even if they are orthogonal. If two projections commute then their product is a projection, but the converse is false: the product of two non-commuting projections may be a projection.

If two orthogonal projections commute then their product is an orthogonal projection. If the product of two orthogonal projections is an orthogonal projection, then the two orthogonal projections commute (more generally: two self-adjoint endomorphisms commute if and only if their product is self-adjoint).

Orthogonal projections edit

When the vector space   has an inner product and is complete (is a Hilbert space) the concept of orthogonality can be used. An orthogonal projection is a projection for which the range   and the kernel   are orthogonal subspaces. Thus, for every   and   in  ,  . Equivalently:

 

A projection is orthogonal if and only if it is self-adjoint. Using the self-adjoint and idempotent properties of  , for any   and   in   we have  ,  , and

 
where   is the inner product associated with  . Therefore,   and   are orthogonal projections.[3] The other direction, namely that if   is orthogonal then it is self-adjoint, follows from the implication from   to
 
for every   and   in  ; thus  .

The existence of an orthogonal projection onto a closed subspace follows from the Hilbert projection theorem.

Properties and special cases edit

An orthogonal projection is a bounded operator. This is because for every   in the vector space we have, by the Cauchy–Schwarz inequality:

 
Thus  .

For finite-dimensional complex or real vector spaces, the standard inner product can be substituted for  .

Formulas edit

A simple case occurs when the orthogonal projection is onto a line. If   is a unit vector on the line, then the projection is given by the outer product

 
(If   is complex-valued, the transpose in the above equation is replaced by a Hermitian transpose). This operator leaves u invariant, and it annihilates all vectors orthogonal to  , proving that it is indeed the orthogonal projection onto the line containing u.[4] A simple way to see this is to consider an arbitrary vector   as the sum of a component on the line (i.e. the projected vector we seek) and another perpendicular to it,  . Applying projection, we get
 
by the properties of the dot product of parallel and perpendicular vectors.

This formula can be generalized to orthogonal projections on a subspace of arbitrary dimension. Let   be an orthonormal basis of the subspace  , with the assumption that the integer  , and let   denote the   matrix whose columns are  , i.e.,  . Then the projection is given by:[5]

 
which can be rewritten as
 

The matrix   is the partial isometry that vanishes on the orthogonal complement of  , and   is the isometry that embeds   into the underlying vector space. The range of   is therefore the final space of  . It is also clear that   is the identity operator on  .

The orthonormality condition can also be dropped. If   is a (not necessarily orthonormal) basis with  , and   is the matrix with these vectors as columns, then the projection is:[6][7]

 

The matrix   still embeds   into the underlying vector space but is no longer an isometry in general. The matrix   is a "normalizing factor" that recovers the norm. For example, the rank-1 operator   is not a projection if   After dividing by   we obtain the projection   onto the subspace spanned by  .

In the general case, we can have an arbitrary positive definite matrix   defining an inner product  , and the projection   is given by  . Then

 

When the range space of the projection is generated by a frame (i.e. the number of generators is greater than its dimension), the formula for the projection takes the form:  . Here   stands for the Moore–Penrose pseudoinverse. This is just one of many ways to construct the projection operator.

If   is a non-singular matrix and   (i.e.,   is the null space matrix of  ),[8] the following holds:

 

If the orthogonal condition is enhanced to   with   non-singular, the following holds:

 

All these formulas also hold for complex inner product spaces, provided that the conjugate transpose is used instead of the transpose. Further details on sums of projectors can be found in Banerjee and Roy (2014).[9] Also see Banerjee (2004)[10] for application of sums of projectors in basic spherical trigonometry.

Oblique projections edit

The term oblique projections is sometimes used to refer to non-orthogonal projections. These projections are also used to represent spatial figures in two-dimensional drawings (see oblique projection), though not as frequently as orthogonal projections. Whereas calculating the fitted value of an ordinary least squares regression requires an orthogonal projection, calculating the fitted value of an instrumental variables regression requires an oblique projection.

A projection is defined by its kernel and the basis vectors used to characterize its range (which is a complement of the kernel). When these basis vectors are orthogonal to the kernel, then the projection is an orthogonal projection. When these basis vectors are not orthogonal to the kernel, the projection is an oblique projection, or just a projection.

A matrix representation formula for a nonzero projection operator edit

Let   be a linear operator,   such that   and assume that   is not the zero operator. Let the vectors   form a basis for the range of  , and assemble these vectors in the   matrix  . Therefore the integer  , otherwise   and   is the zero operator. The range and the kernel are complementary spaces, so the kernel has dimension  . It follows that the orthogonal complement of the kernel has dimension  . Let   form a basis for the orthogonal complement of the kernel of the projection, and assemble these vectors in the matrix  . Then the projection   (with the condition  ) is given by

 

This expression generalizes the formula for orthogonal projections given above.[11][12] A standard proof of this expression is the following. For any vector   in the vector space  , we can decompose  , where vector   is in the image of  , and vector   So  , and then   is in the kernel of  , which is the null space of   In other words, the vector   is in the column space of   so   for some   dimension vector   and the vector   satisfies   by the construction of  . Put these conditions together, and we find a vector   so that  . Since matrices   and   are of full rank   by their construction, the  -matrix   is invertible. So the equation   gives the vector   In this way,   for any vector   and hence  .

In the case that   is an orthogonal projection, we can take  , and it follows that  . By using this formula, one can easily check that  . In general, if the vector space is over complex number field, one then uses the Hermitian transpose   and has the formula  . Recall that one can define the Moore–Penrose inverse of the matrix   by   since   has full column rank, so  .

Singular values edit

Note that   is also an oblique projection. The singular values of   and   can be computed by an orthonormal basis of  . Let   be an orthonormal basis of   and let   be the orthogonal complement of  . Denote the singular values of the matrix   by the positive values  . With this, the singular values for   are:[13]

 
and the singular values for   are
 
This implies that the largest singular values of   and   are equal, and thus that the matrix norm of the oblique projections are the same. However, the condition number satisfies the relation  , and is therefore not necessarily equal.

Finding projection with an inner product edit

Let   be a vector space (in this case a plane) spanned by orthogonal vectors  . Let   be a vector. One can define a projection of   onto   as

 
where repeated indices are summed over (Einstein sum notation). The vector   can be written as an orthogonal sum such that  .   is sometimes denoted as  . There is a theorem in linear algebra that states that this   is the smallest distance (the orthogonal distance) from   to   and is commonly used in areas such as machine learning.
 
y is being projected onto the vector space V.

Canonical forms edit

Any projection   on a vector space of dimension   over a field is a diagonalizable matrix, since its minimal polynomial divides  , which splits into distinct linear factors. Thus there exists a basis in which   has the form

 

where   is the rank of  . Here   is the identity matrix of size  ,   is the zero matrix of size  , and   is the direct sum operator. If the vector space is complex and equipped with an inner product, then there is an orthonormal basis in which the matrix of P is[14]

 

where  . The integers   and the real numbers   are uniquely determined. Note that  . The factor   corresponds to the maximal invariant subspace on which   acts as an orthogonal projection (so that P itself is orthogonal if and only if  ) and the  -blocks correspond to the oblique components.

Projections on normed vector spaces edit

When the underlying vector space   is a (not necessarily finite-dimensional) normed vector space, analytic questions, irrelevant in the finite-dimensional case, need to be considered. Assume now   is a Banach space.

Many of the algebraic results discussed above survive the passage to this context. A given direct sum decomposition of   into complementary subspaces still specifies a projection, and vice versa. If   is the direct sum  , then the operator defined by   is still a projection with range   and kernel  . It is also clear that  . Conversely, if   is projection on  , i.e.  , then it is easily verified that  . In other words,   is also a projection. The relation   implies   and   is the direct sum  .

However, in contrast to the finite-dimensional case, projections need not be continuous in general. If a subspace   of   is not closed in the norm topology, then the projection onto   is not continuous. In other words, the range of a continuous projection   must be a closed subspace. Furthermore, the kernel of a continuous projection (in fact, a continuous linear operator in general) is closed. Thus a continuous projection   gives a decomposition of   into two complementary closed subspaces:  .

The converse holds also, with an additional assumption. Suppose   is a closed subspace of  . If there exists a closed subspace   such that X = UV, then the projection   with range   and kernel   is continuous. This follows from the closed graph theorem. Suppose xnx and Pxny. One needs to show that  . Since   is closed and {Pxn} ⊂ U, y lies in  , i.e. Py = y. Also, xnPxn = (IP)xnxy. Because   is closed and {(IP)xn} ⊂ V, we have  , i.e.  , which proves the claim.

The above argument makes use of the assumption that both   and   are closed. In general, given a closed subspace  , there need not exist a complementary closed subspace  , although for Hilbert spaces this can always be done by taking the orthogonal complement. For Banach spaces, a one-dimensional subspace always has a closed complementary subspace. This is an immediate consequence of Hahn–Banach theorem. Let   be the linear span of  . By Hahn–Banach, there exists a bounded linear functional   such that φ(u) = 1. The operator   satisfies  , i.e. it is a projection. Boundedness of   implies continuity of   and therefore   is a closed complementary subspace of  .

Applications and further considerations edit

Projections (orthogonal and otherwise) play a major role in algorithms for certain linear algebra problems:

As stated above, projections are a special case of idempotents. Analytically, orthogonal projections are non-commutative generalizations of characteristic functions. Idempotents are used in classifying, for instance, semisimple algebras, while measure theory begins with considering characteristic functions of measurable sets. Therefore, as one can imagine, projections are very often encountered in the context of operator algebras. In particular, a von Neumann algebra is generated by its complete lattice of projections.

Generalizations edit

More generally, given a map between normed vector spaces   one can analogously ask for this map to be an isometry on the orthogonal complement of the kernel: that   be an isometry (compare Partial isometry); in particular it must be onto. The case of an orthogonal projection is when W is a subspace of V. In Riemannian geometry, this is used in the definition of a Riemannian submersion.

See also edit

Notes edit

  1. ^ Meyer, pp 386+387
  2. ^ a b Horn, Roger A.; Johnson, Charles R. (2013). Matrix Analysis, second edition. Cambridge University Press. ISBN 9780521839402.
  3. ^ Meyer, p. 433
  4. ^ Meyer, p. 431
  5. ^ Meyer, equation (5.13.4)
  6. ^ Banerjee, Sudipto; Roy, Anindya (2014), Linear Algebra and Matrix Analysis for Statistics, Texts in Statistical Science (1st ed.), Chapman and Hall/CRC, ISBN 978-1420095388
  7. ^ Meyer, equation (5.13.3)
  8. ^ See also Linear least squares (mathematics) § Properties of the least-squares estimators.
  9. ^ Banerjee, Sudipto; Roy, Anindya (2014), Linear Algebra and Matrix Analysis for Statistics, Texts in Statistical Science (1st ed.), Chapman and Hall/CRC, ISBN 978-1420095388
  10. ^ Banerjee, Sudipto (2004), "Revisiting Spherical Trigonometry with Orthogonal Projectors", The College Mathematics Journal, 35 (5): 375–381, doi:10.1080/07468342.2004.11922099, S2CID 122277398
  11. ^ Banerjee, Sudipto; Roy, Anindya (2014), Linear Algebra and Matrix Analysis for Statistics, Texts in Statistical Science (1st ed.), Chapman and Hall/CRC, ISBN 978-1420095388
  12. ^ Meyer, equation (7.10.39)
  13. ^ Brust, J. J.; Marcia, R. F.; Petra, C. G. (2020), "Computationally Efficient Decompositions of Oblique Projection Matrices", SIAM Journal on Matrix Analysis and Applications, 41 (2): 852–870, doi:10.1137/19M1288115, OSTI 1680061, S2CID 219921214
  14. ^ Doković, D. Ž. (August 1991). "Unitary similarity of projectors". Aequationes Mathematicae. 42 (1): 220–224. doi:10.1007/BF01818492. S2CID 122704926.

References edit

  • Banerjee, Sudipto; Roy, Anindya (2014), Linear Algebra and Matrix Analysis for Statistics, Texts in Statistical Science (1st ed.), Chapman and Hall/CRC, ISBN 978-1420095388
  • Dunford, N.; Schwartz, J. T. (1958). Linear Operators, Part I: General Theory. Interscience.
  • Meyer, Carl D. (2000). Matrix Analysis and Applied Linear Algebra. Society for Industrial and Applied Mathematics. ISBN 978-0-89871-454-8.

External links edit

  • MIT Linear Algebra Lecture on Projection Matrices on YouTube, from MIT OpenCourseWare
  • Linear Algebra 15d: The Projection Transformation on YouTube, by Pavel Grinfeld.
  • Planar Geometric Projections Tutorial – a simple-to-follow tutorial explaining the different types of planar geometric projections.

projection, linear, algebra, orthogonal, projection, redirects, here, technical, drawing, concept, orthographic, projection, concrete, discussion, orthogonal, projections, finite, dimensional, linear, spaces, vector, projection, linear, algebra, functional, an. Orthogonal projection redirects here For the technical drawing concept see Orthographic projection For a concrete discussion of orthogonal projections in finite dimensional linear spaces see Vector projection In linear algebra and functional analysis a projection is a linear transformation P displaystyle P from a vector space to itself an endomorphism such that P P P displaystyle P circ P P That is whenever P displaystyle P is applied twice to any vector it gives the same result as if it were applied once i e P displaystyle P is idempotent It leaves its image unchanged 1 This definition of projection formalizes and generalizes the idea of graphical projection One can also consider the effect of a projection on a geometrical object by examining the effect of the projection on points in the object The transformation P is the orthogonal projection onto the line m Contents 1 Definitions 1 1 Projection matrix 2 Examples 2 1 Orthogonal projection 2 2 Oblique projection 3 Properties and classification 3 1 Idempotence 3 2 Open map 3 3 Complementarity of image and kernel 3 4 Spectrum 3 5 Product of projections 3 6 Orthogonal projections 3 6 1 Properties and special cases 3 6 1 1 Formulas 3 7 Oblique projections 3 7 1 A matrix representation formula for a nonzero projection operator 3 7 2 Singular values 3 8 Finding projection with an inner product 4 Canonical forms 5 Projections on normed vector spaces 6 Applications and further considerations 7 Generalizations 8 See also 9 Notes 10 References 11 External linksDefinitions editA projection on a vector space V displaystyle V nbsp is a linear operator P V V displaystyle P colon V to V nbsp such that P2 P displaystyle P 2 P nbsp When V displaystyle V nbsp has an inner product and is complete i e when V displaystyle V nbsp is a Hilbert space the concept of orthogonality can be used A projection P displaystyle P nbsp on a Hilbert space V displaystyle V nbsp is called an orthogonal projection if it satisfies Px y x Py displaystyle langle P mathbf x mathbf y rangle langle mathbf x P mathbf y rangle nbsp for all x y V displaystyle mathbf x mathbf y in V nbsp A projection on a Hilbert space that is not orthogonal is called an oblique projection Projection matrix edit A square matrix P displaystyle P nbsp is called a projection matrix if it is equal to its square i e if P2 P displaystyle P 2 P nbsp 2 p 38 A square matrix P displaystyle P nbsp is called an orthogonal projection matrix if P2 P PT displaystyle P 2 P P mathrm T nbsp for a real matrix and respectively P2 P P displaystyle P 2 P P nbsp for a complex matrix where PT displaystyle P mathrm T nbsp denotes the transpose of P displaystyle P nbsp and P displaystyle P nbsp denotes the adjoint or Hermitian transpose of P displaystyle P nbsp 2 p 223 A projection matrix that is not an orthogonal projection matrix is called an oblique projection matrix The eigenvalues of a projection matrix must be 0 or 1 Examples editOrthogonal projection edit For example the function which maps the point x y z displaystyle x y z nbsp in three dimensional space R3 displaystyle mathbb R 3 nbsp to the point x y 0 displaystyle x y 0 nbsp is an orthogonal projection onto the xy plane This function is represented by the matrixP 100010000 displaystyle P begin bmatrix 1 amp 0 amp 0 0 amp 1 amp 0 0 amp 0 amp 0 end bmatrix nbsp The action of this matrix on an arbitrary vector isP xyz xy0 displaystyle P begin bmatrix x y z end bmatrix begin bmatrix x y 0 end bmatrix nbsp To see that P displaystyle P nbsp is indeed a projection i e P P2 displaystyle P P 2 nbsp we computeP2 xyz P xy0 xy0 P xyz displaystyle P 2 begin bmatrix x y z end bmatrix P begin bmatrix x y 0 end bmatrix begin bmatrix x y 0 end bmatrix P begin bmatrix x y z end bmatrix nbsp Observing that PT P displaystyle P mathrm T P nbsp shows that the projection is an orthogonal projection Oblique projection edit A simple example of a non orthogonal oblique projection isP 00a1 displaystyle P begin bmatrix 0 amp 0 alpha amp 1 end bmatrix nbsp Via matrix multiplication one sees thatP2 00a1 00a1 00a1 P displaystyle P 2 begin bmatrix 0 amp 0 alpha amp 1 end bmatrix begin bmatrix 0 amp 0 alpha amp 1 end bmatrix begin bmatrix 0 amp 0 alpha amp 1 end bmatrix P nbsp showing that P displaystyle P nbsp is indeed a projection The projection P displaystyle P nbsp is orthogonal if and only if a 0 displaystyle alpha 0 nbsp because only then PT P displaystyle P mathrm T P nbsp Properties and classification edit nbsp The transformation T is the projection along k onto m The range of T is m and the kernel is k Idempotence edit By definition a projection P displaystyle P nbsp is idempotent i e P2 P displaystyle P 2 P nbsp Open map edit Every projection is an open map meaning that it maps each open set in the domain to an open set in the subspace topology of the image citation needed That is for any vector x displaystyle mathbf x nbsp and any ball Bx displaystyle B mathbf x nbsp with positive radius centered on x displaystyle mathbf x nbsp there exists a ball BPx displaystyle B P mathbf x nbsp with positive radius centered on Px displaystyle P mathbf x nbsp that is wholly contained in the image P Bx displaystyle P B mathbf x nbsp Complementarity of image and kernel edit Let W displaystyle W nbsp be a finite dimensional vector space and P displaystyle P nbsp be a projection on W displaystyle W nbsp Suppose the subspaces U displaystyle U nbsp and V displaystyle V nbsp are the image and kernel of P displaystyle P nbsp respectively Then P displaystyle P nbsp has the following properties P displaystyle P nbsp is the identity operator I displaystyle I nbsp on U displaystyle U nbsp x U Px x displaystyle forall mathbf x in U P mathbf x mathbf x nbsp We have a direct sum W U V displaystyle W U oplus V nbsp Every vector x W displaystyle mathbf x in W nbsp may be decomposed uniquely as x u v displaystyle mathbf x mathbf u mathbf v nbsp with u Px displaystyle mathbf u P mathbf x nbsp and v x Px I P x displaystyle mathbf v mathbf x P mathbf x left I P right mathbf x nbsp and where u U v V displaystyle mathbf u in U mathbf v in V nbsp The image and kernel of a projection are complementary as are P displaystyle P nbsp and Q I P displaystyle Q I P nbsp The operator Q displaystyle Q nbsp is also a projection as the image and kernel of P displaystyle P nbsp become the kernel and image of Q displaystyle Q nbsp and vice versa We say P displaystyle P nbsp is a projection along V displaystyle V nbsp onto U displaystyle U nbsp kernel image and Q displaystyle Q nbsp is a projection along U displaystyle U nbsp onto V displaystyle V nbsp Spectrum edit In infinite dimensional vector spaces the spectrum of a projection is contained in 0 1 displaystyle 0 1 nbsp as lI P 1 1lI 1l l 1 P displaystyle lambda I P 1 frac 1 lambda I frac 1 lambda lambda 1 P nbsp Only 0 or 1 can be an eigenvalue of a projection This implies that an orthogonal projection P displaystyle P nbsp is always a positive semi definite matrix In general the corresponding eigenspaces are respectively the kernel and range of the projection Decomposition of a vector space into direct sums is not unique Therefore given a subspace V displaystyle V nbsp there may be many projections whose range or kernel is V displaystyle V nbsp If a projection is nontrivial it has minimal polynomial x2 x x x 1 displaystyle x 2 x x x 1 nbsp which factors into distinct linear factors and thus P displaystyle P nbsp is diagonalizable Product of projections edit The product of projections is not in general a projection even if they are orthogonal If two projections commute then their product is a projection but the converse is false the product of two non commuting projections may be a projection If two orthogonal projections commute then their product is an orthogonal projection If the product of two orthogonal projections is an orthogonal projection then the two orthogonal projections commute more generally two self adjoint endomorphisms commute if and only if their product is self adjoint Orthogonal projections edit Main articles Hilbert projection theorem and Complemented subspace When the vector space W displaystyle W nbsp has an inner product and is complete is a Hilbert space the concept of orthogonality can be used An orthogonal projection is a projection for which the range U displaystyle U nbsp and the kernel V displaystyle V nbsp are orthogonal subspaces Thus for every x displaystyle mathbf x nbsp and y displaystyle mathbf y nbsp in W displaystyle W nbsp Px y Py x Px Py 0 displaystyle langle P mathbf x mathbf y P mathbf y rangle langle mathbf x P mathbf x P mathbf y rangle 0 nbsp Equivalently x Py Px Py Px y displaystyle langle mathbf x P mathbf y rangle langle P mathbf x P mathbf y rangle langle P mathbf x mathbf y rangle nbsp A projection is orthogonal if and only if it is self adjoint Using the self adjoint and idempotent properties of P displaystyle P nbsp for any x displaystyle mathbf x nbsp and y displaystyle mathbf y nbsp in W displaystyle W nbsp we have Px U displaystyle P mathbf x in U nbsp y Py V displaystyle mathbf y P mathbf y in V nbsp and Px y Py x P P2 y 0 displaystyle langle P mathbf x mathbf y P mathbf y rangle langle mathbf x left P P 2 right mathbf y rangle 0 nbsp where displaystyle langle cdot cdot rangle nbsp is the inner product associated with W displaystyle W nbsp Therefore P displaystyle P nbsp and I P displaystyle I P nbsp are orthogonal projections 3 The other direction namely that if P displaystyle P nbsp is orthogonal then it is self adjoint follows from the implication from x Px Py Px y Py 0 displaystyle langle mathbf x P mathbf x P mathbf y rangle langle P mathbf x mathbf y P mathbf y rangle 0 nbsp to x Py Px Py Px y x P y displaystyle langle mathbf x P mathbf y rangle langle P mathbf x P mathbf y rangle langle P mathbf x mathbf y rangle langle mathbf x P mathbf y rangle nbsp for every x displaystyle x nbsp and y displaystyle y nbsp in W displaystyle W nbsp thus P P displaystyle P P nbsp The existence of an orthogonal projection onto a closed subspace follows from the Hilbert projection theorem Properties and special cases edit An orthogonal projection is a bounded operator This is because for every v displaystyle mathbf v nbsp in the vector space we have by the Cauchy Schwarz inequality Pv 2 Pv Pv Pv v Pv v displaystyle left P mathbf v right 2 langle P mathbf v P mathbf v rangle langle P mathbf v mathbf v rangle leq left P mathbf v right cdot left mathbf v right nbsp Thus Pv v displaystyle left P mathbf v right leq left mathbf v right nbsp For finite dimensional complex or real vector spaces the standard inner product can be substituted for displaystyle langle cdot cdot rangle nbsp Formulas edit A simple case occurs when the orthogonal projection is onto a line If u displaystyle mathbf u nbsp is a unit vector on the line then the projection is given by the outer productPu uuT displaystyle P mathbf u mathbf u mathbf u mathsf T nbsp If u displaystyle mathbf u nbsp is complex valued the transpose in the above equation is replaced by a Hermitian transpose This operator leaves u invariant and it annihilates all vectors orthogonal to u displaystyle mathbf u nbsp proving that it is indeed the orthogonal projection onto the line containing u 4 A simple way to see this is to consider an arbitrary vector x displaystyle mathbf x nbsp as the sum of a component on the line i e the projected vector we seek and another perpendicular to it x x x displaystyle mathbf x mathbf x parallel mathbf x perp nbsp Applying projection we get Pux uuTx uuTx u sgn uTx x u 0 x displaystyle P mathbf u mathbf x mathbf u mathbf u mathsf T mathbf x parallel mathbf u mathbf u mathsf T mathbf x perp mathbf u left operatorname sgn left mathbf u mathsf T mathbf x parallel right left mathbf x parallel right right mathbf u cdot mathbf 0 mathbf x parallel nbsp by the properties of the dot product of parallel and perpendicular vectors This formula can be generalized to orthogonal projections on a subspace of arbitrary dimension Let u1 uk displaystyle mathbf u 1 ldots mathbf u k nbsp be an orthonormal basis of the subspace U displaystyle U nbsp with the assumption that the integer k 1 displaystyle k geq 1 nbsp and let A displaystyle A nbsp denote the n k displaystyle n times k nbsp matrix whose columns are u1 uk displaystyle mathbf u 1 ldots mathbf u k nbsp i e A u1 uk displaystyle A begin bmatrix mathbf u 1 amp cdots amp mathbf u k end bmatrix nbsp Then the projection is given by 5 PA AAT displaystyle P A AA mathsf T nbsp which can be rewritten as PA i ui ui displaystyle P A sum i langle mathbf u i cdot rangle mathbf u i nbsp The matrix AT displaystyle A mathsf T nbsp is the partial isometry that vanishes on the orthogonal complement of U displaystyle U nbsp and A displaystyle A nbsp is the isometry that embeds U displaystyle U nbsp into the underlying vector space The range of PA displaystyle P A nbsp is therefore the final space of A displaystyle A nbsp It is also clear that AAT displaystyle AA mathsf T nbsp is the identity operator on U displaystyle U nbsp The orthonormality condition can also be dropped If u1 uk displaystyle mathbf u 1 ldots mathbf u k nbsp is a not necessarily orthonormal basis with k 1 displaystyle k geq 1 nbsp and A displaystyle A nbsp is the matrix with these vectors as columns then the projection is 6 7 PA A ATA 1AT displaystyle P A A left A mathsf T A right 1 A mathsf T nbsp The matrix A displaystyle A nbsp still embeds U displaystyle U nbsp into the underlying vector space but is no longer an isometry in general The matrix ATA 1 displaystyle left A mathsf T A right 1 nbsp is a normalizing factor that recovers the norm For example the rank 1 operator uuT displaystyle mathbf u mathbf u mathsf T nbsp is not a projection if u 1 displaystyle left mathbf u right neq 1 nbsp After dividing by uTu u 2 displaystyle mathbf u mathsf T mathbf u left mathbf u right 2 nbsp we obtain the projection u uTu 1uT displaystyle mathbf u left mathbf u mathsf T mathbf u right 1 mathbf u mathsf T nbsp onto the subspace spanned by u displaystyle u nbsp In the general case we can have an arbitrary positive definite matrix D displaystyle D nbsp defining an inner product x y D y Dx displaystyle langle x y rangle D y dagger Dx nbsp and the projection PA displaystyle P A nbsp is given by PAx argminy range A x y D2 textstyle P A x operatorname argmin y in operatorname range A left x y right D 2 nbsp ThenPA A ATDA 1ATD displaystyle P A A left A mathsf T DA right 1 A mathsf T D nbsp When the range space of the projection is generated by a frame i e the number of generators is greater than its dimension the formula for the projection takes the form PA AA displaystyle P A AA nbsp Here A displaystyle A nbsp stands for the Moore Penrose pseudoinverse This is just one of many ways to construct the projection operator If AB displaystyle begin bmatrix A amp B end bmatrix nbsp is a non singular matrix and ATB 0 displaystyle A mathsf T B 0 nbsp i e B displaystyle B nbsp is the null space matrix of A displaystyle A nbsp 8 the following holds I AB AB 1 ATBT 1 ATBT AB ATBT AB 1 ATBT AB ATAOOBTB 1 ATBT A ATA 1AT B BTB 1BT displaystyle begin aligned I amp begin bmatrix A amp B end bmatrix begin bmatrix A amp B end bmatrix 1 begin bmatrix A mathsf T B mathsf T end bmatrix 1 begin bmatrix A mathsf T B mathsf T end bmatrix amp begin bmatrix A amp B end bmatrix left begin bmatrix A mathsf T B mathsf T end bmatrix begin bmatrix A amp B end bmatrix right 1 begin bmatrix A mathsf T B mathsf T end bmatrix amp begin bmatrix A amp B end bmatrix begin bmatrix A mathsf T A amp O O amp B mathsf T B end bmatrix 1 begin bmatrix A mathsf T B mathsf T end bmatrix 4pt amp A left A mathsf T A right 1 A mathsf T B left B mathsf T B right 1 B mathsf T end aligned nbsp If the orthogonal condition is enhanced to ATWB ATWTB 0 displaystyle A mathsf T WB A mathsf T W mathsf T B 0 nbsp with W displaystyle W nbsp non singular the following holds I AB ATWA 1AT BTWB 1BT W displaystyle I begin bmatrix A amp B end bmatrix begin bmatrix left A mathsf T WA right 1 A mathsf T left B mathsf T WB right 1 B mathsf T end bmatrix W nbsp All these formulas also hold for complex inner product spaces provided that the conjugate transpose is used instead of the transpose Further details on sums of projectors can be found in Banerjee and Roy 2014 9 Also see Banerjee 2004 10 for application of sums of projectors in basic spherical trigonometry Oblique projections edit The term oblique projections is sometimes used to refer to non orthogonal projections These projections are also used to represent spatial figures in two dimensional drawings see oblique projection though not as frequently as orthogonal projections Whereas calculating the fitted value of an ordinary least squares regression requires an orthogonal projection calculating the fitted value of an instrumental variables regression requires an oblique projection A projection is defined by its kernel and the basis vectors used to characterize its range which is a complement of the kernel When these basis vectors are orthogonal to the kernel then the projection is an orthogonal projection When these basis vectors are not orthogonal to the kernel the projection is an oblique projection or just a projection A matrix representation formula for a nonzero projection operator edit Let P displaystyle P nbsp be a linear operator P V V displaystyle P V to V nbsp such that P2 P displaystyle P 2 P nbsp and assume that P V V displaystyle P V to V nbsp is not the zero operator Let the vectors u1 uk displaystyle mathbf u 1 ldots mathbf u k nbsp form a basis for the range of P displaystyle P nbsp and assemble these vectors in the n k displaystyle n times k nbsp matrix A displaystyle A nbsp Therefore the integer k 1 displaystyle k geq 1 nbsp otherwise k 0 displaystyle k 0 nbsp and P displaystyle P nbsp is the zero operator The range and the kernel are complementary spaces so the kernel has dimension n k displaystyle n k nbsp It follows that the orthogonal complement of the kernel has dimension k displaystyle k nbsp Let v1 vk displaystyle mathbf v 1 ldots mathbf v k nbsp form a basis for the orthogonal complement of the kernel of the projection and assemble these vectors in the matrix B displaystyle B nbsp Then the projection P displaystyle P nbsp with the condition k 1 displaystyle k geq 1 nbsp is given byP A BTA 1BT displaystyle P A left B mathsf T A right 1 B mathsf T nbsp This expression generalizes the formula for orthogonal projections given above 11 12 A standard proof of this expression is the following For any vector x displaystyle mathbf x nbsp in the vector space V displaystyle V nbsp we can decompose x x1 x2 displaystyle mathbf x mathbf x 1 mathbf x 2 nbsp where vector x1 P x displaystyle mathbf x 1 P mathbf x nbsp is in the image of P displaystyle P nbsp and vector x2 x P x displaystyle mathbf x 2 mathbf x P mathbf x nbsp So P x2 P x P2 x 0 displaystyle P mathbf x 2 P mathbf x P 2 mathbf x mathbf 0 nbsp and then x2 displaystyle mathbf x 2 nbsp is in the kernel of P displaystyle P nbsp which is the null space of A displaystyle A nbsp In other words the vector x1 displaystyle mathbf x 1 nbsp is in the column space of A displaystyle A nbsp so x1 Aw displaystyle mathbf x 1 A mathbf w nbsp for some k displaystyle k nbsp dimension vector w displaystyle mathbf w nbsp and the vector x2 displaystyle mathbf x 2 nbsp satisfies BTx2 0 displaystyle B mathsf T mathbf x 2 mathbf 0 nbsp by the construction of B displaystyle B nbsp Put these conditions together and we find a vector w displaystyle mathbf w nbsp so that BT x Aw 0 displaystyle B mathsf T mathbf x A mathbf w mathbf 0 nbsp Since matrices A displaystyle A nbsp and B displaystyle B nbsp are of full rank k displaystyle k nbsp by their construction the k k displaystyle k times k nbsp matrix BTA displaystyle B mathsf T A nbsp is invertible So the equation BT x Aw 0 displaystyle B mathsf T mathbf x A mathbf w mathbf 0 nbsp gives the vector w BTA 1BTx displaystyle mathbf w B mathsf T A 1 B mathsf T mathbf x nbsp In this way Px x1 Aw A BTA 1BTx displaystyle P mathbf x mathbf x 1 A mathbf w A B mathsf T A 1 B mathsf T mathbf x nbsp for any vector x V displaystyle mathbf x in V nbsp and hence P A BTA 1BT displaystyle P A B mathsf T A 1 B mathsf T nbsp In the case that P displaystyle P nbsp is an orthogonal projection we can take A B displaystyle A B nbsp and it follows that P A ATA 1AT displaystyle P A left A mathsf T A right 1 A mathsf T nbsp By using this formula one can easily check that P PT displaystyle P P mathsf T nbsp In general if the vector space is over complex number field one then uses the Hermitian transpose A displaystyle A nbsp and has the formula P A A A 1A displaystyle P A left A A right 1 A nbsp Recall that one can define the Moore Penrose inverse of the matrix A displaystyle A nbsp by A A A 1A displaystyle A A A 1 A nbsp since A displaystyle A nbsp has full column rank so P AA displaystyle P AA nbsp Singular values edit Note that I P displaystyle I P nbsp is also an oblique projection The singular values of P displaystyle P nbsp and I P displaystyle I P nbsp can be computed by an orthonormal basis of A displaystyle A nbsp Let QA displaystyle Q A nbsp be an orthonormal basis of A displaystyle A nbsp and let QA displaystyle Q A perp nbsp be the orthogonal complement of QA displaystyle Q A nbsp Denote the singular values of the matrix QATA BTA 1BTQA displaystyle Q A T A B T A 1 B T Q A perp nbsp by the positive values g1 g2 gk displaystyle gamma 1 geq gamma 2 geq ldots geq gamma k nbsp With this the singular values for P displaystyle P nbsp are 13 si 1 gi21 i k0otherwise displaystyle sigma i begin cases sqrt 1 gamma i 2 amp 1 leq i leq k 0 amp text otherwise end cases nbsp and the singular values for I P displaystyle I P nbsp are si 1 gi21 i k1k 1 i n k0otherwise displaystyle sigma i begin cases sqrt 1 gamma i 2 amp 1 leq i leq k 1 amp k 1 leq i leq n k 0 amp text otherwise end cases nbsp This implies that the largest singular values of P displaystyle P nbsp and I P displaystyle I P nbsp are equal and thus that the matrix norm of the oblique projections are the same However the condition number satisfies the relation k I P s11 s1sk k P displaystyle kappa I P frac sigma 1 1 geq frac sigma 1 sigma k kappa P nbsp and is therefore not necessarily equal Finding projection with an inner product edit Let V displaystyle V nbsp be a vector space in this case a plane spanned by orthogonal vectors u1 u2 up displaystyle mathbf u 1 mathbf u 2 dots mathbf u p nbsp Let y displaystyle y nbsp be a vector One can define a projection of y displaystyle mathbf y nbsp onto V displaystyle V nbsp asprojV y y uiui uiui displaystyle operatorname proj V mathbf y frac mathbf y cdot mathbf u i mathbf u i cdot mathbf u i mathbf u i nbsp where repeated indices are summed over Einstein sum notation The vector y displaystyle mathbf y nbsp can be written as an orthogonal sum such that y projV y z displaystyle mathbf y operatorname proj V mathbf y mathbf z nbsp projV y displaystyle operatorname proj V mathbf y nbsp is sometimes denoted as y displaystyle hat mathbf y nbsp There is a theorem in linear algebra that states that this z displaystyle mathbf z nbsp is the smallest distance the orthogonal distance from y displaystyle mathbf y nbsp to V displaystyle V nbsp and is commonly used in areas such as machine learning nbsp y is being projected onto the vector space V Canonical forms editAny projection P P2 displaystyle P P 2 nbsp on a vector space of dimension d displaystyle d nbsp over a field is a diagonalizable matrix since its minimal polynomial divides x2 x displaystyle x 2 x nbsp which splits into distinct linear factors Thus there exists a basis in which P displaystyle P nbsp has the form P Ir 0d r displaystyle P I r oplus 0 d r nbsp where r displaystyle r nbsp is the rank of P displaystyle P nbsp Here Ir displaystyle I r nbsp is the identity matrix of size r displaystyle r nbsp 0d r displaystyle 0 d r nbsp is the zero matrix of size d r displaystyle d r nbsp and displaystyle oplus nbsp is the direct sum operator If the vector space is complex and equipped with an inner product then there is an orthonormal basis in which the matrix of P is 14 P 1s100 1sk00 Im 0s displaystyle P begin bmatrix 1 amp sigma 1 0 amp 0 end bmatrix oplus cdots oplus begin bmatrix 1 amp sigma k 0 amp 0 end bmatrix oplus I m oplus 0 s nbsp where s1 s2 sk gt 0 displaystyle sigma 1 geq sigma 2 geq dots geq sigma k gt 0 nbsp The integers k s m displaystyle k s m nbsp and the real numbers si displaystyle sigma i nbsp are uniquely determined Note that 2k s m d displaystyle 2k s m d nbsp The factor Im 0s displaystyle I m oplus 0 s nbsp corresponds to the maximal invariant subspace on which P displaystyle P nbsp acts as an orthogonal projection so that P itself is orthogonal if and only if k 0 displaystyle k 0 nbsp and the si displaystyle sigma i nbsp blocks correspond to the oblique components Projections on normed vector spaces editWhen the underlying vector space X displaystyle X nbsp is a not necessarily finite dimensional normed vector space analytic questions irrelevant in the finite dimensional case need to be considered Assume now X displaystyle X nbsp is a Banach space Many of the algebraic results discussed above survive the passage to this context A given direct sum decomposition of X displaystyle X nbsp into complementary subspaces still specifies a projection and vice versa If X displaystyle X nbsp is the direct sum X U V displaystyle X U oplus V nbsp then the operator defined by P u v u displaystyle P u v u nbsp is still a projection with range U displaystyle U nbsp and kernel V displaystyle V nbsp It is also clear that P2 P displaystyle P 2 P nbsp Conversely if P displaystyle P nbsp is projection on X displaystyle X nbsp i e P2 P displaystyle P 2 P nbsp then it is easily verified that 1 P 2 1 P displaystyle 1 P 2 1 P nbsp In other words 1 P displaystyle 1 P nbsp is also a projection The relation P2 P displaystyle P 2 P nbsp implies 1 P 1 P displaystyle 1 P 1 P nbsp and X displaystyle X nbsp is the direct sum rg P rg 1 P displaystyle operatorname rg P oplus operatorname rg 1 P nbsp However in contrast to the finite dimensional case projections need not be continuous in general If a subspace U displaystyle U nbsp of X displaystyle X nbsp is not closed in the norm topology then the projection onto U displaystyle U nbsp is not continuous In other words the range of a continuous projection P displaystyle P nbsp must be a closed subspace Furthermore the kernel of a continuous projection in fact a continuous linear operator in general is closed Thus a continuous projection P displaystyle P nbsp gives a decomposition of X displaystyle X nbsp into two complementary closed subspaces X rg P ker P ker 1 P ker P displaystyle X operatorname rg P oplus ker P ker 1 P oplus ker P nbsp The converse holds also with an additional assumption Suppose U displaystyle U nbsp is a closed subspace of X displaystyle X nbsp If there exists a closed subspace V displaystyle V nbsp such that X U V then the projection P displaystyle P nbsp with range U displaystyle U nbsp and kernel V displaystyle V nbsp is continuous This follows from the closed graph theorem Suppose xn x and Pxn y One needs to show that Px y displaystyle Px y nbsp Since U displaystyle U nbsp is closed and Pxn U y lies in U displaystyle U nbsp i e Py y Also xn Pxn I P xn x y Because V displaystyle V nbsp is closed and I P xn V we have x y V displaystyle x y in V nbsp i e P x y Px Py Px y 0 displaystyle P x y Px Py Px y 0 nbsp which proves the claim The above argument makes use of the assumption that both U displaystyle U nbsp and V displaystyle V nbsp are closed In general given a closed subspace U displaystyle U nbsp there need not exist a complementary closed subspace V displaystyle V nbsp although for Hilbert spaces this can always be done by taking the orthogonal complement For Banach spaces a one dimensional subspace always has a closed complementary subspace This is an immediate consequence of Hahn Banach theorem Let U displaystyle U nbsp be the linear span of u displaystyle u nbsp By Hahn Banach there exists a bounded linear functional f displaystyle varphi nbsp such that f u 1 The operator P x f x u displaystyle P x varphi x u nbsp satisfies P2 P displaystyle P 2 P nbsp i e it is a projection Boundedness of f displaystyle varphi nbsp implies continuity of P displaystyle P nbsp and therefore ker P rg I P displaystyle ker P operatorname rg I P nbsp is a closed complementary subspace of U displaystyle U nbsp Applications and further considerations editProjections orthogonal and otherwise play a major role in algorithms for certain linear algebra problems QR decomposition see Householder transformation and Gram Schmidt decomposition Singular value decomposition Reduction to Hessenberg form the first step in many eigenvalue algorithms Linear regression Projective elements of matrix algebras are used in the construction of certain K groups in Operator K theoryAs stated above projections are a special case of idempotents Analytically orthogonal projections are non commutative generalizations of characteristic functions Idempotents are used in classifying for instance semisimple algebras while measure theory begins with considering characteristic functions of measurable sets Therefore as one can imagine projections are very often encountered in the context of operator algebras In particular a von Neumann algebra is generated by its complete lattice of projections Generalizations editMore generally given a map between normed vector spaces T V W displaystyle T colon V to W nbsp one can analogously ask for this map to be an isometry on the orthogonal complement of the kernel that ker T W displaystyle ker T perp to W nbsp be an isometry compare Partial isometry in particular it must be onto The case of an orthogonal projection is when W is a subspace of V In Riemannian geometry this is used in the definition of a Riemannian submersion See also editCentering matrix which is an example of a projection matrix Dykstra s projection algorithm to compute the projection onto an intersection of sets Invariant subspace Least squares spectral analysis Orthogonalization Properties of traceNotes edit Meyer pp 386 387 a b Horn Roger A Johnson Charles R 2013 Matrix Analysis second edition Cambridge University Press ISBN 9780521839402 Meyer p 433 Meyer p 431 Meyer equation 5 13 4 Banerjee Sudipto Roy Anindya 2014 Linear Algebra and Matrix Analysis for Statistics Texts in Statistical Science 1st ed Chapman and Hall CRC ISBN 978 1420095388 Meyer equation 5 13 3 See also Linear least squares mathematics Properties of the least squares estimators Banerjee Sudipto Roy Anindya 2014 Linear Algebra and Matrix Analysis for Statistics Texts in Statistical Science 1st ed Chapman and Hall CRC ISBN 978 1420095388 Banerjee Sudipto 2004 Revisiting Spherical Trigonometry with Orthogonal Projectors The College Mathematics Journal 35 5 375 381 doi 10 1080 07468342 2004 11922099 S2CID 122277398 Banerjee Sudipto Roy Anindya 2014 Linear Algebra and Matrix Analysis for Statistics Texts in Statistical Science 1st ed Chapman and Hall CRC ISBN 978 1420095388 Meyer equation 7 10 39 Brust J J Marcia R F Petra C G 2020 Computationally Efficient Decompositions of Oblique Projection Matrices SIAM Journal on Matrix Analysis and Applications 41 2 852 870 doi 10 1137 19M1288115 OSTI 1680061 S2CID 219921214 Dokovic D Z August 1991 Unitary similarity of projectors Aequationes Mathematicae 42 1 220 224 doi 10 1007 BF01818492 S2CID 122704926 References editBanerjee Sudipto Roy Anindya 2014 Linear Algebra and Matrix Analysis for Statistics Texts in Statistical Science 1st ed Chapman and Hall CRC ISBN 978 1420095388 Dunford N Schwartz J T 1958 Linear Operators Part I General Theory Interscience Meyer Carl D 2000 Matrix Analysis and Applied Linear Algebra Society for Industrial and Applied Mathematics ISBN 978 0 89871 454 8 External links editMIT Linear Algebra Lecture on Projection Matrices on YouTube from MIT OpenCourseWare Linear Algebra 15d The Projection Transformation on YouTube by Pavel Grinfeld Planar Geometric Projections Tutorial a simple to follow tutorial explaining the different types of planar geometric projections Retrieved from https en wikipedia org w index php title Projection linear algebra amp oldid 1183181991, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.