fbpx
Wikipedia

Determinant

In mathematics, the determinant is a scalar value that is a function of the entries of a square matrix. The determinant of a matrix A is commonly denoted det(A), det A, or |A|. Its value characterizes some properties of the matrix and the linear map represented by the matrix. In particular, the determinant is nonzero if and only if the matrix is invertible and the linear map represented by the matrix is an isomorphism. The determinant of a product of matrices is the product of their determinants (which follows directly from the above properties).

The determinant of a 2 × 2 matrix is

and the determinant of a 3 × 3 matrix is

The determinant of an n × n matrix can be defined in several equivalent ways, the most common being Leibniz formula, which expresses the determinant as a sum of (the factorial of n) signed products of matrix entries. It can be computed by the Laplace expansion, which expresses the determinant as a linear combination of determinants of submatrices, or with Gaussian elimination, which expresses the determinant as the product of the diagonal entries of a diagonal matrix that is obtained by a succession of elementary row operations.

Determinants can also be defined by some of their properties: the determinant is the unique function defined on the n × n matrices that has the four following properties. The determinant of the identity matrix is 1; the exchange of two rows multiplies the determinant by −1; multiplying a row by a number multiplies the determinant by this number; and adding to a row a multiple of another row does not change the determinant. (The above properties relating to rows may be replaced by the corresponding statements with respect to columns.)

Determinants occur throughout mathematics. For example, a matrix is often used to represent the coefficients in a system of linear equations, and determinants can be used to solve these equations (Cramer's rule), although other methods of solution are computationally much more efficient. Determinants are used for defining the characteristic polynomial of a matrix, whose roots are the eigenvalues. In geometry, the signed n-dimensional volume of a n-dimensional parallelepiped is expressed by a determinant, and the determinant of (the matrix of) a linear transformation determines how the orientation and the n-dimensional volume are transformed. This is used in calculus with exterior differential forms and the Jacobian determinant, in particular for changes of variables in multiple integrals.

Two by two matrices Edit

The determinant of a 2 × 2 matrix   is denoted either by "det" or by vertical bars around the matrix, and is defined as

 

For example,

 

First properties Edit

The determinant has several key properties that can be proved by direct evaluation of the definition for  -matrices, and that continue to hold for determinants of larger matrices. They are as follows:[1] first, the determinant of the identity matrix   is 1. Second, the determinant is zero if two rows are the same:

 

This holds similarly if the two columns are the same. Moreover,

 

Finally, if any column is multiplied by some number   (i.e., all entries in that column are multiplied by that number), the determinant is also multiplied by that number:

 

Geometric meaning Edit

 
The area of the parallelogram is the absolute value of the determinant of the matrix formed by the vectors representing the parallelogram's sides.

If the matrix entries are real numbers, the matrix A can be used to represent two linear maps: one that maps the standard basis vectors to the rows of A, and one that maps them to the columns of A. In either case, the images of the basis vectors form a parallelogram that represents the image of the unit square under the mapping. The parallelogram defined by the rows of the above matrix is the one with vertices at (0, 0), (a, b), (a + c, b + d), and (c, d), as shown in the accompanying diagram.

The absolute value of adbc is the area of the parallelogram, and thus represents the scale factor by which areas are transformed by A. (The parallelogram formed by the columns of A is in general a different parallelogram, but since the determinant is symmetric with respect to rows and columns, the area will be the same.)

The absolute value of the determinant together with the sign becomes the oriented area of the parallelogram. The oriented area is the same as the usual area, except that it is negative when the angle from the first to the second vector defining the parallelogram turns in a clockwise direction (which is opposite to the direction one would get for the identity matrix).

To show that adbc is the signed area, one may consider a matrix containing two vectors u ≡ (a, b) and v ≡ (c, d) representing the parallelogram's sides. The signed area can be expressed as |u| |v| sin θ for the angle θ between the vectors, which is simply base times height, the length of one vector times the perpendicular component of the other. Due to the sine this already is the signed area, yet it may be expressed more conveniently using the cosine of the complementary angle to a perpendicular vector, e.g. u = (−b, a), so that |u| |v| cos θ′, which can be determined by the pattern of the scalar product to be equal to adbc:

 
 
The volume of this parallelepiped is the absolute value of the determinant of the matrix formed by the columns constructed from the vectors r1, r2, and r3.

Thus the determinant gives the scaling factor and the orientation induced by the mapping represented by A. When the determinant is equal to one, the linear mapping defined by the matrix is equi-areal and orientation-preserving.

The object known as the bivector is related to these ideas. In 2D, it can be interpreted as an oriented plane segment formed by imagining two vectors each with origin (0, 0), and coordinates (a, b) and (c, d). The bivector magnitude (denoted by (a, b) ∧ (c, d)) is the signed area, which is also the determinant adbc.[2]

If an n × n real matrix A is written in terms of its column vectors  , then

 

This means that   maps the unit n-cube to the n-dimensional parallelotope defined by the vectors   the region  

The determinant gives the signed n-dimensional volume of this parallelotope,   and hence describes more generally the n-dimensional volume scaling factor of the linear transformation produced by A.[3] (The sign shows whether the transformation preserves or reverses orientation.) In particular, if the determinant is zero, then this parallelotope has volume zero and is not fully n-dimensional, which indicates that the dimension of the image of A is less than n. This means that A produces a linear transformation which is neither onto nor one-to-one, and so is not invertible.

Definition Edit

Let A be a square matrix with n rows and n columns, so that it can be written as

 

The entries   etc. are, for many purposes, real or complex numbers. As discussed below, the determinant is also defined for matrices whose entries are in a commutative ring.

The determinant of A is denoted by det(A), or it can be denoted directly in terms of the matrix entries by writing enclosing bars instead of brackets:

 

There are various equivalent ways to define the determinant of a square matrix A, i.e. one with the same number of rows and columns: the determinant can be defined via the Leibniz formula, an explicit formula involving sums of products of certain entries of the matrix. The determinant can also be characterized as the unique function depending on the entries of the matrix satisfying certain properties. This approach can also be used to compute determinants by simplifying the matrices in question.

Leibniz formula Edit

3 × 3 matrices Edit

The Leibniz formula for the determinant of a 3 × 3 matrix is the following:

 

In this expression, each term has one factor from each row, all in different columns. For example, bdi has b from the first row second column, d from the second row first column, and i from the third row third column. The signs are determined by how many transpositions of factors are necessary to arrange the factors in increasing order of their columns: positive for an even number of transpositions and negative for an odd number. For the example of bdi, the single transposition of bd to db gives dbi, whose three factors are from the first, second and thrd columns respectively; this is an odd number of transpositions, so the term appears with negative sign.

 
Rule of Sarrus

The rule of Sarrus is a mnemonic for the expanded form of this determinant: the sum of the products of three diagonal north-west to south-east lines of matrix elements, minus the sum of the products of three diagonal south-west to north-east lines of elements, when the copies of the first two columns of the matrix are written beside it as in the illustration. This scheme for calculating the determinant of a 3 × 3 matrix does not carry over into higher dimensions.

n × n matrices Edit

Generalizing the above to higher dimensions, the determinant of an   matrix is an expression involving permutations and their signatures. A permutation of the set   is a bijective function   from this set to itself, with values  .exhausting the entire set. The set of all such permutations, called the symmetric group, is commonly denoted  . The signature   of a permutation   is   if the permutation can be obtained with an even number of transpositions (exchanges of two entries); otherwise, it is  

Given a matrix

 

the Leibniz formula for its determinant is, using sigma notation,

 

Using pi notation, this can be shortened into

 .

The Levi-Civita symbol   is defined on the n-tuples of integers in   as 0 if two of the integers are equal, and otherwise as the signature of the permutation defined by the n-tuple of integers. With the Levi-Civita symbol, the Leibniz formula becomes

 

where the sum is taken over all n-tuples of integers in   [4][5]

Properties of the determinant Edit

Characterization of the determinant Edit

The determinant can be characterized by the following three key properties. To state these, it is convenient to regard an  -matrix A as being composed of its   columns, so denoted as

 

where the column vector   (for each i) is composed of the entries of the matrix in the i-th column.

  1.  , where   is an identity matrix.
  2. The determinant is multilinear: if the jth column of a matrix   is written as a linear combination   of two column vectors v and w and a number r, then the determinant of A is expressible as a similar linear combination:
     
  3. The determinant is alternating: whenever two columns of a matrix are identical, its determinant is 0:
     

If the determinant is defined using the Leibniz formula as above, these three properties can be proved by direct inspection of that formula. Some authors also approach the determinant directly using these three properties: it can be shown that there is exactly one function that assigns to any  -matrix A a number that satisfies these three properties.[6] This also shows that this more abstract approach to the determinant yields the same definition as the one using the Leibniz formula.

To see this it suffices to expand the determinant by multi-linearity in the columns into a (huge) linear combination of determinants of matrices in which each column is a standard basis vector. These determinants are either 0 (by property 9) or else ±1 (by properties 1 and 12 below), so the linear combination gives the expression above in terms of the Levi-Civita symbol. While less technical in appearance, this characterization cannot entirely replace the Leibniz formula in defining the determinant, since without it the existence of an appropriate function is not clear.[citation needed]

Immediate consequences Edit

These rules have several further consequences:

  • The determinant is a homogeneous function, i.e.,
     
    (for an   matrix  ).
  • Interchanging any pair of columns of a matrix multiplies its determinant by −1. This follows from the determinant being multilinear and alternating (properties 2 and 3 above):
     
    This formula can be applied iteratively when several columns are swapped. For example
     
    Yet more generally, any permutation of the columns multiplies the determinant by the sign of the permutation.
  • If some column can be expressed as a linear combination of the other columns (i.e. the columns of the matrix form a linearly dependent set), the determinant is 0. As a special case, this includes: if some column is such that all its entries are zero, then the determinant of that matrix is 0.
  • Adding a scalar multiple of one column to another column does not change the value of the determinant. This is a consequence of multilinearity and being alternative: by multilinearity the determinant changes by a multiple of the determinant of a matrix with two equal columns, which determinant is 0, since the determinant is alternating.
  • If   is a triangular matrix, i.e.  , whenever   or, alternatively, whenever  , then its determinant equals the product of the diagonal entries:
     
    Indeed, such a matrix can be reduced, by appropriately adding multiples of the columns with fewer nonzero entries to those with more entries, to a diagonal matrix (without changing the determinant). For such a matrix, using the linearity in each column reduces to the identity matrix, in which case the stated formula holds by the very first characterizing property of determinants. Alternatively, this formula can also be deduced from the Leibniz formula, since the only permutation   which gives a non-zero contribution is the identity permutation.

Example Edit

These characterizing properties and their consequences listed above are both theoretically significant, but can also be used to compute determinants for concrete matrices. In fact, Gaussian elimination can be applied to bring any matrix into upper triangular form, and the steps in this algorithm affect the determinant in a controlled way. The following concrete example illustrates the computation of the determinant of the matrix   using that method:

 
Computation of the determinant of matrix  
Matrix  

 

 

 

Obtained by

add the second column to the first

add 3 times the third column to the second

swap the first two columns

add   times the second column to the first

Determinant  

 

 

 

Combining these equalities gives  

Transpose Edit

The determinant of the transpose of   equals the determinant of A:

 .

This can be proven by inspecting the Leibniz formula.[7] This implies that in all the properties mentioned above, the word "column" can be replaced by "row" throughout. For example, viewing an n × n matrix as being composed of n rows, the determinant is an n-linear function.

Multiplicativity and matrix groups Edit

The determinant is a multiplicative map, i.e., for square matrices   and   of equal size, the determinant of a matrix product equals the product of their determinants:

 

This key fact can be proven by observing that, for a fixed matrix  , both sides of the equation are alternating and multilinear as a function depending on the columns of  . Moreover, they both take the value   when   is the identity matrix. The above-mentioned unique characterization of alternating multilinear maps therefore shows this claim.[8]

A matrix   with entries in a field is invertible precisely if its determinant is nonzero. This follows from the multiplicativity of the determinant and the formula for the inverse involving the adjugate matrix mentioned below. In this event, the determinant of the inverse matrix is given by

 .

In particular, products and inverses of matrices with non-zero determinant (respectively, determinant one) still have this property. Thus, the set of such matrices (of fixed size   over a field  ) forms a group known as the general linear group   (respectively, a subgroup called the special linear group  . More generally, the word "special" indicates the subgroup of another matrix group of matrices of determinant one. Examples include the special orthogonal group (which if n is 2 or 3 consists of all rotation matrices), and the special unitary group.

Because the determinant respects multiplication and inverses, it is in fact a group homomorphism from   into the multiplicative group   of nonzero elements of  . This homomorphism is surjective and its kernel is   (the matrices with determinant one). Hence, by the first isomorphism theorem, this shows that   is a normal subgroup of  , and that the quotient group   is isomorphic to  .

The Cauchy–Binet formula is a generalization of that product formula for rectangular matrices. This formula can also be recast as a multiplicative formula for compound matrices whose entries are the determinants of all quadratic submatrices of a given matrix.[9][10]

Laplace expansion Edit

Laplace expansion expresses the determinant of a matrix   recursively in terms of determinants of smaller matrices, known as its minors. The minor   is defined to be the determinant of the  -matrix that results from   by removing the  -th row and the  -th column. The expression   is known as a cofactor. For every  , one has the equality

 

which is called the Laplace expansion along the ith row. For example, the Laplace expansion along the first row ( ) gives the following formula:

 

Unwinding the determinants of these  -matrices gives back the Leibniz formula mentioned above. Similarly, the Laplace expansion along the  -th column is the equality

 

Laplace expansion can be used iteratively for computing determinants, but this approach is inefficient for large matrices. However, it is useful for computing the determinants of highly symmetric matrix such as the Vandermonde matrix

 
The n-term Laplace expansion along a row or column can be generalized to write an n x n determinant as a sum of   terms, each the product of the determinant of a k x k submatrix and the determinant of the complementary (n−k) x (n−k) submatrix.

Adjugate matrix Edit

The adjugate matrix   is the transpose of the matrix of the cofactors, that is,

 

For every matrix, one has[11]

 

Thus the adjugate matrix can be used for expressing the inverse of a nonsingular matrix:

 

Block matrices Edit

The formula for the determinant of a  -matrix above continues to hold, under appropriate further assumptions, for a block matrix, i.e., a matrix composed of four submatrices   of dimension  ,  ,   and  , respectively. The easiest such formula, which can be proven using either the Leibniz formula or a factorization involving the Schur complement, is

 

If   is invertible, then it follows with results from the section on multiplicativity that

 

which simplifies to   when   is a  -matrix.

A similar result holds when   is invertible, namely

 

Both results can be combined to derive Sylvester's determinant theorem, which is also stated below.

If the blocks are square matrices of the same size further formulas hold. For example, if   and   commute (i.e.,  ), then[12]

 

This formula has been generalized to matrices composed of more than   blocks, again under appropriate commutativity conditions among the individual blocks.[13]

For   and  , the following formula holds (even if   and   do not commute)[citation needed]

 

Sylvester's determinant theorem Edit

Sylvester's determinant theorem states that for A, an m × n matrix, and B, an n × m matrix (so that A and B have dimensions allowing them to be multiplied in either order forming a square matrix):

 

where Im and In are the m × m and n × n identity matrices, respectively.

From this general result several consequences follow.

  1. For the case of column vector c and row vector r, each with m components, the formula allows quick calculation of the determinant of a matrix that differs from the identity matrix by a matrix of rank 1:
     
  2. More generally,[14] for any invertible m × m matrix X,
     
  3. For a column and row vector as above:
     
  4. For square matrices   and   of the same size, the matrices   and   have the same characteristic polynomials (hence the same eigenvalues).

Sum Edit

The determinant of the sum   of two square matrices of the same size is not in general expressible in terms of the determinants of A and of B. However, for positive semidefinite matrices  ,   and   of equal size,

 
with the corollary[15][16]
 
Conversely, if   and   are Hermitian, positive-definite, and size  , then the determinant has concave  th root;[17] this implies
 
by homogeneity.

Sum identity for 2×2 matrices Edit

For the special case of   matrices with complex entries, the determinant of the sum can be written in terms of determinants and traces in the following identity:

 
Proof of identity

This can be shown by writing out each term in components  . The left-hand side is

 

Expanding gives

 

The terms which are quadratic in   are seen to be  , and similarly for  , so the expression can be written

 

We can then write the cross-terms as

 

which can be recognized as

 

which completes the proof.

This has an application to   matrix algebras. For example, consider the complex numbers as a matrix algebra. The complex numbers have a representation as matrices of the form

 
with   and   real. Since  , taking   and   in the above identity gives
 

This result followed just from   and  .

Properties of the determinant in relation to other notions Edit

Eigenvalues and characteristic polynomial Edit

The determinant is closely related to two other central concepts in linear algebra, the eigenvalues and the characteristic polynomial of a matrix. Let   be an  -matrix with complex entries. Then, by the Fundamental Theorem of Algebra,   must have exactly n eigenvalues  . (Here it is understood that an eigenvalue with algebraic multiplicity μ occurs μ times in this list.) Then, it turns out the determinant of A is equal to the product of these eigenvalues,

 

The product of all non-zero eigenvalues is referred to as pseudo-determinant.

From this, one immediately sees that the determinant of a matrix   is zero iff   is an eigenvalue of  . In other words,   is invertible iff   is not an eigenvalue of  .

The characteristic polynomial is defined as[18]

 

Here,   is the indeterminate of the polynomial and   is the identity matrix of the same size as  . By means of this polynomial, determinants can be used to find the eigenvalues of the matrix  : they are precisely the roots of this polynomial, i.e., those complex numbers   such that

 

A Hermitian matrix is positive definite if all its eigenvalues are positive. Sylvester's criterion asserts that this is equivalent to the determinants of the submatrices

 

being positive, for all   between   and  .[19]

Trace Edit

The trace tr(A) is by definition the sum of the diagonal entries of A and also equals the sum of the eigenvalues. Thus, for complex matrices A,

 

or, for real matrices A,

 

Here exp(A) denotes the matrix exponential of A, because every eigenvalue λ of A corresponds to the eigenvalue exp(λ) of exp(A). In particular, given any logarithm of A, that is, any matrix L satisfying

 

the determinant of A is given by

 

For example, for n = 2, n = 3, and n = 4, respectively,

 

cf. Cayley-Hamilton theorem. Such expressions are deducible from combinatorial arguments, Newton's identities, or the Faddeev–LeVerrier algorithm. That is, for generic n, detA = (−1)nc0 the signed constant term of the characteristic polynomial, determined recursively from

 

In the general case, this may also be obtained from[20]

 

where the sum is taken over the set of all integers kl ≥ 0 satisfying the equation

 

The formula can be expressed in terms of the complete exponential Bell polynomial of n arguments sl = −(l – 1)! tr(Al) as

 

This formula can also be used to find the determinant of a matrix AIJ with multidimensional indices I = (i1, i2, …, ir) and J = (j1, j2, …, jr). The product and trace of such matrices are defined in a natural way as

 

An important arbitrary dimension n identity can be obtained from the Mercator series expansion of the logarithm when the expansion converges. If every eigenvalue of A is less than 1 in absolute value,

 

where I is the identity matrix. More generally, if

 

is expanded as a formal power series in s then all coefficients of sm for m > n are zero and the remaining polynomial is det(I + sA).

Upper and lower bounds Edit

For a positive definite matrix A, the trace operator gives the following tight lower and upper bounds on the log determinant

 

with equality if and only if A = I. This relationship can be derived via the formula for the Kullback-Leibler divergence between two multivariate normal distributions.

Also,

 

These inequalities can be proved by expressing the traces and the determinant in terms of the eigenvalues. As such, they represent the well-known fact that the harmonic mean is less than the geometric mean, which is less than the arithmetic mean, which is, in turn, less than the root mean square.

Derivative Edit

The Leibniz formula shows that the determinant of real (or analogously for complex) square matrices is a polynomial function from   to  . In particular, it is everywhere differentiable. Its derivative can be expressed using Jacobi's formula:[21]

 

where   denotes the adjugate of  . In particular, if   is invertible, we have

 

Expressed in terms of the entries of  , these are

 

Yet another equivalent formulation is

 ,

using big O notation. The special case where  , the identity matrix, yields

 

This identity is used in describing Lie algebras associated to certain matrix Lie groups. For example, the special linear group   is defined by the equation  . The above formula shows that its Lie algebra is the special linear Lie algebra   consisting of those matrices having trace zero.

Writing a  -matrix as   where   are column vectors of length 3, then the gradient over one of the three vectors may be written as the cross product of the other two:

 

History Edit

Historically, determinants were used long before matrices: A determinant was originally defined as a property of a system of linear equations. The determinant "determines" whether the system has a unique solution (which occurs precisely if the determinant is non-zero). In this sense, determinants were first used in the Chinese mathematics textbook The Nine Chapters on the Mathematical Art (九章算術, Chinese scholars, around the 3rd century BCE). In Europe, solutions of linear systems of two equations were expressed by Cardano in 1545 by a determinant-like entity.[22]

Determinants proper originated from the work of Seki Takakazu in 1683 in Japan and parallelly of Leibniz in 1693.[23][24][25][26] Cramer (1750) stated, without proof, Cramer's rule.[27] Both Cramer and also Bezout (1779) were led to determinants by the question of plane curves passing through a given set of points.[28]

Vandermonde (1771) first recognized determinants as independent functions.[24] Laplace (1772) gave the general method of expanding a determinant in terms of its complementary minors: Vandermonde had already given a special case.[29] Immediately following, Lagrange (1773) treated determinants of the second and third order and applied it to questions of elimination theory; he proved many special cases of general identities.

Gauss (1801) made the next advance. Like Lagrange, he made much use of determinants in the theory of numbers. He introduced the word "determinant" (Laplace had used "resultant"), though not in the present signification, but rather as applied to the discriminant of a quantic.[30] Gauss also arrived at the notion of reciprocal (inverse) determinants, and came very near the multiplication theorem.[clarification needed]

The next contributor of importance is Binet (1811, 1812), who formally stated the theorem relating to the product of two matrices of m columns and n rows, which for the special case of m = n reduces to the multiplication theorem. On the same day (November 30, 1812) that Binet presented his paper to the Academy, Cauchy also presented one on the subject. (See Cauchy–Binet formula.) In this he used the word "determinant" in its present sense,[31][32] summarized and simplified what was then known on the subject, improved the notation, and gave the multiplication theorem with a proof more satisfactory than Binet's.[24][33] With him begins the theory in its generality.

Jacobi (1841) used the functional determinant which Sylvester later called the Jacobian.[34] In his memoirs in Crelle's Journal for 1841 he specially treats this subject, as well as the class of alternating functions which Sylvester has called alternants. About the time of Jacobi's last memoirs, Sylvester (1839) and Cayley began their work. Cayley 1841 introduced the modern notation for the determinant using vertical bars.[35][36]

The study of special forms of determinants has been the natural result of the completion of the general theory. Axisymmetric determinants have been studied by Lebesgue, Hesse, and Sylvester; persymmetric determinants by Sylvester and Hankel; circulants by Catalan, Spottiswoode, Glaisher, and Scott; skew determinants and Pfaffians, in connection with the theory of orthogonal transformation, by Cayley; continuants by Sylvester; Wronskians (so called by Muir) by Christoffel and Frobenius; compound determinants by Sylvester, Reiss, and Picquet; Jacobians and Hessians by Sylvester; and symmetric gauche determinants by Trudi. Of the textbooks on the subject Spottiswoode's was the first. In America, Hanus (1886), Weld (1893), and Muir/Metzler (1933) published treatises.

Applications Edit

Cramer's rule Edit

Determinants can be used to describe the solutions of a linear system of equations, written in matrix form as  . This equation has a unique solution   if and only if   is nonzero. In this case, the solution is given by Cramer's rule:

 

where   is the matrix formed by replacing the  -th column of   by the column vector  . This follows immediately by column expansion of the determinant, i.e.

 

where the vectors   are the columns of A. The rule is also implied by the identity

 

Cramer's rule can be implemented in   time, which is comparable to more common methods of solving systems of linear equations, such as LU, QR, or singular value decomposition.[37]

Linear independence Edit

Determinants can be used to characterize linearly dependent vectors:   is zero if and only if the column vectors (or, equivalently, the row vectors) of the matrix   are linearly dependent.[38] For example, given two linearly independent vectors  , a third vector   lies in the plane spanned by the former two vectors exactly if the determinant of the  -matrix consisting of the three vectors is zero. The same idea is also used in the theory of differential equations: given functions   (supposed to be   times differentiable), the Wronskian is defined to be

 

It is non-zero (for some  ) in a specified interval if and only if the given functions and all their derivatives up to order   are linearly independent. If it can be shown that the Wronskian is zero everywhere on an interval then, in the case of analytic functions, this implies the given functions are linearly dependent. See the Wronskian and linear independence. Another such use of the determinant is the resultant, which gives a criterion when two polynomials have a common root.[39]

Orientation of a basis Edit

The determinant can be thought of as assigning a number to every sequence of n vectors in Rn, by using the square matrix whose columns are the given vectors. The determinant will be nonzero iff the sequence of vectors is a basis for Rn. In that case, the sign of the determinant determines whether the orientation of the basis is consistent with or opposite to the orientation of the standard basis. In the case of an orthogonal basis, the magnitude of the determinant is equal to the product of the lengths of the basis vectors. For instance, an orthogonal matrix with entries in Rn represents an orthonormal basis in Euclidean space, and hence has determinant of ±1 (since all the vectors have length 1). The determinant is +1 iff the basis has the same orientation. It is −1 iff the basis has the opposite orientation.

More generally, if the determinant of A is positive, A represents an orientation-preserving linear transformation (if A is an orthogonal 2 × 2 or 3 × 3 matrix, this is a rotation), while if it is negative, A switches the orientation of the basis.

Volume and Jacobian determinant Edit

As pointed out above, the absolute value of the determinant of real vectors is equal to the volume of the parallelepiped spanned by those vectors. As a consequence, if   is the linear map given by multiplication with a matrix  

determinant, this, article, about, mathematics, determinants, epidemiology, risk, factor, determinants, immunology, epitope, mathematics, determinant, scalar, value, that, function, entries, square, matrix, determinant, matrix, commonly, denoted, value, charac. This article is about mathematics For determinants in epidemiology see Risk factor For determinants in immunology see Epitope In mathematics the determinant is a scalar value that is a function of the entries of a square matrix The determinant of a matrix A is commonly denoted det A det A or A Its value characterizes some properties of the matrix and the linear map represented by the matrix In particular the determinant is nonzero if and only if the matrix is invertible and the linear map represented by the matrix is an isomorphism The determinant of a product of matrices is the product of their determinants which follows directly from the above properties The determinant of a 2 2 matrix is a b c d a d b c displaystyle begin vmatrix a amp b c amp d end vmatrix ad bc and the determinant of a 3 3 matrix is a b c d e f g h i a e i b f g c d h c e g b d i a f h displaystyle begin vmatrix a amp b amp c d amp e amp f g amp h amp i end vmatrix aei bfg cdh ceg bdi afh The determinant of an n n matrix can be defined in several equivalent ways the most common being Leibniz formula which expresses the determinant as a sum of n displaystyle n the factorial of n signed products of matrix entries It can be computed by the Laplace expansion which expresses the determinant as a linear combination of determinants of submatrices or with Gaussian elimination which expresses the determinant as the product of the diagonal entries of a diagonal matrix that is obtained by a succession of elementary row operations Determinants can also be defined by some of their properties the determinant is the unique function defined on the n n matrices that has the four following properties The determinant of the identity matrix is 1 the exchange of two rows multiplies the determinant by 1 multiplying a row by a number multiplies the determinant by this number and adding to a row a multiple of another row does not change the determinant The above properties relating to rows may be replaced by the corresponding statements with respect to columns Determinants occur throughout mathematics For example a matrix is often used to represent the coefficients in a system of linear equations and determinants can be used to solve these equations Cramer s rule although other methods of solution are computationally much more efficient Determinants are used for defining the characteristic polynomial of a matrix whose roots are the eigenvalues In geometry the signed n dimensional volume of a n dimensional parallelepiped is expressed by a determinant and the determinant of the matrix of a linear transformation determines how the orientation and the n dimensional volume are transformed This is used in calculus with exterior differential forms and the Jacobian determinant in particular for changes of variables in multiple integrals Contents 1 Two by two matrices 1 1 First properties 2 Geometric meaning 3 Definition 3 1 Leibniz formula 3 1 1 3 3 matrices 3 1 2 n n matrices 4 Properties of the determinant 4 1 Characterization of the determinant 4 2 Immediate consequences 4 2 1 Example 4 3 Transpose 4 4 Multiplicativity and matrix groups 4 5 Laplace expansion 4 5 1 Adjugate matrix 4 6 Block matrices 4 7 Sylvester s determinant theorem 4 8 Sum 4 8 1 Sum identity for 2 2 matrices 5 Properties of the determinant in relation to other notions 5 1 Eigenvalues and characteristic polynomial 5 2 Trace 5 3 Upper and lower bounds 5 4 Derivative 6 History 7 Applications 7 1 Cramer s rule 7 2 Linear independence 7 3 Orientation of a basis 7 4 Volume and Jacobian determinant 8 Abstract algebraic aspects 8 1 Determinant of an endomorphism 8 2 Square matrices over commutative rings 8 3 Exterior algebra 9 Generalizations and related notions 9 1 Determinants for finite dimensional algebras 9 2 Infinite matrices 9 3 Operators in von Neumann algebras 9 4 Related notions for non commutative rings 10 Calculation 10 1 Decomposition methods 10 2 Further methods 11 See also 12 Notes 13 References 13 1 Historical references 14 External linksTwo by two matrices EditThe determinant of a 2 2 matrix a b c d displaystyle begin pmatrix a amp b c amp d end pmatrix nbsp is denoted either by det or by vertical bars around the matrix and is defined as det a b c d a b c d a d b c displaystyle det begin pmatrix a amp b c amp d end pmatrix begin vmatrix a amp b c amp d end vmatrix ad bc nbsp For example det 3 7 1 4 3 7 1 4 3 4 7 1 19 displaystyle det begin pmatrix 3 amp 7 1 amp 4 end pmatrix begin vmatrix 3 amp 7 1 amp 4 end vmatrix 3 cdot 4 7 cdot 1 19 nbsp First properties Edit The determinant has several key properties that can be proved by direct evaluation of the definition for 2 2 displaystyle 2 times 2 nbsp matrices and that continue to hold for determinants of larger matrices They are as follows 1 first the determinant of the identity matrix 1 0 0 1 displaystyle begin pmatrix 1 amp 0 0 amp 1 end pmatrix nbsp is 1 Second the determinant is zero if two rows are the same a b a b a b b a 0 displaystyle begin vmatrix a amp b a amp b end vmatrix ab ba 0 nbsp This holds similarly if the two columns are the same Moreover a b b c d d a d d b b c a b c d a b c d displaystyle begin vmatrix a amp b b c amp d d end vmatrix a d d b b c begin vmatrix a amp b c amp d end vmatrix begin vmatrix a amp b c amp d end vmatrix nbsp Finally if any column is multiplied by some number r displaystyle r nbsp i e all entries in that column are multiplied by that number the determinant is also multiplied by that number r a b r c d r a d b r c r a d b c r a b c d displaystyle begin vmatrix r cdot a amp b r cdot c amp d end vmatrix rad brc r ad bc r cdot begin vmatrix a amp b c amp d end vmatrix nbsp Geometric meaning Edit nbsp The area of the parallelogram is the absolute value of the determinant of the matrix formed by the vectors representing the parallelogram s sides If the matrix entries are real numbers the matrix A can be used to represent two linear maps one that maps the standard basis vectors to the rows of A and one that maps them to the columns of A In either case the images of the basis vectors form a parallelogram that represents the image of the unit square under the mapping The parallelogram defined by the rows of the above matrix is the one with vertices at 0 0 a b a c b d and c d as shown in the accompanying diagram The absolute value of ad bc is the area of the parallelogram and thus represents the scale factor by which areas are transformed by A The parallelogram formed by the columns of A is in general a different parallelogram but since the determinant is symmetric with respect to rows and columns the area will be the same The absolute value of the determinant together with the sign becomes the oriented area of the parallelogram The oriented area is the same as the usual area except that it is negative when the angle from the first to the second vector defining the parallelogram turns in a clockwise direction which is opposite to the direction one would get for the identity matrix To show that ad bc is the signed area one may consider a matrix containing two vectors u a b and v c d representing the parallelogram s sides The signed area can be expressed as u v sin 8 for the angle 8 between the vectors which is simply base times height the length of one vector times the perpendicular component of the other Due to the sine this already is the signed area yet it may be expressed more conveniently using the cosine of the complementary angle to a perpendicular vector e g u b a so that u v cos 8 which can be determined by the pattern of the scalar product to be equal to ad bc Signed area u v sin 8 u v cos 8 b a c d a d b c displaystyle text Signed area boldsymbol u boldsymbol v sin theta left boldsymbol u perp right left boldsymbol v right cos theta begin pmatrix b a end pmatrix cdot begin pmatrix c d end pmatrix ad bc nbsp nbsp The volume of this parallelepiped is the absolute value of the determinant of the matrix formed by the columns constructed from the vectors r1 r2 and r3 Thus the determinant gives the scaling factor and the orientation induced by the mapping represented by A When the determinant is equal to one the linear mapping defined by the matrix is equi areal and orientation preserving The object known as the bivector is related to these ideas In 2D it can be interpreted as an oriented plane segment formed by imagining two vectors each with origin 0 0 and coordinates a b and c d The bivector magnitude denoted by a b c d is the signed area which is also the determinant ad bc 2 If an n n real matrix A is written in terms of its column vectors A a 1 a 2 a n displaystyle A left begin array c c c c mathbf a 1 amp mathbf a 2 amp cdots amp mathbf a n end array right nbsp then A 1 0 0 a 1 A 0 1 0 a 2 A 0 0 1 a n displaystyle A begin pmatrix 1 0 vdots 0 end pmatrix mathbf a 1 quad A begin pmatrix 0 1 vdots 0 end pmatrix mathbf a 2 quad ldots quad A begin pmatrix 0 0 vdots 1 end pmatrix mathbf a n nbsp This means that A displaystyle A nbsp maps the unit n cube to the n dimensional parallelotope defined by the vectors a 1 a 2 a n displaystyle mathbf a 1 mathbf a 2 ldots mathbf a n nbsp the region P c 1 a 1 c n a n 0 c i 1 i displaystyle P left c 1 mathbf a 1 cdots c n mathbf a n mid 0 leq c i leq 1 forall i right nbsp The determinant gives the signed n dimensional volume of this parallelotope det A vol P displaystyle det A pm text vol P nbsp and hence describes more generally the n dimensional volume scaling factor of the linear transformation produced by A 3 The sign shows whether the transformation preserves or reverses orientation In particular if the determinant is zero then this parallelotope has volume zero and is not fully n dimensional which indicates that the dimension of the image of A is less than n This means that A produces a linear transformation which is neither onto nor one to one and so is not invertible Definition EditLet A be a square matrix with n rows and n columns so that it can be written as A a 1 1 a 1 2 a 1 n a 2 1 a 2 2 a 2 n a n 1 a n 2 a n n displaystyle A begin bmatrix a 1 1 amp a 1 2 amp cdots amp a 1 n a 2 1 amp a 2 2 amp cdots amp a 2 n vdots amp vdots amp ddots amp vdots a n 1 amp a n 2 amp cdots amp a n n end bmatrix nbsp The entries a 1 1 displaystyle a 1 1 nbsp etc are for many purposes real or complex numbers As discussed below the determinant is also defined for matrices whose entries are in a commutative ring The determinant of A is denoted by det A or it can be denoted directly in terms of the matrix entries by writing enclosing bars instead of brackets a 1 1 a 1 2 a 1 n a 2 1 a 2 2 a 2 n a n 1 a n 2 a n n displaystyle begin vmatrix a 1 1 amp a 1 2 amp cdots amp a 1 n a 2 1 amp a 2 2 amp cdots amp a 2 n vdots amp vdots amp ddots amp vdots a n 1 amp a n 2 amp cdots amp a n n end vmatrix nbsp There are various equivalent ways to define the determinant of a square matrix A i e one with the same number of rows and columns the determinant can be defined via the Leibniz formula an explicit formula involving sums of products of certain entries of the matrix The determinant can also be characterized as the unique function depending on the entries of the matrix satisfying certain properties This approach can also be used to compute determinants by simplifying the matrices in question Leibniz formula Edit Main article Leibniz formula for determinants 3 3 matrices Edit The Leibniz formula for the determinant of a 3 3 matrix is the following a b c d e f g h i a e i b f g c d h c e g b d i a f h displaystyle begin vmatrix a amp b amp c d amp e amp f g amp h amp i end vmatrix aei bfg cdh ceg bdi afh nbsp In this expression each term has one factor from each row all in different columns For example bdi has b from the first row second column d from the second row first column and i from the third row third column The signs are determined by how many transpositions of factors are necessary to arrange the factors in increasing order of their columns positive for an even number of transpositions and negative for an odd number For the example of bdi the single transposition of bd to db gives dbi whose three factors are from the first second and thrd columns respectively this is an odd number of transpositions so the term appears with negative sign nbsp Rule of SarrusThe rule of Sarrus is a mnemonic for the expanded form of this determinant the sum of the products of three diagonal north west to south east lines of matrix elements minus the sum of the products of three diagonal south west to north east lines of elements when the copies of the first two columns of the matrix are written beside it as in the illustration This scheme for calculating the determinant of a 3 3 matrix does not carry over into higher dimensions n n matrices Edit Generalizing the above to higher dimensions the determinant of an n n displaystyle n times n nbsp matrix is an expression involving permutations and their signatures A permutation of the set 1 2 n displaystyle 1 2 dots n nbsp is a bijective function s displaystyle sigma nbsp from this set to itself with values s 1 s 2 s n displaystyle sigma 1 sigma 2 ldots sigma n nbsp exhausting the entire set The set of all such permutations called the symmetric group is commonly denoted S n displaystyle S n nbsp The signature sgn s displaystyle operatorname sgn sigma nbsp of a permutation s displaystyle sigma nbsp is 1 displaystyle 1 nbsp if the permutation can be obtained with an even number of transpositions exchanges of two entries otherwise it is 1 displaystyle 1 nbsp Given a matrix A a 1 1 a 1 n a n 1 a n n displaystyle A begin bmatrix a 1 1 ldots a 1 n vdots qquad vdots a n 1 ldots a n n end bmatrix nbsp the Leibniz formula for its determinant is using sigma notation det A a 1 1 a 1 n a n 1 a n n s S n sgn s a 1 s 1 a n s n displaystyle det A begin vmatrix a 1 1 ldots a 1 n vdots qquad vdots a n 1 ldots a n n end vmatrix sum sigma in S n operatorname sgn sigma a 1 sigma 1 cdots a n sigma n nbsp Using pi notation this can be shortened into det A s S n sgn s i 1 n a i s i displaystyle det A sum sigma in S n left operatorname sgn sigma prod i 1 n a i sigma i right nbsp The Levi Civita symbol e i 1 i n displaystyle varepsilon i 1 ldots i n nbsp is defined on the n tuples of integers in 1 n displaystyle 1 ldots n nbsp as 0 if two of the integers are equal and otherwise as the signature of the permutation defined by the n tuple of integers With the Levi Civita symbol the Leibniz formula becomes det A i 1 i 2 i n e i 1 i n a 1 i 1 a n i n displaystyle det A sum i 1 i 2 ldots i n varepsilon i 1 cdots i n a 1 i 1 cdots a n i n nbsp where the sum is taken over all n tuples of integers in 1 n displaystyle 1 ldots n nbsp 4 5 Properties of the determinant EditCharacterization of the determinant Edit The determinant can be characterized by the following three key properties To state these it is convenient to regard an n n displaystyle n times n nbsp matrix A as being composed of its n displaystyle n nbsp columns so denoted as A a 1 a n displaystyle A big a 1 dots a n big nbsp where the column vector a i displaystyle a i nbsp for each i is composed of the entries of the matrix in the i th column det I 1 displaystyle det left I right 1 nbsp where I displaystyle I nbsp is an identity matrix The determinant is multilinear if the jth column of a matrix A displaystyle A nbsp is written as a linear combination a j r v w displaystyle a j r cdot v w nbsp of two column vectors v and w and a number r then the determinant of A is expressible as a similar linear combination A a 1 a j 1 r v w a j 1 a n r a 1 v a n a 1 w a n displaystyle begin aligned A amp big a 1 dots a j 1 r cdot v w a j 1 dots a n amp r cdot a 1 dots v dots a n a 1 dots w dots a n end aligned nbsp The determinant is alternating whenever two columns of a matrix are identical its determinant is 0 a 1 v v a n 0 displaystyle a 1 dots v dots v dots a n 0 nbsp If the determinant is defined using the Leibniz formula as above these three properties can be proved by direct inspection of that formula Some authors also approach the determinant directly using these three properties it can be shown that there is exactly one function that assigns to any n n displaystyle n times n nbsp matrix A a number that satisfies these three properties 6 This also shows that this more abstract approach to the determinant yields the same definition as the one using the Leibniz formula To see this it suffices to expand the determinant by multi linearity in the columns into a huge linear combination of determinants of matrices in which each column is a standard basis vector These determinants are either 0 by property 9 or else 1 by properties 1 and 12 below so the linear combination gives the expression above in terms of the Levi Civita symbol While less technical in appearance this characterization cannot entirely replace the Leibniz formula in defining the determinant since without it the existence of an appropriate function is not clear citation needed Immediate consequences Edit These rules have several further consequences The determinant is a homogeneous function i e det c A c n det A displaystyle det cA c n det A nbsp for an n n displaystyle n times n nbsp matrix A displaystyle A nbsp Interchanging any pair of columns of a matrix multiplies its determinant by 1 This follows from the determinant being multilinear and alternating properties 2 and 3 above a 1 a j a i a n a 1 a i a j a n displaystyle a 1 dots a j dots a i dots a n a 1 dots a i dots a j dots a n nbsp This formula can be applied iteratively when several columns are swapped For example a 3 a 1 a 2 a 4 a n a 1 a 3 a 2 a 4 a n a 1 a 2 a 3 a 4 a n displaystyle a 3 a 1 a 2 a 4 dots a n a 1 a 3 a 2 a 4 dots a n a 1 a 2 a 3 a 4 dots a n nbsp Yet more generally any permutation of the columns multiplies the determinant by the sign of the permutation If some column can be expressed as a linear combination of the other columns i e the columns of the matrix form a linearly dependent set the determinant is 0 As a special case this includes if some column is such that all its entries are zero then the determinant of that matrix is 0 Adding a scalar multiple of one column to another column does not change the value of the determinant This is a consequence of multilinearity and being alternative by multilinearity the determinant changes by a multiple of the determinant of a matrix with two equal columns which determinant is 0 since the determinant is alternating If A displaystyle A nbsp is a triangular matrix i e a i j 0 displaystyle a ij 0 nbsp whenever i gt j displaystyle i gt j nbsp or alternatively whenever i lt j displaystyle i lt j nbsp then its determinant equals the product of the diagonal entries det A a 11 a 22 a n n i 1 n a i i displaystyle det A a 11 a 22 cdots a nn prod i 1 n a ii nbsp Indeed such a matrix can be reduced by appropriately adding multiples of the columns with fewer nonzero entries to those with more entries to a diagonal matrix without changing the determinant For such a matrix using the linearity in each column reduces to the identity matrix in which case the stated formula holds by the very first characterizing property of determinants Alternatively this formula can also be deduced from the Leibniz formula since the only permutation s displaystyle sigma nbsp which gives a non zero contribution is the identity permutation Example Edit These characterizing properties and their consequences listed above are both theoretically significant but can also be used to compute determinants for concrete matrices In fact Gaussian elimination can be applied to bring any matrix into upper triangular form and the steps in this algorithm affect the determinant in a controlled way The following concrete example illustrates the computation of the determinant of the matrix A displaystyle A nbsp using that method A 2 1 2 2 1 4 3 3 1 displaystyle A begin bmatrix 2 amp 1 amp 2 2 amp 1 amp 4 3 amp 3 amp 1 end bmatrix nbsp Computation of the determinant of matrix A displaystyle A nbsp Matrix B 3 1 2 3 1 4 0 3 1 displaystyle B begin bmatrix 3 amp 1 amp 2 3 amp 1 amp 4 0 amp 3 amp 1 end bmatrix nbsp C 3 5 2 3 13 4 0 0 1 displaystyle C begin bmatrix 3 amp 5 amp 2 3 amp 13 amp 4 0 amp 0 amp 1 end bmatrix nbsp D 5 3 2 13 3 4 0 0 1 displaystyle D begin bmatrix 5 amp 3 amp 2 13 amp 3 amp 4 0 amp 0 amp 1 end bmatrix nbsp E 18 3 2 0 3 4 0 0 1 displaystyle E begin bmatrix 18 amp 3 amp 2 0 amp 3 amp 4 0 amp 0 amp 1 end bmatrix nbsp Obtained by add the second column to the first add 3 times the third column to the second swap the first two columns add 13 3 displaystyle frac 13 3 nbsp times the second column to the firstDeterminant A B displaystyle A B nbsp B C displaystyle B C nbsp D C displaystyle D C nbsp E D displaystyle E D nbsp Combining these equalities gives A E 18 3 1 54 displaystyle A E 18 cdot 3 cdot 1 54 nbsp Transpose Edit The determinant of the transpose of A displaystyle A nbsp equals the determinant of A det A T det A displaystyle det left A textsf T right det A nbsp This can be proven by inspecting the Leibniz formula 7 This implies that in all the properties mentioned above the word column can be replaced by row throughout For example viewing an n n matrix as being composed of n rows the determinant is an n linear function Multiplicativity and matrix groups Edit The determinant is a multiplicative map i e for square matrices A displaystyle A nbsp and B displaystyle B nbsp of equal size the determinant of a matrix product equals the product of their determinants det A B det A det B displaystyle det AB det A det B nbsp This key fact can be proven by observing that for a fixed matrix B displaystyle B nbsp both sides of the equation are alternating and multilinear as a function depending on the columns of A displaystyle A nbsp Moreover they both take the value det B displaystyle det B nbsp when A displaystyle A nbsp is the identity matrix The above mentioned unique characterization of alternating multilinear maps therefore shows this claim 8 A matrix A displaystyle A nbsp with entries in a field is invertible precisely if its determinant is nonzero This follows from the multiplicativity of the determinant and the formula for the inverse involving the adjugate matrix mentioned below In this event the determinant of the inverse matrix is given by det A 1 1 det A det A 1 displaystyle det left A 1 right frac 1 det A det A 1 nbsp In particular products and inverses of matrices with non zero determinant respectively determinant one still have this property Thus the set of such matrices of fixed size n displaystyle n nbsp over a field K displaystyle K nbsp forms a group known as the general linear group GL n K displaystyle operatorname GL n K nbsp respectively a subgroup called the special linear group SL n K GL n K displaystyle operatorname SL n K subset operatorname GL n K nbsp More generally the word special indicates the subgroup of another matrix group of matrices of determinant one Examples include the special orthogonal group which if n is 2 or 3 consists of all rotation matrices and the special unitary group Because the determinant respects multiplication and inverses it is in fact a group homomorphism from GL n K displaystyle operatorname GL n K nbsp into the multiplicative group K displaystyle K times nbsp of nonzero elements of K displaystyle K nbsp This homomorphism is surjective and its kernel is SL n K displaystyle operatorname SL n K nbsp the matrices with determinant one Hence by the first isomorphism theorem this shows that SL n K displaystyle operatorname SL n K nbsp is a normal subgroup of GL n K displaystyle operatorname GL n K nbsp and that the quotient group GL n K SL n K displaystyle operatorname GL n K operatorname SL n K nbsp is isomorphic to K displaystyle K times nbsp The Cauchy Binet formula is a generalization of that product formula for rectangular matrices This formula can also be recast as a multiplicative formula for compound matrices whose entries are the determinants of all quadratic submatrices of a given matrix 9 10 Laplace expansion Edit Laplace expansion expresses the determinant of a matrix A displaystyle A nbsp recursively in terms of determinants of smaller matrices known as its minors The minor M i j displaystyle M i j nbsp is defined to be the determinant of the n 1 n 1 displaystyle n 1 times n 1 nbsp matrix that results from A displaystyle A nbsp by removing the i displaystyle i nbsp th row and the j displaystyle j nbsp th column The expression 1 i j M i j displaystyle 1 i j M i j nbsp is known as a cofactor For every i displaystyle i nbsp one has the equality det A j 1 n 1 i j a i j M i j displaystyle det A sum j 1 n 1 i j a ij M ij nbsp which is called the Laplace expansion along the i th row For example the Laplace expansion along the first row i 1 displaystyle i 1 nbsp gives the following formula a b c d e f g h i a e f h i b d f g i c d e g h displaystyle begin vmatrix a amp b amp c d amp e amp f g amp h amp i end vmatrix a begin vmatrix e amp f h amp i end vmatrix b begin vmatrix d amp f g amp i end vmatrix c begin vmatrix d amp e g amp h end vmatrix nbsp Unwinding the determinants of these 2 2 displaystyle 2 times 2 nbsp matrices gives back the Leibniz formula mentioned above Similarly the Laplace expansion along the j displaystyle j nbsp th column is the equality det A i 1 n 1 i j a i j M i j displaystyle det A sum i 1 n 1 i j a ij M ij nbsp Laplace expansion can be used iteratively for computing determinants but this approach is inefficient for large matrices However it is useful for computing the determinants of highly symmetric matrix such as the Vandermonde matrix 1 1 1 1 x 1 x 2 x 3 x n x 1 2 x 2 2 x 3 2 x n 2 x 1 n 1 x 2 n 1 x 3 n 1 x n n 1 1 i lt j n x j x i displaystyle begin vmatrix 1 amp 1 amp 1 amp cdots amp 1 x 1 amp x 2 amp x 3 amp cdots amp x n x 1 2 amp x 2 2 amp x 3 2 amp cdots amp x n 2 vdots amp vdots amp vdots amp ddots amp vdots x 1 n 1 amp x 2 n 1 amp x 3 n 1 amp cdots amp x n n 1 end vmatrix prod 1 leq i lt j leq n left x j x i right nbsp The n term Laplace expansion along a row or column can be generalized to write an n x n determinant as a sum of n k displaystyle tbinom n k nbsp terms each the product of the determinant of a k x k submatrix and the determinant of the complementary n k x n k submatrix Adjugate matrix Edit The adjugate matrix adj A displaystyle operatorname adj A nbsp is the transpose of the matrix of the cofactors that is adj A i j 1 i j M j i displaystyle operatorname adj A ij 1 i j M ji nbsp For every matrix one has 11 det A I A adj A adj A A displaystyle det A I A operatorname adj A operatorname adj A A nbsp Thus the adjugate matrix can be used for expressing the inverse of a nonsingular matrix A 1 1 det A adj A displaystyle A 1 frac 1 det A operatorname adj A nbsp Block matrices Edit The formula for the determinant of a 2 2 displaystyle 2 times 2 nbsp matrix above continues to hold under appropriate further assumptions for a block matrix i e a matrix composed of four submatrices A B C D displaystyle A B C D nbsp of dimension m m displaystyle m times m nbsp m n displaystyle m times n nbsp n m displaystyle n times m nbsp and n n displaystyle n times n nbsp respectively The easiest such formula which can be proven using either the Leibniz formula or a factorization involving the Schur complement is det A 0 C D det A det D det A B 0 D displaystyle det begin pmatrix A amp 0 C amp D end pmatrix det A det D det begin pmatrix A amp B 0 amp D end pmatrix nbsp If A displaystyle A nbsp is invertible then it follows with results from the section on multiplicativity that det A B C D det A det A B C D det A 1 A 1 B 0 I n det A 1 det A 1 det A det I m 0 C A 1 D C A 1 B det A det D C A 1 B displaystyle begin aligned det begin pmatrix A amp B C amp D end pmatrix amp det A det begin pmatrix A amp B C amp D end pmatrix underbrace det begin pmatrix A 1 amp A 1 B 0 amp I n end pmatrix det A 1 det A 1 amp det A det begin pmatrix I m amp 0 CA 1 amp D CA 1 B end pmatrix amp det A det D CA 1 B end aligned nbsp which simplifies to det A D C A 1 B displaystyle det A D CA 1 B nbsp when D displaystyle D nbsp is a 1 1 displaystyle 1 times 1 nbsp matrix A similar result holds when D displaystyle D nbsp is invertible namely det A B C D det D det A B C D det I m 0 D 1 C D 1 det D 1 det D 1 det D det A B D 1 C B D 1 0 I n det D det A B D 1 C displaystyle begin aligned det begin pmatrix A amp B C amp D end pmatrix amp det D det begin pmatrix A amp B C amp D end pmatrix underbrace det begin pmatrix I m amp 0 D 1 C amp D 1 end pmatrix det D 1 det D 1 amp det D det begin pmatrix A BD 1 C amp BD 1 0 amp I n end pmatrix amp det D det A BD 1 C end aligned nbsp Both results can be combined to derive Sylvester s determinant theorem which is also stated below If the blocks are square matrices of the same size further formulas hold For example if C displaystyle C nbsp and D displaystyle D nbsp commute i e C D D C displaystyle CD DC nbsp then 12 det A B C D det A D B C displaystyle det begin pmatrix A amp B C amp D end pmatrix det AD BC nbsp This formula has been generalized to matrices composed of more than 2 2 displaystyle 2 times 2 nbsp blocks again under appropriate commutativity conditions among the individual blocks 13 For A D displaystyle A D nbsp and B C displaystyle B C nbsp the following formula holds even if A displaystyle A nbsp and B displaystyle B nbsp do not commute citation needed det A B B A det A B det A B displaystyle det begin pmatrix A amp B B amp A end pmatrix det A B det A B nbsp Sylvester s determinant theorem Edit Sylvester s determinant theorem states that for A an m n matrix and B an n m matrix so that A and B have dimensions allowing them to be multiplied in either order forming a square matrix det I m A B det I n B A displaystyle det left I mathit m AB right det left I mathit n BA right nbsp where Im and In are the m m and n n identity matrices respectively From this general result several consequences follow For the case of column vector c and row vector r each with m components the formula allows quick calculation of the determinant of a matrix that differs from the identity matrix by a matrix of rank 1 det I m c r 1 r c displaystyle det left I mathit m cr right 1 rc nbsp More generally 14 for any invertible m m matrix X det X A B det X det I m B X 1 A displaystyle det X AB det X det left I mathit m BX 1 A right nbsp For a column and row vector as above det X c r det X det 1 r X 1 c det X r adj X c displaystyle det X cr det X det left 1 rX 1 c right det X r operatorname adj X c nbsp For square matrices A displaystyle A nbsp and B displaystyle B nbsp of the same size the matrices A B displaystyle AB nbsp and B A displaystyle BA nbsp have the same characteristic polynomials hence the same eigenvalues Sum Edit The determinant of the sum A B displaystyle A B nbsp of two square matrices of the same size is not in general expressible in terms of the determinants of A and of B However for positive semidefinite matrices A displaystyle A nbsp B displaystyle B nbsp and C displaystyle C nbsp of equal size det A B C det C det A C det B C displaystyle det A B C det C geq det A C det B C text nbsp with the corollary 15 16 det A B det A det B displaystyle det A B geq det A det B text nbsp Conversely if A displaystyle A nbsp and B displaystyle B nbsp are Hermitian positive definite and size n n displaystyle n times n nbsp then the determinant has concave n displaystyle n nbsp th root 17 this implies det A B n det A n det B n displaystyle sqrt n det A B geq sqrt n det A sqrt n det B nbsp by homogeneity Sum identity for 2 2 matrices Edit For the special case of 2 2 displaystyle 2 times 2 nbsp matrices with complex entries the determinant of the sum can be written in terms of determinants and traces in the following identity det A B det A det B tr A tr B tr A B displaystyle det A B det A det B text tr A text tr B text tr AB nbsp Proof of identity This can be shown by writing out each term in components A i j B i j displaystyle A ij B ij nbsp The left hand side is A 11 B 11 A 22 B 22 A 12 B 12 A 21 B 21 displaystyle A 11 B 11 A 22 B 22 A 12 B 12 A 21 B 21 nbsp Expanding gives A 11 A 22 B 11 A 22 A 11 B 22 B 11 B 22 A 12 A 21 B 12 A 21 A 12 B 21 B 12 B 21 displaystyle A 11 A 22 B 11 A 22 A 11 B 22 B 11 B 22 A 12 A 21 B 12 A 21 A 12 B 21 B 12 B 21 nbsp The terms which are quadratic in A displaystyle A nbsp are seen to be det A displaystyle det A nbsp and similarly for B displaystyle B nbsp so the expression can be written det A det B A 11 B 22 B 11 A 22 A 12 B 21 B 12 A 21 displaystyle det A det B A 11 B 22 B 11 A 22 A 12 B 21 B 12 A 21 nbsp We can then write the cross terms as A 11 A 22 B 11 B 22 A 11 B 11 A 12 B 21 A 21 B 12 A 22 B 22 displaystyle A 11 A 22 B 11 B 22 A 11 B 11 A 12 B 21 A 21 B 12 A 22 B 22 nbsp which can be recognized as tr A tr B tr A B displaystyle text tr A text tr B text tr AB nbsp which completes the proof This has an application to 2 2 displaystyle 2 times 2 nbsp matrix algebras For example consider the complex numbers as a matrix algebra The complex numbers have a representation as matrices of the forma I b i a 1 0 0 1 b 0 1 1 0 displaystyle aI b mathbf i a begin pmatrix 1 amp 0 0 amp 1 end pmatrix b begin pmatrix 0 amp 1 1 amp 0 end pmatrix nbsp with a displaystyle a nbsp and b displaystyle b nbsp real Since tr i 0 displaystyle text tr mathbf i 0 nbsp taking A a I displaystyle A aI nbsp and B b i displaystyle B b mathbf i nbsp in the above identity gives det a I b i a 2 det I b 2 det i a 2 b 2 displaystyle det aI b mathbf i a 2 det I b 2 det mathbf i a 2 b 2 nbsp This result followed just from tr i 0 displaystyle text tr mathbf i 0 nbsp and det I det i 1 displaystyle det I det mathbf i 1 nbsp Properties of the determinant in relation to other notions EditEigenvalues and characteristic polynomial Edit The determinant is closely related to two other central concepts in linear algebra the eigenvalues and the characteristic polynomial of a matrix Let A displaystyle A nbsp be an n n displaystyle n times n nbsp matrix with complex entries Then by the Fundamental Theorem of Algebra A displaystyle A nbsp must have exactly n eigenvalues l 1 l 2 l n displaystyle lambda 1 lambda 2 ldots lambda n nbsp Here it is understood that an eigenvalue with algebraic multiplicity m occurs m times in this list Then it turns out the determinant of A is equal to the product of these eigenvalues det A i 1 n l i l 1 l 2 l n displaystyle det A prod i 1 n lambda i lambda 1 lambda 2 cdots lambda n nbsp The product of all non zero eigenvalues is referred to as pseudo determinant From this one immediately sees that the determinant of a matrix A displaystyle A nbsp is zero iff 0 displaystyle 0 nbsp is an eigenvalue of A displaystyle A nbsp In other words A displaystyle A nbsp is invertible iff 0 displaystyle 0 nbsp is not an eigenvalue of A displaystyle A nbsp The characteristic polynomial is defined as 18 x A t det t I A displaystyle chi A t det t cdot I A nbsp Here t displaystyle t nbsp is the indeterminate of the polynomial and I displaystyle I nbsp is the identity matrix of the same size as A displaystyle A nbsp By means of this polynomial determinants can be used to find the eigenvalues of the matrix A displaystyle A nbsp they are precisely the roots of this polynomial i e those complex numbers l displaystyle lambda nbsp such that x A l 0 displaystyle chi A lambda 0 nbsp A Hermitian matrix is positive definite if all its eigenvalues are positive Sylvester s criterion asserts that this is equivalent to the determinants of the submatrices A k a 1 1 a 1 2 a 1 k a 2 1 a 2 2 a 2 k a k 1 a k 2 a k k displaystyle A k begin bmatrix a 1 1 amp a 1 2 amp cdots amp a 1 k a 2 1 amp a 2 2 amp cdots amp a 2 k vdots amp vdots amp ddots amp vdots a k 1 amp a k 2 amp cdots amp a k k end bmatrix nbsp being positive for all k displaystyle k nbsp between 1 displaystyle 1 nbsp and n displaystyle n nbsp 19 Trace Edit The trace tr A is by definition the sum of the diagonal entries of A and also equals the sum of the eigenvalues Thus for complex matrices A det exp A exp tr A displaystyle det exp A exp operatorname tr A nbsp or for real matrices A tr A log det exp A displaystyle operatorname tr A log det exp A nbsp Here exp A denotes the matrix exponential of A because every eigenvalue l of A corresponds to the eigenvalue exp l of exp A In particular given any logarithm of A that is any matrix L satisfying exp L A displaystyle exp L A nbsp the determinant of A is given by det A exp tr L displaystyle det A exp operatorname tr L nbsp For example for n 2 n 3 and n 4 respectively det A 1 2 tr A 2 tr A 2 det A 1 6 tr A 3 3 tr A tr A 2 2 tr A 3 det A 1 24 tr A 4 6 tr A 2 tr A 2 3 tr A 2 2 8 tr A 3 tr A 6 tr A 4 displaystyle begin aligned det A amp frac 1 2 left left operatorname tr A right 2 operatorname tr left A 2 right right det A amp frac 1 6 left left operatorname tr A right 3 3 operatorname tr A operatorname tr left A 2 right 2 operatorname tr left A 3 right right det A amp frac 1 24 left left operatorname tr A right 4 6 operatorname tr left A 2 right left operatorname tr A right 2 3 left operatorname tr left A 2 right right 2 8 operatorname tr left A 3 right operatorname tr A 6 operatorname tr left A 4 right right end aligned nbsp cf Cayley Hamilton theorem Such expressions are deducible from combinatorial arguments Newton s identities or the Faddeev LeVerrier algorithm That is for generic n detA 1 nc0 the signed constant term of the characteristic polynomial determined recursively from c n 1 c n m 1 m k 1 m c n m k tr A k 1 m n displaystyle c n 1 c n m frac 1 m sum k 1 m c n m k operatorname tr left A k right 1 leq m leq n nbsp In the general case this may also be obtained from 20 det A k 1 k 2 k n 0 k 1 2 k 2 n k n n l 1 n 1 k l 1 l k l k l tr A l k l displaystyle det A sum begin array c k 1 k 2 ldots k n geq 0 k 1 2k 2 cdots nk n n end array prod l 1 n frac 1 k l 1 l k l k l operatorname tr left A l right k l nbsp where the sum is taken over the set of all integers kl 0 satisfying the equation l 1 n l k l n displaystyle sum l 1 n lk l n nbsp The formula can be expressed in terms of the complete exponential Bell polynomial of n arguments sl l 1 tr Al as det A 1 n n B n s 1 s 2 s n displaystyle det A frac 1 n n B n s 1 s 2 ldots s n nbsp This formula can also be used to find the determinant of a matrix AIJ with multidimensional indices I i1 i2 ir and J j1 j2 jr The product and trace of such matrices are defined in a natural way as A B J I K A K I B J K tr A I A I I displaystyle AB J I sum K A K I B J K operatorname tr A sum I A I I nbsp An important arbitrary dimension n identity can be obtained from the Mercator series expansion of the logarithm when the expansion converges If every eigenvalue of A is less than 1 in absolute value det I A k 0 1 k j 1 1 j j tr A j k displaystyle det I A sum k 0 infty frac 1 k left sum j 1 infty frac 1 j j operatorname tr left A j right right k nbsp where I is the identity matrix More generally if k 0 1 k j 1 1 j s j j tr A j k displaystyle sum k 0 infty frac 1 k left sum j 1 infty frac 1 j s j j operatorname tr left A j right right k nbsp is expanded as a formal power series in s then all coefficients of s m for m gt n are zero and the remaining polynomial is det I sA Upper and lower bounds Edit For a positive definite matrix A the trace operator gives the following tight lower and upper bounds on the log determinant tr I A 1 log det A tr A I displaystyle operatorname tr left I A 1 right leq log det A leq operatorname tr A I nbsp with equality if and only if A I This relationship can be derived via the formula for the Kullback Leibler divergence between two multivariate normal distributions Also n tr A 1 det A 1 n 1 n tr A 1 n tr A 2 displaystyle frac n operatorname tr left A 1 right leq det A frac 1 n leq frac 1 n operatorname tr A leq sqrt frac 1 n operatorname tr left A 2 right nbsp These inequalities can be proved by expressing the traces and the determinant in terms of the eigenvalues As such they represent the well known fact that the harmonic mean is less than the geometric mean which is less than the arithmetic mean which is in turn less than the root mean square Derivative Edit The Leibniz formula shows that the determinant of real or analogously for complex square matrices is a polynomial function from R n n displaystyle mathbf R n times n nbsp to R displaystyle mathbf R nbsp In particular it is everywhere differentiable Its derivative can be expressed using Jacobi s formula 21 d det A d a tr adj A d A d a displaystyle frac d det A d alpha operatorname tr left operatorname adj A frac dA d alpha right nbsp where adj A displaystyle operatorname adj A nbsp denotes the adjugate of A displaystyle A nbsp In particular if A displaystyle A nbsp is invertible we have d det A d a det A tr A 1 d A d a displaystyle frac d det A d alpha det A operatorname tr left A 1 frac dA d alpha right nbsp Expressed in terms of the entries of A displaystyle A nbsp these are det A A i j adj A j i det A A 1 j i displaystyle frac partial det A partial A ij operatorname adj A ji det A left A 1 right ji nbsp Yet another equivalent formulation is det A ϵ X det A tr adj A X ϵ O ϵ 2 det A tr A 1 X ϵ O ϵ 2 displaystyle det A epsilon X det A operatorname tr operatorname adj A X epsilon O left epsilon 2 right det A operatorname tr left A 1 X right epsilon O left epsilon 2 right nbsp using big O notation The special case where A I displaystyle A I nbsp the identity matrix yields det I ϵ X 1 tr X ϵ O ϵ 2 displaystyle det I epsilon X 1 operatorname tr X epsilon O left epsilon 2 right nbsp This identity is used in describing Lie algebras associated to certain matrix Lie groups For example the special linear group SL n displaystyle operatorname SL n nbsp is defined by the equation det A 1 displaystyle det A 1 nbsp The above formula shows that its Lie algebra is the special linear Lie algebra s l n displaystyle mathfrak sl n nbsp consisting of those matrices having trace zero Writing a 3 3 displaystyle 3 times 3 nbsp matrix as A a b c displaystyle A begin bmatrix a amp b amp c end bmatrix nbsp where a b c displaystyle a b c nbsp are column vectors of length 3 then the gradient over one of the three vectors may be written as the cross product of the other two a det A b c b det A c a c det A a b displaystyle begin aligned nabla mathbf a det A amp mathbf b times mathbf c nabla mathbf b det A amp mathbf c times mathbf a nabla mathbf c det A amp mathbf a times mathbf b end aligned nbsp History EditHistorically determinants were used long before matrices A determinant was originally defined as a property of a system of linear equations The determinant determines whether the system has a unique solution which occurs precisely if the determinant is non zero In this sense determinants were first used in the Chinese mathematics textbook The Nine Chapters on the Mathematical Art 九章算術 Chinese scholars around the 3rd century BCE In Europe solutions of linear systems of two equations were expressed by Cardano in 1545 by a determinant like entity 22 Determinants proper originated from the work of Seki Takakazu in 1683 in Japan and parallelly of Leibniz in 1693 23 24 25 26 Cramer 1750 stated without proof Cramer s rule 27 Both Cramer and also Bezout 1779 were led to determinants by the question of plane curves passing through a given set of points 28 Vandermonde 1771 first recognized determinants as independent functions 24 Laplace 1772 gave the general method of expanding a determinant in terms of its complementary minors Vandermonde had already given a special case 29 Immediately following Lagrange 1773 treated determinants of the second and third order and applied it to questions of elimination theory he proved many special cases of general identities Gauss 1801 made the next advance Like Lagrange he made much use of determinants in the theory of numbers He introduced the word determinant Laplace had used resultant though not in the present signification but rather as applied to the discriminant of a quantic 30 Gauss also arrived at the notion of reciprocal inverse determinants and came very near the multiplication theorem clarification needed The next contributor of importance is Binet 1811 1812 who formally stated the theorem relating to the product of two matrices of m columns and n rows which for the special case of m n reduces to the multiplication theorem On the same day November 30 1812 that Binet presented his paper to the Academy Cauchy also presented one on the subject See Cauchy Binet formula In this he used the word determinant in its present sense 31 32 summarized and simplified what was then known on the subject improved the notation and gave the multiplication theorem with a proof more satisfactory than Binet s 24 33 With him begins the theory in its generality Jacobi 1841 used the functional determinant which Sylvester later called the Jacobian 34 In his memoirs in Crelle s Journal for 1841 he specially treats this subject as well as the class of alternating functions which Sylvester has called alternants About the time of Jacobi s last memoirs Sylvester 1839 and Cayley began their work Cayley 1841 introduced the modern notation for the determinant using vertical bars 35 36 The study of special forms of determinants has been the natural result of the completion of the general theory Axisymmetric determinants have been studied by Lebesgue Hesse and Sylvester persymmetric determinants by Sylvester and Hankel circulants by Catalan Spottiswoode Glaisher and Scott skew determinants and Pfaffians in connection with the theory of orthogonal transformation by Cayley continuants by Sylvester Wronskians so called by Muir by Christoffel and Frobenius compound determinants by Sylvester Reiss and Picquet Jacobians and Hessians by Sylvester and symmetric gauche determinants by Trudi Of the textbooks on the subject Spottiswoode s was the first In America Hanus 1886 Weld 1893 and Muir Metzler 1933 published treatises Applications EditCramer s rule Edit Determinants can be used to describe the solutions of a linear system of equations written in matrix form as A x b displaystyle Ax b nbsp This equation has a unique solution x displaystyle x nbsp if and only if det A displaystyle det A nbsp is nonzero In this case the solution is given by Cramer s rule x i det A i det A i 1 2 3 n displaystyle x i frac det A i det A qquad i 1 2 3 ldots n nbsp where A i displaystyle A i nbsp is the matrix formed by replacing the i displaystyle i nbsp th column of A displaystyle A nbsp by the column vector b displaystyle b nbsp This follows immediately by column expansion of the determinant i e det A i det a 1 b a n j 1 n x j det a 1 a i 1 a j a i 1 a n x i det A displaystyle det A i det begin bmatrix a 1 amp ldots amp b amp ldots amp a n end bmatrix sum j 1 n x j det begin bmatrix a 1 amp ldots amp a i 1 amp a j amp a i 1 amp ldots amp a n end bmatrix x i det A nbsp where the vectors a j displaystyle a j nbsp are the columns of A The rule is also implied by the identity A adj A adj A A det A I n displaystyle A operatorname adj A operatorname adj A A det A I n nbsp Cramer s rule can be implemented in O n 3 displaystyle operatorname O n 3 nbsp time which is comparable to more common methods of solving systems of linear equations such as LU QR or singular value decomposition 37 Linear independence Edit Determinants can be used to characterize linearly dependent vectors det A displaystyle det A nbsp is zero if and only if the column vectors or equivalently the row vectors of the matrix A displaystyle A nbsp are linearly dependent 38 For example given two linearly independent vectors v 1 v 2 R 3 displaystyle v 1 v 2 in mathbf R 3 nbsp a third vector v 3 displaystyle v 3 nbsp lies in the plane spanned by the former two vectors exactly if the determinant of the 3 3 displaystyle 3 times 3 nbsp matrix consisting of the three vectors is zero The same idea is also used in the theory of differential equations given functions f 1 x f n x displaystyle f 1 x dots f n x nbsp supposed to be n 1 displaystyle n 1 nbsp times differentiable the Wronskian is defined to be W f 1 f n x f 1 x f 2 x f n x f 1 x f 2 x f n x f 1 n 1 x f 2 n 1 x f n n 1 x displaystyle W f 1 ldots f n x begin vmatrix f 1 x amp f 2 x amp cdots amp f n x f 1 x amp f 2 x amp cdots amp f n x vdots amp vdots amp ddots amp vdots f 1 n 1 x amp f 2 n 1 x amp cdots amp f n n 1 x end vmatrix nbsp It is non zero for some x displaystyle x nbsp in a specified interval if and only if the given functions and all their derivatives up to order n 1 displaystyle n 1 nbsp are linearly independent If it can be shown that the Wronskian is zero everywhere on an interval then in the case of analytic functions this implies the given functions are linearly dependent See the Wronskian and linear independence Another such use of the determinant is the resultant which gives a criterion when two polynomials have a common root 39 Orientation of a basis Edit Main article Orientation vector space The determinant can be thought of as assigning a number to every sequence of n vectors in Rn by using the square matrix whose columns are the given vectors The determinant will be nonzero iff the sequence of vectors is a basis for Rn In that case the sign of the determinant determines whether the orientation of the basis is consistent with or opposite to the orientation of the standard basis In the case of an orthogonal basis the magnitude of the determinant is equal to the product of the lengths of the basis vectors For instance an orthogonal matrix with entries in Rn represents an orthonormal basis in Euclidean space and hence has determinant of 1 since all the vectors have length 1 The determinant is 1 iff the basis has the same orientation It is 1 iff the basis has the opposite orientation More generally if the determinant of A is positive A represents an orientation preserving linear transformation if A is an orthogonal 2 2 or 3 3 matrix this is a rotation while if it is negative A switches the orientation of the basis Volume and Jacobian determinant Edit As pointed out above the absolute value of the determinant of real vectors is equal to the volume of the parallelepiped spanned by those vectors As a consequence if f R n R n displaystyle f mathbf R n to mathbf R n nbsp is the linear map given by multiplication with a matrix A displaystyle A nbsp spa, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.