We define the determinant after a few preliminaries
Let Vect be the category of vector spaces over a field , and assume for the moment that the characteristic . For each , let
be the 1-dimensional sign representation? on the symmetric group , taking each transposition to . We may linearly extend the sign action of , so that names a (right) -module with underlying vector space . At the same time, acts on the tensor product of a vector space by permuting tensor factors, giving a left -module structure on . We define the Schur functor
by the formula
It is called the alternating power (of ).
If is -dimensional, then has dimension . In particular, is 1-dimensional.
If is a basis for , then expressions of the form form a basis for . Let denote the image of this element under the quotient map . We have
(consider the transposition in which swaps and ) and so we may take only such expressions on the left where as forming a spanning set for , and indeed these form a basis. The number of such expressions is .
In the case where , the same development may be carried out by simply decreeing that whenever for some pair of distinct indices , .
Now let be an -dimensional space, and let be a linear map. By the proposition, the map
being an endomorphism on a 1-dimensional space, is given by multiplying by a scalar . It is manifestly functorial since is, i.e., . The quantity is called the determinant of .
We see then that if is of dimension ,
is a homomorphism of multiplicative monoids; by commutativity of multiplication in , we infer that
for each invertible linear map .
If we choose a basis of so that we have an identification , then the determinant gives a function
that takes products of matrices to products in . The determinant however is of course independent of choice of basis, since any two choices are related by a change-of-basis matrix , where and its transform have the same determinant.
By following the definitions above, we can give an explicit formula:
We work over fields of arbitrary characteristic. The determinant satisfies the following properties, which taken together uniquely characterize the determinant. Write a square matrix as a row of column vectors .
is separately linear in each column vector:
whenever for distinct .
, where is the identity matrix.
Other properties may be worked out, starting from the explicit formula or otherwise:
If is a diagonal matrix, then is the product of its diagonal entries.
More generally, if is an upper (or lower) triangular matrix, then is the product of the diagonal entries.
If is an extension field and is a -linear map , then . Using the preceding properties and the Jordan normal form? of a matrix, this means that is the product of its eigenvalues (counted with multiplicity), as computed in the algebraic closure of .
If is the transpose of , then .
A simple observation which flows from these basic properties is
(Cramer’s Rule)
Let be column vectors of dimension , and suppose
Then for each we have
where occurs as the column vector on the right.
This follows straightforwardly from properties 1 and 2 above.
For instance, given a square matrix such that , and writing , this allows us to solve the equation
and we easily conclude that is invertible if .
This holds true even if we replace the field by an arbitrary commutative ring , and we replace the condition by the condition that is a unit. (The entire development given above goes through, mutatis mutandis.)
Let be a commutative ring, and let be an matrix with entries in . Then there exists an matrix with entries in such that .
We may as well take to be the polynomial ring , since we are then free to interpret the indeterminates however we like along a ring map . Let denote the corresponding generic matrix.
Guided by Cramer’s rule, put
the being columns of and , the column vector with in the row and ’s elsewhere, appearing as the column. If we pretend is invertible, then we know by Cramer’s rule. We claim this holds for general .
Indeed, we can interpret this as a polynomial equation in and check it there. As an equation between polynomial functions on the space of matrices , it holds on the dense subset . Therefore, by continuity, it holds on all of . But a polynomial function equation with coefficients in implies the corresponding polynomial identity, and the proof is complete.
(Cayley-Hamilton) Let be a finitely generated free module over a commutative ring , and let be an -module map. Let be the characteristic polynomial of , and let be the unique -algebra map sending to . Then is the zero map .
Via , regard as an -module, and with regard to some -basis of , represent by a matrix . Now consider as an matrix with entries in . By definition of the module structure, this matrix , seen as acting on , annihilates the length column vector whose row entry is .
By the previous lemma, there is such that is times the identity matrix. It follows that
i.e., for each . Since the form an -basis, the -scalar annihilates the -module , as was to be shown.
The Cayley-Hamilton theorem easily generalizes to finitely generated -modules (not necessarily free) as follows. Let be a module endomorphism, and suppose is an epimorphism. Since is projective, the map can be lifted through to a map . Let be the characteristic polynomial of .
.
Write . We already know . From , it follows that for any . Hence . Since is epic, follows.
We give an interesting and perhaps surprising consequence of the Cayley-Hamilton theorem below, after establishing a lemma close in spirit to Nakayama's lemma.
Suppose is a finitely generated -module, and is a module map such that for some ideal of . Then there is a polynomial , with all , such that .
For some finite , we have a surjective map , and by hypothesis we have a surjective map in
By projectivity of , we can lift the bottom composite to a map making the diagram commute. Let be the -module map , regarded as a matrix. Then the characteristic polynomial of satisfies the conclusion, by the Cayley-Hamilton theorem.
Let be a finitely generated module over a commutative ring , and let be a surjective module map. Then is an isomorphism.
Regard as a finitely generated -module via . Since is assumed surjective, we have for the ideal of . Now take as in the preceding lemma, a module map over the ring . By the lemma, we see that where , in other words the -scalar
as an operator on . Write for polynomials . Now we may rewrite the previous displayed equation as
for all , which translates into saying that , i.e., that is a retraction of . Since is epic, we now see is an isomorphism.
A useful intuition to have for determinants of real matrices is that they measure change of volume. That is, an matrix with real entries will map a standard unit cube in to a parallelpiped in (quashed to lie in a hyperplane if the matrix is singular), and the determinant is, up to sign, the volume of the parallelpiped. It is easy to convince oneself of this in the planar case by a simple dissection of a parallelogram, rearranging the dissected pieces in the style of Euclid to form a rectangle. In algebraic terms, the dissection and rearrangement amount to applying shearing or elementary column operations to the matrix which, by the properties discussed earlier, leave the determinant unchanged. These operations transform the matrix into a diagonal matrix whose determinant is the area of the corresponding rectangle. This procedure easily generalizes to dimensions.
The sign itself is a matter of interest. An invertible transformation is said to be orientation-preserving if is positive, and orientation-reversing if is negative. Orientations play an important role throughout geometry and algebraic topology, for example in the study of orientable manifolds (where the tangent bundle as -bundle can be lifted to a -bundle structure, being the subgroup of matrices of positive determinant). See also KO-theory?.
Finally, we include one more property of determinants which pertains to matrices with real coefficients (which works slightly more generally for matrices with coefficients in a local field):
see Pfaffian for the moment
The proof of the Cayley-Hamilton theorem follows the treatment in
The proof of Proposition 3 on surjective endomorphisms of finitely generated modules was extracted from