nLab
determinant

Redirected from "descent theory".

Contents

Definition

We define the determinant after a few preliminaries

Let Vect k be the category of vector spaces over a field k, and assume for the moment that the characteristic char(k)2. For each j0, let

sgn j:S jhom(k,k)sgn_j \colon S_j \to \hom(k, k)

be the 1-dimensional sign representation? on the symmetric group S j, taking each transposition (ij) to 1k *. We may linearly extend the sign action of S j, so that sgn names a (right) kS j-module with underlying vector space k. At the same time, S j acts on the j th tensor product of a vector space V by permuting tensor factors, giving a left kS j-module structure on V j. We define the Schur functor

Λ j:Vect kVect k\Lambda^j \colon Vect_k \to Vect_k

by the formula

Λ j(V)=sgn j kS jV j.\Lambda^j(V) = sgn_j \otimes_{k S_j} V^{\otimes j}.

It is called the j th alternating power (of V).

Proposition

If V is n-dimensional, then Λ j(V) has dimension (nj). In particular, Λ n(V) is 1-dimensional.

Proof

If e 1,,e n is a basis for V, then expressions of the form e n 1e n j form a basis for V j. Let e n 1e n j denote the image of this element under the quotient map V jΛ j(V). We have

e n 1e n ie n i+1e n j=e n 1e n i+1e n ie n je_{n_1} \wedge \ldots \wedge e_{n_i} \wedge e_{n_{i+1}} \wedge \ldots \wedge e_{n_j} = -e_{n_1} \wedge \ldots \wedge e_{n_{i+1}} \wedge e_{n_i} \wedge \ldots \wedge e_{n_j}

(consider the transposition in S j which swaps i and i+1) and so we may take only such expressions on the left where n 1<<n j as forming a spanning set for Λ j(V), and indeed these form a basis. The number of such expressions is (nj).

Remark

In the case where char(k)=2, the same development may be carried out by simply decreeing that e n 1e n j=0 whenever n i=n i for some pair of distinct indices i, i.

Now let V be an n-dimensional space, and let f:VV be a linear map. By the proposition, the map

Λ n(f):Λ n(V)Λ n(V),\Lambda^n(f) \colon \Lambda^n(V) \to \Lambda^n(V),

being an endomorphism on a 1-dimensional space, is given by multiplying by a scalar D(f)k. It is manifestly functorial since Λ n is, i.e., D(fg)=D(f)D(g). The quantity D(f) is called the determinant of f.

Determinant of a matrix

We see then that if V is of dimension n,

det:End(V)k\det \colon End(V) \to k

is a homomorphism of multiplicative monoids; by commutativity of multiplication in k, we infer that

det(UAU 1)=det(A)\det(U A U^{-1}) = \det(A)

for each invertible linear map UGL(V).

If we choose a basis of V so that we have an identification GL(V)Mat n(k), then the determinant gives a function

det:Mat n(k)k\det \colon Mat_n(k) \to k

that takes products of n×n matrices to products in k. The determinant however is of course independent of choice of basis, since any two choices are related by a change-of-basis matrix U, where A and its transform UAU 1 have the same determinant.

By following the definitions above, we can give an explicit formula:

det(A)= σS nsgn(σ) i=1 na iσ(i).\det(A) = \sum_{\sigma \in S_n} sgn(\sigma) \prod_{i = 1}^n a_{i \sigma(i)}.

Properties

We work over fields of arbitrary characteristic. The determinant satisfies the following properties, which taken together uniquely characterize the determinant. Write a square matrix A as a row of column vectors (v 1,,v n).

  1. det is separately linear in each column vector:

    det(v 1,,av+bw,,v n)=adet(v 1,,v,,v n)+bdet(v 1,,w,,v n)\det(v_1, \ldots, a v + b w, \ldots, v_n) = a\det(v_1, \ldots, v, \ldots, v_n) + b\det(v_1, \ldots, w, \ldots, v_n)
  2. det(v 1,,v n)=0 whenever v i=v j for distinct i,j.

  3. det(I)=1, where I is the identity matrix.

Other properties may be worked out, starting from the explicit formula or otherwise:

  • If A is a diagonal matrix, then det(A) is the product of its diagonal entries.

  • More generally, if A is an upper (or lower) triangular matrix, then det(A) is the product of the diagonal entries.

  • If E/k is an extension field and f is a k-linear map VV, then det(f)=det(E kf). Using the preceding properties and the Jordan normal form? of a matrix, this means that det(f) is the product of its eigenvalues (counted with multiplicity), as computed in the algebraic closure of k.

  • If A t is the transpose of A, then det(A t)=det(A).

Cramer’s rule

A simple observation which flows from these basic properties is

Proposition

(Cramer’s Rule)

Let v 1,,v n be column vectors of dimension n, and suppose

w= ja jv j.w = \sum_j a_j v_j.

Then for each i we have

a jdet(v 1,,v i,,v n)=det(v 1,,w,,v n)a_j \det(v_1, \ldots, v_i, \ldots, v_n) = \det(v_1, \ldots, w, \ldots, v_n)

where w occurs as the i th column vector on the right.

Proof

This follows straightforwardly from properties 1 and 2 above.

For instance, given a square matrix A such that det(A)0, and writing A=(v 1,,v n), this allows us to solve the equation

Aa=wA \cdot a = w

and we easily conclude that A is invertible if det(A)0.

Remark

This holds true even if we replace the field k by an arbitrary commutative ring R, and we replace the condition det(A)0 by the condition that det(A) is a unit. (The entire development given above goes through, mutatis mutandis.)

Cayley-Hamilton theorem

Lemma

Let R be a commutative ring, and let A be an n×n matrix with entries in R. Then there exists an n×n matrix A˜ with entries in R such that AA˜=A˜A=det(A)I n.

Proof

We may as well take R to be the polynomial ring [a ij] 1i,jn, since we are then free to interpret the indeterminates a ij however we like along a ring map [a ij]R. Let A denote the corresponding generic matrix.

Guided by Cramer’s rule, put

A˜ ji=det(a 1,,e i,a n),\tilde{A}_{j i} = \det(a_1, \ldots, e_i, \ldots a_n),

the a i being columns of A and e i, the column vector with 1 in the i th row and 0’s elsewhere, appearing as the j th column. If we pretend A is invertible, then we know AA˜=det(A)I n=A˜A by Cramer’s rule. We claim this holds for general A.

Indeed, we can interpret this as a polynomial equation in [a ij] and check it there. As an equation between polynomial functions on the space of matrices AMat n()=Spec([a ij]), it holds on the dense subset GL n()Mat n(). Therefore, by continuity, it holds on all of Mat n(). But a polynomial function equation with coefficients in implies the corresponding polynomial identity, and the proof is complete.

Theorem

(Cayley-Hamilton) Let V be a finitely generated free module over a commutative ring R, and let f:VV be an R-module map. Let p(t)R[t] be the characteristic polynomial det(t1 Vf) of f, and let ϕ f:R[t]Mod R(V,V) be the unique R-algebra map sending t to f. Then p(f)ϕ f(p) is the zero map 0:VV.

Proof

Via ϕ f, regard V as an R[t]-module, and with regard to some R-basis {v i} 1in of V, represent f by a matrix A. Now consider tI nA as an n×n matrix B(t) with entries in R[t]. By definition of the module structure, this matrix B(t), seen as acting on V n, annihilates the length n column vector c whose i th row entry is v i.

By the previous lemma, there is B˜(t) such that B˜(t)B(t) is det(tI nf) times the identity matrix. It follows that

det(tI nf)V=B˜(t)B(t)c=B˜(t)0=0\det(t \cdot I_n - f) V = \tilde{B}(t) B(t) c = \tilde{B}(t) 0 = 0

i.e., det(tI nf)v i=0 for each i. Since the v i form an R-basis, the R[t]-scalar det(tI nf) annihilates the R[t]-module V, as was to be shown.

The Cayley-Hamilton theorem easily generalizes to finitely generated R-modules (not necessarily free) as follows. Let f:VV be a module endomorphism, and suppose π:R nV is an epimorphism. Since R n is projective, the map fπ can be lifted through π to a map A:R nR n. Let P(t) be the characteristic polynomial of A.

Proposition

P(f)=0.

Proof

Write P(t)= ia it i. We already know P(A)=0. From fπ=πA, it follows that f iπ=πA i for any i0. Hence P(f)π=πP(A)=0. Since π is epic, P(f)=0 follows.

We give an interesting and perhaps surprising consequence of the Cayley-Hamilton theorem below, after establishing a lemma close in spirit to Nakayama's lemma.

Lemma

Suppose V is a finitely generated R-module, and g:VV is a module map such that g(V)IV for some ideal I of R. Then there is a polynomial p(t)=t n+a 1t n1++a n, with all a iI, such that p(g)=0.

Proof

For some finite n0, we have a surjective map R nM, and by hypothesis we have a surjective map I nim(g) in

I n R n M g im(g)\array{ & & & & I^n \\ & & & & \downarrow \\ R^n & \to & M & \stackrel{g}{\to} & im(g) }

By projectivity of R n, we can lift the bottom composite to a map R nI n making the diagram commute. Let A be the R-module map R nI nR n, regarded as a matrix. Then the characteristic polynomial of A satisfies the conclusion, by the Cayley-Hamilton theorem.

Proposition

Let V be a finitely generated module over a commutative ring R, and let f:VV be a surjective module map. Then f is an isomorphism.

Proof

Regard V as a finitely generated R[t]-module via ϕ f:R[t]Mod R(V,V). Since f is assumed surjective, we have IV=V for the ideal I=(t) of R[t]. Now take g=1 V as in the preceding lemma, a module map over the ring R=R[t]. By the lemma, we see that g n+a 1g n1++a n=0 where a i(t), in other words the R[t]-scalar

(1+a 1++a n)1 V=0(1 + a_1 + \ldots + a_n)1_V = 0

as an operator on V. Write a i=b i(t)t for polynomials b i(t)R[t]. Now we may rewrite the previous displayed equation as

1 V(v)=( ib i(t))tv1_V(v) = -(\sum_i b_i(t)) t \cdot v

for all vV, which translates into saying that 1 V= ib i(f)f, i.e., that ib i(f) is a retraction of f. Since f is epic, we now see f is an isomorphism.

Over the real numbers

A useful intuition to have for determinants of real matrices is that they measure change of volume. That is, an n×n matrix with real entries will map a standard unit cube in n to a parallelpiped in n (quashed to lie in a hyperplane if the matrix is singular), and the determinant is, up to sign, the volume of the parallelpiped. It is easy to convince oneself of this in the planar case by a simple dissection of a parallelogram, rearranging the dissected pieces in the style of Euclid to form a rectangle. In algebraic terms, the dissection and rearrangement amount to applying shearing or elementary column operations to the matrix which, by the properties discussed earlier, leave the determinant unchanged. These operations transform the matrix into a diagonal matrix whose determinant is the area of the corresponding rectangle. This procedure easily generalizes to n dimensions.

The sign itself is a matter of interest. An invertible transformation f:VV is said to be orientation-preserving if det(f) is positive, and orientation-reversing if det(f) is negative. Orientations play an important role throughout geometry and algebraic topology, for example in the study of orientable manifolds (where the tangent bundle as GL(n)-bundle can be lifted to a GL +(n)-bundle structure, GL +(n)GL(n) being the subgroup of matrices of positive determinant). See also KO-theory?.

Finally, we include one more property of determinants which pertains to matrices with real coefficients (which works slightly more generally for matrices with coefficients in a local field):

  • If A is an n×n matrix, then det(exp(A))= exp(trace(A))

In terms of Berezinian integrals

see Pfaffian for the moment

References

The proof of the Cayley-Hamilton theorem follows the treatment in

  • Serge Lange, Algebra (3 rd edition), Addison-Wesley, 1993.

The proof of Proposition 3 on surjective endomorphisms of finitely generated modules was extracted from

  • Stacks Project, Commutative Algebra, section 13 (pdf)