nLab rank-nullity theorem

Redirected from "Orzech's theorem".

Contents

Idea

In linear algebra, what is known as the rank-nullity theorem (e.g. 3.22 of Axler 2015, who calls it the fundamental theorem of linear maps) is the statement that for any linear map f:VWf \colon V \to W out of a finite-dimensional vector space, the sum of

  • the rank rk(f)dim(im(f))rk(f) \coloneqq dim\big(im(f)\big), i.e. the dimension of the image;

with

  • the nullity nl(f)dim(ker(f))nl(f) \coloneqq dim\big( ker(f) \big), i.e. the dimension of the kernel

equals

(1)d V=rk(f)+nl(f). d_V \;=\; rk(f) + nl(f) \,.

This rank-nullity theorem is the decategorification (under the dimension functor dim:FinDimVectdim \colon FinDimVect \to \mathbb{Z}) of the stronger statement that VV itself is the direct sum of its kernel and image vector spaces:

Vim(f)ker(f). V \;\simeq\; im(f) \oplus ker(f) \,.

This may be understood as an instance of the splitting lemma for vector spaces, or more precisely of the statement (here) that every short exact sequence of vector spaces, such as

0ker(f)Vim(f)0 0 \to ker(f) \longrightarrow V \longrightarrow im(f) \to 0

is a split exact sequence, hence of the form

0ker(f)ker(f)im(f)im(f)0. 0 \to ker(f) \longrightarrow ker(f) \oplus im(f) \longrightarrow im(f) \to 0 \,.

Partial generalization to modules

While the rank-nullity theorem (1) does not fully generalize from vector spaces over fields to modules over rings, some aspects do carry over. For instance:

Proposition

Let MM be a finitely generated module over a unital commutative ring, and NMN \subset M a submodule. Then surjective module homomorphisms f:NMf \colon N \twoheadrightarrow M are already isomorphisms.

This was claimed in Orzech 1971, though with a gap in the proof (cf. Grinberg 2014). A full proof was given by Grinberg 2016. For the special case N=MN = M see also SP 05G8.

Proof

The following proof essentially reproduces a proof by Thomas Browning using the Cayley-Hamilton theorem.

Let (m iM) i=1 k(m_i \in M)_{i = 1}^k be the assumed finite tuple of generators of the module MM, so that

M=m 1,,m k. M = \langle m_1,\ldots,m_k\rangle \mathrlap{\,.}

By the assumption that ff is surjective, there must be preimages (n iN) i=1 k(n_i \in N)_{i =1}^k, hence with

m i=f(n i). m_i = f(n_i) \mathrlap{\,.}

Moreover, by the assumption that NN is a submodule of MM, these preimages must be linear combinations of the generators,

n i= ja ijm j n_i = \textstyle{\sum_j} a_{i j} m_j

with coefficient matrix

A=(a ij), A = (a_{i j}) \mathrlap{\,,}

so that

n=Am. \vec{n}=A\vec{m} \mathrlap{\,.}

Now, by Cayley-Hamilton theorem, the matrix AA satisfies its own characteristic polynomial equation:

A k+c k1A k1++c 2A 2+c 1A+c 0I=0. A^k + c_{k-1} A^{k-1} + \cdots + c_2 A^2 + c_1 A + c_0 I \;=\; 0 \mathrlap{\,.}

Applying this equation to m\vec{m} gives

A km+c k1A k1m++c 2A 2m+c 1Am+c 0m=0. A^k\vec{m} + c_{k-1} A^{k-1}\vec{m} + \cdots + c_2 A^2 \vec{m} + c_1 A\vec{m} + c_0\vec{m} \;=\; 0 \mathrlap{\,.}

Now consider the step of first substituting Am=nA\vec{m}=\vec{n}

A k1n+c k1A k2n++c 2An+c 1n+c 0m=0. A^{k-1}\vec{n} + c_{k-1} A^{k-2}\vec{n} + \cdots + c_2A\vec{n} + c_1\vec{n} + c_0\vec{m} \;=\; 0 \mathrlap{\,.}

and then applying ff component-wise, to obtain:

A k1m+c k1A k2m++c 2Am+c 1m+f(c 0m)=0. A^{k-1}\vec{m} + c_{k-1}A^{k-2}\vec{m} + \cdots + c_2A\vec{m} + c_1\vec{m} + f(c_0\vec{m}) \;=\; 0 \mathrlap{\,.}

Observe that this has had the effect of reducing the order of the powers of AA. So, applying the same kind of step again, by first substituting Am=nA\vec{m}=\vec{n}

A k2n+c k1A k3n++c 2n+c 1m+f(c 0m)=0 A^{k-2}\vec{n} + c_{k-1}A^{k-3}\vec{n} + \cdots + c_2\vec{n} + c_1\vec{m} + f(c_0\vec{m}) \;=\; 0

and then applying ff component-wise, reduces the order by another unit, to yield:

A k2m+c k1A k3m++c 2m+f(c 1m+f(c 0m))=0. A^{k-2}\vec{m} + c_{k-1}A^{k-3}\vec{m} + \cdots + c_2\vec{m} + f\big( c_1\vec{m} + f(c_0\vec{m}) \big) \;=\; 0 \mathrlap{\,.}

Hence by iteration of this step we eventually deduce an identity of the form

m+f(c k1m+f(c k2m+f()))=0. \vec{m} + f\Big( c_{k-1}\vec{m} + f\big( c_{k-2}\vec{m} + f(\cdots) \big) \Big) \;=\; 0 \mathrlap{\,.}

But since the m=(m i)\vec m = (m_i) are generators, this same identity thus holds for arbitrary xMx \in M, by forming linear combinations:

x+f(c k1x+f(c k2x+f()))=0. x + f\Big( c_{k-1}x + f\big( c_{k-2}x + f(\cdots) \big) \Big) \;=\; 0 \mathrlap{\,.}

Evaluating this for any element xker(f)x \in \ker(f) in the kernel of ff clearly causes the nested terms to vanish iteratively and hence implies x=0x = 0, whence the kernel is trivial. This means that the surjective map ff is also injective and therefore a module isomorphism, as claimed.

The following special case is important in practice and may still be proven essentially by recourse to the ordinary rank-nullity theorem:

Example

A surjective linear map of the form n n\mathbb{Z}^n \longrightarrow \mathbb{Z}^n is already an isomorphism.

Proof

The linear map is represented by a square matrix with integer coefficients, and it being surjective means that this matrix has full rank. But under the canonical inclusion \mathbb{Z} \subset \mathbb{R} we may regard this also as a real matrix. As such it still has full rank, and hence vanishing kernel by the rank-nullity theorem (1), hence is injective.

References

Ordinary

Textbook accounts:

A formal proof of the rank-nullity theorem in the Isabelle proof assistant:

See also:

Partial generalization to modules

Last revised on April 13, 2026 at 08:57:14. See the history of this page for a list of all contributions to it.