nLab
Mac Lane's proof of the coherence theorem for monoidal categories

Context

Monoidal categories

Category theory

Contents

Throughout, we will use the notation and terminology of Ch. VII of CWM, with the exception that the product bifunctor of a monoidal category will be denoted by , instead of Mac Lane’s notation.

Introduction and statement

Let B, B,α (B),e B,λ (B),ϱ (B), be some monoidal category. At first, we would like the coherence theorem for monoidal categories to state that every diagram in B built up from instances of α, λ (B), ϱ (B) and identity arrows by multiplications B (such as the pentagon diagram) commutes. This, however, is not possible, because some formally different vertices of such a diagram might turn out to be the same in our particular monoidal category B, in a way that makes the diagram non-commutative.

As a concrete example of the problem in a particular monoidal category, consider Isbell’s famous example for showing that moving to skeleta does not assure that all monoidal categories are strict (see p. 164 of CWM). In this example, one shows that if D is the denumerable set in a skeleton Set 0 of Set, then, although D×(D×D) equals (D×D)×D, the associator α D,D,D (defined as usual to commute with the projections) cannot be the identity.

So, in order for coherence to work, we must change it to the statement that every ”formal” diagram commutes.
The formal diagrams that commute will be described through certain graphs 𝒢 n (n). The vertices of 𝒢 n will be binary words of length n, and its arrows will represent the sort of arrows that participate in the coherence theorem.

We begin by defining the set W of binary words inductively, as the set of all terms in one variable (denoted by ””) in the signature {e 0,}, where e 0 is an 0-ary function symbol (a constant), and is a binary function symbol. So, W is the smallest set of strings in the alphabet {e 0,,} satisfying W, e 0W, and if u,vW then uv (usually written as (u)(v) or even uv for convenience) is also in W.

Next, we define a set S of certain arrow symbols as follows (the reason for the tag in S will be clear later). Let S be the smallest set of strings in the alphabet W{1,α,α 1,λ,λ 1,ϱ,ϱ 1,^} satisfying αuvw,α 1uvwS for all u,v,wW, λw,λ 1w,ϱw,ϱ 1wS for all wW, and for all wW, if βS then also (β)^(1w),(1w)^(β)S (more precisely, the last terms should be called, ^β1w and ^1wβ, but as before, we will usually use the more convenient infix notation). Note that elements of S are comprised of a single associator/unitor symbol (or their inverses) tensored with one or more strings of the form 1w. Later, when we will interpret the elements of S as arrows of a particular monoidal category B, we will call them basic arrows. The idea is that every arrow of the ”right” kind in B (the kind considered in the coherence theorem) factors as the composition of such basic arrows (because is a bifunctor).

It can be verified that every string of S is of one and only one of the following forms: αuvw for some u,v,wW, α 1uvw for some u,v,wW, λw for some wW, …, β^1w for some wW and βS. Also, in each case the ”arguments” are determined uniquely (for example, both β and w are determined from β^1w), and so we may define functions on S inductively.

Let 𝒢 be the directed graph (quiver in nLab jargon) with W as the set of objects, S as the set of arrows, and with the obviously defined dom and cod (for example, For u,v,wW, dom(αuvw):=u(vw)=:cod(α 1uvw), dom(β^1w):=dom(β)w, etc.

Define inductively the length len(w) of a binary word w as follows:

len(e 0):=0,len():=1,len(uv):=len(u)+len(v).len(e_0):=0,\quad len(-):=1,\quad len(u\otimes v):= len(u) + len(v).

Since for all sS, len(dom(s))=len(cod(s)) (this can be verified by induction), 𝒢 is the union of disjoint graphs 𝒢 n, where for n, the vertices of 𝒢 n are the binary words of length n, and the edges are all sS with len(dom(s))=n.

Now, let us fix some monoidal category B, B,α (B),e B,λ (B),ϱ (B),, and some object bobj(B). We can interpret vertices and edges of 𝒢 in B by defining inductively functions () b:Wobj(B), :Sarr(B) as follows:

(e 0) b:=e B, () b:=b,(uv) b:=u b Bv b,(e_0)_b:=e_B,\text{ } (-)_b:=b, (u\otimes v)_b:= u_b\otimes_B v_b,
(αuvw):=α u b,v b,w b (B),(α 1uvw):=(α u b,v b,w b (B)) 1, (λu):=λ u b (B),(λ 1u):=(λ u b (B)) 1, (ϱu):=ϱ u b (B),(ϱ 1u):=(ϱ u b (B)) 1, (β^1w):=(β) B1 w b,(1w^β):=1 w b B(β)\begin{aligned} &&\mathcal{I}(\alpha u v w) := \alpha^{(B)}_{u_b,v_b,w_b},\quad \mathcal{I}(\alpha^{-1} u v w):= (\alpha^{(B)}_{u_b,v_b,w_b})^{-1},\\ && \mathcal{I}(\lambda u) := \lambda^{(B)}_{u_b}, \quad \mathcal{I}(\lambda^{-1} u) := (\lambda^{(B)}_{u_b})^{-1},\\ && \mathcal{I}(\varrho u) := \varrho^{(B)}_{u_b}, \quad \mathcal{I}(\varrho^{-1} u) := (\varrho^{(B)}_{u_b})^{-1},\\ && \mathcal{I}(\beta\hat{\otimes} 1w) := \mathcal{I}(\beta)\otimes_B 1_{w_b},\quad \mathcal{I}(1w\hat{\otimes} \beta) := 1_{w_b}\otimes_B\mathcal{I}(\beta) \end{aligned}

(for u,v,wW and βS).

It is straightforward to verify that the pair () b, is a morphism of graphs 𝒢UB (where UB is the underlying graph of B), that is, that for all sS, we have

(dom(s)) b=dom B((s)),(cod(s)) b=cod B((s)).(dom(s))_b = dom_B(\mathcal{I}(s)),\quad (cod(s))_b = cod_B(\mathcal{I}(s)).

Therefore, if s 1,,s m form a path in 𝒢 (i.e., dom(s i+1)=cod(s i) for all i), then (s 1),,(s m) (in that order) are composable arrows in B. We can finally state the coherence theorem:

Theorem

(Mac Lane’s Coherence Theorem, version 1). Let v,wW be two binary words of the same length n. Suppose that s 1,,s m and s 1,s m form two paths from v to w in 𝒢 n. Then the two corresponding sequences of arrows in B have the same composition in B:

(s m)(s 1)=(s m)(s 1)\mathcal{I}(s_m)\circ \cdots\circ \mathcal{I}(s_1) = \mathcal{I}(s'_{m'})\circ \cdots\circ \mathcal{I}(s'_1)

Note that at a first glance, this ”single variable coherence” looks quite limited: We only prove commutativity of diagrams in B whose vertices are obtained by tensoring a single element b (e.g., vertices such as b B((b Bb) Bb)). However, it turns out that the general case follows almost immediately from this case. The trick is to construct an appropriate monoidal category It(B) in a way that the coherence in one variable for It(B) and one special object of It(B) is the general coherence for B.

The coherence theorem in CWM (Theorem VII.2.1) is stated in a different way, but the main bulk of its proof is actually devoted to prove the above coherence theorem. To state also the version of Theorem VII.2.1 of CWM, let 𝒲 be the category whose objects are binary words, with a single arrow uv between any two binary words of the same length and with the obvious composition. The monoidal structure on 𝒲 is the obvious one: The unit is e 0, and the components of the associator and the unitors are each the appropriate single arrow of the relevant hom-set. The coherence theorem in CWM is as follows:

Theorem

(Mac Lane’s Coherence Theorem, version 2). For any monoidal category B and any object b of B, there is a unique strict morphism of monoidal categories 𝒲B with ()b.

Version 2 says that 𝒲 is the free monoidal category on one generator () (that is, that it is the object part of a universal arrow from () to the forgetful (object) functor Moncat sSet, where Moncat s stands for monoidal categories with strict morphisms of monoidal categories) . As we shall see later, this version follows almost immediately from version 1. While this is not clear at a first glance, it is also true that version 1 follows from version 2. For this, we will define a monoidal category F[1] that will be almost evidently free (and hence, given version 2, isomorphic to 𝒲).

Coherence only for the associator

It will be simpler to first prove coherence only for the associator, forgetting about the unit and the unitors. Let 𝒢 be the full subgraph of 𝒢 whose vertex set consists only of those binary words that do not involve e 0. We will write W 0 for the subset of W consisting of such binary words (that is, for the vertex set of 𝒢).

Let S be the subset of S consisting of the strings that
involve only instances of α and α 1 (but not λ, λ 1, ϱ, and ϱ 1) and also do not involve e 0 (e.g., as in αe 0vw). Actually, S could be defined inductively, similarly to S, omitting the reference to λ, λ 1, ϱ, and ϱ 1, and using W 0 instead of W. Note that S is the set of arrows of 𝒢: S={sSdom(s),cod(s)W 0}. For simplicity, we will use the same names dom,cod for the restriction of dom,cod to S.

Let 𝒢 n be the full subgraph of 𝒢 on the vertices of length n. The proof will use a certain vertex w (n) of 𝒢 n as a”pivot.” Specifically, we let w (n)W 0 be the unique word of length n in which all pairs of parentheses start in front (e.g., () for n=3, (()) for n=4, and ((())) for n=5; a precise inductive definition is w (1)=, w (n)=w (n1) for n1).

The proof will also use a measure ρ of how ”far” is a word wW 0 from being w (n). So, we define ρ inductively by

ρ()=0,ρ(vw)=ρ(v)+ρ(w)+len(w)1.\rho(-) = 0, \quad \rho(v\otimes w) = \rho(v)+\rho(w)+len(w)-1.

It is easily verified that ρ(u)0 for all uW 0, with equality iff u=w (n). In fact, ρ(u) is the length of the longest directed path (see ahead for what ”directed” means here) from u to w (n). Intuitively, the idea is that if u=vw, then we may first handle v, then handle w, in order to reach w (n 1)w (n 2) (with n 1:=len(v) and n 2:=len(w)) but not yet to w (n). Then, by n 21 additional applications of associators, we finally reach w (n).

We will call any sS that involves α a directed arrow, and any sS that involves α 1 an anti-directed arrow. A more precise inductive definition is this: For all u,v,wW 0, αuvw is directed, and if β is directed, then β^1w and 1w^β are directed for all wW 0. The definition of anti-directed is obtained by replacing α by α 1 and ”directed” by ”anti-directed” in the previous definition. It can be shown by induction that if sS is directed, then ρ(cod(s)) is strictly smaller that ρ(dom(s)).

For future reference, define the ”formal inverse” of sS in the obvious inductive way, by setting, for example, inv(αuvw):=α u,v,w 1, inv(α 1uvw):=αuvw inv(β^1w):=inv(β)^1w, etc. It can be verified that the formal inverse has all expected properties (such as being idempotent, switching domain and codomain, taking an anti-directed arrow to a directed one, and being interpreted as the inverse of the interpretation of the original arrow).

From each wW 0 of length n, there is a ”canonical” path, involving only directed arrows, from w to w (n) in 𝒢 n, obtained by successively moving the outermost parentheses to the left with the appropriate α. For an example, consider the paths and in the pentagon (5) on p. 162 of CWM. Note that if v,wW 0 are both of length n, then there is a path in 𝒢 n from v to w: Take the canonical path from v to w (n), and then extend it by reversing (using inv) every edge of the canonical path from w to w (n).

Given v,wW 0 of the same length n, our task is to show that for any path from v to w in 𝒢 n, applying to the arrows of the path and composing in B results in the same arrow v bw b in B, independently of the choice of path. In other words, we would like to prove the theorem obtained from replacing W by W 0 and 𝒢 n by 𝒢 n in the first version of the coherence theorem.

As a preparation, we start with the following degenerate case, that can be proved (as usual) by induction on s and s.

Proposition

There are no parallel directed arrows in 𝒢 n: If s,sS are directed arrows with dom(s)=dom(s) and cod(s)=cod(s), then s=s.

Now, take some path s 1,,s m from v to w, as in

Coh_fig_1

In B, we have the corresponding path (note that both in 𝒢 n and in B, it is possible that some of the vertices are in fact equal)

Coh_fig_2

In our original path, it is possible that some arrows (some s i’s) are anti-directed. We can replace each anti-directed arrow by its formal inverse so that we are left only with directed arrows. For example, if s 2 and s 3 are the only anti-directed arrows in our path, then after formally inverting them we get the following series of arrows in 𝒢 n:

Coh_fig_3

In B, we get

Coh_fig_4

Let us write can(u) for the canonical path from uW 0 with len(u)=n and with uw (n) to w (n). Let us also write (can(u)) for the arrow of B obtained by composing of all arrows of can(u). Suppose that all squares in the following diagram of B commute:

Coh_fig_5

(where we write w b (n) for [w (n)] b). This would imply that all squares in

Coh_fig_6

commute, which also implies that the outer rectangle commutes, that is:

(s m)(s 1)=(can(w)) 1(can(v)).\mathcal{I}(s_m)\circ \cdots \circ\mathcal{I}(s_1) = \mathcal{I}(can(w))^{-1}\circ \mathcal{I}(can(v)).

The right-hand side depends only on w and v (and not on the particular choice of the path s 1,s m), which is exactly what we would like to prove. Hence, it is sufficient to show that all squares in the last but one diagram commute. For this, it is clearly sufficient to prove the following proposition:

Proposition

Let vW 0 be a word of length n, different from w (n). Then applying to any two directed paths (paths consisting only of directed arrows) from v to w (n) results in two paths in B that have the same composition in B.

The proof is by induction on ρ(v). For the basis, if ρ(v)=0 then v=w (n) and the assertion is trivial. Suppose, therefore, that ρ(v)>0 and the assertion is true for all vW 0 with ρ(v)<ρ(v). For reasons that will be understood later, we will prove this part by one additional induction: on the length len(v).

Hence, we have to prove that if the assertion holds for all v of rank smaller than r:=ρ(v) and for all v of rank r and length smaller than n (for n>1 – the basis for n=1 is trivial as then v=()=w (1)), then it holds also for all v of rank r
and length n.

From ρ(v)>0 we know that v, and hence there exist u,wW 0 such that v=uw. Consider two paths from v to w (n). Let βS be the first edge of the first path, and let γS be the first edge of the second path (so that in particular dom(β)=v=dom(γ)). Write v:=cod(β) and v:=cod(γ), so that in 𝒢 n we have

Coh_fig_7

Since ρ(v)<ρ(v) and ρ(v)<ρ(v), by hypothesis any two paths from v to w (n) have the same composition in B, and similarly for v. In particular, if β=γ, then we are done:

Coh_fig_8

So, we will continue by assuming that βγ. Note that in this case vv, because it can verified by induction that there are no parallel arrows in 𝒢 n. In this case, it will be sufficient to find some zW 0 of length n and directed paths (recall that here a directed path is a path consisting only of directed arrows) in 𝒢 n from v to z and from v to z such that applying to the following diamond (upper part of the diagram) results in a commutative diamond in B:

Coh_fig_9

(the outer paths are our two paths from v to w (n)). This is indeed enough, because by hypothesis, the two lower trapezoids are commutative in B.

The existence of zW 0 and of directed paths from v and v to z such that the above diamond is commutative in B will be proved by considering all possible cases for β and γ. For β, we have three possibilities:

  • β=β^1w˜ for some w˜W 0 and some directed βS. In this case
v=uw=dom(β)=dom(β)w˜v = u\otimes w = dom(\beta) = dom(\beta')\otimes \tilde{w}

and therefore dom(β)=u and w˜=w (by standard ”unique decomposition” of terms in a first-order language). Also, v=cod(β)=cod(β)w.

  • β=1u˜^β for some u˜W 0 and some directed βS. As above, we get in this case that dom(β)=w and u˜=u. Also, v=cod(β)=ucod(β).

  • β=αu˜st for some u˜,s,tW 0, in which case

v=uw=dom(β)=u˜(st),v = u\otimes w = dom(\beta) = \tilde{u}\otimes (s\otimes t),

so that u˜=u and w=st. Also, v=cod(β)=(us)t.

In summary, we have the following 3 cases for β:

  • Case 1 β: β=β^1w for some directed βS with dom(β)=u. Also, v=cod(β)w.

  • Case 2 β: β=1u^β for some directed βS with dom(β)=w. Also, v=ucod(β).

  • Case 3 β: β=αust for some s,tW 0, and also w=st and v=(us)t.

Similarly, we have Case 1 γ, Case 2 γ and Case 3 γ for γ. We will now consider each one of the possible cases for β and γ:

1 β and 1 γ: The situation is as follows:

Coh_fig_10

Since len(u)<len(v) (as wW 0 cannot be e 0) and since

ρ(u)=ρ(v)(ρ(w) 0+len(w)1 0)ρ(v),\rho(u) = \rho(v) - (\underbrace{\rho(w)}_{\geq 0} + \underbrace{len(w) - 1}_{\geq 0}) \leq \rho(v),

it follows from the induction hypothesis that any two directed paths from u to w (len(u)) have the same composition in B. In particular, applying to the following diamond in 𝒢 len(u) results in a commutative diamond in B:

Coh_fig_11

Since B is a bifunctor, it follows that the following diagram is again commutative when interpreted in B:

Coh_fig_12

(where for xW 0 with len(x)=len(u), we write can(x)^1w for the directed path in 𝒢 n obtained by ^-ing each arrow of can(x) from the right with 1w). This completes the proof for the current case.

2 β and 2 γ: Proved similarly to the previous case.

1 β and 2 γ (or 2 β and 1 γ): We have the following diamond:

Coh_fig_13

which commutes in B, because B is a bifunctor.

3 β or 3 γ: W.l.o.g., we will assume 3 β. Note that in this case, it is not possible that also 3 γ holds, that is, that γ=αust for some s,t with w=st, for then st=st, which implies that s=s and t=t, so that γ=β, contradicting our assumption. So, we have to consider only 1 γ and 2 γ. For 1 γ, we have γ=γ^1st w and dom(γ)=u, and we get the following diamond:

Coh_fig_15

which commutes in B by the naturality of α (B).

For 2 γ (γ=1u^γ and dom(γ)=w=st), we have to consider the three possible cases for γ.

If γ=1s˜^δ for some s˜W 0 and some directed δS, then by comparing domains we get st=dom(γ)=s˜dom(δ), which implies that s˜=s and dom(δ)=t, that is, γ=1s^δ with dom(δ)=t. Hence we get the diamond

Coh_fig_15

which commutes in B, because α (B) is natural.

The case where γ=δ^1t˜ for some t˜W 0 and some directed δS can be treated similarly.

Finally, if γ=αrpq, then by comparing domains we get st=dom(γ)=r(pq), and therefore r=s and t=pq. Hence, γ=αspq and t=pq. In this case, the two first edges β, γ in our two paths from v to w (n) look like part of the coherent pentagon (Eq. (5) on p. 162 of CWM):

Coh_fig_16

All edges in the above pentagon are directed, and the diagram is obviously commutative in B (as B is monoidal). This completes the proof of the proposition.

Including the unit and the unitors

Note: By Kelly’s 1964 paper, we know that the pentagon identity and one triangle identity are all that is required (that the two other triangle identities follow from the above is shown both in CWM an in Kelly, while the fact that λ e B (B)=ϱ e B (B) follows from the pentagon and the triangle identities is proved in Kelly). From this point on, we will be using freely all identities from pp. 162-163 of CWM.

Still need to fill this. The following are the highlights.

First, show that any directed path in path in 𝒢 n may be replaced by a path with the same starting and ending vertices in which all associators (if any) are at the end, and with the same composition in B as the original path. The main ingredient in the following proposition:

Proposition

Let

ususuu \stackrel{s'}{\to} u' \stackrel{s''}{\to} u''

be two directed arrows of 𝒢 n with an associator appearing in s and a unitor appearing in s (note: the definition of ”directed” is extended in an obvious way to include unitors). Then either there exists a directed arrow us 1u containing a unitor with (s 1)=(s)(s), or there exist directed s 1, s 2 as in

us 1?s 2uu \stackrel{s_1}{\to} ? \stackrel{s_2}{\to} u''

with s 1 containing a unitor, s 2 containing an associator, and (s 2)(s 1)=(s)(s).

This proposition may be proved by a long and tedious induction on s and s.

For this I needed the triangle identities and the naturality of α (B) (as mentioned in CWM), but also the naturality of ϱ (B) and λ (B), as well as the bifunctoriality of B. For example, consider:

Coh_fig_17

(where some associator appears in β). So – what am I missing here?

Recall that our original goal is to show that any two paths in 𝒢 n with the same starting vertex and the same ending vertex have the same composition in B. There is still a path from any vertex of 𝒢 n to w (n) (first remove all units, then proceed as in the case of 𝒢 n), and we can use exactly the same method used above (when only the associator was considered) to reduce the problem to: Any two directed paths of 𝒢 n starting at some vertex v and ending at w (n) have the same composition in B. From what we have seen above, any two arbitrary such paths may be replaced be two paths starting with unitors and ending with associators, as in

Coh_fig_18

Here, clean(v) is the word obtained by removing all instances of the unit from v (this can be defined rigorously by induction), and r is the total number of units appearing in v. The right part of the last diagram, where there are no units and unitors, commutes in B by the ”coherence for the associator” that was already proved. Hence, it remains to prove that the left part, where all arrows involve unitors, commutes in B (”coherence for the unitors”).

First one must prove that two parallel directed arrows with unitors in 𝒢 n have the same interpretation in B .
Recall that in 𝒢 n (no units and no unitors), there are no parallel directed arrows. However, this is not the case in 𝒢 n (for example, consider λe 0e 0,1e 0^λe 0,1e 0^ϱe 0:e 0(e 0e 0)e 0e 0). Hence, we must settle for equality of the interpretations in B.

For this, I used induction on the total number of ^ in both s and s, using ϱ e B (B)=λ e B (B) and the naturality of ϱ (B) and λ (B). Am I missing something here? Is there an obvious one-line argument that I’m failing to see?

Now, we can finally prove coherence for the unitors, by induction on the number r of units in v. Considering the left part of the last diagram, if u 1=u 1, as in

Coh_fig_19

then the parallel arrows are equal in B, while the ”large diamond” on the right commutes in B by the induction hypothesis.

We can therefore assume that u 1u 1. As in the case of coherence for the associator, it is sufficient to
find a commutative diamond, that is, to find some vertex w of 𝒢 n and directed arrows γ and γ in S containing unitors, as in

Coh_fig_20

such that the diamond on the left commutes in :
If we find such w, γ and γ, then since by the induction hypothesis the upper and lower parallelograms on the right commute in B, we are done.

Finally, the required commutative diamond may be found by induction on v.

Proof of version 2 of the coherence (Theorem VII.2.1 of CWM)

I intend to come back and fill this. The most difficult part was already proved above.

The construction of F[1] and proof of universality

The category 𝒲 has one-element hom-sets by construction, but it was not at all obvious that it is indeed the free monoidal category on one generator. Alternatively, it is possible to construct a category F[1] that will be almost obviously a free monoidal category on one generator. This would mean that F[1] is isomorphic to 𝒲, and in particular has one-element hom-sets. The fact that F[1] is free and has one-element hom-sets may then serve as yet a third version of coherence.

This should still be filled.

Moving from one variable to any number of variables: The category It(B)

This should still be filled.

References

Revised on January 17, 2013 01:10:45 by Urs Schreiber (203.116.137.162)