# nLab adjunction

### Context

#### 2-Category theory

2-category theory

# Contents

## Idea

Two morphisms $C\stackrel{L}{\to }D$ and $D\stackrel{R}{\to }C$ in a 2-category $𝒞$ form an adjunction if they are dual to each other.

There are two archetypical examples:

• If $A$ is a monoidal category and $𝒞=BA$ is the one-object 2-category incarnation of $A$ (the delooping of $A$), so that the morphisms in $𝒞$ correspond to the objects of $A$, then the notion of adjoint morphisms in $𝒞$ coincides precisely with the notion of dual objects in a $A$.

• If $𝒞$ is the $2$-category Cat, so that the morphisms in $𝒞$ are functors, then the notion of adjoint morphisms in $𝒞$ coincides precisely with the notion of adjoint functors.

The notion of adjunction may usefully be thought of as a weakened version of the notion of equivalence in a 2-category: a morphism in an adjunction need not be invertible, but it has in some sense a left inverse from below and a right inverse from above. If the morphism in an adjunction does happen to be a genuine equivalence, then we speak of the adjunction being an adjoint equivalence.

Essentially everything that makes category theory nontrivial and interesting beyond groupoid theory can be derived from the concept of adjoint functors. In particular universal constructions such as limits and colimits are examples of certain adjunctions. Adjunctions are already interesting (but simpler) in 2-posets, such as the $2$-poset Pos of posets.

### From hom-functors to units and counits

At the cost of some repetition (compare adjoint functor), we outline how one gets from the hom-functor formulation of adjunction in Cat to the elementary definition in terms of units and counits. This will motivate the definition in the section that follows, which is elementary (definable in the first-order theory of categories) and portable to any 2-category.

We start from a familiar example. Let $U:\mathrm{Grp}\to \mathrm{Set}$ from groups to sets be the usual forgetful functor. When we say ”$F\left(X\right)$ is the free group generated by a set $X$”, we mean there is a function ${\eta }_{X}:X\to U\left(F\left(X\right)\right)$ which is universal among functions from $X$ to the underlying set of a group, which means in turn that given a function $f:X\to U\left(G\right)$, there is a unique group homomorphism $g:F\left(X\right)\to G$ such that

$f=\left(X\stackrel{{\eta }_{X}}{\to }U\left(F\left(X\right)\right)\stackrel{U\left(g\right)}{\to }U\left(G\right)\right)$f = (X \stackrel{\eta_X}{\to} U(F(X)) \stackrel{U(g)}{\to} U(G))

Here ${\eta }_{X}$ is a component of what we call the unit of the adjunction $F⊣U$, and the equation above is a recipe for the relationship between the map $g:F\left(X\right)\to G$ and the map $f:X\to U\left(G\right)$ in terms of the unit.

Now we work more generally. Suppose given functors $L:C\to D$, $R:D\to C$ and the structure of an adjunction in the form of a natural isomorphism

${\Psi }_{c,d}:{\mathrm{hom}}_{D}\left(L\left(c\right),d\right)\cong {\mathrm{hom}}_{C}\left(c,R\left(d\right)\right)$\Psi_{c, d}: \hom_D(L(c), d) \cong \hom_C(c, R(d))

Now the idea is that, a la the Yoneda lemma, $\Psi$ should be completely describable in terms of what it does to identity maps. With that in mind, define the unit $\eta :{1}_{C}\to RL$ by the formula ${\eta }_{c}={\Psi }_{c,L\left(c\right)}\left({1}_{L\left(c\right)}\right)$. Dually, define the counit $\epsilon :LR\to {1}_{D}$ by the formula ${\epsilon }_{d}={\Psi }_{R\left(d\right),d}^{-1}\left({1}_{R\left(d\right)}\right)$. Then given $g:L\left(c\right)\to d$, the claim is that

${\Psi }_{c,d}\left(g\right)=\left(c\stackrel{{\eta }_{c}}{\to }R\left(L\left(c\right)\right)\stackrel{R\left(g\right)}{\to }R\left(d\right)\right).$\Psi_{c, d}(g) = (c \stackrel{\eta_c}{\to} R(L(c)) \stackrel{R(g)}{\to} R(d)).

This may be left as an exercise in the yoga of the Yoneda lemma, applied to ${\mathrm{hom}}_{D}\left(L\left(c\right),-\right)\to {\mathrm{hom}}_{C}\left(c,R\left(-\right)\right)$. By duality, given $f:c\to R\left(d\right)$,

${\Psi }_{c,d}^{-1}\left(f\right)=\left(L\left(c\right)\stackrel{L\left(f\right)}{\to }L\left(R\left(d\right)\right)\stackrel{{\epsilon }_{d}}{\to }d\right).$\Psi^{-1}_{c, d}(f) = (L(c) \stackrel{L(f)}{\to} L(R(d)) \stackrel{\varepsilon_d}{\to} d).

(In fact, we spell out the Yoneda-lemma proof of this dual form below.)

Finally, these operations should obviously be mutually inverse, but that can again be entirely encapsulated Yoneda-wise in terms of the effect on identity maps. Thus, if ${\eta }_{c}≔{\Psi }_{c,L\left(c\right)}\left({1}_{L\left(c\right)}\right)$, the recipe just given for ${\Psi }^{-1}$ yields back

${1}_{L\left(c\right)}=\left(L\left(c\right)\stackrel{L\left({\eta }_{c}\right)}{\to }LRL\left(c\right)\stackrel{{\epsilon }_{L\left(c\right)}}{\to }L\left(c\right)\right)$1_{L(c)} = (L(c) \stackrel{L(\eta_c)}{\to} L R L(c) \stackrel{\varepsilon_{L(c)}}{\to} L(c))

and this is one of the famous triangular equations: ${1}_{L}=\left(L\stackrel{L\eta }{\to }LRL\stackrel{\epsilon L}{\to }L\right)$. Note that juxtaposition in the diagram above is neither functor application, nor vertical composition, nor horizontal composition, but is actually whiskering. By duality, we have the other triangular equation ${1}_{R}=\left(R\stackrel{\eta R}{\to }RLR\stackrel{R\epsilon }{\to }R\right)$. These two triangular equations are enough to guarantee that the recipes for $\Psi$ and ${\Psi }^{-1}$ are indeed mutually inverse.

Thus, it is perfectly sufficient to define an adjoint pair of functors in $\mathrm{Cat}$ as given by unit and counit transformations $\eta :{1}_{C}\to RL$, $\epsilon :LR\to {1}_{D}$, satisfying triangular equations as above.

One thing often heard is that the definition of adjunctions via units and counits is an “elementary” definition (so that by implication, the formulation in terms of hom-functors is not elementary). This means that whereas the hom-functor formulation relies on a background category of sets, the formulation in terms of units and counits is purely in the first-order language of categories and makes no reference to a background model of set theory. It is therefore a perfectly serviceable definition of adjunction without assumptions of local smallness.

###### Yoneda-lemma argument

We claim that ${\Psi }_{c,d}^{-1}:{\mathrm{hom}}_{C}\left(c,R\left(d\right)\right)\to {\mathrm{hom}}_{D}\left(L\left(c\right),d\right)$ can be defined by the formula

${\Psi }_{c,d}^{-1}\left(f:c\to R\left(d\right)\right)=\left(L\left(c\right)\stackrel{L\left(f\right)}{\to }L\left(R\left(d\right)\right)\stackrel{{\epsilon }_{d}}{\to }d\right)$\Psi^{-1}_{c, d}(f: c \to R(d)) = (L(c) \stackrel{L(f)}{\to} L(R(d)) \stackrel{\varepsilon_d}{\to} d)

where ${\epsilon }_{d}≔{\Psi }_{R\left(d\right),d}^{-1}\left({1}_{R\left(d\right)}\right)$. This is by appeal to the proof of the Yoneda lemma applied to the transformation

${\Psi }_{-,d}^{-1}:{\mathrm{hom}}_{C}\left(-,R\left(d\right)\right)\to {\mathrm{hom}}_{D}\left(L\left(-\right),d\right)$\Psi^{-1}_{-, d}: \hom_C(-, R(d)) \to \hom_D(L(-), d)

For the naturality of ${\Psi }^{-1}$ in the argument $\left(-\right)$ would imply that given $f:c\to R\left(d\right)$, we have a commutative square

$\begin{array}{ccc}{\mathrm{hom}}_{C}\left(R\left(d\right),R\left(d\right)\right)& \stackrel{{\Psi }_{R\left(d\right),d}^{-1}}{\to }& {\mathrm{hom}}_{D}\left(L\left(R\left(d\right)\right),d\right)\\ {\mathrm{hom}}_{C}\left(f,R\left(d\right)\right)↓& & ↓{\mathrm{hom}}_{D}\left(L\left(f\right),d\right)\\ {\mathrm{hom}}_{C}\left(c,R\left(d\right)\right)& \underset{{\Psi }_{c,d}^{-1}}{\to }& {\mathrm{hom}}_{D}\left(L\left(c\right),d\right)\end{array}$\array{ \hom_C(R(d), R(d)) & \stackrel{\Psi^{-1}_{R(d), d}}{\to} & \hom_D(L(R(d)), d) \\ \hom_C(f, R(d)) \downarrow & & \downarrow \hom_D(L(f), d) \\ \hom_C(c, R(d)) & \underset{\Psi^{-1}_{c, d}}{\to} & \hom_D(L(c), d) }

Chasing the element ${1}_{R\left(d\right)}$ down and then across, we get $f:c\to R\left(d\right)$ and then ${\Psi }_{c,d}^{-1}\left(f\right)$. Chasing across and then down, we get ${\epsilon }_{d}$ and then ${\epsilon }_{d}\circ L\left(f\right)$. This completes the verification of the claim.

## Definition

An adjunction in a 2-category is a pair of objects $C,D$ together with morphisms $L:C\to D$, $R:D\to C$ and 2-morphisms $\eta :{1}_{C}\to R\circ L$, $ϵ:L\circ R\to {1}_{D}$ satisfying the equations

$\left(Rϵ\right)\cdot \left(\eta R\right)={1}_{R}\phantom{\rule{2em}{0ex}}\text{i.e.}\phantom{\rule{2em}{0ex}}R\stackrel{\eta \circ {1}_{R}}{\to }R\circ L\circ R\stackrel{{1}_{R}\circ ϵ}{\to }R=R\stackrel{{1}_{R}}{\to }R\phantom{\rule{2em}{0ex}}\text{i.e.}\phantom{\rule{2em}{0ex}}\begin{array}{cccccc}& & & & {1}_{C}& \\ & & \multicolumn{5}{c}{ {⇓}^{\eta } }\\ D& \stackrel{R}{\to }& C& \stackrel{L}{\to }& D& \stackrel{R}{\to }& C\\ \multicolumn{4}{c}{ {⇓}^{ϵ} }\\ & & {1}_{D}& \end{array}\phantom{\rule{1em}{0ex}}=\phantom{\rule{1em}{0ex}}D\stackrel{R}{\to }C$(R \epsilon) \cdot (\eta R) = 1_R \qquad \text{i.e.} \qquad R \stackrel{\eta \circ 1_R}{\to} R \circ L \circ R \stackrel{1_R \circ \epsilon}{\to} R = R \stackrel{1_R}{\to} R \qquad \text{i.e.} \qquad \array{\arrayopts{ \padding{0} } &&&&1_C& \\ &&\cellopts{\colspan{5}}\begin{svg} <svg xmlns="http://www.w3.org/2000/svg" width="8.5em" height="2em" viewBox="0 0 85 20"> <defs> <marker id='svg195arrowhead' markerHeight='5' markerUnits='strokeWidth' markerWidth='8' orient='auto' refX='0' refY='5' viewBox='0 0 10 10'> <path d='M 0 0 L 10 5 L 0 10 z'/> </marker> </defs> <path marker-end='url(#svg195arrowhead)' stroke-width="1" stroke="#000" fill="none" d="M5 15q40-28 75 0"/> <foreignObject height='20' width='20' x='40' y='3' font-size='10'><math xmlns="http://www.w3.org/1998/Math/MathML" display='inline'><msup><mo>&#8659;</mo><mi>&#951;</mi></msup>[/itex]</foreignObject> </svg> \end{svg}\\ D & \stackrel{R}{\to}& C & \stackrel{L}{\to}& D & \stackrel{R}{\to}& C \\ \cellopts{\colspan{4}}\begin{svg} <svg xmlns="http://www.w3.org/2000/svg" width="8.5em" height="2em" viewBox="0 0 85 20"> <path marker-end='url(#svg195arrowhead)' stroke-width="1" stroke="#000" fill="none" d="M5 5q40 28 75 0"/> <foreignObject height='20' width='20' x='40' y='0' font-size='10'><math xmlns="http://www.w3.org/1998/Math/MathML" display='inline'><msup><mo>&#8659;</mo><mi>&#1013;</mi></msup>[/itex]</foreignObject> </svg> \end{svg} \\ &&1_D& } \quad = \quad D \stackrel{R}{\to} C

and

$\left(ϵL\right)\cdot \left(L\eta \right)={1}_{L}\phantom{\rule{2em}{0ex}}\text{i.e.}\phantom{\rule{2em}{0ex}}L\stackrel{{1}_{L}\circ \eta }{\to }L\circ R\circ L\stackrel{ϵ\circ {1}_{L}}{\to }L=L\stackrel{{1}_{L}}{\to }L\phantom{\rule{2em}{0ex}}\text{i.e.}\phantom{\rule{2em}{0ex}}\begin{array}{cccc}& & {1}_{C}& \\ \multicolumn{5}{c}{ {⇓}^{\eta } }\\ C& \stackrel{L}{\to }& D& \stackrel{R}{\to }& C& \stackrel{L}{\to }& D\\ & & \multicolumn{4}{c}{ {⇓}^{ϵ} }\\ & & & & {1}_{D}& \end{array}\phantom{\rule{1em}{0ex}}=\phantom{\rule{1em}{0ex}}C\stackrel{L}{\to }D$(\epsilon L) \cdot (L \eta) = 1_L \qquad \text{i.e.} \qquad L \stackrel{1_L \circ \eta}{\to} L \circ R \circ L \stackrel{\epsilon \circ 1_L}{\to} L = L \stackrel{1_L}{\to} L \qquad \text{i.e.} \qquad \array{\arrayopts{ \padding{0} } &&1_C& \\ \cellopts{\colspan{5}}\begin{svg} <svg xmlns="http://www.w3.org/2000/svg" width="8.5em" height="2em" viewBox="0 0 85 20"> <defs> <marker id='svg195arrowhead' markerHeight='5' markerUnits='strokeWidth' markerWidth='8' orient='auto' refX='0' refY='5' viewBox='0 0 10 10'> <path d='M 0 0 L 10 5 L 0 10 z'/> </marker> </defs> <path marker-end='url(#svg195arrowhead)' stroke-width="1" stroke="#000" fill="none" d="M5 15q40-28 75 0"/> <foreignObject height='20' width='20' x='40' y='3' font-size='10'><math xmlns="http://www.w3.org/1998/Math/MathML" display='inline'><msup><mo>&#8659;</mo><mi>&#951;</mi></msup>[/itex]</foreignObject> </svg> \end{svg}\\ C & \stackrel{L}{\to}& D & \stackrel{R}{\to}& C & \stackrel{L}{\to}& D \\ &&\cellopts{\colspan{4}}\begin{svg} <svg xmlns="http://www.w3.org/2000/svg" width="8.5em" height="2em" viewBox="0 0 85 20"> <path marker-end='url(#svg195arrowhead)' stroke-width="1" stroke="#000" fill="none" d="M5 5q40 28 75 0"/> <foreignObject height='20' width='20' x='40' y='0' font-size='10'><math xmlns="http://www.w3.org/1998/Math/MathML" display='inline'><msup><mo>&#8659;</mo><mi>&#1013;</mi></msup>[/itex]</foreignObject> </svg> \end{svg} \\ &&&&1_D& } \quad = \quad C \stackrel{L}{\to} D

variously called the triangle identities or the zig-zag identities. We call $L$ the left adjoint (of $R$) and $R$ the right adjoint (of $L$). We call $\eta$ the unit of the adjunction and $ϵ$ the counit of the adjunction.

When interpreted in the prototypical 2-category Cat, $C$ and $D$ are categories, $L$ and $R$ are functors, and $\eta$ and $ϵ$ are natural transformations. In this case (which was of course the first to be defined) there are a number of equivalent definitions of an adjunction, which can be found on the page adjoint functor. Conversely, the definition in any 2-category can be obtained by internalization from the definition in $Cat$.

### In terms of string diagrams

The definition of an adjunction may be nicely expressed using string diagrams. The data $L:C\to D$, $R:D\to C$ and 2-cells $\eta :{1}_{C}\to R\circ L$, $ϵ:L\circ R\to {1}_{D}$ are depicted as

(where 1-cells read from right to left and 2-cells from bottom to top), and the zigzag identities are expressed as “pulling zigzags straight” (hence the name):

Often, arrows on strings are used to distinguish $L$ and $R$, and most or all other labels are left implicit; so the zigzag identities, for instance, become:

## References

Revised on January 10, 2013 14:49:30 by Noam Zeilberger (89.80.149.17)