I want to discuss a neat theorem (theorem 2 below) which manages to roll several fundamental theorems of topos theory into one.
I’ve known this theorem for some time now (since about 1997), but for a long time it struck me as just a kind of curiosity – I didn’t know any exciting new applications, and I didn’t even know what I should name the new concept that the theorem depends on. But, after thinking about some recent discussion on the blog on modal logic, it occurred to me there might be some justification for christening this new concept “modal operator”.
Just giving the baby a jazzy new name (even if it doesn’t really suit her in the end) somehow rekindled the motivation to want to show her to people: write down the proofs, see if I could make it grow, and so on. And in the process, I found a way to construct modal operators that for some reason hadn’t occurred to me before (theorem 3). If anyone knows about this concept from somewhere else, I’d really like to know about it!
Although this article is more or less self-contained, it’s really addressed to those who know some topos theory at an introductory level, say through the first five chapters of the book by Mac Lane and Moerdijk. I’ve included a little appendix on a way to think about sheafification, which I’m sure is known to experts but which I’ve never seen written down.
There are three basic theorems for building new toposes from old ones:
Slice theorem: If is a topos and is an object, then is a topos.
Lex comonad theorem: If is a left exact comonad acting on a topos , then the category of -coalgebras is also a topos.
If the slice theorem is the one which passes to toposes of presheaves on internal discrete categories, then the lex comonad theorem is the one which, among other things, extends that further to toposes of presheaves on general internal categories.
This is the theorem which takes us from presheaves to sheaves. Here the monad is the composite of taking the associated sheaf functor (which is left exact) followed by the forgetful functor from sheaves to presheaves; the -algebras are the sheaves. The monad is idempotent because the associated sheaf of a sheaf is canonically isomorphic to (alternatively, because the embedding of sheaves in presheaves is full and faithful). In fact, every such lex idempotent monad on a topos arises in this way, by sheafifying with respect to a Lawvere-Tierney topology in .
Thus the three theorems taken together effect the passage from a base category as a topos, to Grothendieck toposes. We will show how to effect this passage in just one step, instead of three separate ones. In fact, we will see that all three theorems can be combined into one master theorem (theorem 2 below).
The first reduction combines the slice theorem and the lex comonad theorem into one. The idea is rather simple. Indeed, notice that the slice is the category of coalgebras for the comonad , where the comonad structure comes from the unique comonoid structure which exists on any object :
The comonad isn’t left exact (it doesn’t preserve the terminal object), but it does preserve pullbacks.
Thus, we will reproduce the following theorem. It has long been known as folklore to experts, and appears in Johnstone’s Sketches of an Elephant, section A, Remark 4.2.3.
If is a pullback-preserving comonad acting on a topos , then the category of coalgebras is also a topos.
It suffices to produce finite limits and power objects in . Pullbacks in are created by the forgetful functor because preserves pullbacks, and is terminal in if is terminal in , since the right adjoint preserves limits. So is finitely complete.
Since preserves pullbacks, it also preserves monos. In particular, if denotes the elementhood relation between and its power object in , we have a chain of monos
where the middle two objects are pullbacks of the diagram
in . The composite of the monos in (1) names a subobject of , which is classified by a map
in .
Let denote the comultiplication and the counit of , and suppose carries a -coalgebra . Define the object in to be the equalizer in of the following pair of coalgebra maps from the cofree coalgebra to itself:
where is the contravariant power object functor (whence the direction ). We will show that is the power object of in .
So, let be a coalgebra. We must show that coalgebra maps correspond precisely to subcoalgebras of the product in (whose underlying object in is the pullback ). The remainder of the proof will be proven with the help of three lemmas.
Lemma 0: If is a -coalgebra, then the coassociativity square
is a pullback. In any pullback, if the two pullback projections coincide (as here, where they are both ), then the pullback projection is monic.
In any commutative square
the arrows , coincide because and have a common left inverse . Therefore the pullback of the arrows , is the same as their equalizer, which is . For the second sentence, the condition that the two pullback projections coincide is equivalent to the condition that the projection is the equalizer, and equalizers are monic.
Suppose is a comonad which preserves pullbacks, and let be a -coalgebra. Then a pair defines a subcoalgebra of precisely when is monic and the compatibility square
commutes.
First, is monic assuming is monic, since preserves pullbacks. Therefore is uniquely determined. We must show that satisfies the axioms for a coalgebra structure, i.e., that
Since is monic, the second equation follows from the equation
but this equation holds since
using in turn naturality of , the square (3), and the fact that is a coalgebra.
Again, preserves pullbacks, so is monic assuming is. So the first equation of (4) follows from
and this last equation is left to the reader; it follows from naturality of , the square (3), and the fact that is a coalgebra.
In the notation of lemma 1, if is a monic coalgebra map, then the square
is a pullback.
Suppose we have a commutative square
First, we prove that equalizes the pair . We have a series of equations
using, in turn, (5) (and functoriality of ), (6), the coassociativity on the coalgebra , (6) again, and naturality of . Comparing the first and last terms of this series, and using the fact that is monic (because is monic and preserves pullbacks), we obtain .
Thus factors through the equalizer of . Write . We then have
and since is monic (cf. lemma 0), we infer that . Moreover, this is unique since is monic. Thus, we have shown that any square (6) factors uniquely, via the map , through the square (5); this completes the proof.
Now we return to the proof of Theorem 1; it remains to show that the equalizer of the pair of maps in (2) is the power object of the coalgebra . Let be a coalgebra, and a coalgebra map. Then equalizes the maps in (2). Moreover, since is cofree, we have that to each -coalgebra map , there exists a unique in the topos such that
Thus in or in corresponds to a subobject in . We need to check that the condition that equalizes the pair of maps in (2) corresponds precisely to the condition that is a subcoalgebra of (the product of and in ). Or, taking into account lemmas 0, 1, 2: precisely to the condition that there is a pullback diagram of the form
where is the coalgebra structure on , and is monic.
That equalizes the pair of maps in (2) translates to a condition on , that the following diagram commutes:
Or, that the subobject equals the subobject classified by
First, remembering how was defined, the subobject classified by appears as the left-hand column in
Then, the condition is that postcomposing by and precomposing with , the subobject of classified by , which is obtained by further pulling back as below, should match the subobject , as in the left-hand column below:
This condition amounts saying that the top left square is a pullback (and that factors through the inclusion ), since the other three squares are obviously pullbacks already if preserves pullbacks. That the top left square is a pullback is tantamount to having a pullback (through which the top left square factors)
precisely as in (7). Thus the proof of theorem 1 is complete.
Let us put this theorem to work, by showing how the passage from a topos to the topos of presheaves on an internal category can be accomplished in just one step, instead of two. Given an internal category in with underlying span
there is a comonad formed as the composite
where denotes pulling back along , and is right adjoint to .
This comonad takes an object of (“a set”) to
where denotes the set of all morphisms whose target is . This comonad is manifestly pullback-preserving. A coalgebra structure
sends to some summand at an index (so we think of this as giving a fibering ). More precisely, is mapped to a function , so that given in , hitting it with gives an element in . The assignments should be thought of as producing a (contravariant) action of on the fibered set , inasmuch as the coalgebra axioms guarantee equational axioms of the form
wherever they make sense. But such a -action is simply another name for a presheaf on , where the fibering corresponds to the category of elements of .
In summary, coalgebras of this comonad are exactly internal presheaves on , and theorem 1 shows that the category of presheaves is a topos, in a single shot.
We’ve now given a reduction of three fundamental theorems of topos theory to two:
Pullback-preserving comonad theorem: Given a pullback-preserving comonad acting on a topos , the category of coalgebras is a topos.
Lex idempotent monad theorem: Given a left exact idempotent monad acting on a topos , the category of algebras is a topos.
What is interesting in these cases is that the relevant constructions, in particular the construction of power objects, can be made to look exactly the same in as in . It is therefore very tempting to seek a further reduction, i.e., a generalization which combines these two theorems into one. But finding a common generalization of monads and comonads would seem to be an ill-advised task, to say the least.
In particular, there is no analogue of the counit
in the monad case, even when the monad is idempotent. However, looking over our earlier proofs, particularly of lemmas 0, 1, and 2, we found that the presence of a counit for our comonad played a very restricted role. In fact, it played no essential role whatsoever, except in one place in the proof of lemma 0, where it was used to prove that coalgebra structures fit into pullback squares
here we used only the fact that was a common left inverse to , . In other words, if we take this pullback property for granted, then the rest of the proof of theorem 1 goes through without invoking the counit again.
Let’s look now at the monad side of things. For general monads , there are two structure maps
where denotes the unit, but for idempotent monads these maps coincide: each is the inverse of the monad multiplication
which for an idempotent monad is an isomorphism, by definition. Moreover, an algebra structure for an idempotent monad,
is itself an isomorphism with inverse given by the unit . In particular, it is trivial that for a -algebra , the following square is a pullback:
since all arrows in sight are isomorphisms. If we then define to be , then this pullback square for -algebras has the same formal shape as the pullback square for -coalgebras above.
With these preliminaries in place, we now propose the following definition as a generalization of pullback-preserving comonad and lex idempotent monad:
Definition 1: Let be a finitely complete category.
A modal operator on is a pullback-preserving functor equipped with a transformation , such that the following coassociativity square is a pullback:
A modal operator is strict if
If is a modal operator on , then a -structure is an object of together with a morphism such that the coassociativity square
is a pullback. A morphism of -structures , is a map such that
commutes.
The category of -structures will be denoted . It is not hard to see that is strict if and only if is terminal in .
Remark: I invite discussion about the appropriateness of the term ‘modal operator’ to describe such structures. On the one hand, Tarski has given a topological interpretation of a modal operator ‘necessity’, by considering the operator on the Boolean algebra which maps a subset to its interior with respect to some given topology on ; such interior operators are the same as left exact comonads on . (See for instance the paper by Awodey-Kinisha.) On the other hand, Lawvere claims that a Lawvere-Tierney topology, that is a left exact internal monad on , or (on a categorified level) a left exact idempotent monad on the ambient topos , should be construed as a modal operator where truth of is read as saying ‘ is true locally’ (where the meaning of ‘local’ depends on the Lawvere-Tierney topology chosen). It’s clear that I’m trying to consider an abstract common (categorified) generalization that covers these cases, but at present I haven’t assembled more convincing arguments for why ‘modal operator’ is a good term. Maybe it is, maybe it isn’t…
I’m happy to hear other suggestions, of course. I seem to recall that modal operators as defined here bear some broad resemblance to the notion of “interpolad” due to Koslowski, but I have not studied this work.
Here then is our fundamental three-in-one theorem:
Theorem 2: Let be a strict modal operator on a topos . Then the category of -structures is also a topos.
Proof: This proceeds pretty much the same way as the proof given above for theorem 1. It is straightforward to check that the underlying functor , from -structures to , creates and preserves pullbacks if preserves pullbacks. Thus admits pullbacks, and it admits a terminal object under the strictness assumption; therefore admits all finite limits. In particular, the underlying object in of the product of two -structures , is .
Power objects in are constructed just as they were in theorem 1. First one constructs an auxiliary map in ,
namely the map which classifies the subobject defined by the composite mono
and then as before one defines, for a -structure , a putative power object in , namely the equalizer of the diagram
in . One then has the task of proving that this works: that given a -structure , that -structure maps are in natural bijection with -structure subobjects of , the product of and in . At the risk of some redundancy (by essentially repeating earlier arguments), let’s run through this now.
Lemma 3: Let be a modal operator, and a -structure. Then a mono in targeted at is given precisely by a mono in which fits into a pullback square of the form
Proof: A morphism in is monic if and only if its underlying morphism in monic in , because the underlying functor creates and preserves pullbacks and therefore monos.
So let be a mono in , and suppose (9) is a pullback square. The map is uniquely determined since is monic, and we claim it defines a -structure on . Indeed, we have a diagram
where the right-hand square is a pullback by assumption and the fact that preserves pullbacks. Therefore, to show the left-hand square is a pullback, it suffices to show the composite square is a pullback. But this composite is the same as the composite square for the diagram
in which both squares are pullbacks. Thus indeed defines a -structure, and commutativity of (9) says is a -structure map. Thus one direction of the lemma is established.
For the other direction, suppose that (9) defines a monomorphism of -structures. We must show (9) is a pullback. For this, suppose we have a commutative square
First, we prove that equalizes the pair . We have a series of equations
using, in turn, (9) (and functoriality of ), (10), the coassociativity on the -structure , (10) again, and naturality of . Comparing the first and last terms of this series, and using the fact that is monic, we obtain .
Thus factors through the equalizer of . Write . We then have
and since is monic, we infer that . Moreover, this is unique since is monic. Thus, we have shown that any square (10) factors uniquely, via the map , through the square (9); this completes the proof of lemma 3.
Now we resume the proof of theorem 2; it remains to show that the equalizer of the pair of maps in (8) is the power object of the -structure . Let be a -structure, and a -structure map. Then equalizes the maps in (8).
Let be the composite
Then we have
where the first equation is the equalizing condition on , the second uses the fact that is a -structure map, and the third uses functoriality of . Hence is retrievable from . In fact, the mapping
defines a bijection between maps satisfying the equalizing condition and maps such that
The map classifies a subobject in , and we need to check that condition (11) on corresponds precisely to the condition that is a sub--structure of . Or, taking into account lemma 3: precisely to the condition that there is a pullback diagram of the form
where is the -structure on , and is monic.
Proof: The condition is that the subobject equals the subobject classified by the composite given in (11):
The subobject classified by appears as the left-hand column in
Then, the condition is that postcomposing by and precomposing with , the subobject of classified by , which is obtained by further pulling back as below, should match the subobject , as in the left-hand column below:
This condition amounts saying that the top left square is a pullback (and that factors through the inclusion ), since the other three squares are obviously pullbacks already if preserves pullbacks. That the top left square is a pullback is tantamount to having a pullback (through which the top left square factors)
precisely as in (12). Thus the proof of theorem 2 is complete.
To tie up some loose ends, we should check that theorem 2 really does specialize to the pullback-preserving comonad theorem and the lex idempotent monad theorem. In other words, we should check that -structures really are the same things as -coalgebras or -algebras, where is either a pullback-preserving comonad or a lex idempotent monad, seen as a modal operator.
First, suppose is a pullback-preserving comonad. We have already seen that a -coalgebra satisfies the -structure axiom. Converse, if is a -structure, then it is also a -coalgebra: all we have to do is verify the equation
Since we have a pullback
we know is monic by lemma 0, so it suffices to show . But we have
using naturality of , the counit axiom for comonads, and commutativity of (13). Finally, since is monic, we have , as was to be shown.
Next, let be a lex idempotent monad, and define to be . Here we have a commuting coassociativity square for which consists of isomorphisms and is therefore a pullback, so is a modal operator. Also, for a -algebra , the map is an isomorphism, and so here algebra structure boils down to a condition that the unit is invertible (and then ). It is then automatic that satisfies the -structure pullback condition (again since all arrows in the coassociativity square (13) are isomorphisms).
In the other direction, if is a -structure, then the naturality square
factors through the pullback (13) via a unique map , and since is an isomorphism in this case, we must have , whence also . Thus is an isomorphism, so that is a -algebra. In other words, the category of -algebras for a lex idempotent monad is equivalent to the category of -structures where is viewed as a modal operator.
Finally, it is clear that is a -algebra, hence the -algebra map is an isomorphism. Hence is terminal in , so that is strict.
We have seen that pullback-preserving comonads and left exact idempotent monads (equivalently, pullback-preserving idempotent monads) are examples of modal operators. In order for the notion of modal operator to be a really viable concept, however, we should give some other interesting examples which fall outside these two classes. We should also make good on our promise to present the fundamental passage, from a topos to the topos of sheaves on an internal site in , in a single step (more precisely, a single application of theorem 2).
These goals are interconnected: we will construct, given an internal site in , a (strict) modal operator such that the category of sheaves is equivalent to the category of -structures. This is neither a pullback-preserving comonad nor a lex idempotent monad; it’s a hybrid which supports the viability of our notion of modal operator.
Indeed, it isn’t hard to imagine how this works. First, we have a two-step passage
where assigns to each presheaf its underlying “set” (object) of elements, and is right adjoint to ; the comonad was the one used to describe as the category of coalgebras in the application of theorem 1. The functor is the associated sheaf functor, and , its right adjoint, is the underlying functor from sheaves to presheaves. The composite is a lex idempotent monad whose category of algebras is , and naturally theorem 2 guarantees that is a topos, given that is.
Suppose we traverse the full circuit of arrows from back to itself, giving rise to an operator
This is a hybrid formed from the comonad and the monad , but is neither a comonad nor a monad itself. It is, as we shall see, a (strict) modal operator on , such that is equivalent to the category of -structures! This is the denouement we were waiting for: it shows that the passage from sets to sheaves is really a one-step process, if we choose the modal operator on sets right.
The entire discussion is concentrated in theorem 3 below, after a few preliminaries. Let be finitely complete, and let be a modal operator. For each -structure , the map is a map of -structures. Thus, if denotes the underlying functor and , by slight abuse of notation, denotes the evident functor , then we have a natural transformation
Of course, this isn’t generally a unit of an adjunction or anything like that, but much of the power of modal operators derives from a fact which is the essential point of lemma 3 above: that is cartesian with respect to monos. We take advantage of this fact repeatedly.
If furthermore is a modal operator on , then we have a round trip operator
and we define a transformation in the only reasonable way possible: as the composite
Theorem 3: defines a modal operator on , and . If the modal operators and are strict, then so is .
Proof: A completely detailed proof involves some largish diagrams, but we sketch the essential points. To prove is a modal operator, we need to contemplate a diagram of shape
where all four squares are pullbacks by more or less the same argument, which we give just for the square labeled (4): this square is obtained by applying the pullback-preserving functor (at the argument ) to the square
which is a pullback by lemma 3 ( is monic, and the transformation is cartesian with respect to monos). This completes the sketch for why is a modal operator.
Now we sketch the equivalence . In one direction, it is relatively easy to define a functor . For consider an object of . This consists of an object of together with structures
(where denotes the underlying object in of the value of applied to the -structure ), subject to various pullback conditions. Observe that may be regarded as a map in ; by applying to it, we obtain a map
and hence a composite
which is a map which defines a putative -structure. Indeed, in we have a diagram of the form
where (1) is a pullback because is the underlying map of an -structure, (2) is a pullback because is a monic -structure map (then use lemma 3), (3) is a pullback because preserves pullbacks and is also a monic -structure map (lemma 3 again), and (4) is a pullback because preserves pullbacks and is a monic -structure map.
In the other direction, we now define a functor . This is a bit trickier; the main hurdle to overcome is to produce a -structure from a -structure . In summary, the pullback of a diagram having the form
is (with both pullback projections being ), by taking advantage of the pullback axiom on the -structure together with naturality of . If we leave off the leftmost and topmost map from this diagram, we get a diagram
whose pullback is (with both pullback projections being ), by the pullback condition on and the fact that preserves pullbacks. Thus the first pullback factors through the second pullback , by a map I shall again call . It may be checked that this makes a -structure. From all this we get a pullback
so that is a -structure map, and it will be so regarded. Thus, we interpret this pullback square as lifted up to the category , and we hit it with the pullback-preserving functor . The result is the square appearing in the diagram
and the pullback of the right-hand column against the bottom map is, by the -structure axiom, with both pullback projections being . Thus we arrive at a diagram
in which . It may be checked that defines an -structure in , and this concludes our sketch of the equivalence .
Finally, if the modal operators and are strict, then is terminal in , and then is terminal in . Since is terminal in , it follows that is strict. This concludes the proof of theorem 3.
Theorem 3 gives one means of constructing new modal operators from old.
I thank Sridhar Ramesh and Mike Shulman for pointing out that Theorem 1 appears in print in Johnstone’s Elephant, and Mike for many thoughtful comments on a draft of this article, in particular for pointing out the closely related work by Toby Kenney on toposes constructed by means of diads [reference to be inserted].