Skip to main content
Springer logoLink to Springer
. 2022 Jan 31;30(4):685–753. doi: 10.1007/s10485-021-09671-9

Kan Extensions are Partial Colimits

Paolo Perrone 1,, Walter Tholen 2
PMCID: PMC9296069  PMID: 35875343

Abstract

One way of interpreting a left Kan extension is as taking a kind of “partial colimit”, whereby one replaces parts of a diagram by their colimits. We make this intuition precise by means of the partial evaluations sitting in the so-called bar construction of monads. The (pseudo)monads of interest for forming colimits are the monad of diagrams and the monad of small presheaves, both on the (huge) category CAT of locally small categories. Throughout, particular care is taken to handle size issues, which are notoriously delicate in the context of free cocompletion. We spell out, with all 2-dimensional details, the structure maps of these pseudomonads. Then, based on a detailed general proof of how the restriction-of-scalars construction of monads extends to the case of pseudoalgebras over pseudomonads, we consider a morphism of monads between them, which we call image. This morphism allows in particular to generalize the idea of confinal functors, i.e. of functors which leave colimits invariant in an absolute way. This generalization includes the concept of absolute colimit as a special case. The main result of this paper spells out how a pointwise left Kan extension of a diagram corresponds precisely to a partial evaluation of its colimit. This categorical result is analogous to what happens in the case of probability monads, where a conditional expectation of a random variable corresponds to a partial evaluation of its center of mass.

Keywords: Kan extensions, Colimits, Presheaves, Partial evaluations

Introduction

Kan extensions are a prominent tool of category theory, to the extent that, already in the preface to the first edition of [21], Mac Lane declared that “all concepts of category theory are Kan extensions”, a claim reinforced more recently in [27, Chapter 1]. However, they are also considered to be a notoriously slippery concept, especially by newcomers to the subject. One of the most powerful pictures that help understanding how they work may be the idea that Kan extensions, especially in their pointwise form, “replace parts of a diagram with their best approximations, either from the right or from the left”. In other words, Kan extensions can be seen as taking limits or colimits of “parts” of a diagram. The scope of this paper is making this intuition mathematically precise.

We make use of the concept of partial evaluation, which was introduced in [9], and which is a way to formalize “partially computed operations” in terms of monads. The standard example is that “1+2+3+4” may be evaluated to “10”, but also partially evaluated to “3+7”, whereby parts of the given sum have been replaced by their sums.

Just as monads on sets may be seen as encoding different algebraic structures and operations, here we consider pseudomonads on categories which encode the operation of taking colimits. We are in particular interested in two pseudomonads: the monad of diagrams and the monad of small presheaves (also known as the free (small) cocompletion monad). Both monads are known in the literature, but certainly not presented in sufficient detail as needed for our purposes. To make the paper more accessible, we therefore decided to spell out their definition in full detail, in Sects. 2 and 4.

There is a functor connecting diagrams and presheaves, which we call the image presheaf, and which appears in the literature under the name of connected component functor [23] (at least for the case of small categories). This construction takes a diagram and forms a presheaf that can be considered the “free colimit” of the diagram. We describe the construction in detail in Sect. 3, taking care of the size issues we face when the codomain of diagrams is not small.

The definitions of pseudomonads, pseudoalgebras, and their morphisms are also hard to find in the literature in sufficient detail. For this reason, to avoid any ambiguity, we have given a detailed account of them in Appendix A. Readers who are familiar with these pseudomonads, and with the concepts of pseudomonads in general, may skip these sections, with the exception of Sects. 2.5, 4.6 and Appendix A.3, which contain new results.

Here is what the novel content of this work consists of. First of all, we show that the image presheaf induces a morphism of monads from the monad of diagrams to the monad of small presheaves, which in turn gives a pullback functor between the categories of algebras (we prove the 2-dimensional version of this statement in Appendix A.3). This morphism of monads is not injective in any sense. Indeed, it turns out that diagrams with isomorphic image presheaves have the same colimit, in a very strong sense, analogous to “differing by a confinal functor”. We indeed generalize the theory of confinal functors, and connect it to the theory of absolute colimits – both because we need that in order to prove the subsequent statements, and because it should be interesting for its own sake.

We then turn to the central topic of this paper and study partial evaluations for both monads. We prove that partial evaluations for the monad of diagrams correspond to pointwise left Kan extensions along split opfibrations, by invoking the Grothendieck correspondence between split opfibrations and functors into Cat. For the monad of small presheaves, we show that partial evaluations correspond to pointwise left Kan extensions along arbitrary functors. This result may be summarized in the following way: given small presheaves P and Q on a locally small, small-cocomplete category, Q is a partial colimit of P if and only if they can be written as image presheaves of small diagrams D and D, in such a way that D is the left Kan extension of D along some functor. More concisely, Kan extensions are partial colimits, as claimed by the paper’s title.

This result is analogous to, and was motivated by, an analogous result in measure theory involving probability monads, where partial evaluations (or “partial expectations”) correspond exactly to conditional expectations (Theorem 5.13). Indeed, one could say that “if coends are like integrals, then Kan extensions are like conditional expectations”. (See Sect. 5.4 for more on this.)

As usual, when one talks about free cocompletion, one has to be very careful with size issues. This is why some parts of this work, such as the proof of Lemma 5.12, appear to be rather technical. The payoff is that the main theorems of this work will hold for arbitrary (small) colimits in arbitrary (locally small) categories, beyond the trivial case of preorders.

Outline. In Sect. 2 we study the category of diagrams in a given category, and show that the construction gives a pseudomonad on the 2-category of locally small categories. While this construction seems to be known, its details don’t seem to have been spelled out previously. The content of Sect. 2.5, however, seems to be entirely new. We show that cocomplete categories, equipped with a choice of colimit for each diagrams, are pseudoalgebras over this pseudomonad, and that not all pseudoalgebras are of that form.

In Sect. 3 we study the concept of image presheaf, which can be seen as a “free colimit of a diagram”, or as a “colimit blueprint”. We show that having the same image presheaf is a strong and consistent generalization both of the theory of confinal functors (Proposition 3.9), and of the concept of absolute colimit (Proposition 3.13). The main ideas are already known in the literature for the case of small categories. We restate some of them to take care of the size issues we face (since some of our categories are only locally small).

In Sect. 4 we study small presheaves and show that they form a pseudomonad. Again, this is known, but here we spell out the construction in much greater detail than previous accounts have done. This enables us to establish the new result presented in Sect. 4.6: the image presheaf construction forms a morphism of pseudomonads from diagrams to presheaves.

The principal new results of this paper appear in Sect. 5. Theorem 5.5 and Theorem 5.6 state that partial colimits for the monad of diagrams correspond to pointwise left Kan extensions of diagrams along split opfibrations. In Theorem 5.10 we prove that partial colimits for the free cocompletion monad correspond to pointwise left Kan extensions of diagrams along arbitrary functors. Then, in Sect. 5.4, we compare this categorical result to the analogous measure-theoretic fact that partial expectations for probability monads correspond (in some cases) to conditional expectations of random variables. This is in line with the famous analogy between coends and integrals.

Finally, in Appendix A we recall the (known) definition of pseudomonads and pseudoalgebras, and of the categories they form, which we use in the rest of the paper. We also provide a 2-dimensional version of the restriction of scalars construction (Theorem A.7), where a morphism of monads induces a functor between the categories of their algebras in opposite direction. As far as we know, this 2-dimensional version has not appeared in the literature previously.

Categorical setting, notation, and conventions. As it is to be expected when one talks about generic colimits, size issues are relevant. Here are our conventions.

All the categories in this work (except CAT) are assumed locally small. We denote by Cat the 2-category of small categories, and by CAT the 2-category of possibly large, locally small categories. Note that CAT is itself larger than a large category (some authors call it a huge category).

When we say “category”, without specifying the size, we will always implicitly refer to a possibly large, locally small category.

Similarly, by “cocomplete category” we always mean a possibly large, locally small category which admits all small colimits.

The Monad of Diagrams

In this section we define the monad of diagrams. The first source for it that we are aware of is Guitart’s article [12], but without an explicit construction, and some concepts date back at least to Kock’s PhD thesis [16] (without the higher-dimensional aspects yet). We give in detail all the structure maps, and in Sect. 2.5 we prove that cocomplete categories with a choice of colimits are pseudoalgebras (but not all pseudoalgebras are in this form). The notions of pseudomonad and pseudoalgebra that we use are given in detail in Appendix A.

Note that, differently from some of the literature, we use the following slightly generalized notion of morphism of diagrams (also used in Guitart’s original work [12], and dating back to the origins of category theory [7, Section 23]). Moreover, in order to avoid size issues, we require every diagram to be small.

Definition 2.1

Let C be a locally small category.

  • We call a diagram in C a small category J together with a functor D:JC. Throughout this work, all the diagrams will be implicitly assumed to be of this form (i.e. be small).

  • Given diagrams (J,D) and (J,D) in C, we call a morphism of diagrams a functor R:JJ together with a natural transformation ρ:DDR, i.e. a diagram in CAT as the following. graphic file with name 10485_2021_9671_Figa_HTML.jpg

  • Given diagrams (J,D) and (J,D) in D and morphisms of diagrams (R,ρ),(R,ρ):(J,D)(J,D), we call a 2-cell of diagrams a natural transformation α:RR such that the following 2-cells are equal. graphic file with name 10485_2021_9671_Figb_HTML.jpg

We denote by Diag(C) the 2-category of diagrams in C, their morphisms, and their 2-cells. (Sometimes we will still denote by Diag(C) the underlying 1-category.)

Note that the definition of morphism of diagrams is slightly more general than just a natural transformation between parallel functors. This is still compatible with the traditional intuitive picture of “deforming a diagram into another one”, provided that one notices the following. In principle R may be not essentially surjective, so one should visualize the natural transformation ρ as “deforming” the figure drawn by D into a subfigure of the one drawn by D.

Note moreover that:

  • For C locally small, Diag(C) is locally small too;

  • The forgetful functor Diag(C)Cat given by the domain is a fibration (via precomposition), and it is an opfibration (via left Kan extensions) if and only if C is cocomplete (see for example [26, Proposition 2.8]).

In the rest of this section we show that CDiag(C) is part of a pseudomonad on CAT, and that cocomplete categories with a choice of colimit for each diagram are pseudoalgebras, with the structure map given by such chosen colimits. For the precise definitions of pseudomonads and pseudoalgebras, see Appendix A.

Functoriality

We show that the assignment CDiag(C) is part of a 2-functor on CAT. First of all, let C and D be locally small categories, and let F:CD be a functor.

Consider now Diag(C) and Diag(D) as 1-categories. There is a (1-)functor F:Diag(C)Diag(D) induced by F via postcomposition and whiskering, as follows. graphic file with name 10485_2021_9671_Figc_HTML.jpggraphic file with name 10485_2021_9671_Figd_HTML.jpgTherefore, Diag is an endofunctor of CAT.

The functor F:Diag(C)Diag(D) extends to 2-cells giving a 2-functor, but we will not need this in order for Diag to be a pseudomonad on CAT.

On the other hand, we need to extend Diag to the 2-cells of CAT. So let C and D be locally small categories, let F,G:CD be functors, and let α:FG be a natural transformation. We have an induced natural transformation α:FG induced via whiskering, as follows. To each diagram (J,D) in C, we assign the morphism of diagrams (idJ,αD) of D, i.e. graphic file with name 10485_2021_9671_Fige_HTML.jpgNaturality follows from naturality of α. This makes Diag a strict 2-functor CATCAT.

Unit and Multiplication

The Unit: One-Object Diagrams

The unit of the monad is a map constructing “one-object diagrams”. In detail, let C be a locally small category. We construct the functor ηC:CDiag(C) as follows. graphic file with name 10485_2021_9671_Figf_HTML.jpggraphic file with name 10485_2021_9671_Figg_HTML.jpgFor brevity, we will denote ηC simply by η. This is (strictly) natural in the category C: given a functor F:CD, the following diagram commutes strictly. graphic file with name 10485_2021_9671_Figh_HTML.jpgIndeed, both paths in the diagram give the following assignment, graphic file with name 10485_2021_9671_Figi_HTML.jpggraphic file with name 10485_2021_9671_Figj_HTML.jpgusing the fact that F(C)=FC if we view C as a functor 1C, and that analogously Ff is given by whiskering f (seen as a natural transformation) with F.

Diagrams of Diagrams are Lax Cocones

Let’s now turn to the multiplication. We first notice that an object of Diag(Diag(C)) is the same as a lax cocone in CAT with tip C, where the indexing category and all the categories appearing in the cone except C are required to be small. Let’s see how. Let J be a small category. A functor D:JDiag(C) assigns to each object J of J a diagram in C, i.e. a small category D0J together with a functor D1J:D0JC: graphic file with name 10485_2021_9671_Figk_HTML.jpgand to each morphism j:JJ of J a morphism of diagrams, which amounts to a functor D0j:D0JD0J together with a natural transformation D1j as below: graphic file with name 10485_2021_9671_Figl_HTML.jpgMoreover, since we want D to be a functor, we need it to preserve identities and composition, i.e. D0 needs to be a functor, and D1 needs to satisfy the conditions D1(idJ)=idD1J and D1(jj)=D1jD0jD1J, which are exactly the conditions of lax naturality. In pictures, graphic file with name 10485_2021_9671_Figm_HTML.jpggraphic file with name 10485_2021_9671_Fign_HTML.jpgIn other words, a functor D:JDiag(C) consists of a functor D0:JCatCAT, together with a lax cocone D1 in CAT under D0 with tip C. A lax cocone is a lax natural transformation D1:D0ΔC, where ΔC is the constant functor at C. For brevity, we will also write constant functors simply as objects.

The Multiplication: the Grothendieck Construction

Given now D=(D0,D1) as above, take the Grothendieck construction D0 of D0:JCatCAT, which we recall.

  • An object of D0 consists of a pair (JX) where J is an object of J and X is an object of the category D0J;

  • A morphism (J,X)(J,X) of D0 consists of a pair (jf) where j:JJ is a morphism of J, and f:D0j(X)X is a morphism of the category D0J.

The short integral sign does not denote a coend here, it is standard for the Grothendieck construction (we use different sizes to avoid confusion, since both symbols are standard notation). Note that since J and all the D0J are small, D0 is small too. Its set of objects is given by

JJD0J.

Moreover:

  • For each object J of J, the inclusion maps iJ:D0JD0 defined by the coproduct above can be canonically made into functors via graphic file with name 10485_2021_9671_Figo_HTML.jpgWe will call the images of the iJ the fibers of D0.

  • For each morphism j:JJ of J, there is a natural transformation graphic file with name 10485_2021_9671_Figp_HTML.jpgwhose component at each object X of D0J is given by graphic file with name 10485_2021_9671_Figq_HTML.jpg

  • The iJ and ij assemble into a lax cocone D0ΔD0, i.e. the identity and composition conditions are satisfied.

It is well known that D0 is the oplax colimit of D0 in Cat, with the universal lax cocone given by the iJ. We now show that it is so also in CAT, and we also give a strict version of the universal property.1

Proposition 2.2

Let D0:JCatCAT be a small diagram of small categories. Let D1:D0ΔC be a lax cocone over D0 in CAT, with tip C locally small (but not necessarily small). There is a unique functor D0C such that

  • For all objects J of J, the following triangle commutes (strictly); graphic file with name 10485_2021_9671_Figr_HTML.jpg

  • For all morphisms j:JJ of J, the following 2-cells coincide. graphic file with name 10485_2021_9671_Figs_HTML.jpg

Denote this functor by μ(D0,D1), or more briefly by μ(D). It is a diagram in C. This will give the multiplication of the monad Diag.

Proof of Proposition 2.2

Since we want the diagram 2.1 to commute strictly, the only possibility to define μ(D) on objects is as follows. For every object J of J, and for every object X of D0J,

μ(D)(J,X):=D1J(X).

Just as well, for all the morphisms of D0 in the fiber, i.e. in the form (idJ,f) for a morphism f:XX of D0J, we are forced to define

μ(D)(idJ,f):=D1J(f).

Moreover, since we want the condition (2.2) to hold, for all morphisms j:JJ of J we have to require that μ(D) on the components of ij has to give the respective component of D1j. Explicitly, for each object X of D0J,

μ(D)(ij)X=μ(D)(j,idD0j(X)):=(D1j)X.

The generic morphism (j,f):(J,X)(J,X), for j:JJ and f:D0j(X)X can be decomposed as graphic file with name 10485_2021_9671_Figt_HTML.jpgand so we have determined the action of μ(D) for all morphisms of D0.

Functoriality of this assignment is routine.

Functoriality and Naturality of the Multiplication

We now have to show that the assignment (D0,D1)μ(D0,D1) is functorial, and that it is natural in the category C.

To address functoriality, we need to look at morphisms in Diag(Diag(C)).

Given diagrams D:JDiag(C) and E:KDiag(C), where J and K are small categories, a morphism of diagrams from D to E amounts to a functor F:JK together with a natural transformation graphic file with name 10485_2021_9671_Figu_HTML.jpgExplicitly, D consists of a functor D0:JCatCAT and a lax cocone D1:D0ΔC, and E has an analogous form. The natural transformation ϕ amounts to the following. For each object J of J, we have a morphism of diagrams graphic file with name 10485_2021_9671_Figv_HTML.jpgand for each morphism j:JJ of J, the following diagrams have to commute. First of all, this diagram of functors has to commute strictly. graphic file with name 10485_2021_9671_Figw_HTML.jpgMoreover, the following composite 2-cells have to coincide, forming a commutative pyramid with 2-cells as lateral faces, whose square base is the commutative square just described above. graphic file with name 10485_2021_9671_Figx_HTML.jpg

Form now the Grothendieck construction of E0. We can form a lax cocone over D0 with tip E0 as follows. graphic file with name 10485_2021_9671_Figy_HTML.jpggraphic file with name 10485_2021_9671_Figz_HTML.jpg

Note that this is a lax cocone D0ΔE0. By the universal property of the Grothendieck construction as an oplax colimit (Proposition 2.2), there is a unique functor D0E0 such that

  • for all objects J of J, the following square commutes; graphic file with name 10485_2021_9671_Figaa_HTML.jpg

  • for all morphisms j:JJ of J, the following composite 2-cells coincide. graphic file with name 10485_2021_9671_Figab_HTML.jpg

Denote this functor by μ0(F,ϕ). This gives a triangle graphic file with name 10485_2021_9671_Figac_HTML.jpgwhich does not necessarily commute. In order to get a morphism of diagram μ(D)μ(E) we need to fill the triangle above with a 2-cell which we form as follows. Consider the object (JX) of D0, where J is an object of J and X is an object of D0J. Note that, by the diagram (2.1), μ(D)(J,X)=D1J(X). Analogously, using the diagram (2.1) for E together with the diagram (2.4),

μ(E)(μ0(J,X))=μ(E)(J,ϕ0J(X))=E1FJ(ϕ0J(X)).

We now assign to the object (JX) the morphism of C given by the component of ϕ1J at X, graphic file with name 10485_2021_9671_Figad_HTML.jpgLet’s now show that this assignment is natural. We will again test this first along the fibers, and then on the opcartesian morphisms of D0. So let f:XY be a morphism of D0J. The following diagram commutes simply by naturality of ϕ1J. graphic file with name 10485_2021_9671_Figae_HTML.jpgLet now j:JJ be a morphism of J. We have to prove that the following diagram commutes. graphic file with name 10485_2021_9671_Figaf_HTML.jpgThis is however exactly Equation (2.3), written out in components. Therefore we have a natural transformation, which we denote by μ1(F,ϕ), and we get a morphism of diagrams graphic file with name 10485_2021_9671_Figag_HTML.jpgwhich makes μ functorial. (The identity and composition conditions follow by uniqueness.) Note that in particular this shows functoriality of the Grothendieck construction not only on strictly commutative triangles of functors (i.e. in Cat/C), but also more in general for lax-commutative triangles.

Proposition 2.3

The functor μ:Diag(Diag(C))Diag(C) is strictly natural in C.

Proof

Let F:CD be a functor. The induced functor graphic file with name 10485_2021_9671_Figah_HTML.jpgmaps a lax cocone with tip C to the lax cocone with tip D obtained simply via postcomposition with F. graphic file with name 10485_2021_9671_Figai_HTML.jpgIn other words, F(D0,D1)=(D0,FD1). If we take the Grothendieck construction in both cases we get morphisms μ(D):D0C and μ(FD):D0D. By uniqueness (Proposition 2.2), necessarily μ(FD)=Fμ(D), and therefore μ is strictly natural.

Unitors and Associators

Left Unitor

Let D:JC be a diagram. We can apply to it the unit ηDiag(C):Diag(C)Diag(Diag(C)) to form the diagram graphic file with name 10485_2021_9671_Figaj_HTML.jpgcorresponding to the following (rather trivial) lax cocone in CAT with tip C. graphic file with name 10485_2021_9671_Figak_HTML.jpgWe view this as a lax cocone over the diagram J:1CAT which maps the unique object of 1 to J. If we form the Grothendieck construction as prescribed by the multiplication of the monad, we get the category J which is isomorphic to J, explicitly given as follows:

  • Objects are pairs (,X), where is the unique object of 1 and X is an object of J;

  • A morphism (,X)(,Y) is simply a morphism f:XY of J.

The functor μ(η(D)) maps then (,X) to DX and f:(,X)(,Y) to Df in C.

The functor JJ given by (,X)X is an isomorphism of categories. This defines an isomorphism of diagrams μC(ηDiag(C)(D))D, in the category Diag(C). Denote this isomorphism by . This is the map that we take as left unitor.

Proposition 2.4

The map induces a modification μ(ηDiag)id.

Proof

Let’s first show that is natural in the diagram D. If (R,ρ):(J,D)(K,E) is a morphism of diagrams, it’s easy to see that this diagram commutes strictly, graphic file with name 10485_2021_9671_Figal_HTML.jpgwhere R~ is the functor mapping (,X) to (,RX), and acting similarly on morphisms. This commutative diagram induces an analogous commutative diagram in Diag(C), so that is a natural isomorphism of functors graphic file with name 10485_2021_9671_Figam_HTML.jpg

In order to show that is a modification, we have to show that is natural in the category C as well, in the sense that for each functor F:CD the following composite 2-cells are equal, graphic file with name 10485_2021_9671_Figan_HTML.jpgwhere the squares without a 2-cell commute (by naturality).

Explicitly, we have to check that given a diagram D:JC, the following parallel functors coincide. graphic file with name 10485_2021_9671_Figao_HTML.jpgNote however that F acts on the codomain of the diagram, by postcomposing with F, while acts on the domain of the diagram, mapping J to its isomorphic copy J. Therefore both arrows give the same morphism of diagrams, explicitly the following commutative diagram of CAT, graphic file with name 10485_2021_9671_Figap_HTML.jpgwhere D~(,X)=D(X), and the action on morphisms is similarly defined. Therefore is a modification.

Right Unitor

Let D:JC be a diagram. This time we apply to it the map η:Diag(C)Diag(Diag(C)) given by the functor image of η under Diag. Explicitly, the result is the following lax cocone in CAT, with tip C, indexed by J via the constant functor Δ1:JCAT at 1. In pictures, graphic file with name 10485_2021_9671_Figaq_HTML.jpg

If we form the Grothendieck construction, this time we get the category Inline graphic which is again isomorphic to J, explicitly given as follows:

  • Objects are pairs (X,), where X is an object of J and is the unique object of 1;

  • A morphism (X,)(Y,) is simply a morphism f:XY of J.

The functor μ(η(D)) maps (X,) to DX and f:(X,)(Y,) to Df in C.

Analogously to the left unitor case, we have a functor Δ1J given by (X,)X which induces an isomorphism of categories. This defines in turn an isomorphism of diagrams μ(η(D))D, which we denote by r-1 (and its inverse by r. See Definition A.1 for the convention we are using). The map r is the one that we take as right unitor.

Proposition 2.5

The map r induces a modification idμη.

We omit the proof, since it is analogous to the case of .

Associator

In order to define the associator, we have to look at Diag(Diag(Diag(C))). So let D:JDiag(Diag(C)) be a diagram which assigns to each object J of J a diagram of diagrams D1J:D0JDiag(C), itself mapping the object K of D0J to the diagram (D1J)1:(D1J)0KC, which, in turn, maps an object L of (D1J)0K to the object (D1J)1(L) of C. We could depict the situation as follows. For brevity we omit the action on morphisms, which is similarly constructed. graphic file with name 10485_2021_9671_Figar_HTML.jpgWe can now take the Grothendieck construction at two different depths, which we can think of as “joining levels J and K” and “joining levels K and L”. The former way, which is μ(D)Diag(Diag(C)), gives the following diagram of diagrams (only two levels). graphic file with name 10485_2021_9671_Figas_HTML.jpgThe latter way, which is μDDiag(Diag(C)), gives instead the following diagram of diagrams, which is in general not isomorphic to the former. graphic file with name 10485_2021_9671_Figat_HTML.jpgIf we apply once again the Grothendieck construction to the two, we do obtain isomorphic diagrams: graphic file with name 10485_2021_9671_Figau_HTML.jpgand graphic file with name 10485_2021_9671_Figav_HTML.jpgThese are isomorphic as diagrams through the map graphic file with name 10485_2021_9671_Figaw_HTML.jpgand so the following diagram commutes up to isomorphism. graphic file with name 10485_2021_9671_Figax_HTML.jpgWe call this isomorphism the associator, and denote it by a. Again, analogously as for the unitors, we have:

Proposition 2.6

The associators assemble to a modification μμμμ.

Higher Coherence Laws

The associator and unitors satisfy coherence conditions that are analogous to the ones of monoidal categories (see Definition A.1 for the precise definition). We spell them out in detail for this case.

Unit Condition

Instantiating the unit condition of Definition A.1 in our case, we get the following statement, which reminds us of the unit condition of monoidal categories.

  • Consider a diagram of diagrams as follows. graphic file with name 10485_2021_9671_Figay_HTML.jpgApplying directly the Grothendieck construction we would have a diagram (J,K)(D1J)(K). However, we instead want to insert a “bullet” via the unitor, and this can be done in two ways.

  • We can either apply the (inverse of the) left unitor at depth K, and then take the Grothendieck construction, obtaining the following diagram. graphic file with name 10485_2021_9671_Figaz_HTML.jpg

  • Alternatively, we can apply the right unitor r at depth J, and again take the Grothendieck construction, obtaining the following isomorphic diagram. graphic file with name 10485_2021_9671_Figba_HTML.jpg

  • Now, not only are the two diagrams isomorphic, but the isomorphism relating them is exactly the associator, graphic file with name 10485_2021_9671_Figbb_HTML.jpgwhich we can view as “rebracketing”.

Pentagon Equation

Instantiating the pentagon condition of Definition A.1 in our case, we get the following statement, which reminds us of the analogous condition for monoidal categories. Consider a four-level diagram, as follows. graphic file with name 10485_2021_9671_Figbc_HTML.jpgThere are several ways of obtaining a depth-one diagram via applying the Grothendieck construction three times, and they are related to one another via the associators. In particular, we can apply the Grothendieck construction repeatedly starting from the deepest (rightmost) level, graphic file with name 10485_2021_9671_Figbd_HTML.jpgor we can start from the outermost (leftmost) level. graphic file with name 10485_2021_9671_Figbe_HTML.jpgThere are now two ways of obtaining the former from the latter via associators, and they are equal. They are induced by the following rebracketings, which form a commutative pentagon (analogous to the one of monoidal categories). graphic file with name 10485_2021_9671_Figbf_HTML.jpg

Cocomplete Categories are Algebras

In this section we prove the following statements.

Theorem 2.7

Every cocomplete category C equipped with a choice of colimit for each diagram has the structure of a pseudoalgebra over Diag.

As shown in Sect. 2.5.4, the converse of the theorem does not hold: not all pseudoalgebras are in this form.

A definition of pseudoalgebra over a pseudomonad is given in Appendix A.

Structure Map

Let C be a cocomplete category. For each diagram D:JC, choose a colimit (they are all isomorphic, pick one in each equivalence class). Let’s see why this construction is functorial. A colimit does not just consist of an object of C, but also of the arrows of the colimiting cocone. Denote by c(D) the (chosen) colimit object of D, and by h(D):Dc(D) the colimiting cocone. In components, the cocone consists of arrows graphic file with name 10485_2021_9671_Figbg_HTML.jpgfor each object J of J. Now consider a morphism of diagrams as follows. graphic file with name 10485_2021_9671_Figbh_HTML.jpgUsing ρ and the colimit cocone h(D):Dc(D) we can construct a cocone under D, with tip c(D): the one of components graphic file with name 10485_2021_9671_Figbi_HTML.jpgfor each J of J. This cocone must then factor uniquely through c(D) by the universal property of c(D) as a colimit: graphic file with name 10485_2021_9671_Figbj_HTML.jpgDenote the resulting map c(D)c(D) by c(R,ρ). It is the unique map that makes the diagram above commute for each J of J. By uniqueness, this assignment preserves identities and composition, and so c is a functor Diag(C)C. Technically speaking, for each diagram one can choose many possible colimits within the same equivalence class. However, once c(D) and c(D) are fixed, the map c(D)c(D) is unique. So any choice of such colimit objects gives rise to a functor, and all these functors will be naturally isomorphic, again by uniqueness.

We can say even more: the map c:Diag(C)C is even 2-functorial if we view Diag(C) as a 2-category (as in Definition 2.1), and C (which is a 1-category) as a locally discrete 2-category. This is made precise by the following lemma.

Lemma 2.8

Let C be a cocomplete category and c:Diag(C)C a choice of colimit for each diagram. Consider diagrams (J,D) and (J,D) in C, morphisms of diagrams (R,ρ),(R,ρ):(J,D)(J,D), and suppose there exists a 2-cell of diagrams α:(R,ρ)(R,ρ). Then c(R,ρ)=c(R,ρ).

Proof

Recall that α consists of a natural transformation α:RR such that graphic file with name 10485_2021_9671_Figbk_HTML.jpgThis way, for each object J of J, the following diagram commutes, graphic file with name 10485_2021_9671_Figbl_HTML.jpgwhere h(D):Dc(D) denotes the colimit cone of c(D) under D. The maps c(R,ρ) and c(R,ρ):c(D)c(D) are both defined as the unique maps making the following diagram commute, graphic file with name 10485_2021_9671_Figbm_HTML.jpgand are therefore equal.

Structure 2-Cells

We now give the structure 2-cells of the pseudoalgebras, namely the unitor and the multiplicator.

Lemma 2.9

The following diagram commutes up to a canonical natural isomorphism. graphic file with name 10485_2021_9671_Figbn_HTML.jpg

We denote the natural isomorphism by ι:cηidC.

Proof

Let C be an object of C. The diagram η(C) is the one-object diagram whose unique node is given by C. A colimit cocone over η(C) consists of an object C together with a specified isomorphism CC. Therefore, for any choice of c, we get canonically an isomorphism ιC:Cc(η(C)). The maps ιC assemble to a natural isomorphism ι:cηidC, since for each f:CC of C the following diagram commutes, graphic file with name 10485_2021_9671_Figbo_HTML.jpgsince the map c(η(f)) is defined (by definition of how c acts on morphisms) as the unique map making the diagram above commute.

Lemma 2.10

The following diagram commutes up to a canonical natural isomorphism. graphic file with name 10485_2021_9671_Figbp_HTML.jpg

Denote the natural isomorphism by γ:cccμ.

This statement is known in the literature, see for example [4, Section 40], as well as [26, Theorem 3.2]. Here we present a direct proof. In the proof we can see how the objects in the top right corner of the square are, in some sense, partial colimits of the objects in the bottom left corner. This will be made precise in Sect. 5.

Proof

Let D:JDiag(C). The diagram cD:JC is given by the following postcomposition. graphic file with name 10485_2021_9671_Figbq_HTML.jpgIn other words, the nodes of D are diagrams (one for each object of J), and c replaces them by their (chosen) colimit, obtaining a diagram in C indexed by J.

Recall that μ is given by the Grothendieck construction, so that μ(D) is a diagram obtained by the union of the diagrams DJ for each J of J, plus additional arrows between those subdiagrams, induced by the morphisms of J. Specifically, for every morphism j:JJ of J and for every f:XY of D0J, the following square commutes. graphic file with name 10485_2021_9671_Figbr_HTML.jpgThe object c(μ(D)) is a colimit of the resulting diagram involving all the j and the f as above. Recall now the universal property of the Grothendieck construction, and in particular diagrams (2.1) and (2.2). For each object J of J, the morphism of diagrams given by the inclusion iJ of the fiber over J, graphic file with name 10485_2021_9671_Figbs_HTML.jpginduces a map between their (chosen) colimits c(iJ,id):c(DJ)c(μ(D)). The maps c(iJ,id):c(DJ)c(μ(D)), for each J, assemble to a cocone under cD:JC, with tip c(μ(D)), meaning that for each morphism j:JJ of J, the following diagram commutes. graphic file with name 10485_2021_9671_Figbt_HTML.jpgIndeed, we can rewrite (2.2) as follows, graphic file with name 10485_2021_9671_Figbu_HTML.jpgwhich is exactly the condition for a 2-cells of diagrams (iJ,id)(iJD0J,D1j) (see Definition 2.1). By Lemma 2.8, then, c(iJ,id)=c(iJD0J,D1j), which means that (2.5) commutes.

As we said, the maps c(iJ,id):c(DJ)c(μ(D)), assemble to a cocone under cD:JC, with tip c(μ(D)). Therefore, by the universal property of c(cD) as a colimit, there exists a unique arrow c(cD)c(μ(D)), which we denote by γD, making the following diagram commute for all J of J, graphic file with name 10485_2021_9671_Figbv_HTML.jpgwhere the h(cD)J denote the arrows of the colimiting cocone. Note that for each object X of D0J we can extend the diagram above to the following commutative diagram, graphic file with name 10485_2021_9671_Figbw_HTML.jpgwhere h(DJ)X and h(μ(D))X are the components at X of the colimiting cocones of c(DJ) and c(μ(D)).

To show that γD is an isomorphism, we invoke the Yoneda embedding. Let K be any object of C. We want to show that the function graphic file with name 10485_2021_9671_Figbx_HTML.jpgwhich is natural in K, is a bijection. To this end, we note that by the universal property of colimits, the set on the left is naturally isomorphic (via composing with the components of h(μ(D))) to the subset

SJJXD0JHomC(D1J(X),K)

whose elements are families of arrows (kJ,X:D1J(X)K) such that for each j:JJ of J and each f:XY of D0J the following diagram commutes. graphic file with name 10485_2021_9671_Figby_HTML.jpgNow, for each J of J, the quantity appearing in S given by

SJXD0JHomC(D1J(X),K)

and whose elements are arrows (kJ,X:D1J(X)K) such that f:XY of D0J the following diagram commutes, graphic file with name 10485_2021_9671_Figbz_HTML.jpgis in natural bijection with

HomC(c(DJ),K)

by universal property of the colimit, via composing with the cocones h(DJ). In other words, S is in natural bijection with the subset of

SJJHomC(c(DJ),K)

whose elements are arrows (kJ:c(DJ)K) such that the following diagram commutes. graphic file with name 10485_2021_9671_Figca_HTML.jpgThe subset S, again by the universal property of the colimit, is in bijection (via composing with h(cD)) with HomC(c(cD),K), which is exactly at the right side of (2.7). Since (2.6) commutes, composing with γD has the same effect as applying the bijections given by (the inverse of) composing with h(cD) and h(DJ)X (all the bijections are invertible), and then composing with h(μ(D))X. Therefore (2.7) is a bijection too. By the Yoneda lemma, then, γD is an isomorphism.

Coherence Laws

In order to prove Theorem 2.7 it remains to be checked that the unit and multiplication coherence conditions of Definition A.4 hold. Intuitively, such coherence conditions hold by the “uniqueness property of maps between colimits”. In other words, not only do objects satisfying the same universal property admit an isomorphism between them, but they admit a unique one compatible with the universal property (in our case, the cocone): while colimit objects of a diagram may have many automorphisms (as objects), colimit cocones over the same diagram form a contractible groupoid.

Let’s see this more explicitly. The unit condition of Definition A.4, instantiated in our case, says the following. Let C be a cocomplete locally small category, and construct (choose) the functor c:Diag(C)C as before. Consider now a diagram D:JC. We can apply the map η:Diag(C)Diag(Diag(C)) as in Sect. 2.3.2 and obtain the diagram ηD:JDiag(C) as follows. graphic file with name 10485_2021_9671_Figcb_HTML.jpgNow we can either

  • apply to ηD the map c, which replaces each one-object diagram DJ with its chosen colimit c(DJ) (isomorphic to DJ via the unitor ι), giving the diagram c(D-):JC; or

  • form the Grothendieck construction and obtain the diagram (J,)DJ, with exactly the same image in C as D, but indexed by a nominally different category, and isomorphic to D via the counit ρ .

Both ways give isomorphic diagrams in C, which then have isomorphic colimits. The isomorphism between the colimits can be written a priori in two ways:

  • It is the one induced by γ:cccμ;

  • It is the one induced by the isomorphism of diagrams of components ι:DJc(DJ) for each object J of J.

The unit condition of pseudoalgebras says that these two isomorphism should be equal. This is indeed the case, by uniqueness of the morphism γ: forming the colimit cocones of D and of c(D-), which are isomorphic diagrams via ι, we have a unique morphism making the following diagram commute for all J of J, graphic file with name 10485_2021_9671_Figcc_HTML.jpgwhich can be seen as either the map γ (by definition), or as the map induced by ι, after suitably translating D into (J,)DJ via the right unitor r.

The multiplication condition of Definition A.4, again instantiated in our case, says the following. As in Sect. 2.3.3, let DDiag(Diag(Diag(C))) be a diagram as follows. graphic file with name 10485_2021_9671_Figcd_HTML.jpgWe can now take the colimit progressively, a priori in two ways: first of all, “from the inside out”, that is,

  • For each J of J and K of D0J, take the (chosen) colimits of the diagrams (D1J)1K, obtaining the following diagram of diagrams; graphic file with name 10485_2021_9671_Figce_HTML.jpg

  • Then, for each J of J, take the (chosen) colimit of the remaining innermost level diagram c(D1J), obtaining the following diagram; graphic file with name 10485_2021_9671_Figcf_HTML.jpg

  • Finally, take the colimit c(c(cD)) of the diagram just obtained.

Alternatively, we could

  • Form the Grothendieck construction of D joining levels J and K, obtaining the following diagram of diagrams; graphic file with name 10485_2021_9671_Figcg_HTML.jpg

  • Form again the Grothendieck construction, joining level L as well; graphic file with name 10485_2021_9671_Figch_HTML.jpg

  • Finally, take the colimit c(μ(μ(D))) of the resulting diagram.

The colimits constructed this way are isomorphic, a priori, in two different ways, using the maps obtained by γ in different orders (first inner level, then outer, or vice versa). However, the different ways coincide, since both colimits come equipped with the following cocones, graphic file with name 10485_2021_9671_Figci_HTML.jpgand there is a unique map making the diagram above commute for all J, K and L.

This finally proves that cocomplete categories are pseudoalgebras of Diag (Theorem 2.7).

Not All Algebras are of this Form

We now want to show the following statement.

Proposition 2.11

Not every pseudoalgebra over Diag is in the form of Theorem 2.7.

We use the following known result [26, Theorem 2.7].

Theorem 2.12

Let C be a cocomplete category. Then the category Diag(C) is cocomplete too. Moreover, the functor Diag(C)Cat which assigns to each diagram D:JC its domain J preserves colimits.

We are now ready to prove the proposition. We will prove it by showing that for free pseudoalgebras, in the form (Diag(C),μ), the map μ is in general not taking colimits of diagrams (of diagrams).

Proof of Proposition 2.11

Let C be a cocomplete category with at least two non-isomorphic objects X and Y and a morphism f:XY. Consider now the morphism η(f) of Diag(C), which can be seen as the morphism of diagrams, graphic file with name 10485_2021_9671_Figcj_HTML.jpgand so, in particular, also as a diagram of diagrams (indexed by the walking arrow 2). Denote by D:2Diag(C) this diagram of diagrams. We have that μ(D), as given by the Grothendieck construction, is a diagram indexed again by 2. Instead, by Theorem 2.12, the colimit of D in Diag(C) is a diagram whose domain must be the colimit of id:11 in Cat, which is 1. In particular, this colimit is not isomorphic to μ(D) .

Therefore, for the free algebra (Diag(C),μ), the algebra structure map μ is not in the form of Theorem 2.7. (See the end of Appendix A.2 for why (Diag(C),μ) is indeed a pseudoalgebra.)

A structural reason for why not all Diag-algebras arise this way will be given in Sect 4.6.1. Conjecturally, the generic Diag-algebras may be given by taking oplax colimits, instead of strict (in 2-categories rather than categories).

Image Presheaves

In this section we define the notion of image presheaf of a diagram, which may be interpreted as its “free” or “prototype colimit”. As far as we can tell, it was first introduced by Paré under the name “connected component functor” [23]. This concepts allows to extend and generalize the theory of cofinal functors (which we call confinal, see Sect 3.2), giving conditions for when certain diagrams have isomorphic colimits even after applying a functor to them (Proposition 3.9), and it incorporates absolute colimits as a special case (see Proposition 3.13 and the subsequent discussion).

Diagrams and Presheaves

Given a diagram D:JC, we obtain a presheaf ImD on C canonically, as follows.

Definition 3.1

The image presheaf of the diagram D, which we denote by ImD, is the colimit of the following composite functor, graphic file with name 10485_2021_9671_Figck_HTML.jpg where y denotes the Yoneda embedding.

We can view the image as a “free colimit”, the presheaf obtained as the colimit of representables indexed by the diagram D. As usual, by the universal property of colimits this assignment is functorial. The terminology “image” is motivated by a factorization (see Sect 3.2.1), as well as by analogy with random variables (see Sect 5.4).

Equivalently, ImD is the (pointwise) left Kan extension

ImD:=LanDop1,

as in the following diagram, graphic file with name 10485_2021_9671_Figcl_HTML.jpgwhere 1:JopSet is the constant presheaf at the singleton set 1, and λD denotes the universal 2-cell. This way one could generalize the definition to the case of weighted diagrams, which is however beyond the scope of the present paper.

Concretely, given an object C of C, the set (ImD)(C) is the set

colimJJ(HomC(C,DJ)).

Its elements are the equivalence classes of arrows of C of the form CDJ, for some object J of J, where we identify any two arrows f:CDJ and f:CDJ whenever there exists a morphism g:JJ of J such that f=Dgf, as in the following diagram. graphic file with name 10485_2021_9671_Figcm_HTML.jpgFunctoriality of Im, as a functor Diag(C)[Cop,Set], is given by pasting arrows and commutative diagrams.

In general, two arrows f:CDJ and f:CDJ are identified if there is a zig-zag of arrows of J connecting J and J, which we write as JJ, such that the following diagram “commutes”. graphic file with name 10485_2021_9671_Figcn_HTML.jpgBy convention, we say that a triangle containing a zig-zag as the one above commutes if and only if each arrow in the zig-zag gives a commutative triangle. For later use, we denote by [Jf] the equivalence class in ImD represented by f:CDJ.

This description in terms of equivalence classes of arrows can be stated more concisely as follows: the presheaf ImD applied to the object C is the set of connected components of the comma category C/D. This is how this construction was first introduced (for small categories) in [23, Section 2], where it is called the “connected component functor” or π0. The equivalence of the two definitions is already proven in [23, Theorem 2.1].

The Category of Elements

Let P:CopSet be a presheaf. Recall the discrete fibration given by the category of elements

graphic file with name 10485_2021_9671_Equ47_HTML.gif

Note that we use again the short integral sign, as we had used for the Grothendieck construction – but here it denotes the category of elements, as we are using the contravariant version. The category oP is the category where

  • Objects consist of pairs (Cx), where C is an object of C and xC is an element of the set PC.

  • A morphism (C,x)(C,y) is a morphism g:CC of C such that the function Pg:PCPC sends yPC to xPC.

If C is small (resp. locally small), oP is small too (resp. locally small). The functor oPC, which is a discrete fibration, maps (Cx) to C and a morphism of oP to the underlying morphism of C. The category of elements is functorial in the following way. Let α:PQ be a morphism of presheaves, i.e. a natural transformation graphic file with name 10485_2021_9671_Figco_HTML.jpgwe can construct a functor oα:oPoQ which makes the following diagram commute, graphic file with name 10485_2021_9671_Figcp_HTML.jpgwhere the morphisms into C are the canonical discrete fibrations. The functor oα is constructed as follows.

  • It maps the object (Cp), where C is an object of C and pPC, to the object (C,αC(p)). Note that αC(p)QC;

  • It maps the morphism (C,Pf(q))(C,q) induced by the morphism f:CC of C to the morphism (C,Qf(αC(q)))(C,αC(q)) again induced by f. Note that the following naturality diagram commutes. graphic file with name 10485_2021_9671_Figcq_HTML.jpg

Consider now a diagram D:JC, take its image presheaf ImD and form its category of elements oImD. Let’s see what we get explicitly.

  • An object of oImD consists of an object C of C together with an equivalence class [Jf] represented by an object J of J and a morphism f:CDJ of C.

  • A morphism (C,[J,f])(C,[J,f]) consists of a morphism g:CC of C such that (ImD)(g)([J,f])=[J,f]. This means that there is a zig-zag of arrows of J connecting J and J, which we write as JJ, such that the following diagram commutes. graphic file with name 10485_2021_9671_Figcr_HTML.jpg

Proposition 3.2

Given a morphism of diagrams graphic file with name 10485_2021_9671_Figcs_HTML.jpgthe 2-cell ρ factors in the following form, graphic file with name 10485_2021_9671_Figct_HTML.jpgwhere the triangle on the right commutes, with the vertical arrow given by oIm(R,ρ) (recall that both o and Im are functorial).

Before the proof, let’s see what the functor oIm(R,ρ):oImFoImF looks like. It maps an object

(C,[J,CfFJ])

of oImF to the object

(C,[RJ,CfFJρJFRJ])

of oImF. On morphisms, given g:CC in C and a zig-zag JJ in J making the diagram on the left commute, we get the diagram on the right. graphic file with name 10485_2021_9671_Figcu_HTML.jpgThe right-most square commutes by naturality of ρ applied to the zig-zag.

Proof

Let J be an object of J. Let’s give the component of ρ~ at J explicitly. Note first that

graphic file with name 10485_2021_9671_Equ48_HTML.gif

and that

F~(R(J))=(RJ,[FRJidFRJ]).

The morphism ρ~J:oIm(R,ρ)(F~(J))F~(R(J)) is then given by the following diagram, graphic file with name 10485_2021_9671_Figcv_HTML.jpgwith the zig-zag given by the identity. By construction, whiskering ρ~ with the forgetful functor to C we get back ρ.

Connecting Confinal Functors and Absolute Colimits

As shown in [23, Section 3], the concept of image presheaf allows us to extend a bit the theory of confinal functors,2 and unifies it with the theory of absolute colimits. Here we restate some of those ideas, since we have slightly different size conditions. A reference for the standard theory is for example given in [2, Section 2.11] (note that there the term “final functor” is used instead, for limit-invariant functors, rather than colimit-invariant).

Definition 3.3

A functor F:CD is called confinal if for every object D of D, the comma category D/F is non-empty and connected.

Note that, if we view F as a (possibly large) diagram, ImF(D) for an object D of D can be seen as the set of connected components of D/F. Therefore F is confinal if and only if ImF(D) is the terminal presheaf. (See also [23, Corollary 3.4].)

The importance of confinal functors is due to the following well-known statement, which is actually an equivalent characterization of confinality.

Proposition 3.4

If F:CD is confinal, for every functor G:DE admitting a colimit, the functor GF:CE admits a colimit too, and the map between colimits

colimCCG(F(C))colimDDG(D)

induced by the following morphism of (possibly large) diagrams graphic file with name 10485_2021_9671_Figcw_HTML.jpgis an isomorphism.

For a proof, see for example the proof of the very similar statement [2, Proposition 2.11.2] (again, note the different conventions there).

Refining the Comprehension Factorization

We would like now to prove the following statement.

Proposition 3.5

Let C be (small-)cocomplete. The (large) colimit of the fibration

graphic file with name 10485_2021_9671_Equ49_HTML.gif

exists, and it coincides with the (small) colimit of the diagram F:JC.

This is almost an instance of the following known result, sometimes called the “comprehension factorization schema”.

Theorem 3.6

[28] There is an orthogonal (EM)-factorization system on Cat, where E are the confinal functors and M are the discrete fibrations.

However, in our case we are not requiring C to be small, only locally small. Because of this, and because we need the construction explicitly, we give a dedicated proof. We construct a functor F~:JoImF as follows.

  • For each object J of J, define
    F~J:=(FJ,[J,idFJ]),
    i.e. assign to J the equivalence class represented by the identity arrow FJFJ of C.
  • For each morphism f:JJ, take the map Ff:FJFJ. Notice that we have the following commutative diagram, graphic file with name 10485_2021_9671_Figcx_HTML.jpgso that we have a well-defined morphism of oImF (the zig-zag is simply given by the morphism f).

Proposition 3.7

The functor F~:JoImF is confinal.

This suffices to deduce Proposition 3.5, since the following diagram commutes. graphic file with name 10485_2021_9671_Figcy_HTML.jpg

Proof of Proposition 3.7

We need to show that for every object (C, [Jf]) of oImF, the comma category (C,[J,f])/F~ is non-empty and connected. This is guaranteed by the way the category oImF is constructed, as follows.

First, since F~J is the equivalence class represented by the identity FJFJ, we can consider f as an arrow (C,[J,f])F~J of oImF and, hence, an object of (C,[J,f])/F~, as we have the trivially commuting diagram graphic file with name 10485_2021_9671_Figcz_HTML.jpgwith the zig-zag given by the identity. Now, given any other arrow f:(C,[J,f])F~J in oImF, we have [J,f]=ImF(f)([J,1DJ])=[J,f]. So there is a zig-zag JJ in J, which actually links f with f in the comma category (C,[J,f])/F~, as required.

Mutually Confinal Diagrams

We will make use of the following well-known fact:

Lemma 3.8

Consider the functors graphic file with name 10485_2021_9671_Figda_HTML.jpgwhere A, B and C are locally small categories. If GF is confinal and G is fully faithful, then F and G separately are confinal too.

Proposition 3.9

Let C be a locally small category. Let D:JC and E:KC be small diagrams. The following conditions are equivalent.

  1. ImD and ImE are naturally isomorphic;

  2. for every locally small category D and every functor F:CD, the composite diagram FD:JD admits a colimit if and only if FE:KD does, and in that case the two colimits are isomorphic;

  3. D and E are connected by a zigzag in Cat/C such that all the arrows of the underlying zigzag JK in Cat are confinal functors.

Definition 3.10

If the diagrams D:JC and E:KC satisfy any (and, hence, all) of the conditions above, we call them mutually confinal.

One should view the property of being mutually confinal as the absolute coincidence of their colimits: existence granted, their colimits remain the same even after applying any other functor.

This idea, minus the size issues, appears already in [23, Theorem 3.2].

Proof of Proposition 3.9

The statement (c)(b) is part of the standard theory of confinal functors (see the references). The statement (b)(a) follows from choosing for F:CD the Yoneda embedding η:CPC.

The real work is to prove (a)(c). To this end, suppose that α is an isomorphism ImDImE. We have an isomorphism between the corresponding categories of elements, graphic file with name 10485_2021_9671_Figdb_HTML.jpgtogether with functors D~:JoImD and E~:KoImE which are confinal by Proposition 3.7. We have the following diagram of confinal functors. graphic file with name 10485_2021_9671_Figdc_HTML.jpgDenote now by S the full subcategory of oImE given by the joint full image of the two functors oαD~ and E~. The situation is depicted in the following commutative diagram. graphic file with name 10485_2021_9671_Figdd_HTML.jpgBy construction, S is small, since its cardinality is bounded by the one of the disjoint union of the sets of objects of J and K, which are small. Moreover, the resulting functors JS and KS are confinal by Lemma 3.8. The resulting diagram graphic file with name 10485_2021_9671_Figde_HTML.jpggives the desired zigzag (of length 2).

Absolute Colimits

An absolute colimit is a colimit which is preserved by every functor [24]. We can redefine the concept of absolute colimits in terms of mutually confinal functor as follows. As we will see, this is equivalent to the usual definition. Again, these ideas, minus size issues, already appear in [23], see Corollary 3.3 therein and the subsequent discussion.

Definition 3.11

Let C be a locally small category, and let D:JC be a small diagram. An absolute colimit of D is an object X of C such that the diagrams D:JC and X:1C are mutually confinal.

The image presheaf of a one-object diagram is the one given by the Yoneda embedding y:C[Cop,Set], as the following proposition show.

Proposition 3.12

For each locally small category C, the following diagram commutes up to natural isomorphism. graphic file with name 10485_2021_9671_Figdf_HTML.jpg

Proof

Using the definition of image in terms of Kan extensions, and recalling that η(X) is the diagram X:1C that picks out the object X, we have that Im(η(X)) is given by the following Kan extension, graphic file with name 10485_2021_9671_Figdg_HTML.jpgwhich is isomorphic to HomC(-,X), i.e. the image of X under the Yoneda embedding. The isomorphism is moreover natural in X, by the universal property of (free) colimits.

Therefore, equivalently, an object X is an absolute colimit of the diagram D:JC if and only if ImD is naturally isomorphic to the representable presheaf HomC(-,X).

If we instance Proposition 3.9 for this case, we get the following statement.

Proposition 3.13

Let C be a locally small category. Let D:JC and be a small diagram, and let X be an object of C. The following conditions are equivalent.

  1. X is an absolute colimit of D (i.e. ImDHomC(-,X) naturally);

  2. for every locally small category D and every functor F:CD, the object F(X) is the colimit in D of the composite diagram FD:JD;

  3. D:JC and X:1C are connected by a zigzag in Cat/C such that all the arrows of the underlying zigzag J1 in Cat are confinal functors.

Note that condition (b) implies, in particular, that indeed X is a colimit of D (take F to be the identity). Denote the colimit cone by h:DX.

We can now rewrite condition (c) in a more elementary way. Recall that in the proof of Proposition 3.9 we had obtained condition (c) from (a) by forming the category of elements of the (common) image presheaf, and taking the joint image of the confinal functors from J and from 1 to this category of elements. The category of elements of the representable presheaf HomC(-,X) is isomorphic to the slice category C/X. We therefore have to take the joint image in C/X of the two functors at the top of this diagram, graphic file with name 10485_2021_9671_Figdh_HTML.jpgwhere the functor JC/X maps an object JJ to the arrow of the colimit cone hJ:DJX. Just as in the proof of Proposition 3.9, denote this joint full image by S. Now, the resulting functor 1S is trivially confinal, since it maps the unique object of 1 to idXC/X. More interestingly, the proof of Proposition 3.9 says that also the resulting functor JS is confinal. The condition is nontrivial for the only object of S that does not come from J, which is the one coming from 1, namely idXC/X. For this object, the confinality condition of the functor JS says the following:

  • (d)

    There exist an object J of J and an arrow f:XDJ of C such that the following diagram commutes: (d)and such that moreover, for each object J of J and arrow f:XDJ making a similar diagram commute, there exists a zigzag JJ in J making the following diagram in C commute. (d)

Intuitively, we can interpret this condition as “the colimit cone eventually has a section, which is in some sense unique”. This is similar to very well-known statements in the literature, see for example Theorems 2.1 and 4.1 in [24]. Therefore we can view our theory of mutually confinal functors as a joint generalization both of confinal functors and of absolute colimits.

The Monad of Small Presheaves

In this section we study small presheaves, and show that they also form a pseudomonad. Moreover, the image map of the previous section gives a morphism of pseudomonads (also explicitly defined in Appendix A). Again, cocomplete categories are pseudoalgebras of this monad, but this time, every pseudoalgebra is of this form. Indeed, considering the long history of (co)completion theory of categories (see [14, 18] for early contributions), one should view the monad of small presheaves as the “free small-cocompletion monad”. The fact that Diag admits cocomplete categories as algebras is then to be thought of as an instance of the “restriction of scalars” construction, where algebras of a monad can be pulled back along a morphism of monads, see Appendix A.3.

It is known that small presheaves form a pseudomonad [6]. However, we did not find an explicit construction in the literature, so we give one in the present section. Compared to the pseudomonad of Sect. 2, this one is weaker: the underlying pseudofunctor is not a strict 2-functor. A short review of the relevant basic definitions can be found in Appendix A. The fact that Im defines a morphism of pseudomonads (Sect. 4.6) seems to be new.

Small Presheaves

Definition 4.1

A presheaf is called small if it is (naturally isomorphic to) the image presheaf of a small diagram.

Denote by PC the full subcategory of [Cop,Set] whose objects are small presheaves.

The image presheaf of a (small) diagram is by definition a small presheaf, so that the functor Im:Diag(C)[Cop,Set] actually lands in PC. We denote the resulting functor Diag(C)PC again by Im. This will not cause confusion, since from now on we will only consider small presheaves.

Despite the slightly new terminology, this is a known concept, see for example [6]. We recall the following facts.

  • A presheaf is small if and only if it can be written as a small colimit of representables [6, Section 2]. Therefore we can think of small presheaves as of forming the free small cocompletion of a category.

  • The category PC of small presheaves on a locally small category C is itself locally small. This allows us to avoid several size issues when talking about the free cocompletion.

Notice also the following fact.

Remark 4.2

Let C be a locally small category, and let P:CopSet be a small presheaf. Then we know (Proposition 3.7) that there exist a small category S and a confinal functor F~:SoP. By Lemma 3.8, we can assume that F~ is fully faithful, or equivalently that it is the inclusion of a full subcategory.

For later use in this section, we recall the following known statement, sometimes called the co-Yoneda lemma (see [15, Section 3.10], as well as [19, Section 2.2]).3

Proposition 4.3

Let C be a category, and let H:CSet be a functor. There is an isomorphism

H(C)CCHomC(C,C)×H(C),

for each object C of C and natural in C, given by mapping each element xH(C) to the equivalence class in the coend above the ordered pair (idC,x)HomC(C,C)×H(C).

The Pseudofunctor

Given locally small categories C and D and a functor F:CD, we would like to find an assignment PCPD, which maps small presheaves to small presheaves.

Definition 4.4

Let F:CD be a functor between locally small categories, and let P be a small presheaf on C. The pushforward of P along F is the presheaf on D given by the following left Kan extension. graphic file with name 10485_2021_9671_Figdk_HTML.jpg We denote the resulting presheaf by FP.

Equivalently, FP is given by the free colimit of F, weighted by P. By the universal property of (weighted) colimits, it is therefore functorial in F. Note that this definition specifies FP only up to isomorphism. As usual, the choice of a particular object within its isomorphism class is de facto irrelevant.

Recall the following fact, which says that Kan extension diagrams can be pasted vertically. While the statement is folklore and a consequence of the simple fact that universal arrows [21] compose in an obvious sense, we provide a proof because the explicit isomorphism given in the proof will be of use later.

Proposition 4.5

Let A, B, C and D be categories, and let F:AB, G:BC, H:AD be functors. The left Kan extensions LanG(LanFH) and LanGFH are naturally isomorphic. graphic file with name 10485_2021_9671_Figdl_HTML.jpg

Proof

By the universal property of LanFH, the natural transformation λGF on the right of (4.1) factors uniquely through λF, i.e. there exists a unique 2-cell ν:LanFHLanGFHG such that the following 2-cells are equal. graphic file with name 10485_2021_9671_Figdm_HTML.jpgMoreover, by the universal property of LanG(LanFH), the natural transformation ν factors uniquely through λG, meaning that there exists a unique natural transformation κ:LanG(LanFH)LanGFH such that the following 2-cells are equal, graphic file with name 10485_2021_9671_Figdn_HTML.jpgwhere the unlabeled arrow (for reasons of space) denotes LanG(LanFH). We now show that κ is an isomorphism, by providing an inverse. By the universal property of LanGFH, the composite natural transformation on the left of (4.1) factors uniquely through λGF, meaning that there exists a unique natural transformation δ:LanGFHLanG(LanFH) such that the following 2-cells are equal. graphic file with name 10485_2021_9671_Figdo_HTML.jpgwhere this time the unlabeled arrow denotes LanGFH. By the universal properties of the respective Kan extensions, we then have that κδ has to be the identity natural transformation at LanGFH, and δκ has to be the identity natural transformation at LanG(LanFH).

Corollary 4.6

Pushforwards of small presheaves exist, are given by pointwise left Kan extensions, and are small.

Remark 4.7

Since we are dealing with pointwise Kan extensions, we can also express this vertical pasting law in terms of coends, where it is an instance of the co-Yoneda lemma (Proposition 4.3). In particular, let A and B be locally small categories, let F:AB be a functor, and let P:AopSet be a small presheaf. Then

FP(B)AAP(A)×HomB(B,FA).

Let moreover C be locally small, and G:BC be a functor. Then

GFP(C)AABBP(A)×HomB(B,FA)×HomC(C,GB)AAP(A)×HomC(C,GFA)(GF)P(C),

where the middle isomorphism, which in the proof Proposition 4.5 was denoted by κ, is given by the co-Yoneda lemma.

Now, given F:CD, we have a (chosen) mapping F:PCPD. For P to be a pseudofunctor, we first of all need F to be a functor. To this end, let α:PQ be a natural transformation between small presheaves on C. By the universal property of FP as a Kan extension, there is a unique 2-cell FPFQ, which we denote by Fα, which makes the following 2-cells equal. graphic file with name 10485_2021_9671_Figdp_HTML.jpgThis makes F a functor PCPD, where functoriality holds by uniqueness of the cell Fα. Uniqueness of such cell holds once a choice of F has been made. (One can obtain this 2-cell also using the pointwise characterization of F as a coend.)

Unitor and Compositor

The left Kan extension of PPC along the identity functor id:CC is naturally isomorphic to P itself, and this isomorphism is natural in P as well. In other words, id:PCPC is naturally isomorphic to id:PCPC. Since we are free to choose idP within its isomorphism class, we can in particular pick idP=P, so that the unitor of our pseudofunctor is the identity (one speaks of a normal pseudofunctor).

With composition, the matters are not so simple. By Proposition 4.5, or by Remark 4.7, we know that Kan extensions preserve compositions up to a specified natural isomorphism, which we had denoted by κ. In general we cannot assume that κ is the identity, we cannot make that choice consistently across the whole category. However, we can show that κ satisfies all the properties of a compositor, and so it makes P pseudofunctorial.

As in Remark 4.7, let A, B and C be locally small categories, and let F:AB and G:BC be functors. Let moreover P be a small presheaf on A. The isomorphism κ:G(FP)(GF)P of PC given by the co-Yoneda lemma, as in Remark 4.7, is (strictly) natural in P, in F, and in G, by the universal property of coends.

In order to have pseudofunctoriality it remains to be shown that the compositor κ is associative and unital. Unitality is guaranteed by our choice of unitor (identities), we now prove associativity.

Proposition 4.8

The following “associativity” diagram commutes for all locally small categories A, B, C and D and functors F:AB, G:BC and H:CD. graphic file with name 10485_2021_9671_Figdq_HTML.jpg

One could again invoke the co-Yoneda lemma, but it may be instructive to give a proof by explicitly pasting Kan extensions vertically. For simplicity, we equivalently prove the statement in terms of the inverse κ-1.

Proof

Let P be a small presheaf on A. By iterating (4.2), both composite cells graphic file with name 10485_2021_9671_Figdr_HTML.jpgare equal to the following composition. graphic file with name 10485_2021_9671_Figds_HTML.jpgBy the universal property of (HGF)P as a Kan extension, then, the two composite 2-cells (HGF)PHGFP are equal.

This proves that P is a pseudofunctor CATCAT.

Naturality of the Image

Consider a functor F:CD between locally small categories, and let D:IC be a (small) diagram in C. One can either form the image presheaf of D and then push it forward along F, or one can first form the diagram FD:ID, and then take the image presheaf. As we will see shortly, the result is the same, up to coherent isomorphism.

Proposition 4.9

The functors Im:Diag(C)PC form a pseudonatural transformation DiagP.

Proof

First of all, for each functor F:CD of CAT we need a natural isomorphism in the following form. graphic file with name 10485_2021_9671_Figdt_HTML.jpgLet now D:IC be a small diagram. Using the definition of image in terms of Kan extensions, the two routes of Equation (4.4) are given by the following Kan extensions, respectively, graphic file with name 10485_2021_9671_Figdu_HTML.jpgwhich we know are naturally isomorphic via the map κ of Proposition 4.5 and Sect. 4.2.1, which is also natural both in D and in F. The unit and multiplication conditions correspond to the unitality and associativity condition for κ.

Unit and Multiplication

The Unit: the Yoneda Embedding

Let C be a locally small category. By Proposition 3.12, the Yoneda embedding y:C[Cop,Set] lands in PC: representable presheaves are small. We denote again by η:CPC the functor induced by the Yoneda embedding CHomC(-,C). When this causes confusion because both monads Diag and P are present, we will denote the two units by ηDiag and ηP.

Proposition 4.10

The unit η:CPC is canonically pseudonatural in C.

Proof

Let C and D be locally small, and let F:CD be a functor. We have to prove that the following diagram commutes up to coherent isomorphism. graphic file with name 10485_2021_9671_Figdv_HTML.jpgIn practice, using the coend description of F (via Kan extensions), this amounts to a natural isomorphism of presheaves,

HomD(-,FC)CCHomC(C,C)×HomD(-,FC)

which is given by the co-Yoneda lemma (Proposition 4.3), by setting H(C)=HomD(-,FC). In particular, the isomorphism is given pointwise, for each object D of D by mapping f:DFC to the equivalence class of (idC,f)HomC(C,C)×HomD(D,FC). It can be checked that, defined this way, the isomorphism respects identities and composition.

The Multiplication: Free Weighted Colimits

Let’s now turn to the multiplication of the monad. One could define it as the left-adjoint to the unit, since the monad turns out to be lax idempotent (a.k.a. Kock-Zöberlein). Here, instead, we define the multiplication directly.

Definition 4.11

Let C be locally small, and let Φ be an object of PPC (i.e. a small presheaf on small presheaves). We define μ(Φ) as the object of PC specified, up to isomorphism, by the following “free weighted colimit”,

μ(Φ)(C):=PPCΦ(P)×P(C)

for each object C of C.

Remark 4.12

By functoriality of colimits, μ(Φ) is a presheaf CopSet. Since PC is in general not small, let’s show why the coend exists. Since Φ is small, there exists a small diagram D:IPC such that Im(D)Φ. In other words,

μ(Φ)(C)PPCIIHomPC(P,DI)×P(C),

which by Fubini and by the co-Yoneda lemma (Proposition 4.3) is naturally isomorphic to

IIDI(C).

This coend exists, since it is a coend in Set indexed by a small category, and it gives a small presheaf (since PC is cocomplete).

Therefore μ is a functor PPCPC (as usual, defined up to natural isomorphism).

Proposition 4.13

The functor μ:PPCPC is pseudonatural in the category C.

Proof

We have to prove that the following diagram commutes up to coherent natural isomorphism. graphic file with name 10485_2021_9671_Figdw_HTML.jpgGiven ΦPPC, the top right path gives the presheaf

CCPPCHomD(-,FC)×Φ(P)×P(C),

which (by Fubini) is isomorphic to

PPCΦ(P)×FP. 4.5

The bottom left path gives the presheaf

QPDPPCHomPD(Q,FP)×Φ(P)×Q,

which by the co-Yoneda lemma (Proposition 4.3) is isomorphic to (4.5). One can check that this isomorphism respects identities and composition.

Unitors, Associators, Coherence

The left unitor graphic file with name 10485_2021_9671_Figdx_HTML.jpgis given as follows. Starting with a presheaf P in PC, we can apply the unit to get the following representable presheaf on PC,

HomPC(-,P),

and then, applying the multiplication, we get the following presheaf,

QPCHomPC(Q,P)×Q(-)P,

where the last isomorphism, filling the diagram above, is given by the co-Yoneda lemma (Proposition 4.3), and defines the left unitor . Naturality in P follows from naturality of the co-Yoneda isomorphism. The modification property for against functors F:CD is again an instance of the coherence of colimits (uniqueness of the isomorphism), just as in Sect. 2.5.3 and Proposition 4.8, this time for weighted colimits.

The right unitor graphic file with name 10485_2021_9671_Figdy_HTML.jpgis given as follows. Again start with a presheaf P in PC. This time we apply the map η, to get the following presheaf.

QCCPC×HomPC(Q,ηC(C))=CCPC×HomPC(Q,HomC(-,C))

(Note that we cannot apply the Yoneda lemma to simplify the expression on the right.) We now apply the multiplication again, to obtain

XQPCCCPC×HomPC(Q,HomC(-,C))×Q(X)CCP(C)×HomC(X,C)P(X),

where both isomorphisms are given again by the co-Yoneda lemma (Proposition 4.3). This gives the right unitor r, and the reason why it’s a modification is analogous to the one for the left unitor .

The associator graphic file with name 10485_2021_9671_Figdz_HTML.jpgis again an instance of the co-Yoneda lemma (Proposition 4.3). Namely, given ΦPPPC, the top-right path of the diagram gives the following presheaf

CPPCΨPPCP(C)×Φ(Ψ)×HomPCP,QPCΨ(Q)×Q(-)

while the bottom-left path gives the following,

CΨPPCQPCΦ(Ψ)×Φ(Q)×Q(C)

and the two differ by one application of the co-Yoneda lemma (over P). This gives the associator a, which is a modification for reasons analogous to the above.

Again, the higher coherence conditions hold by the uniqueness of isomorphisms given by the universal property, as in Sect. 2.5.3.

Algebras

It is well-known that the pseudoalgebras of the pseudomonad P are cocomplete categories with a choice of (weighted) colimit, and pseudomorphisms of pseudoalgebras are cocontinuous functors [6]. Differently from the case of Diag, this is a complete characterization. Let’s see in detail how the structure maps look.

Given a small-cocomplete, locally small category C, let e:PCC be a choice of weighted colimits, that is,

e(P)XCP(X)X, 4.6

where denotes the copower (also known as tensor, see for example [15, Section 3.7]).

The copower satisfies a (more general) version of the “co-Yoneda lemma” of Proposition 4.3, as follows. See again [15, Section 3.10] for more details.

Proposition 4.14

Let C be a category, let D be a cocomplete category, and let H:CD be a functor. There is an isomorphism

H(C)CCHomC(C,C)H(C),

for each object C of C and natural in C, given by selecting the component of idCHomC(C,C) in the copower.

A similar argument as Remark 4.12 shows then that the coend in (4.6) exists. The action of e on morphisms is the one given by the universal property, as usual.

The unitor and multiplicator of the algebras are given as follows. First of all, the unitor graphic file with name 10485_2021_9671_Figea_HTML.jpgis the canonical isomorphism given by the general co-Yoneda lemma (Proposition 4.14),

XCHomC(X,C)XX.

for all CC.

The multiplicator graphic file with name 10485_2021_9671_Figeb_HTML.jpgis also given by the generalized co-Yoneda lemma, as follows,

YCPPCΦ(P)×HomCY,XCP(X)XYPPCXCΦ(P)×P(X)X

for all ΦPPC. This can be seen as a “generalized Fubini” for coends or weighted colimits, analogous to Lemma 2.10.

Again, the coherence conditions can be seen as a matter of uniqueness of the isomorphism by the universal property of weighted colimits.

The Image is a Morphism of Monads

Here we want to show that the image is a (pseudo)morphism of (pseudo)monads, following Definition A.2. The unit modification u is given by the isomorphism of Proposition 3.12. Again, the fact that this gives a modification comes from the universal property.

The multiplication modification m is in the following form, graphic file with name 10485_2021_9671_Figec_HTML.jpgwhere ImIm is shorthand for the following composite. graphic file with name 10485_2021_9671_Figed_HTML.jpg(Note that, since the interchange law of pseudonatural transformations holds only up to natural isomorphism, a priori horizontal composition is not uniquely defined, as in a weak bicategory. The choice we make, which will be consistent throughout the document, is motivated by later convenience.)

Explicitly, m is given as follows. Let D:JDiag(C) be the following diagram of diagrams. graphic file with name 10485_2021_9671_Figee_HTML.jpgThen, for every CC, writing images and μ in terms of coends, the two paths of diagram (4.7) are the following objects,

PPCJJHomPCP,KD0JHomC(-,D1J(K))×P(C)

and

JJKD0(J)HomC(X,D1J(K))

and they are again isomorphic by the co-Yoneda lemma (Proposition 4.3), over P. This isomorphism is what we take as the multiplicator m. Again, the fact that it forms a modification follows from uniqueness, and so do the higher coherence conditions of Definition A.2.

The Pullback Functor of Algebras

In algebra, given a ring morphism f:RR, every R-module is canonically an R-module too, via the map f, and morphisms of R-modules are morphisms of R-modules too. The resulting “pullback” functor between the categories of R-modules and R-modules is known as the “restriction of scalars” [3, Chapter II], or “Weil restriction” in algebraic geometry [29, Section 1.3]. Not every R-module arises this way if f is not an isomorphism (for example, R itself, seen as an R-module, does not).

More generally, given a morphism of monads λ:TT, every T-algebra is canonically a T-algebra via λ, and morphisms of T-algebras are automatically morphisms of T-algebras. This is well known (see, for example, [1, Theorem 3 in Section 3.6]), and the pullback functor is again called “restriction of scalars”, after its instance for the case of rings. Again, not every T-algebra arises this way (if λ is not an isomorphism), for example, the free algebra (TX,μ), where X is any object, in general does not.

With Im:DiagP we are witnessing an instance of this phenomenon in higher dimensions: every P-algebra, i.e. a cocomplete category, is also a Diag-algebra, via the map Im, which is a morphism of monads. As we have seen in Sect. 2.5.4, not every Diag-algebra arises this way, for example, in general free algebras do not. The 2-dimensional restriction-of-scalars theorem is given in Appendix A.3 as Theorem A.7.

To see that cocomplete categories as Diag-algebras indeed arise in this way, note that the following diagram commutes up to natural isomorphism for each cocomplete category C (and any choices of colimits c and coends e). graphic file with name 10485_2021_9671_Figef_HTML.jpgIndeed, the fact that this diagram commutes is, for the last time, an instance of the general co-Yoneda lemma (Proposition 4.14): given D:JC,

XCJJHomC(X,DJ)XJDJ,

which is indeed just the colimit of D, given up to isomorphism by c. (Compare this with the analogous “free” case of Remark 4.12.)

Partial Colimits

We review here the basic ideas of partial evaluations, which are a categorical formalization of the idea of “computing only pieces of an operation”. We will apply this to the operation of colimit encoded by the monads Diag and P. We will then show that, for both monads, we have a correspondence between Kan extensions and partial evaluations of colimits (Theorems 5.5, 5.6 and 5.10). Intuitively, these results may be interpreted as the fact that “a left Kan extension is a partially computed colimit”. While the statement for the Diag monad is rather straightforward, the corresponding statement for P requires quite more work.

Partial Evaluations

Partial evaluations were introduced in [25, Chapter 4] for the case of probability monads, and defined for the general case in [9]. A detailed study of their compositional structure (in general they don’t form a category) is given in [5].

Definition 5.1

Let (T,μ,ν) be a monad on Set, and let (Ae) be a T-algebra. Consider the parallel pair of maps graphic file with name 10485_2021_9671_Figeg_HTML.jpgof which e:TAA is the coequalizer. Given elements p,qTA, a partial evaluation from p to q is an element rTTA such that μA(r)=p and Te(r)=q.

If such a partial evaluation exists, we also say that q is a partial evaluation of p and that p can be partially evaluated to q.

Example 5.2

Let (T,μ,ν) be the free commutative monoid monad. Given a set X, the elements of TX can be thought of as formal sums of elements of X, for example in the form x+y+z with x,y,zX. Natural numbers with addition form a commutative monoid, and so N forms a T-algebra. The formal sum

3+4+5+6,

seen as an element of TN, can be partially evaluated to the formal sum

7+11

via the element [3+4]+[5+6]TTN. The interpretation is that “we haven’t summed everything together, but only some of the terms”.

An essential property of partial evaluations is that “they don’t change the total result”. This is reflected by the multiplication diagram of the algebra, graphic file with name 10485_2021_9671_Figeh_HTML.jpgwhich can be seen as saying that evaluating (via the map e) two formal expressions which differ by a partial evaluation, the (total) result of the evaluation is the same. In the example above, both formal sums evaluate to 18.

Let’s also remark that “total evaluations are a special case of partial evaluations”. We refer the interested reader to [9] and [5] for more details and examples. In our case we need a higher-dimensional, weaker analogue of the concept, since we are dealing with pseudomonads on CAT rather than monads on Set.

Definition 5.3

Let (T,μ,ν) be a pseudomonad on CAT, and let (A,e) be a T-pseudoalgebra. Given objects P,QA, a partial evaluation from P to Q is an object R of TTA such that μA(R)P and Te(R)Q.

If such a partial evaluation exists, we also say that Q is a partial evaluation of P and that P can be partially evaluated to Q.

Partial Evaluations of Diagrams

We now instance Definition 5.3 for the case of the monad Diag, keeping in mind the following multiplication square, which commutes up to isomorphism. graphic file with name 10485_2021_9671_Figei_HTML.jpg

Definition 5.4

Let C be a cocomplete category, and let D:JC and D:KC be small diagrams. A partial evaluation from D to D for the monad Diag is an object E of Diag(Diag(C)) such that μ(E)D and cED. If such an object exists, we also say that D is a partial colimit of D for the monad Diag.

As we see shortly, we now establish an equivalence between partial evaluations of diagrams and left Kan extensions along split opfibrations. There are now two ways of talking about the correspondence. One, which is probably the easiest to state, is as an equivalence of properties.

Theorem 5.5

Let C be a cocomplete category, and let D:JC and D:KC be small diagrams. Then D is a partial colimit of D (for the monad Diag) if and only if D can be written as the (pointwise) left Kan extension of D along a split opfibration.

Instead of an equivalence of properties we can also write the result as an equivalence of structures, as follows.

Theorem 5.6

Let C be a cocomplete category, and let D:JC and D:KC be small diagrams. There is a bijection between partial evaluations of colimits from D to D and split opfibrations F:JK exhibiting D as a (pointwise) left Kan extension of D along F.

graphic file with name 10485_2021_9671_Figej_HTML.jpg

In particular, if K is the terminal category 1, the Kan extension is an ordinary colimit. This is an instance of the fact that “total evaluations are a special case of partial evaluations”.

We will prove the latter statement, Theorem 5.6, since it clearly implies the former. We will use the following property of Kan extensions along opfibrations, which can be interpreted as “fiberwise colimits”. The following two statements are well known (see for example the nLab page on Kan extensions).

Proposition 5.7

Let F:EB be an opfibration between small categories. For each object E of E, the inclusion of the fiber F-1(E) into the comma category F/E has a left adjoint. Hence, it is a confinal functor.

Corollary 5.8

Let F:EB be an opfibration between small categories, and let C be cocomplete and locally small. The (pointwise) left Kan extension of a functor G:EC along F at B can then be computed by a colimit labeled by the fiber of F at B:

LanFG(B)colimEF-1(B)G(E).

We are now ready to prove the theorem. The crucial point of the theorem is the correspondence between split opfibrations and (strict!) functors into Cat.

Proof of Theorem 5.6

First of all, suppose that we have a partial evaluation from D to D. That is, let E=(E0,E1) be an object of Diag(Diag(C)) such that μ(E)=D and cE=D. Note that this implies that E0 is necessarily indexed by K (up to isomorphism), it is a functor KCat. We can now express c(E) as the left Kan extension of μ(E) along the Grothendieck fibration π:E0K. Indeed, using Corollary 5.8, we have that for each object K of K,

Lanπμ(E)(K)colim(K,X)π-1(K)μ(E)(K,X)=colimXE0KE1K(X)c(E)(K).

Conversely, let F:JK be a split opfibration and suppose that D is the left Kan extension of D along F. graphic file with name 10485_2021_9671_Figek_HTML.jpgLet now E0:KCatCAT be the functor associated with the opfibration F. Concretely, this is the functor mapping each object K of K to its preimage F-1(K), and each morphism k:KK to the functor F-1(K)F-1(K) given by the opcartesian lifts. Define also, for each object K of K, the functor E1K:E0KC to be given by graphic file with name 10485_2021_9671_Figel_HTML.jpgFor each morphism k:KK, define E1j to be the 2-cell given by the opcartesian lifts in J, whiskered by D. We have that, by construction, μ(E0,E1)=D. Moreover, by Corollary 5.8, D=c(E0,E1).

Note also that, in the hypotheses above, D and D must necessarily have isomorphic colimits, either because “Kan extensions can be stacked vertically” (Proposition 4.5), or because of the multiplication square (5.1). A different but related picture is given in [26, Proposition 2.9].

Partial Evaluations of Presheaves

We now give and prove a similar statement for the monad P. Let’s instance Definition 5.3 for the case of the monad P.

Definition 5.9

Let C be a cocomplete category, and let P and Q be small presheaves on C. A partial evaluation from P to Q for the monad P is a presheaf on presheaves Φ in PPC such that μ(Φ)P and eΦQ. If such an object exists, we also say that Q is a partial colimit of P for the monad P.

This time we establish a correspondence with Kan extensions along any functor, not just along a split opfibration. Moreover, we only have a weak statement, analogous to Theorem 5.5, an equivalence of properties rather than of structures.

Theorem 5.10

Let C be a cocomplete category, and let P,QPC. The following conditions are equivalent.

  1. There exists a small diagram D:JC such that ImDP, and a small category K with a functor F:JK such that Im(LanFD)Q. graphic file with name 10485_2021_9671_Figem_HTML.jpg

  2. There exists a partial evaluation from P to Q for the monad P, i.e. an object ΦPPC such that μ(Φ)P and e(Φ)Q. graphic file with name 10485_2021_9671_Figen_HTML.jpg

In other words, Q is a partial colimit of P if and only if it can be written as the image presheaf of the left Kan extension of a diagram with image presheaf P.

Again, if K is the terminal category 1, the Kan extension is an ordinary colimit (“total evaluations are a special case of partial evaluations”).

The proof of the theorem, which can be considered the main result of this paper, requires more work than the analogous statement for Diag, and will have to use two auxiliary lemmas.

Lemma 5.11

Let F:JK be a functor between small categories. There exist a small category H, a confinal functor G:HJ, and a split opfibration π:HK such that for every locally small category C and every diagram D:JC, the pointwise left Kan extension of DG along π exists if and only if the one of D along F exists, and in that case the two Kan extensions are naturally isomorphic.

We can depict the situation as follows: graphic file with name 10485_2021_9671_Figeo_HTML.jpgNote that the diagram above does not necessarily commute, nor does any of its two subdiagrams.

Proof of Lemma 5.11

Given an object K of K, the comma category F/K has

  • As objects, pairs (Jk) where J is an object of J and k:FJK is a morphism of K;

  • As morphisms, commutative diagrams in the following form, graphic file with name 10485_2021_9671_Figep_HTML.jpgwhere j:JJ is a morphism of J.

Now define the functor F/-:KCat as follows.

  • To each object K of K assign the comma category F/K;

  • To each morphism :KK of K, assign the functor F/KF/K given by post-composition with .

Choose now as H the Grothendieck construction F/-, and notice that it is small. Up to isomorphism,

  • Its objects are triplets (KJk) where K is an object of K, J is an object of J, and k:FJK is a morphism of K;

  • Each morphism :KK of K defines an “opcartesian” morphism :(K,J,k:FJK)(K,J,k:FJK);

  • For each object K of K, a morphism j:(J,k:FJK)(J,k:FJK) of F/K (i.e. a morphism j:JJ of J with kFj=k) defines a morphism “in the fiber” (K,J,k)(K,J,k), which we denote again by j.

  • Any other morphism of H is a composition of two morphisms in the two forms above.

The forgetful functor π:HK which maps (KJk) to K is a split opfibration. We can also construct the forgetful functor G:HJ which maps (KJk) to J. This functor is confinal: indeed, let J be an object of J.

  • The category J/G is nonempty: it always contains graphic file with name 10485_2021_9671_Figeq_HTML.jpg

  • The category J/G is connected: let (K,J,k:FJK) and (K,J,k:FJK) be objects of H, and let f:JJ and g:JJ be morphisms of J. We can then form the following zigzag in H: graphic file with name 10485_2021_9671_Figer_HTML.jpg

Suppose now the pointwise left Kan extension graphic file with name 10485_2021_9671_Figes_HTML.jpgexists. We claim that LanπDGLanFD. To see this, let K be an object of K. Since π is an opfibration, by Corollary 5.8 we can write Lanπ(DG)(K) up to isomorphism as the colimit of the following composite functor. graphic file with name 10485_2021_9671_Figet_HTML.jpgRecall now that H=F/-, so that π-1(K)H is isomorphic to F/K, and the following diagram commutes, graphic file with name 10485_2021_9671_Figeu_HTML.jpgwhere U is the forgetful functor mapping (J,k:FJK) to just J. Therefore we can rewrite Lanπ(DG)(K) up to isomorphism as the colimit of this simpler composition, graphic file with name 10485_2021_9671_Figev_HTML.jpgwhich gives the pointwise left Kan extension of D along F.

Lemma 5.12

Let C be a locally small category. The functor ImIm:Diag(Diag(C))PPC is essentially surjective on objects.

Recall that the following diagram commutes only up to natural isomorphism, graphic file with name 10485_2021_9671_Figew_HTML.jpgand that, by convention, we denoted by ImIm the top-right path.

Proof of Lemma 5.12

Since Im is essentially surjective, it suffices to prove that DiagIm is as well. In other words, we have to prove that every small diagram of small presheaves can be obtained from a diagram of diagrams (by taking the image presheaf of each subdiagram).

Let I be a small category, and let D:IPC be a diagram. Explicitly, for each object I of I we have a small presheaf DI:CopSet, and for each morphism of I we have a morphism of presheaves. We can now take the category of elements of each DI, as described in Sect. 3.1.1. For each I of I we have a discrete fibration πI:oDIC, and for each morphism i:IJ of I we have a commutative triangle as follows. graphic file with name 10485_2021_9671_Figex_HTML.jpgHowever, the categories oDI are large in general, and so they do not form the desired diagram of (small) diagrams yet. We then proceed as follows. Since each presheaf DI is small, for each I of I there exists (Remark 4.2) a small full subcategory SI of oDI, such that the inclusion functor is confinal. Denote by FI:SIC the composition (or restriction) graphic file with name 10485_2021_9671_Figey_HTML.jpgBy construction, Im(Fi)DI. The assignment ISI is not functorial in I: for each morphism i:IJ of I, we get the following commutative diagram. graphic file with name 10485_2021_9671_Figez_HTML.jpgHowever, in principle we cannot lift oDi to get a commutative square, that is, a priori the restriction of oDi to SI does not necessarily land in SJ. We now extend the SI, as follows.

Let I be an object of I, and consider the slice category I/I, whose objects are pairs (Hh) where H is an object of I, and h:HI is an arrow of I. This category is small, since its set of objects is given by

HOb(I)HomI(H,I),

which is a small union of small sets. Now define TI to be the full subcategory of oDI whose set of objects is given by graphic file with name 10485_2021_9671_Figfa_HTML.jpgThat is, an object of EI is in the form oDh(C,x) for some object (CC,xDH(C)) in SH and some morphism h:HI of I. The category TI is small, since its set of objects is a small union of small sets. Moreover, since we can pick h to be the identity II, the category SI is fully embedded into TI, i.e. we have a commutative diagram of inclusions graphic file with name 10485_2021_9671_Figfb_HTML.jpgwhich are all confinal by Lemma 3.8. In particular, if we denote by EI:TIC the composition (or restriction) graphic file with name 10485_2021_9671_Figfc_HTML.jpgwe have again that Im(EI)DI.

This time, the assignment ITI is functorial. To see this, let i:IJ be a morphism of I, and form the following commutative diagram. graphic file with name 10485_2021_9671_Figfd_HTML.jpgWe claim that the restriction of oDi to the subcategory TI lands in TJ. Any object of TI is of the form oDh(C,x) for some (Cx) in SH and some h:HI of I. We then have

graphic file with name 10485_2021_9671_Equ50_HTML.gif

which lies in TJ since ih:HJ belongs to the slice category I/J. We can then complete the diagram to a commutative diagram graphic file with name 10485_2021_9671_Figfe_HTML.jpgThe assignment ITI, iTi is a functor ICat, and the corresponding assignment I(TI,EI:EIC) gives a functor E:IDiag(C).

This functor E:IDiag(C) is the diagram of diagrams that we want. Indeed, Diag(Im)(E) is given by the postcomposition with the image graphic file with name 10485_2021_9671_Figff_HTML.jpgand we know that for each I of I, Im(EI)DI. To show that Diag(Im)(E)D, and hence conclude the proof, we have to show that the isomorphism Im(EI)DI is natural in I. So let i:IJ be a morphism of I. We have to show that the following diagram of PC commutes, graphic file with name 10485_2021_9671_Figfg_HTML.jpgwhere the isomorphisms are the ones given in Proposition 3.4. Diagram (5.2) commutes since the vertices are the colimits of the functors ηEi, ηEJ, ηπI and ηπJ obtained by the following commutative diagram of CAT, graphic file with name 10485_2021_9671_Figfh_HTML.jpgand the induced maps between their colimits are the arrows of (5.2). Since the square in (5.3) commutes, (5.2) commutes too (by uniqueness of the induced map).

We are now finally ready to prove the theorem.

Proof of Theorem 5.10

Since Im is a morphism of monads, the following diagram commutes up to natural isomorphism. graphic file with name 10485_2021_9671_Figfi_HTML.jpgIn particular, this can be interpreted as “Im, as a morphism of monads, preserves partial evaluations”. With this and the previous lemmas in mind, let’s proceed with the proof.

  • (a)(b): Consider D:JC and F:JK as in (a). By Lemma 5.11 there exists a diagram D:HC together with a confinal functor G:HJ and an opfibration π:HK with LanπDGLanFD. Since G is confinal, by Proposition 3.9 we have that Im(DG)ImDP. Now, by Theorem 5.5, there exists a diagram of diagrams EDiag(Diag(C)) such that μ(E)DG and Diagcolim(E)LanπDG. Chasing diagram (5.4), we see that ImIm(E) is the desired partial evaluation between Im(DG)ImDP and Im(LanπDG)Im(LanFD)Q.

  • (b)(a): Let Φ be a partial evaluation from P to Q. By Lemma 5.12, ImIm is essentially surjective on objects, so that there exists EDiag(Diag(C)) with ImIm(E)Φ. Chasing diagram (5.4), let D=μ(E) and D=Diagcolim(E), so that ImDP and ImDQ. By Theorem 5.5, D can be written as the pointwise left Kan extension of D along a split opfibration.

Comparison with Measure Theory

Coends, in particular when they denote weighted colimits, are often considered similar to integrals in analysis, which is why they are normally denoted by an integral sign. The correspondence can roughly be summarized in terms of monads, by saying that the monad P on CAT behaves similarly to the Giry monad P on the category of (say) measurable spaces [11], and to other probability and measure monads. In particular,

  • Small presheaves on a category are similar to measures on a measurable space;

  • Cocomplete categories (categories where one can take colimits) are similar to algebras over probability monads (spaces where one can take integrals or expectation values, such as the real line);

  • In a (sufficiently) cocomplete category C, given a small presheaf PPC the following coend
    XCPXX
    is similar to the following integral of a measure p on, say, real numbers,
    Rxdp(x);
  • As Kleisli morphisms, (small) profunctors are similar to Markov kernels.

An introduction to probability monads, for the interested reader, can be found in [25, Chapter 1] as well as in [9, Section 6].

We now wish to emphasize that Theorem 5.10 adds a further analogy between integrals and coends: just like Kan extensions can be thought of as “partial colimits”, conditional expectations can be thought of as “partial expectations”. In particular, we compare Theorem 5.10 to a similar theorem for a probability monad on the category of metric spaces, the Kantorovich monad (see [8] as well as the more introductory material in [9, Section 6]).

The following statement is known ([9, Theorem 6.13] or [25, Theorem 4.2.14]).

Theorem 5.13

Let (Ae) be a Banach space (an algebra of the Kantorovich monad). Let p,qPA be Radon probability measures on A of finite first moment. The following conditions are equivalent:

  • There exist probability spaces (Ω,μ) and (Ω,μ), random variables f:ΩA and g:ΩA with image measures p and q respectively, and a measure-preserving map m:ΩΩ such that g is the conditional expectation of f along (the pull-back sigma-algebra induced by) m – as in the following not necessarily commutative diagram: graphic file with name 10485_2021_9671_Figfj_HTML.jpg

  • There is a partial evaluation kPPA from p to q (for the Kantorovich monad). graphic file with name 10485_2021_9671_Figfk_HTML.jpg

The similarity to Theorem 5.10 is evident. One could say intuitively that if coends are analogous to integrals, Kan extensions are analogous to conditional expectations. The former can be interpreted as partial (weighted) colimits, and the latter as partial (weighted) averages. Notice also that the diagram (5.5) does not commute, but the conditional expectation map, just like a Kan extension, can be thought of as the one “making the diagram as close as possible to commuting”.

It has to be noted that, while we proved Theorem 5.10 by invoking an analogous (but simpler and stronger) statement for the monad of diagrams, Theorem 5.13 was proven directly, using measure-theoretic methods. In the statement of the Theorem 5.13 it seems that random variables play somewhat the role that diagrams play in Theorem 5.10 (or at least, of diagrams equipped with weights, analogous to the measures on Ω and Ω). However, currently it is not clear whether random variables, or related structures, may form a monad analogous to Diag, and on which category such a structure could be found.

Acknowledgements

The first author would like to thank Bartosz Milewski and David Jaz Myers for the insight on coends and weighted limits, Joachim Kock and Emily Riehl for enlightenment on some of the higher-dimensional aspects, Tobias Fritz for further helpful insight, and Robert Paré and Evan Patterson for the helpful pointers. The first author would also like to thank Sean McKenna, as well as David Spivak and MIT as a whole, for all the support during the 2020 pandemic, and Nathanael Arkor for the development of the app Quiver, which proved to be very helpful in writing some of the diagrams in this document. The first author was affiliated to the Massachusetts Institute of Technology (MIT) for most of the time of writing. This research was partially funded by the Fields Institute (Canada) and by AFOSR grants FA9550-19-1-0113 and FA9550-17-1-0058 (U.S.A.). The second author acknowledges partial financial support by the Natural Science and Engineering Council of Canada under the Discovery Grants Program (Grant No. 501260).

Open Access

This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Appendix

A. Some 2-Dimensional Monad Theory

Here we recall some definitions of 2-dimensional monad theory, and spell out explicitly the definitions that we use in the rest of this work. We also prove a result, Theorem A.7, which extends to pseudomonads the “restriction of scalars” construction. We follow the definitions given in [20, Section 2.12]. We refer the interested reader to that article, as well as [17] and [22], for further details.

A.1. Pseudomonads and Their Morphisms

Definition A.1

A pseudomonad on a strict 2-category K consists of

  • A pseudofunctor T:KK, together with

  • Pseudonatural transformations η:idKT and μ:TTT, which we call unit and multiplication respectively, and

  • Invertible modifications graphic file with name 10485_2021_9671_Figfl_HTML.jpgwhich we call (left and right) unitors and associator, respectively, such that

  • the following two coherence laws are satisfied, graphic file with name 10485_2021_9671_Figfm_HTML.jpg which we call the unit and pentagon4 condition, respectively, where T^=υ-1Tκ, Ta^=κ-1Taκ, and κ and υ are the compositor and unitor, respectively, of the pseudofunctor T. The arrow μμ is the invertible 3-cell filling the pseudonaturality square of μ along the morphism μ itself, of (2-cell) components graphic file with name 10485_2021_9671_Figfn_HTML.jpgfor each object X of K.

For some readers it may be easier to interpret the coherence conditions above if we express them as 2-dimensional diagrams. In particular, the unit and pentagon conditions can be expressed as the following commutative cubes, respectively, graphic file with name 10485_2021_9671_Figfo_HTML.jpgwhere the unfilled 3-cells are identities, and T^ denotes the following composite cell, graphic file with name 10485_2021_9671_Figfp_HTML.jpgand graphic file with name 10485_2021_9671_Figfq_HTML.jpgwhere Ta^ denotes the following composite cell. graphic file with name 10485_2021_9671_Figfr_HTML.jpg

We now give the definition of a morphism of pseudomonads (over the same category).

Definition A.2

Let K be a strict 2-category. Let (T,μ,η,,r,a) and (T,μ,η,,r,a) be pseudomonads on K. A (pseudo-) morphism of pseudomonads consists of

  • A pseudonatural transformation λ:TT, together with

  • Invertible modifications as follows, graphic file with name 10485_2021_9671_Figfs_HTML.jpgwhere we recall that λλ, as in Section 4.6, is shorthand for the following composition, graphic file with name 10485_2021_9671_Figft_HTML.jpgsuch that

  • the following diagrams commute, graphic file with name 10485_2021_9671_Figfu_HTML.jpgwhich we call left unitality, right unitality, and associativity conditions, respectively, where
    • uTη^ denotes (λTηλ)(uTλ),
    • Tuλ^ denotes λT(κ-1Tu)λη-1,
    • mTμ^ denotes (λTμλ)(mTTTλ), and
    • Tmλ^ denotes (λT(κ-1Tmκκ))(λμ-1TλTTTλ).

We can write the coherence conditions 2-dimensionally, as follows. The unitality conditions are the following two commutative prisms, graphic file with name 10485_2021_9671_Figfv_HTML.jpgwhere uTη^ denotes the following composite, graphic file with name 10485_2021_9671_Figfw_HTML.jpgand graphic file with name 10485_2021_9671_Figfx_HTML.jpgwhere Tuλ^ denotes the following composite and the unfilled 3-cells are intended to be identities; graphic file with name 10485_2021_9671_Figfy_HTML.jpgassociativity is the following commutative cube, graphic file with name 10485_2021_9671_Figfz_HTML.jpgwhere mTμ^ and Tmλ^ denote the following compositions, respectively, graphic file with name 10485_2021_9671_Figga_HTML.jpgand graphic file with name 10485_2021_9671_Figgb_HTML.jpgFinally, 2-cells of pseudomonads are defined as follows.

Definition A.3

Let K be a strict 2-category. Let (T,μ,η,,r,a) and (T,μ,η,,r,a) be pseudomonads on K. Let (λ,u,m) and (ξ,v,n) be (pseudo)morphisms TT. A 2-cell of monads is a modification graphic file with name 10485_2021_9671_Figgc_HTML.jpgsuch that the following diagrams commute, graphic file with name 10485_2021_9671_Figgd_HTML.jpgwhich we call “unit” and “multiplication” conditions, respectively.

As above, it may be helpful to write the conditions 2-dimensionally. The unit condition forms the following commutative “cone”, graphic file with name 10485_2021_9671_Figge_HTML.jpgand the multiplication condition forms the following commutative “cylinder”, graphic file with name 10485_2021_9671_Figgf_HTML.jpgIn line with our usual convention for horizontal composition, tt denotes the following 3-cell. graphic file with name 10485_2021_9671_Figgg_HTML.jpg

A.2. Pseudoalgebras and Their Morphisms

Definition A.4

Let (T,μ,η,,r,a) be a pseudomonad on K. A pseudoalgebra over the pseudomonad T consists of

  • an object A of K, together with

  • a morphism e:TAA, and

  • invertible 2-cells graphic file with name 10485_2021_9671_Figgh_HTML.jpgsuch that

  • the following diagrams commute, graphic file with name 10485_2021_9671_Figgi_HTML.jpggraphic file with name 10485_2021_9671_Figgj_HTML.jpgwhich we call unit and multiplication conditions, respectively, where Tι^=υ-1Tικ, and Tγ^=κ-1Tγκ.

We call the pseudoalgebra normal if the 2-cell ι is the identity.

We can write the coherence diagrams 2-dimensionally as well, as follows. Here is the unit condition, graphic file with name 10485_2021_9671_Figgk_HTML.jpgwhere the unfilled 2-cells are identities, and Tι^ denotes the following composite cell, graphic file with name 10485_2021_9671_Figgl_HTML.jpgand here is the multiplication condition, graphic file with name 10485_2021_9671_Figgm_HTML.jpgwhere Tγ^ denotes the following composite cell. graphic file with name 10485_2021_9671_Figgn_HTML.jpg

Definition A.5

Let (A,eA,ιA,γA) and (B,eB,ιB,γB) be pseudoalgebras over the pseudomonad (T,μ,η,,r,a). A (strong) morphism of pseudoalgebras consists of

  • a morphism f:AB of K, together with

  • an invertible 2-cell graphic file with name 10485_2021_9671_Figgo_HTML.jpgsuch that

  • the following diagrams commute, graphic file with name 10485_2021_9671_Figgp_HTML.jpgwhich we call the unit and multiplication condition, respectively, where Tϕ^=κ-1Tϕκ.

As above, it may be helpful to draw the coherence conditions in a 2-dimensional way. The unit condition is the following commutative prism, graphic file with name 10485_2021_9671_Figgq_HTML.jpgwhere again the unfilled 2-cell is an identity, and the multiplication condition is the following commutative cube, graphic file with name 10485_2021_9671_Figgr_HTML.jpgwhere Tϕ^ denotes the following composite cell. graphic file with name 10485_2021_9671_Figgs_HTML.jpg

Definition A.6

Let (A,eA,ιA,γA) and (B,eB,ιB,γB) be pseudoalgebras over the pseudomonad (T,μ,η,,r,a). Let (f,ϕ) and (g,χ) be strong morphisms of pseudoalgebras. A 2-cell of pseudoalgebras is a 2-cell graphic file with name 10485_2021_9671_Figgt_HTML.jpgsuch that the following diagram commutes. graphic file with name 10485_2021_9671_Figgu_HTML.jpg

If we draw the condition in a 2-dimensional way, we get the following commutative “cylinder”: graphic file with name 10485_2021_9671_Figgv_HTML.jpg

It is immediate from the definitions to check that, given any object X of K, the object TX is canonically a pseudoalgebra, with structure morphism μX:TTXTX and 2-cells X:μXηTXidTX and aX:μXTμXμXμTX. We call this a “free algebra”, analogously to the 1-dimensional case. Moreover, by naturality of μ, for any morphism f:XY of K, the morphism Tf:TXTY gives a morphism of pseudoalgebras. Similarly, for every f,g:XY and α:fg, the 2-cell Tα:TfTg is automatically a 2-cell of algebras.

A.3. Restriction of Scalars for Pseudomonads

It is very well known that in the one-dimensional context, a morphism of monads induces a pullback functor between the algebras, sometimes named “restriction of scalars” after its instance in ring theory [1, Theorem 3 in Section 3.6]. A similar phenomenon occurs in two dimensions, as follows. We use this statement in Sect. 4.6.1, see there also for further context.

Theorem A.7

Let K be a (strict) 2-category. Let (T,μ,η,,r,a) and (T,μ,η,,r,a) be pseudomonads on K, and let (λ,u,m) be a pseudomorphism of monads from T to T. Each T-pseudoalgebra (A,e,ι,γ), defines canonically a T-pseudoalgebra structure on A with the following structure 1- and 2-cells. graphic file with name 10485_2021_9671_Figgw_HTML.jpgMoreover, this construction defines a 2-functor between the categories of pseudoalgebras λ:KTKT.

In analogy with the 1-dimensional case, we call λ the restriction of scalar functor.

Proof

The unit diagram for (A,e,ι,γ), obtained by plugging (A.1) into the unit diagram of Definition A.4, can be decomposed in the following way, where the whiskerings have been suppressed for reasons of space. graphic file with name 10485_2021_9671_Figgx_HTML.jpgNow,

  • the region on the right commutes by the right unit condition of (λ,u,m), as in Definition A.2 (recall that all the arrows of the diagram are invertible);

  • the triangle in the center bottom is the right unitality condition of the algebra (A,e,ι,γ), as in Definition A.4;

  • the region on the bottom left commutes by pseudonaturality of λ;

  • finally, the remaining parallelograms commute by the interchange law.

The multiplication diagram for (A,e,ι,γ), analogously obtained by plugging (A.1) into the multiplication diagram of Definition A.4, can be decomposed as follows, where again the whiskerings have been suppressed, and the hat denotes the suitable application of the compositors. graphic file with name 10485_2021_9671_Figgy_HTML.jpgNow,

  • the bottom right region commutes by the associativity condition of (λ,u,m), as in Definition A.2;

  • the top right hexagon commutes by pseudonaturality of λ;

  • the center hexagon commutes by the multiplication condition of the algebra (A,e,ι,γ), as in Definition A.4;

  • the top left hexagon commutes by the modification property for m;

  • all the remaining parallelograms commute by the interchange law.

Therefore, (A,e,ι,γ) is a T-pseudoalgebra.

For functoriality, consider now T-pseudoalgebras (A,eA,ιA,γA) and (B,eB,ιB,γB), and a morphism of T-pseudoalgebras (f,ϕ) from A to B. The composite 2-cell graphic file with name 10485_2021_9671_Figgz_HTML.jpgmakes f a morphism between the T-pseudoalgebra structures defined above. Indeed, the unit condition, obtained by plugging (A.1) into Definition A.5, can be decomposed as follows, again omitting the whiskering. graphic file with name 10485_2021_9671_Figha_HTML.jpgNow

  • the top region commutes by the modification property for u;

  • the bottom left rectangle commutes by the unit condition for (f,ϕ) as in Definition A.5;

  • the bottom right rectangle commutes by the interchange law.

Similarly, the multiplication condition can be decomposed as follows, where again the whiskerings are omitted. graphic file with name 10485_2021_9671_Fighb_HTML.jpgNow,

  • the region on the far left commutes by the modification property for m;

  • the hexagon on the bottom commutes by the multiplication condition for (f,ϕ) as in Definition A.5;

  • the hexagon on the top right commutes by pseudonaturality of λ;

  • all the remaining parallelograms commute by the interchange law.

This makes (f,ϕλ) a pseudomorphism of T-pseudoalgebras.

Finally, to prove 2-functoriality, let (f,ϕ) and (g,χ) be pseudomorphisms of T-pseudoalgebras AB, and let α:fg be a 2-cell of T-pseudoalgebras. We have that α is canonically also a 2-cell of T-pseudoalgebras, since the relevant diagram can be decomposed as follows, graphic file with name 10485_2021_9671_Fighc_HTML.jpgagain omitting the whiskerings, and now

  • the left square commutes by pseudonaturality of λ;

  • the right square commutes since α is a 2-cell of pseudoalgebras, as in Definition A.6.

This action on pseudoalgebras, their morphisms and their 2-cells defines then a 2-functor from the 2-category of T-pseudoalgebras to the 2-category of T-pseudoalgebras (notice the direction).

We encourage the readers more familiar with 2-dimensional diagrams to rewrite the proof using 2-cells and, for clarity, we suggest to dedicate one direction in each diagram to the transformation λ.

Footnotes

1

By “strict” here we mean “we give an isomorphism of hom-categories, not just an equivalence”. The colimit is still oplax, not strict.

2

These are known in the literature also as “cofinal”, “coinitial” and “final”, terms which may cause confusion. The “co” in “cofinal” (e.g. in “cofinal subnet”) does not denote duality, but rather, follows the Latin particle “cum” which means “with, together”. As such, we feel that “confinal” is both closer to the original etymology, and less prone to cause confusion. The term “confinal” (or in German, “konfinal”) was introduced by Hausdorff for the case of ordered sets [13, Section IV.4, page 86], and it has been in use at least until [10, Definition 2.12].

3

Often a stronger statement is called “co-Yoneda lemma”, see Proposition 4.14.

4

This name comes from the theory of monoidal categories, where the diagram has the shape of a pentagon (in that case, the analogue of μ is strictly natural, and so the arrow corresponding to μμ is an identity).

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Barr, M., Wells, C.: Toposes, triples, and theories. Grundlehren der mathematischen Wissenschaften 278. Springer, Berlin (1983)
  • 2.Borceux F. Handbook of Categorical Algebra I. Cambridge: Cambridge University Press; 1994. [Google Scholar]
  • 3.Bourbaki, N.: Algebra I. Hermann (1974)
  • 4.Chachólski, W., Scherer, J.: Homotopy theory of diagrams. Memoirs Am. Math. Soc., 736 (2002)
  • 5.Constantin, C., Fritz, T., Perrone, P., Shapiro, B.: Partial evaluations and the compositional structure of the bar construction. arXiv:2009.07302. (2020)
  • 6.Day BJ, Lack S. Limits of small functors. J. Pure Appl. Algebra. 2007;210:651–663. doi: 10.1016/j.jpaa.2006.10.019. [DOI] [Google Scholar]
  • 7.Eilenberg S, MacLane S. General theory of natural equivalences. Trans. Am. Math. Soc. 1945;58.2:231–294. doi: 10.1090/S0002-9947-1945-0013131-6. [DOI] [Google Scholar]
  • 8.Fritz T, Perrone P. A probability monad as the colimit of spaces of finite samples. Theory Appl. Categ. 2019;34.7:170–220. [Google Scholar]
  • 9.Fritz, T., Perrone, P.: Monads, partial evaluations, and rewriting. In: Proceedings of MFPS 36, ENTCS (2020)
  • 10.Gabriel, P., Ulmer, F.: Lokal präsentierbare Kategorien. Lecture Notes in Mathematics 221. Springer, Berlin (1971)
  • 11.Giry, M.: A categorical approach to probability theory. In: Lecture Notes in Mathematics 915: Categorical Aspects of Topology and Analysis. Springer, Berlin, pp. 68–85 (1982). 10.1007/BFb0092872
  • 12.Guitart R. Sur le foncteur diagramme. Cahiers de topologie et géométrie différentielle catégoriques. 1973;14.2:181–182. [Google Scholar]
  • 13.Hausdorff F. Grundzüge der Mengenlehre. Leipzig: Veit & comp; 1914. [Google Scholar]
  • 14.Isbell JR. Adequate subcategories. Illinois J. Math. 1960;4:1–33. doi: 10.1215/ijm/1255456274. [DOI] [Google Scholar]
  • 15.Kelly, G. M.: Basic concepts in enriched category theory. Available at Reprints in TAC, No. 10, 2005. Cambridge University Press, Cambridge (1982)
  • 16.Kock, A.: Limit monads in categories. Ph.D thesis. University of Chicago (1967)
  • 17.Lack S. A coherent approach to pseudomonads. Adv. Math. 2000;52:179–202. doi: 10.1006/aima.1999.1881. [DOI] [Google Scholar]
  • 18.Lambek, J.: Completions of Categories. Lecture Notes in Mathematics 24. Springer, Berlin (1966)
  • 19.Loregian, F.: Coend calculus. arXiv:1501.02503. (2015)
  • 20.Nunes FL. Pseudo–Kan extensions and descent theory. Theory Appl. Categ. 2018;33(15):390–444. [Google Scholar]
  • 21.Lane SM. Categories for the Working Mathematician. 2. Berlin: Springer; 1998. [Google Scholar]
  • 22.Marmolejo F. Doctrines whose structures forms a fully faithful adjoint string. Theory Appl. Categories. 1997;3.2:22–42. [Google Scholar]
  • 23.Paré R. Connected components and colimits. J. Pure Appl. Algebr. 1973;3:21–42. doi: 10.1016/0022-4049(73)90003-0. [DOI] [Google Scholar]
  • 24.Paré, R.: On absolute colimits. J. Algebra, 19 (11971), 80–95
  • 25.Perrone, P.: Categorical Probability and Stochastic Dominance in Metric Spaces. Available at http://paoloperrone.org/phdthesis.pdf. Ph.D thesis. University of Leipzig (2018)
  • 26.Peschke, G., Tholen, W.: Diagrams, Fibrations, and the Decomposition of Colimits. arXiv:2006.10890 (2020)
  • 27.Riehl E. Categorical Homotopy Theory. Cambridge: Cambridge University Press; 2014. [Google Scholar]
  • 28.Street, R., Walters, R. F. C.: The comprehensive factorization of a functor. Bull. Am. Math. Soc. 79(5) (1973)
  • 29.Weil A. Adeles and algebraic groups. Progress Math. 1982;23:1–3. doi: 10.1007/BF01583776. [DOI] [Google Scholar]

Articles from Applied Categorical Structures are provided here courtesy of Springer

RESOURCES