nLab geometry of physics -- perturbative quantum field theory

Contents

These notes (pdf, 323 pages) mean to give an expository but rigorous introduction to the basic concepts of relativistic perturbative quantum field theories, specifically those that arise as the perturbative quantization of Lagrangian field theories – such as quantum electrodynamics, quantum chromodynamics, and perturbative quantum gravity appearing in the standard model of particle physics.

\,

This is one chapter of geometry of physics.

Previous chapters: smooth sets, supergeometry.

\,

Contents

\,

For broad introduction of the idea of the topic of perturbative quantum field theory see there and see

Here, first we consider classical field theory (or rather pre-quantum field theory), complete with BV-BRST formalism; then its deformation quantization via causal perturbation theory to perturbative quantum field theory. This mathematically rigorous (i.e. clear and precise) formulation of the traditional informal lore has come to be known as perturbative algebraic quantum field theory.

We aim to give a fully local discussion, where all structures arise on the “jet bundle over the field bundle” (introduced below) and “transgress” from there to the spaces of field histories over spacetime (discussed further below). This “Higher Prequantum Geometry” streamlines traditional constructions and serves the conceptualization in the theory. This is joint work with Igor Khavkine.

In full beauty these concepts are extremely general and powerful; but the aim here is to give a first precise idea of the subject, not a fully general account. Therefore we concentrate on the special case where spacetime is Minkowski spacetime (def. below), where the field bundle (def. below) is an ordinary trivial vector bundle (example below) and hence the Lagrangian density (def. below) is globally defined. Similarly, when considering gauge theory we consider just the special case that the gauge parameter-bundle is a trivial vector bundle and we concentrate on the case that the gauge symmetries are “closed irreducible” (def. below). But we aim to organize all concepts such that the structure of their generalization to curved spacetime and non-trivial field bundles is immediate.

This comparatively simple setup already subsumes what is considered in traditional texts on the subject; it captures the established perturbative BRST-BV quantization of gauge fields coupled to fermions on curved spacetimes – which is the state of the art. Further generalization, necessary for the discussion of global topological effects, such as instanton configurations of gauge fields, will be discussed elsewhere (see at homotopical algebraic quantum field theory).

Alongside the theory we develop the concrete examples of the real scalar field, the electromagnetic field and the Dirac field; eventually combining these to a disussion of quantum electrodynamics.

running examples

fieldfield bundleLagrangian densityequation of motion
real scalar fieldexpl. expl. expl.
Dirac fieldexpl. expl. expl.
electromagnetic fieldexpl. expl. expl.
Yang-Mills fieldexpl. ,
expl.
expl. expl.
B-fieldexpl. expl expl.

\,

fieldPoisson bracketcausal propagatorWightman propagatorFeynman propagator
real scalar fieldexpl. ,
expl.
prop. def. def.
Dirac fieldexpl. ,
expl.
prop. def. def.
electromagnetic fieldprop. prop.

\,

fieldgauge symmetrylocal BRST complexgauge fixing
electromagnetic fieldexpl. expl. expl.
Yang-Mills fieldexpl. expl.
B-fieldexpl. expl.

\,

interacting field theoryinteraction Lagrangian densityinteraction Wick algebra-element
phi^n theoryexp. expl.
quantum electrodynamicsexpl. expl.

\,

References

Pointers to the literature are given in each chapter, alongside the text. The following is a selection of these references.

The discussion of spinors in chapter 2. Spacetime follows Baez-Huerta 09.

The functorial geometry of supergeometric spaces of field histories in 3. Fields follows Schreiber 13, further developed by Giotopoulos & Sati 2023.

For the jet bundle-formulation of variational calculus of Lagrangian field theory in 4. Field variations, and 5. Lagrangians we follow Anderson 89 and Olver 86, further developed by Giotopoulos & Sati 2023; for 6. Symmetries augmented by Fiorenza-Rogers-Schreiber 13b.

The identification of polynomial observables with distributions in 7. Observables was observed by Paugam 12.

The discussion of the Peierls-Poisson bracket in 8. Phase space is based on Khavkine 14.

The derivation of wave front sets of propagators in 9. Propagators takes clues from Radzikowski 96 and uses results from Gelfand-Shilov 66.

For the general idea of BV-BRST formalism a good review is Henneaux 90.

The Lie algebroid-perspective on BRST complexes developed in chapter 10. Gauge symmetries, may be compared to Barnich 10.

For the local BV-BRST theory laid out in chapter 11. Reduced phase space we are following Barnich-Brandt-Henneaux 00.

For the BV-gauge fixing developed in 12. Gauge fixing we take clues from Fredenhagen-Rejzner 11a.

For the free quantum BV-operators in 13. Free quantum fields and the interacting quantum master equation in 15. Interacting quantum fields we are following Fredenhagen-Rejzner 11b, Rejzner 11, which in turn is taking clues from Hollands 07.

The discussion of quantization in 13. Quantization takes clues from Hawkins 04, Collini 16 and spells out the derivation of the Moyal star product from geometric quantization of symplectic groupoids due to Gracia-Bondia & Varilly 94.

The perspective on the Wick algebra in 14. Free quantum fields goes back to Dito 90 and was revived for pAQFT in Dütsch-Fredenhagen 00. The proof of the folklore result that the perturbative Hadamard vacuum state on the Wick algebra is indeed a state is cited from Dütsch 18.

The discussion of causal perturbation theory in 15. Interacting quantum fields follows the original Epstein-Glaser 73. The relevance here of the star product induced by the Feynman propagator was highlighted in Fredenhagen-Rejzner 12. The proof that the interacting field algebra of observables defined by Bogoliubov's formula is a causally local net in the sense of the Haag-Kastler axioms is that of Brunetti-Fredenhagen 00.

Our derivation of Feynman diagrammatics follows Keller 10, chapter IV, our derivation of the quantum master equation follows Rejzner 11, section 5.1.3, and our discussion of Ward identities is informed by Dütsch 18, chapter 4.

In chapter 16. Renormalization we take from Brunetti-Fredenhagen 00 the perspective of Epstein-Glaser renormalization via extension of distributions and from Brunetti-Dütsch-Fredenhagen 09 and Dütsch 10 the rigorous formulation of Gell-Mann Low renormalization group flow, UV-regularization, effective quantum field theory and Polchinski's flow equation.

\,

Acknowledgement

These notes profited greatly from discussions with Igor Khavkine and Michael Dütsch.

Thanks also to Marco Benini, Klaus Fredenhagen, Arnold Neumaier and Kasia Rejzner for helpful discussion.

\,

\,

Geometry

The geometry of physics is differential geometry. This is the flavor of geometry which is modeled on Cartesian spaces n\mathbb{R}^n with smooth functions between them. Here we briefly review the basics of differential geometry on Cartesian spaces.

In principle the only background assumed of the reader here is

  1. usual naive set theory (e.g. Lawvere-Rosebrugh 03);

  2. the concept of the continuum: the real line \mathbb{R}, the plane 2\mathbb{R}^2, etc.

  3. the concepts of differentiation and integration of functions on such Cartesian spaces;

hence essentially the content of multi-variable differential calculus.

We now discuss:

As we uncover Lagrangian field theory further below, we discover ever more general concepts of “space” in differential geometry, such as smooth manifolds, diffeological spaces, infinitesimal neighbourhoods, supermanifolds, Lie algebroids and super Lie ∞-algebroids. We introduce these incrementally as we go along:

more general spaces in differential geometry introduced further below

higher differential geometry
differential geometrysmooth manifolds
(def. )
\hookrightarrowdiffeological spaces
(def. )
\hookrightarrowsmooth sets
(def. )
\hookrightarrowformal smooth sets
(def. )
\hookrightarrowsuper formal smooth sets
(def. )
\hookrightarrowsuper formal smooth ∞-groupoids
(not needed in fully perturbative QFT)
infinitesimal geometry,
Lie theory
infinitesimally thickened points
(def. )
superpoints
(def. )
Lie ∞-algebroids
(def. )
higher Lie theory
needed in QFT for:spacetime (def. )space of field histories
(def. )
Cauchy surface (def. ),
perturbation theory (def. )
Dirac field (expl. ), Pauli exclusion principleinfinitesimal gauge symmetry/BRST complex
(expl. )

\,

Abstract coordinate systems

What characterizes differential geometry is that it models geometry on the continuum, namely the real line \mathbb{R}, together with its Cartesian products n\mathbb{R}^n, regarded with its canonical smooth structure (def. below). We may think of these Cartesian spaces n\mathbb{R}^n as the “abstract coordinate systems” and of the smooth functions between them as the “abstract coordinate transformations”.

We will eventually consider below much more general “smooth spacesXX than just the Cartesian spaces n\mathbb{R}^n; but all of them are going to be understood by “laying out abstract coordinate systems” inside them, in the general sense of having smooth functions f: nXf \colon \mathbb{R}^n \to X mapping a Cartesian space smoothly into them. All structure on generalized smooth spaces XX is thereby reduced to compatible systems of structures on just Cartesian spaces, one for each smooth “probe” f: nXf\colon \mathbb{R}^n \to X. This is called “functorial geometry”.

Notice that the popular concept of a smooth manifold (def./prop. below) is essentially that of a smooth space which locally looks just like a Cartesian space, in that there exist sufficiently many f: nXf \colon \mathbb{R}^n \to X which are (open) isomorphisms onto their images. Historically it was a long process to arrive at the insight that it is wrong to fix such local coordinate identifications ff, or to have any structure depend on such a choice. But it is useful to go one step further:

In functorial geometry we do not even focus attention on those f: nXf \colon \mathbb{R}^n \to X that are isomorphisms onto their image, but consider all “probes” of XX by “abstract coordinate systems”. This makes differential geometry both simpler as well as more powerful. The analogous insight for algebraic geometry is due to Grothendieck 65; it was transported to differential geometry by Lawvere 67.

This allows to combine the best of two superficially disjoint worlds: On the one hand we may reduce all constructions and computations to coordinates, the way traditionally done in the physics literature; on the other hand we have full conceptual control over the coordinate-free generalized spaces analyzed thereby. What makes this work is that all coordinate-constructions are functorially considered over all abstract coordinate systems.

\,

Definition

(Cartesian spaces and smooth functions between them)

For nn \in \mathbb{N} we say that the set n\mathbb{R}^n of n-tuples of real numbers is a Cartesian space. This comes with the canonical coordinate functions

x k: n x^k \;\colon\; \mathbb{R}^n \longrightarrow \mathbb{R}

which send an n-tuple of real numbers to the kkth element in the tuple, for k{1,,n}k \in \{1, \cdots, n\}.

For

f: n n f \;\colon\; \mathbb{R}^{n} \longrightarrow \mathbb{R}^{n'}

any function between Cartesian spaces, we may ask whether its partial derivative along the kkth coordinate exists, denoted

fx k: n n. \frac{\partial f}{\partial x^k} \;\colon\; \mathbb{R}^{n} \longrightarrow \mathbb{R}^{n'} \,.

If this exists, we may in turn ask that the partial derivative of the partial derivative exists

2fx k 1x k 2x k 2fx k 1 \frac{\partial^2 f}{\partial x^{k_1} \partial x^{k_2}} \coloneqq \frac{\partial}{\partial x^{k_2}} \frac{\partial f}{\partial x^{k_1}}

and so on.

A general higher partial derivative obtained this way is, if it exists, indexed by an n-tuple of natural numbers α n\alpha \in \mathbb{N}^n and denoted

(1) α |α|f α 1x 1 α 2x 2 α nx n, \partial_\alpha \;\coloneqq\; \frac{ \partial^{\vert \alpha \vert} f }{ \partial^{\alpha_1} x^1 \partial^{\alpha_2} x^2 \cdots \partial^{\alpha_n} x^n } \,,

where |α|ni=1α i{\vert \alpha\vert} \coloneqq \underoverset{n}{i = 1}{\sum} \alpha_i is the total order of the partial derivative.

If all partial derivative to all orders α n\alpha \in \mathbb{N}^n of a function f: n nf \colon \mathbb{R}^n \to \mathbb{R}^{n'} exist, then ff is called a smooth function.

Of course the composition gfg \circ f of two smooth functions is again a smooth function.

n 2 f g n 1 gf n 3. \array{ && \mathbb{R}^{n_2} \\ & {}^{\mathllap{f}}\nearrow && \searrow^{\mathrlap{g}} \\ \mathbb{R}^{n_1} && \underset{g \circ f}{\longrightarrow} && \mathbb{R}^{n_3} } \,.

The inclined reader may notice that this means that Cartesian spaces with smooth functions between them constitute a category (“CartSp”); but the reader not so inclined may ignore this.

For the following it is useful to think of each Cartesian space as an abstract coordinate system. We will be dealing with various generalized smooth spaces (see the table below), but they will all be characterized by a prescription for how to smoothly map abstract coordinate systems into them.

Example

(coordinate functions are smooth functions)

Given a Cartesian space n\mathbb{R}^n, then all its coordinate functions (def. )

x k: n x^k \;\colon\; \mathbb{R}^n \longrightarrow \mathbb{R}

are smooth functions (def. ).

For

f: n 1 n 2 f \colon \mathbb{R}^{n_1} \longrightarrow \mathbb{R}^{n_2}

any smooth function and a{1,2,,n 2}a \in \{1, 2, \cdots, n_2\} write

f ax af: n 1f n 2x a f^a \coloneqq x^a \circ f \;\colon\; \mathbb{R}^{n_1} \overset{f}{\longrightarrow} \mathbb{R}^{n_2} \overset{x^a}{\longrightarrow} \mathbb{R}

. for its composition with this coordinate function.

Example

(algebra of smooth functions on Cartesian spaces)

For each nn \in \mathbb{N}, the set

C ( n)Hom CartSp( n,) C^\infty(\mathbb{R}^n) \;\coloneqq\; Hom_{CartSp}(\mathbb{R}^n, \mathbb{R})

of real number-valued smooth functions f: nf \colon \mathbb{R}^n \to \mathbb{R} on the nn-dimensional Cartesian space (def. ) becomes a commutative associative algebra over the ring of real numbers by pointwise addition and multiplication in \mathbb{R}: for f,gC ( n)f,g \in C^\infty(\mathbb{R}^n) and x nx \in \mathbb{R}^n

  1. (f+g)(x)f(x)+g(x)(f + g)(x) \coloneqq f(x) + g(x)

  2. (fg)(x)f(x)g(x)(f \cdot g)(x) \coloneqq f(x) \cdot g(x).

The inclusion

constC ( n) \mathbb{R} \overset{const}{\hookrightarrow} C^\infty(\mathbb{R}^n)

is given by the constant functions.

We call this the real algebra of smooth functions on n\mathbb{R}^n:

C ( n)Alg. C^\infty(\mathbb{R}^n) \;\in\; \mathbb{R} Alg \,.

If

f: n 1 n 2 f \;\colon\; \mathbb{R}^{n_1} \longrightarrow \mathbb{R}^{n_2}

is any smooth function (def. ) then pre-composition with ff (“pullback of functions”)

C ( n 2) f * C ( n 1) g f *ggf \array{ C^\infty(\mathbb{R}^{n_2}) &\overset{f^\ast}{\longrightarrow}& C^\infty(\mathbb{R}^{n_1}) \\ g &\mapsto& f^\ast g \coloneqq g \circ f }

is an algebra homomorphism. Moreover, this is clearly compatible with composition in that

f 1 *(f 2 *g)=(f 2f 1) *g. f_1^\ast(f_2^\ast g) = (f_2 \circ f_1)^\ast g \,.

Stated more abstractly, this means that assigning algebras of smooth functions is a functor

C ():CartSpAlg op C^\infty(-) \;\colon\; CartSp \longrightarrow \mathbb{R} Alg^{op}

from the category CartSp of Cartesian spaces and smooth functions between them (def. ), to the opposite of the category \mathbb{R}Alg of \mathbb{R}-algebras.

Definition

(local diffeomorphisms and open embeddings of Cartesian spaces)

A smooth function f: n nf \colon \mathbb{R}^{n} \to \mathbb{R}^{n} from one Cartesian space to itself (def. ) is called a local diffeomorphism, denoted

f: net n f \;\colon\; \mathbb{R}^{n} \overset{et}{\longrightarrow} \mathbb{R}^n

if the determinant of the matrix of partial derivatives (the “Jacobian” of ff) is everywhere non-vanishing

det(f 1x 1(x) f nx 1(x) f 1x n(x) f nx n(x))0AAAAfor allx n. det \left( \array{ \frac{\partial f^1}{\partial x^1}(x) &\cdots& \frac{\partial f^n}{\partial x^1}(x) \\ \vdots && \vdots \\ \frac{\partial f^1}{\partial x^n}(x) &\cdots& \frac{\partial f^n}{\partial x^n}(x) } \right) \;\neq\; 0 \phantom{AAAA} \text{for all} \, x \in \mathbb{R}^n \,.

If the function ff is both a local diffeomorphism, as above, as well as an injective function then we call it an open embedding, denoted

f: nAetA n. f \;\colon\; \mathbb{R}^n \overset{\phantom{A}et\phantom{A}}{\hookrightarrow} \mathbb{R}^n \,.
Definition

(good open cover of Cartesian spaces)

For n\mathbb{R}^n a Cartesian space (def. ), a differentiably good open cover is

  • an indexed set

    { netAAf iAA n} iI \left\{ \mathbb{R}^n \underoverset{et}{\phantom{AA}f_i\phantom{AA}}{\hookrightarrow} \mathbb{R}^n \right\}_{i \in I}

    of open embeddings (def. )

such that the images

U iim(f i) n U_i \coloneqq im(f_i) \subset \mathbb{R}^n

satisfy:

  1. (open cover) every point of n\mathbb{R}^n is contained in at least one of the U iU_i;

  2. (good) all finite intersections U i 1U i k nU_{i_1} \cap \cdots \cap U_{i_k} \subset \mathbb{R}^n are either empty set or themselves images of open embeddings according to def. .

The inclined reader may notice that the concept of differentiably good open covers from def. is a coverage on the category CartSp of Cartesian spaces with smooth functions between them, making it a site, but the reader not so inclined may ignore this.

(Fiorenza-Schreiber-Stasheff 12, def. 6.3.9)

\,

\,

fiber bundles

Given any context of objects and morphisms between them, such as the Cartesian spaces and smooth functions from def. it is of interest to fix one object XX and consider other objects parameterized over it. These are called bundles (def. ) below. For reference, we briefly discuss here the basic concepts related to bundles in the context of Cartesian spaces.

Of course the theory of bundles is mostly trivial over Cartesian spaces; it gains its main interest from its generalization to more general smooth manifolds (def./prop. below). It is still worthwhile for our development to first consider the relevant concepts in this simple case first.

For more exposition see at fiber bundles in physics.

\,

Definition

(bundles)

We say that a smooth function EfbXE \overset{fb}{\to} X (def. ) is a bundle just to amplify that we think of it as exhibiting EE as being a “space over XX”:

E fb X. \array{ E \\ \downarrow\mathrlap{fb} \\ X } \,.

For xXx \in X a point, we say that the fiber of this bundle over xx is the pre-image

(2)E xfb 1({x})E E_x \coloneqq fb^{-1}(\{x\}) \subset E

of the point xx under the smooth function. We think of fbfb as exhibiting a “smoothly varying” set of fiber spaces over XX.

Given two bundles E 1fb 1XE_1 \overset{fb_1}{\to} X and E 2fb 2XE_2 \overset{fb_2}{\to} X over XX, a homomorphism of bundles between them is a smooth function f:E 1E 2f \colon E_1 \to E_2 (def. ) between their total spaces which respects the bundle projections, in that

fb 2f=fb 1AAAAi.e.AAAE 1 f E 2 fb 1 fb 2 X. fb_2 \circ f = fb_1 \phantom{AAAA} \text{i.e.} \phantom{AAA} \array{ E_1 && \overset{f}{\longrightarrow} && E_2 \\ & {}_{\mathllap{fb_1}}\searrow && \swarrow_{\mathrlap{fb_2}} \\ && X } \,.

Hence a bundle homomorphism is a smooth function that sends fibers to fibers over the same point:

f((E 1) x)(E 2) x. f\left( (E_1)_x \right) \;\subset\; (E_2)_x \,.

The inclined reader may notice that this defines a category of bundles over XX, which is in fact just the slice category CartSp /XCartSp_{/X}; the reader not so inclined may ignore this.

Definition

(sections)

Given a bundle EfbXE \overset{fb}{\to} X (def. ) a section is a smooth function s:XEs \colon X \to E such that

fbs=id XAAAAA E s fb X = X. fb \circ s = id_X \phantom{AAAAA} \array{ && E \\ & {}^{\mathllap{s}}\nearrow & \downarrow\mathrlap{fb} \\ X &=& X } \,.

This means that ss sends every point xXx \in X to an element in the fiber over that point

s(x)E x. s(x) \in E_x \,.

We write

Γ X(E){ E s fb X = Xfb} \Gamma_X(E) \coloneqq \left\{ \array{ && E \\ & {}^{\mathllap{s}}\nearrow & \downarrow^\mathrlap{fb} \\ X &=& X } \phantom{fb} \right\}

for the set of sections of a bundle.

For E 1f 1XE_1 \overset{f_1}{\to} X and E 2f 2XE_2 \overset{f_2}{\to} X two bundles and for

E 1 f E 2 fb 1 fb 2 X \array{ E_1 && \overset{f}{\longrightarrow} && E_2 \\ & {}_{\mathllap{fb_1}}\searrow && \swarrow_{\mathrlap{fb_2}} \\ && X }

a bundle homomorphism between them (def. ), then composition with ff sends sections to sections and hence yields a function denoted

Γ X(E 1) f * Γ X(E 2) s fs. \array{ \Gamma_X(E_1) &\overset{f_\ast}{\longrightarrow}& \Gamma_X(E_2) \\ s &\mapsto& f \circ s } \,.
Example

(trivial bundle)

For XX and FF Cartesian spaces, then the Cartesian product X×FX \times F equipped with the projection

X×F pr 1 X \array{ X \times F \\ \downarrow^\mathrlap{pr_1} \\ X }

to XX is a bundle (def. ), called the trivial bundle with fiber FF. This represents the constant smoothly varying set of fibers, constant on FF

If F=*F = \ast is the point, then this is the identity bundle

X id X. \array{ X \\ \downarrow\mathrlap{id} \\ X } \,.

Given any bundle EfbXE \overset{fb}{\to} X, then a bundle homomorphism (def. ) from the identity bundle to EfbXE \overset{fb}{\to} X is equivalently a section of EfbXE \overset{fb}{\to} X (def. )

X s E id fb X \array{ X && \overset{s}{\longrightarrow} && E \\ & {}_{\mathllap{id}}\searrow && \swarrow_{\mathrlap{fb}} \\ && X }
Definition

(fiber bundle)

A bundle EfbXE \overset{fb}{\to} X (def. ) is called a fiber bundle with typical fiber FF if there exists a differentiably good open cover {U iX} iI\{U_i \hookrightarrow X\}_{i \in I} (def. ) such that the restriction of fbfb to each U iU_i is isomorphic to the trivial fiber bundle with fiber FF over U iU_i. Such diffeomorphisms f i:U i×FE| U if_i \colon U_i \times F \overset{\simeq}{\to} E\vert_{U_i} are called local trivializations of the fiber bundle:

U i×F f i E| U i pr 1 fb| U i U i. \array{ U_i \times F &\underoverset{\simeq}{f_i}{\longrightarrow}& E\vert_{U_i} \\ & {}_{\mathllap{pr_1}}\searrow & \downarrow\mathrlap{fb\vert_{U_i}} \\ && U_i } \,.
Definition

(vector bundle)

A vector bundle is a fiber bundle EvbXE \overset{vb}{\to} X (def. ) with typical fiber a vector space VV such that there exists a local trivialization {U i×Vf iE| U i} iI\{U_i \times V \underoverset{\simeq}{f_i}{\to} E\vert_{U_i}\}_{i \in I} whose gluing functions

U iU j×Vf i| U iU jE| U iU jf j 1| U iU jU iU j×V U_i \cap U_j \times V \overset{f_i\vert_{U_i \cap U_j}}{\longrightarrow} E\vert_{U_i \cap U_j} \overset{f_j^{-1}\vert_{U_i \cap U_j}}{\longrightarrow} U_i \cap U_j \times V

for all i,jIi,j \in I are linear functions over each point xU iU jx \in U_i \cap U_j.

A homomorphism of vector bundle is a bundle morphism ff (def. ) such that there exist local trivializations on both sides with respect to which gg is fiber-wise a linear map.

The inclined reader may notice that this makes vector bundles over XX a category (denoted Vect /XVect_{/X}); the reader not so inclined may ignore this.

Example

(module of sections of a vector bundle)

Given a vector bundle EvbXE \overset{vb}{\to} X (def. ), then its set of sections Γ X(E)\Gamma_X(E) (def. ) becomes a real vector space by fiber-wise multiplication with real numbers. Moreover, it becomes a module over the algebra of smooth functions C (X)C^\infty(X) (example ) by the same fiber-wise multiplication:

C (X) Γ X(E) Γ X(E) (f,s) (xf(x)s(x)). \array{ C^\infty(X) \otimes_{\mathbb{R}} \Gamma_X(E) &\longrightarrow& \Gamma_X(E) \\ (f,s) &\mapsto& (x \mapsto f(x) \cdot s(x)) } \,.

For E 1fb 1XE_1 \overset{fb_1}{\to} X and E 2fb 2XE_2 \overset{fb_2}{\to} X two vector bundles and

E 1 f E 2 fb 1 fb 2 X \array{ E_1 && \overset{f}{\longrightarrow} && E_2 \\ & {}_{\mathllap{fb_1}}\searrow && \swarrow_{\mathrlap{fb_2}} \\ && X }

a vector bundle homomorphism (def. ) then the induced function on sections (def. )

f *:Γ X(E 1)Γ X(E 2) f_\ast \;\colon\; \Gamma_X(E_1) \longrightarrow \Gamma_X(E_2)

is compatible with this action by smooth functions and hence constitutes a homomorphism of C (X)C^\infty(X)-modules.

The inclined reader may notice that this means that taking spaces of sections yields a functor

Γ X():Vect /XC (X)Mod \Gamma_X(-) \;\colon\; Vect_{/X} \longrightarrow C^\infty(X) Mod

from the category of vector bundles over XX to that over modules over C (X)C^\infty(X).

Example

(tangent vector fields and tangent bundle)

For n\mathbb{R}^n a Cartesian space (def. ) the trivial vector bundle (example , def. )

T n n× n tb pr 1 n = n \array{ T \mathbb{R}^n &\coloneqq& \mathbb{R}^n \times \mathbb{R}^n \\ \mathllap{tb}\downarrow && \downarrow\mathrlap{pr_1} \\ \mathbb{R}^n &=& \mathbb{R}^n }

is called the tangent bundle of n\mathbb{R}^n. With (x a) a=1 n(x^a)_{a = 1}^n the coordinate functions on n\mathbb{R}^n (def. ) we write ( a) a=1 n(\partial_a)_{a = 1}^n for the corresponding linear basis of n\mathbb{R}^n regarded as a vector space. Then a general section (def. )

T n v tb n = n \array{ && T \mathbb{R}^n \\ & {}^{\mathllap{v}}\nearrow& \downarrow\mathrlap{tb} \\ \mathbb{R}^n &=& \mathbb{R}^n }

of the tangent bundle has a unique expansion of the form

v=v a a v = v^a \partial_a

where a sum over indices is understood (Einstein summation convention) and where the components (v aC ( n)) a=1 n(v^a \in C^\infty(\mathbb{R}^n))_{a = 1}^n are smooth functions on n\mathbb{R}^n (def. ).

Such a vv is also called a smooth tangent vector field on n\mathbb{R}^n.

Each tangent vector field vv on n\mathbb{R}^n determines a partial derivative on smooth functions

C ( n) D v C ( n) f D vfv a a(f) av afx a. \array{ C^\infty(\mathbb{R}^n) &\overset{D_v}{\longrightarrow}& C^\infty(\mathbb{R}^n) \\ f &\mapsto& \mathrlap{ D_v f \coloneqq v^a \partial_a (f) \coloneqq \sum_a v^a \frac{\partial f}{\partial x^a} } } \,.

By the product law of differentiation, this is a derivation on the algebra of smooth functions (example ) in that

  1. it is an \mathbb{R}-linear map in that

    D v(c 1f 1+c 2f 2)=c 1D vf 1+c 2D vf 2 D_v( c_1 f_1 + c_2 f_2 ) = c_1 D_v f_1 + c_2 D_v f_2
  2. it satisfies the Leibniz rule

    D v(f 1f 2)=(D vf 1)f 2+f 1(D vf 2) D_v(f_1 \cdot f_2) = (D_v f_1) \cdot f_2 + f_1 \cdot (D_v f_2)

for all c 1,c 2c_1, c_2 \in \mathbb{R} and all f 1,f 2C ( n)f_1, f_2 \in C^\infty(\mathbb{R}^n).

Hence regarding tangent vector fields as partial derivatives constitutes a linear function

D:Γ n(T n)Der(C ( n)) D \;\colon\; \Gamma_{\mathbb{R}^n}(T \mathbb{R}^n) \longrightarrow Der(C^\infty(\mathbb{R}^n))

from the space of sections of the tangent bundle. In fact this is a homomorphism of C ( n)C^\infty(\mathbb{R}^n)-modules (example ), in that for fC ( n)f \in C^\infty(\mathbb{R}^n) and vΓ n(T n)v \in \Gamma_{\mathbb{R}^n}(T \mathbb{R}^n) we have

D fv()=fD v(). D_{f v}(-) = f \cdot D_v(-) \,.
Example

(vertical tangent bundle)

Let EfbΣE \overset{fb}{\to} \Sigma be a fiber bundle. Then its vertical tangent bundle T ΣETfbΣT_\Sigma E \overset{T fb}{\to} \Sigma is the fiber bundle (def. ) over Σ\Sigma whose fiber over a point is the tangent bundle (def. ) of the fiber of EfbΣE \overset{fb}{\to}\Sigma over that point:

(T ΣE) xT(E x). (T_\Sigma E)_x \coloneqq T(E_x) \,.

If EΣ×FE \simeq \Sigma \times F is a trivial fiber bundle with fiber FF, then its vertical vector bundle is the trivial fiber bundle with fiber TFT F.

Definition

(dual vector bundle)

For EvbΣE \overset{vb}{\to} \Sigma a vector bundle (def. ), its dual vector bundle is the vector bundle whose fiber (2) over xΣx \in \Sigma is the dual vector space of the corresponding fiber of EΣE \to \Sigma:

(E *) x(E x) *. (E^\ast)_x \;\coloneqq\; (E_x)^\ast \,.

The defining pairing of dual vector spaces (E x) *E x(E_x)^\ast \otimes E_x \to \mathbb{R} applied pointwise induces a pairing on the modules of sections (def. ) of the original vector bundle and its dual with values in the smooth functions (def. ):

(3)Γ Σ(E) C (X)Γ Σ(E *) C (Σ) (v,α) (vα:xα x(v x)) \array{ \Gamma_\Sigma(E) \otimes_{C^\infty(X)} \Gamma_\Sigma(E^\ast) &\longrightarrow& C^\infty(\Sigma) \\ (v,\alpha) &\mapsto& (v \cdot \alpha \colon x \mapsto \alpha_x(v_x) ) }

\,

synthetic differential geometry

Below we encounter generalizations of ordinary differential geometry that include explicit “infinitesimals” in the guise of infinitesimally thickened points, as well as “super-graded infinitesimals”, in the guise of superpoints (necessary for the description of fermion fields such as the Dirac field). As we discuss below, these structures are naturally incorporated into differential geometry in just the same way as Grothendieck introduced them into algebraic geometry (in the guise of “formal schemes”), namely in terms of formally dual rings of functions with nilpotent ideals. That this also works well for differential geometry rests on the following three basic but important properties, which say that smooth functions behave “more algebraically” than their definition might superficially suggest:

Proposition

(the three magic algebraic properties of differential geometry)

  1. embedding of Cartesian spaces into formal duals of R-algebras

    For XX and YY two Cartesian spaces, the smooth functions f:XYf \colon X \longrightarrow Y between them (def. ) are in natural bijection with their induced algebra homomorphisms C (X)f *C (Y)C^\infty(X) \overset{f^\ast}{\longrightarrow} C^\infty(Y) (example ), so that one may equivalently handle Cartesian spaces entirely via their \mathbb{R}-algebras of smooth functions.

    Stated more abstractly, this means equivalently that the functor C ()C^\infty(-) that sends a smooth manifold XX to its \mathbb{R}-algebra C (X)C^\infty(X) of smooth functions (example ) is a fully faithful functor:

    C ():SmthMfdAAAAAlg op. C^\infty(-) \;\colon\; SmthMfd \overset{\phantom{AAAA}}{\hookrightarrow} \mathbb{R} Alg^{op} \,.

    (Kolar-Slovak-Michor 93, lemma 35.8, corollaries 35.9, 35.10)

  2. embedding of smooth vector bundles into formal duals of R-algebra modules

    For E 1vb 1XE_1 \overset{vb_1}{\to} X and E 2vb 2XE_2 \overset{vb_2}{\to} X two vector bundle (def. ) there is then a natural bijection between vector bundle homomorphisms f:E 1E 2f \colon E_1 \to E_2 and the homomorphisms of modules f *:Γ X(E 1)Γ X(E 2)f_\ast \;\colon\; \Gamma_X(E_1) \to \Gamma_X(E_2) that these induces between the spaces of sections (example ).

    More abstractly this means that the functor Γ X()\Gamma_X(-) is a fully faithful functor

    Γ X():VectBund XAAAAC (X)Mod \Gamma_X(-) \;\colon\; VectBund_X \overset{\phantom{AAAA}}{\hookrightarrow} C^\infty(X) Mod

    (Nestruev 03, theorem 11.29)

    Moreover, the modules over the \mathbb{R}-algebra C (X)C^\infty(X) of smooth functions on XX which arise this way as sections of smooth vector bundles over a Cartesian space XX are precisely the finitely generated free modules over C (X)C^\infty(X).

    (Nestruev 03, theorem 11.32)

  3. vector fields are derivations of smooth functions.

    For XX a Cartesian space (example ), then any derivation D:C (X)C (X)D \colon C^\infty(X) \to C^\infty(X) on the \mathbb{R}-algebra C (X)C^\infty(X) of smooth functions (example ) is given by differentiation with respect to a uniquely defined smooth tangent vector field: The function that regards tangent vector fields with derivations from example

    Γ X(TX) AA Der(C (X)) v D v \array{ \Gamma_X(T X) &\overset{\phantom{A}\simeq\phantom{A}}{\longrightarrow}& Der(C^\infty(X)) \\ v &\mapsto& D_v }

    is in fact an isomorphism.

    (This follows directly from the Hadamard lemma.)

Actually all three statements in prop. hold not just for Cartesian spaces, but generally for smooth manifolds (def./prop. below; if only we generalize in the second statement from free modules to projective modules. However for our development here it is useful to first focus on just Cartesian spaces and then bootstrap the theory of smooth manifolds and much more from that, which we do below.

\,

\,

differential forms

We introduce and discuss differential forms on Cartesian spaces.

Definition

(differential 1-forms on Cartesian spaces and the cotangent bundle)

For nn \in \mathbb{N} a smooth differential 1-form ω\omega on a Cartesian space n\mathbb{R}^n (def. ) is an n-tuple

(ω iCartSp( n,)) i=1 n \left(\omega_i \in CartSp\left(\mathbb{R}^n,\mathbb{R}\right)\right)_{i = 1}^n

of smooth functions (def. ), which we think of equivalently as the coefficients of a formal linear combination

ω=ω idx i \omega = \omega_i d x^i

on a set {dx 1,dx 2,,dx n}\{d x^1, d x^2, \cdots, d x^n\} of cardinality nn.

Here a sum over repeated indices is tacitly understood (Einstein summation convention).

Write

Ω 1( k)CartSp( k,) ×kSet \Omega^1(\mathbb{R}^k) \simeq CartSp(\mathbb{R}^k, \mathbb{R})^{\times k}\in Set

for the set of smooth differential 1-forms on k\mathbb{R}^k.

We may think of the expressions (dx a) a=1 n(d x^a)_{a = 1}^n as a linear basis for the dual vector space n\mathbb{R}^n. With this the differential 1-forms are equivalently the sections (def. ) of the trivial vector bundle (example , def. )

T * n n×( n) * cb pr 1 n = n \array{ T^\ast \mathbb{R}^n &\coloneqq& \mathbb{R}^n \times (\mathbb{R}^n)^\ast \\ \mathllap{cb}\downarrow && \downarrow\mathrlap{pr_1} \\ \mathbb{R}^n &=& \mathbb{R}^n }

called the cotangent bundle of n\mathbb{R}^n (def. ):

Ω 1( n)=Γ n(T * n). \Omega^1(\mathbb{R}^n) = \Gamma_{\mathbb{R}^n}(T^\ast \mathbb{R}^n) \,.

This amplifies via example that Ω 1( n)\Omega^1(\mathbb{R}^n) has the structure of a module over the algebra of smooth functions C ( n)C^\infty(\mathbb{R}^n), by the evident multiplication of differential 1-forms with smooth functions:

  1. The set Ω 1( k)\Omega^1(\mathbb{R}^k) of differential 1-forms in a Cartesian space (def. ) is naturally an abelian group with addition given by componentwise addition

    ω+λ =ω idx i+λ idx i =(ω i+λ i)dx i, \begin{aligned} \omega + \lambda & = \omega_i d x^i + \lambda_i d x^i \\ & = (\omega_i + \lambda_i) d x^i \end{aligned} \,,
  2. The abelian group Ω 1( k)\Omega^1(\mathbb{R}^k) is naturally equipped with the structure of a module over the algebra of smooth functions C ( k)C^\infty(\mathbb{R}^k) (example ), where the action C ( k)×Ω 1( k)Ω 1( k)C^\infty(\mathbb{R}^k) \times\Omega^1(\mathbb{R}^k) \to \Omega^1(\mathbb{R}^k) is given by componentwise multiplication

    fω=(fω i)dx i. f \cdot \omega = ( f \cdot \omega_i) d x^i \,.

Accordingly there is a canonical pairing between differential 1-forms and tangent vector fields (example )

(4)Γ n(T n) Γ n(T* n) ι ()() C ( n) (v,ω) ι vωv aω a \array{ \Gamma_{\mathbb{R}^n}(T \mathbb{R}^n) \otimes_{\mathbb{R}} \Gamma_{\mathbb{R}^n}(T \ast \mathbb{R}^n) &\overset{\iota_{(-)}(-) }{\longrightarrow}& C^\infty(\mathbb{R}^n) \\ (v,\omega) &\mapsto& \mathrlap{ \iota_v \omega \coloneqq v^a \omega_a } }

With differential 1-forms in hand, we may collect all the first-order partial derivatives of a smooth function into a single object: the exterior derivative or de Rham differential is the \mathbb{R}-linear function

(5)C ( n) d Ω 1( n) f dffx adx a. \array{ C^\infty(\mathbb{R}^n) &\overset{d}{\longrightarrow}& \Omega^1(\mathbb{R}^n) \\ f &\mapsto& \mathrlap{ d f \coloneqq \frac{\partial f}{ \partial x^a} d x^a } } \,.

Under the above pairing with tangent vector fields vv this yields the particular partial derivative along vv:

ι vdf=D vf=v afx a. \iota_v d f = D_v f = v^a \frac{\partial f}{\partial x^a} \,.

We think of dx id x^i as a measure for infinitesimal displacements along the x ix^i-coordinate of a Cartesian space. If we have a measure of infintesimal displacement on some n\mathbb{R}^n and a smooth function f: n˜ nf \colon \mathbb{R}^{\tilde n} \to \mathbb{R}^n, then this induces a measure for infinitesimal displacement on n˜\mathbb{R}^{\tilde n} by sending whatever happens there first with ff to n\mathbb{R}^n and then applying the given measure there. This is captured by the following definition:

Definition

(pullback of differential 1-forms)

For ϕ: k˜ k\phi \colon \mathbb{R}^{\tilde k} \to \mathbb{R}^k a smooth function, the pullback of differential 1-forms along ϕ\phi is the function

ϕ *:Ω 1( k)Ω 1( k˜) \phi^* \colon \Omega^1(\mathbb{R}^{k}) \to \Omega^1(\mathbb{R}^{\tilde k})

between sets of differential 1-forms, def. , which is defined on basis-elements by

ϕ *dx iϕ ix˜ jdx˜ j \phi^* d x^i \;\coloneqq\; \frac{\partial \phi^i}{\partial \tilde x^j} d \tilde x^j

and then extended linearly by

ϕ *ω =ϕ *(ω idx i) (ϕ *ω) iϕ ix˜ jdx˜ j =(ω iϕ)ϕ ix˜ jdx˜ j. \begin{aligned} \phi^* \omega & = \phi^* \left( \omega_i d x^i \right) \\ & \coloneqq \left(\phi^* \omega\right)_i \frac{\partial \phi^i }{\partial \tilde x^j} d \tilde x^j \\ & = (\omega_i \circ \phi) \cdot \frac{\partial \phi^i }{\partial \tilde x^j} d \tilde x^j \end{aligned} \,.

This is compatible with identity morphisms and composition in that

(6)(id n) *=id Ω 1( n)AAAA(gf) *=f *g *. (id_{\mathbb{R}^n})^\ast = id_{\Omega^1(\mathbb{R}^n)} \phantom{AAAA} (g \circ f)^\ast = f^\ast \circ g^\ast \,.

Stated more abstractly, this just means that pullback of differential 1-forms makes the assignment of sets of differential 1-forms to Cartesian spaces a contravariant functor

Ω 1():CartSp opSet. \Omega^1(-) \;\colon\; CartSp^{op} \longrightarrow Set \,.

The following definition captures the idea that if dx id x^i is a measure for displacement along the x ix^i-coordinate, and dx jd x^j a measure for displacement along the x jx^j coordinate, then there should be a way to get a measure, to be called dx idx jd x^i \wedge d x^j, for infinitesimal surfaces (squares) in the x ix^i-x jx^j-plane. And this should keep track of the orientation of these squares, with

dx jdx i=dx idx j d x^j \wedge d x^i = - d x^i \wedge d x^j

being the same infinitesimal measure with orientation reversed.

Definition

(exterior algebra of differential n-forms)

For k,nk,n \in \mathbb{N}, the smooth differential forms on a Cartesian space k\mathbb{R}^k (def. ) is the exterior algebra

Ω ( k) C ( k) Ω 1( k) \Omega^\bullet(\mathbb{R}^k) \coloneqq \wedge^\bullet_{C^\infty(\mathbb{R}^k)} \Omega^1(\mathbb{R}^k)

over the algebra of smooth functions C ( k)C^\infty(\mathbb{R}^k) (example ) of the module Ω 1( k)\Omega^1(\mathbb{R}^k) of smooth 1-forms.

We write Ω n( k)\Omega^n(\mathbb{R}^k) for the sub-module of degree nn and call its elements the differential n-forms.

Explicitly this means that a differential n-form ωΩ n( k)\omega \in \Omega^n(\mathbb{R}^k) on k\mathbb{R}^k is a formal linear combination over C ( k)C^\infty(\mathbb{R}^k) (example ) of basis elements of the form dx i 1dx i nd x^{i_1} \wedge \cdots \wedge d x^{i_n} for i 1<i 2<<i ni_1 \lt i_2 \lt \cdots \lt i_n:

ω=ω i 1,,i ndx i 1dx i n. \omega = \omega_{i_1, \cdots, i_n} d x^{i_1} \wedge \cdots \wedge d x^{i_n} \,.

Now all the constructions for differential 1-forms above extent naturally to differential n-forms:

Definition

(exterior derivative or de Rham differential)

For n\mathbb{R}^n a Cartesian space (def. ) the de Rham differential d:C ( n)Ω 1( n)d \colon C^\infty(\mathbb{R}^n) \to \Omega^1(\mathbb{R}^n) (5) uniquely extended as a derivation of degree +1 to the exterior algebra of differential forms (def. )

d:Ω ( n)Ω ( n) d \;\colon\; \Omega^\bullet(\mathbb{R}^n) \longrightarrow \Omega^\bullet(\mathbb{R}^n)

meaning that for ω iΩ k i()\omega_i \in \Omega^{k_i}(\mathbb{R}) then

d(ω 1ω 2)=(dω 1)ω 2+ω 1dω 2. d(\omega_1 \wedge \omega_2) \;=\; (d \omega_1) \wedge \omega_2 + \omega_1 \wedge d \omega_2 \,.

In components this simply means that

dω =d(ω i 1i kdx i 1dx i k) =ω i 1i kx adx adx i 1dx i k. \begin{aligned} d \omega & = d \left(\omega_{i_1 \cdots i_k} d x^{i_1} \wedge \cdots \wedge d x^{i_k}\right) \\ & = \frac{\partial \omega_{i_1 \cdots i_k}}{\partial x^{a}} d x^a \wedge d x^{i_1} \wedge \cdots \wedge d x^{i_k} \end{aligned} \,.

Since partial derivatives commute with each other, while differential 1-form anti-commute, this implies that dd is nilpotent

d 2=dd=0. d^2 = d \circ d = 0 \,.

We say hence that differential forms form a cochain complex, the de Rham complex (Ω ( n),d)(\Omega^\bullet(\mathbb{R}^n), d).

Definition

(contraction of differential n-forms with tangent vector fields)

The pairing ι vω=ω(v)\iota_v \omega = \omega(v) of tangent vector fields vv with differential 1-forms ω\omega (4) uniquely extends to the exterior algebra Ω ( n)\Omega^\bullet(\mathbb{R}^n) of differential forms (def. ) as a derivation of degree -1

ι v:Ω +1( n)Ω ( n). \iota_v \;\colon\; \Omega^{\bullet+1}(\mathbb{R}^n) \longrightarrow \Omega^\bullet(\mathbb{R}^n) \,.

In particular for ω 1,ω 2Ω 1( n)\omega_1, \omega_2 \in \Omega^1(\mathbb{R}^n) two differential 1-forms, then

ι v(ω 1ω 2)=ω 1(v)ω 2ω 2(v)ω 1Ω 1( n). \iota_{v} (\omega_1 \wedge \omega_2) \;=\; \omega_1(v) \omega_2 - \omega_2(v) \omega_1 \;\in\; \Omega^1(\mathbb{R}^n) \,.
Proposition

(pullback of differential n-forms)

For f: n 1 n 2f \colon \mathbb{R}^{n_1} \to \mathbb{R}^{n_2} a smooth function between Cartesian spaces (def. ) the operationf of pullback of differential 1-forms of def. extends as an C ( k)C^\infty(\mathbb{R}^k)-algebra homomorphism to the exterior algebra of differential forms (def. ),

f *:Ω ( n 2)Ω ( n 1) f^\ast \;\colon\; \Omega^\bullet(\mathbb{R}^{n_2}) \longrightarrow \Omega^\bullet(\mathbb{R}^{n_1})

given on basis elements by

f *(dx i 1dx i n)=(f *dx i 1f *dx i n). f^* \left( dx^{i_1} \wedge \cdots \wedge dx^{i_n} \right) = \left(f^* dx^{i_1} \wedge \cdots \wedge f^* dx^{i_n} \right) \,.

This commutes with the de Rham differential dd on both sides (def. ) in that

df *=f *dAAAAAΩ (X) f * Ω (Y) d d Ω (X) f * Ω (Y) d \circ f^\ast = f^\ast \circ d \phantom{AAAAA} \array{ \Omega^\bullet(X) &\overset{f^\ast}{\longleftarrow}& \Omega^\bullet(Y) \\ \mathllap{d}\downarrow && \downarrow\mathrlap{d} \\ \Omega^\bullet(X) &\underset{f^\ast}{\longleftarrow}& \Omega^\bullet(Y) }

hence that pullback of differential forms is a chain map of de Rham complexes.

This is still compatible with identity morphisms and composition in that

(7)(id n) *=id Ω 1( n)AAAA(gf) *=f *g *. (id_{\mathbb{R}^n})^\ast = id_{\Omega^1(\mathbb{R}^n)} \phantom{AAAA} (g \circ f)^\ast = f^\ast \circ g^\ast \,.

Stated more abstractly, this just means that pullback of differential n-forms makes the assignment of sets of differential n-forms to Cartesian spaces a contravariant functor

Ω n():CartSp opSet. \Omega^n(-) \;\colon\; CartSp^{op} \longrightarrow Set \,.
Proposition

(Cartan's homotopy formula)

Let XX be a Cartesian space (def. ), and let vΓ(TX)v \in \Gamma(T X) be a smooth tangent vector field (example ).

For tt \in \mathbb{R} write exp(tv):XX\exp(t v) \colon X \overset{\simeq}{\to} X for the flow by diffeomorphisms along vv of parameter length tt.

Then the derivative with respect to tt of the pullback of differential forms along exp(tv)\exp(t v), hence the Lie derivative v:Ω (X)Ω (X)\mathcal{L}_v \colon \Omega^\bullet(X) \to \Omega^\bullet(X), is given by the anticommutator of the contraction derivation ι v\iota_v (def. ) with the de Rham differential dd (def. ):

v ddtexp(tv) *ω| t=0 =ι vdω+dι vω. \begin{aligned} \mathcal{L}_v &\coloneqq \frac{d}{d t } \exp(t v)^\ast \omega \vert_{t = 0} \\ & = \iota_v d \omega + d \iota_v \omega \,. \end{aligned}

Finally we turn to the concept of integration of differential forms (def. below). First we need to say what it is that differential forms may be integrated over:

Definition

(smooth singular simplicial chains in Cartesian spaces)

For nn \in \mathbb{N}, the standard n-simplex in the Cartesian space n\mathbb{R}^n (def. ) is the subset

Δ n{(x i) i=1 n|0x 1x n} n. \Delta^n \;\coloneqq\; \left\{ (x^i)_{i = 1}^n \;\vert\; 0 \leq x^1 \leq \cdots \leq x^n \right\} \;\subset\; \mathbb{R}^n \,.

More generally, a smooth singular n-simplex in a Cartesian space k\mathbb{R}^k is a smooth function (def. )

σ: n k, \sigma \;\colon\; \mathbb{R}^n \longrightarrow \mathbb{R}^k \,,

to be thought of as a smooth extension of its restriction

σ| Δ n:Δ n k. \sigma\vert_{\Delta^n} \;\colon\; \Delta^n \longrightarrow \mathbb{R}^k \,.

(This is called a singular simplex because there is no condition that Σ\Sigma be an embedding in any way, in particular σ\sigma may be a constant function.)

A singular chain in k\mathbb{R}^k of dimension nn is a formal linear combination of singular nn-simplices in k\mathbb{R}^k.

In particular, given a singular n+1n+1-simplex σ\sigma, then its boundary is a singular chain of singular nn-simplices σ\partial \sigma.

Definition

(fiber-integration of differential forms) over smooth singular chains in Cartesian spaces)

For nn \in \mathbb{N} and ωΩ n( n)\omega \in \Omega^n(\mathbb{R}^n) a differential n-form (def. ), which may be written as

ω=fdx 1dx n, \omega = f d x^1 \wedge \cdots d x^n \,,

then its integration over the standard n-simplex Δ n n\Delta^n \subset \mathbb{R}^n (def. ) is the ordinary integral (e.g. Riemann integral)

Δ nω0x 1x n1f(x 1,,x n)dx 1dx n. \int_{\Delta^n} \omega \;\coloneqq\; \underset{0 \leq x^1 \leq \cdots \leq x^n \leq 1}{\int} f(x^1, \cdots, x^n) \, d x^1 \cdots d x^n \,.

More generally, for

  1. ωΩ n( k)\omega \in \Omega^n(\mathbb{R}^k) a differential n-forms;

  2. C=ic iσ iC = \underset{i}{\sum} c_i \sigma_i a singular nn-chain (def. )

in any Cartesian space k\mathbb{R}^k. Then the integration of ω\omega over xx is the sum of the integrations, as above, of the pullback of differential forms (def. ) along all the singular n-simplices in the chain:

Cωic i Δ n(σ i) *ω. \int_C \omega \;\coloneqq\; \underset{i}{\sum} c_i \int_{\Delta^n} (\sigma_i)^\ast \omega \,.

Finally, for UU another Cartesian space, then fiber integration of differential forms along U×CUU \times C \to U is the linear map

C:Ω +dim(C)(U×C)Ω (U) \int_C \;\colon\; \Omega^{\bullet + dim(C)}(U \times C) \longrightarrow \Omega^\bullet(U)

which on differential forms of the form ω Uω\omega_U \wedge \omega with (ω U\omega_U pulled back from UU and ω\omega from CC) is given by:

Cω Uω(1) |ω U|( Cω)ω U. \int_C \omega_U \wedge \omega \;\coloneqq\; (-1)^{\vert \omega_U\vert} \Big( \textstyle{\int}_C \omega \Big) \omega_U \,.
Proposition

(Stokes theorem for fiber-integration of differential forms)

For Σ\Sigma a smooth singular simplicial chain (def. ) the operation of fiber-integration of differential forms along U×Σpr 1UU \times \Sigma \overset{pr_1}{\longrightarrow} U (def. ) is compatible with the exterior derivative d Ud_U on UU (def. ) in that

d Σω =(1) dim(Σ) Σd Uω =(1) dim(Σ)( Σdω Σω), \begin{aligned} d \int_\Sigma \omega & = (-1)^{dim(\Sigma)} \int_\Sigma d_U \omega \\ & = (-1)^{dim(\Sigma)} \left( \int_\Sigma d \omega - \int_{\partial \Sigma} \omega \right) \end{aligned} \,,

where d=d U+d Σd = d_U + d_\Sigma is the de Rham differential on U×ΣU \times \Sigma (def. ) and where the second equality is the Stokes theorem along Σ\Sigma:

Σd Σω= Σω. \int_\Sigma d_\Sigma \omega = \int_{\partial \Sigma} \omega \,.

\,

This concludes our review of the basics of (synthetic) differential geometry on which the following development of quantum field theory is based. In the next chapter we consider spacetime and spin.

Spacetime

Relativistic field theory takes place on spacetime.

The concept of spacetime makes sense for every dimension p+1p+1 with pp \in \mathbb{N}. The observable universe has macroscopic dimension 3+13+1, but quantum field theory generally makes sense also in lower and in higher dimensions. For instance quantum field theory in dimension 0+1 is the “worldline” theory of particles, also known as quantum mechanics; while quantum field theory in dimension >p+1\gt p+1 may be “KK-compactified” to an “effective” field theory in dimension p+1p+1 which generally looks more complicated than its higher dimensional incarnation.

However, every realistic field theory, and also most of the non-realistic field theories of interest, contain spinor fields such as the Dirac field (example below) and the precise nature and behaviour of spinors does depend sensitively on spacetime dimension. In fact the theory of relativistic spinors is mathematically most natural in just the following four spacetime dimensions:

p+1=AAAAA2+1, 3+1, 5+1, 9+1 p +1 = \phantom{AAAAA} \array{ 2+1,\; & 3+1,\; & \, & 5+1,\; &\, & \, & \, & \, 9+1 }

In the literature one finds these four dimensions advertized for two superficially unrelated reasons:

  1. in precisely these dimensions “twistors” exist (see there);

  2. in precisely these dimensions “GS-superstrings” exist (see there).

However, both these explanations have a common origin in something simpler and deeper: Spacetime in these dimensions appears from the “Pauli matrices” with entries in the real normed division algebras. (In fact it goes deeper still, but this will not concern us here.)

This we explain now, and then we use this to obtain a slick handle on spinors in these dimensions, using simple linear algebra over the four real normed division algebras. At the end (in remark ) we give a dictionary that expresses these constructions in terms of the “two-component spinor notation” that is traditionally used in physics texts (remark below).

The relation between real spin representations and division algebras, is originally due to Kugo-Townsend 82, Sudbery 84 and others. We follow the streamlined discussion in Baez-Huerta 09 and Baez-Huerta 10.

A key extra structure that the spinors impose on the underlying Cartesian space of spacetime is its causal structure, which determines which points in spacetime (“events”) are in the future or the past of other points (def. below). This causal structure will turn out to tightly control the quantum field theory on spacetime in terms of the “causal additivity of the S-matrix” (prop. below) and the induced “causal locality” of the algebra of quantum observables (prop. below). To prepare the discussion of these constructions, we end this chapter with some basics on the causal structure of Minkowski spacetime.

\,

  1. Real division algebras

  2. Spacetime in dimensions 3, 4, 6 and 10

  3. Lorentz group and Spin group

  4. Spinors in dimensions 3, 4, 6 and 10

  5. Causal structure

\,

Real division algebras

To amplify the following pattern and to fix our notation for algebra generators, recall these definitions:

Definition

(complex numbers)

The complex numbers \mathbb{C} is the commutative algebra over the real numbers \mathbb{R} which is generated from one generators {e 1}\{e_1\} subject to the relation

  • (e 1) 2=1(e_1)^2 = -1.
Definition

(quaternions)

The quaternions \mathbb{H} is the associative algebra over the real numbers which is generated from three generators {e 1,e 2,e 3}\{e_1, e_2, e_3\} subject to the relations

quaternion multiplication table
  1. for all ii

    (e i) 2=1(e_i)^2 = -1

  2. for (i,j,k)(i,j,k) a cyclic permutation of (1,2,3)(1,2,3) then

    1. e ie j=e ke_i e_j = e_k

    2. e je i=e ke_j e_i = -e_k

(graphics grabbed from Baez 02)

Definition

(octonions)

The octonions 𝕆\mathbb{O} is the nonassociative algebra over the real numbers which is generated from seven generators {e 1,,e 7}\{e_1, \cdots, e_7\} subject to the relations

octonion multiplication table
  1. for all ii

    (e i) 2=1(e_i)^2 = -1

  2. for e ie je ke_i \to e_j \to e_k an edge or circle in the diagram shown (a labeled version of the Fano plane) then

    1. e ie j=e ke_i e_j = e_k

    2. e je i=e ke_j e_i = -e_k

    and all relations obtained by cyclic permutation of the indices in these equations.

(graphics grabbed from Baez 02)

One defines the following operations on these real algebras:

Definition

(conjugation, real part, imaginary part and absolute value)

For 𝕂{,,,𝕆}\mathbb{K} \in \{\mathbb{R}, \mathbb{C}, \mathbb{H}, \mathbb{O}\}, let

() *:𝕂𝕂 (-)^\ast \;\colon\; \mathbb{K} \longrightarrow \mathbb{K}

be the antihomomorphism of real algebras

(ra) *=ra * ,forr,a𝕂 (ab) *=b *a * ,fora,b𝕂 \begin{aligned} (r a)^\ast = r a^\ast &, \text{for}\;\; r \in \mathbb{R}, a \in \mathbb{K} \\ (a b)^\ast = b^\ast a^\ast &,\text{for}\;\; a,b \in \mathbb{K} \end{aligned}

given on the generators of def. , def. and def. by

(e i) *=e i. (e_i)^\ast = - e_i \,.

This operation makes 𝕂\mathbb{K} into a star algebra. For the complex numbers \mathbb{C} this is called complex conjugation, and in general we call it conjugation.

Let then

Re:𝕂 Re \;\colon\; \mathbb{K} \longrightarrow \mathbb{R}

be the function

Re(a)12(a+a *) Re(a) \;\coloneqq\; \tfrac{1}{2}(a + a^\ast)

(“real part”) and

Im:𝕂 Im \;\colon\; \mathbb{K} \longrightarrow \mathbb{R}

be the function

Im(a)12(aa *) Im(a) \;\coloneqq \; \tfrac{1}{2}(a - a^\ast)

(“imaginary part”).

It follows that for all a𝕂a \in \mathbb{K} then the product of a with its conjugate is in the real center of 𝕂\mathbb{K}

aa *=a *a𝕂 a a^\ast = a^\ast a \;\in \mathbb{R} \hookrightarrow \mathbb{K}

and we write the square root of this expression as

|a|aa * {\vert a\vert} \;\coloneqq\; \sqrt{a a^\ast}

called the norm or absolute value function

||:𝕂. {\vert -\vert} \;\colon\; \mathbb{K} \longrightarrow \mathbb{R} \,.

This norm operation clearly satisfies the following properties (for all a,b𝕂a,b \in \mathbb{K})

  1. |a|0\vert a \vert \geq 0;

  2. |a|=0a=0{\vert a \vert } = 0 \;\;\;\;\; \Leftrightarrow\;\;\;\;\;\; a = 0;

  3. |ab|=|a||b|{\vert a b \vert } = {\vert a \vert} {\vert b \vert}

and hence makes 𝕂\mathbb{K} a normed algebra.

Since \mathbb{R} is a division algebra, these relations immediately imply that each 𝕂\mathbb{K} is a division algebra, in that

ab=0a=0orb=0. a b = 0 \;\;\;\;\;\; \Rightarrow \;\;\;\;\;\; a = 0 \;\; \text{or} \;\; b = 0 \,.

Hence the conjugation operation makes 𝕂\mathbb{K} a real normed division algebra.

Remark

(sequence of inclusions of real normed division algebras)

Suitably embedding the sets of generators in def. , def. and def. into each other yields sequences of real star-algebra inclusions

𝕆. \mathbb{R} \hookrightarrow \mathbb{C} \hookrightarrow \mathbb{H} \hookrightarrow \mathbb{O} \,.

For example for the first two inclusions we may send each generator to the generator of the same name, and for the last inclusion me may choose

1 1 e 1 e 3 e 2 e 4 e 3 e 6 \array{ 1 &\mapsto& 1 \\ e_1 &\mapsto & e_3 \\ e_2 &\mapsto& e_4 \\ e_3 &\mapsto& e_6 }
Proposition

(Hurwitz theorem: \mathbb{R}, \mathbb{C}, \mathbb{H} and 𝕆\mathbb{O} are the normed real division algebras)

The four algebras of real numbers \mathbb{R}, complex numbers \mathbb{C}, quaternions \mathbb{H} and octonions 𝕆\mathbb{O} from def. , def. and def. respectively, which are real normed division algebras via def. , are, up to isomorphism, the only real normed division algebras that exist.

Remark

(Cayley-Dickson construction and sedenions)

While prop. says that the sequence from remark

𝕆 \mathbb{R} \hookrightarrow \mathbb{C} \hookrightarrow \mathbb{H} \hookrightarrow \mathbb{O}

is maximal in the category of real normed non-associative division algebras, there is a pattern that does continue if one disregards the division algebra property. Namely each step in this sequence is given by a construction called forming the Cayley-Dickson double algebra. This continues to an unbounded sequence of real nonassociative star-algebras

𝕆𝕊 \mathbb{R} \hookrightarrow \mathbb{C} \hookrightarrow \mathbb{H} \hookrightarrow \mathbb{O} \hookrightarrow \mathbb{S} \hookrightarrow \cdots

where the next algebra 𝕊\mathbb{S} is called the sedenions.

What actually matters for the following relation of the real normed division algebras to real spin representations is that they are also alternative algebras:

Definition

(alternative algebras)

Given any non-associative algebra AA, then the trilinear map

[,,]AAAA [-,-,-] \;-\; A \otimes A \otimes A \longrightarrow A

given on any elements a,b,cAa,b,c \in A by

[a,b,c](ab)ca(bc) [a,b,c] \coloneqq (a b) c - a (b c)

is called the associator (in analogy with the commutator [a,b]abba[a,b] \coloneqq a b - b a ).

If the associator is completely antisymmetric (in that for any permutation σ\sigma of three elements then [a σ 1,a σ 2,a σ 3]=(1) |σ|[a 1,a 2,a 3][a_{\sigma_1}, a_{\sigma_2}, a_{\sigma_3}] = (-1)^{\vert \sigma\vert} [a_1, a_2, a_3] for |σ|\vert \sigma \vert the signature of the permutation) then AA is called an alternative algebra.

If the characteristic of the ground field is different from 2, then alternativity is readily seen to be equivalent to the conditions that for all a,bAa,b \in A then

(aa)b=a(ab)and(ab)b=a(bb). (a a)b = a (a b) \;\;\;\;\; \text{and} \;\;\;\;\; (a b) b = a (b b) \,.

We record some basic properties of associators in alternative star-algebras that we need below:

Proposition

(properties of alternative star algebras)

Let AA be an alternative algebra (def. ) which is also a star algebra. Then (using def. ):

  1. the associator vanishes when at least one argument is real

    [Re(a),b,c] [Re(a),b,c]
  2. the associator changes sign when one of its arguments is conjugated

    [a,b,c]=[a *,b,c]; [a,b,c] = -[a^\ast,b,c] \,;
  3. the associator vanishes when one of its arguments is the conjugate of another

    [a,a *,b]=0; [a,a^\ast, b] = 0 \,;
  4. the associator is purely imaginary

    Re([a,b,c])=0. Re([a,b,c]) = 0 \,.
Proof

That the associator vanishes as soon as one argument is real is just the linearity of an algebra product over the ground ring.

Hence in fact

[a,b,c]=[Im(a),Im(b),Im(c)]. [a,b,c] = [Im(a), Im(b), Im(c)] \,.

This implies the second statement by linearity. And so follows the third statement by skew-symmetry:

[a,a *,b]=[a,a,b]=0. [a,a^\ast,b] = -[a,a,b] = 0 \,.

The fourth statement finally follows by this computation:

[a,b,c] * =[c *,b *,a *] =[c,b,a] =[a,b,c]. \begin{aligned} \,[ a, b, c]^\ast & = -[c^\ast, b^\ast, a^\ast] \\ & = -[c,b,a] \\ & = -[a,b,c] \end{aligned} \,.

Here the first equation follows by inspection and using that (ab) *=b *a *(a b)^\ast = b^\ast a^\ast, the second follows from the first statement above, and the third is the anti-symmetry of the associator.

It is immediate to check that:

Proposition

(\mathbb{R}, \mathbb{C}, \mathbb{H} and 𝕆\mathbb{O} are real alternative algebras)

The real algebras of real numbers, complex numbers, def. ,quaternions def. and octonions def. are alternative algebras (def. ).

Proof

Since the real numbers, complex numbers and quaternions are associative algebras, their associator vanishes identically. It only remains to see that the associator of the octonions is skew-symmetric. By linearity it is sufficient to check this on generators. So let e ie je ke_i \to e_j \to e_k be a circle or a cyclic permutation of an edge in the Fano plane. Then by definition of the octonion multiplication we have

(e ie j)e j =e ke j =e je k =e i =e i(e je j) \begin{aligned} (e_i e_j) e_j &= e_k e_j \\ &= - e_j e_k \\ & = -e_i \\ & = e_i (e_j e_j) \end{aligned}

and similarly

(e ie i)e j =e j =e ke i =e ie k =e i(e ie j). \begin{aligned} (e_i e_i ) e_j &= - e_j \\ &= - e_k e_i \\ &= e_i e_k \\ &= e_i (e_i e_j) \end{aligned} \,.

The analog of the Hurwitz theorem (prop. ) is now this:

Proposition

(\mathbb{R}, \mathbb{C}, \mathbb{H} and 𝕆\mathbb{O} are precisely the alternative real division algebras)

The only division algebras over the real numbers which are also alternative algebras (def. ) are the real numbers themselves, the complex numbers, the quaternions and the octonions from prop. .

This is due to (Zorn 30).

For the following, the key point of alternative algebras is this equivalent characterization:

Proposition

(alternative algebra detected on subalgebras spanned by any two elements)

A nonassociative algebra is alternative, def. , precisely if the subalgebra? generated by any two elements is an associative algebra.

This is due to Emil Artin, see for instance (Schafer 95, p. 18).

Proposition is what allows to carry over a minimum of linear algebra also to the octonions such as to yield a representation of the Clifford algebra on 9,1\mathbb{R}^{9,1}. This happens in the proof of prop. below.

So we will be looking at a fragment of linear algebra over these four normed division algebras. To that end, fix the following notation and terminology:

Definition

(hermitian matrices with values in real normed division algebras)

Let 𝕂\mathbb{K} be one of the four real normed division algebras from prop. , hence equivalently one of the four real alternative division algebras from prop. .

Say that an n×nn \times n matrix with coefficients in 𝕂\mathbb{K}

AMat n×n(𝕂) A\in Mat_{n\times n}(\mathbb{K})

is a hermitian matrix if the transpose matrix (A t) ijA ji(A^t)_{i j} \coloneqq A_{j i} equals the componentwise conjugated matrix (def. ):

A t=A *. A^t = A^\ast \,.

Hence with the notation

() (() t) * (-)^\dagger \coloneqq ((-)^t)^\ast

we have that AA is a hermitian matrix precisely if

A=A . A = A^\dagger \,.

We write Mat 2×2 her(𝕂)Mat_{2 \times 2}^{her}(\mathbb{K}) for the real vector space of hermitian matrices.

Definition

(trace reversal)

Let AMat 2×2 her(𝕂)A \in Mat_{2 \times 2}^{her}(\mathbb{K}) be a hermitian 2×22 \times 2 matrix as in def. . Its trace reversal is the result of subtracting its trace times the identity matrix:

A˜A(trA)1 n×n. \tilde A \;\coloneqq\; A - (tr A) 1_{n\times n} \,.

\,

Minkowski spacetime in dimensions 3,4,6 and 10

We now discover Minkowski spacetime of dimension 3,4,6 and 10, in terms of the real normed division algebras 𝕂\mathbb{K} from prop. , equivalently the real alternative division algebras from prop. : this is prop./def. and def. below.

Proposition/Definition

(Minkowski spacetime as real vector space of hermitian matrices in real normed division algebras)

Let 𝕂\mathbb{K} be one of the four real normed division algebras from prop. , hence one of the four real alternative division algebras from prop. .

Then the real vector space of 2×22 \times 2 hermitian matrices over 𝕂\mathbb{K} (def. ) equipped with the inner product η\eta whose quadratic form || η 2{\vert -\vert^2_\eta} is the negative of the determinant operation on matrices is Minkowski spacetime:

(8) dim (𝕂)+1,1 ( dim (𝕂)+2,|| η 2) (Mat 2×2 her(𝕂),det). \begin{aligned} \mathbb{R}^{dim_{\mathbb{R}}(\mathbb{K})+1,1} & \coloneqq \left( \mathbb{R}^{dim_{\mathbb{R}(\mathbb{K})}+2} , {\vert -\vert^2_\eta} \right) & \coloneqq \left(Mat_{2 \times 2}^{her}(\mathbb{K}), -det \right) \end{aligned} \,.

hence

  1. 2,1\mathbb{R}^{2,1} for 𝕂=\mathbb{K} = \mathbb{R};

  2. 3,1\mathbb{R}^{3,1} for 𝕂=\mathbb{K} = \mathbb{C};

  3. 5,1\mathbb{R}^{5,1} for 𝕂=\mathbb{K} = \mathbb{H};

  4. 9,1\mathbb{R}^{9,1} for 𝕂=𝕆\mathbb{K} = \mathbb{O}.

Here we think of the vector space on the left as p,1\mathbb{R}^{p,1} with

pdim (𝕂)+1 p \coloneqq dim_{\mathbb{R}}(\mathbb{K})+1

equipped with the canonical coordinates labeled (x μ) μ=0 p(x^\mu)_{\mu = 0}^p.

As a linear map the identification is given by

(x 0,x 1,,x d1)(x 0+x 1 y y * x 0x 1)withyx 21+x 3e 1+x 4e 2++x 2+dim (𝕂)e dim (𝕂)1. (x^0, x^1, \cdots, x^{d-1}) \;\mapsto\; \left( \array{ x^0 + x^1 & y \\ y^\ast & x^0 - x^1 } \right) \;\;\; \text{with}\; y \coloneqq x^2 1 + x^3 e_1 + x^4 e_2 + \cdots + x^{2 + dim_{\mathbb{R}(\mathbb{K})}} \,e_{dim_{\mathbb{R}}(\mathbb{K})-1} \,.

This means that the quadratic form || η 2{\vert - \vert^2_\eta} is given on an element v=(v μ) μ=0 pv = (v^\mu)_{\mu = 0}^p by

|v| η 2=(v 0) 2+j=1p(x j) 2. {\vert v \vert}^2_{\eta} \;=\; - (v^0)^2 + \underoverset{j = 1}{p}{\sum} (x^j)^2 \,.

By the polarization identity the quadratic form || η 2{\vert - \vert^2_\eta} induces a bilinear form

η: p,1 p,1 \eta \;\colon\; \mathbb{R}^{p,1}\otimes \mathbb{R}^{p,1} \longrightarrow \mathbb{R}

given by

η(v 1,v 2) =η μνv 1 μv 1 ν v 1 0v 2 0+j=1pv 1 jv 2 j. \begin{aligned} \eta(v_1, v_2) & = \eta_{\mu \nu} v_1^\mu v_1^\nu \\ & \coloneqq - v_1^0 v_2^0 + \underoverset{j = 1}{p}{\sum} v_1^j v_2^j \end{aligned} \,.

This is called the Minkowski metric.

Finally, under the above identification the operation of trace reversal from def. corresponds to time reversal in that

(x 0+x 1 y y * x 0x 1)˜=(x 0+x 1 y y * x 0x 1). \widetilde{ \left( \array{ x^0 + x^1 & y \\ y^\ast & x^0 - x^1 } \right) } \;=\; \left( \array{ -x^0 + x^1 & y \\ y^\ast & -x^0 - x^1 } \right) \,.
Proof

We need to check that under the given identification, the Minkowski norm-square is indeed given by minus the determinant on the corresponding hermitian matrices. This follows from the nature of the conjugation operation () *(-)^\ast from def. :

det(x 0+x 1 y y * x 0x 1) =(x 0+x 1)(x 0x 1)+yy * =(x 0) 2+i=1p(x i) 2. \begin{aligned} - det \left( \array{ x^0 + x^1 & y \\ y^\ast & x^0 - x^1 } \right) & = -(x^0 + x^1)(x^0 - x^1) + y y^\ast \\ & = -(x^0)^2 + \underoverset{i = 1}{p}{\sum} (x^i)^2 \end{aligned} \,.
Remark

(physical units of length)

As the term “metric” suggests, in application to physics, the Minkowski metric η\eta in prop./def. is regarded as a measure of length: for vΓ x(T p,1)v \in \Gamma_x(T \mathbb{R}^{p,1}) a tangent vector at a point xx in Minkowski spacetime, interpreted as a displacement from event xx to event x+vx + v, then

  1. if η(v,v)>0\eta(v,v) \gt 0 then

    η(v,v) \sqrt{\eta(v,v)} \in \mathbb{R}

    is interpreted as a measure for the spatial distance between xx and x+vx + v;

  2. if η(v,v)<0\eta(v,v) \lt 0 then

    η(v,v) \sqrt{-\eta(v,v)} \in \mathbb{R}

    is interpreted as a measure for the time distance between xx and x+vx + v.

But for this to make physical sense, an operational prescription needs to be specified that tells the experimentor how the real number η(v,v)\sqrt{\eta(v,v)} is to be translated into an physical distance between actual events in the observable universe.

Such an operational prescription is called a physical unit of length. For example “centimetercmcm is a physical unit of length, another one is “femtometerfmfm.

The combined information of a real number η(v,v)\sqrt{\eta(v,v)} \in \mathbb{R} and a physical unit of length such as meter, jointly written

η(v,v)cm \sqrt{\eta(v,v)} \, cm

is a prescription for finding actual distance in the observable universe. Alternatively

η(v,v)fm \sqrt{\eta(v,v)} \, fm

is another prescription, that translates the same real number η(v,v)\sqrt{\eta(v,v)} into another physical distance.

But of course they are related, since physical units form a torsor over the group >0\mathbb{R}_{\gt 0} of non-negative real numbers, meaning that any two are related by a unique rescaling. For example

fm=10 13cm, fm = 10^{-13} cm \,,

with 10 13 >010^{-13} \in \mathbb{R}_{\gt 0}.

This means that once any one prescription of turning real numbers into spacetime distances is specified, then any other such prescription is obtained from this by rescaling these real numbers. For example

η(v,v)fm =(10 13η(v,v))cm =10 26η(v,v)cm. \begin{aligned} \sqrt{\eta(v,v)} \, fm & = \left( 10^{-13} \sqrt{\eta(v,v)}\right) \,cm \\ & = \sqrt{ 10^{-26} \eta(v,v) } \, cm \end{aligned} \,.

The point to notice here is that, via the last line, we may think of this as rescaling the metric from η\eta to 10 30η10^{-30} \eta.

In quantum field theory physical units of length are typically expressed in terms of a physical unit of “action”, called “Planck's constant\hbar, via the combination of units called the Compton wavelength

(9) m=2πmc. \ell_m = \frac{2\pi \hbar}{m c} \,.

parameterized, in turn, by a physical unit of mass mm. For the mass of the electron, the Compton wavelength is

e=2πm ec386fm. \ell_e = \frac{2\pi \hbar}{m_e c} \sim 386 \, fm \,.

Another physical unit of length parameterized by a mass mm is the Schwarzschild radius r m2mG/c 2r_m \coloneqq 2 m G/c^2, where GG is the gravitational constant. Solving the equation

m = r m 2π/mc = 2mG/c 2 \array{ & \ell_m &=& r_m \\ \Leftrightarrow & 2\pi\hbar / m c &=& 2 m G / c^2 }

for mm yields the Planck mass

m P1πm =r=cG. m_{P} \coloneqq \tfrac{1}{\sqrt{\pi}} m_{\ell = r} = \sqrt{\frac{\hbar c}{G}} \,.

The corresponding Compton wavelength m P\ell_{m_{P}} is given by the Planck length P\ell_P

P12π m P=Gc 3. \ell_{P} \coloneqq \tfrac{1}{2\pi} \ell_{m_P} = \sqrt{ \frac{\hbar G}{c^3} } \,.
Definition

(Minkowski spacetime as a pseudo-Riemannian Cartesian space)

Prop./def. introduces Minkowski spacetime p,1\mathbb{R}^{p,1} for p+1{3,4,6,10}p+1 \in \{3,4,6,10\} as a a vector space p,1\mathbb{R}^{p,1} equipped with a norm || η{\vert - \vert_\eta}. The genuine spacetime corresponding to this is this vector space regaded as a Cartesian space, i.e. with smooth functions (instead of just linear maps) to it and from it (def. ). This still carries one copy of p,1\mathbb{R}^{p,1} over each point x p,1x \in \mathbb{R}^{p,1}, as its tangent space (example )

T x p,1 p,1 T_x \mathbb{R}^{p,1} \simeq \mathbb{R}^{p,1}

and the Cartesian space p,1\mathbb{R}^{p,1} equipped with the Lorentzian inner product from prop./def. on each tangent space T x p,1T_x \mathbb{R}^{p,1} (a “pseudo-Riemannian Cartesian space”) is Minkowski spacetime as such.

We write

(10)dvol Σdx 0dx 1dx pΩ p+1( p,1) dvol_\Sigma \;\coloneqq\; d x^0 \wedge d x^1 \wedge \cdots \wedge d x^p \in \Omega^{p+1}(\mathbb{R}^{p,1})

for the canonical volume form on Minkowski spacetime.

We use the Einstein summation convention: Expressions with repeated indices indicate summation over the range of indices.

For example a differential 1-form αΩ 1( p,1)\alpha \in \Omega^1(\mathbb{R}^{p,1}) on Minkowski spacetime may be expanded as

α=α μdx μ. \alpha = \alpha_\mu d x^\mu \,.

Moreover we use square brackets around indices to indicate skew-symmetrization. For example a differential 2-form βΩ 2( p,1)\beta \in \Omega^2(\mathbb{R}^{p,1}) on Minkowski spacetime may be expanded as

β =β μνdx μdx ν =β [μν]dx μdx ν \begin{aligned} \beta & = \beta_{\mu \nu} d x^\mu \wedge d x^\nu \\ & = \beta_{[\mu \nu]} d x^\mu \wedge d x^\nu \end{aligned}

\,

The identification of Minkowski spacetime (def. ) in the exceptional dimensions with the generalized Pauli matrices (prop./def. ) has some immediate useful implications:

Proposition

(Minkowski metric in terms of trace reversal)

In terms of the trace reversal operation ()˜\widetilde{(-)} from def. , the determinant operation on hermitian matrices (def. ) has the following alternative expression

det(A) =AA˜ =A˜A. \begin{aligned} -det(A) & = A \tilde A \\ & = \tilde A A \end{aligned} \,.

and the Minkowski inner product from prop. has the alternative expression

η(A,B) =12Re(tr(AB˜)) =12Re(tr(A˜B)). \begin{aligned} \eta(A,B) & = \tfrac{1}{2}Re(tr(A \tilde B)) \\ & = \tfrac{1}{2} Re(tr(\tilde A B)) \end{aligned} \,.

(Baez-Huerta 09, prop. 5)

Proposition

(special linear group SL(2,𝕂)SL(2,\mathbb{K}) acts by linear isometries on Minkowski spacetime )

For 𝕂{,,,𝕆}\mathbb{K} \in \{\mathbb{R}, \mathbb{C}, \mathbb{H}, \mathbb{O}\} one of the four real normed division algebras (prop. ) the special linear group SL(2,𝕂)SL(2,\mathbb{K}) acts on Minkowski spacetime p,1\mathbb{R}^{p,1} in dimension p+1{2+1,3+1,5+1.9+1}p+1 \in \{2+1, \,3+1, \, 5+1. \, 9+1\} (def. ) by linear isometries given under the identification with the Pauli matrices in prop./def. by conjugation:

SL(2,𝕂)× dim(𝕂+1,1) SL(2,𝕂)×Mat 2×2 herm(𝕂) Mat 2×2 herm(𝕂) dim(𝕂+1,1) (G,A) GAG \array{ SL(2,\mathbb{K}) \times \mathbb{R}^{dim(\mathbb{K}+1,1)} & \simeq & SL(2, \mathbb{K}) \times Mat^{herm}_{2 \times 2}(\mathbb{K}) &\overset{}{\longrightarrow}& Mat^{herm}_{2 \times 2}(\mathbb{K}) & \simeq & \mathbb{R}^{dim(\mathbb{K}+1,1)} \\ && (G, A) &\mapsto& G \, A \, G^\dagger }
Proof

For 𝕂{,,}\mathbb{K} \in \{\mathbb{R}, \mathbb{C}, \mathbb{H}\} this is immediate from matrix calculus, but we spell it out now. While the argument does not directly apply to the case 𝕂=𝕆\mathbb{K} = \mathbb{O} of the octonions, one can check that it still goes through, too.

First we need to see that the action is well defined. This follows from the associativity of matrix multiplication and the fact that forming conjugate transpose matrices is an antihomomorphism: (G 1G 2) =G 2 G 1 (G_1 G_2)^\dagger = G_2^\dagger G_1^\dagger. In particular this implies that the action indeed sends hermitian matrices to hermitian matrices:

(GAG ) =(G )=GA =AG =GAG . \begin{aligned} \left( G \, A \, G^\dagger \right)^\dagger & = \underset{= G}{\underbrace{\left( G^\dagger \right)}} \, \underset{= A}{\underbrace{A^\dagger}} \, G^\dagger \\ & = G \, A \, G^\dagger \end{aligned} \,.

By prop./def. such an action is an isometry precisely if it preserves the determinant. This follows from the multiplicative property of determinants: det(AB)=det(A)det(B)det(A B) = det(A) det(B) and their compativility with conjugate transposition: det(A )=det(A *)det(A^\dagger) = det(A^\ast), and finally by the assumption that GSL(2,𝕂)G \in SL(2,\mathbb{K}) is an element of the special linear group, hence that its determinant is 1𝕂1 \in \mathbb{K}:

det(GAG ) =det(G)=1det(A)det(G )=1 *=1 =det(A). \begin{aligned} det\left( G \, A \, G^\dagger \right) & = \underset{ = 1}{\underbrace{det(G)}} \, det(A) \, \underset{= 1^\ast = 1}{\underbrace{det(G^\dagger)}} \\ & = det(A) \end{aligned} \,.

In fact the special linear groups of linear isometries in prop. are the spin groups (def. below) in these dimensions.

exceptional spinors and real normed division algebras

Lorentzian
spacetime
dimension
AA\phantom{AA}spin groupnormed division algebra\,\, brane scan entry
3=2+13 = 2+1Spin(2,1)SL(2,)Spin(2,1) \simeq SL(2,\mathbb{R})A\phantom{A} \mathbb{R} the real numberssuper 1-brane in 3d
4=3+14 = 3+1Spin(3,1)SL(2,)Spin(3,1) \simeq SL(2, \mathbb{C})A\phantom{A} \mathbb{C} the complex numberssuper 2-brane in 4d
6=5+16 = 5+1Spin(5,1)Spin(5,1) \simeq SL(2,H)A\phantom{A} \mathbb{H} the quaternionslittle string
10=9+110 = 9+1Spin(9,1) {\simeq}SL(2,O)A\phantom{A} 𝕆\mathbb{O} the octonionsheterotic/type II string

This we explain now.

\,

Lorentz group and spin group

Definition

(Lorentz group)

For dd \in \mathbb{N}, write

O(d1,1)GL( d) O(d-1,1) \hookrightarrow GL(\mathbb{R}^d)

for the subgroup of the general linear group on those linear maps AA which preserve this bilinear form on Minkowski spacetime (def ), in that

η(A(),A())=η(,). \eta(A(-),A(-)) = \eta(-,-) \,.

This is the Lorentz group in dimension dd.

The elements in the Lorentz group in the image of the special orthogonal group SO(d1)O(d1,1)SO(d-1) \hookrightarrow O(d-1,1) are rotations in space. The further elements in the special Lorentz group SO(d1,1)SO(d-1,1), which mathematically are “hyperbolic rotations” in a space-time plane, are called boosts in physics.

One distinguishes the following further subgroups of the Lorentz group O(d1,1)O(d-1,1):

  • the proper Lorentz group

    SO(d1,1)O(d1,1) SO(d-1,1) \hookrightarrow O(d-1,1)

    is the subgroup of elements which have determinant +1 (as elements SO(d1,1)GL(d)SO(d-1,1)\hookrightarrow GL(d) of the general linear group);

  • the proper orthochronous (or restricted) Lorentz group

    SO +(d1,1)SO(d1,1) SO^+(d-1,1) \hookrightarrow SO(d-1,1)

    is the further subgroup of elements AA which preserve the time orientation of vectors vv in that (v 0>0)((Av) 0>0)(v^0 \gt 0) \Rightarrow ((A v)^0 \gt 0).

Proposition

(connected component of Lorentz group)

As a smooth manifold, the Lorentz group O(d1,1)O(d-1,1) (def. ) has four connected components. The connected component of the identity is the proper orthochronous Lorentz group SO +(3,1)SO^+(3,1) (def. ). The other three components are

  1. SO +(d1,1)PSO^+(d-1,1)\cdot P

  2. SO +(d1,1)TSO^+(d-1,1)\cdot T

  3. SO +(d1,1)PTSO^+(d-1,1)\cdot P T,

where, as matrices,

Pdiag(1,1,1,,1) P \coloneqq diag(1,-1,-1, \cdots, -1)

is the operation of point reflection at the origin in space, where

Tdiag(1,1,1,,1) T \coloneqq diag(-1,1,1, \cdots, 1)

is the operation of reflection in time and hence where

PT=TP=diag(1,1,,1) P T = T P = diag(-1,-1, \cdots, -1)

is point reflection in spacetime.

The following concept of the Clifford algebra (def. ) of Minkowski spacetime encodes the structure of the inner product space d1,1\mathbb{R}^{d-1,1} in terms of algebraic operation (“geometric algebra”), such that the action of the Lorentz group becomes represented by a conjugation action (example below). In particular this means that every element of the proper orthochronous Lorentz group may be “split in half” to yield a double cover: the spin group (def. below).

Definition

(Clifford algebra)

For dd \in \mathbb{N}, we write

Cl( d1,1) Cl(\mathbb{R}^{d-1,1})

for the /2\mathbb{Z}/2-graded associative algebra over \mathbb{R} which is generated from dd generators {Γ 0,Γ 1,Γ 2,,Γ d1}\{\Gamma_0, \Gamma_1, \Gamma_2, \cdots, \Gamma_{d-1}\} in odd degree (“Clifford generators”), subject to the relation

(11)Γ aΓ b+Γ bΓ a=2η ab \Gamma_{a} \Gamma_b + \Gamma_b \Gamma_a = - 2\eta_{a b}

where η\eta is the inner product of Minkowski spacetime as in def. .

These relations say equivalently that

Γ 0 2=+1 Γ i 2=1fori{1,,d1} Γ aΓ b=Γ bΓ aforab. \begin{aligned} & \Gamma_0^2 = +1 \\ & \Gamma_i^2 = -1 \;\; \text{for}\; i \in \{1,\cdots, d-1\} \\ & \Gamma_a \Gamma_b = - \Gamma_b \Gamma_a \;\;\; \text{for}\; a \neq b \end{aligned} \,.

We write

Γ a 1a p1p!permutationsσ(1) |σ|Γ a σ(1)Γ a σ(p) \Gamma_{a_1 \cdots a_p} \;\coloneqq\; \frac{1}{p!} \underset{{permutations \atop \sigma}}{\sum} (-1)^{\vert \sigma\vert } \Gamma_{a_{\sigma(1)}} \cdots \Gamma_{a_{\sigma(p)}}

for the antisymmetrized product of pp Clifford generators. In particular, if all the a ia_i are pairwise distinct, then this is simply the plain product of generators

Γ a 1a n=Γ a 1Γ a nifi,j(a ia j). \Gamma_{a_1 \cdots a_n} = \Gamma_{a_1} \cdots \Gamma_{a_n} \;\;\; \text{if} \; \underset{i,j}{\forall} (a_i \neq a_j) \,.

Finally, write

()¯:Cl( d1,1)Cl( d1,1) \overline{(-)} \;\colon\; Cl(\mathbb{R}^{d-1,1}) \longrightarrow Cl(\mathbb{R}^{d-1,1})

for the algebra anti-automorphism given by

Γ a¯Γ a \overline{\Gamma_a} \coloneqq \Gamma_a
Γ aΓ b¯Γ bΓ a. \overline{\Gamma_a \Gamma_b} \coloneqq \Gamma_b \Gamma_a \,.
Remark

(vectors inside Clifford algebra)

By construction, the vector space of linear combinations of the generators in a Clifford algebra Cl( d1,1)Cl(\mathbb{R}^{d-1,1}) (def. ) is canonically identified with Minkowski spacetime d1,1\mathbb{R}^{d-1,1} (def. )

()^: d1,1Cl( d1,1) \widehat{(-)} \;\colon\; \mathbb{R}^{d-1,1} \hookrightarrow Cl(\mathbb{R}^{d-1,1})

via

x aΓ a, x_a \mapsto \Gamma_a \,,

hence via

v=v ax av^=v aΓ a, v = v^a x_a \mapsto \hat v = v^a \Gamma_a \,,

such that the defining quadratic form on d1,1\mathbb{R}^{d-1,1} is identified with the anti-commutator in the Clifford algebra

η(v 1,v 2)=12(v^ 1v^ 2+v^ 2v^ 1), \eta(v_1,v_2) = -\tfrac{1}{2}( \hat v_1 \hat v_2 + \hat v_2 \hat v_1) \,,

where on the right we are, in turn, identifying \mathbb{R} with the linear span of the unit in Cl( d1,1)Cl(\mathbb{R}^{d-1,1}).

The key point of the Clifford algebra (def. ) is that it realizes spacetime reflections, rotations and boosts via conjugation actions:

Example

(Clifford conjugation)

For dd \in \mathbb{N} and d1,1\mathbb{R}^{d-1,1} the Minkowski spacetime of def. , let v d1,1v \in \mathbb{R}^{d-1,1} be any vector, regarded as an element v^Cl( d1,1)\hat v \in Cl(\mathbb{R}^{d-1,1}) via remark .

Then

  1. the conjugation action v^Γ a 1v^Γ a\hat v \mapsto -\Gamma_a^{-1} \hat v \Gamma_a of a single Clifford generator Γ a\Gamma_a on v^\hat v sends vv to its

reflection at the hyperplane x a=0x_a = 0;

  1. the conjugation action

    v^exp(α2Γ ab)v^exp(α2Γ ab) \hat v \mapsto \exp(- \tfrac{\alpha}{2} \Gamma_{a b}) \hat v \exp(\tfrac{\alpha}{2} \Gamma_{a b})

    sends vv to the result of rotating it in the (a,b)(a,b)-plane through an angle α\alpha.

Proof

This is immediate by inspection:

For the first statement, observe that conjugating the Clifford generator Γ b\Gamma_b with Γ a\Gamma_a yields Γ b\Gamma_b up to a sign, depending on whether a=ba = b or not:

Γ a 1Γ bΓ a={Γ b |ifa=b Γ b |otherwise. - \Gamma_a^{-1} \Gamma_b \Gamma_a = \left\{ \array{ -\Gamma_b & \vert \text{if}\, a = b \\ \Gamma_b & \vert \text{otherwise} } \right. \,.

Therefore for v^=v bΓ b\hat v = v^b \Gamma_b then Γ a 1v^Γ a\Gamma_a^{-1} \hat v \Gamma_a is the result of multiplying the aa-component of vv by 1-1.

For the second statement, observe that

12[Γ ab,Γ c]=Γ aη bcΓ bη ac. -\tfrac{1}{2}[\Gamma_{a b}, \Gamma_c] = \Gamma_a \eta_{b c} - \Gamma_b \eta_{a c} \,.

This is the canonical action of the Lorentzian special orthogonal Lie algebra 𝔰𝔬(d1,1)\mathfrak{so}(d-1,1). Hence

exp(α2Γ ab)v^exp(α2Γ ab)=exp(12[Γ ab,])(v^) \exp(-\tfrac{\alpha}{2} \Gamma_{ab}) \hat v \exp(\tfrac{\alpha}{2} \Gamma_{ab}) = \exp(\tfrac{1}{2}[\Gamma_{a b}, -])(\hat v)

is the rotation action as claimed.

Remark

Since the reflections, rotations and boosts in example are given by conjugation actions, there is a crucial ambiguity in the Clifford elements that induce them:

  1. the conjugation action by Γ a\Gamma_a coincides precisely with the conjugation action by Γ a-\Gamma_a;

  2. the conjugation action by exp(α4Γ ab)\exp(\tfrac{\alpha}{4} \Gamma_{a b}) coincides precisely with the conjugation action by exp(α2Γ ab)-\exp(\tfrac{\alpha}{2}\Gamma_{a b}).

Definition

(spin group)

For dd \in \mathbb{N}, the spin group Spin(d1,1)Spin(d-1,1) is the group of even graded elements of the Clifford algebra Cl( d1,1)Cl(\mathbb{R}^{d-1,1}) (def. ) which are unitary with respect to the conjugation operation ()¯\overline{(-)} from def. :

Spin(d1,1){ACl( d1,1) even|A¯A=1}. Spin(d-1,1) \;\coloneqq\; \left\{ A \in Cl(\mathbb{R}^{d-1,1})_{even} \;\vert\; \overline{A} A = 1 \right\} \,.
Proposition

The function

Spin(d1,1)GL( d1,1) Spin(d-1,1) \longrightarrow GL(\mathbb{R}^{d-1,1})

from the spin group (def. ) to the general linear group in dd-dimensions given by sending ASpin(d1,1)Cl( d1,1)A \in Spin(d-1,1) \hookrightarrow Cl(\mathbb{R}^{d-1,1}) to the conjugation action

A¯()A \overline{A}(-) A

(via the identification of Minkowski spacetime as the subspace of the Clifford algebra containing the linear combinations of the generators, according to remark )

is

  1. a group homomorphism onto the proper orthochronous Lorentz group (def. ):

    Spin(d1,1)SO +(d1,1) Spin(d-1,1) \longrightarrow SO^+(d-1,1)
  2. exhibiting a /2\mathbb{Z}/2-central extension.

Proof

That the function is a group homomorphism into the general linear group, hence that it acts by linear transformations on the generators follows by using that it clearly lands in automorphisms of the Clifford algebra.

That the function lands in the Lorentz group O(d1,1)GL(d)O(d-1,1) \hookrightarrow GL(d) follows from remark :

η(A¯v 1A,A¯v 2A) =12((A¯v^ 1A)(A¯v^ 2A)+(A¯v^ 2A)(A¯v^ 1A)) =12(A¯(v^ 1v^ 2+v^ 2v^ 1)A) =A¯A12(v^ 1v^ 2+v^ 2v^ 1) =η(v 1,v 2). \begin{aligned} \eta(\overline{A}v_1A , \overline{A} v_2 A) &= \tfrac{1}{2} \left( \left(\overline{A} \hat v_1 A\right) \left(\overline{A}\hat v_2 A\right) + \left(\overline{A} \hat v_2 A\right) \left(\overline{A} \hat v_1 A\right) \right) \\ & = \tfrac{1}{2} \left( \overline{A}(\hat v_1 \hat v_2 + \hat v_2 \hat v_1) A \right) \\ & = \overline{A} A \tfrac{1}{2}\left( \hat v_1 \hat v_2 + \hat v_2 \hat v_1\right) \\ & = \eta(v_1, v_2) \end{aligned} \,.

That it moreover lands in the proper Lorentz group SO(d1,1)SO(d-1,1) follows from observing (example ) that every reflection is given by the conjugation action by a linear combination of generators, which are excluded from the group Spin(d1,1)Spin(d-1,1) (as that is defined to be in the even subalgebra).

To see that the homomorphism is surjective, use that all elements of SO(d1,1)SO(d-1,1) are products of rotations in hyperplanes. If a hyperplane is spanned by the bivector (ω ab)(\omega^{a b}), then such a rotation is given, via example by the conjugation action by

exp(α2ω abΓ ab) \exp(\tfrac{\alpha}{2} \omega^{a b}\Gamma_{a b})

for some α\alpha, hence is in the image.

That the kernel is /2\mathbb{Z}/2 is clear from the fact that the only even Clifford elements which commute with all vectors are the multiples aCl( d1,1)a \in \mathbb{R} \hookrightarrow Cl(\mathbb{R}^{d-1,1}) of the identity. For these a¯=a\overline{a} = a and hence the condition a¯a=1\overline{a} a = 1 is equivalent to a 2=1a^2 = 1. It is clear that these two elements {+1,1}\{+1,-1\} are in the center of Spin(d1,1)Spin(d-1,1). This kernel reflects the ambiguity from remark .

\,

Spinors in dimensions 3, 4, 6 and 10

We now discuss how real spin representations (def. ) in spacetime dimensions 3,4, 6 and 10 are naturally induced from linear algebra over the four real alternative division algebras (prop. ).

Definition

(Clifford algebra via normed division algebra)

Let 𝕂\mathbb{K} be one of the four real normed division algebras from prop. , hence one of the four real alternative division algebras from prop. .

Define a real linear map

Γ: dim (𝕂)+1,1End (𝕂 4) \Gamma \;\colon\; \mathbb{R}^{dim_{\mathbb{R}}(\mathbb{K})+1,1} \longrightarrow End_{\mathbb{R}}(\mathbb{K}^4)

from (the real vector space underlying) Minkowski spacetime to real linear maps on 𝕂 4\mathbb{K}^4

Γ(A)(ψ ϕ)(A˜ϕ Aψ). \Gamma(A) \left( \array{ \psi \\ \phi } \right) \;\coloneqq\; \left( \array{ - \tilde A \phi \\ A \psi } \right) \,.

Here on the right we are using the isomorphism from prop. for identifying a spacetime vector with a 2×22 \times 2-matrix, and we are using the trace reversal (˜)\widetilde(-) from def. .

Remark

(Clifford multiplication via octonion-valued matrices)

Each operation of Γ(A)\Gamma(A) in def. is clearly a linear map, even for 𝕂\mathbb{K} being the non-associative octonions. The only point to beware of is that for 𝕂\mathbb{K} the octonions, then the composition of two such linear maps is not in general given by the usual matrix product.

Proposition

(real spin representations via normed division algebras)

The map Γ\Gamma in def. gives a representation of the Clifford algebra Cl( dim (𝕂)+1,1)Cl(\mathbb{R}^{dim_{\mathbb{R}}(\mathbb{K})+1,1} ) (this def.), i.e of

  1. Cl( 2,1)Cl(\mathbb{R}^{2,1}) for 𝕂=\mathbb{K} = \mathbb{R};

  2. Cl( 3,1)Cl(\mathbb{R}^{3,1}) for 𝕂=\mathbb{K} = \mathbb{C};

  3. Cl( 5,1)Cl(\mathbb{R}^{5,1}) for 𝕂=\mathbb{K} = \mathbb{H};

  4. Cl( 9,1)Cl(\mathbb{R}^{9,1}) for 𝕂=𝕆\mathbb{K} = \mathbb{O}.

Hence this Clifford representation induces representations of the spin group Spin(dim (𝕂)+1,1)Spin(dim_{\mathbb{R}}(\mathbb{K})+1,1) on the real vector spaces

S ±𝕂 2. S_{\pm } \coloneqq \mathbb{K}^2 \,.

and hence on

SS +S . S \coloneqq S_+ \oplus S_- \,.

(Baez-Huerta 09, p. 6)

Proof

We need to check that the Clifford relation

(Γ(A)) 2 =η(A,A)1 =+det(A) \begin{aligned} (\Gamma(A))^2 & = -\eta(A,A)1 \\ & = + det(A) \end{aligned}

is satisfied (where we used (11) and (8)). Now by definition, for any (ϕ,ψ)𝕂 4(\phi,\psi) \in \mathbb{K}^4 then

(Γ(A)) 2(ϕ ψ)=(A˜(Aϕ) A(A˜ψ)), (\Gamma(A))^2 \left( \array{ \phi \\ \psi } \right) \;=\; - \left( \array{ \tilde A(A \phi) \\ A(\tilde A \psi) } \right) \,,

where on the right we have in each component ordinary matrix product expressions.

Now observe that both expressions on the right are sums of triple products that involve either one real factor or two factors that are conjugate to each other:

A(A˜ψ) =(x 0+x 1 y y * x 0x 1)((x 0+x 1)ϕ 1+yϕ 2 y *ϕ 1(x 0+x 1)ϕ 2) =((x 0 2+x 1 2)ϕ 1+(x 0+x 1)(yϕ 2)+y(y *ϕ 1)y((x 0+x 1)ϕ 2) ). \begin{aligned} A (\tilde A \psi) & = \left( \array{ x_0 + x_1 & y \\ y^\ast & x_0 - x_1 } \right) \cdot \left( \array{ (-x_0 + x_1) \phi_1 + y \phi_2 \\ y^\ast \phi_1 - (x_0 + x_1)\phi_2 } \right) \\ & = \left( \array{ (-x_0^2 + x_1^2) \phi_1 + (x_0 + x_1)(y \phi_2) + y (y^\ast \phi_1) - y( (x_0 + x_1) \phi_2 ) \\ \cdots } \right) \end{aligned} \,.

Since the associators of triple products that involve a real factor and those involving both an element and its conjugate vanish by prop. (hence ultimately by Artin’s theorem, prop. ). In conclusion all associators involved vanish, so that we may rebracket to obtain

(Γ(A)) 2(ϕ ψ)=((A˜A)ϕ (AA˜)ψ). (\Gamma(A))^2 \left( \array{ \phi \\ \psi } \right) \;=\; - \left( \array{ (\tilde A A) \phi \\ (A \tilde A) \psi } \right) \,.

This implies the statement via the equality AA˜=A˜A=det(A)-A \tilde A = -\tilde A A = det(A) (prop. ).

Proposition

(spinor bilinear pairings)

Let 𝕂\mathbb{K} be one of the four real normed division algebras and S ± 𝕂 2S_\pm \simeq_{\mathbb{R}}\mathbb{K}^2 the corresponding spin representation from prop. .

Then there are bilinear maps from two spinors (according to prop. ) to the real numbers

()¯():S +S \overline{(-)}(-) \;\colon\; S_+ \otimes S_-\longrightarrow \mathbb{R}

as well as to dim(𝕂+1,1)\mathbb{R}^{dim(\mathbb{K}+1,1)}

()¯Γ():S ±S ± dim(𝕂+1,1) \overline{(-)}\Gamma (-) \;\colon\; S_\pm \otimes S_{\pm}\longrightarrow \mathbb{R}^{dim(\mathbb{K}+1,1)}

given, respectively, by forming the real part (def. ) of the canonical 𝕂\mathbb{K}-inner product

()¯():S +S \overline{(-)}(-) \colon S_+\otimes S_- \longrightarrow \mathbb{R}
(ψ,ϕ)ψ¯ϕRe(ψ ϕ) (\psi,\phi)\mapsto \overline{\psi} \phi \coloneqq Re(\psi^\dagger \cdot \phi)

and by forming the product of a column vector with a row vector to produce a matrix, possibly up to trace reversal (def. ) under the identification dim(𝕂)+1,1Mat 2×2 her(𝕂)\mathbb{R}^{dim(\mathbb{K})+1,1} \simeq Mat^{her}_{2 \times 2}(\mathbb{K}) from prop. :

S +S + dim(𝕂)+1,1 S_+ \otimes S_+ \longrightarrow \mathbb{R}^{dim(\mathbb{K})+1,1}
(ψ,ϕ)ψ¯Γϕψϕ +ϕψ ˜ (\psi , \phi) \mapsto \overline{\psi}\Gamma \phi \coloneqq \widetilde{\psi \phi^\dagger + \phi \psi^\dagger}

and

S S dim(𝕂+1,1) S_- \otimes S_- \longrightarrow \mathbb{R}^{dim(\mathbb{K}+1,1)}
(ψ,ϕ)ψϕ +ϕψ (\psi , \phi) \mapsto {\psi \phi^\dagger + \phi \psi^\dagger}

For AMat 2×2 her(𝕂)A \in Mat^{her}_{2 \times 2}(\mathbb{K}) the AA-component of this map is

η(ψ¯Γϕ,A)=Re(ψ (Aϕ)). \eta(\overline{\psi}\Gamma \phi, A) = Re (\psi^\dagger (A\phi)) \,.

These pairings have the following properties

  1. both are Spin(dim(𝕂)+1,1)Spin(dim(\mathbb{K})+1,1)-equivalent;

  2. the pairing ()¯Γ()\overline{(-)}\Gamma(-) is symmetric:

    (12)ψ 1¯Γψ 2=+ϕ 2¯Γψ 1AAAAforAAψ 1,ψ 2S +S \overline{\psi_1} \,\Gamma\, \psi_2 = + \overline{\phi_2}\, \Gamma\, \psi_1 \phantom{AAAA} \text{for} \phantom{AA} \psi_1, \psi_2 \in S_+ \oplus S_-

(Baez-Huerta 09, prop. 8, prop. 9).

Remark

(two-component spinor notation)

In the physics/QFT literature the expressions for spin representations given by prop. are traditionally written in two-component spinor notation as follows:

  • An element of S +S_+ is denoted (χ a𝕂) a=1,2(\chi_a \in \mathbb{K})_{a = 1,2} and called a left handed spinor;

  • an element of S S_- is denoted (ξ a˙) a˙=1,2(\xi^{\dagger \dot a})_{\dot a = 1,2} and called a right handed spinor;

  • an element of S=S +S S = S_+ \oplus S_- is denoted

    (13)(ψ α)=((χ a),(ξ a˙)) (\psi^\alpha) = \left( (\chi_a), (\xi^{\dagger \dot a}) \right)

    and called a Dirac spinor;

and the Clifford action of prop. corresponds to the generalized “Pauli matrices”:

  • a hermitian matrix AMat 2×2 her(𝕂)A \in Mat^{her}_{2\times 2}(\mathbb{K}) as in prop regarded as a linear map S S +S_- \to S_+ via def. is denoted

    (x μσ aa˙ μ)(x 0+x 1 y y * x 0x 1); \left(x_\mu \sigma^\mu_{a \dot a}\right) \;\coloneqq\; \left( \array{ x_0 + x_1 & y \\ y^\ast & x_0 - x_1 } \right) \,;
  • the negative of the trace-reversal (def. ) of such a hermitian matrix, regarded as a linear map S +S S_+ \to S_-, is denoted

    (x μσ˜ μa˙a)(x 0+x 1 y y * x 0x 1). \left( x_\mu \widetilde \sigma^{\mu \dot a a} \right) \;\coloneqq\; - \left( \array{ -x_0 + x_1 & y \\ y^\ast & -x_0 - x_1 } \right) \,.
  • the corresponding Clifford generator Γ(A):S +S S +S \Gamma(A) \;\colon\; S_+ \oplus S_- \to S_+ \oplus S_- (def. ) is denoted

    x μ(γ μ) αβ(0 x μσ ab˙ μ x μσ˜ μa˙b) x_\mu (\gamma^\mu)_{\alpha \beta} \;\coloneqq\; \left( \array{ 0 & x_\mu \sigma^\mu_{a \dot b} \\ x_\mu \widetilde \sigma^{\mu \dot a b} } \right)
  • the bilinear spinor-to-vector pairing from prop. is written as the matrix multiplication

    (ψ¯γ μϕ)ψ¯Γϕ, \left( \overline{\psi} \, \gamma^\mu \, \phi\right) \;\coloneqq\; \overline{\psi}\,\Gamma \,\phi \,,

    where the Dirac conjugate ψ¯\overline{\psi} on the left is given on (ψ α)=(χ a,ξ c˙)(\psi_\alpha) = (\chi_a, \xi^{\dagger \dot c}) by

    (14)ψ¯ ψ γ 0 =(ξ a,χ a˙ ) \begin{aligned} \overline{\psi} & \coloneqq \psi^\dagger \gamma^0 \\ & = ( \xi^a, \chi^\dagger_{\dot a} ) \end{aligned}

    hence, with (13):

    (15)ψ 1¯γ μψ 2 =ψ 1 γ 0γ μψ 2 =(ξ 1) aσ ac˙ μ(ξ 2) c˙+(χ 1) a˙ σ˜ μa˙c(χ 2) c \begin{aligned} \overline{\psi_1} \,\gamma^\mu\, \psi_2 & = \psi_1^\dagger \, \gamma^0 \gamma^\mu \, \psi_2 \\ & = (\xi_1)^a \, \sigma^\mu_{a \dot c}\, (\xi_2)^{\dagger \dot c} + (\chi_1)^\dagger_{\dot a} \, \widetilde \sigma^{\mu \dot a c} \, (\chi_2)_c \end{aligned}

Finally, it is common to abbreviate contractions with the Clifford algebra generators (γ μ)(\gamma^\mu) by a slash, as in

k/γ μk μ k\!\!\!/\, \;\coloneqq\; \gamma^\mu k_\mu

or

(16)i/iγ μx μ. i \partial\!\!\!/\, \;\coloneqq\; i \gamma^\mu \frac{\partial}{\partial x^\mu} \,.

This is called the Feynman slash notation.

(e.g. Dermisek I-8, Dermisek I-9)

Below we spell out the example of the Lagrangian field theory of the Dirac field in detail (example ). For discussion of massive chiral spinor fields one also needs the following, here we just mention this for completeness:

Proposition

(chiral spinor mass pairing)

In dimension 2+1 and 3+1, there exists a non-trivial skew-symmetric pairing

ϵ:SS \epsilon \;\colon\; S \wedge S \longrightarrow \mathbb{R}

which may be normalized such that in the two-component spinor basis of remark we have

(17)σ˜ μa˙a=ϵ abϵ a˙b˙σ bb˙ μ. \tilde \sigma^{\mu \dot a a} = \epsilon^{a b} \epsilon^{\dot a \dot b} \sigma^\mu_{b \dot b} \,.
Proof

Take the non-vanishing components of ϵ\epsilon to be

ϵ 12=ϵ 1˙2˙=ϵ 21=ϵ 2˙1˙=1 \epsilon^{1 2} = \epsilon^{\dot 1 \dot 2} = \epsilon_{21} = \epsilon_{\dot 2 \dot 1} = 1

and

ϵ 21=ϵ 2˙1˙=ϵ 12=ϵ 1˙2˙=1. \epsilon^{2 1} = \epsilon^{\dot 2 \dot 1} = \epsilon_{1 2} = \epsilon_{\dot 1 \dot 2} = -1 \,.

With this equation (17) is checked explicitly. It is clear that ϵ\epsilon thus defined is skew symmetric as long as the component algebra is commutative, which is the case for 𝕂\mathbb{K} being \mathbb{R} or \mathbb{C}.

\,

Causal structure

We need to consider the following concepts and constructions related to the causal structure of Minkowski spacetime Σ\Sigma (def. ).

Definition

(spacelike, timelike, lightlike directions; past and future)

Given two points x,yΣx,y \in \Sigma in Minkowski spacetime (def. ), write

vyx p,1 v \coloneqq y - x \in \mathbb{R}^{p,1}

for their difference, using the vector space structure underlying Minkowski spacetime.

Recall the Minkowski inner product η\eta on p,1\mathbb{R}^{p,1}, given by prop./def. . Then via remark we say that the difference vector vv is

  1. spacelike if η(v,v)>0\eta(v,v) \gt 0,

  2. timelike if η(v,v)<0\eta(v,v) \lt 0,

  3. lightlike if η(v,v)=0\eta(v,v) = 0.

If vv is timelike or lightlike then we say that

  1. yy is in the future of xx if y 0x 00y^0 - x^0 \geq 0;

  2. yy is in the past of xx if y 0x 00y^0 - x^0 \leq 0.

Definition

(causal cones)

For xΣx \in \Sigma a point in spacetime (an event), we write

V +(x),V (x)Σ V^+(x), V^-(x) \subset \Sigma

for the subsets of events that are in the timelike future or in the timelike past of xx, respectively (def. ) called the open future cone and open past cone, respectively, and

V¯ +(x),V¯ (x)Σ \overline{V}^+(x), \overline{V}^-(x) \subset \Sigma

for the subsets of events that are in the timelike or lightlike future or past, respectivel, called the closed future cone and closed past cone, respectively.

The union

J(x)V¯ +(x)V¯ (x) J(x) \coloneqq \overline{V}^+(x) \cup \overline{V}^-(x)

of the closed future cone and past cone is called the full causal cone of the event xx. Its boundary is the light cone.

More generally for SΣS \subset \Sigma a subset of events we write

V¯ ±(S)xSV¯ ±(x) \overline{V}^\pm(S) \;\coloneqq\; \underset{x \in S}{\cup} \overline{V}^{\pm}(x)

for the union of the future/past closed cones of all events in the subset.

Definition

(compactly sourced causal support)

Consider a vector bundle EΣE \overset{}{\to} \Sigma (def. ) over Minkowski spacetime (def. ).

Write Γ Σ(E)\Gamma_{\Sigma}(E) for the spaces of smooth sections (def. ), and write

Γ cp(E) compact support Γ Σ,±cp(E) compactly sourced future/past support Γ Σ,scp(E) spacelike compact support Γ Σ,(f/p)cp(E) future/past compact support Γ Σ,tcp(E) timelike compact support \begin{aligned} \Gamma_{cp}(E) & \,\text{compact support} \\ \Gamma_{\Sigma,\pm cp}(E) & \,\text{compactly sourced future/past support} \\ \Gamma_{\Sigma,scp}(E) & \,\text{spacelike compact support} \\ \Gamma_{\Sigma,(f/p)cp}(E) & \,\text{future/past compact support} \\ \Gamma_{\Sigma,tcp}(E) & \,\text{timelike compact support} \end{aligned}

for the subsets on those smooth sections whose support is

  1. (cpcp) inside a compact subset,

  2. (±cp\pm cp) inside the closed future cone/closed past cone, respectively, of a compact subset,

  3. (scpscp) inside the closed causal cone of a compact subset, which equivalently means that the intersection with every (spacelike) Cauchy surface is compact (Sanders 13, theorem 2.2),

  4. (fcpfcp) inside the past of a Cauchy surface (Sanders 13, def. 3.2),

  5. (pcppcp) inside the future of a Cauchy surface (Sanders 13, def. 3.2),

  6. (tcptcp) inside the future of one Cauchy surface and the past of another (Sanders 13, def. 3.2).

(Bär 14, section 1, Khavkine 14, def. 2.1)

Definition

(causal order)

Consider the relation on the set P(Σ)P(\Sigma) of subsets of spacetime which says a subset S 1ΣS_1 \subset \Sigma is not prior to a subset S 2ΣS_2 \subset \Sigma, denoted S 1S 2S_1 {\vee\!\!\!\wedge} S_2, if S 1S_1 does not intersect the causal past of S 2S_2 (def. ), or equivalently that S 2S_2 does not intersect the causal future of S 1S_1:

S 1S 2 S 1V¯ (S 2)= S 2V¯ +(S 1)=. \begin{aligned} S_1 {\vee\!\!\!\wedge} S_2 & \;\;\coloneqq\;\; S_1 \cap \overline{V}^-(S_2) = \emptyset \\ & \;\;\Leftrightarrow\;\; S_2 \cap \overline{V}^+(S_1) = \emptyset \end{aligned} \,.

(Beware that this is just a relation, not an ordering, since it is not relation.)

If S 1S 2S_1 {\vee\!\!\!\wedge} S_2 and S 2S 1S_2 {\vee\!\!\!\wedge} S_1 we say that the two subsets are spacelike separated and write

S 1><S 2S 1S 2andS 2S 1. S_1 {\gt\!\!\!\!\lt} S_2 \;\;\;\coloneqq\;\;\; S_1 {\vee\!\!\!\wedge} S_2 \;\text{and}\; S_2 {\vee\!\!\!\wedge} S_1 \,.
Definition

(causal complement and causal closure of subset of spacetime)

For SXS \subset X a subset of spacetime, its causal complement S S^\perp is the complement of the causal cone:

S SJ X(S). S^\perp \;\coloneqq\; S \setminus J_X(S) \,.

The causal complement S S^{\perp \perp} of the causal complement S S^\perp is called the causal closure. If

S=S S = S^{\perp \perp}

then the subset SS is called a causally closed subset.

Given a spacetime Σ\Sigma, we write

CausClsdSubsets(Σ)Cat CausClsdSubsets(\Sigma) \;\in\; Cat

for the partially ordered set of causally closed subsets, partially ordered by inclusion 𝒪 1𝒪 2\mathcal{O}_1 \subset \mathcal{O}_2.

Definition

(adiabatic switching)

For a causally closed subset 𝒪Σ\mathcal{O} \subset \Sigma of spacetime (def. ) say that an adiabatic switching function or infrared cutoff function for 𝒪\mathcal{O} is a smooth function g swg_{sw} of compact support (a bump function) whose restriction to some neighbourhood UU of 𝒪\mathcal{O} is the constant function with value 11:

Cutoffs(𝒪){g swC c (Σ)|U𝒪neighbourhood(g sw| U=1)}. Cutoffs(\mathcal{O}) \;\coloneqq\; \left\{ g_{sw} \in C^\infty_c(\Sigma) \;\vert\; \underset{ {U \supset \mathcal{O}} \atop { \text{neighbourhood} } }{\exists} \left( g_{sw}\vert_U = 1 \right) \right\} \,.

Often we consider the vector space space C (Σ)gC^\infty(\Sigma)\langle g \rangle spanned by a formal variable gg (the coupling constant) under multiplication with smooth functions, and consider as adiabatic switching functions the corresponding images in this space,

C c (Σ) C c (X)g \array{ C_c^\infty(\Sigma) &\overset{\simeq}{\longrightarrow}& C_c^\infty(X)\langle g\rangle }

which are thus bump functions constant over a neighbourhood UU of 𝒪\mathcal{O} not on 1 but on the formal parameter gg:

g sw| U=g g_{sw}\vert_U = g \,

In this sense we may think of the adiabatic switching as being the spacetime-depependent coupling “constant”.

The following lemma will be key in the derivation (proof of prop. below) of the causal locality of algebra of quantum observables in perturbative quantum field theory:

Lemma

(causal partition)

Let 𝒪Σ\mathcal{O} \subset \Sigma be a causally closed subset (def. ) and let fC cp (Σ)f \in C^\infty_{cp}(\Sigma) be a compactly supported smooth function which vanishes on a neighbourhood U𝒪U \supset \mathcal{O}, i.e. f| U=0f\vert_U = 0.

Then there exists a causal partition of ff in that there exist compactly supported smooth functions a,rC cp (Σ)a,r \in C^\infty_{cp}(\Sigma) such that

  1. they sum up to ff:

    f=a+r f = a + r
  2. their support satisfies the following causal ordering (def. )

    supp(a)𝒪supp(r). supp(a) {\vee\!\!\!\wedge} \mathcal{O} {\vee\!\!\!\wedge} supp(r) \,.
Proof idea

By assumption 𝒪\mathcal{O} has a Cauchy surface. This may be extended to a Cauchy surface Σ p\Sigma_p of Σ\Sigma, such that this is one leaf of a foliation of Σ\Sigma by Cauchy surfaces, given by a diffeomorphism Σ(1,1)×Σ p\Sigma \simeq (-1,1) \times \Sigma_p with the original Σ p\Sigma_p at zero. There exists then ϵ(0,1)\epsilon \in (0,1) such that the restriction of supp(f)supp(f) to the interval (ϵ,ϵ)(-\epsilon, \epsilon) is in the causal complement 𝒪¯\overline{\mathcal{O}} of the given region (def. ):

supp(f)(ϵ,ϵ)×Σ p𝒪¯. supp(f) \cap (-\epsilon, \epsilon) \times \Sigma_p \;\subset\; \overline{\mathcal{O}} \,.

Let then χ:Σ\chi \colon \Sigma \to \mathbb{R} be any smooth function with

  1. χ| (1,0]×Σ p=1\chi\vert_{(-1,0] \times \Sigma_p} = 1

  2. χ| (ϵ,1)×Σ p=0\chi\vert_{(\epsilon,1) \times \Sigma_p} = 0.

Then

rχfAAAandAAAa(1χ)f r \coloneqq \chi \cdot f \phantom{AAA} \text{and} \phantom{AAA} a \coloneqq (1-\chi) \cdot f

are smooth functions as required.

\,

This concludes our discussion of spin and spacetime. In the next chapter we consider the concept of fields on spacetime.

Fields

In this chapter we discuss these topics:

\,

A field history on a given spacetime Σ\Sigma (a history of spatial field configurations, see remark below) is a quantity assigned to each point of spacetime (each event), such that this assignment varies smoothly with spacetime points. For instance an electromagnetic field history (example below) is at each point of spacetime a collection of vectors that encode the direction in which a charged particle passing through that point would feel a force (the “Lorentz force”, see example below).

This is readily formalized (def. below): If FF denotes the smooth manifold of “values” that the given kind of field may take at any spacetime point, then a field history Φ\Phi is modeled as a smooth function from spacetime to this space of values:

Φ:ΣF. \Phi \;\colon\; \Sigma \longrightarrow F \,.

It will be useful to unify spacetime and the space of field values (the field fiber) into a single manifold, the Cartesian product

EΣ×F E \;\coloneqq\; \Sigma \times F

and to think of this equipped with the projection map onto the first factor as a fiber bundle of spaces of field values over spacetime

E Σ×F fb pr 1 Σ. \array{ E &\coloneqq& \Sigma \times F \\ {}^{\mathllap{fb}}\downarrow & \swarrow_{\mathrlap{pr_1}} \\ \Sigma } \,.

This is then called the field bundle, which specifies the kind of values that the given field species may take at any point of spacetime. Since the space FF of field values is the fiber of this fiber bundle (def. ), it is sometimes also called the field fiber. (See also at fiber bundles in physics.)

Given a field bundle EfbΣE \overset{fb}{\to}\Sigma, then a field history is a section of that bundle (def. ). The discussion of field theory concerns the space of all possible field histories, hence the space of sections of the field bundle (example below). This is a very “large” generalized smooth space, called a diffeological space (def. below).

Or rather, in the presence of fermion fields such as the Dirac field (example below), the Pauli exclusion principle demands that the field bundle is a super-manifold, and that the fermionic space of field histories (example below) is a super-geometric generalized smooth space: a super smooth set (def. below).

This smooth structure on the space of field histories will be crucial when we discuss observables of a field theory below, because these are smooth functions on the space of field histories. In particular it is this smooth structure which allows to derive that linear observables of a free field theory are given by distributions (prop. ) below. Among these are the point evaluation observables (delta distributions) which are traditionally denoted by the same symbol as the field histories themselves.

Hence there are these aspects of the concept of “field” in physics, which are closely related, but crucially different:

\,

aspects of the concept of fields

aspecttermtypedescriptiondef.
field componentϕ a\phi^a, ϕ ,μ a\phi^a_{,\mu}J Σ (E)J^\infty_\Sigma(E) \to \mathbb{R}coordinate function on jet bundle of field bundledef. , def.
field historyΦ\Phi, Φx μ\frac{\partial \Phi}{\partial x^\mu}ΣJ Σ (E)\Sigma \to J^\infty_\Sigma(E)jet prolongation of section of field bundledef. , def.
field observableΦ a(x)\mathbf{\Phi}^a(x), μΦ a(x),\partial_{\mu} \mathbf{\Phi}^a(x), Γ Σ(E)\Gamma_{\Sigma}(E) \to \mathbb{R}derivatives of delta-functional on space of sectionsdef. , example
averaging of field observableα *Σα a *(x)Φ a(x)dvol Σ(x)\alpha^\ast \mapsto \underset{\Sigma}{\int} \alpha^\ast_a(x) \mathbf{\Phi}^a(x) \, dvol_\Sigma(x)Γ Σ,cp(E *)Obs(E scp,L)\Gamma_{\Sigma,cp}(E^\ast) \to Obs(E_{scp},\mathbf{L})observable-valued distributiondef.
algebra of quantum observables(Obs(E,L) μc,)\left( Obs(E,\mathbf{L})_{\mu c},\, \star\right)Alg\mathbb{C}Algnon-commutative algebra structure on field observablesdef. , def.

\,

field bundles

Definition

(fields and field histories)

Given a spacetime Σ\Sigma, then a type of fields on Σ\Sigma is a smooth fiber bundle (def. )

E fb Σ \array{E \\ \downarrow^{\mathrlap{fb}} \\ \Sigma }

called the field bundle,

Given a type of fields on Σ\Sigma this way, then a field history of that type on Σ\Sigma is a term of that type, hence is a smooth section (def. ) of this bundle, namely a smooth function of the form

Φ:ΣE \Phi \;\colon\; \Sigma \longrightarrow E

such that composed with the projection map it is the identity function, i.e. such that

fbΦ=idAAAAAAA E Φ fb Σ = Σ. fb \circ \Phi = id \phantom{AAAAAAA} \array{ && E \\ & {}^{\mathllap{\Phi}}\nearrow & \downarrow^{\mathrlap{fb}} \\ \Sigma & = & \Sigma } \,.

The set of such sections/field histories is to be denoted

(18)Γ Σ(E){ E Φ fb Σ = Σfb} \Gamma_\Sigma(E) \;\coloneqq\; \left\{ \array{ && E \\ & {}^{\mathllap{\Phi}}\nearrow & \downarrow^{\mathrlap{fb}} \\ \Sigma &=& \Sigma } \phantom{fb} \right\}
Remark

(field histories are histories of spatial field configurations)

Given a section ΦΓ Σ(E)\Phi \in \Gamma_\Sigma(E) of the field bundle (def. ) and given a spacelike (def. ) submanifold Σ pΣ\Sigma_p \hookrightarrow \Sigma (def. ) of spacetime in codimension 1, then the restriction Φ| Σ p\Phi\vert_{\Sigma_p} of Φ\Phi to Σ p\Sigma_p may be thought of as a field configuration in space. As different spatial slices Σ p\Sigma_p are chosen, one obtains such field configurations at different times. It is in this sense that the entirety of a section ΦΓ Σ(E)\Phi \in \Gamma_\Sigma(E) is a history of field configurations, hence a field history (def ).

Remark

(possible field histories)

After we give the set Γ Σ(E)\Gamma_\Sigma(E) of field histories (18) differential geometric structure, below in example and example , we call it the space of field histories. This should be read as space of possible field histories; containing all field histories that qualify as being of the type specified by the field bundle EE.

After we obtain equations of motion below in def. , these serve as the “laws of nature” that field histories should obey, and they define the subspace of those field histories that do solve the equations of motion; this will be denoted

Γ Σ(E) δ ELL=0AAAΓ Σ(E) \Gamma_\Sigma(E)_{\delta_{EL}\mathbf{L}= 0} \overset{\phantom{AAA}}{\hookrightarrow} \Gamma_\Sigma(E)

and called the on-shell space of field histories (41).

For the time being, not to get distracted from the basic idea of quantum field theory, we will focus on the following simple special case of field bundles:

Example

(trivial vector bundle as a field bundle)

In applications the field fiber F=VF = V is often a finite dimensional vector space. In this case the trivial field bundle with fiber FF is of course a trivial vector bundle (def. ).

Choosing any linear basis (ϕ a) a=1 s(\phi^a)_{a = 1}^s of the field fiber, then over Minkowski spacetime (def. ) we have canonical coordinates on the total space of the field bundle

((x μ),(ϕ a)), ( (x^\mu), ( \phi^a ) ) \,,

where the index μ\mu ranges from 00 to pp, while the index aa ranges from 1 to ss.

If this trivial vector bundle is regarded as a field bundle according to def. , then a field history Φ\Phi is equivalently an ss-tuple of real-valued smooth functions Φ a:Σ\Phi^a \colon \Sigma \to \mathbb{R} on spacetime:

Φ=(Φ a) a=1 s. \Phi = ( \Phi^a )_{a = 1}^s \,.
Example

(field bundle for real scalar field)

If Σ\Sigma is a spacetime and if the field fiber

F F \coloneqq \mathbb{R}

is simply the real line, then the corresponding trivial field bundle (def. )

Σ× pr 1 Σ \array{ \Sigma \times \mathbb{R} \\ {}^{\mathllap{pr_1}}\downarrow \\ \Sigma }

is the trivial real line bundle (a special case of example ) and the corresponding field type (def. ) is called the real scalar field on Σ\Sigma. A configuration of this field is simply a smooth function on Σ\Sigma with values in the real numbers:

(19)Γ Σ(Σ×)C (Σ). \Gamma_\Sigma(\Sigma \times \mathbb{R}) \;\simeq\; C^\infty(\Sigma) \,.
Example

(field bundle for electromagnetic field)

On Minkowski spacetime Σ\Sigma (def. ), let the field bundle (def. ) be given by the cotangent bundle

ET *Σ. E \coloneqq T^\ast \Sigma \,.

This is a trivial vector bundle (example ) with canonical field coordinates (a μ)(a_\mu).

A section of this bundle, hence a field history, is a differential 1-form

AΓ Σ(T *Σ)=Ω 1(Σ) A \in \Gamma_\Sigma(T^\ast \Sigma) = \Omega^1(\Sigma)

on spacetime (def. ). Interpreted as a field history of the electromagnetic field on Σ\Sigma, this is often called the vector potential. Then the de Rham differential (def. ) of the vector potential is a differential 2-form

FdA F \coloneqq d A

known as the Faraday tensor. In the canonical coordinate basis 2-forms this may be expanded as

(20)F=i=1pE idx 0dx i+1i<jpB ijdx idx j. F = \underoverset{i = 1}{p}{\sum} E_i d x^0 \wedge d x^i + \underset{1 \leq i \lt j \leq p}{\sum} B_{i j} d x^i \wedge d x^j \,.

Here (E i) i=1 p(E_i)_{i = 1}^p are called the components of the electric field, while (B ij)(B_{i j}) are called the components of the magnetic field.

Example

(field bundle for Yang-Mills field over Minkowski spacetime)

Let 𝔤\mathfrak{g} be a Lie algebra of finite dimension with linear basis (t α)(t_\alpha), in terms of which the Lie bracket is given by

(21)[t α,t β]=γ γ αβt γ. [t_\alpha, t_\beta] \;=\; \gamma^\gamma{}_{\alpha \beta} t_\gamma \,.

Over Minkowski spacetime Σ\Sigma (def. ), consider then the field bundle which is the cotangent bundle tensored with the Lie algebra 𝔤\mathfrak{g}

ET *Σ𝔤. E \coloneqq T^\ast \Sigma \otimes \mathfrak{g} \,.

This is the trivial vector bundle (example ) with induced field coordinates

(a μ α). ( a_\mu^\alpha ) \,.

A section of this bundle is a Lie algebra-valued differential 1-form

AΓ Σ(T *Σ𝔤)=Ω 1(Σ,𝔤). A \in \Gamma_\Sigma(T^\ast \Sigma \otimes \mathfrak{g}) = \Omega^1(\Sigma, \mathfrak{g}) \,.

with components

A *(a μ α)=A μ α. A^\ast(a_\mu^\alpha) = A^\alpha_\mu \,.

This is called a field history for Yang-Mills gauge theory (at least if 𝔤\mathfrak{g} is a semisimple Lie algebra, see example below).

For 𝔤=\mathfrak{g} = \mathbb{R} is the line Lie algebra, this reduces to the case of the electromagnetic field (example ).

For 𝔤=𝔰𝔲(3)\mathfrak{g} = \mathfrak{su}(3) this is a field history for the gauge field of the strong nuclear force in quantum chromodynamics.

For readers familiar with the concepts of principal bundles and connections on a bundle we include the following example which generalizes the Yang-Mills field over Minkowski spacetime from example to the situation over general spacetimes.

Example

(general Yang-Mills field in fixed topological sector)

Let Σ\Sigma be any spacetime manifold and let GG be a compact Lie group with Lie algebra denoted 𝔤\mathfrak{g}. Let PisΣP \overset{is}{\to} \Sigma be a GG-principal bundle and 0\nabla_0 a chosen connection on it, to be called the background GG-Yang-Mills field.

Then the field bundle (def. ) for GG-Yang-Mills theory in the topological sector PP is the tensor product of vector bundles

E(P× G ad𝔤) Σ(T *Σ) E \coloneqq \left(P \times^{ad}_G \mathfrak{g}\right) \otimes_\Sigma \left( T^\ast \Sigma \right)

of the adjoint bundle of PP and the cotangent bundle of Σ\Sigma.

With the choice of 0\nabla_0, every (other) connection \nabla on PP uniquely decomposes as

= 0+A, \nabla = \nabla_0 + A \,,

where

AΓ Σ(E) A \in \Gamma_\Sigma(E)

is a section of the above field bundle, hence a Yang-Mills field history.

The electromagnetic field (def. ) and the Yang-Mills field (def. , def. ) with differential 1-forms as field histories are the basic examples of gauge fields (we consider this in more detail below in Gauge symmetries). There are also higher gauge fields with differential n-forms as field histories:

Example

(field bundle for B-field)

On Minkowski spacetime Σ\Sigma (def. ), let the field bundle (def. ) be given by the skew-symmetrized tensor product of vector bundles of the cotangent bundle with itself

E Σ 2T *Σ. E \coloneqq \wedge^2_\Sigma T^\ast \Sigma \,.

This is a trivial vector bundle (example ) with canonical field coordinates (b μν)(b_{\mu \nu}) subject to

b μν=b νμ. b_{\mu \nu} \;=\; - b_{\nu \mu} \,.

A section of this bundle, hence a field history, is a differential 2-form (def. )

BΓ Σ( Σ 2T *Σ)=Ω 2(Σ) B \in \Gamma_\Sigma(\wedge^2_\Sigma T^\ast \Sigma) = \Omega^2(\Sigma)

on spacetime.

\,

space of field histories

Given any field bundle, we will eventually need to regard the set of all field histories Γ Σ(E)\Gamma_\Sigma(E) as a “smooth set” itself, a smooth space of sections, to which constructions of differential geometry apply (such as for the discussion of observables and states below ). Notably we need to be talking about differential forms on Γ Σ(E)\Gamma_\Sigma(E).

However, a space of sections Γ Σ(E)\Gamma_\Sigma(E) does not in general carry the structure of a smooth manifold; and it carries the correct smooth structure of an infinite dimensional manifold only if Σ\Sigma is a compact space (see at manifold structure of mapping spaces). Even if it does carry infinite dimensional manifold structure, inspection shows that this is more structure than actually needed for the discussion of field theory. Namely it turns out below that all we need to know is what counts as a smooth family of sections/field histories, hence which functions of sets

Φ (): nΓ Σ(E) \Phi_{(-)} \;\colon\; \mathbb{R}^n \longrightarrow \Gamma_\Sigma(E)

from any Cartesian space n\mathbb{R}^n (def. ) into Γ Σ(E)\Gamma_\Sigma(E) count as smooth functions, subject to some basic consistency condition on this choice.

This structure on Γ Σ(E)\Gamma_\Sigma(E) is called the structure of a diffeological space:

Definition

(diffeological space)

A diffeological space XX is

  1. a set X sX_s \in Set;

  2. for each nn \in \mathbb{N} a choice of subset

    X( n)Hom Set( s n,X s)={ s nX s} X(\mathbb{R}^n) \subset Hom_{Set}(\mathbb{R}^n_s, X_s) = \left\{ \mathbb{R}^n_s \to X_s \right\}

    of the set of functions from the underlying set s n\mathbb{R}^n_s of n\mathbb{R}^n to X sX_s, to be called the smooth functions or plots from n\mathbb{R}^n to XX;

  3. for each smooth function f: n 1 n 2f \;\colon\; \mathbb{R}^{n_1} \longrightarrow \mathbb{R}^{n_2} between Cartesian spaces (def. ) a choice of function

    f *:X( n 2)X( n 1) f^\ast \;\colon\; X(\mathbb{R}^{n_2}) \longrightarrow X(\mathbb{R}^{n_1})

    to be thought of as the precomposition operation

    ( n 2ΦX)f *( n 1f n 2ΦX) \left( \mathbb{R}^{n_2} \overset{\Phi}{\longrightarrow} X \right) \;\overset{f^\ast}{\mapsto}\; \left( \mathbb{R}^{n_1} \overset{f}{\to} \mathbb{R}^{n_2} \overset{\Phi}{\to} X \right)

such that

  1. (constant functions are smooth)

    X( 0)=X s, X(\mathbb{R}^0) = X_s \,,
  2. (functoriality)

    1. If id n: n nid_{\mathbb{R}^n} \;\colon\; \mathbb{R}^n \to \mathbb{R}^n is the identity function on n\mathbb{R}^n, then (id n) *:X( n)X( n)\left(id_{\mathbb{R}^n}\right)^\ast \;\colon\; X(\mathbb{R}^n) \to X(\mathbb{R}^n) is the identity function on the set of plots X( n)X(\mathbb{R}^n);

    2. If n 1f n 2g n 3\mathbb{R}^{n_1} \overset{f}{\to} \mathbb{R}^{n_2} \overset{g}{\to} \mathbb{R}^{n_3} are two composable smooth functions between Cartesian spaces (def. ), then pullback of plots along them consecutively equals the pullback along the composition:

      f *g *=(gf) * f^\ast \circ g^\ast = (g \circ f)^\ast

      i.e.

      X( n 2) f * g * X( n 1) (gf) * X( n 3) \array{ && X(\mathbb{R}^{n_2}) \\ & {}^{\mathllap{f^\ast}}\swarrow && \nwarrow^{\mathrlap{g^\ast}} \\ X(\mathbb{R}^{n_1}) && \underset{ (g \circ f)^\ast }{\longleftarrow} && X(\mathbb{R}^{n_3}) }
  3. (gluing)

    If {U if i n} iI\{ U_i \overset{f_i}{\to} \mathbb{R}^n\}_{i \in I} is a differentiably good open cover of a Cartesian space (def. ) then the function which restricts n\mathbb{R}^n-plots of XX to a set of U iU_i-plots

    X( n)((f i) *) iIiIX(U i) X(\mathbb{R}^n) \overset{( (f_i)^\ast )_{i \in I} }{\hookrightarrow} \underset{i \in I}{\prod} X(U_i)

    is a bijection onto the set of those tuples (Φ iX(U i)) iI(\Phi_i \in X(U_i))_{i \in I} of plots, which are “matching families” in that they agree on intersections:

    ϕ i| U iU j=ϕ j| U iU jAAAAAA U iU j U i U j Φ i Φ j X \phi_i\vert_{U_i \cap U_j} = \phi_j \vert_{U_i \cap U_j} \phantom{AAAAAA} \array{ && U_i \cap U_j \\ & \swarrow && \searrow \\ U_i && && U_j \\ & {}_{\mathrlap{\Phi_i}}\searrow && \swarrow_{\mathrlap{\Phi_j}} \\ && X }

Finally, given X 1X_1 and X 2X_2 two diffeological spaces, then a smooth function between them

f:X 1X 2 f \;\colon\; X_1 \longrightarrow X_2

is

  • a function of the underlying sets

    f s:(X 1) s(X 2) s f_s \;\colon\; (X_1)_s \longrightarrow (X_2)_s

such that

  • for ΦX( n)\Phi \in X(\mathbb{R}^n) a plot of X 1X_1, then the composition f sΦ sf_s \circ \Phi_s is a plot f *(Φ)f_\ast(\Phi) of X 2X_2:

    n Φ f *(Φ) X 1 f X 2. \array{ && \mathbb{R}^n \\ & {}^{\mathllap{\Phi}}\swarrow && \searrow^{\mathrlap{f_\ast(\Phi)}} \\ X_1 && \underset{f}{\longrightarrow} && X_2 } \,.

(Stated more abstractly, this says simply that diffeological spaces are the concrete sheaves on the site of Cartesian spaces from def. .)

For more background on diffeological spaces see also geometry of physics – smooth sets.

Example

(Cartesian spaces are diffeological spaces)

Let XX be a Cartesian space (def. ) Then it becomes a diffeological space (def. ) by declaring its plots ΦX( n)\Phi \in X(\mathbb{R}^n) to the ordinary smooth functions Φ: nX\Phi \colon \mathbb{R}^n \to X.

Under this identification, a function f:(X 1) s(X 2) sf \;\colon\; (X_1)_s \to (X_2)_s between the underlying sets of two Cartesian spaces is a smooth function in the ordinary sense precisely if it is a smooth function in the sense of diffeological spaces.

Stated more abstractly, this statement is an example of the Yoneda embedding over a subcanonical site.

More generally, the same construction makes every smooth manifold a smooth set.

Example

(diffeological space of field histories)

Let EfbΣE \overset{fb}{\to} \Sigma be a smooth field bundle (def. ). Then the set Γ Σ(E)\Gamma_\Sigma(E) of field histories/sections (def. ) becomes a diffeological space (def. )

(22)Γ Σ(E)DiffeologicalSpaces \Gamma_\Sigma(E) \in DiffeologicalSpaces

by declaring that a smooth family Φ ()\Phi_{(-)} of field histories, parameterized over any Cartesian space UU is a smooth function out of the Cartesian product manifold of Σ\Sigma with UU

U×Σ Φ ()() E (u,x) Φ u(x) \array{ U \times \Sigma &\overset{\Phi_{(-)}(-)}{\longrightarrow}& E \\ (u,x) &\mapsto& \Phi_u(x) }

such that for each uUu \in U we have pΦ u()=id Σp \circ \Phi_{u}(-) = id_\Sigma, i.e.

E Φ ()() fb U×Σ pr 2 Σ. \array{ && E \\ & {}^{\mathllap{\Phi_{(-)}(-)}}\nearrow & \downarrow^{\mathrlap{fb}} \\ U \times \Sigma &\underset{pr_2}{\longrightarrow}& \Sigma } \,.

The following example is included only for readers who wonder how infinite-dimensional manifolds fit in. Since we will never actually use infinite-dimensional manifold-structure, this example is may be ignored.

Example

(Fréchet manifolds are diffeological spaces)

Consider the particular type of infinite-dimensional manifolds called Fréchet manifolds. Since ordinary smooth manifolds UU are an example, for XX a Fréchet manifold there is a concept of smooth functions UXU \to X. Hence we may give XX the structure of a diffeological space (def. ) by declaring the plots over UU to be these smooth functions UXU \to X, with the evident postcomposition action.

It turns out that then that for XX and YY two Fréchet manifolds, there is a natural bijection between the smooth functions XYX \to Y between them regarded as Fréchet manifolds and [regarded as . Hence it does not matter which of the two perspectives we take (unless of course a more general than a enters the picture, at which point the second definition generalizes, whereas the first does not).]

Stated more abstractly, this means that Fréchet manifolds form a full subcategory of that of diffeological spaces (this prop.):

FrechetManifoldsDiffeologicalSpaces. FrechetManifolds \hookrightarrow DiffeologicalSpaces \,.

If Σ\Sigma is a compact smooth manifold and EΣ×FΣE \simeq \Sigma \times F \to \Sigma is a trivial fiber bundle with fiber FF a smooth manifold, then the set of sections Γ Σ(E)\Gamma_\Sigma(E) carries a standard structure of a Fréchet manifold (see at manifold structure of mapping spaces). Under the above inclusion of Fréchet manifolds into diffeological spaces, this smooth structure agrees with that from example (see this prop.)

Once the step from smooth manifolds to diffeological spaces (def. ) is made, characterizing the smooth structure of the space entirely by how we may probe it by mapping smooth Cartesian spaces into it, it becomes clear that the underlying set X sX_s of a diffeological space XX is not actually crucial to support the concept: The space is already entirely defined structurally by the system of smooth plots it has, and the underlying set X sX_s is recovered from these as the set of plots from the point 0\mathbb{R}^0.

This is crucial for field theory: the spaces of field histories of fermionic fields (def. below) such as the Dirac field (example below) do not have underlying sets of points the way diffeological spaces have. Informally, the reason is that a point is a bosonic object, while and the nature of fermionic fields is the opposite of bosonic.

But we may just as well drop the mentioning of the underlying set X sX_s in the definition of generalized smooth spaces. By simply stripping this requirement off of def. we obtain the following more general and more useful definition (still “bosonic”, though, the supergeometric version is def. below):

Definition

(smooth set)

A smooth set XX is

  1. for each nn \in \mathbb{N} a choice of set

    X( n)Set X(\mathbb{R}^n) \in Set

    to be called the set of smooth functions or plots from n\mathbb{R}^n to XX;

  2. for each smooth function f: n 1 n 2f \;\colon\; \mathbb{R}^{n_1} \longrightarrow \mathbb{R}^{n_2} between Cartesian spaces a choice of function

    f *:X( n 2)X( n 1) f^\ast \;\colon\; X(\mathbb{R}^{n_2}) \longrightarrow X(\mathbb{R}^{n_1})

    to be thought of as the precomposition operation

    ( n 2ΦX)f *( n 1f n 2ΦX) \left( \mathbb{R}^{n_2} \overset{\Phi}{\longrightarrow} X \right) \;\overset{f^\ast}{\mapsto}\; \left( \mathbb{R}^{n_1} \overset{f}{\to} \mathbb{R}^{n_2} \overset{\Phi}{\to} X \right)

such that

  1. (functoriality)

    1. If id n: n nid_{\mathbb{R}^n} \;\colon\; \mathbb{R}^n \to \mathbb{R}^n is the identity function on n\mathbb{R}^n, then (id n) *:X( n)X( n)\left(id_{\mathbb{R}^n}\right)^\ast \;\colon\; X(\mathbb{R}^n) \to X(\mathbb{R}^n) is the identity function on the set of plots X( n)X(\mathbb{R}^n).

    2. If n 1f n 2g n 3\mathbb{R}^{n_1} \overset{f}{\to} \mathbb{R}^{n_2} \overset{g}{\to} \mathbb{R}^{n_3} are two composable smooth functions between Cartesian spaces, then consecutive pullback of plots along them equals the pullback along the composition:

      f *g *=(gf) * f^\ast \circ g^\ast = (g \circ f)^\ast

      i.e.

      X( n 2) f * g * X( n 1) (gf) * X( n 3) \array{ && X(\mathbb{R}^{n_2}) \\ & {}^{\mathllap{f^\ast}}\swarrow && \nwarrow^{\mathrlap{g^\ast}} \\ X(\mathbb{R}^{n_1}) && \underset{ (g \circ f)^\ast }{\longleftarrow} && X(\mathbb{R}^{n_3}) }
  2. (gluing)

    If {U if i n} iI\{ U_i \overset{f_i}{\to} \mathbb{R}^n\}_{i \in I} is a differentiably good open cover of a Cartesian space (def. ) then the function which restricts n\mathbb{R}^n-plots of XX to a set of U iU_i-plots

    X( n)((f i) *) iIiIX(U i) X(\mathbb{R}^n) \overset{( (f_i)^\ast )_{i \in I} }{\hookrightarrow} \underset{i \in I}{\prod} X(U_i)

    is a bijection onto the set of those tuples (Φ iX(U i)) iI(\Phi_i \in X(U_i))_{i \in I} of plots, which are “matching families” in that they agree on intersections:

    ϕ i| U iU j=ϕ j| U iU jAAAAi.e.AAAA U iU j U i U j Φ i Φ j X \phi_i\vert_{U_i \cap U_j} = \phi_j \vert_{U_i \cap U_j} \phantom{AAAA} \text{i.e.} \phantom{AAAA} \array{ && U_i \cap U_j \\ & \swarrow && \searrow \\ U_i && && U_j \\ & {}_{\mathrlap{\Phi_i}}\searrow && \swarrow_{\mathrlap{\Phi_j}} \\ && X }

Finally, given X 1X_1 and X 2X_2 two smooth sets, then a smooth function between them

f:X 1X 2 f \;\colon\; X_1 \longrightarrow X_2

is

  • for each nn \in \mathbb{N} a function

    f *( n):X 1( n)X 2( n) f_\ast(\mathbb{R}^n) \;\colon\; X_1(\mathbb{R}^n) \longrightarrow X_2(\mathbb{R}^n)

such that

  • for each smooth function g: n 1 n 2g \colon \mathbb{R}^{n_1} \to \mathbb{R}^{n_2} between Cartesian spaces we have

    g 2 *f *( n 2)=f *( n 1)g 1 *AAAAAi.e.AAAAAi.e.AAAAAX 1( n 2) f *( n 2) X 2( n 2) g 1 * g 2 * X 1( n 1) f *( n 1) X 2( n 1) g^\ast_2 \circ f_\ast(\mathbb{R}^{n_2}) = f_\ast(\mathbb{R}^{n_1}) \circ g^\ast_1 \phantom{AAAAA} \text{i.e.} \phantom{AAAAA} \text{i.e.} \phantom{AAAAA} \array{ X_1(\mathbb{R}^{n_2}) &\overset{f_\ast(\mathbb{R}^{n_2})}{\longrightarrow}& X_2(\mathbb{R}^{n_2}) \\ \mathllap{g_1^\ast}\downarrow && \downarrow\mathrlap{g^\ast_2} \\ X_1(\mathbb{R}^{n_1}) &\underset{f_\ast(\mathbb{R}^{n_1})}{\longrightarrow}& X_2(\mathbb{R}^{n_1}) }

Stated more abstractly, this simply says that smooth sets are the sheaves on the site of Cartesian spaces from def. .

Basing differential geometry on smooth sets is an instance of the general approach to geometry called functorial geometry or topos theory. For more background on this see at geometry of physics – smooth sets.

First we verify that the concept of smooth sets is a consistent generalization:

Example

(diffeological spaces are smooth sets)

Every diffeological space XX (def. ) is a smooth set (def. ) simply by forgetting its underlying set of points and remembering only its sets of plot.

In particular therefore each Cartesian space n\mathbb{R}^n is canonically a smooth set by example .

Moreover, given any two diffeological spaces, then the morphisms f:XYf \colon X \to Y between them, regarded as diffeological spaces, are the same as the morphisms as smooth sets.

Stated more abstractly, this means that we have full subcategory inclusions

CartesianSpacesAAADiffeologicalSpacesAAASmoothSets. CartesianSpaces \overset{\phantom{AAA}}{\hookrightarrow} DiffeologicalSpaces \overset{\phantom{AAA}}{\hookrightarrow} SmoothSets \,.

Recall, for the next proposition , that in the definition of a smooth set XX the sets X( n)X(\mathbb{R}^n) are abstract sets which are to be thought of as would-be smooth functions “ nX\mathbb{R}^n \to X”. Inside def. this only makes sense in quotation marks, since inside that definition the smooth set XX is only being defined, so that inside that definition there is not yet an actual concept of smooth functions of the form “ nX\mathbb{R}^n \to X”.

But now that the definition of smooth sets and of morphisms between them has been stated, and seeing that Cartesian space n\mathbb{R}^n are examples of smooth sets, by example , there is now an actual concept of smooth functions nX\mathbb{R}^n \to X, namely as smooth sets. For the concept of smooth sets to be consistent, it ought to be true that this a posteriori concept of smooth functions from Cartesian spaces to smooth sets coincides wth the a priori concept, hence that we “may remove the quotation marks” in the above. The following proposition says that this is indeed the case:

Proposition

(plots of a smooth set really are the smooth functions into the smooth set)

Let XX be a smooth set (def. ). For nn \in \mathbb{R}, there is a natural function

Hom SmoothSet( n,X)AAAAX( n) Hom_{SmoothSet}(\mathbb{R}^n , X) \overset{\phantom{AA}\simeq\phantom{AA}}{\longrightarrow} X(\mathbb{R}^n)

from the set of homomorphisms of smooth sets from n\mathbb{R}^n (regarded as a smooth set via example ) to XX, to the set of plots of XX over n\mathbb{R}^n, given by evaluating on the identity plot id nid_{\mathbb{R}^n}.

This function is a bijection.

This says that the plots of XX, which initially bootstrap XX into being as declaring the would-be smooth functions into XX, end up being the actual smooth functions into XX.

Proof

This elementary but profound fact is called the Yoneda lemma, here in its incarnation over the site of Cartesian spaces (def. ).

A key class of examples of smooth sets (def. ) that are not diffeological spaces (def. ) are universal smooth moduli spaces of differential forms:

Example

(universal smooth moduli spaces of differential forms)

For kk \in \mathbb{N} there is a smooth set (def. )

Ω kSmoothSet \mathbf{\Omega}^k \;\in\; SmoothSet

defined as follows:

  1. for nn \in \mathbb{N} the set of plots from n\mathbb{R}^n to Ω k\mathbf{\Omega}^k is the set of smooth differential k-forms on n\mathbb{R}^n (def. )

    Ω k( n)Ω k( n) \mathbf{\Omega}^k(\mathbb{R}^n) \;\coloneqq\; \Omega^k(\mathbb{R}^n)
  2. for f: n 1 n 2f \colon \mathbb{R}^{n_1} \to \mathbb{R}^{n_2} a smooth function (def. ) the operation of pullback of plots along ff is just the pullback of differential forms f *f^\ast from prop.

    n 1 Ω k( n 1) f f * n 2 Ω k( n 2) \array{ \mathbb{R}^{n_1} && \Omega^k(\mathbb{R}^{n_1}) \\ \downarrow^{\mathrlap{f}} && \uparrow^{\mathrlap{f^\ast}} \\ \mathbb{R}^{n_2} && \Omega^k(\mathbb{R}^{n_2}) }

That this is functorial is just the standard fact (7) from prop. .

For k=1k = 1 the smooth set Ω 0\mathbf{\Omega}^0 actually is a diffeological space, in fact under the identification of example this is just the real line:

Ω 0 1. \mathbf{\Omega}^0 \simeq \mathbb{R}^1 \,.

But for k1k \geq 1 we have that the set of plots on 0=*\mathbb{R}^0 = \ast is a singleton

Ω k1( 0){0} \mathbf{\Omega}^{k \geq 1}(\mathbb{R}^0) \simeq \{0\}

consisting just of the zero differential form. The only diffeological space with this property is 0=*\mathbb{R}^0 = \ast itself. But Ω k1\mathbf{\Omega}^{k \geq 1} is far from being that trivial: even though its would-be underlying set is a single point, for all nkn \geq k it admits an infinite set of plots. Therefore the smooth sets Ω k\mathbf{\Omega}^k for kk \geq are not diffeological spaces.

That the smooth set Ω k\mathbf{\Omega}^k indeed deserves to be addressed as the universal moduli space of differential k-forms follows from prop. : The universal moduli space of kk-forms ought to carry a universal differential kk-forms ω univΩ k(Ω k)\omega_{univ} \in \Omega^k(\mathbf{\Omega}^k) such that every differential kk-form ω\omega on any n\mathbb{R}^n arises as the pullback of differential forms of this universal one along some modulating morphism f ω:XΩ kf_\omega \colon X \to \mathbf{\Omega}^k:

{ω} (f ω) * {ω univ} X f ω Ω k \array{ \{\omega\} &\overset{(f_\omega)^\ast}{\longleftarrow}& \{\omega_{univ}\} \\ \\ X &\underset{f_\omega}{\longrightarrow}& \mathbf{\Omega}^k }

But with prop. this is precisely what the definition of the plots of Ω k\mathbf{\Omega}^k says.

Similarly, all the usual operations on differential form now have their universal archetype on the universal moduli spaces of differential forms

In particular, for kk \in \mathbb{N} there is a canonical morphism of smooth sets of the form

Ω kdΩ k+1 \mathbf{\Omega}^k \overset{\mathbf{d}}{\longrightarrow} \mathbf{\Omega}^{k+1}

defined over n\mathbb{R}^n by the ordinary de Rham differential (def. )

(23)Ω k( n)dΩ k+1( n). \Omega^k(\mathbb{R}^n) \overset{d}{\longrightarrow} \Omega^{k+1}(\mathbb{R}^n) \,.

That this satisfies the compatibility with precomposition of plots

n 1 Ω k( n 1) d Ω k+1( n 1) f f * f * n 2 Ω k( n 2) d Ω k( n 2) \array{ \mathbb{R}^{n_1} && \Omega^k(\mathbb{R}^{n_1}) &\overset{d}{\longrightarrow}& \Omega^{k+1}(\mathbb{R}^{n_1}) \\ {}^{\mathllap{f}}\downarrow && \uparrow^{\mathrlap{f^\ast}} && \uparrow^{\mathrlap{f^\ast}} \\ \mathbb{R}^{n_2} && \Omega^k(\mathbb{R}^{n_2}) &\underset{d}{\longrightarrow}& \Omega^k( \mathbb{R}^{n_2} ) }

is just the compatibility of pullback of differential forms with the de Rham differential of from prop. .

The upshot is that we now have a good definition of differential forms on any diffeological space and more generally on any smooth set:

Definition

(differential forms on smooth sets)

Let XX be a diffeological space (def. ) or more generally a smooth set (def. ) then a differential k-form ω\omega on XX is equivalently a morphism of smooth sets

XΩ k X \longrightarrow \mathbf{\Omega}^k

from XX to the universal smooth moduli space of differential froms from example .

Concretely, by unwinding the definitions of Ω k\mathbf{\Omega}^k and of morphisms of smooth sets, this means that such a differential form is:

  • for each nn \in \mathbb{N} and each plot nΦX\mathbb{R}^n \overset{\Phi}{\to} X an ordinary differential form

    Φ *(ω)Ω ( n) \Phi^\ast(\omega) \in \Omega^\bullet(\mathbb{R}^n)

such that

  • for each smooth function f: n 1 n 2f \;\colon\; \mathbb{R}^{n_1} \to \mathbb{R}^{n_2} between Cartesian spaces the ordinary pullback of differential forms along ff is compatible with these choices, in that for every plot n 2ΦX\mathbb{R}^{n_2} \overset{\Phi}{\to} X we have

    f *(Φ *(ω))=(f *Φ) *(ω) f^\ast\left(\Phi^\ast(\omega)\right) = ( f^\ast \Phi )^\ast(\omega)

    i.e.

    n 1 f n 2 f *Φ Φ XAAAAΩ ( n 1) f * Ω ( n 2) (f *Φ) * Φ * Ω (X). \array{ \mathbb{R}^{n_1} && \overset{f}{\longrightarrow} && \mathbb{R}^{n_2} \\ & {}_{\mathllap{f^\ast \Phi}}\searrow && \swarrow_{\mathrlap{\Phi}} \\ && X } \phantom{AAAA} \array{ \Omega^\bullet( \mathbb{R}^{n_1} ) && \overset{f^\ast}{\longleftarrow} && \Omega^\bullet(\mathbb{R}^{n_2}) \\ & {}_{\mathllap{(f^\ast \Phi)^\ast}}\nwarrow && \nearrow_{\mathrlap{\Phi^\ast}} \\ && \Omega^\bullet(X) } \,.

We write Ω (X)\Omega^\bullet(X) for the set of differential forms on the smooth set XX defined this way.

Moreover, given a differential k-form

XωΩ k X \overset{\omega}{\longrightarrow} \mathbf{\Omega}^k

on a smooth set XX this way, then its de Rham differential dωΩ k+1(X)d \omega \in \Omega^{k+1}(X) is given by the composite of morphisms of smooth sets with the universal de Rham differential from (23):

(24)dω:XωΩ kdΩ k+1. d \omega \;\colon\; X \overset{\omega}{\longrightarrow} \mathbf{\Omega}^k \overset{d}{\longrightarrow} \mathbf{\Omega}^{k+1} \,.

Explicitly this means simply that for Φ:UX\Phi \colon U \to X a plot, then

Φ *(dω)=d(Φ *ω)Ω k+1(U). \Phi^\ast (d\omega) \;=\; d\left( \Phi^\ast \omega\right) \;\in\; \Omega^{k+1}(U) \,.

The usual operations on ordinary differential forms directly generalize plot-wise to differential forms on diffeological spaces and more generally on smooth sets:

Definition

(exterior differential and exterior product on smooth sets)

Let XX be a diffeological space (def. ) or more generally a smooth set (def. ). Then

  1. For ωΩ n(X)\omega \in \Omega^n(X) a differential form on XX (def. ) its exterior differential

    dωΩ n+1(X) d \omega \in \Omega^{n+1}(X)

    is defined on any plot nΦX\mathbb{R}^n \overset{\Phi}{\to} X as the ordinary exterior differential of the pullback of ω\omega along that plot:

    Φ *(dω)dΦ *(ω). \Phi^\ast(d \omega) \coloneqq d \Phi^\ast(\omega) \,.
  2. For ω 1Ω n 1\omega_1 \in \Omega^{n_1} and ω 2Ω n 2(X)\omega_2 \in \Omega^{n_2}(X) two differential forms on XX (def. ) then their exterior product

    ω 1ω 2Ω n 1+n 2(X) \omega_1 \wedge \omega_2 \;\in\; \Omega^{n_1 + n_2}(X)

    is the differential form defined on any plot nΦX\mathbb{R}^n \overset{\Phi}{\to} X as the ordinary exterior product of the pullback of th differential forms ω 1\omega_1 and ω 2\omega_2 to this plot:

    Φ *(ω 1ω 2)Φ *(ω 1)Φ *(ω 2). \Phi^\ast(\omega_1 \wedge \omega_2) \;\coloneqq\; \Phi^\ast(\omega_1) \wedge \Phi^\ast(\omega_2) \,.

\,

Infinitesimal geometry

It is crucial in field theory that we consider field histories not only over all of spacetime, but also restricted to submanifolds of spacetime. Or rather, what is actually of interest are the restrictions of the field histories to the infinitesimal neighbourhoods (example below) of these submanifolds. This appears notably in the construction of phase spaces below. Moreover, fermion fields such as the Dirac field (example below) take values in graded infinitesimal spaces, called super spaces (discussed below). Therefore “infinitesimal geometry”, sometimes called formal geometry (as in “formal scheme”) or synthetic differential geometry or synthetic differential supergeometry, is a central aspect of field theory.

In order to mathematically grasp what infinitesimal neighbourhoods are, we appeal to the first magic algebraic property of differential geometry from prop. , which says that we may recognize smooth manifolds XX dually in terms of their commutative algebras C (X)C^\infty(X) of smooth functions on them

C ():SmoothManifoldsAAA(Algebras) op. C^\infty(-) \;\colon\; SmoothManifolds \overset{\phantom{AAA}}{\hookrightarrow} (\mathbb{R} Algebras)^{op} \,.

But since there are of course more algebras AAlgebrasA \in \mathbb{R}Algebras than arise this way from smooth manifolds, we may turn this around and try to regard any algebra AA as defining a would-be space, which would have AA as its algebra of functions.

For example an infinitesimally thickened point should be a space which is “so small” that every smooth function ff on it which vanishes at the origin takes values so tiny that some finite power of them is not just even more tiny, but actually vanishes:

Definition

(infinitesimally thickened Cartesian space)

An infinitesimally thickened point

𝔻Spec(A) \mathbb{D} \coloneqq Spec(A)

is represented by a commutative algebra AAlgA \in \mathbb{R}Alg which as a real vector space is a direct sum

A 1V A \simeq_{\mathbb{R}} \langle 1 \rangle \oplus V

of the 1-dimensional space 1=\langle 1 \rangle = \mathbb{R} of multiples of 1 with a finite dimensional vector space VV that is a nilpotent ideal in that for each element aVa \in V there exists a natural number nn \in \mathbb{N} such that

a n+1=0. a^{n+1} = 0 \,.

More generally, an infinitesimally thickened Cartesian space

n×𝔻 n×Spec(A) \mathbb{R}^n \times \mathbb{D} \;\coloneqq\; \mathbb{R}^n \times Spec(A)

is represented by a commutative algebra

C ( n)AAlg C^\infty(\mathbb{R}^n) \otimes A \;\in\; \mathbb{R} Alg

which is the tensor product of algebras of the algebra of smooth functions C ( n)C^\infty(\mathbb{R}^n) on an actual Cartesian space of some dimension nn (example ), with an algebra of functions A 1VA \simeq_{\mathbb{R}} \langle 1\rangle \oplus V of an infinitesimally thickened point, as above.

We say that a smooth function between two infinitesimally thickened Cartesian spaces

n 1×Spec(A 1)f n 2×Spec(A 2) \mathbb{R}^{n_1} \times Spec(A_1) \overset{f}{\longrightarrow} \mathbb{R}^{n_2} \times Spec(A_2)

is by definition dually an \mathbb{R}-algebra homomorphism of the form

C ( n 1)A 1f *C ( n 2)A 2. C^\infty(\mathbb{R}^{n_1}) \otimes A_1 \overset{f^\ast}{\longleftarrow} C^\infty(\mathbb{R}^{n_2}) \otimes A_2 \,.
Example

(infinitesimal neighbourhoods in the real line )

Consider the quotient algebra of the formal power series algebra [[ϵ]]\mathbb{R}[ [\epsilon] ] in a single parameter ϵ\epsilon by the ideal generated by ϵ 2\epsilon^2:

([[ϵ]])/(ϵ 2) ϵ. (\mathbb{R}[ [\epsilon] ])/(\epsilon^2) \;\simeq_{\mathbb{R}}\; \mathbb{R} \oplus \epsilon \mathbb{R} \,.

(This is sometimes called the algebra of dual numbers, for no good reason.) The underlying real vector space of this algebra is, as show, the direct sum of the multiples of 1 with the multiples of ϵ\epsilon. A general element in this algebra is of the form

a+bϵ([ϵ])/(ϵ 2) a + b \epsilon \in (\mathbb{R}[\epsilon])/(\epsilon^2)

where a,ba,b \in \mathbb{R} are real numbers. The product in this algebra is given by “multiplying out” as usual, and discarding all terms proportional to ϵ 2\epsilon^2:

(a 1+b 1ϵ)(a 2+b 2ϵ)=a 1a 2+(a 1b 2+b 1a 2)ϵ. \left( a_1 + b_1 \epsilon \right) \cdot \left( a_2 + b_2 \epsilon \right) \;=\; a_1 a_2 + ( a_1 b_2 + b_1 a_2 ) \epsilon \,.

We may think of an element a+bϵa + b \epsilon as the truncation to first order of a Taylor series at the origin of a smooth function on the real line

f: f \;\colon\; \mathbb{R} \to \mathbb{R}

where a=f(0)a = f(0) is the value of the function at the origin, and where b=fx(0)b = \frac{\partial f}{\partial x}(0) is its first derivative at the origin.

Therefore this algebra behaves like the algebra of smooth function on an infinitesimal neighbourhood 𝔻 1\mathbb{D}^1 of 00 \in \mathbb{R} which is so tiny that its elements ϵ𝔻 1\epsilon \in \mathbb{D}^1 \hookrightarrow \mathbb{R} become, upon squaring them, not just tinier, but actually zero:

ϵ 2=0. \epsilon^2 = 0 \,.

This intuitive picture is now made precise by the concept of infinitesimally thickened points def. , if we simply set

𝔻 1Spec([[ϵ]]/(ϵ 2)) \mathbb{D}^1 \;\coloneqq\; Spec\left( \mathbb{R}[ [\epsilon] ]/(\epsilon^2) \right)

and observe that there is the inclusion of infinitesimally thickened Cartesian spaces

𝔻 1AAiAA 1 \mathbb{D}^1 \overset{\phantom{AA}i\phantom{AA} }{\hookrightarrow} \mathbb{R}^1

which is dually given by the algebra homomorphism

ϵ i * C ( 1) f(0)+fx(0) {f} \array{ \mathbb{R} \oplus \epsilon \mathbb{R} &\overset{i^\ast}{\longleftarrow}& C^\infty(\mathbb{R}^1) \\ f(0) + \frac{\partial f}{\partial x}(0) &\longleftarrow& \{f\} }

which sends a smooth function to its value f(0)f(0) at zero plus ϵ\epsilon times its derivative at zero. Observe that this is indeed a homomorphism of algebras due to the product law of differentiation, which says that

i *(fg) =(fg)(0)+fgx(0)ϵ =f(0)g(0)+(fx(0)g(0)+f(0)gx(0))ϵ =(f(0)+fx(0)ϵ)(g(0)+gx(0)ϵ) \begin{aligned} i^\ast(f \cdot g) & = (f \cdot g)(0) + \frac{\partial f \cdot g}{\partial x}(0) \epsilon \\ & = f(0) \cdot g(0) + \left( \frac{\partial f}{\partial x}(0) \cdot g(0) + f(0) \cdot \frac{\partial g}{\partial x}(0) \right) \epsilon \\ & = \left( f(0) + \frac{\partial f}{\partial x}(0) \epsilon \right) \cdot \left( g(0) + \frac{\partial g}{\partial x}(0) \epsilon \right) \end{aligned}

Hence we see that restricting a smooth function to the infinitesimal neighbourhood of a point is equivalent to restricting attention to its Taylor series to the given order at that point:

𝔻 1 i 1 (ϵf(0)+fx(0)ϵ) f 1 \array{ \mathbb{D}^1 &\overset{i}{\hookrightarrow}& \mathbb{R}^1 \\ & {}_{\mathllap{(\epsilon \mapsto f(0) + \frac{\partial f}{\partial x}(0) \epsilon) }}\searrow & \downarrow_{\mathrlap{f}} \\ && \mathbb{R}^1 }

Similarly for each kk \in \mathbb{N} the algebra

([[ϵ]])/(ϵ k+1) (\mathbb{R}[ [ \epsilon ] ])/(\epsilon^{k+1})

may be thought of as the algebra of Taylor series at the origin of \mathbb{R} of smooth functions \mathbb{R} \to \mathbb{R}, where all terms of order higher than kk are discarded. The corresponding infinitesimally thickened point is often denoted

𝔻 1(k)Spec(([[ϵ]])/(ϵ k+1)). \mathbb{D}^1(k) \;\coloneqq\; Spec\left( \left(\mathbb{R}[ [\epsilon] ]\right)/(\epsilon^{k+1}) \right) \,.

This is now the subobject of the real line

𝔻 1(k)AAA 1 \mathbb{D}^1(k) \overset{\phantom{AAA}}{\hookrightarrow} \mathbb{R}^1

on those elements ϵ\epsilon such that ϵ k+1=0\epsilon^{k+1} = 0.

(Kock 81, Kock 10)

The following example shows that infinitesimal thickening is invisible for ordinary spaces when mapping out of these. In contrast example further below shows that the morphisms into an ordinary space out of an infinitesimal space are interesting: these are tangent vectors and their higher order infinitesimal analogs.

Example

(infinitesimal line 𝔻 1\mathbb{D}^1 has unique global point)

For n\mathbb{R}^n any ordinary Cartesian space (def. ) and D 1(k) 1D^1(k) \hookrightarrow \mathbb{R}^1 the order-kk infinitesimal neighbourhood of the origin in the real line from example , there is exactly only one possible morphism of infinitesimally thickened Cartesian spaces from n\mathbb{R}^n to 𝔻 1(k)\mathbb{D}^1(k):

n ! 6𝔻 1(k) ! ! 0=*. \array{ \mathbb{R}^n && \overset{\exists !}{\longrightarrow} &6 \mathbb{D}^1(k) \\ & {}_{\mathllap{\exists !}}\searrow && \nearrow_{\mathrlap{\exists !}} \\ && \mathbb{R}^0 = \ast } \,.
Proof

By definition such a morphism is dually an algebra homomorphism

C ( n)f *([[ϵ]])/(ϵ k+1) 𝒪(ϵ) C^\infty(\mathbb{R}^n) \overset{f^\ast}{\longleftarrow} \left( \mathbb{R}[ [\epsilon] ])/(\epsilon^{k+1} \right) \simeq_{\mathbb{R}} \mathbb{R} \oplus \mathcal{O}(\epsilon)

from the higher order “algebra of dual numbers” to the algebra of smooth functions (example ).

Now this being an \mathbb{R}-algebra homomorphism, its action on the multiples cc \in \mathbb{R} of the identity is fixed:

f *(1)=1. f^\ast(1) = 1 \,.

All the remaining elements are proportional to ϵ\epsilon, and hence are nilpotent. However, by the homomorphism property of an algebra homomorphism it follows that it must send nilpotent elements ϵ\epsilon to nilpotent elements f(ϵ)f(\epsilon), because

(f *(ϵ)) k+1 =f *(ϵ k+1) =f *(0) =0 \begin{aligned} \left(f^\ast(\epsilon)\right)^{k+1} & = f^\ast\left( \epsilon^{k+1}\right) \\ & = f^\ast(0) \\ & = 0 \end{aligned}

But the only nilpotent element in C ( n)C^\infty(\mathbb{R}^n) is the zero element, and hence it follows that

f *(ϵ)=0. f^\ast(\epsilon) = 0 \,.

Thus f *f^\ast as above is uniquely fixed.

Example

(synthetic tangent vector fields)

Let n\mathbb{R}^n be a Cartesian space (def. ), regarded as an infinitesimally thickened Cartesian space (def. ) and consider 𝔻 1Spec(([[ϵ]])/(ϵ 2))\mathbb{D}^1 \coloneqq Spec( (\mathbb{R}[ [\epsilon] ])/(\epsilon^2) ) the first order infinitesimal line from example .

Then homomorphisms of infinitesimally thickened Cartesian spaces of the form

n×𝔻 1 v˜ n pr 1 id n \array{ \mathbb{R}^n \times \mathbb{D}^1 && \overset{\tilde v}{\longrightarrow} && \mathbb{R}^n \\ & {}_{\mathllap{pr_1}}\searrow && \swarrow_{\mathrlap{id}} \\ && \mathbb{R}^n }

hence smoothly XX-parameterized collections of morphisms

v˜ x:𝔻 1 n \tilde v_x \;\colon\; \mathbb{D}^1 \longrightarrow \mathbb{R}^n

which send the unique base point (𝔻 1)=*\Re(\mathbb{D}^1) = \ast (example ) to x nx \in \mathbb{R}^n, are in natural bijection with tangent vector fields vΓ n(T n)v \in \Gamma_{\mathbb{R}^n}(T \mathbb{R}^n) (example ).

Proof

By definition, the morphisms in question are dually \mathbb{R}-algebra homomorphisms of the form

(C ( n)ϵC ( n))C ( n) (C^\infty(\mathbb{R}^n) \oplus \epsilon C^\infty(\mathbb{R}^n)) \longleftarrow C^\infty(\mathbb{R}^n)

which are the identity modulo ϵ\epsilon. Such a morphism has to take any function fC ( n)f \in C^\infty(\mathbb{R}^n) to

f+(f)ϵ f + (\partial f) \epsilon

for some smooth function (f)C ( n)(\partial f) \in C^\infty(\mathbb{R}^n). The condition that this assignment makes an algebra homomorphism is equivalent to the statement that for all f 1,f 2C ( n)f_1,f_2 \in C^\infty(\mathbb{R}^n) we have

(f 1f 2+((f 1f 2))ϵ)=(f 1+(f 1)ϵ)(f 2+(f 2)ϵ). (f_1 f_2 + (\partial (f_1 f_2))\epsilon ) \;=\; (f_1 + (\partial f_1) \epsilon) \cdot (f_2 + (\partial f_2) \epsilon) \,.

Multiplying this out and using that ϵ 2=0\epsilon^2 = 0, this is equivalent to

(f 1f 2)=(f 1)f 2+f 1(f 2). \partial(f_1 f_2) = (\partial f_1) f_2 + f_1 (\partial f_2) \,.

This in turn means equivalently that :C ( n)C ( n)\partial\colon C^\infty(\mathbb{R}^n)\to C^\infty(\mathbb{R}^n) is a derivation.

With this the statement follows with the third magic algebraic property of smooth functions (prop. ): derivations of smooth functions are vector fields.

We need to consider infinitesimally thickened spaces more general than the thickenings of just Cartesian spaces in def. . But just as Cartesian spaces (def. ) serve as the local test geometries to induce the general concept of diffeological spaces and smooth sets (def. ), so using infinitesimally thickened Cartesian spaces as test geometries immediately induces the corresponding generalization of smooth sets with infinitesimals:

Definition

(formal smooth set)

A formal smooth set XX is

  1. for each infinitesimally thickened Cartesian space n×Spec(A)\mathbb{R}^n \times Spec(A) (def. ) a set

    X( n×Spec(A))Set X(\mathbb{R}^n \times Spec(A)) \in Set

    to be called the set of smooth functions or plots from n×Spec(A)\mathbb{R}^n \times Spec(A) to XX;

  2. for each smooth function f: n 1×Spec(A 1) n 2×Spec(A 2)f \;\colon\; \mathbb{R}^{n_1} \times Spec(A_1) \longrightarrow \mathbb{R}^{n_2} \times Spec(A_2) between infinitesimally thickened Cartesian spaces a choice of function

    f *:X( n 2×Spec(A 2))X( n 1×Spec(A 1)) f^\ast \;\colon\; X(\mathbb{R}^{n_2} \times Spec(A_2)) \longrightarrow X(\mathbb{R}^{n_1} \times Spec(A_1))

    to be thought of as the precomposition operation

    ( n 2ΦX)f *( n 1×Spec(A 1)f n 2×Spec(A 2)ΦX) \left( \mathbb{R}^{n_2} \overset{\Phi}{\longrightarrow} X \right) \;\overset{f^\ast}{\mapsto}\; \left( \mathbb{R}^{n_1}\times Spec(A_1) \overset{f}{\to} \mathbb{R}^{n_2} \times Spec(A_2) \overset{\Phi}{\to} X \right)

such that

  1. (functoriality)

    1. If id n×Spec(A): n×Spec(A) n×Spec(A)id_{\mathbb{R}^n \times Spec(A)} \;\colon\; \mathbb{R}^n \times Spec(A) \to \mathbb{R}^n \times Spec(A) is the identity function on n×Spec(A)\mathbb{R}^n \times Spec(A), then (id n×Spec(A)) *:X( n×Spec(A))X( n×Spec(A))\left(id_{\mathbb{R}^n \times Spec(A)}\right)^\ast \;\colon\; X(\mathbb{R}^n \times Spec(A)) \to X(\mathbb{R}^n \times Spec(A)) is the identity function on the set of plots X( n×Spec(A))X(\mathbb{R}^n \times Spec(A));

    2. If n 1×Spec(A 1)f n 2×Spec(A 2)g n 3×Spec(A 3)\mathbb{R}^{n_1}\times Spec(A_1) \overset{f}{\to} \mathbb{R}^{n_2} \times Spec(A_2) \overset{g}{\to} \mathbb{R}^{n_3} \times Spec(A_3) are two composable smooth functions between infinitesimally thickened Cartesian spaces, then pullback of plots along them consecutively equals the pullback along the composition:

      f *g *=(gf) * f^\ast \circ g^\ast = (g \circ f)^\ast

      i.e.

      X( n 2×Spec(A 2)) f * g * X( n 1×Spec(A 1)) (gf) * X( n 3×Spec(A 3)) \array{ && X(\mathbb{R}^{n_2} \times Spec(A_2)) \\ & {}^{\mathllap{f^\ast}}\swarrow && \nwarrow^{\mathrlap{g^\ast}} \\ X(\mathbb{R}^{n_1} \times Spec(A_1)) && \underset{ (g \circ f)^\ast }{\longleftarrow} && X(\mathbb{R}^{n_3} \times Spec(A_3)) }
  2. (gluing)

    If {U i×Spec(A)f i×id Spec(A) n×Spec(A)} iI\{ U_i \times Spec(A) \overset{f_i \times id_{Spec(A)}}{\to} \mathbb{R}^n \times Spec(A)\}_{i \in I} is such that

    {U if i n} iI\{ U_i \overset{f_i }{\to} \mathbb{R}^n \}_{i \in I}

    in a differentiably good open cover (def. ) then the function which restricts n×Spec(A)\mathbb{R}^n \times Spec(A)-plots of XX to a set of U i×Spec(A)U_i \times Spec(A)-plots

    X( n×Spec(A))((f i) *) iIiIX(U i×Spec(A)) X(\mathbb{R}^n \times Spec(A)) \overset{( (f_i)^\ast )_{i \in I} }{\hookrightarrow} \underset{i \in I}{\prod} X(U_i \times Spec(A))

    is a bijection onto the set of those tuples (Φ iX(U i)) iI(\Phi_i \in X(U_i))_{i \in I} of plots, which are “matching families” in that they agree on intersections:

    ϕ i| ((U iU j)×Spec(A)=ϕ j| (U iU j)×Spec(A) \phi_i\vert_{((U_i \cap U_j) \times Spec(A)} = \phi_j \vert_{(U_i \cap U_j)\times Spec(A)}

    i.e.

    (U iU j)×Spec(A) U i×Spec(A) U j×Spec(A) Φ i Φ j X \array{ && (U_i \cap U_j) \times Spec(A) \\ & \swarrow && \searrow \\ U_i\times Spec(A) && && U_j \times Spec(A) \\ & {}_{\mathrlap{\Phi_i}}\searrow && \swarrow_{\mathrlap{\Phi_j}} \\ && X }

Finally, given X 1X_1 and X 2X_2 two formal smooth sets, then a smooth function between them

f:X 1X 2 f \;\colon\; X_1 \longrightarrow X_2

is

  • for each infinitesimally thickened Cartesian space n×Spec(A)\mathbb{R}^n \times Spec(A) (def. ) a function

    f *( n×Spec(A)):X 1( n×Spec(A))X 2( n×Spec(A)) f_\ast(\mathbb{R}^n \times Spec(A)) \;\colon\; X_1(\mathbb{R}^n \times Spec(A)) \longrightarrow X_2(\mathbb{R}^n \times Spec(A))

such that

  • for each smooth function g: n 1×Spec(A 1) n 2×Spec(A 2)g \colon \mathbb{R}^{n_1} \times Spec(A_1) \to \mathbb{R}^{n_2} \times Spec(A_2) between infinitesimally thickened Cartesian spaces we have

    g 2 *f *( n 2×Spec(A 2))=f *( n 1×Spec(A 1))g 1 * g^\ast_2 \circ f_\ast(\mathbb{R}^{n_2} \times Spec(A_2)) = f_\ast(\mathbb{R}^{n_1} \times Spec(A_1)) \circ g^\ast_1

    i.e.

    X 1( n 2×Spec(A 2)) f *( n 2×Spec(A 2)) X 2( n 2×Spec(A 2)) g 1 * g 2 * X 1( n 1×Spec(A 1)) f *( n 1) X 2( n 1×Spec(A 1)) \array{ X_1(\mathbb{R}^{n_2} \times Spec(A_2)) &\overset{f_\ast(\mathbb{R}^{n_2}\times Spec(A_2) )}{\longrightarrow}& X_2(\mathbb{R}^{n_2} \times Spec(A_2)) \\ \mathllap{g_1^\ast}\downarrow && \downarrow\mathrlap{g^\ast_2} \\ X_1(\mathbb{R}^{n_1} \times Spec(A_1)) &\underset{f_\ast(\mathbb{R}^{n_1})}{\longrightarrow}& X_2(\mathbb{R}^{n_1} \times Spec(A_1)) }

(Dubuc 79)

Basing infinitesimal geometry on formal smooth sets is an instance of the general approach to geometry called functorial geometry or topos theory. For more background on this see at geometry of physics – manifolds and orbifolds.

We have the evident generalization of example to smooth geometry with infinitesimals:

Example

(infinitesimally thickened Cartesian spaces are formal smooth sets)

For XX an infinitesimally thickened Cartesian space (def. ), it becomes a formal smooth set according to def. by taking its plots out of some n×𝔻\mathbb{R}^n \times \mathbb{D} to be the homomorphism of infinitesimally thickened Cartesian spaces:

X( n×𝔻)Hom FormalCartSp( n×𝔻,X). X(\mathbb{R}^n \times \mathbb{D}) \;\coloneqq\; Hom_{FormalCartSp}( \mathbb{R}^n \times \mathbb{D}, X ) \,.

(Stated more abstractly, this is an instance of the Yoneda embedding over a subcanonical site.)

Example

(smooth sets are formal smooth sets)

Let XX be a smooth set (def. ). Then XX becomes a formal smooth set (def. ) by declaring the set of plots X( n×𝔻)X(\mathbb{R}^n \times \mathbb{D}) over an infinitesimally thickened Cartesian space (def. ) to be equivalence classes of pairs

n×𝔻 k,AA kX \mathbb{R}^n \times \mathbb{D} \longrightarrow \mathbb{R}^{k} \,, \phantom{AA} \mathbb{R}^k \longrightarrow X

of a morphism of infinitesimally thickened Cartesian spaces and of a plot of XX, as shown, subject to the equivalence relation which identifies two such pairs if there exists a smooth function f: k kf \colon \mathbb{R}^k \to \mathbb{R}^{k'} such that

n×𝔻 k f k k f k X \array{ && \mathbb{R}^n \times \mathbb{D} \\ & \swarrow && \searrow \\ \mathbb{R}^k && \overset{f}{\longrightarrow} && \mathbb{R}^{k'} \\ \mathbb{R}^k && \underset{f}{\longrightarrow} && \mathbb{R}^{k'} \\ & \searrow && \swarrow \\ && X }

Stated more abstractly this says that XX as a formal smooth set is the left Kan extension (see this example) of XX as a smooth set along the functor that includes Cartesian spaces (def. ) into infinitesimally thickened Cartesian spaces (def. ).

Definition

(reduction and infinitesimal shape)

For n×𝔻\mathbb{R}^n \times \mathbb{D} an infinitesimally thickened Cartesian space (def. ) we say that the underlying ordinary Cartesian space n\mathbb{R}^n (def. ) is its reduction

( n×𝔻) n. \Re\left( \mathbb{R}^n \times \mathbb{D} \right) \;\coloneqq\; \mathbb{R}^n \,.

There is the canonical inclusion morphism

( n×𝔻)= nAAAA n×𝔻 \Re\left( \mathbb{R}^n \times \mathbb{D} \right) = \mathbb{R}^n \overset{\phantom{AAAA}}{\hookrightarrow} \mathbb{R}^n \times \mathbb{D}

which dually corresponds to the homomorphism of commutative algebras

C ( n)C ( n) A C^\infty(\mathbb{R}^n) \longleftarrow C^\infty(\mathbb{R}^n) \otimes_{\mathbb{R}} A

which is the identity on all smooth functions fC ( n)f \in C^\infty(\mathbb{R}^n) and is zero on all elements aVAa \in V \subset A in the nilpotent ideal of AA (as in example ).

Given any formal smooth set XX, we say that its infinitesimal shape or de Rham shape (also: de Rham stack) is the formal smooth set X\Im X (def. ) defined to have as plots the reductions of the plots of XX, according to the above:

(X)(U)X((U)). (\Im X)( U ) \;\coloneqq\: X(\Re(U)) \,.

There is a canonical morphism of formal smooth set

η X:XX \eta_X \;\colon\; X \longrightarrow \Im X

which takes a plot

U= n×𝔻fX U = \mathbb{R}^n \times \mathbb{D} \overset{f}{\longrightarrow} X

to the composition

n n×𝔻fX \mathbb{R}^n \hookrightarrow \mathbb{R}^n \times \mathbb{D} \overset{f}{\hookrightarrow} X

regarded as a plot of X\Im X.

Example

(mapping space out of an infinitesimally thickened Cartesian space)

Let XX be an infinitesimally thickened Cartesian space (def. ) and let YY be a formal smooth set (def. ). Then the mapping space

[X,Y]FormalSmoothSet [X,Y] \;\in\; FormalSmoothSet

of smooth functions from XX to YY is the formal smooth set whose UU-plots are the morphisms of formal smooth sets from the Cartesian product of infinitesimally thickened Cartesian spaces U×XU \times X to YY, hence the U×XU \times X-plots of YY:

[X,Y](U)Y(U×X). [X,Y](U) \;\coloneqq\; Y(U \times X) \,.
Example

(synthetic tangent bundle)

Let X nX \coloneqq \mathbb{R}^n be a Cartesian space (def. ) regarded as an infinitesimally thickened Cartesian space () and thus regarded as a formal smooth set (def. ) by example . Consider the infinitesimal line

𝔻 1 1 \mathbb{D}^1 \hookrightarrow \mathbb{R}^1

from example . Then the mapping space [𝔻 1,X][\mathbb{D}^1, X] (example ) is the total space of the tangent bundle TXT X (example ). Moreover, under restriction along the reduction *𝔻 1\ast \longrightarrow \mathbb{D}^1, this is the full tangent bundle projection, in that there is a natural isomorphism of formal smooth sets of the form

TX [𝔻 1,X] tb [*𝔻 1,X] X [*,X] \array{ T X &\simeq& [\mathbb{D}^1, X] \\ {}^{\mathllap{tb}}\downarrow && \downarrow^{\mathrlap{ [ \ast \to \mathbb{D}^1, X ] }} \\ X &\simeq& [\ast, X] }

In particular this implies immediately that smooth sections (def. ) of the tangent bundle

[𝔻 1,X] TX v X = X \array{ && [\mathbb{D}^1, X] & \simeq T X \\ & {}^{\mathllap{v}}\nearrow & \downarrow \\ X &=& X }

are equivalently morphisms of the form

X v˜ id X×𝔻 1 pr 1 X \array{ && X \\ & {}^{\mathllap{\tilde v}}\nearrow & \downarrow^{\mathrlap{id}} \\ X \times \mathbb{D}^1 &\underset{pr_1}{\longrightarrow}& X }

which we had already identified with tangent vector fields (def. ) in example .

Proof

This follows by an analogous argument as in example , using the Hadamard lemma.

While in infinitesimally thickened Cartesian spaces (def. ) only infinitesimals to any finite order may exist, in formal smooth sets (def. ) we may find infinitesimals to any arbitrary finite order:

Example

(infinitesimal neighbourhood)

Let XX be a formal smooth sets (def. ) YXY \hookrightarrow X a sub-formal smooth set. Then the infinitesimal neighbourhood to arbitrary infinitesimal order of YY in XX is the formal smooth set N XYN_X Y whose plots are those plots of XX

n×Spec(A)fX \mathbb{R}^n \times Spec(A) \overset{f}{\longrightarrow} X

such that their reduction (def. )

n n×Spec(A)fX \mathbb{R}^n \hookrightarrow \mathbb{R}^n \times Spec(A) \overset{f}{\longrightarrow} X

factors through a plot of YY.

This allows to grasp the restriction of field histories to the infinitesimal neighbourhood of a submanifold of spacetime, which will be crucial for the discussion of phase spaces below.

Definition

(field histories on infinitesimal neighbourhood of submanifold of spacetime)

Let EfbΣE \overset{fb}{\to} \Sigma be a field bundle (def. ) and let SΣS \hookrightarrow \Sigma be a submanifold of spacetime.

We write N Σ(S)ΣN_\Sigma(S) \hookrightarrow \Sigma for its infinitesimal neighbourhood in Σ\Sigma (def. ).

Then the set of field histories restricted to SS, to be denoted

(25)Γ S(E)Γ N Σ(S)(E| N ΣS)H \Gamma_{S}(E) \coloneqq \Gamma_{N_\Sigma(S)}( E\vert_{N_\Sigma S} ) \in \mathbf{H}

is the set of section of EE restricted to the infinitesimal neighbourhood N Σ(S)N_\Sigma(S) (example ).

\,

We close the discussion of infinitesimal differential geometry by explaining how we may recover the concept of smooth manifolds inside the more general formal smooth sets (def./prop. below). The key point is that the presence of infinitesimals in the theory allows an intrinsic definition of local diffeomorphisms/formally étale morphism (def. and example below). It is noteworthy that the only role this concept plays in the development of field theory below is that smooth manifolds admit triangulations by smooth singular simplices (def. ) so that the concept of fiber integration of differential forms is well defined over manifolds.

Definition

(local diffeomorphism of formal smooth sets)

Let X,YX,Y be formal smooth sets (def. ). Then a morphism between them is called a local diffeomorphism or formally étale morphism, denoted

f:XetY, f \;\colon\; X \overset{et}{\longrightarrow} Y \,,

if ff if for each infinitesimally thickened Cartesian space (def. ) n×𝔻\mathbb{R}^n \times \mathbb{D} we have a natural identification between the n×𝔻\mathbb{R}^n \times \mathbb{D}-plots of XX with those nn×𝔻\mathbb{R}^n n\times \mathbb{D}-plots of YY whose reduction (def. ) comes from an n\mathbb{R}^n-plot of XX, hence if we have a natural fiber product of sets of plots

X( n×𝔻)Y( n×𝔻)× fY( n)X( n) X(\mathbb{R}^n \times \mathbb{D}) \;\simeq\; Y(\mathbb{R}^n \times \mathbb{D}) \underset{Y(\mathbb{R}^n)}{\times^f} X(\mathbb{R}^n)

i. e.

X( n×𝔻) Y( n×𝔻) (pb) X( n) Y( n) \array{ && X(\mathbb{R}^n \times \mathbb{D}) \\ & \swarrow && \searrow \\ Y(\mathbb{R}^n \times \mathbb{D}) && \text{(pb)} && X(\mathbb{R}^n) \\ & \searrow && \swarrow \\ && Y(\mathbb{R}^n ) }

for all infinitesimally thickened Cartesian spaces n×𝔻\mathbb{R}^n \times \mathbb{D}.

Stated more abstractly, this means that the naturality square of the unit of the infinitesimal shape \Im (def. ) is a pullback square

X η X X f (pb) f Y η Y Y \array{ X &\overset{\eta_X}{\longrightarrow}& \Im X \\ {}^{\mathllap{f}}\downarrow &\text{(pb)}& \downarrow^{\mathrlap{\Im f}} \\ Y &\underset{\eta_Y}{\longrightarrow}& \Im Y }

(Khavkine-Schreiber 17, def. 3.1)

Example

(local diffeomorphism between Cartesian spaces from the general definition)

For X,YCartSpX,Y \in CartSp two ordinary Cartesian spaces (def. ), regarded as formal smooth sets by example then a morphism f:XYf \colon X \to Y between them is a local diffeomorphism in the general sense of def. precisely if it is so in the ordinary sense of def. .

(Khavkine-Schreiber 17, prop. 3.2)

Definition/Proposition

(smooth manifolds)

A smooth manifold XX of dimension nn \in \mathbb{N} is

such that

  1. there exists an indexed set { nϕ iX} iI\{ \mathbb{R}^n \overset{\phi_i}{\to} X\}_{i \in I} of morphisms of formal smooth sets (def. ) from Cartesian spaces n\mathbb{R}^n (def. ) (regarded as formal smooth sets via example , example and example ) into XX, (regarded as a formal smooth set via example and example ) such that

    1. every point xX sx \in X_s is in the image of at least one of the ϕ i\phi_i;

    2. every ϕ i\phi_i is a local diffeomorphism according to def. ;

  2. the final topology induced by the set of plots of XX makes X sX_s a paracompact Hausdorff space.

(Khavkine-Schreiber 17, example 3.4)

For more on smooth manifolds from the perspective of formal smooth sets see at geometry of physics – manifolds and orbifolds.

\,

fermion fields and supergeometry

Field theories of interest crucially involve fermionic fields (def. below), such as the Dirac field (example below), which are subject to the “Pauli exclusion principle”, a key reason for the stability of matter. Mathematically this principle means that these fields have field bundles whose field fiber is not an ordinary manifold, but an odd-graded supermanifold (more on this in remark and remark below).

This “supergeometry” is an immediate generalization of the infinitesimal geometry above, where now the infinitesimal elements in the algebra of functions may be equipped with a grading: one speaks of superalgebra.

The “super”-terminology for something as down-to-earth as the mathematical principle behind the stability of matter may seem unfortunate. For better or worse, this terminology has become standard since the middle of the 20th century. But the concept that today is called supercommutative superalgebra was in fact first considered by Grassmann 1844 who got it right (“Ausdehnungslehre”) but apparently was too far ahead of his time and remained unappreciated.

Beware that considering supergeometry does not necessarily involve considering “supersymmetry”. Supergeometry is the geometry of fermion fields (def. below), and that all matter fields in the observable universe are fermionic has been experimentally established since the Stern-Gerlach experiment in 1922. Supersymmetry, on the other hand, is a hypothetical extension of spacetime-symmetry within the context of supergeometry. Here we do not discuss supersymmetry; for details see instead at geometry of physics – supersymmetry.

Definition

(supercommutative superalgebra)

A real /2\mathbb{Z}/2-graded algebra or superalgebra is an associative algebra AA over the real numbers together with a direct sum decomposition of its underlying real vector space

A A evenA odd, A \simeq_{\mathbb{R}} A_{even} \oplus A_{odd} \,,

such that the product in the algebra respects the multiplication in the cyclic group of order 2 /2={even,odd}\mathbb{Z}/2 = \{even, odd\}:

A evenA even A oddA odd}A evenAAAAA oddA even A evenA odd}A odd. \left. \array{ A_{even} \cdot A_{even} \\ A_{odd} \cdot A_{odd} } \right\} \subset A_{even} \phantom{AAAA} \left. \array{ A_{odd} \cdot A_{even} \\ A_{even} \cdot A_{odd} } \right\} \subset A_{odd} \,.

This is called a supercommutative superalgebra if for all elements a 1,a 2Aa_1, a_2 \in A which are of homogeneous degree |a i|/2={even,odd}{\vert a_i \vert} \in \mathbb{Z}/2 = \{even, odd\} in that

a iA |a i|A a_i \in A_{{\vert a_i\vert}} \subset A

we have

a 1a 2=(1) |a 1||a 2|a 2a 1. a_1 \cdot a_2 = (-1)^{{\vert a_1 \vert \vert a_2 \vert}} a_2 \cdot a_1 \,.

A homomorphism of superalgebras

f:AA f \;\colon\; A \longrightarrow A'

is a homomorphism of associative algebras over the real numbers such that the /2\mathbb{Z}/2-grading is respected in that

f(A even)A evenAAAAAf(A odd)A odd. f(A_{even}) \subset A'_{even} \phantom{AAAAA} f(A_{odd}) \subset A'_{odd} \,.

For more details on superalgebra see at geometry of physics – superalgebra.

Example

(basic examples of supercommutative superalgebras)

Basic examples of supercommutative superalgebras (def. ) include the following:

  1. Every commutative algebra AA becomes a supercommutative superalgebra by declaring it to be all in even degree: A=A evenA = A_{even}.

  2. For VV a finite dimensional real vector space, then the Grassmann algebra A V *A \coloneqq \wedge^\bullet_{\mathbb{R}} V^\ast is a supercommutative superalgebra with A even evenV *A_{even} \coloneqq \wedge^{even} V^\ast and A odd oddV *A_{odd} \coloneqq \wedge^{odd} V^\ast.

    More explicitly, if V= sV = \mathbb{R}^s is a Cartesian space with canonical dual coordinates (θ i) i=1 s(\theta^i)_{i = 1}^s then the Grassmann algebra ( s) *\wedge^\bullet (\mathbb{R}^s)^\ast is the real algebra which is generated from the θ i\theta^i regarded in odd degree and hence subject to the relation

    θ iθ j=θ jθ i. \theta^i \cdot \theta^j = - \theta^j \cdot \theta^i \,.

    In particular this implies that all the θ i\theta^i are infinitesimal (def. ):

    θ iθ i=0. \theta^i \cdot \theta^i = 0 \,.
  3. For A 1A_1 and A 2A_2 two supercommutative superalgebras, there is their tensor product supercommutative superalgebra A 1 A 2A_1 \otimes_{\mathbb{R}} A_2. For example for XX a smooth manifold with ordinary algebra of smooth functions C (X)C^\infty(X) regarded as a supercommutative superalgebra by the first example above, the tensor product with a Grassmann algebra (second example above) is the supercommutative superalgebta

    C (X) (( s)*) C^\infty(X) \otimes_{\mathbb{R}} \wedge^\bullet((\mathbb{R}^s)\ast)

    whose elements may uniquely be expanded in the form

    f+f iθ i+f ijθ iθ j+f ijkθ iθ jθ k++f i 1i sθ i 1θ i s, f + f_i \theta^i + f_{i j} \theta^i \theta^j + f_{i j k} \theta^i \theta^j \theta^k + \cdots + f_{i_1 \cdots i_s} \theta^{i_1} \cdots \theta^{i_s} \,,

    where the f i 1i kC (X)f_{i_1 \cdots i_k} \in C^\infty(X) are smooth functions on XX which are skew-symmetric in their indices.

The following is the direct super-algebraic analog of the definition of infinitesimally thickened Cartesian spaces (def. ):

Definition

(super Cartesian space)

A superpoint Spec(A)Spec(A) is represented by a super-commutative superalgebra AA (def. ) which as a /2\mathbb{Z}/2-graded vector space (super vector space) is a direct sum

A 1V A \simeq_{\mathbb{R}} \langle 1 \rangle \oplus V

of the 1-dimensional even vector space 1=\langle 1 \rangle = \mathbb{R} of multiples of 1, with a finite dimensional super vector space VV that is a nilpotent ideal in AA in that for each element aVa \in V there exists a natural number nn \in \mathbb{N} such that

a n+1=0. a^{n+1} = 0 \,.

More generally, a super Cartesian space n×Spec(A)\mathbb{R}^n \times Spec(A) is represented by a super-commutative algebra C ( n)AAlgC^\infty(\mathbb{R}^n) \otimes A \in \mathbb{R} Alg which is the tensor product of algebras of the algebra of smooth functions C ( n)C^\infty(\mathbb{R}^n) on an actual Cartesian space of some dimension nn, with an algebra of functions A 1VA \simeq_{\mathbb{R}} \langle 1\rangle \oplus V of a superpoint (example ).

Specifically, for ss \in \mathbb{N}, there is the superpoint

(26) 0|sSpec( ( s) *) \mathbb{R}^{0 \vert s} \;\coloneqq\; Spec\left( \wedge^\bullet (\mathbb{R}^s)^\ast \right)

whose algebra of functions is by definition the Grassmann algebra on ss generators (θ i) i=1 s(\theta^i)_{i = 1}^s in odd degree (example ).

We write

b|s b× 0|s = b×Spec( ( s) *) =Spec(C ( b) ( s) *) \begin{aligned} \mathbb{R}^{b\vert s} & \coloneqq \mathbb{R}^b \times \mathbb{R}^{0 \vert s} \\ & = \mathbb{R}^b \times Spec( \wedge^\bullet(\mathbb{R}^s)^\ast ) \\ & = Spec\left( C^\infty(\mathbb{R}^b) \otimes_{\mathbb{R}} \wedge^\bullet (\mathbb{R}^s)^\ast \right) \end{aligned}

for the corresponding super Cartesian spaces whose algebra of functions is as in example .

We say that a smooth function between two super Cartesian spaces

n 1×Spec(A 1)f n 2×Spec(A 2) \mathbb{R}^{n_1} \times Spec(A_1) \overset{f}{\longrightarrow} \mathbb{R}^{n_2} \times Spec(A_2)

is by definition dually a homomorphism of supercommutative superalgebras (def. ) of the form

C ( n 1)A 1f *C ( n 2)A 2. C^\infty(\mathbb{R}^{n_1}) \otimes A_1 \overset{f^\ast}{\longleftarrow} C^\infty(\mathbb{R}^{n_2}) \otimes A_2 \,.
Example

(superpoint induced by a finite dimensional vector space)

Let VV be a finite dimensional real vector space. With V *V^\ast denoting its dual vector space write V *\wedge^\bullet V^\ast for the Grassmann algebra that it generates. This being a supercommutative algebra, it defines a superpoint (def. ).

We denote this superpoint by

V odd 0|dim(V). V_{odd} \simeq \mathbb{R}^{0 \vert dim(V)} \,.

All the differential geometry over Cartesian space that we discussed above generalizes immediately to super Cartesian spaces (def. ) if we stricly adhere to the super sign rule which says that whenever two odd-graded elements swap places, a minus sign is picked up. In particular we have the following generalization of the de Rham complex on Cartesian spaces discussed above.

Definition

(super differential forms on super Cartesian spaces)

For b|s\mathbb{R}^{b\vert s} a super Cartesian space (def. ), hence the formal dual of the supercommutative superalgebra of the form

C ( b|s)=C ( b) s C^\infty(\mathbb{R}^{b\vert s}) \;=\; C^\infty(\mathbb{R}^b) \otimes_{\mathbb{R}} \wedge^\bullet \mathbb{R}^s

with canonical even-graded coordinate functions (x i) i=1 b(x^i)_{i = 1^b} and odd-graded coordinate functions (θ j) j=1 s(\theta^j)_{j = 1}^s.

Then the de Rham complex of super differential forms on b|s\mathbb{R}^{b\vert s} is, in super-generalization of def. , the ×(/2)\mathbb{Z} \times (\mathbb{Z}/2)-graded commutative algebra

Ω ( b|s)C ( b|s) dx 1,,dx b,dθ 1,,dθ s \Omega^\bullet(\mathbb{R}^{b|s}) \;\coloneqq\; C^\infty(\mathbb{R}^{b|s}) \otimes_{\mathbb{R}} \wedge^\bullet \langle d x^1, \cdots, d x^b, \; d \theta^1, \cdots, d\theta^s \rangle

which is generated over C ( b|s)C^\infty(\mathbb{R}^{b\vert s}) from new generators

dx ideg=(1,even)AAAAAdθ jdeg=(1,odd) \underset{ \text{deg} = (1,even) }{\underbrace{ d x^i }} \phantom{AAAAA} \underset{ \text{deg} = (1,odd) }{ \underbrace{ d \theta^j } }

whose differential is defined in degree-0 by

dffx idx i+fθ jdθ j d f \;\coloneqq\; \frac{\partial f}{\partial x^i} d x^i + \frac{\partial f}{\partial \theta^j} d \theta^j

and extended from there as a bigraded derivation of bi-degree (1,even)(1,even), in super-generalization of def. .

Accordingly, the operation of contraction with tangent vector fields (def. ) has bi-degree (1,σ)(-1,\sigma) if the tangent vector has super-degree σ\sigma:

generatorbi-degree
AAx a\phantom{AA} x^a(0,even)
AAθ α\phantom{AA} \theta^\alpha(0,odd)
AAdx a\phantom{AA} dx^a(1,even)
AAdθ α\phantom{AA} d\theta^\alpha(1,odd)
derivationbi-degree
AAd\phantom{AA} d(1,even)
AAι x a\phantom{AA}\iota_{\partial x^a}(-1, even)
AAι θ α\phantom{AA}\iota_{\partial \theta^\alpha}(-1,odd)

This means that if αΩ ( b|s)\alpha \in \Omega^\bullet(\mathbb{R}^{b\vert s}) is in bidegree (n α,σ α)(n_\alpha, \sigma_\alpha), and βΩ ( b|σ)\beta \in \Omega^\bullet(\mathbb{R}^{b \vert \sigma}) is in bidegree (n β,σ β)(n_\beta, \sigma_\beta), then

αβ=(1) n αn β+σ ασ ββα. \alpha \wedge \beta \; = \; (- 1)^{n_\alpha n_\beta + \sigma_\alpha \sigma_\beta} \; \beta \wedge \alpha \,.

Hence there are two contributions to the sign picked up when exchanging two super-differential forms in the wedge product:

  1. there is a “cohomological sign” which for commuting an n 1n_1-forms past an n 2n_2-form is (1) n 1n 2(-1)^{n_1 n_2};

  2. in addition there is a “super-grading” sign which for commuting a σ 1\sigma_1-graded coordinate function past a σ 2\sigma_2-graded coordinate function (possibly under the de Rham differential) is (1) σ 1σ 2(-1)^{\sigma_1 \sigma_2}.

For example:

x a 1(dx a 2)=+(dx a 2)x a 1 x^{a_1} (dx^{a_2}) \;=\; + (dx^{a_2}) x^{a_1}
θ α(dx a)=+(dx a)θ α \theta^\alpha (dx^a) \;=\; + (dx^a) \theta^\alpha
θ α 1(dθ α 2)=(dθ α 2)θ α 1 \theta^{\alpha_1} (d\theta^{\alpha_2}) \;=\; - (d\theta^{\alpha_2}) \theta^{\alpha_1}
dx a 1dx a 2=dx a 2dx a 1 dx^{a_1} \wedge d x^{a_2} \;=\; - d x^{a_2} \wedge d x^{a_1}
dx adθ α=dθ αdx a dx^a \wedge d \theta^{\alpha} \;=\; - d\theta^{\alpha} \wedge d x^a
dθ α 1dθ α 2=+dθ α 2dθ α 1 d\theta^{\alpha_1} \wedge d \theta^{\alpha_2} \;=\; + d\theta^{\alpha_2} \wedge d \theta^{\alpha_1}

(e.g. Castellani-D’Auria-Fré 91 (II.2.106) and (II.2.109), Deligne-Freed 99, section 6)

Beware that there is also another sign rule for super differential forms used in the literature. See at signs in supergeometry for further discussion.

\,

It is clear now by direct analogy with the definition of formal smooth sets (def. ) what the corresponding supergeometric generalization is. For definiteness we spell it out yet once more:

Definition

(super smooth set)

A super smooth set XX is

  1. for each super Cartesian space n×Spec(A)\mathbb{R}^n \times Spec(A) (def. ) a set

    X( n×Spec(A))Set X(\mathbb{R}^n \times Spec(A)) \in Set

    to be called the set of smooth functions or plots from n×Spec(A)\mathbb{R}^n \times Spec(A) to XX;

  2. for each smooth function f: n 1×Spec(A 1) n 2×Spec(A 2)f \;\colon\; \mathbb{R}^{n_1} \times Spec(A_1) \longrightarrow \mathbb{R}^{n_2} \times Spec(A_2) between super Cartesian spaces a choice of function

    f *:X( n 2×Spec(A 2))X( n 1×Spec(A 1)) f^\ast \;\colon\; X(\mathbb{R}^{n_2} \times Spec(A_2)) \longrightarrow X(\mathbb{R}^{n_1} \times Spec(A_1))

    to be thought of as the precomposition operation

    ( n 2ΦX)f *( n 1×Spec(A 1)f n 2×Spec(A 2)ΦX) \left( \mathbb{R}^{n_2} \overset{\Phi}{\longrightarrow} X \right) \;\overset{f^\ast}{\mapsto}\; \left( \mathbb{R}^{n_1}\times Spec(A_1) \overset{f}{\to} \mathbb{R}^{n_2} \times Spec(A_2) \overset{\Phi}{\to} X \right)

such that

  1. (functoriality)

    1. If id n×Spec(A): n×Spec(A) n×Spec(A)id_{\mathbb{R}^n \times Spec(A)} \;\colon\; \mathbb{R}^n \times Spec(A) \to \mathbb{R}^n \times Spec(A) is the identity function on n×Spec(A)\mathbb{R}^n \times Spec(A), then (id n×Spec(A)) *:X( n×Spec(A))X( n×Spec(A))\left(id_{\mathbb{R}^n \times Spec(A)}\right)^\ast \;\colon\; X(\mathbb{R}^n \times Spec(A)) \to X(\mathbb{R}^n \times Spec(A)) is the identity function on the set of plots X( n×Spec(A))X(\mathbb{R}^n \times Spec(A)).

    2. If n 1×Spec(A 1)f n 2×Spec(A 2)g n 3×Spec(A 3)\mathbb{R}^{n_1}\times Spec(A_1) \overset{f}{\to} \mathbb{R}^{n_2} \times Spec(A_2) \overset{g}{\to} \mathbb{R}^{n_3} \times Spec(A_3) are two composable smooth functions between infinitesimally thickened Cartesian spaces, then pullback of plots along them consecutively equals the pullback along the composition:

      f *g *=(gf) * f^\ast \circ g^\ast = (g \circ f)^\ast

      i.e.

      X( n 2×Spec(A 2)) f * g * X( n 1×Spec(A 1)) (gf) * X( n 3×Spec(A 3)) \array{ && X(\mathbb{R}^{n_2} \times Spec(A_2)) \\ & {}^{\mathllap{f^\ast}}\swarrow && \nwarrow^{\mathrlap{g^\ast}} \\ X(\mathbb{R}^{n_1} \times Spec(A_1)) && \underset{ (g \circ f)^\ast }{\longleftarrow} && X(\mathbb{R}^{n_3} \times Spec(A_3)) }
  2. (gluing)

    If {U i×Spec(A)f i×id Spec(A) n×Spec(A)} iI\{ U_i \times Spec(A) \overset{f_i \times id_{Spec(A)}}{\to} \mathbb{R}^n \times Spec(A)\}_{i \in I} is such that

    {U if i n} iI\{ U_i \overset{f_i }{\to} \mathbb{R}^n \}_{i \in I}

    is a differentiably good open cover (def. ) then the function which restricts n×Spec(A)\mathbb{R}^n \times Spec(A)-plots of XX to a set of U i×Spec(A)U_i \times Spec(A)-plots

    X( n×Spec(A))((f i) *) iIiIX(U i×Spec(A)) X(\mathbb{R}^n \times Spec(A)) \overset{( (f_i)^\ast )_{i \in I} }{\hookrightarrow} \underset{i \in I}{\prod} X(U_i \times Spec(A))

    is a bijection onto the set of those tuples (Φ iX(U i)) iI(\Phi_i \in X(U_i))_{i \in I} of plots, which are “matching families” in that they agree on intersections:

    ϕ i| ((U iU j)×Spec(A)=ϕ j| (U iU j)×Spec(A) \phi_i\vert_{((U_i \cap U_j) \times Spec(A)} = \phi_j \vert_{(U_i \cap U_j)\times Spec(A)}

    i.e.

    (U iU j)×Spec(A) U i×Spec(A) U j×Spec(A) Φ i Φ j X \array{ && (U_i \cap U_j) \times Spec(A) \\ & \swarrow && \searrow \\ U_i\times Spec(A) && && U_j \times Spec(A) \\ & {}_{\mathrlap{\Phi_i}}\searrow && \swarrow_{\mathrlap{\Phi_j}} \\ && X }

Finally, given X 1X_1 and X 2X_2 two super formal smooth sets, then a smooth function between them

f:X 1X 2 f \;\colon\; X_1 \longrightarrow X_2

is

  • for each super Cartesian space n×Spec(A)\mathbb{R}^n \times Spec(A) a function

    f *( n×Spec(A)):X 1( n×Spec(A))X 2( n×Spec(A)) f_\ast(\mathbb{R}^n \times Spec(A)) \;\colon\; X_1(\mathbb{R}^n \times Spec(A)) \longrightarrow X_2(\mathbb{R}^n \times Spec(A))

such that

  • for each smooth function g: n 1×Spec(A 1) n 2×Spec(A 2)g \colon \mathbb{R}^{n_1} \times Spec(A_1) \to \mathbb{R}^{n_2} \times Spec(A_2) between super Cartesian spaces we have

    g 2 *f *( n 2×Spec(A 2))=f *( n 1×Spec(A 1))g 1 * g^\ast_2 \circ f_\ast(\mathbb{R}^{n_2} \times Spec(A_2)) = f_\ast(\mathbb{R}^{n_1} \times Spec(A_1)) \circ g^\ast_1

    i.e.

    X 1( n 2×Spec(A 2)) f *( n 2×Spec(A 2)) X 2( n 2×Spec(A 2)) g 1 * g 2 * X 1( n 1×Spec(A 1)) f *( n 1) X 2( n 1×Spec(A 1)) \array{ X_1(\mathbb{R}^{n_2} \times Spec(A_2)) &\overset{f_\ast(\mathbb{R}^{n_2}\times Spec(A_2) )}{\longrightarrow}& X_2(\mathbb{R}^{n_2} \times Spec(A_2)) \\ \mathllap{g_1^\ast}\downarrow && \downarrow\mathrlap{g^\ast_2} \\ X_1(\mathbb{R}^{n_1} \times Spec(A_1)) &\underset{f_\ast(\mathbb{R}^{n_1})}{\longrightarrow}& X_2(\mathbb{R}^{n_1} \times Spec(A_1)) }

(Yetter 88)

Basing supergeometry on super formal smooth sets is an instance of the general approach to geometry called functorial geometry or topos theory. For more background on this see at geometry of physics – supergeometry.

In direct generalization of example we have:

Example

(super Cartesian spaces are super smooth sets)

Let XX be a super Cartesian space (def. ) Then it becomes a super smooth set (def. ) by declaring its plots ΦX( n×𝔻)\Phi \in X(\mathbb{R}^n \times \mathbb{D}) to the algebra homomorphisms C ( n×𝔻)C ( b|s) C^\infty(\mathbb{R}^n \times \mathbb{D}) \leftarrow C^\infty(\mathbb{R}^{b\vert s}).

Under this identification, morphisms between super Cartesian spaces are in natural bijection with their morphisms regarded as super smooth sets.

Stated more abstractly, this statement is an example of the Yoneda embedding over a subcanonical site.

Similarly, in direct generalization of prop. we have:

Proposition

(plots of a super smooth set really are the smooth functions into the smooth smooth set)

Let XX be a super smooth set (def. ). For n×𝔻\mathbb{R}^n \times \mathbb{D} any super Cartesian space (def. ) there is a natural function

Hom SmoothSet( n,X)X( n) Hom_{SmoothSet}(\mathbb{R}^n , X) \overset{\simeq}{\longrightarrow} X(\mathbb{R}^n)

from the set of homomorphisms of super smooth sets from n×𝔻\mathbb{R}^n \times \mathbb{D} (regarded as a super smooth set via example ) to XX, to the set of plots of XX over n×𝔻\mathbb{R}^n \times \mathbb{D}, given by evaluating on the identity plot id n×𝔻id_{\mathbb{R}^n \times \mathbb{D}}.

This function is a bijection.

This says that the plots of XX, which initially bootstrap XX into being as declaring the would-be smooth functions into XX, end up being the actual smooth functions into XX.

Proof

This is the statement of the Yoneda lemma over the site of super Cartesian spaces.

We do not need to consider here supermanifolds more general than the super Cartesian spaces (def. ). But for those readers familiar with the concept we include the following direct analog of the characterization of smooth manifolds according to def./prop. :

Definition/Proposition

(supermanifolds)

A supermanifold XX of dimension super-dimension (b,s)×(b,s) \in \mathbb{N} \times \mathbb{N} is

such that

  1. there exists an indexed set { b|sϕ iX} iI\{ \mathbb{R}^{b\vert s} \overset{\phi_i}{\to} X\}_{i \in I} of morphisms of super smooth sets (def. ) from super Cartesian spaces b|s\mathbb{R}^{b\vert s} (def. ) (regarded as super smooth sets via example into XX, such that

    1. for every plot n×𝔻X\mathbb{R}^n \times \mathbb{D} \to X there is a differentiably good open cover (def. ) restricted to which the plot factors through the i b|s\mathbb{R}^{b\vert s}_i;

    2. every ϕ i\phi_i is a local diffeomorphism according to def. , now with respect not just to infinitesimally thickened points, but with respect to superpoints;

  2. the bosonic part of XX is a smooth manifold according to def./prop. .

Finally we have the evident generalization of the smooth moduli space Ω \mathbf{\Omega}^\bullet of differential forms from example to supergeometry

Example

(universal smooth moduli spaces of super differential forms)

For nMn \in \mathbf{M} write

Ω nSuperSmoothSet \mathbf{\Omega}^n \;\in\; SuperSmoothSet

for the super smooth set (def. ) whose set of plots on a super Cartesian space USuperCartSpU \in SuperCartSp (def. ) is the set of super differential forms (def. ) of cohomolgical degree nn

Ω n(U)Ω n(U) \mathbf{\Omega}^n(U) \;\coloneqq\; \Omega^n(U)

and whose maps of plots is given by pullback of super differential forms.

The de Rham differential on super differential forms applied plot-wise yields a morpism of super smooth sets

(27)d:Ω nΩ n+1. d \;\colon\; \mathbf{\Omega}^n \longrightarrow \mathbf{\Omega}^{n+1} \,.

As before in def. we then define for any super smooth set XSuperSmoothSetX \in SuperSmoothSet its set of differential nn-forms to be

Ω n(X)Hom SuperSmoothSet(X,Ω n) \Omega^n(X) \;\coloneqq\; Hom_{SuperSmoothSet}(X,\mathbf{\Omega}^n)

and we define the de Rham differential on these to be given by postcomposition with (27).

\,

Definition

(bosonic fields and fermionic fields)

For Σ\Sigma a spacetime, such as Minkowski spacetime (def. ) if a fiber bundle EfbΣE \overset{fb}{\longrightarrow} \Sigma with total space a super Cartesian space (def. ) (or more generally a supermanifold, def./prop. ) is regarded as a super-field bundle (def. ), then

In components, if E=Σ×FE = \Sigma \times F is a trivial bundle with fiber a super Cartesian space (def. ) with even-graded coordinates (ϕ a)(\phi^a) and odd-graded coordinates (ψ A)(\psi^A), then the ϕ a\phi^a are called the bosonic field coordinates, and the ψ A\psi^A are called the fermionic field coordinates.

What is crucial for the discussion of field theory is the following immediate supergeometric analog of the smooth structure on the space of field histories from example :

Example

(supergeometric space of field histories)

Let EfbΣE \overset{fb}{\to} \Sigma be a super-field bundle (def. , def. ).

Then the space of sections, hence the space of field histories, is the super formal smooth set (def. )

Γ Σ(E)SuperSmoothSet \Gamma_\Sigma(E) \in SuperSmoothSet

whose plots Φ ()\Phi_{(-)} for a given Cartesian space n\mathbb{R}^n and superpoint 𝔻\mathbb{D} (def. ) with the Cartesian products U n×𝔻U \coloneqq \mathbb{R}^n \times \mathbb{D} and U×ΣU \times \Sigma regarded as super smooth sets according to example are defined to be the morphisms of super smooth set (def. )

U×Σ Φ ()() E \array{ U \times \Sigma &\overset{\Phi_{(-)}(-)}{\longrightarrow}& E }

which make the following diagram commute:

E Φ ()() fb U×Σ pr 2 Σ. \array{ && E \\ & {}^{\mathllap{\Phi_{(-)}(-)}}\nearrow & \downarrow^{\mathrlap{fb}} \\ U \times \Sigma &\underset{pr_2}{\longrightarrow}& \Sigma } \,.

Explicitly, if Σ\Sigma is a Minkowski spacetime (def. ) and E=Σ×FE = \Sigma \times F a trivial field bundle with field fiber a super vector space (example , example ) this means dually that a plot Φ ()\Phi_{(-)} of the super smooth set of field histories is a homomorphism of supercommutative superalgebras (def. )

C (U×Σ) (Φ ()()) * C (E) \array{ C^\infty(U \times \Sigma) &\overset{\left(\Phi_{(-)}(-)\right)^\ast}{\longleftarrow}& C^\infty(E) }

which make the following diagram commute:

C (E) (Φ ()()) * fb * C (U×Σ) pr 2 * C (Σ). \array{ && C^\infty(E) \\ & {}^{\mathllap{\left( \Phi_{(-)}(-) \right)^\ast }}\nearrow & \uparrow^{\mathrlap{fb^\ast}} \\ C^\infty(U \times \Sigma) &\underset{pr_2^\ast}{\longleftarrow}& C^\infty(\Sigma) } \,.

We will focus on discussing the supergeometric space of field histories (example ) of the Dirac field (def. below). This we consider below in example ; but first we discuss now some relevant basics of general supergeometry.

Example is really a special case of a general relative mapping space-construction as in example . This immediately generalizes also to the supergeometric context.

Definition

(super-mapping space out of a super Cartesian space)

Let XX be a super Cartesian space (def. ) and let YY be a super smooth set (def. ). Then the mapping space

[X,Y]SuperSmoothSet [X,Y] \;\in\; SuperSmoothSet

of super smooth functions from XX to YY is the super formal smooth set whose UU-plots are the morphisms of super smooth set from the Cartesian product of super Cartesian space U×XU \times X to YY, hence the U×XU \times X-plots of YY:

[X,Y](U)Y(U×X). [X,Y](U) \;\coloneqq\; Y(U \times X) \,.

In direct generalization of the synthetic tangent bundle construction (example ) to supergeometry we have

Definition

(odd tangent bundle)

Let XX be a super smooth set (def. ) and 0|1\mathbb{R}^{0 \vert 1} the superpoint (26) then the supergeometry-mapping space

T oddX [ 0|1,X] tb odd [* 0|1,X] X = X \array{ T_{odd} X & \coloneqq& [\mathbb{R}^{0\vert 1}, X] \\ {}^{\mathllap{tb_{odd}}}\downarrow && \downarrow^{\mathrlap{ [ \ast \to \mathbb{R}^{0 \vert 1}, X ] }} \\ X & = & X }

is called the odd tangent bundle of XX.

Example

(mapping space of superpoints)

Let VV be a finite dimensional real vector space and consider its corresponding superpoint V oddV_{odd} from exampe . Then the mapping space (def. ) out of the superpoint 0|1\mathbb{R}^{0\vert 1} (def. ) into V oddV_{odd} is the Cartesian product V odd×VV_{odd} \times V

[ 0|1,V odd]V odd×V. [\mathbb{R}^{0\vert 1}, V_{odd}] \;\simeq\; V_{odd} \times V \,.

By def. this says that V odd×VV_{odd} \times V is the “odd tangent bundle” of V oddV_{odd}.

Proof

Let UU be any super Cartesian space. Then by definition we have the following sequence of natural bijections of sets of plots

[ 0|1,V odd](U) =Hom SuperSmoothSet( 0|1×U,V odd) Hom sAlg( (V *),C (U)[θ]/(θ 2)) Hom Vect(V *,(C (U) oddC (U) evenθ) Hom Vect(V *,C (U) odd)×Hom Vect(V *,C (U) even) V odd(U)×V(U) (V odd×V)(U) \begin{aligned} \left[ \mathbb{R}^{0\vert 1}, V_{odd} \right](U) & = Hom_{SuperSmoothSet}( \mathbb{R}^{0\vert 1} \times U, V_{odd} ) \\ & \simeq Hom_{\mathbb{R}sAlg}( \wedge^\bullet(V^\ast)\,,\, C^\infty(U)[\theta]/(\theta^2) ) \\ & \simeq Hom_{\mathbb{R}Vect}( V^\ast \,,\, (C^\infty(U)_{odd} \oplus C^\infty(U)_{even}\langle \theta\rangle ) \\ & \simeq Hom_{\mathbb{R}Vect}( V^\ast\,,\, C^\infty(U)_{odd} ) \,\times\, Hom_{\mathbb{R}Vect}( V^\ast, C^\infty(U)_{even} ) \\ & \simeq V_{odd}(U) \times V(U) \\ & \simeq (V_{odd} \times V)(U) \end{aligned}

Here in the third line we used that the Grassmann algebra V *\wedge^\bullet V^\ast is free on its generators in V *V^\ast, meaning that a homomorphism of supercommutative superalgebras out of the Grassmann algebra is uniquely fixed by the underlying degree-preserving linear function on these generators. Since in a Grassmann algebra all the generators are in odd degree, this is equivalently a linear map from V *V^\ast to the odd-graded real vector space underlying C (U)[θ](θ 2)C^\infty(U)[\theta](\theta^2), which is the direct sum C (U) oddC (U) evenθC^\infty(U)_{odd} \oplus C^\infty(U)_{even}\langle \theta \rangle.

Then in the fourth line we used that finite direct sums are Cartesian products, so that linear maps into a direct sum are pairs of linear maps into the direct summands.

That all these bijections are natural means that they are compatible with morphisms UUU \to U' and therefore this says that [ 0|1,V odd][\mathbb{R}^{0\vert 1}, V_{odd}] and V odd×VV_{odd} \times V are the same as seen by super-smooth plots, hence that they are isomorphic as super smooth sets.

With this supergeometry in hand we finally turn to defining the Dirac field species:

Example

(field bundle for Dirac field)

For Σ\Sigma being Minkowski spacetime (def. ), of dimension 2+12+1, 3+1