nLab
locale of real numbers

The locale of real numbers

Summary

This page gives an elementary description of the locale of real numbers, that is the localic real line. The development is manifestly constructive and even predicative over the natural numbers (although we are somewhat careless with the language and do not always point out when a set may predicatively be a proper class). Ideally, we will show that our construction satisfies the seven ‘headline properties’ of the real line described by Bauer & Taylor (although so far we cover only the Heine–Borel theorem).

The exposition here is pretty much my own work, although of course the basic ideas are well known to many. In particular, the technical zigzag lemma is mine (but it is not a very deep result!).

Idea

To describe the locale of the real numbers, we need first of all to describe what an open in the real line is. The key insight is that an open set is determined by the open intervals of rational numbers that it contains. This is analogous to the key insight of Richard Dedekind's definition of real number: that a point in the real line is determined by which open intervals of rational numbers that it belongs to. (Once you're talking about rational numbers, things are manageable.)

Opens

An open in the real line is a binary relation on the rational numbers that satisfies the four properties listed below. Intuitively, we have ab iff the open interval (a,b) is contained in the corresponding open set.

  1. If ab, then ab.
  2. If abcd, then ad.
  3. If ab>cd, then ad.
  4. If bc whenever a<b and c<d, then ad.

Property (1) is motivated because (a,b) is empty whenever ab. Property (2) is motivated because (a,d)(b,c) whenever ab and cd. Property (4) is somewhat technical; it keeps {a,ba<b0<a<b<1} from being an open, while {a,ba<b0a<b1} is allowed.

The really interesting property is property (3). It in fact generalises as follows:

  • If

    (1)a 1b 1>a 2b 2>>a nb n,a_1 \sim b_1 \gt a_2 \sim b_2 \gt \cdots \gt a_n \sim b_n ,

    then a 1b n.

We call the combined hypothesis of this property a zigzag; each hypothesis a ib i is a zig, and each hypothesis b i>a i + is a zag. To indicate the length of a zigzag, we will count the zigs; the zigzag (1) has n zigs (and n1 zags). A typical nondegenerate zigzag with 3 zigs is shown below; it consists of 3 overlapping open intervals, each of which belongs to a given open set; we are motivated to conclude that the entire interval from a 1 to b 3 belongs to that open set.

Layer 1 a 1 a 2 b 1 a 3 b 2 b 3 a_1\qquad a_2\longleftarrow\; b_1\qquad a_3\longleftarrow\; b_2\qquad b_3 ~ ~ ~ ~ ~ ~

Thus the zigzag property for n=2 is property (3), the zigzag property for n=1 is trivial (if ab, then ab), and the zigzag property for n>2 may be proved by induction. (There is also a sense in which the zigzag property for n=0 is property (1), but I haven't quite got my head around that. It would be awesome if the zigzag property for n= could be interepreted as property (4), but that seems unlikely so far.)

If G is an open in the real line, then we write (a,b)G to mean that a is related to b through G and say that G contains the interval from a to b.

The zigzag lemma

We will often have occasion to consider a zigzag as in (1), but where the various zigs are taken relative to different opens. Nevertheless, we will usually want to say something about the relationship of a 1 to b n, and it will often be convenient to assume, for all i, that a 1a i and b ib n. We may do so without loss of generality as follows: consider the smallest value of i such that b ib n, truncate the zigzag so that it now has only i zigs, and change the new endpoint b i to the old endpoint b n. The final zig (a i,b i)G i becomes (a i,b n)G i, which will hold by property (2) since b ib n. Now consider the largest value of ji such that a 1a j, and truncate similarly on the other side. We now have a zigzag with ij+1 zigs that never falls below a 1 on a zag nor rises above b n on a zig.

To be precise:

Zigzag lemma

Given a collection 𝒞 of opens and a zigzag in which each zig holds relative to some open in 𝒞, there exists a zigzag (1) in which each zig holds relative to some open in 𝒞, a 1a i for each i, and b ib n for each i.

(Since each b i>a i +, it follows further that a i<b n for each i>1 and a 1<b i for each i<n, although this corollary doesn't seem to be particularly useful.)

At the moment, this lemma is used only in the proof of the infinite distributivity law.

The frame of opens

The opens form a sub-poset of the power set 𝒫(×). This poset is in fact a frame, as we will now show. (It is not a subframe of the power set, since the joins are different. It is a sub-inflattice of the power set, although this seems to be a red herring at least for infinitary meets, since those are not part of the frame structure that we need.)

The top open, denoted , is the binary relation which is always true. Given two opens G and H, their meet in the poset of opens, denoted GH, is simply their conjunction, that is their intersection as subsets of ×. (In fact, given any collection of opens, their meet is their conjunction.) It is straightforward to check that and GH are opens and to prove that these are the desired meets. Intuitively, this all works because an open interval will be contained in the intersection of a family of open sets if and only if it is contained in each individual open set.

The bottom open, denoted , is the binary relation . That is, (a,b) iff ab. It is easy to check that this is an open; it precedes every open by property (1). Intuitively, this corresponds to the empty subset of the real line; (a,b) is empty if and only if ab. However, note that is not the empty subset of ×; the notation follows our topological intuition rather than the algebra of relations.

Given two opens G and H, their join in the poset of opens, denoted GH, is defined as follows: (a,b)GH if and only if there exists a zigzag (1) with a=a 1 and b=b n in which each zig is of the form (a i,b i)G or (a i,b i)H. It is immediate that this is an open in which G and H are both contained. Conversely, any open in which G and H are contained must contain this open GH, by property (3).

More generally, given any family (G k) k of opens, their join in the poset of opens, denoted kG k, is defined as follows: (a,b) kG k if and only if ab or there exists a zigzag (1) with a=a 1 and b=b n in which each zig is of the form (a i,b i)G k for some k. (We can leave out the ab clause if the family is inhabited.) The same argument applies as before. Note that each individual zigzag has finitely many zigs, and therefore involves finitely many of the opens G k, even when taking the join of an infinite family.

Finally, we must check the distributive law G kH k k(GH k). That is, if ab directly through G and ab through a zigzag of Hs (or ab), then we need that ab through a zigzag in which each zig is related both through G and through some H (or ab). To prove this (ignoring the trivial case where ab), start with the zigzag of Hs, and apply the zigzag lemma to get a zigzag of Hs in which each zig involves values bounded by a and b. Then these zigs hold for G as well, by property (2). Therefore, we may interpret each zig using GH k for some k, proving the desired result.

This frame of opens, interpreted as a locale, is the locale of real numbers. As usual, we denote this locale with the same symbol as the top element of its frame, in this case . (Of course, the true etymology of the symbols runs in the other order.)

Open intervals as opens

Given rational numbers a and b, the open interval (a,b) may itself be interpreted as an open in the real line, also denoted (a,b), as follows: let (c,d)(a,b) hold if every rational number strictly between c and d (in that order) is also strictly between a and b (in that order). In other words, we interpret ‘’ literally as comparing subsets of . It is straightforward to check that this condition does indeed define an open. There is now a third way to interpret ‘(c,d)(a,b)’; interpreting both intervals as opens in the real line, this states that the first is contained in the second. But again, it is easy to check that this is equivalent; (c,d)(a,b) (in the set-theoretic sense) if and only if (e,f)(a,b) whenever (e,f)(c,d). Notice that (a,b)= whenever ab.

We can actually generalise this somewhat. Given any set L of rational numbers and any set U of rational numbers, we may define the open (infU,supL) as follows: let (a,b)(infU,supL) hold if every rational number strictly between a and b (in that order) is greater than some element of U and less than some element of L. If the infimum of U and the supremum of L exist in the usual sense as rational numbers, then this agrees with the previous paragraph. If instead U or L is the set of all rational numbers, then we write for infU or for supL. In general, we may interpret infU as an extended upper real and supL as an extended lower real. Classically, every extended upper or lower real is either a real number, , or ; only the converse holds constructively. Notice that (,)=.

Since we will refer to them below, we will state for the record the complete definitions of (,a) and (a,) for a rational number a. We have (b,c)(,a) iff every rational number strictly between b and c is less than a, that is iff bc or ca. Similarly, we have (b,c)(a,) iff every rational number strictly between b and c is greater than a, that is iff bc or ba. A fortiori, (b,c)(,0) if c=0, and (b,c)(1,) if b=1.

Open sets, closed sets, and points

We think of each open as defining an open subset of the real line, but we can equally well think of it as defining a closed subset. The difference between these perspectives is reflected in complementary criteria for when a real number belongs to the set. So to make sense of this, we must identify the points of the real line.

Recall that a real number may be defined as a pair (L,U) of inhabited subsets of satisfying the following properties:

  1. If aL, then a<b for some bL.
  2. If bU, then a<b for some aU.
  3. If aL and bU, then a<b.
  4. If a<b, then aL or bU.

We define a point of the real line to be a real number in this sense. Given such a point x=(L,U) and a rational number a, we write a<x to mean that aL and a>x to mean that aU. If we wish to refer to L and U directly, we may call L the lower set of x and U the upper set.

Given a point x and an open G, we say that x belongs to G, written xG, if (a,b)G for some a<x and b>x; that is, G contains an interval from some element of the lower set to some element of the upper set of x. We have x since its lower and upper sets are inhabited. If xG and xH, with (a,b)G and (c,d)H, then (max(a,c),min(b,d))GH, so xGH; note that this argument fails for infinitary intersections. (The converse, that xG and xH if xGH, is immediate.) Dually, suppose that x kG k, as shown by some zigzag (since ab is impossible when a<x and b>x, by 3). Applying condition (4) of the definition of real number to each zag, we have a i +<x or b i>x, for each i<n. Checking all 2 n1 possibilities, and knowing that a 1<x and b n>x in any case, we must have a i<x and b i>x for some i. Then we have xG k for some k, whichever corresponds to the ith zig. (The converse, that x kG k if xG k for some k, is immediate.)

Therefore, each point defines a completely prime filter on the frame of all opens, which is the definition of a point in general locale theory. Conversely, given such a completely prime filter P, let L be the set of all rational numbers a such that the open interval (a,) (when interpreted as an open in the real line as defined above) is in P, and symmetrically let U be the set of all a such that (,a)P.

Given a point x and an open G, we say that x co-belongs to G, written xG, if we never have a<x, b>x, and (a,b)G, which is precisely the negation of the property that xG. We think of this condition as defining a closed set to which x does belong. Notice that x kG k if and only if xG k for each k, giving the desired behaviour for an arbitrary intersection of closed sets (which corresponds to union of open sets under de Morgan duality). We also have that x always fails, and xGH if xG or xH. To prove that xG or xH whenever xGH, however, we must use excluded middle; constructively, closed sets don't behave well under union.

A related question is whether we can reconstruct G from the set of points which belong to it. This should be equivalent to the fan theorem, which is classically true and also accepted by Brouwer's school of intuitionism, but refuted in the Russian school in which all real numbers are assumed to be computable?. (I should check this.) Arguably, the real lesson of these logical technicalities is that we should remember that opens, not points, are the fundamental concept in a locale.

The Heine–Borel theorem

The classical Heine–Borel theorem, as a statement about sets of real numbers, may fail constructively; this is related to the comments above about the fan theorem. But the beauty of the localic approach is that Heine–Borel necessarily holds when interpreted as a statement about opens in the locale of real numbers. To state the theorem, we must define what it means for a collection of opens to cover the unit interval. We will give an ad-hoc definition, but this may also be derived from the general theory of closed sublocales which allows us to interpret the unit interval as a compact locale in its own right.

Definition

An open cover of the unit interval is a collection 𝒞 of opens in the real line such that is the join of (,0), (1,), and the elements of 𝒞.

Theorem

(Heine–Borel)

Every open cover of the unit interval has a finite subcover.

Proof

The proof is almost embarrassingly simple. The key point is that the construction of joins in terms of zigzags involves only finite zigzags, even for an infinitary join.

Let J be the join of (,0), (1,), and the elements of 𝒞. If this equals , then in particular (1,2)J, a fact which is given by some zigzag ζ. This zigzag involves only finitely many of the elements of 𝒞; let 𝒟 be the collection of these, and let K be the join of (,0), (1,), and 𝒟. Now if (a,b) is any pair of rational numbers, we construct a zigzag showing that (a,b)K as follows: the zig (a,0)(,0), the zag 0>1, the zigzag ζ from 1 to 2, the zag 2>1, and the zig (1,b)(1,). This is always a valid zigzag, so K=. Therefore, the finite collection 𝒟 covers the unit interval.

This proof generalises immediately to any closed interval [a,b], for a any upper real and b any lower real. But note that we do not say ‘extended’ here; we need to find some rational number (analogous to 1 in the proof above) smaller than a and some rational number (analogous to 2 above) larger than b. So the Heine–Borel theorem applies only to bounded closed intervals.

Revised on August 12, 2011 21:20:01 by Toby Bartels (64.89.53.88)