Contents lists available atScienceDirect
Theoretical Computer Science
journal homepage:www.elsevier.com/locate/tcs
Affine connections, midpoint formation, and point reflection
✩Anders Kock
∗Department of Mathematics, University of Aarhus, DK 8000 Aarhus C, Denmark
a r t i c l e i n f o
Keywords:
Affine connection Geodesic
Synthetic differential geometry
a b s t r a c t
We describe some differential-geometric structures in combinatorial terms: namely affine connections and their torsion and curvature, and we show that torsion free affine connec- tions may equivalently be presented in terms of some simpler combinatorial structure:
midpoint formation, and point reflection (geodesic symmetry). The method employed is that of synthetic differential geometry, which is briefly explained.
©2011 Elsevier B.V. All rights reserved.
Preface
It is a striking fact that differential calculus exists not only in analysis (based on the real numbersRand limits therein), but also in algebraic geometry, where no limit processes are available. In algebraic geometry, one rather uses the idea of nilpotent elementsin the ‘‘affine line’’R; they act as infinitesimals. (Recall that an elementxin a ringRis callednilpotentif xk
=
0 for suitable non-negative integerk.)Synthetic differential geometry (SDG) is an axiomatic theory, based on such nilpotent infinitesimals. It can be proved, via topos theory, that the axiomatics covers both the differential-geometric notions of algebraic geometry and those of calculus.
I shall illustrate this synthetic method, by presenting its application to three particular types of differential-geometric structures, namely that ofaffine connection,midpoint formation, andpoint reflection(geodesic symmetry).
I shall not go much into the foundations of SDG, whose core is the so-called KL1axiom scheme. This is a very strong kind of axiomatics; in fact, a salient feature of it is:it is inconsistent— if you allow yourself the luxury of reasoning with so-called classical logic, i.e. use the ‘‘law of excluded middle’’, ‘‘proof by contradiction’’, etc. Rather, in SDG, one uses a weaker kind of logic, often called ‘‘constructive’’ or ‘‘intuitionist’’. Note the evident logical fact that there is a trade-off: with aweaker logic,strongeraxiom systems become consistent. For the SDG axiomatics, it follows for instance that any function from the affine line to itself is infinitely often differentiable (smooth); a very useful simplifying feature in differential geometry — but incompatible with the law of excluded middle, which allows you to construct the non-smooth function
f
(
x) =
1 ifx
=
0 0 if not.
1. Nilpotents, and neighboursThe axiomatics mentioned in the introduction deals with a commutative ringR, which we think of as theaffine lineor number line. Out of such, one constructs other objects that deserve names with geometric connotation, like the ‘‘unit sphere
✩Expanded version of Kock (2009) [8]. The notion of midpoint formation considered in [8] has been generalized, so that a notion of point reflection can be considered, together with their interdependence of these concepts, cf. Theorem 4.2 and Figure (4.1). Also, some proofs have been supplied, using Christoffel symbols for affine connections.
∗Tel.: +45 30353337.
E-mail address:kock@imf.au.dk.
1 for ‘‘Kock-Lawvere’’.
0304-3975/$ – see front matter©2011 Elsevier B.V. All rights reserved.
doi:10.1016/j.tcs.2010.12.037
(relative toR)’’, meaningS2
:= { (
x,
y,
z) ∈
R3|
x2+
y2+
z2=
1}
, the ‘‘coordinate planeR2(relative toR), and also some objects that are trivial in caseRhappens to be a field, like the setsDk:= {
x∈
R|
xk+1=
0}
(likewise relative toR; in the following,Ris fixed); the elements in the setDkare the ‘‘kth orderinfinitesimals’’.2These nilpotent infinitesimals come in a precise hierarchy, sincexk
=
0 implies xk+1=
0,
or equivalentlyDk−1
⊆
Dk⊆ · · ·
(note that this is opposite to the hierarchy of functions given by Landau’s ‘‘little-oh’’ notion, where ‘‘o(
xk+1)
implieso(
xk)
). The setD1of first order infinitesimals (i.e. the set ofx∈
Rwithx2=
0) is also denotedD.The basic instance of the KL axiom scheme says that any mapDk
→
Rextends uniquely to a polynomial mapR→
Rof degree≤
k. Thus, given any mapf:
R→
R, the restriction off toDkextends uniquely to a polynomial map of degree≤
k, thekthTaylor polynomialoffat 0. (Clearly, ifRis a field, this fails, since thenDk= {
0}
. SoR=
R, the ‘‘real’’ numbers, is not a model of the axiomatics.)ForxandyinR, we say thatxandyarekth orderneighboursifx
−
y∈
Dk, and we writex∼
ky. It is clear that∼
kis a reflexive and symmetric relation. It is not transitive. For instance, ifx∈
Dandy∈
D, thenx+
y∈
D2, by binomial expansion of(
x+
y)
3; but we cannot concludex+
y∈
D. Sox∼
1yandy∼
1zimplyx∼
2z, and similarly for higherk.We now turn to the (first order) neighbour relations in the coordinate planeR2. It is, in analogy with the 1-dimensional case, defined in terms of a subsetD
(
2) ⊆
R2; we putD
(
2) = { (
x1,
x2) ∈
R×
R|
x21=
0,
x22=
0,
x1·
x2=
0} .
SoD
(
2) ⊆
D×
D. We define the ‘‘first order neighbour relation’’∼
(or∼
1) onR2by puttingx∼
yifx−
y∈
D(
2)
, wherex= (
x1,
x2)
andy= (
y1,
y2)
. Similarly forD(
n) ⊆
Rn, and the resulting first order neighbour relation on the higher‘‘coordinate vector spaces’’Rn.
Ifx
∈
D(
n)
, we haveB(
x,
x) =
0 for any bilinearB:
Rn×
Rn→
Rm, and ifBis furthermore symmetric, we therefore also have the usefulB
(
x+
y,
x+
y) =
2B(
x,
y)
(1.1)forxandyinD
(
n)
.There is also akth order neighbour relation
∼
konRn, defined in a completely analogous manner from the set Dk(
n) := { (
x1, . . . ,
xn) ∈
Rn|
the product of anyk+
1 of thexis is 0} ,
namelyx
∼
kyifx−
y∈
Dk(
n)
.A higher dimensional version of the KL axiom scheme says that any mapD
(
n) →
Rextends uniquely to an affine map.More generally, any mapDk
(
n) →
Rextends uniquely to a polynomial mapRn→
Rof degree≤
k. The codomainRhere may be replaced byRm. So, if a mapD(
n) →
Rmtakes 0 to 0, it extends uniquely to a linear mapRn→
Rm. It is then easy to prove that if a mapΓ:
D(
n) ×
D(
n) →
Rm‘‘vanishes on the two axes’’, i.e. ifΓ(
d,
0) =
0=
Γ(
0,
d)
for alld∈
D(
n)
, then Γextends uniquely to a bilinear mapRn×
Rn→
Rm; and this bilinear map is symmetric iffΓ itself is so.The following (cf. [9], Proposition 1.5.1) is another consequence of the KL axiom scheme:
Theorem 1.1. Any map f
:
Rn→
Rmpreserves the kth order neighbour relation, x∼
ky implies f(
x) ∼
kf(
y).
Proof sketch. Forn
=
2,m=
1, for the first order neighbour relation∼
1. It suffices to see thatx∼
10 impliesf(
x) ∼
1f(
0)
, i.e. to prove thatx∈
D(
2)
impliesf(
x) −
f(
0) ∈
D. Now from a suitable version of the KL axiom scheme it follows that on D(
2)
,f agrees with a unique affine functionT1f:
R2→
R, so forx= (
x1,
x2) ∈
D(
2)
,f
(
x) −
f(
0) =
a1x1+
a2x2.
Squaring the right hand side here yields 0, since not onlyx1
∈
Dandx2∈
D, but alsox1·
x2=
0. Sof(
x) −
f(
0) ∈
D.From the Theorem it follows that the relation
∼
konRn iscoordinate free, i.e. is a truly geometric notion: any re- coordinatization ofRn(byanymap, not just by a linear or affine one) preserves the relation∼
k.Let us call a subsetUofRnopenif for anyk, one has thatx
∈
Uandy∼
kximplyy∈
U. (There are other, stronger, notions of ‘open’ compatible with SDG, cf. [9].) Using such an openness notion, one can define a notion ofn-dimensional manifold (relative toR), namely something that locally can be coordinatized with open subsets ofRn. From the invariance of∼
kunder re-coordinatization, one concludes that on any manifold, there are canonical reflexive symmetric relations∼
k: they may2 They are not to be compared to the infinitesimals of non-standard analysis.
be defined in terms of a local coordinatization, but, by the Theorem, are independent of the coordinatization chosen. The sphereS2
⊆
R3is an example of a 2-dimensional manifold3Any map between manifolds preserves the relations
∼
k.We shall mainly be interested in thefirstorder neighbour relation
∼
1(also written as just∼
). In Sections3and4, we study aspects of the second order neighbour relation∼
2.So for a manifoldM, we have a subsetM(1)
⊆
M×
M, the ‘‘first neighbourhood of the diagonal’’, consisting of(
x,
y)
∈
M×
Mwithx∼
y. It was in terms of this ‘‘scheme’’M(1)that algebraic geometers in the 1950s gave nilpotent infinitesimals a rigorous role in geometry. Note that forM=
Rn, we haveM(1)∼ =
M×
D(
n)
, by the map(
x,
y) → (
x,
x−
y)
.Let us consider some notions from ‘‘infinitesimal geometry’’ which can be expressed in terms of the first order neighbour relation
∼
on an arbitrary manifoldM. Given three pointsx,
y,
zinM. Ifx∼
yandx∼
zwe call the triple(
x,
y,
z)
a2- whisker at x(sometimes: aninfinitesimal2-whisker, for emphasis); since∼
is not transitive, we cannot in general conclude thaty∼
z; ifyhappens to be∼
z, we call the triple(
x,
y,
z)
a2-simplex(sometimes aninfinitesimal2-simplex). Similarly fork-whiskers andk-simplices. Ak-simplex is thus ak+
1-tuple of mutual neighbour points. Thek-simplices form, ask ranges, a simplicial complex, which in fact contains the information of differential forms, and the de Rham complex ofM, see [2,6,1,9].(When we say that
(
x0,
x1, . . . ,
xk)
is ak-whisker, we mean to say that it is ak-whiskerat x0, i.e. thatx0∼
xi for all i=
1, . . . ,
k. On the other hand, in a simplex, none of the points have a special status.)Given ak-whisker
(
x0, . . . ,
xk)
inM. IfUis an open subset ofMcontainingx0, it will also contain the otherxis, and ifU is coordinatized byRn, we may use coordinates to define the affine combinationk
−
i=0
ti
·
xi,
(1.2)(where∑
ti
=
1; recall that this is the condition that a linear combination of vectorsxi deserves the name ofaffine combination). The affine combination (1.2) can again be proved to belong toU, and thus it defines a point inM. The point thus obtained has in generalnota good geometric significance, since it will depend on the coordinatization chosen. However (cf. [5,9] 2.1), it does have geometric significance, if the whisker is a simplex:Theorem 1.2. Let
(
x0, . . . ,
xk)
be a k-simplex in M. Then the affine combination(1.2)is independent of the coordinatization used to define it. All the points that arise in this way are mutual neigbours. And any map to another manifold M′preserves such combinations.Proof sketch. This is much in the spirit of the proof ofTheorem 1.1: it suffices to see that any mapRn
→
Rm(not just a linear or affine one) preserves affine combinations of mutual neighbour points. This follows by considering a suitable first Taylor polynomial off (expand fromx0), and using the following purely algebraic fact: Ifx1, . . . ,
xkare inD(
n)
, then any linear combination of them will again be inD(
n)
providedthexis aremutualneighbours.Examples. Ifx
∼
yin a manifold (so they form a 1-simplex), we have the affine combinations ‘‘midpoint ofxandy’’, and‘‘reflection ofyinx’’, 1
2x
+
12y and 2x
−
y,
respectively. Ifx
,
y,
zform a 2-simplex, we may form the affine combinationu:=
y−
x+
z;
geometrically, it means completing the simplex into a parallelogram by adjoining the pointu. Here is the relevant picture:✘✘✘✘✘✘
☎
☎
☎☎
☎
☎
☎☎
✘✘✘✘✘✘
x
y z
q q
q
qu
=
y−
x+
z(1.3)
Theuthus constructed will be a neighbour of each of the three given points.
Remark. Ifx
,
y,
zanduare as above, and ifx,
y, andzbelong to a subsetS⊆
Mgiven as the zero set of a functionf:
M→
R, then so doesu=
y−
x+
z; forf preserves this affine combination.3 One of the referees asked ‘‘what does it mean that two points are neighbours on the real sphere ?’’. If ‘‘real sphere’’ meansS2(R)(whereRdenotes the field of ‘‘real’’ numbers, as constructed by Cauchy or Dedekind) then the answer is: points are only neighbours if they are equal. ForRis a field. It does not satisfy the KL axiomatics, and the theory presented here becomes void. On the other hand, if ‘‘real sphere’’ means a sphere in the real world, like the surface of the earth, there is no definite answer to the question; — nor is it there in general definite answers to similar questions in other situations where one applies a mathematical theory to something real; the mathematical theory is a simplification, and answers to such questions may depend on the purpose.
What is for instance the answer to the question whether two given streets in a real city like Montreal areparallel? Some further remarks on the issue of reality and ‘‘real’’ numbers are found in [7].
2. Affine connections
Ifx
,
y,
zform a 2-whisker atxin a manifold (sox∼
yandx∼
z), we cannot canonically form a parallelogram as in (1.3);rather, parallelogram formation is an addedstructure:
Definition 2.1. Anaffine connectionon a manifoldMis a law
λ
which to any 2-whiskerx,
y,
zinM associates a point u= λ(
x,
y,
z) ∈
M, subject to the conditionsλ(
x,
x,
z) =
z, λ(
x,
y,
x) =
y.
(2.1)(Cf. [3].) It can be verified (cf. [9] 2.3) that several other laws follow; in a more abstract combinatorial situation than manifolds, these laws should probably be postulated. Some of the laws are: for any 2-whisker
(
x,
y,
z)
λ(
x,
y,
z) ∼
y andλ(
x,
y,
z) ∼
z (2.2)λ(
y,
x, λ(
x,
y,
z)) =
z.
(2.3)One will not in general have or require the ‘‘symmetry’’ condition
λ(
x,
y,
z) = λ(
x,
z,
y) ;
(2.4)nor do we in general have, for 2-simplicesx
,
y,
z, thatλ(
x,
y,
z) =
y−
x+
z.
(2.5)The laws (2.4) and (2.5) are in fact equivalent, and affine connections satisfying either are calledsymmetricortorsion free.
We return to the torsion of an affine connection below.
Ifx
,
y,
z,
uare four points inMsuch that(
x,
y,
z)
is a 2-whisker atx, the statementu= λ(
x,
y,
z)
can be rendered by a diagram✘✘✘✘✘✘
☎
☎
☎☎
☎
☎
☎☎
☎
☎
☎☎
☎
☎
☎☎
✘✘✘✘✘✘
x
y z
q q
qu
= λ(
x,
y,
z)
q (2.6)
The figure4is meant to indicate that the data of
λ
provides a way of closing a whisker(
x,
y,
z)
into aparallelogram(one may say thatλ
provides a notion ofinfinitesimal parallelogram); but note thatλ
is not required to be symmetric inyandz, which is why we in the figure use different signatures for the line segments connectingxtoyand toz, respectively.Here, a line segment (whether single or double) indicates that the points connected by the line segment are neighbours.
Ifx
,
y,
z,
uare four points inMthat come about in the way described, we say that the 4-tuple forms aλ
-parallelogram.The fact that we in the picture did not make the four line segmentsorientedcontains some symmetry assertions, which can be proved by working in a coordinatized situation; namely that the 4-groupZ2
×
Z2acts on the set ofλ
-parallelograms; so for instance(
u,
z,
y,
x)
is aλ
-parallelogram, equivalentlyλ(λ(
x,
y,
z),
z,
y) =
x.
On the other handλ(λ(
x,
y,
z),
y,
z) ∼
x,
(2.7)but it will not in general be equal tox; its discrepancy from beingxis an expression of thetorsionof
λ
. Even wheny∼
z (sox,
y,
zform a simplex), the left hand side of (2.7) need not bex. Rather, we may define thetorsionofλ
to be the lawb which to any 2-simplexx,
y,
zassociatesλ(λ(
x,
y,
z),
y,
z)
. Thenb(
x,
y,
z) =
xfor all simplices iffλ
is symmetric, cf. [9]Proposition 2.3.1.
There is also a notion ofcurvatureof
λ
: LetMbe a manifold equipped with an affine connectionλ
. Note thatλ(
x,
y, − )
takes any neighbourv
ofxinto a neighbour ofy(‘‘λ
-paralleltransport ofv
fromxtoy’’). If nowx,
y,
zform an infinitesimal 2-simplex andv
is a neighbour ofx, we may successively make three transports ofv
, fromxtoy, then fromytoz, and finally fromzback tox; we will arrive at a neighbourv
′ofxagain, butv
′is not necessarily thev
we started with. Thus the 2-simplex x,
y,
zgives rise to a permutationv → v
′of the set of neighbour pointsv
ofx, and the connection isflat(curvature free) if all permutations arising this way are the identity permutation. More generally, the curvaturerofλ
is defined as the law, which to an infinitesimal 2-simplexx,
y,
zprovides the permutation of the neighbours ofxjust described. (In the terminology of [9],ris a group-bundle valued combinatorial 2-form.)We give two examples of affine connections on the unit sphereS.
Example 1. The unit sphereS
=
S2(
R)
sits inside Euclidean 3-space,S⊆
R3. SinceR3is in particular an affine space, we may for any three pointsx,
y,
zin it formy−
x+
z∈
R3. Forx,
y,
zinS, the pointy−
x+
zwill in general be outsideS;4 Note the difference between this figure and the figure (1.3), in whichyandzare assumed to be neighbours, and where the parallelogramcanonically may be formed.
ifx
,
y,
zare mutual neighbours, however,y−
x+
zwill be inS, cf. Remark at the end of Section1. What ifx,
y,
zform an infinitesimal 2-whisker? Then we cannot expecty−
x+
zto be inS; we rather defineλ(
x,
y,
z) ∈
Sto be the intersection point of the half-linelwith the sphere, wherelis the half-line from origo toy−
x+
z. IfSis the surface of the earth, this just means thatλ(
x,
y,
z)
is the point onSwhich is vertically (in the physical sense given by gravitation) belowy−
x+
z.This affine connection
λ
is evidently symmetric inyandz, so is torsion free; it does, however, have curvature, which one can see from the ‘‘integrated version’’ (holonomy) of the connection, i.e. the parallel transport (according toλ
) along curves on the sphere: for instance, transporting along a spherical triangle, whose sides each are 90◦, will provide a permutation of the neigbour points of any vertex, namely a rotation by 90◦(make a picture!). This connection is the Riemann- or Levi-Civita connection on a sphere.Example 2. (This example does not work on the whole sphere, only away from the two poles.) Givenx
,
yandzwithx∼
z (x∼
yis presently not relevant). Sincexandzare quite close, we can uniquely describezin a rectangular two-dimensional coordinate system atxwith coordinate axes pointing East and North. Now takeλ(
x,
y,
z)
to be that point neary, which in the East–North coordinate system atyhas the same coordinates as the ones obtained forzin the coordinate system that we considered atx.The description of this affine connection is asymmetric inyandz, and it is indeed easy to calculate that it has torsion ([9], Section 2.4). It has no curvature.
Connections constructed in a similar way also occur in materials science: for a crystalline substance, one may attach a coordinate system at each point, by using the crystalline structure to define directions (call them ‘‘East’’ and ‘‘North’’ and
‘‘Up’’, say). The torsion for a connection
λ
constructed from such coordinate systems is a measure for the imperfection of the crystal lattice (dislocations) — see [11,4] and the references therein.For calculations, and even for communication, one usually needs coordinates. Coordinate expressions for an affine connection are the ‘‘Christoffel symbols’’. Let
λ
be an affine connection on ann-dimensional manifoldM; assume that we have identified some open subset ofMwith some open subset ofRn. Ifx∼
yin this subset,y=
x+
dfor somed∈
D(
n)
, by definition of the neighbour relation. So a whisker(
x,
y,
z)
atxmay be written(
x,
x+
d1,
x+
d2)
with(
d1,
d2) ∈
D(
n) ×
D(
n)
. Define, for fixedx, the functionΓ:
D(
n) ×
D(
n) →
Rnby the ruleΓ
(
d1,
d2) = λ(
x,
x+
d1,
x+
d2) −
x
+
d1+
d2.
(To record the dependence ofΓ on the pointx, we may writeΓ
(
x;
d1,
d2)
.) Thus,Γ measures the discrepancy betweenλ
and the canonical affine connectionλ
0in the affine spaceRn. From the lawλ(
x,
x,
z) =
zfollowsΓ(
0,
d2) =
0, and similarlyλ(
x,
y,
x) =
ygivesΓ(
d1,
0) =
0. From the discussion prior toTheorem 1.1it follows thatΓ extends uniquely to a bilinear mapΓ:
Rn×
Rn→
Rn, which is theChristoffel symbolforλ
at the pointx— relative to the coordinatization assumed around x. It is clear thatλ
is asymmetric(=
torsion free) affine connection iff the Christoffel symbolsΓ(
x; − , − )
are symmetric bilinear maps, for allx∈
M.We can rephrase the relation between
λ
and the Christoffel symbolsΓ (in a coordinatized situation) byλ(
x,
x+
d1,
x+
d2) =
x+
d1+
d2+
Γ(
x;
d1,
d2)
(2.8)whered1andd2are inD
(
n)
.3. Second order notions; midpoint formation
We describe a geometric notion ofmidpoint formation structurewhich can be used to construct torsion free affine connections.
LetM(2)
⊆
M×
Mdenote the set of pairs(
x,
u)
of second order neighbours;M(2)is the ‘‘second neighbourhood of the diagonal’’, in analogy with the first neighbourhoodM(1)described in Section1. We haveM(1)⊆
M(2).Recall that forx
∼
1yinM, we have canonically the affine combination12x+
12y, the midpoint formation for first order neighbours; it defines a mapM(1)
→
M.Definition 3.1. Amidpoint formation structureonMis a map
µ :
M(2)→
Mwhich extends the midpoint formationM(1)→
Mfor pairs of first order neighbour points.Thus,
µ(
x,
u)
is defined wheneverx∼
2 u; andµ(
x,
u) =
12x
+
12uwheneverx
∼
1 u. It can be proved that suchµ
is automatically symmetric,µ(
x,
u) = µ(
u,
x)
. (The proof follows the same lines as the symmetry ofλ
in the proof of Theorem 4.2below.) It can also be proved thatµ(
x,
u) ∼
2xand∼
2u. Beware that ‘‘µ(
x,
u)
is midpoint ofxandu’’ (where x∼
2u) does not imply neitherx∼
1µ(
x,
u)
noru∼
1µ(
x,
u)
; in fact, neither of these will hold unlessxis already∼
1u.Theorem 3.2. Any midpoint formation structure
µ
on M gives rise canonically to a symmetric affine connectionλ
on M.Proof. Given
µ
, and given an infinitesimal 2-whisker(
x,
y,
z)
. Sincex∼
1 y, we may form the affine combination 2y−
x (reflection ofxiny), and it is still a first order neigbour ofx. Similarly for 2z−
x. So(
2y−
x) ∼
2(
2z−
x)
, and so we may formµ(
2y−
x,
2z−
x)
, and we defineλ(
x,
y,
z) := µ(
2y−
x,
2z−
x).
The relevant picture is here:
✘✘✘✘✘✘✘✘✘✘✘✘
☎
☎
☎
☎
☎
☎
☎☎
☎
☎
☎☎
✘✘✘✘✘✘
x
y z
2y
−
x 2z−
x.
λ(
x,
y,
z)
❛❛
❛❛
❛❛
❛❛
❛❛❛
.
. .
.
It is symmetric inyandz, by the symmetry of
µ
. Also, ify=
x, we getλ(
x,
x,
z) = µ(
x,
2z−
x) =
12x
+
12
(
2z−
x) =
z,
sincex
∼
1 2z−
xand sinceµ
extends the canonical midpoint formation for first order neighbours. This proves the first equation in (2.1), and the second one then follows by symmetry.Theorem 3.3. Any symmetric affine connection
λ
on M gives rise canonically to a midpoint formationµ
.This is less evident. First it requires an ‘‘interpolation’’ axiom, consistent with the KL axiomatics, namely that any
δ ∈
D2(
n)
may be writtend1+
d2for suitabled1andd2inD(
n)
. This implies that for any manifoldM, and anyx∼
2 uinM, we may interpolate ay, in the sense thatx∼
1y∼
1u. IfMis equipped with an affine connectionλ
, we may therefore also forx∼
2u find aλ
-parallelogram(
x,
y,
z,
u)
(takez= λ(
y,
x,
u)
).Now given a
λ
-parallelogram(
x,
y,
z,
u)
, and given a scalart∈
R. Sincex∼
y, we may form the affine combination yt:= (
1−
t)
x+
ty, and similarly, givens∈
R, we may formzs:= (
1−
s)
x+
sz; both these points are∼
x, and so we may form the pointλ(
x, (
1−
t)
x+
ty, (
1−
s)
x+
sz)
. See the picture:✘✘✘✘✘✘
☎
☎
☎☎
☎
☎
☎☎
✘✘✘✘✘✘
x
y z
q q
q
qu q q
✘✘✘✘
☎
☎☎
yt
zs (3.1)
The picture suggests that fors
=
t, we could define(
1−
t)
x+
tuasλ(
x,
yt,
zt)
(this will certainly be correct in an affine space). Thus we have described a candidate for this affine combination ofxandu, even thoughxanduare in general only secondorder neighbours. In particular, takingt=
12, we would get a candidate for the midpoint
µ(
x,
u)
. The problem with this definition is of course that it depends not only onxandu, but also (seemingly) on the ‘‘interpolating’’yandz. By working in coordinates, using the Christoffel symbolΓ=
Γ(
x; − , − )
forλ
atx, we shall prove that this dependence is only apparent for symmetricλ
. We may writey=
x+
d1andu=
y+
d2=
x+
d1+
d2. It is not in general true thatz=
x+
d2, but ratherz
=
x+
d2−
Γ(
d1,
d2).
(3.2)For, since
λ(
x,
y, − )
maps the set of neighbours ofxbijectively to the set of neighbours ofy, it suffices to see thatλ(
x,
y,
x+
d2−
Γ(
d1,
d2)) =
x+
d1+
d2.
Calculating the left hand side withΓyields (sincey
−
x=
d1) x+
d1+
d2−
Γ(
d1,
d2) +
Γ(
d1,
d2−
Γ(
d1,
d2))
=
x+
d1+
d2−
Γ(
d1,
d2) +
Γ(
d1,
d2) −
Γ(
d1,
Γ(
d1,
d2)),
using bilinearity ofΓ, so we end up withx
+
d1+
d2−
Γ(
d1,
Γ(
d1,
d2))
. Now the last term vanishes:d1here occurs in a quadratic fashion, andd1∈
D(
n)
; more precisely, sinceΓ( − ,
Γ( − ,
r))
is bilinear, it vanishes if a vector fromD(
n)
is put in both the empty slots. Thus we finally end up withx+
d1+
d2, which isu.Substituting the expression (3.2) forzgives
λ(
x,
yt,
zt) = λ(
x, (
1−
t)
x+
ty, (
1−
t)
x−
tz) = λ(
x,
x+
td1,
x+
t(
d2−
Γ(
d1,
d2)) ;
let us calculate this usingΓ; we get that it equalsx
+
td1+
td2−
tΓ(
d1,
d2) +
Γ(
td1,
td2−
tΓ(
d1,
d2)).
As before, the ‘‘nested’’ appearance ofΓ vanishes sinced1
∈
D(
n)
, and we are left withx
+
td1+
td2−
tΓ(
d1,
d2) +
Γ(
td1,
td2).
(3.3)Now we use that
λ
was assumed symmetric, so thatΓis a symmetric bilinear form. Then by (1.1),Γ(
d1,
d2) =
12Γ
(
d1+
d2)
, and similarly forΓ(
td1,
td2)
. Thus the expression (3.3) only depends onxandd1+
d2, that is, onxanduonly. This proves the desired independence of the choice ofd1andd2.Let us show that one gets the symmetric affine connection
λ
back from the midpoint formationµ
to whichλ
gives rise.Let
λ ˜
be the affine connection constructed fromµ
, so for a whiskerx,
y,
zatx, usexas the interpolating point between 2y−
x and 2z−
x; soλ( ˜
x,
y,
z) = µ(
2y−
x,
2z−
x) = λ
x
,
12x
+
12
(
2y−
x),
1 2x+
12
(
2z−
x)
,
but 12x+
12
(
2y−
x) =
yand12x+
12
(
2z−
x) =
z, by purely affine calculations; so we getλ(
x,
y,
z)
back.In [5], it is shown how a Riemannian metric geometrically gives rise to a midpoint formation (out of which, in turn, the Levi-Civita affine connection may be constructed, by the process given by the Theorem).
4. Point reflection (geodesic symmetry)
For a pair of first order neighbours,x
∼
1y, on a manifoldM, one has canonically the point reflection ofyinx, namely the affine combination 2x−
y; it thus defines a mapM(1)→
M.Definition 4.1. Apoint reflectionon a manifoldMis a map
∗ :
M(2)→
M, which extends the point reflectionM(1)→
Mfor pairs of first order neighbour points.We writex
∗
yfor the values of this map, ‘‘x∗
yis the reflection ofyinx’’. It should be thought of as an infinitesimal geodesic symmetry.Problem. One would like to investigate the conditions when a point reflection structure satisfies the equation,
(
x∗
y) ∗ (
x∗
z) =
x∗ (
y∗
z)
for ‘‘symmetric spaces in the sense of Loos’’ (cf. [10]), wheneverx
,
y,
zare mutual 2-neighbours. The equation does not immediately make sense in our context, since we cannot assert thatx∼
2(
y∗
z)
. However, it can be proved thatx∼
3(
y∗
z)
, and one can prove that∗ :
M(2)→
Mextends uniquely to a map∗ :
M(3)→
Msatisfyingx∗ (
x∗
u) =
ufor allu∼
3x, and with this extended∗
, the expressionx∗ (
y∗
z)
makes sense.Theorem 4.2. Any point reflection structure
∗
on M gives canonically rise to a symmetric affine connectionλ
on M.Proof. Given
∗
, and given an infinitesimal 2-whisker(
x,
y,
z)
. Sincex∼
1z, we may form the affine combinationz′:=
2x−
z (reflection ofzinx), and it is still a first order neigbour ofx. Also, we may formm:=
12x
+
12y, likewise a first order neighbour ofx, som
∼
2z′, and therefore we may formm∗
z′. We defineλ(
x,
y,
z)
to be this point. The relevant picture is here:✘✘✘✘✘✘
☎
☎
☎☎
☎
☎
☎☎
✘✘✘✘✘✘
x
y z
q q
q
q
λ(
x,
y,
z)
☎
☎
☎☎ q z′ ✪
✪
✪
✪
✪
✪
✪
✪✪
qm (4.1)
It is easy to see that
λ(
x,
x,
z) =
zandλ(
x,
y,
x) =
y, soλ
is indeed an affine connection. The fact that it is symmetric is not immediate from the definition; we prove it by considering a coordinatized situation, so identify an open neighbourhood of x,
y,
zwith an open subset ofRn, soy=
x+
d1,z=
x+
d2ford1andd2inD(
n)
. Now, since∗
extends the canonical point reflection for first order neighbours, it is by KL of the formx∗
u=
2x−
u+
Qx(
u−
x)
whereQx:
Rn→
Rnis a quadratic map, i.e.Qx(v) =
Γ(
x; v, v)
for some (unique) symmetric bilinear mapΓ(
x; − , − ) :
Rn×
Rn→
Rn. The symmetry of the constructedλ
will now follow by proving that the Christoffel symbols forλ
are theΓs. Here is the calculation. Note that m=
x+
12d1, and thatz′
=
x−
d2, soλ(
x,
y,
z) =
m∗
z′=
2 x
+
12d1
− (
x−
d2) +
Qm x
−
d2−
x
+
12d1
=
x+
d1+
d2+
Γ
m
; −
d2−
12d1
, −
d2−
12d1
.
TheΓ term may be rewritten by (1.1) using symmetry and bilinearity ofΓ
(
m; − , − ) =
Γ(
x+
12d1
; − , − )
; it gives Γ(
x+
12d1
;
d1,
d2)
. Here,d1appears as an argument in a linear position (after the semicolon), and then a Taylor expansionargument gives that thex
+
12d1in front of the semicolon may be replaced byx. (This is becaused1is inD
(
n)
; for a precise formulation of this ‘‘Taylor principle’’, see [9] (1.4.2).) Putting things together, we thus haveλ(
x,
y,
z) =
y−
x+
z+
Γ(
x;
d1,
d2),
proving thatΓ
(
x; − , − )
is indeed the Christoffel symbol atxforλ
. This proves the symmetry.Theorem 4.3. Any symmetric affine connection
λ
on M gives rise canonically to a point reflection structure (geodesic symmetry)∗
.Just as inTheorem 3.3, the construction depends on interpolating a
λ
-parallelogramx,
y,
z,
ubetweenxanduforx∼
2 u (see figure (3.1): one then putsx
∗
u:= λ(
x,
2x−
y,
2x−
z),
and the proof that this is independent of the choice of the interpolation is as forTheorem 3.3. Again, the constructions are the inverse of each other. We may summarize the results of the last two sections in
Theorem 4.4. On any manifold M, there are canonical bijective correspondences between the following three kinds of geometric structure:
•
symmetric affine connectionsλ
•
midpoint formationsµ :
M(2)→
M•
point reflection structures∗ :
M(2)→
M.Any mapf
:
M′→
Mbetween manifolds preserves the neighbour relations∼
1and∼
2(Theorem 1.1); therefore ifM′ andMare equipped with affine connectionsλ
′andλ
, respectively, it makes sense to ask whetherfis connection preservingf
(λ
′(
x,
y,
z)) = λ(
f(
x),
f(
y),
f(
z)),
for anyx
∼
1y,
x∼
1zinM′. IfM′andMare equipped with midpoint formation structuresµ
′andµ
, respectively, it makes sense to ask whetherfpreserves midpoint formation,f
(µ
′(
x,
u)) = µ(
f(
x),
f(
u))
for pairs of second order neighboursx
∼
2uinM′. Similarly ifM′andMare equipped with point reflection structures.Symmetric affine connections, midpoint formation structures, and point reflection structures correspond, by Theorem 4.4; the correspondences are constructed using affine combinations of first order neighbour points, preserved by anyf byTheorem 1.2. Therefore it follows that the assertions ‘‘fis connection preserving ’’, ‘‘f is midpoint preserving’’, and ‘‘f is point-reflection preserving’’ are equivalent (for symmetric affine connections, and the corresponding
µ
and∗
).Such mapsf
:
M′→
Mdeserve the namegeodesicmaps.In particular, the number lineR(being an affine space) carries canonical structures
λ
′,µ
′,∗
(which correspond to each other), namelyλ
′(
x,
y,
z) =
y−
x+
z, µ
′(
x,
u) =
1 2x+
12u
,
x∗
u=
2x−
u,
(in fact without any restrictions likex
∼
2 u). A mapf:
R→
Minto a manifoldMequipped with a symmetric affine connectionλ
deserves the name(parametrized) geodesic curveiff is geodesic in the general sense just defined. This is equivalent tofpreserving midpoint formation or point reflection. In particular,fis geodesic iff(
x+
2d) =
f(
x+
d) ∗
f(
x)
for d∈
D2. A subsetC⊆
Mdeserves the nameunparametrized curveif there is an embeddingf:
R→
MmappingRbijectively ontoC. In this case,Cdeserves the namegeodesicifx∼
2uinCimpliesx∗
u∈
C, in other words, ifCis stable under point reflection.Acknowledgement
Most of the content of Sections 1–3 of the present article was presented as an invited lecture at the 15th IAPR International Conference, DCGI 2009 (Montréal September/October 2009), cf. [8].
References
[1] L. Breen, W. Messing, Combinatorial differential forms, Adv. Math. 164 (2001) 203–282.
[2] A. Kock, Synthetic Differential Geometry, in: London Math. Soc. Lecture Notes Series, vol. 51, Cambridge University Press, 1981, Second edition: London Math. Soc. Lecture Notes Series 333, Cambridge University Press 2006.
[3] A. Kock, A combinatorial theory of connections, in: J. Gray (Ed.), Mathematical Applications of Category Theory, Proceedings 1983, Contemporary Math. 30 (1984), 132–144.
[4] A. Kock, Introduction to synthetic differential geometry, and a synthetic theory of dislocations, in: F.W. Lawvere, S. Schanuel (Eds.), Categories in Continuum Physics, Proceedings Buffalo 1982, in: Springer Lecture Notes, vol. 1174, 1986.
[5] A. Kock, Geometric construction of the Levi-Civita parallelism, Theory Appl. Categ. 4 (1998) 195–207.
[6] A. Kock, Differential forms as infinitesimal cochains, J. Pure Appl. Algebra 154 (2000) 257–264.
[7] A. Kock, Differential calculus and nilpotent real numbers, Bull. Symbolic Logic 9 (2003) 225–230.
[8] A. Kock, Affine connections, and midpoint formation, in: Discrete Geometry for Computer Imagery, Proceedings Montréal, in: LNCS, vol. 5810, Springer, 2009, pp. 13–21.
[9] A. Kock, Synthetic Geometry of Manifolds, in: Cambridge Tracts in Mathematics, vol. 180, 2010.
[10] O. Loos, Symmetric Spaces, Benjamin, 1969, pp. 1–2.
[11] W. Noll, Materially uniform simple bodies with inhomogeneities, Arch. Ration Mech. Anal. 27 (1967) 1–32.