Or: waves that take the shortest path through infinity
guest post by Tim van Beek
Water waves can do a lot of things that light waves cannot, like “breaking”:
In mathematical models this difference shows up through the kind of partial differential equation (PDE) that models the waves:
• light waves are modelled by linear equations while
• water waves are modelled by nonlinear equations.
Physicists like to point out that linear equations model things that do not interact, while nonlinear equations model things that interact with each other. In quantum field theory, people speak of “free fields” versus “interacting fields”.
Some nonlinear PDE that describe fluid flows turn out to also describe geodesics on infinite-dimensional Riemannian manifolds. This fascinating observation is due to the Russian mathematician Vladimir Arnold. In this blog post I would like to talk a little bit about the concepts involved and show you a little toy example.
Fluid Flow modelled by Diffeomorphisms
The Euler viewpoint on fluids is that a fluid is made of tiny “packages” or “particles”. The fluid flow is described by specifying where each package or particle is at a given time . When we start at some time
on a given manifold
, the flow of every fluid package is described by a path on
parametrized by time, and for every time
there is a diffeomorphism
defined by the requirement that it maps the initial position
of each fluid package to its position
at time
:
This picture is taken from the book
• V.I. Arnold and B.A. Khesin, Topological Methods in Hydrodynamics, Springer, Berlin, 1998. (Review at Zentralblatt Mathematik.)
We will take as a model of the domain of the fluid flow a compact Riemannian manifold . A fluid flow, as pictured above, is then a path in the diffeomorphism group
. In order to apply geometric concepts in this situation, we will have to turn
or some closed subgroup of it into a manifold, which will be infinite dimensional.
The curvature of such a manifold can provide a great deal about the stability of fluid flows: On a manifold with negative curvature geodesics will diverge from each other. If we can model fluid flows as geodesics in a Riemannian manifold and calculate the curvature, we could try to infer a bound on weather forecasts (in fact, that is what Vladimir Arnold did!): The solution that you calculate is one geodesic. But if you take into account errors with determining your starting point (involving the measurement of the state of the flow at the given start time), what you are actually looking at is a bunch of geodesics starting in a neighborhood of your starting point. If they diverge fast, that means that measurement errors make your result unreliable fast.
If you never thought about manifolds in infinite dimensions, you may feel a little bit insecure as to how the concepts that you know from differential geometry can be generalized from finite dimensions. At least I felt this way when I first read about it. But it turns out that the part of the theory one needs to know in order to understand Arnold’s insight is not that scary, so I will talk a little bit about it next.
What you should know about infinite-dimensional manifolds
The basic strategy when handling finite-dimensional, smooth, real manifolds is that you have a complicated manifold , but also locally for every point
a neighborhood
and an isomorphism (a “chart”) of
to an open subset of the vastly simpler space
, the “model space”. These isomorphisms can be used to transport concepts from
to
. In infinite dimensions it is however not that clear what kind of model space
should be taken in place of
. What structure should
have?
Since we would like to differentiate, we should for example be able to define the derivative of a curve in :
If we write down the usual formula for a derivative
we see that to make sense of this we need to be able to add elements, have a scalar multiplication, and a topology such that the algebraic operations are continuous. Sets with this structure are called topological vector spaces.
A curve that has a first derivative, second derivative, third derivative… and so on at every point is called a smooth curve, just as in the finite dimensional case.
So should at least be a topological vector space. We can, of course, put more structure on
to make it “more similar” to
, and choose as model space in ascending order of generality:
1) A Hilbert space, which has an inner product,
2) a Banach space that does not have a inner product, but a norm,
3) a Fréchet space that does not have a norm, but a metric,
4) a general topological vector space that need not be metrizable.
People talk accordingly of Hilbert, Banach and Fréchet manifolds. Since the space consisting of smooth maps from
to
is not a Banach space but a Fréchet space, we should not expect that we can model diffeomorphism groups on Banach spaces, but on Fréchet spaces. So we will use the concept of Fréchet manifolds.
But if you are interested in a more general theory using locally convex topological vector spaces as model spaces, you can look it up here:
• Andreas Kriegl and Peter W. Michor, The Convenient Setting of Global Analysis, American Mathematical Society, Providence Rhode Island, 1999.
Note that Kriegl and Michor use a different definition of “smooth function of Fréchet spaces” than we will below.
If you learn functional analysis, you will most likely start with operators on Hilbert spaces. One could say that the theory of topological vector spaces is about abstracting away as much structure from a Hilbert space and look what structure you need for important theorems to still hold true, like the open mapping/closed graph theorem. If you would like to learn more about this, my favorite book is this one:
• Francois Treves, Topological vector Spaces, Distributions and Kernels, Dover Publications, 2006.
Since we replace the model space with a Fréchet space
, there will be certain things that won’t work out as easily as for the finite dimensional
, or not at all.
It is nevertheless possible to define both integrals and differentials that behave much in the expected way. You can find a nice exposition of how this can be done in this paper:
• Richard S. Hamilton, The inverse function theorem of Nash and Moser, Bulletin of the American Mathematical Society 7 (1982), pages 65-222.
The story starts with the definition of the directional derivative that can be done just as in finite dimensions:
Let and
be Fréchet spaces,
open and
a continuous map. The derivative of
at the point
in the direction
is the map
given by:
A simple but nontrivial example is the operator
given by:
with the derivative
It is possible to define higher derivatives and also prove that the chain rule holds, so that we can define that a function between Fréchet spaces is smooth if it has derivatives at every point of all orders. The definition of a smooth Fréchet manifold is then straightforward: you can copy the usual definition of a smooth manifold word for word, replacing by some Fréchet space.
With tangent vector s, you may remember that there are several different ways to define them in the finite dimensional case, which turn out to be equivalent. Since there are situations in infinite dimensions where these definitions turn out to not be equivalent, I will be explicit and define tangent vector s in the “kinematic way”:
The (kinematic) tangent vector space of a Fréchet manifold
at a point
consists of all pairs
where
is a smooth curve
With this definition, the set of pairs forms a Fréchet manifold, the tangent bundle
, just as in finite dimensions.
The first serious (more or less) problem we encounter is the definition of the cotangent bundle: is isomorphic to its dual vector space. This is still true for every Hilbert space (this is known as the Riesz representation theorem). It fails already for Banach spaces: The dual space will still be a Banach space, but a Banach space does not need to be isomorphic to its dual, or even the dual of its dual (though the latter situation happens quite often, and such Banach spaces are called reflexive).
With Fréchet spaces things are even a little bit worse, because the dual of a Fréchet space (which is not a Banach space) is not even a Fréchet space! Since I did not know that and could not find a reference, I asked about this on Mathoverflow here and promptly got an answer. Mathoverflow is a really amazing platform for this kind of question!
So, if we naively define the cotangent space as in finite dimensions by taking the dual space of every tangent space, then the cotangent bundle won’t be a Fréchet manifold.
We will therefore have to be careful with the definition of differential forms for Fréchet manifolds and define it explicitly:
A differential form (a one form) is a smooth map
where is the tangent bundle, such that
restricts to a linear map on every tangent space
.
Another pitfall is that theorems from multivariable calculus may fail in Fréchet spaces, like the existence and uniqueness theorem of Picard-Lindelöf for ordinary differential equations. Things are much easier in Banach spaces: If you take a closer look at multivariable calculus, you will notice that a lot of definitions and theorems actually make use of the Banach space structure of only, so that a lot generalizes straight forward to infinite dimensional Banach spaces. But that is less so for Fréchet spaces.
By now you should feel reasonably comfortable with the notion of a Fréchet manifold, so let us talk about the kind of gadget that Arnold used to describe fluid flows: diffeomorphism groups that are both infinite-dimensional Riemannian manifolds and Lie groups.
The geodesic equation for an invariant metric
If is both a Riemannian manifold and a Lie group, it is possible to define the concept of left or right invariant metric. A left or right invariant metric
on
is one that does not change if we multiply the arguments with a group element:
A metric is left invariant iff for all
:
Similarly, is right invariant iff:
How does one get a one-sided invariant metric?
Here is one possibility: If you take a Lie group off the shelf, you get two automorphisms for free, namely the left and right multiplication by a group element
:
given by:
Pictorially speaking, you can use the differentials of these to transport vector s from the Lie algebra of
– which is the tangent space at the identity of the group,
– to any other tangent space
. If you can define an inner product on the Lie algebra, you can use this trick to transport the inner product to all the other tangent spaces by left or right multiplication, which will get you a left or right invariant metric.
To be more precise, for every tangent vectors of a tangent space
there are unique vectors
that are mapped to
by the differential of the right multiplication
, that is
and we can define the inner product of and
to have the value of that of
and
:
This works for the left multiplication , too, of course.
For a one-sided invariant metric, the geodesic equation looks somewhat simpler than for general metrics. Let us take a look at that:
On a Riemannian manifold with tangent bundle
there is a unique connection, the Levi-Civita connection, with the following properties for vector fields
:
If we combine both formulas we get
If the inner products are constant along every flow, i.e. the metric is (left or right) invariant, then the first three terms on the right hand side vanish, so that we get
This latter formula can be written in a more succinct way if we introduce the coadjoint operator. Remeber the adjoint operator defined to be
With the help of the inner product we can define the adjoint of the adjoint operator:
Beware! We’re using the word ‘adjoint’ in two completely different ways here, both of which are very common in math. One way is to use ‘adjoint’ for the operation of taking a Lie bracket: . Another is to use ‘adjoint’ for the linear map
coming from a linear map between inner product spaces
given by
Please don’t blame me for this.
Then the formula above for the covariant derivative can be written as
Since the inner product is nondegenerate, we can eliminate and get
A geodesic curve is one whose tangent vector is transported parallel to itself. That is, we have
Using the formula for the covariant derivative for an invariant metric above we get
as a reformulation of the geodesic equation.
For time dependent dynamical systems, we have the time axis as an additional dimension and every vector field has as an additional summand. So, in this case we get as the geodesic equation (again, for an invariant metric):
A simple example: the circle
As a simple example we will look at the circle and its diffeomorphism group
The Lie algebra
of
can be identified with the space of all vector fields on
If we sloppily identify
with
with coordinate
, then we can write for vector fields
and
the commutator
where is short for the derivative:
And of course we have an inner product via
which we can use to define either a left or a right invariant metric on , by transporting it via left or right multiplication to every tangent space.
Let us evaluate the geodesic equation for this example. We have to calculate the effect of the coadjoint operator:
If we write for the vector fields ,
and
, this results in
where the last step employs integration by parts and uses the periodic boundary condition for the involved functions.
So we get for the coadjoint operator
Finally, the geodesic equation
turns out to be
A similar equation,
is known as the Hopf equation or inviscid Burgers’ equation. It looks simple, but its solutions can produce behaviour that looks like turbulence, so it is interesting in its own right.
If we take a somewhat more sophisticated diffeomorphism group, we can get slightly more complicated and therefore more interesting partial differential equations like the Korteweg-de Vries equation. But since this post is quite long already, that topic will have to wait for another post!

Reblogged this on mathenaetor.
Thank you for the post, I’ve been meaning to brush up my differential geometry, I now look forward to the next post. I do have a couple of questions:
1) I don’t quite understand the Levi-Cevita connection part. Is there a term missing when you combine the two formulae? Also I don’t see how to make the next jump. Is this compatibility of the inner product with the Lie bracket?
2) In your final example, what generality are we working in? When we integrate around the circle, is that an inner product, does it define a norm, or is it just the metric of a Fréchet manifold? I wish I had the time to investigate myself! Also is there any kind of physical system associated to this example?
I’ll only answer the question Tim already pointed to an answer for:
Yes, near the end of this post Tim is deriving the inviscid Burgers equation in the case where space is a circle. The same idea works in higher dimensions, for example 2 or 3. The idea is that we have a pressure-free, zero-viscosity gas of particles each of which move along a straight line (or geodesic) at constant speed. Thus, the convective derivative of the velocity vector field is zero:
I hope you (and indeed everyone in the universe) knows about the convective derivative of the velocity vector field, which is the left-hand side of the above equation. It’s the time derivative of the velocity vector field as seen by a particle moving along with that velocity—that’s what adds that second term! Does everyone still learn about this in school?
There’s a funny factor of 3 in Tim’s version of this equation, which I find puzzling, but we can make it go away by doing a simple transformation:
One fun thing about the inviscid Burgers equation is that it develops ‘shock waves’ when particles with different velocities crash into each other. Here we see the shock wave developing as the faster particles move into a region of zero-velocity particles and speed of the particles becomes a discontinuous function:
This is the beginning of a longer story…
Thank you, that’s really helped. So perhaps the most natural solutions for this equation aren’t continuous at all, they’re delta or Green’s functions. Steps travel around the circle with velocity proportional to their height.
Obviously you can’t just add any two solutions to these differential equations; but from the physical description you can take the unions of the two sets of particles, but I suppose we are assuming that they don’t ‘overlap’. I feel that there should be an algebraic analogue of this by which you can combine any two solutions, but perhaps you have to re-parametrize either the circle or time to make it work.
Correct me if I’m wrong, but doesn’t the formation of a shock just correspond to the geodesic hitting the boundary (I think
is noncompact and has a boundary)? So the shock would simply be an endpoint after which the solution is no longer defined (at least in the differential geometric setting).
also, how do TeX here?!
James Griffin wrote:
Hmm, that’s interesting! Your somewhat ‘algebraic’ outlook on this problem sounds worth pursuing. The awkwardness of assuming two sets don’t overlap seems related to the existence of shocks, since shocks happen when they start to overlap.
It might be possible to enlarge
to the space of smooth maps from the circle to itself and extend the solutions of the equations beyond shocks that way. A map that’s not 1-1 would have several particles at the same point, typically with different velocities.
There’s been a lot of work on this stuff already, most of which I don’t know.
Heiki Arponen wrote:
I get what you mean, and it’s a good intuition, but technically it’s a bit slippery.
First of all, it’s hard for to make sense of the idea that a topological group has a ‘boundary’. In a group every point is just like every other (thanks to left or right translation symmetry), so it doesn’t seem to make sense to say one is ‘on the boundary’. We can try to imagine some sort of ideal boundary added on to
consisting of points that describe shocks. But I haven’t worked out the details. Maybe someone has.
Second, on a lesser note,
isn’t even locally compact, because it’s infinite-dimensional.
The safest thing to say for now is that some geodesics in
‘run off to infinity in a finite amount of time’.
This can’t happen for geodesics on a finite-dimensional Lie group equipped with some left-invariant metric, even a noncompact Lie group.
Note the shock isn’t actually in
, so to make sense of this ‘endpoint’ we’d need to add some ‘boundary points’ to this group.
On this blog, and all WordPress blogs, you need to type
$latex x = y$
to get the equation
The ‘latex’ must occur directly after the dollar sign, with no space in between. There must be a space after it. And double dollar signs don’t work, at least on my blog.
Thanks for the reply John, that makes a lot of sense! I meant with a “boundary” a similar concept from finite dimensional noncompact symmetric spaces, where indeed the geodesic distance to the boundary is infinite, unlike in
(insofar as a concept of a boundary makes sense here).
Again, this is kind of a stretch, but I guess
is isomorphic to the group of conformal maps on the upper half plane (preserving the real axis), so that might provide the “analytic continuation”?
Heikki wrote:
Interesting idea. It needs to be polished a bit:
There’s a problem with how a rotation of
corresponds to a transformation that maps
to a finite real number.
Also, we’d better work with real-analytic diffeomorphisms of
, not just smooth ones, if we want something like this.
I think it’s a bit easier to work with the unit disk than the upper half-plane, and try to show any real-analytic diffeomorphism of the unit circle extends to an analytic map from the unit disk to itself. Is this true? If it’s true, clearly the extension is unique.
What I really want is a larger space such that when a solution of the inviscid Burgers’ equation develops a shock, it ceases to be an element of
but still lies in this larger space.
I think the obvious choice is the space of smooth maps from the circle to itself. Using the ‘particle picture’ of the inviscid Burgers’ equation, a shock occurs when two or more particles hit, and this means we get a smooth map that’s no longer one-to-one.
Someone should have studied this already! Time to do a little poking around in the literature…
For question (1a) Yes, the Koszul formula is missing one term. See e.g. http://en.wikipedia.org/wiki/Fundamental_theorem_of_Riemannian_geometry For question (1b) that is the compatibility of the choice of the vector fields
. In other words, it is postulated that
etc. are 0. In the case of the Lie group, the assumption probably is that the vector fields
is given by transporting tangent vectors at the identity with the group action.
Sorry for being absent, I have been terribly busy yesterday and am today.
(1a) Willie is right, but the missing term is recovered in the next line :-)
(1b) Willie is right again. I should have been clearer: The step is actually that we assume that we look at vector fields such that the scalar product is constant along all flows generated by all fields. This happens with an invariant metric and invariant vector fields.
I respectfully submit that Arnold, in all his glory, is not “the” Russian mathematician. :-)
Excellent post! I just happened to be getting to know these things in the course of my work, so it’s doubly interesting. I wish all math textbooks were written this clearly :)
Now only if there was a way of describing viscous flows as geodesic flows on Diff(M)… Maybe some stochasticity would give rise to viscosity?
Interesting question, but unfortunately I have no idea. I’m still working on an understanding of ideal incompressible flows. That’s what Arnold actually worked on.
Stochasticity contributes to the wave energy spectrum of typical open-ocean waves.
First, we make a maximum entropy estimation of the energy of a one-dimensional propagating wave driven by a prevailing wind direction. The mean energy of the wave is related to the wave height by the square of the height, H. This makes sense because a taller wave needs a broader base to support that height, leading to a scaled pseudo-triangular shape, as shown in this figure.
Since the area of such a scaled triangle goes as H^2, the MaxEnt cumulative probability is:
where a is related to the mean energy of an ensemble of waves. This relationship is empirically observed from measurements of ocean wave heights over a sufficient time period. However, we can proceed further and try to derive the dispersion results of wave frequency, which is the very common oceanography measure. So we consider — based on the energy stored in a specific wave — the time, t, it will take to drop a height, H, by the Newton’s law relation:
and since t goes as 1/f, then we can create a new PDF from the height cumulative as follows:
where
then
which is just the Pierson-Moskowitz wave spectra that oceanographers have observed for years (developed first in 1964, variations of this include the Bretschneider and ITTC wave spectra).
One can actually test this by visiting one of the real-time buoy measuring stations, this one off of San Diego.
http://cdip.ucsd.edu/?nav=historic&sub=data&units=metric&tz=UTC&pub=public&map_stati=1,2,3&stn=167&stream=p1&xyrmo=201201&xitem=product25
A typical spectrum with a fit looks like this
This is an interesting derivation that I have not seen anywhere before and it is one of those cases that first-order physics (Newton’s Law plus MaxEnt) works pretty effectively to describe real-world behavior for a seemingly complex phenomena.
Excellent article, Tim!
I have a few naive questions:
Besides the correspondence of the fluid parcel pathlines and the curvature of the manifold, is there any other information we can get from the manifold that will tell us about the nature of the fluid flow?
For instance, if we were ignorant about the nature of the solutions to the inviscid Burger equation, are there any properties of $Diff(S^1)$ that would alert us to the fact that the characteristics of the equation governing the geodesics cross?
Can you assign a corresponding geodesic equation to the dynamical equation of any inviscid fluid flow? Is there then necessarily a manifold that corresponds with this geodesic equation?
Although most of this is out of my league, I find the connection interesting and beautiful and in my ignorance I am trying to find places where this alternative way of looking at fluids would yield new information that would not be apparent, or would be very difficult to deduce, in the classical description.
Nick
Those are not naive questions, at least not from where I am standing.
Both John and I plan to write a follow-up to this post where we will address this (although that is easier announced than done, of course). A relevant paper about shock waves and how they can be described using the viewpoint of manifolds is this:
• B. Khesina and G. Misiolek, Shock waves for the Burgers equation and curvatures of diffeomorphism groups.
I have no idea, but I guess that we’d have to make “any” in “any inviscid fluid flow” more precise in order to get an answer.
I recently developed a non-breaking-wave derivation here:
http://theoilconundrum.blogspot.com/2012/01/wave-energy-spectrum.html
The math may seem a bit pedantic, but I like pedantic. Nothing but Newton’s law, MaxEnt principle, and inverting probability densities are involved.
I believe the key to describing all these physical phenomenon in practice is to assume disorder. I would appreciate getting some feedback on the approach, especially in regards to finding an alternate derivation for wave spectra. Every oceanography text I have flipped through just states the Pierson-Moskowitz wave spectra as a given without providing a first-principles derivation.
I think that is kind of common in the earth and geosciences disciplines and is why I appreciate the Azimuth — as the understanding always seems to come first here.
Interesting discussions. I’d just like to make three points:
1) It’s sort of obvious but I only just noticed that if you integrate u around the circle then you get a conserved quantity (w.r.t. time). Both the physical and geodesic interpretations are quite natural.
2) If you make sure that the above integral is zero then a step function describes a periodic geodesic (just integrate by hand starting at the identity, or constant map). Having said that I haven’t done this rigorously, I could be mistaken.
2) One could try to do the differential geometry of a function which takes multiple values (multiple ‘particles’ with differing velocities) by replacing
with a category of spans. When composing one has to be careful taking the pullbacks (I think it’s possible to keep our pullbacks being closed 1-manifolds if not a single circle). The Lie algebra looks like the tensor product of two copies of the Lie algebra above, however there is an issue with the invariant metric, because it isn’t both left and right invariant I think you have to change the Lie algebra slightly. You also need to change the metric a little and make sure that everything you do is invariant with respect to a diagonal copy of
which corresponds to equivalence of spans, or reparametrisation of our curve.
Having said all this I’m not convinced that what you get behaves nicely. I’ve so far failed to fill in all the details.
Sorry that second 2) should be a 3).
I didn’t say what a span was; John has helpfully written about them in the past:
http://math.ucr.edu/home/baez/span/
A few more observations:
It seems clear that there will be a Hamiltonian structure associated with the geodesics of the manifold, since these flow lines are inherently minimizing a quantity.
From these will we be able to deduce relevant conservation laws of the fluid? Also, I would expect the Hamiltonian structure to yield additional information about the dynamic equations of the geodesics, but precisely what I’m unsure of.
Since speculation gets us nowhere, I’ll start to scour the internet for relevant resources.
These seem relevant/interesting:
http://terrytao.wordpress.com/2010/06/07/the-euler-arnold-equation/
http://arxiv.org/pdf/0906.0184v1.pdf
http://arxiv.org/pdf/1004.2462v1.pdf
Thanks for the input, I will have a look at it. I wrote a stub on the Azimuth Projekt, analytical hydrodynamics, with three references to textbooks that I use to fill me in. All three of them seem to be quite useful to me, and you can find some answers to your questions there.
[…] Last time in this series, we set the stage by explaining infinite dimensional manifolds. Then we looked at a simple example: the inviscid Burgers equation. We saw this was the equation for geodesics in the diffeomorphism group of the circle. […]
Hello,
I’m quite puzzled by the burgers equation… If we dealing with this group can we identify the
then the eulerian velocity
so the velocity is only imaginary?
I don’t know where you’re getting that
in your
. There are no complex numbers in Tim’s post or in the Burgers equation. Physicists doing quantum mechanics like to insert
in the definition of Lie algebras, so that group elements look like
where
is a Lie algebra element. But mathematicians know this is unnecessary, and in contexts like the Burgers equation this it would be very confusing. If
is a vector field on a compact manifold, we can exponentiate it and get a diffeomorphism 
Hello,
I’m quite puzzled by the burgers equation… If we dealing with this group can we identify the
then the eulerian velocity
so the velocity is only imaginary?