The Euler-Arnold equation
7 June, 2010 in expository, math.AP, math.DG, math.DS | Tags: Euler equation, Euler-Arnold equation, geodesic flow, Lax pair, Vladimir Arnold | by Terence Tao
A (smooth) Riemannian manifold is a smooth manifold {M} without boundary, equipped with a Riemannian metric {{\rm g}}, which assigns a length {|v|_{{\rm g}(x)} \in {\bf R}^+} to every tangent vector {v \in T_x M} at a point {x \in M}, and more generally assigns an inner product
\displaystyle \langle v, w \rangle_{{\rm g}(x)} \in {\bf R}
to every pair of tangent vectors {v, w \in T_x M} at a point {x \in M}. (We use Roman font for {g} here, as we will need to use {g} to denote group elements later in this post.) This inner product is assumed to symmetric, positive definite, and smoothly varying in {x}, and the length is then given in terms of the inner product by the formula
\displaystyle |v|_{{\rm g}(x)}^2 := \langle v, v \rangle_{{\rm g}(x)}.
In coordinates (and also using abstract index notation), the metric {{\rm g}} can be viewed as an invertible symmetric rank {(0,2)} tensor {{\rm g}_{ij}(x)}, with
\displaystyle \langle v, w \rangle_{{\rm g}(x)} = {\rm g}_{ij}(x) v^i w^j.
One can also view the Riemannian metric as providing a (self-adjoint) identification between the tangent bundle {TM} of the manifold and the cotangent bundle {T^* M}; indeed, every tangent vector {v \in T_x M} is then identified with the cotangent vector {\iota_{TM \rightarrow T^* M}(v) \in T_x^* M}, defined by the formula
\displaystyle \iota_{TM \rightarrow T^* M}(v)(w) := \langle v, w \rangle_{{\rm g}(x)}.
In coordinates, {\iota_{TM \rightarrow T^* M}(v)_i = {\rm g}_{ij} v^j}.
A fundamental dynamical system on the tangent bundle (or equivalently, the cotangent bundle, using the above identification) of a Riemannian manifold is that of geodesic flow. Recall that geodesics are smooth curves {\gamma: [a,b] \rightarrow M} that minimise the length
\displaystyle |\gamma| := \int_a^b |\gamma'(t)|_{{\rm g}(\gamma(t))}\ dt.
There is some degeneracy in this definition, because one can reparameterise the curve {\gamma} without affecting the length. In order to fix this degeneracy (and also because the square of the speed is a more tractable quantity analytically than the speed itself), it is better if one replaces the length with the energy
\displaystyle E(\gamma) := \frac{1}{2} \int_a^b |\gamma'(t)|_{{\rm g}(\gamma(t))}^2\ dt.
Minimising the energy of a parameterised curve {\gamma} turns out to be the same as minimising the length, together with an additional requirement that the speed {|\gamma'(t)|_{{\rm g}(\gamma(t))}} stay constant in time. Minimisers (and more generally, critical points) of the energy functional (holding the endpoints fixed) are known as geodesic flows. From a physical perspective, geodesic flow governs the motion of a particle that is subject to no external forces and thus moves freely, save for the constraint that it must always lie on the manifold {M}.
One can also view geodesic flows as a dynamical system on the tangent bundle (with the state at any time {t} given by the position {\gamma(t) \in M} and the velocity {\gamma'(t) \in T_{\gamma(t)} M}) or on the cotangent bundle (with the state then given by the position {\gamma(t) \in M} and the momentum {\iota_{TM \rightarrow T^* M}( \gamma'(t) ) \in T_{\gamma(t)}^* M}). With the latter perspective (sometimes referred to as cogeodesic flow), geodesic flow becomes a Hamiltonian flow, with Hamiltonian {H: T^* M \rightarrow {\bf R}} given as
\displaystyle H( x, p ) := \frac{1}{2} \langle p, p \rangle_{{\rm g}(x)^{-1}} = \frac{1}{2} {\rm g}^{ij}(x) p_i p_j
where {\langle ,\rangle_{{\rm g}(x)^{-1}}: T^*_x M \times T^*_x M \rightarrow {\bf R}} is the inverse inner product to {\langle, \rangle_{{\rm g}(x)}: T_x M \times T_x M \rightarrow {\bf R}}, which can be defined for instance by the formula
\displaystyle \langle p_1, p_2 \rangle_{{\rm g}(x)^{-1}} = \langle \iota_{TM \rightarrow T^* M}^{-1}(p_1), \iota_{TM \rightarrow T^* M}^{-1}(p_2)\rangle_{{\rm g}(x)}.
In coordinates, geodesic flow is given by Hamilton’s equations of motion
\displaystyle \frac{d}{dt} x^i = {\rm g}^{ij} p_j; \quad \frac{d}{dt} p_i = - \frac{1}{2} (\partial_i {\rm g}^{jk}(x)) p_j p_k.
In terms of the velocity {v^i := \frac{d}{dt} x^i = {\rm g}^{ij} p_j}, we can rewrite these equations as the geodesic equation
\displaystyle \frac{d}{dt} v^i = - \Gamma^i_{jk} v^j v^k
where
\displaystyle \Gamma^i_{jk} = \frac{1}{2} {\rm g}^{im} (\partial_k {\rm g}_{mj} + \partial_j {\rm g}_{mk} - \partial_m {\rm g}_{jk} )
are the Christoffel symbols; using the Levi-Civita connection {\nabla}, this can be written more succinctly as
\displaystyle (\gamma^* \nabla)_t v = 0.
If the manifold {M} is an embedded submanifold of a larger Euclidean space {R^n}, with the metric {{\rm g}} on {M} being induced from the standard metric on {{\bf R}^n}, then the geodesic flow equation can be rewritten in the equivalent form
\displaystyle \gamma''(t) \perp T_{\gamma(t)} M,
where {\gamma} is now viewed as taking values in {{\bf R}^n}, and {T_{\gamma(t)} M} is similarly viewed as a subspace of {{\bf R}^n}. This is intuitively obvious from the geometric interpretation of geodesics: if the curvature of a curve {\gamma} contains components that are transverse to the manifold rather than normal to it, then it is geometrically clear that one should be able to shorten the curve by shifting it along the indicated transverse direction. It is an instructive exercise to rigorously formulate the above intuitive argument. This fact also conforms well with one’s physical intuition of geodesic flow as the motion of a free particle constrained to be in {M}; the normal quantity {\gamma''(t)} then corresponds to the centripetal force necessary to keep the particle lying in {M} (otherwise it would fly off along a tangent line to {M}, as per Newton’s first law). The precise value of the normal vector {\gamma''(t)} can be computed via the second fundamental form as {\gamma''(t) = \Pi_{\gamma(t)}( \gamma'(t), \gamma'(t) )}, but we will not need this formula here.
In a beautiful paper from 1966, Vladimir Arnold (who, sadly, passed away last week), observed that many basic equations in physics, including the Euler equations of motion of a rigid body, and also (by which is a priori a remarkable coincidence) the Euler equations of fluid dynamics of an inviscid incompressible fluid, can be viewed (formally, at least) as geodesic flows on a (finite or infinite dimensional) Riemannian manifold. And not just any Riemannian manifold: the manifold is a Lie group (or, to be truly pedantic, a torsor of that group), equipped with a right-invariant (or left-invariant, depending on one’s conventions) metric. In the context of rigid bodies, the Lie group is the group {SE(3) = {\bf R}^3 \rtimes SO(3)} of rigid motions; in the context of incompressible fluids, it is the group {Sdiff({\bf R}^3}) of measure-preserving diffeomorphisms. The right-invariance makes the Hamiltonian mechanics of geodesic flow in this context (where it is sometimes known as the Euler-Arnold equation or the Euler-Poisson equation) quite special; it becomes (formally, at least) completely integrable, and also indicates (in principle, at least) a way to reformulate these equations in a Lax pair formulation. And indeed, many further completely integrable equations, such as the Korteweg-de Vries equation, have since been reinterpreted as Euler-Arnold flows.
From a physical perspective, this all fits well with the interpretation of geodesic flow as the free motion of a system subject only to a physical constraint, such as rigidity or incompressibility. (I do not know, though, of a similarly intuitive explanation as to why the Korteweg de Vries equation is a geodesic flow.)
One consequence of being a completely integrable system is that one has a large number of conserved quantities. In the case of the Euler equations of motion of a rigid body, the conserved quantities are the linear and angular momentum (as observed in an external reference frame, rather than the frame of the object). In the case of the two-dimensional Euler equations, the conserved quantities are the pointwise values of the vorticity (as viewed in Lagrangian coordinates, rather than Eulerian coordinates). In higher dimensions, the conserved quantity is now the (Hodge star of) the vorticity, again viewed in Lagrangian coordinates. The vorticity itself then evolves by the vorticity equation, and is subject to vortex stretching as the diffeomorphism between the initial and final state becomes increasingly sheared.
The elegant Euler-Arnold formalism is reasonably well-known in some circles (particularly in Lagrangian and symplectic dynamics, where it can be viewed as a special case of the Euler-Poincaré formalism or Lie-Poisson formalism respectively), but not in others; I for instance was only vaguely aware of it until recently, and I think that even in fluid mechanics this perspective to the subject is not always emphasised. Given the circumstances, I thought it would therefore be appropriate to present Arnold’s original 1966 paper here. (For a more modern treatment of these topics, see the books of Arnold-Khesin and Marsden-Ratiu.)
In order to avoid technical issues, I will work formally, ignoring questions of regularity or integrability, and pretending that infinite-dimensional manifolds behave in exactly the same way as their finite-dimensional counterparts. In the finite-dimensional setting, it is not difficult to make all of the formal discussion below rigorous; but the situation in infinite dimensions is substantially more delicate. (Indeed, it is a notorious open problem whether the Euler equations for incompressible fluids even forms a global continuous flow in a reasonable topology in the first place!) However, I do not want to discuss these analytic issues here; see this paper of Ebin and Marsden for a treatment of these topics.
— 1. Geodesic flow using a right-invariant metric —
Let {G} be a Lie group. From a physical perspective, one should think of a group element {g} as describing the relationship between a fixed reference observer {O}, and a moving object {A = gO}; mathematically, one could think of {A} and {O} as belonging to a (left) torsor {M} of {G}. (One could also work with right torsors; this would require a number of sign conventions below to be altered.) For instance, in the case of rigid motions, {O} would be the reference state of a rigid body, {A} would be the current state, and {g = A/O} would be the element of the rigid motion group {SE(3) = {\bf R}^3 \rtimes SO(3)} that moves {O} to {A}; {M} is then the configuration space of the rigid body. Similarly, in the case of incompressible fluids, {O} would be a reference state of the fluid (e.g. the initial state at time {t=0}), {A} would be the current state, and {g \in SDiff({\bf R}^3)} would be the measure-preserving diffeomorphism required to map the location each particle of the fluid at {O} to the corresponding location of the same particle at {A}. Again, {M} would be the configuration space of the fluid.
Once one fixes the reference observer {O}, one can set up a bijection between the torsor {M} and the group {G}; but one can also adopt a “coordinate-free” perspective in which the observer {O} is not present, in which case one should keep {M} and {G} distinct. Strictly speaking, the geodesic flow we will introduce will be on {M} rather than on {G}, but for some minor notational reasons it is convenient to fix a reference observer {O} in order to identify the two objects.
Let {{\mathfrak g} := T_{id} G} denote the Lie algebra of {G}, i.e. the tangent space of {G} at the identity. This Lie algebra can be identified with the tangent space {T_A M} of a state {A} in {M} in two different ways: an intrinsic one that does not use a reference observer {O}, and an extrinsic one which does rely on this observer.
- (Intrinsic identification) If {V \in T_A M} is a tangent vector to {A}, we let {V/A \in {\mathfrak g}} be the associated element of the Lie algebra defined infinitesimally as
\displaystyle A + \epsilon V = (1 + \epsilon V/A) A
modulo higher order terms for infinitesimal {\epsilon}, or more traditionally by requiring {\gamma'(0)/\gamma(0) = g'(0)} whenever {\gamma: {\bf R} \rightarrow M}, {g: {\bf R} \rightarrow G} are smooth curves such that {\gamma(t) = g(t) \gamma(0)}. Conversely, if {X \in {\mathfrak g}}, we let {XA \in T_A M} be the tangent vector at {A} defined infinitesimally as
\displaystyle A + \epsilon X A = (1 + \epsilon X) A
modulo higher order terms for infinitesimal {\epsilon}, or more traditionally by requiring {\gamma'(0) = g'(0) \gamma(0)} whenever {\gamma: {\bf R} \rightarrow M}, {g: {\bf R} \rightarrow G} are smooth curves such that {\gamma(t) = g(t) \gamma(0)} (so {g(0) = id}). Clearly, these two operations invert each other.
- (Extrinsic identification) If {V \in T_A M} is a tangent vector to {A}, and {A=gO} for some fixed reference {O}, we let {g^{-1} V/O \in {\mathfrak g}} be the element of the Lie algebra defined infinitesimally as
\displaystyle A + \epsilon V = g( (1 + \epsilon g^{-1} V/O) O )
or more traditionally by requiring {g^{-1} \gamma'(0)/O = h'(0)} whenever {\gamma: {\bf R} \rightarrow M}, {h: {\bf R} \rightarrow G} are such that {\gamma(t) = g h(t) O} and {h(0)=id}.
The distinction between intrinsic and extrinsic identifications is closely related to the distinction between active and passive transformations: {V/A} denotes the direction in which {A} must move in order to effect a change of {V} in the apparent position of {A} relative to any observer {O}, whereas {g^{-1} V/O} is the (inverse of the) direction in which the reference {O} would move to effect the same change in the apparent position. The two quantities are related to each other by conjugation:
\displaystyle g^{-1} V/O = g^{-1} ( V/A ) g; \quad V/A = g (g^{-1} V/O) g^{-1}
where we define conjugation {X \mapsto g X g^{-1}} of a Lie algebra element {X} by a Lie group element {g} in the usual manner.
If {A(t) \in M} is the state of a rigid body at time {t}, then {A'(t)/A(t)} is the linear and angular velocity of {A(t)} as measured in {A(t)}‘s current spatial reference frame, while if {A(t) = g(t) O}, then {g(t)^{-1} A'(t) / O} is the linear and angular velocity of {A(t)} as measured in the frame of {O}. Similarly, if {A(t) \in M} is the state of an incompressible fluid at time {t}, then {A'(t)/A(t)} is the velocity field {u(t)} in Eulerian coordinates, while {g(t)^{-1} A'(t) / O} is the velocity field {u \circ g(t)} in Lagrangian coordinates.
The left action of {G} {g: A \mapsto gA} on the torsor {M} induces a corresponding action {g: V \rightarrow gV} on the tangent bundle {TM}. Indeed, this action was implicitly present in the notation {g^{-1} V/O} used earlier.
Now suppose we choose a non-degenerate inner product {\langle,\rangle_{{\mathfrak g}}} on the Lie algebra {{\mathfrak g}}. We do not assume any symmetries or invariances of this inner product with respect to the group structure, such as conjugation invariance; in particular, this inner product will usually not be the Cartan-Killing form. At any rate, once we select an inner product, we can construct a right-invariant Riemannian metric {{\rm g}} on {M} by the formula
\displaystyle \langle V, W \rangle_{{\rm g}(A)} := \langle V/A, W/A \rangle_{{\mathfrak g}}. \ \ \ \ \ (1)
Because we do not require the inner product to be conjugation invariant, this metric will usually not be bi-invariant, instead being merely right-invariant.
The quantity {H(V) := \frac{1}{2} \langle V, V \rangle_{{\rm g}(A)}} is the Hamiltonian associated to this metric. For rigid bodies, this Hamiltonian is the total kinetic energy of the body, which is the sum of the kinetic energy {\frac{1}{2} m |v|^2} of the centre of mass, plus the rotational kinetic energy {\frac{1}{2} I( \omega, \omega)} which is determined by the moments of inertia {I}. For incompressible fluids, the Hamiltonian is (up to a normalising constant) the energy {\frac{1}{2} \int_{{\bf R}^3} |u|^2 = \frac{1}{2} \int_{{\bf R}^3} |u \circ A|^2} of the fluid, which can be computed either in Eulerian coordinates or in Lagrangian coordinates (there are no Jacobian factors here thanks to incompressibility).
Another important object in the Euler-Arnold formalism is the bilinear form {B: {\mathfrak g} \times {\mathfrak g} \rightarrow {\mathfrak g}} associated to the inner product {\langle, \rangle}, defined via the Lie bracket and duality using the formula
\displaystyle \langle [X,Y], Z \rangle = \langle B(Z,Y), X \rangle, \ \ \ \ \ (2)
thus {B} is a partial adjoint of the Lie bracket operator. (The conventions here differ slightly from those in Arnold’s paper.) Note that this form need not be symmetric. The importance of this form comes from the fact that it describes the geodesic flow:
Theorem 1 (Euler-Arnold equation) Let {\gamma: {\bf R} \rightarrow M} be a geodesic flow on {M} using the right-invariant metric {{\rm g}} defined above, and let {X(t) := \gamma'(t) / \gamma(t) \in {\mathfrak g}} be the intrinsic velocity vector. Then {X} obeys the equation
\displaystyle \frac{d}{dt} X(t) = B(X(t), X(t)). \ \ \ \ \ (3)
The Euler-Arnold equation is also known as the Euler-Poincaré equation; see for instance this paper of Cendra, Marsden, Pekarsky, and Ratiu for further discussion.
Proof: For notational reasons, we will prove this in the model case when {G} is a matrix group (so that we can place {G}, {M}, and {{\mathfrak g}} in a common vector space, or more precisely a common matrix space); the general case is similar but requires more abstract notation. We consider a variation {\gamma(s,t)} of the original curve {\gamma(t)=\gamma(s,t)}, and consider the first variation of the energy
\displaystyle \partial_s \frac{1}{2} \int_a^b \langle \partial_t \gamma(s,t), \partial_t \gamma(s,t) \rangle_{{\rm g}(\gamma(s,t))}\ dt
which we write using (1) as
\displaystyle \partial_s \frac{1}{2} \int_a^b \langle \gamma_t \gamma^{-1}, \gamma_t \gamma^{-1} \rangle\ dt.
We move the derivative inside and use symmetry to write this as
\displaystyle \int_a^b \langle \partial_s(\gamma_t \gamma^{-1}), \gamma_t \gamma^{-1} \rangle\ dt
or
\displaystyle \int_a^b \langle \partial_s(\gamma_t \gamma^{-1}), X \rangle\ dt
We expand
\displaystyle \partial_s(\gamma_t \gamma^{-1}) = \gamma_{ts} \gamma^{-1} - \gamma_t \gamma^{-1} \gamma_s \gamma^{-1}.
Similarly
\displaystyle \partial_t(\gamma_s \gamma^{-1}) = \gamma_{ts} \gamma^{-1} - \gamma_s \gamma^{-1} \gamma_t \gamma^{-1}
and thus
\displaystyle \partial_s(\gamma_t \gamma^{-1}) = \partial_t(\gamma_s \gamma^{-1}) + [\gamma_s \gamma^{-1}, X ].
Inserting this into the first variation and integrating by parts, we obtain
\displaystyle \int_a^b \langle [\gamma_s \gamma^{-1}, X], X \rangle - \langle \gamma_s \gamma^{-1}, \partial_t X \rangle\ dt;
using (2), this is
\displaystyle \int_a^b \langle \gamma_s \gamma^{-1}, B(X,X) \rangle - \langle \gamma_s \gamma^{-1}, \partial_t X \rangle\ dt
and so the first variation vanishes for arbitrary choices of perturbation {\gamma_s} precisely when {\partial_t X = B(X,X)}, as required. \Box
It is instructive to verify that the Hamiltonian {H = \frac{1}{2} \langle X, X \rangle} is preserved by this equation, as it should be. In the case of rigid motions, (3) is essentially Euler’s equations of motion.
The right-invariance of the Riemannian manifold implies that the geodesic flow is similarly right-invariant. And this is reflected by the fact that the Euler-Arnold equation (3) does not involve the position {\gamma(t)}. This position of course evolves by the equation
\displaystyle \frac{d}{dt} \gamma = X \gamma \ \ \ \ \ (4)
which is just the definition of {X}.
Note that while the velocity {X} influences the evolution of the position {\gamma}, the position {\gamma} does not influence the evolution of the velocity {X}. This is of course a manifestation of the right-invariance of the problem. This reduction of the flow is known as Euler-Poincaré reduction, and is essentially a basic example of both Lagrangian reduction and symplectic reduction, in which the symmetries of a Lagrangian or Hamiltonian evolution are used to reduce the dimension of the dynamics while preserving the Lagrangian or Hamiltonian structure.
We can rephrase the Euler equation in a Lax pair formulation by introducing the Cartan-Killing form
\displaystyle (X, Y) := \hbox{tr}( \hbox{ad}(X) \hbox{ad}(Y) ).
Like {\langle, \rangle}, the Cartan-Killing form {(,)} is a symmetric bilinear form on the Lie algebra {{\mathfrak g}}. If we assume that the group {G} is semisimple (and finite-dimensional), then this form will be non-degenerate. It obeys the identity
\displaystyle ([X,Y],Z) = -(Y,[X,Z]), \ \ \ \ \ (5)
thus adjoints are automatically skew-adjoint in the Cartan-Killing form.
If the Cartan-Killing form is non-degenerate, it can be used to express the inner product {\langle, \rangle} via a formula of the form
\displaystyle \langle X, Y \rangle := ( X, \Lambda^{-1} Y ) \ \ \ \ \ (6)
where {\Lambda: {\mathfrak g} \rightarrow {\mathfrak g}} is an invertible linear transformation which is self-adjoint with respect to both {\langle,\rangle} and {(,)}. We then define the intrinsic momentum {{\bf M}(t)} of the Euler-Arnold flow {\gamma(t)} by the formula
\displaystyle {\bf M}(t) := \Lambda^{-1} X(t).
From (3), we see that {{\bf M}} evolves by the equation
\displaystyle \frac{d}{dt} {\bf M} := \Lambda^{-1} B( \Lambda {\bf M}, \Lambda {\bf M} ).
But observe from (6), (2), (6), (5) that
\displaystyle ( \Lambda^{-1} B( \Lambda {\bf M}, \Lambda {\bf M} ), Y ) = \langle B( \Lambda {\bf M}, \Lambda {\bf M} ), Y \rangle
\displaystyle = \langle [Y, \Lambda {\bf M}], \Lambda {\bf M} \rangle
\displaystyle = ([Y,\Lambda {\bf M}], {\bf M} )
\displaystyle = ([\Lambda {\bf M},{\bf M}],Y)
for any test vector {Y \in {\mathfrak g}}, which by nondegeneracy implies that
\displaystyle \Lambda^{-1} B( \Lambda {\bf M}, \Lambda {\bf M} ) = [\Lambda {\bf M},{\bf M}]
leading to the Lax pair form
\displaystyle \frac{d}{dt} {\bf M} = [\Lambda {\bf M},{\bf M}]
of the Euler-Arnold equation, known as the Lie-Poisson equation. (The sign conventions here are the opposite of those in Arnold’s paper, I think ultimately because I am assuming right-invariance instead of left-invariance.) In particular, the spectrum of {{\bf M}} is invariant, or equivalently {{\bf M}} evolves along a single coadjoint orbit in {{\mathfrak g} \equiv {\mathfrak g}^*}.
By Noether’s theorem, the right-invariance of the geodesic flow should create a conserved quantity (or moment map); as the right-invariance is an action of the group {G}, the conserved quantity should take place in the adjoint {{\mathfrak g}^*}. If we write {\gamma(t) = g(t) O} for some fixed observer {O}, then this conserved quantity can be computed as the extrinsic momentum
\displaystyle P: Y \mapsto \langle X, g Y g^{-1} \rangle, \ \ \ \ \ (7)
thus {\omega} is the {1}-form associated to {X}, pulled back to extrinsic coordinates. Indeed, from (4) one has
\displaystyle \partial_t g = X g
and thus
\displaystyle \partial_t g^{-1} = - g^{-1} X
and hence for any test vector {Y}
\displaystyle \partial_t P(Y) = \langle X_t, g Y g^{-1} \rangle + \langle X, [X, gYg^{-1}] \rangle
\displaystyle = \langle B(X,X), g Y g^{-1}\rangle - \langle B(X,X), g Y g^{-1}\rangle
\displaystyle = 0
thanks to (2), and the claim follows. Using the Cartan-Killing form, the extrinsic momentum can also be identified with {g^{-1} {\bf M} g}, thus linking the extrinsic and intrinsic momenta to each other.
— 2. Incompressible fluids —
Now consider an incompressible fluid in {{\bf R}^3}, whose initial state is {O} and whose state at any time {t} is given as {\gamma(t)}. One can express {\gamma(t) = g(t) O}, where {g(t) \in \hbox{Sdiff}({\bf R}^3)} is the diffeomorphism from {{\bf R}^3} to itself that maps the location of each particle at {O} to the location of the same particle at {\gamma(t)}. As the fluid is assumed incompressible, the diffeomorphism must be measure-preserving (and orientation preserving); we denote the group of such special diffeomorphisms as {\hbox{Sdiff}({\bf R}^3)}.
The Lie algebra to the group {\hbox{Diff}({\bf R}^3)} of all diffeomorphisms, is the space of all (smooth) vector fields {X: {\bf R}^3 \rightarrow {\bf R}^3}. The Lie algebra of the subgroup {\hbox{Sdiff}({\bf R}^3)} of measure-preserving diffeomorphisms is the space of all divergence-free vector fields; indeed, this is one of the primary motivations of introducing the concept of divergence of a vector field. We give both Lie algebras the usual {L^2} inner product:
\displaystyle \langle u, v\rangle := \int_{{\bf R}^3} u \cdot v.
The Lie bracket on {\hbox{Sdiff}({\bf R}^3)} or {\hbox{Diff}({\bf R}^3)} is the same as the usual Lie bracket of vector fields.
Let {u(t) := \gamma'(t)/\gamma(t) = g' \circ g^{-1}} be the intrinsic velocity vector; then this is a divergence-free vector field, which physically represents the velocity field in Eulerian coordinates. The extrinsic velocity vector {g(t)^{-1} u(t) g(t) = u \circ g(t)} is then the velocity field in Lagrangian coordinates; it is also divergence-free.
If there were no constraint of incompressibility (i.e. if one were working in {\hbox{Diff}({\bf R}^3)} rather than {\hbox{Sdiff}({\bf R}^3)}), then the metric is flat, and the geodesic equation of motion is simply given by Newton’s first law
\displaystyle \frac{d^2}{dt^2} g(t) = 0
or in terms of the intrinsic velocity field {u},
\displaystyle \partial_t u(t) + (u \cdot \nabla) u = 0.
Once we restrict to incompressible fluids, this becomes
\displaystyle \frac{d^2}{dt^2} g(t) \perp T_{g(t)} \hbox{Sdiff}({\bf R}^3)
or, in terms of the intrinsic velocity field,
\displaystyle \partial_t u(t) + (u \cdot \nabla) u \perp \hbox{ divergence free fields}
or equivalently (by Hodge theory)
\displaystyle \partial_t u(t) + (u \cdot \nabla) u = \nabla p
for some {p}; this is precisely the Euler equations of incompressible fluids. This equation can also be deduced from (3), after first calculating using (2) and the formula for Lie bracket of vector fields that {B(X,Y)} is the divergence-free component of {X \neg dY}; we omit the details (which are in Arnold’s paper).
Let us now compute the extrinsic momentum {P}, which is conserved by the Euler equations. Given any divergence-free vector field {v} (in Lagrangian coordinates), we see from (7) that {P} is given by the formula
\displaystyle P(v) := \int_{{\bf R}^3} u \cdot (g_* v),
thus the form {P(v)} is computed by pushing {v} over to Eulerian coordinates to get {g_* v := (Dg \circ g^{-1}) (v \circ g^{-1})} and then taking the inner product with {u}. Let us check that this is indeed conserved. Since
\displaystyle u_t = - (u \cdot \nabla) u + \nabla p
and
\displaystyle \partial_t (g_* v) = - {\mathcal L}_u (g_* v),
where {{\mathcal L}} denotes the Lie derivative along the vector field {u}, we see that
\displaystyle \partial_t P(v) = \int_{{\bf R}^3} (- (u \cdot \nabla) u + \nabla p) \cdot w - u \cdot {\mathcal L}_u w,
where {w := g_* v} is {v} in Eulerian coordinates. The {\nabla p} term vanishes by integration by parts, since {v} (and hence {w}) is divergence-free. The Lie derivative is computed by the formula
\displaystyle {\mathcal L}_u w = (u \cdot \nabla) w - (w \cdot \nabla) u.
As {u \cdot (w \cdot \nabla) u = \frac{1}{2} (w \cdot \nabla) |u|^2} is a total derivative (recall here that {w} is divergence-free), this term vanishes. The other two terms combine to form a total derivative {-(u \cdot \nabla)(u \cdot w)}, which also vanishes, and so the claim follows.
The external momentum is closely related to the vorticity {\omega := \hbox{curl} u}. This is because a divergence-free vector field {v} can (in principle, at least) be written as the divergence {v = \hbox{div} \alpha} of a {2}-vector field {\alpha}. As divergence is diffeomorphism invariant, it commutes with pushforward:
\displaystyle g_*( \hbox{div} \alpha ) = \hbox{div} (g_* \alpha)
and thus
\displaystyle P(\hbox{div} \alpha) = \int_{{\bf R}^3} u \cdot \hbox{div}( g_* \alpha )
\displaystyle = \int_{{\bf R}^3} \omega \cdot g_* \alpha
\displaystyle = \int_{{\bf R}^3} (*\omega) \wedge g_* \alpha
where {*} is the Hodge star. We can pull this back to Lagrangian coordinates to obtain
\displaystyle P(\hbox{div} \alpha) = \int_{{\bf R}^3} g^{-1}_*(*\omega) \wedge \alpha.
As {\alpha} was an arbitrary {2}-form, we thus see that the pullback {g^{-1}_*(*\omega)} of the Hodge star of the vorticity in Lagrangian coordinates is preserved by the flow, or equivalently that {*\omega} is transported by the velocity field {u}. In the two-dimensional case, this is well known ({*\omega} is a scalar in this case); in higher dimensions, this is fact is implicit in the vorticity equation
\displaystyle \partial_t \omega_{ij} + u_k \partial_k \omega_{ij} + \omega_{ik} \partial_k u_j = 0
which can be rewritten as
\displaystyle \partial_t(*\omega) + {\mathcal L}_u (*\omega) = 0.
In principle, the Euler-Arnold formalism allows one to write the Euler equations for incompressible fluids into a Lax pair form. To properly carry this out by the machinery above, though, would require calculating the Cartan-Killing form for the infinite-dimensional Lie group {\hbox{Sdiff}({\bf R}^3)}, which looked quite tricky to me, and I was not able to complete the calculation. However, a Lax pair formulation for this system was found by Friedlander and Vishik, and it is likely that that formulation is essentially equivalent to the Lax pair that one could construct from the Euler-Arnold formalism. In the simpler two-dimensional case, it was observed by Li that the vorticity equation can also be recast into a slightly different Lax pair form. While this formalism does allow for some of the inverse scattering machinery to be brought to bear on the initial value problem for the Euler equations, it does not as yet seem that this machinery can be successfully used for the global regularity problem.
It would, of course, also be very interesting to see what aspects of this formalism carry over to the Navier-Stokes equation. The first naive guess would be to add a friction term, but this seems to basically correspond to adding a damping factor of {-cu} (rather than a viscosity factor of {\nu \Delta u}) to the Euler equations and ends up being rather uninteresting (it basically slows down the time variable but otherwise does not affect the dynamics). More generally, it would be of interest to see how the Hamiltonian formalism can be generalised to incorporate dissipation or viscosity.
{\emph Update}, June 15: Some references added. Thanks to Jerry Marsden for comments.
32 comments
Comments feed for this article
8 June, 2010 at 12:45 am
Igor Carron
Terry,
Typo in LaTeX ?
“..we let [Formula does not parse] be the tangent vector at A defined infinitesimally…”
Igor.
[This seemed to be a transient problem with WordPress’s LaTeX image rendering server; it disappeared by the time I got around to looking at it. -T.]
8 June, 2010 at 6:37 am
David Corfield
Extra ‘to’ in “…a system subject to only to a physical constraint…”.
Section 3 of Hydrodynamics and Infinite-Dimensional Riemannian Geometry gives some insight as to the KdV equation as geodesic flow, but not so straightforwardly as the free motion of a system subject only to a physical constraint.
[Corrected, and thanks for the link! – T.]
8 June, 2010 at 6:48 pm
Christian
There is some interesting recent work by Holm on co-workers on dissipative systems which are still co-adjoint, see for example http://rspa.royalsocietypublishing.org/content/early/2010/04/21/rspa.2010.0043.abstract. They consider kinetic theory, and hence $Diff_{can},ドル but the approach using double Poisson brackets is also applicable to fluid systems, see the earlier article by Bloch et al; http://infoscience.epfl.ch/record/129486.
9 June, 2010 at 3:38 am
HB
Minor typo (second equation):
“… the length is then given in terms of the inner product by the formula”
|v|^2:=
^2 is missing.
[Corrected, thanks – T.]
9 June, 2010 at 5:44 am
John Sidles
Thank you for this fine post, and especially for its tribute to Vladimir Arnold. I am hoping that someday you may write a post that discusses the extension of these geometric/dynamical ideas to the quantum domain, where many of them carry-over without change … and some of them become simpler. Arnold discusses these quantum extensions in (too brief!) Appendix 3 of his Mathematical Methods of Classical Mechanics, but nowadays we appreciate that there is much more to be said.
Nowadays, in quantum systems engineering, the natural pullback is not from (say) classical Cartesian space onto the manifold of rigid body rotations and translations, but from Hilbert space onto the manifold of tensor network states. Just as dissipation and viscosity play central roles in classical systems, Lindbladian dynamics plays a key role in quantum dynamics. And in all of the preceding, the natural pullback of dynamics onto reduced-dimension manifolds is central to achieving a satisfactory mathematical, physical, and engineering understanding.
One challenge that the mathematical, physical science, and engineering communities are all facing is that (typically) our students do not acquire (as undergraduates at least) a mathematically natural toolset for appreciating these ubiquitous “Arnoldian” ideas.
Learning these ideas at the graduate level is not particularly satisfactory either, in that specialization sets-in to a degree that inhibits graduate students in math, physics, chemistry, biology, mechatronics, materials science (etc.) from appreciating that they are all embracing the same “Arnoldian” ideas, but speaking very different languages. Too often, moreover, these languages are not mathematically natural in the sense that Arnold so scrupulously respect.
It is easy to write a long wish-list of “Arnoldian” topics that mathematicians might beneficially explain in ways that are accessible to engineers and scientists. To name two, I wish someone would explain Ran Raz’ STOC2010 presentation Tensor-Rank and Lower Bounds for Arithmetic Formulas, which appears to lie at the boundary of complexity theory, algebraic geometry, and quantum simulation theory … but I have to confess to being (so far) unable to decode Raz’ article itself. And I wish someone would tackle the “Arnoldian” analysis of Lindblad processes. The standard textbooks analyze these processes using vector-space proof technology, but we engineers use these processes in an “Arnoldian” pullback context (for which much of the vector-space toolset is inapplicable); this leaves us engineers with sadly little geometric and/or informatic insight into our own simulation codes.
That being said, I am *very* grateful for the existence of this fine weblog—and for many other outstanding math weblogs too—which so greatly encourage us engineers and scientists, by allowing us to glimpse the many wonderful “Arnoldian” opportunities, both creative and practical, that are open to us all.
9 June, 2010 at 7:35 am
Peter
It looks like there’s a typo in the definition of the Christoffel symbol – the first two terms in the parenthesis are the same.
[Corrected, thanks – T.]
9 June, 2010 at 11:36 am
Michael Nielsen
A connection I’m very fond of is between quantum computing and minimal geodesic paths on right-invariant manifolds. It can be shown that the minimal quantum circuit size for solving a particular problem is equal (modulo polynomial factors and some technical caveats) to the minimal geodesic length between two points on a particular right-invariant manifold.
This seems to me a potentially very useful observation for understanding minimal circuit sizes. Pretty much everything we teach students about minimizing functions relies on working in a smooth space, where the tools of calculus can be used. Unfortunately, the way we ordinarily think about circuits (quantum and non-quantum) that way of thinking can’t be applied to understand minimal circuit size. But by relating circuit size to geometry it becomes possible to use the tools of calculus to think about minimal circuit complexity. Roughly, there’s a correspondence optimal quantum circuit to solve a particular problem and some solution of the Euler-Arnold equation. This seems like a potentially powerful observation, although I was never able to get bounds on circuit size this way. I wrote up some of these ideas in http://arxiv.org/abs/quant-ph/0701004 (joint with Mark Dowling), if anyone’s interested.
23 June, 2010 at 8:09 pm
Chirag Lakhani
Dear Michael,
I know I have come to this post late, but I was wondering if you think there is any advantage if formulating your ideas in terms of this symplectic geometry setting rather than Riemannian geometry. Do things like Euler-Poison reduction work out when it comes to the metric you use for SU(2^n)?
24 June, 2010 at 6:57 am
Michael Nielsen
I didn’t consider this at all – it certainly seems an interesting question. My background in symplectic geometry is minimal.
10 June, 2010 at 8:23 am
Anosh Joseph
Great post, Terry. Here is a recent review on this in a recent arXiv paper :
http://arxiv.org/abs/0906.0184
10 June, 2010 at 11:38 am
John Sidles
Anosh, thank you for that arXiv reference … which was new to me and fun. Perhaps you might similarly enjoy astronaut Michael Foale’s keynote address “Navigating on Mir” (which was published in Mathematica Journal) in which the negative sectional curvature of rigid-body state-space generates high adventure in low Earth orbit.
The fraction of the Earth’s global computing power that is devoted to numerically integrating rigid-body rotational dynamics is surprising large … plausibly as large as 10^(-4)—10(^-3) … for the reason that tumbling water molecules are ubiquitously present in molecular dynamical simulations.
Moreover, for reasons of numerical efficiency, water-molecule dynamical trajectories typically are integrated on quaternionic state-spaces; thus yet another of this blog’s perennial mathematical themes appears naturally … gauge invariance!
16 June, 2010 at 7:08 am
Vladimir Arnold (1937-2010). R.I.P. « Successful Researcher
[…] Vladimir Arnold (1937-2010). R.I.P. One of the world’s greatest and most influential mathematicians, Vladimir Arnold, passed away today in Paris, France. He made major contributions, inter alia, to the fields of dynamical systems (the famous KAM (Kolmogorov-Arnold-Moser) theory) and topology (he is one of the founding fathers of symplectic topology). I hope that the prominent mathematicians like Terence Tao or Timothy Gowers will say more on their blogs about Arnold’s scientific legacy as they did for Israel Gelfand (update: Terence Tao has a recent post on the Euler–Arnold equation). […]
16 June, 2010 at 9:52 am
Artem
When in the beginning you give geodesic flow in coordinates by Hamilton’s equations you say: «In terms of the velocity…» and write the formula with g that is typeseted not in Roman font — possible typo.
[Corrected, thanks – T.]
29 September, 2010 at 10:40 pm
Daniel Geisler
For years I’ve come across meager references that there was another mathematical formalism besides PDE’s that was appropriate for modeling physics, that of continuously iterated functions. But Aldrovandi and Freitas’ article Continuous iteration of dynamical maps is the only relevant work I’ve been able to find. Could you please tell me if Arnold’s 1966 paper is the source or is relevant to using continuously iterated functions to model physics?
Dynamics
In 1986 I had a couple of conversations with Stephen Wolfram regarding dynamical systems. Wolfram was interested in my work in defining tetration for complex numbers. Tetration is iterated exponentiation and the standard notation is ^1 a=a, ^2 a=a^a, ^3a=a^{a^a} , and so on. Wolfram pointed out that the problem of extending tetration to the complex numbers was actually part of the much larger and more important problem of unifying the discreet representation of chaotic systems in mathematics with the continuous representation of chaotic systems in physics, of unifying maps from iterated functions and flows from PDEs. He maintained that the duality prevented the derivation of mathematical solutions for continuous chaotic systems as are found in physics. My understanding of Wolfram’s position is that he was interested in the possibility that there might be a principle at play even deeper than his Principle of Equivalence and that it might be possible to formulate a single mathematical approach to dynamics encompassing iterated functions, cellular automata, PDEs, and recurrence equations. Wolfram suggested at if tetration could be defined for complex numbers then those results might be generalized to unify discrete maps and continuous flows.
Bell Polynomials
Let f(x) and g(x) be differentiable functions, then the Bell polynomials can be constructed using Faa Di Bruno’s formula.
D^nf(g(x))=\sum_{\pi(n)} \frac{n!}{k_1! \cdots k_n!} (D^kf)(g(x)) \left(\frac{Dg(x)}{1!}\right)^{k_1} \cdots \left(\frac{D^ng(x)}{n!}\right)^{k_n}
A partition of n is \pi(n), usually denoted by 1^{k_1}2^{k_2}\cdots n^{k_n} with k_1+2k_2+ \cdots nk_n=k; where k_i is the number of parts of size i. The partition function p(n) is a decategorized version of \pi(n), the function \pi(n) enumerates the integer partitions of n, while p(n) is the cardinality of the enumeration of \pi(n).
Setting g(x) = f^{t-1}(x) results in
D^n f^t(x)=\sum_{\pi(n)} \frac{n!}{k_1! \cdots k_n!} (D^k f)(f^{t-1}(x)) \left(\frac{Df^{t-1}(x)}{1!}\right)^{k_1} \cdots \left(\frac{D^n f^{t-1}(x)}{n!}\right)^{k_n}
The Taylors series of f^t(x) is derived by evaluating the derivatives of the iterated function at a fixed point f^t(0) by setting x=0 and separating out the k_n term of the summation that is dependent on D^n f^{t-1}(0).
D^n f^t(0)=\sum \frac{n!(D^k f)(0)}{k_1! \cdots k_{n-1}!} \left(\frac{Df^{t-1}(0)}{1!}\right)^{k_1} \cdots \left(\frac{D^n f^{t-1}(0)}{(n-1)!}\right)^{k_{n-1}} + (D f)(0) D^n f^{t-1}(0)
and rewriting (D^k f)(0) as f_k where f_1 is the Lyapunov multiplier.
D^n f^t(0)=\sum \frac{n! f_k}{k_1! \cdots k_{n-1}!} \left(\frac{Df^{t-1}(0)}{1!}\right)^{k_1} \cdots\left(\frac{D^{n-1} f^{t-1}(0)}{(n-1)!}\right)^{k_{n-1}} + f_1 D^n f^{t-1}(0)
The remaining p(n)-1 terms of the summation are only dependent on D^k f^{t-1}(0), where 0<k<n.
Flows
Putting the pieces together gives the Taylors series for a dynamical system. Still assuming a fixed point at x=0 such that f(0)=0 results in,
f^t(x)=f_1^t x+\frac{1}{2!}(f_2 \sum_{k_1=0}^{t-1}f_1^{2 t-k_1-2})x^2+\frac{1}{3!}(f_3 \sum_{k_1=0}^{t-1}f_1^{3 t-2 k_1-3}+3 f_2^2 \sum_{k_1=0}^{t-1}\sum_{k_2=0}^{t-k_1-2}f_1^{3 t-2k_1-k_2-5})x^3 + \cdots
As a check, a great deal is known about dynamical systems, so if there is a fundamental flaw in the derivation of Taylors series for f^t(x), then it should appear as it is scrutinized in light of the theorems of dynamics. Note the presence of nested summations of f_1, the Lyapunov multiplier. They occur in the quadratic and all higher terms as the only time dependent components. This demonstrates the dominance of the Lyapunov multiplier in the global dynamics of f^t(x), an important theorem in dynamics.
Consider that the nested summations are unevaluated. Simplifying the sums of geometric progressions depends on whether the multiplier is a root of unity and then which root. For the iteration of complex functions, f: \mathbb{C} \rightarrow \mathbb{C}, the classification of types of solutions for simplifying the sums of geometric progressions is consistent with the classification of fixed points, issues of topological conjugacy and the use of Schroeder’s and Abel’s functional equations.
A final reason for studying the nested summations in unevaluated form is it leaves the underlying combinatorial structure of f^t(x) exposed. My unpublished paper Bell Polynomials of Iterated Functions covers much of the proceeding material in greater detail as well as exploring the combinatorial structure of f^t(x).
17 October, 2010 at 7:57 pm
Tetration Blog · The Euler-Arnold equation
[…] The following was posted as a question to Terry Tao’s posting on The Euler-Arnold equation. […]
28 December, 2010 at 11:43 pm
The Euler-Arnold Equation « Sergeysk's Blog
[…] nice post by wonderful Terry Tao about the Arnold formalism. Sums it up in a concise way, where everything in […]
11 May, 2012 at 10:38 pm
Fluid Flows and Infinite-Dimensional Manifolds (Part 2) « Azimuth
[…] ours, but—as could be expected—assuming a little bit more of a mathematical background: The Euler-Arnold equation. If you are into math, you might like to take a look at […]
25 February, 2014 at 12:08 pm
Conserved quantities for the Euler equations | What's new
[…] fact was also interpreted as conservation of exterior momentum in this previous blog post.) This fact also follows from Kelvin’s circulation theorem, after first applying […]
2 March, 2014 at 11:08 pm
Noether’s theorem, and the conservation laws for the Euler equations | What's new
[…] discussed in this previous post, both the Euler equations for rigid body motion, and the Euler equations for incompressible […]
14 April, 2014 at 7:45 pm
Andrei Ludu
All these are beautiful and a great exercise. However, nature is very far away from these crystal clean mathematical approaches. Euler equation is a very limited situation which rather doesn’t happen in reality. Moreover, there are showers of articles on geometrical aspects of fluids in n-dimensions (R^n manifolds) using concepts like centrifugal force and surface tension. I wonder what is a centrifugal force in more than 3 dimensions? What is the meaning of law of inertia in more than 3 (3+1) dimensions.? There is not knowledge of such a thing. I don’t know of Newton’s laws in more than 3+1 dimensions. But nevertheless, all these are splendid exercises helping the understanding of some phenomena in 3 dimensions.
20 August, 2014 at 7:49 am
Robert Honeybul
Excellent post. Just wondering, should the map Gamma map to the dual of the lie algebra? I’m assuming in this case that Gamma represents The inertia operator
2 March, 2015 at 3:55 am
kosta
Interesting post!
I think there is a typo here:
* (Intrinsic identification) If V \in T_M A
I think should be
* (Intrinsic identification) If V \in T_A M
[Corrected, thanks – T.]
29 June, 2016 at 6:44 am
Finite time blowup for Lagrangian modifications of the three-dimensional Euler equation | What's new
[…] an infinite dimensional Riemannian manifold of volume preserving diffeomorphisms (as discussed in this previous post) extends to the generalised Euler equations (with the operator determining the new Riemannian […]
26 May, 2017 at 7:18 am
Geodesics of left invariant metrics on matrix Lie groups – Part 2 Conservation laws | Open System - Ark's blog
[…] his blog post “The Euler-Arnold equation” Terrence Tao mentions that some people call this equation the “Euler-Arnold” equation, […]
25 July, 2017 at 7:39 pm
On the universality of the incompressible Euler equation on compact manifolds | What's new
[…] flow equation on the infinite-dimensional manifold of volume-preserving diffeomorphisms on (see this previous post for a discussion of this in the flat space […]
14 December, 2017 at 5:04 pm
Embedding the Boussinesq equations in the incompressible Euler equations on a manifold | What's new
[…] flow equation on the infinite-dimensional manifold of volume-preserving diffeomorphisms on (see this previous post for a discussion of this in the flat space […]
17 July, 2018 at 12:36 am
CLASSICAL LAGRANGIAN MECHANICS | zulfahmed
[…] the localized waves. By classical trajectory we will mean the precise definition given above. Terry Tao’s discussion here veers toward the generalization of the Euler-Lagrange equations toward homogeneous descriptions for […]
12 November, 2020 at 8:12 am
Pablo Basteiro
Hi! Thank you very much for this amazing article! I am a little bit late to the party, but I would have a question on the defining equation for the bilinear form B in the Lie algebra. I have encountered many different definitions for this, even in Arnold’s paper and his book on topological methods in hydrodynamics the definition is slightly different. We do not assume this form to be symmetric, but we do assume symmetry of the inner product, correct?
Can we then write = equivalently to =? This simplifies the explicit computation of the B form when using a non-trivial inner-product.
Thanks to anyone that can help me with this little detail!
12 November, 2020 at 8:28 am
Pablo Basteiro
([X,Y],Z)_{inner}=(B(Y,Z),X)_{inner} equivalently to (Z,[X,Y])_{inner}=(X,B(Y,Z)_{inner}
11 May, 2022 at 4:34 pm
Variational form of Euler’s incompressible fluid equations? – GrindSkills
[…] Tao, The Euler-Arnold equation, blogpost […]
14 April, 2024 at 10:38 pm
Vladimir Arnold – featurefreaks
[…] The Euler-Arnold equation […]
12 June, 2025 at 3:50 am
Anonymous
Great post! Today Arnold would have turned 88. He is still influencing many students with his writings.