Posts Tagged ‘Charles Pugh’

The Carnot-Carathéodory metric

March 22, 2011

I remember looking at Gromov’s notes “Carnot-Carathéodory spaces seen from within” a long time ago and didn’t get anywhere. Recently I encountered it again through professor Guth. This time, with his effort in explaining, I did get some ideas. This thing is indeed pretty cool~ So I decided to write an elementary introduction about it here. We will construct a such metric in \mathbb{R}^3.

In general, if we have a Riemannian manifold, the Riemannian distance between two given points p, q is defined as

\inf_\Gamma(p,q) \int_0^1||\gamma'(t)||dt

where \Gamma is the collection of all differentiable curves \gamma connecting the two points.

However, if we have a lower dimensional sub-bundle E(M) of the tangent bundle (depending continuously on the base point). We may attempt to define the metric

d(p,q) = \inf_{\Gamma'} \int_0^1||\gamma'(t)||dt

where \Gamma' is the collection of curves connecting p, q with \gamma'(t) \in E(M) for all t. (i.e. we are only allowed to go along directions in the sub-bundle.

Now if we attempt to do this in \mathbb{R}^3, the first thing we may try is let the sub-bundle be the say, xy-plane at all points. It’s easy to realize that now we are ‘stuck’ in the same height: any two points with different z coordinate will have no curve connecting them (hence the distance is infinite). The resulting metric space is real number many discrete copies of \mathbb{R}^2. Of course that’s no longer homeomorphic to \mathbb{R}^3.

Hence for the metric to be finite, we have to require accessibility of the sub-bundle: Any point is connected to any other point by a curve with derivatives in the E(M).

For the metric to be equivalent to our original Riemannian metric (meaning generate the same topology), we need E(M) to be locally accessible: Any point less than \delta away from the original point p can be connected to p by a curve of length < \varepsilon going along E(M).

At the first glance the existence of a (non-trivial) such metric may not seem obvious. Let’s construct one on \mathbb{R}^3 that generates the same topology:

To start, we first identify our \mathbb{R}^3 with the 3 \times 3 real entry Heisenberg group H^3 (all 3 \times 3 upper triangular matrices with “1”s on the diagonal). i.e. we have homeomorphism

h(x,y,z) \mapsto \left( \begin{array}{ccc}  1 & x & z \\  0 & 1 & y \\  0 & 0 & 1 \end{array} \right)

Let g be a left-invariant metric on H_3.

In the Lie algebra T_e(H_3) (tangent space of the identity element), the elements X = \left( \begin{array}{ccc}  1 & 1 & 0 \\  0 & 1 & 0 \\  0 & 0 & 1 \end{array} \right) , Y = \left( \begin{array}{ccc}  1 & 0 & 0 \\  0 & 1 & 1 \\  0 & 0 & 1 \end{array} \right) and Z = \left( \begin{array}{ccc}  1 & 0 & 1 \\  0 & 1 & 0 \\  0 & 0 & 1 \end{array} \right) form a basis.

At each point, we take the two dimensional sub-bundle E(H_3) of the tangent bundle generated by infinitesimal left translations by X, Y. Since the metric g is left invariant, we are free to restrict the metric to E(M) i.e. we have ||X_p|| = ||Y_p|| = 1 for each p \in M.

The interesting thing about H_3 is that all points are accessible from the origin via curves everywhere tangent to E(H_3). In other words, any points can be obtained by left translating any other point by multiples of elements X and Y.

The “unit grid” in \mathbb{R}^3 under this sub-Riemannian metric looks something like:

Since we have

\left( \begin{array}{ccc}  1 & x & z \\  0 & 1 & y \\  0 & 0 & 1 \end{array} \right) \left( \begin{array}{ccc}  1 & 1 & 0 \\  0 & 1 & 0 \\  0 & 0 & 1 \end{array} \right) = \left( \begin{array}{ccc}  1 & x+1 & z \\  0 & 1 & y \\  0 & 0 & 1 \end{array} \right) ,

the original x-direction stay the same, i.e. a bunch of horizontal lines connecting the original yz planes orthogonally.

However, if we look at a translation by Y, we have

\left( \begin{array}{ccc}  1 & x & z \\  0 & 1 & y \\  0 & 0 & 1 \end{array} \right) \left( \begin{array}{ccc}  1 & 0 & 0 \\  0 & 1 & 1 \\  0 & 0 & 1 \end{array} \right) = \left( \begin{array}{ccc}  1 & x & z+x \\  0 & 1 & y+1 \\  0 & 0 & 1 \end{array} \right)

i.e. a unit length Y-vector not only add a 1 to the y-direction but also adds a height x to z, hence the grid of unit Y vectors in the above three yz planes look like:

We can now try to see the rough shape of balls by only allowing ourselves to go along the unit grid formed by X and Y lines constructed above. This corresponds to accessing all matrices with integer entry by words in X and Y.

The first question to ask is perhaps how to go from (0,0,0) to (0,0,1). –since going along the z axis is disabled. Observe that going through the following loop works:

We conclude that d_C((0,0,0), (0,0,1)) \leq 4 in fact up to a constant going along such loop gives the actual distance.

At this point one might feel that going along z axis in the C-C metric is always takes longer than the ordinary distance. Giving it a bit more thought, we will find this is NOT the case: Imagine what happens if we want to go from (0,0,0) to (0,0,10000)?

One way to do this is to go along X for 100 steps, then along Y for 100 steps (at this point each step in Y will raise 100 in z-coordinate, then Y^{-100} X^{-100}. This gives d_C((0,0,0), (0,0,10000)) \leq 400.

To illustrate, let’s first see the loop from (0,0,0) to (0,0,4):

The loop has length 8. (A lot shorter compare to length 4 for going 1 unit in z-direction)

i.e. for large Z, it’s much more efficient to travel in the C-C metric. d_C( (0,0,0), (0,0,N^2)) = 4N

In fact, we can see the ball of radius N is roughly an rectangle with dimension R \times R \times R^2 (meaning bounded from both inside and outside with a constant factor). Hence the volume of balls grow like R^4.

Balls are very “flat” when they are small and very “long” when they are large.

A survey on ergodicity of Anosov diffeomorphisms

March 7, 2011

This is in part a preparation for my 25-minutes talk in a workshop here at Princeton next week. (Never given a short talk before…I’m super nervous about this >.<) In this little survey post I wish to list some background and historical results which might appear in the talk.

Let me post the (tentative) abstract first:


Title: Volume preserving extensions and ergodicity of Anosov diffeomorphisms

Abstract: Given a C^1 self-diffeomorphism of a compact subset in \mathbb{R}^n, from Whitney’s extension theorem we know exactly when does it C^1 extend to \mathbb{R}^n. How about volume preserving extensions?

It is a classical result that any volume preserving Anosov di ffeomorphism of regularity C^{1+\varepsilon} is ergodic. The question is open for C^1. In 1975 Rufus Bowen constructed an (non-volume-preserving) Anosov map on the 2-torus with an invariant positive measured Cantor set. Various attempts have been made to make the construction volume preserving.

By studying the above extension problem we conclude, in particular the Bowen-type mapping on positive measured Cantor sets can never be volume preservingly extended to the torus. This is joint work with Charles Pugh and Amie Wilkinson.


A diffeomorphism f: M \rightarrow M is said to be Anosov if there is a splitting of the tangent space TM = E^u \oplus E^s that’s invariant under Df, vectors in E^u are uniformly expanding and vectors in E^s are uniformly contracting.

In his thesis, Anosov gave an argument that proves:

Theorem: (Anosov ’67) Any volume preserving Anosov diffeomorphism on compact manifolds with regularity C^2 or higher on is ergodic.

This result is later generalized to Anosov diffeo with regularity C^{1+\varepsilon}. i.e. C^1 with an \varepsilon-holder condition on the derivative.

It is a curious open question whether this is true for maps that’s strictly C^1.

The methods for proving ergodicity for maps with higher regularity, which relies on the stable and unstable foliation being absolutely continuous, certainly does not carry through to the C^1 case:

In 1975, Rufus Bowen gave the first example of an Anosov map that’s only C^1, with non-absolutely continuous stable and unstable foliations. In fact his example is a modification of the classical Smale’s horseshoe on the two-torus, non-volume-preserving but has an invariant Cantor set of positive Lebesgue measure.

A simple observation is that the Bowen map is in fact volume preserving on the Cantor set. Ever since then, it’s been of interest to extend Bowen’s example to the complement of the Cantor set in order to obtain an volume preserving Anosov diffeo that’s not ergodic.

In 1980, Robinson and Young extended the Bowen example to a C^1 Anosov diffeomorphism that preserves a measure that’s absolutely continuous with respect to the Lebesgue measure.

In a recent paper, Artur Avila showed:

Theorem: (Avila ’10) C^\infty volume preserving diffeomorphisms are C^1 dense in C^1 volume preserving diffeomorphisms.

Together with other fact about Anosov diffeomorphisms, this implies the generic C^1 volume preserving diffeomorphism is ergodic. Making the question of whether such example exists even more curious.

In light of this problem, we study the much more elementary question:

Question: Given a compact set K \subseteq \mathbb{R}^2 and a self-map f: K \rightarrow K, when can the map f be extended to an area-preserving C^1 diffeomorphism F: \mathbb{R}^2 \rightarrow \mathbb{R}^2?

Of course, a necessary condition for such extension to exist is that f extends to a C^1 diffeomorphism F (perhaps not volume preserving) and that DF has determent 1 on K. Whitney’s extension theorem gives a necessary and sufficient criteria for this.

Hence the unknown part of our question is just:

Question: Given K \subseteq \mathbb{R}^2, F \in \mbox{Diff}^1(\mathbb{R}^2) s.t. \det(DF_p) = 1 for all p \in K. When is there a G \in \mbox{Diff}^1_\omega(\mathbb{R}^2) with G|_K = F|_K?

There are trivial restrictions on K i.e. if K separates \mathbb{R}^2 and F switches complementary components with different volume, then F|_K can never have volume preserving extension.

A positive result along the line would be the following slight modification of Moser’s theorem:

Theorem: Any C^{r+1} diffeomorphism on S^1 can be extended to a C^r area-preserving diffeomorphism on the unit disc D.

For more details see this pervious post.

Applying methods of generating functions and Whitney’s extension theorem, as in this paper, in fact we can get rid of the loss of one derivative. i.e.

Theorem: (Bonatti, Crovisier, Wilkinson ’08) Any C^1 diffeo on the circle can be extended to a volume-preserving C^1 diffeo on the disc.

With the above theorem, shall we expect the condition of switching complementary components of same volume to be also sufficient?

No. As seen in the pervious post, restricting to the case that F only permute complementary components with the same volume is not enough. In the example, K does not separate the plane, f: K \rightarrow K can be C^1 extended, the extension preserves volume on K, and yet it’s impossible to find an extension preserving the volume on the complement of K.

The problem here is that there are ‘almost enclosed regions’ with different volume that are being switched. One might hope this is true at least for Cantor sets (such as in the Bowen case), however this is still not the case.

Theorem: For any positively measured product Cantor set C = C_1 \times C_2, the Horseshoe map h: C \rightarrow C does not extend to a Holder continuous map preserving area on the torus.

Hence in particular we get that no volume preserving extension of the Bowen map can be possible. (not even Holder continuous)

Recurrence and genericity – a translation from French

January 31, 2011

To commemorate passing the French exam earlier this week (without knowing any French) and also to test this program ‘latex to wordpress‘, I decided to post my French-translation assignment here.

Last year, I went to Paris and heard a French talk by Crovisier. Strangely enough, although I can’t understand a single word he says, just by looking at the slides and pictures, I liked the talk. That’s why when being asked the question ‘so are there any French papers you wanted to look at?’, I immediately came up with this one which the talk was based on.

Here is a translation of selected parts (selected according to my interest) in section 1.2 taken from the paper `Récurrence et Généricité‘ ( Inventiones Mathematicae 158 (2004), 33-104 ) by C. Bonatti and S. Crovisier. In which they proved a connecting lemma for pseudo-orbits.

Interestingly, just in this short section they referred to two results I have discussed in earlier posts of this blog: Conley’s fundamental theorem of dynamical systems and the closing lemma. In any case, I think it’s a cool piece of work to look at! Enjoy~ (Unfortunately, if one wants to see the rest of the paper, one has to read French >.<)

Precise statements of results

1. Statement of the connecting lemma for pseudo-orbits

In all the following work we consider compact manifold {M} equipped with an arbitrary Riemannian metric and sometimes also with a volume form {\omega} (unrelated to the metric). We write {\mbox{Diff}^1(M)} for the set of diffeomorphisms of class {C^1} on {M} with the {C^1} topology and {\mbox{Diff}^1_\omega(M) \subset \mbox{Diff}^1(M)} the subset preserving volume form {\omega}.

Recall that, in any complete metric space, a set is said to be residual if it contains a countable intersection of open and dense sets. A property is said to be generic if it is satisfied on a residual set. By slight abuse of language, we use the term generic diffeomorphisms: the phase ‘generic diffeomorphisms satisfy property P‘ means that property P is generic.

Let f \in \mbox{Diff}^1(M) be a diffeomorphism of M. For all \varepsilon>0, an \varepsilon-pseudo-orbit of f is a sequence (finite or infinite) of points(x_i) such that for all i, d(x_{i+1},f(x_i)) < \varepsilon. We define the following binary relations for pairs of points (x,y) on M:

– For all \varepsilon > 0, we write x \dashv_\varepsilon y if there exists an \varepsilon-pseudo-orbit (x_0, x_1, \cdots, x_k) where x_0 = x and x_k = y for some k \geq 1.

– We write {x \dashv y} if {x \dashv_\varepsilon y} for all {\varepsilon>0}. We sometimes write {x \dashv_f y} to specify the dynamical system in consideration.

– We write {x \prec y} (or {x \prec_f y}) if for all neighborhoods {U, V} of {x} and {y}, respectively, there exists {n \geq 1} such that {f^n(U)} intersects {V}.

Here are a few elementary properties of these relations.

1. The relations {\dashv} and {\dashv_\varepsilon} are, by construction, transitive. The chain recurrent set {\mathcal{R}(f)} is the set of points {x} in {M} such that {x \dashv x}.

2. The relation {x \prec y} is not a-priori transitive. The non-wandering set {\Omega(f)} is the set of points {x} in {M} such that {x \prec x}.

Marie-Claude Arnaud has shown in [Ar] that the relation {\prec} is transitive for generic diffeomorphisms. By using similar methods we show:

Theorem 1: There exists a residual set {\mathcal{G}} in {\mbox{Diff}^1(M)} (or in {\mbox{Diff}^1_\omega(M)}) such that for all diffeomorphisms {f} in {\mathcal{G}} and all pair of points {(x, y)} in {M} we have:

\displaystyle x \dashv_f y \Longleftrightarrow x \prec_f y.

This theorem is a consequence of the following general perturbation result:

Theorem 2: Let {f} be a diffeomorphism on compact manifold {M}, satisfying one of the following two hypotheses:

1. all periodic orbits of {f} are hyperbolic,

2. {M} is a compact surface and all periodic orbits are either hyperbolic or elliptic with irrational rotation number (its derivative has complex eigenvalues, all of modulus {1}, but are not powers of roots of unity).

Let {\mathcal{U}} be a {C^1}-neighborhood of {f} in {\mbox{Diff}^1(M)} (or in {\mbox{Diff}^1_\omega(M)}, if {f} preserves volume form {\omega}). Then for all pairs of points {(x,y)} in {M} such that {x \dashv y}, there exists a diffeomorphism {g} in {\mathcal{U}} and an integer {n>0} such that {g^n(x) = y}.

Remark: In Theorem 2 above, if the diffeomorphism {f} if of class {C^r} with {r \in (\mathbb{N} \backslash \{0\})\cup \{ \infty \}}, then the {C^1}-perturbation {g} can also be chosen in class {C^r}. Indeed the diffeomorphism {g} is obtained thanks to a finite number of {C^1}-perturbations given by the connecting lemma (Theorem 2.1), each of these perturbations is itself of class {C^r}.

Here are a few consequences of these results:

Corollary: There exists a residual set {\mathcal{G}} in {\mbox{Diff}^1(M)} such that for all diffeomorphism {f} in {\mathcal{G}}, the chain recurrent set {\mathcal{R}(f)} coincides with the non-wandering set {\Omega(f)}.

Corollary: Suppose {M} is connected, then there exists a residual set {\mathcal{G}} in {\mbox{Diff}^1(M)} such that if {f \in \mathcal{G}} satisfies {\Omega(f) = M} then it is transitive. Furthermore, {M} is the unique homoclinic class for {f}.

For volume preserving diffeomorphism {f}, the set {\Omega(f)} always coincide with the whole manifold {M}. We therefore find the analogue of this corollary in the conservative case (see section 1.2.4).

2. Dynamical decomposition of generic diffeomorphisms into elementary pieces

Consider the symmetrized relation {\vdash\dashv } of {\dashv} defined by {x \vdash\dashv y} if {x \dashv y} and {y\dashv x}. This relation then induces an equivalence relation on {\mathcal{R}(f)}, where the equivalence classes are called chain recurrence classes.

We say a compact {f}-invariant set {\Lambda} is weakly transitive if for all {x, y \in \Lambda}, we have {x \prec y}. A set {\Lambda} is maximally weakly transitive if it is maximal under the partial order {\subseteq} among the collection of weakly transitive sets.

Since the closure of increasing union of weakly transitive sets is weakly transitive, Zorn’s lemma implies any weakly transitive set is contained in a maximally weakly transitive set. In the case where the relation {\prec_f} is transitive (which is a generic property), the maximally weakly transitive sets are the equivalence classes of the symmetrized relation induced by {\prec} on the set {\Omega(f)}. Hence we obtain, for generic diffeomorphisms:

Corollary: There exists residual set {\mathcal{G}} in {\mbox{Diff}^1(M)} such that for all {f \in \mathcal{G}} the chain recurrence classesare exactly the maximally weakly transitive sets of {f}.

The result of Conley (see posts on fundamental theorem of dynamical systems) on the decomposition of {\mathcal{R}(f)} into chain recurrence classes will therefore apply (for generic diffeomorphisms) to the decomposition of {\Omega(f)} into maximally weakly transitive sets.


3. Chain recurrence classes and periodic orbits

Recall that after the establishment of closing lemma by C. Pugh (see the closing lemma post), it is known that periodic points are dense in {\Omega(f)} for generic diffeomorphisms, we would like to use these periodic orbits to better understand the dynamics of chain recurrence classes.

Recall the homoclinic class {H(p, f)} of a hyperbolic periodic point {p} is the closure of all transversal crossing points of its stable and unstable manifolds. This set is by construction transitive, as we have seen in section 1.2, the results of [CMP] imply that, for generic diffeomorphisms any homoclinic class is maximally weakly transitive. By applying corollary 1.4, we see that:

Remark: For generic diffeomorphisms homoclinic classes are also chain recurrence classes.

However, for generic diffeomorphisms, there are chain recurrence classes which are not homoclinic classes, therefore contains no periodic orbit, we call such chain recurrence class with no periodic points aperiodic class.

Corollary: There exists residual set {\mathcal{G}} in {\mbox{Diff}^1(M)} such that for all {f \in \mathcal{G}}, any connected component with empty interior of {\Omega(f) = \mathcal{R}(f)} is periodic and its orbit is a homoclinic class.

The closing lemma of Pugh and Remark 5 show:

Remark: For generic {f}, any isolated chain recurrence class in {R(f)} is a homoclinic class. In particular this applies to classes that are topological attractors or repellers.

For non-isolated classes, a recent work (see [Cr]) specifies how a chain recurrence class is approximated by periodic orbits:

Theorem: There exists residual set {\mathcal{G}} in {\mbox{Diff}^1(M)} such that for all {f \in \mathcal{G}}, all maximally weakly transitive sets of {f} are Hausdorff limits of sequences of periodic orbits.

More general chain recurrence classes satisfy the upper semi-continuity property: if {(x_i) \subseteq \mathcal{R}(f)} is a sequence of points converging to a point {x} then for large enough {n}, the class of {x_n} is contained in an arbitrary small neighborhood of the class of {x}.

C^1 vs. C^1 volume preserving

January 23, 2011

One of the things I’ve always been interested in is, for a given compact set say in \mathbb{R}^n, what maps defined on the set into \mathbb{R}^n can be extended to a volume preserving map (of certain regularity) on a larger set (for example, some open set containing the original set).

The analogues extension question without requiring the extended map to be volume preserving is answered by the famous Whitney’s extension theorem. It gives a beautiful necessary and sufficient condition on when the map has C^r extension – See this pervious post for more details.

A simple case of this type of question was discussed in my earlier Moser’s theorem post:

Question: Given a diffeomorphism on the circle, when can we extend it to a volume preserving diffeomorphism on the disc?

In the post, we showed that any C^r diffeomorphism on the circle can be extended to a C^{r-1} volume preserving diffeomorphism on the disc. Some time later Amie Wilkinson pointed out to me that, by using generating function methods, in fact one can avoid losing derivative and extend it to a C^r volume preserving.

Anyways, so we know the answer for the circle, what about for sets that looks very different from the circle? Is it true that whenever we can C^r extend the map, we can also so it volume-preserving? (Of course we need to rule out trivial case such as the map is already not volume-preserving on the original set or the map sends, say a larger circle to a smaller circle.)

Question: Is it true that for any compact set K \subseteq \mathbb{R}^n with connected complement, for any function f: K \rightarrow \mathbb{R}^n satisfying the Whitney condition with all candidate derivatives having determent 1, one can always extend f to a volume preserving F: \mathbb{R}^n \rightarrow \mathbb{R}^n.

Note: requiring the set to have connected complement is to avoid the ‘larger circle to small circle’ case and if some candidate derivative does not have determent 1 then the extended map cannot possibly be volume preserving near the point.

After thinking about this for a little bit, we (me, Charles and Amie) came up with the following simple example where the map can only be C^1 extended but not C^1 volume preserving.

Example: Let K \subset \mathbb{R}^2 be the countable union of segments:

K = \{0, 1, 1/2, 1/3, \cdots \} \times [0,1]

As shown below:

Define f: K \rightarrow K be the map that sends the vertical segment above 1/n to the vertical segment above 1/(n+1), preserves the y-coordinate and fixes the segment \{0\} \times [0,1]:

Claim: f can be extended to a C^1 map F: \mathbb{R}^2 \rightarrow \mathbb{R}^2.

Proof: Define g: \mathbb{R} \rightarrow \mathbb{R} s.t.

1) g is the identity on \mathbb{R}^{\leq 0}

2) g(x) = x-1/2 for x>1

3) g: 1/n \mapsto 1/(n+1)

4) g is increasing and differentiable on each [1/n, 1/(n-1)] with derivative no less than (1-1/n)(n^2-n)/(n^2+n) and the one sided derivative at the endpoints being 1.

It’s easy to check such g exists and is continuous:

Since \lim_{n \rightarrow \infty}  (1-1/n)(n^2-n)/(n^2+n) = 1, we deduce g is continuously differentiable with derivative 1 at 0.

Let F = g \otimes \mbox{id}, F: \mathbb{R}^2 \rightarrow \mathbb{R}^2 is a C^1 extension of f.

Establishes the claim.

Hence the pair (K, f) satisfies the Whitney condition for extending to C^1 map. Furthermore, since the F as above has derivative being the identity matrix at all points of K, the determent of candidate derivatives are uniformly 1. In other words, this example satisfies all conditions in the question.

Claim: f cannot be extended to a C^1 volume preserving diffeomorphism of the plane.

Proof: The idea here is to look at rectangles with sides on the set K, if F preserves area, they have to go to regions enclosing the same area as the original rectangles, then apply the isoperimetric inequality to deduce that image of some edges of the rectangle would need to be very long, hence at some point on the edge the derivative of F would need to be large.

Suppose such extension F exists, consider rectangle R_n = [1/n, 1/(n-1)] \times [0,1]. We have

m_2(R_n) = 1/(n^2-n)

m_2(R_n) - m_2(R_{n+1})


Hence in order for F(R_n) to have the same area as R_n, the image of the two segments

s_{n,0} = [1/n, 1/(n-1)] \times \{ 0\} and

s_{n,1}= [1/n, 1/(n-1)] \times \{ 1\}

would need to enclose an area of 2/(n^3-n) \sim n^{-3} outside of the rectangle R_{n+1}.

By isoparametric inequality, the sum of the length of the two curves must be at least \sim n^{-3/2}, while the length of the original segments is 2/(n^2-n) \sim n^{-2}.

Hence somewhere on the segments F needs to have derivative having norm at least

\ell(F(s_{n,0} \cup s_{n,1}) / \ell(s_{n,0} \cup s_{n,1})

\sim n^{-3/2}/n^{-2} = n^{1/2}

We deduce that there exists a sequence of points (p_n) converging to either (0,0) or (0,1) where

|| F'(p_n) || \sim n^{1/2} \rightarrow \infty.

Hence F cannot be C^1 at the limit point of (p_n).

Remark: In fact we have showed the stronger statement that no volume preserving Lipschitz extension could exist and gave an upper bound 1/2 on the best possible Holder exponent.

From this we know the answer to the above question is negative, i.e. not all C^1 extendable map can me extended in a volume preserving fashion. It would be very interesting to give criteria on what map on which sets can be extended. By applying same methods we are also able to produce an example where the set K is a Cantor set on the plane.

A convergence theorem for Riemann maps

January 17, 2011

So~ After 2.5 weeks of wonderful math discussions with Amie and Charles, I finished my winter vacation and got back to Princeton! (and back to my normal blogging Sundays ^^)

One thing I would like to shear here is that we (me and Charles) finally got an answer to the following question that’s been haunting me for a while:

Question: Given Jordan curve C \subseteq \mathbb{C} containing a neighborhood of \bar{0} in its interior. Given parametrizations \gamma_1:S^1 \rightarrow C.

Is it true that for all \varepsilon >0, there exists \delta >0 s.t. any Jordan curve C' with a parametrization \gamma_2:S^1 \rightarrow C_2 so that ||\gamma_1-\gamma_2||<\delta in the uniform norm implies the Riemann maps R, R' from \mathbb{D} to the interiors of C, C' that fixes the origin and have positive real derivatives at \bar{0} would be at most \varepsilon apart?

i.e. Is the projection map from the space of parametrized Jordan curves (with the uniform metric) to the space of unparametrized Jordan curves (with metric given by taking uniform distance between the canonical Riemann maps) continuous?

First, I think the development and problem-solving process for this one is quite interesting and worth mentioning (skip this if you just want to see math):

—Begin story—

The problem was initially of interest because I stated a lemma on our Jordan curves paper which asserts the above projection map is continuous at smooth curves. To my surprise, I was unable to prove this seemingly-direct lemma. I turned to Charles for help, after a day or so of thinking he proved it for smooth curves (via a very clever usage of cross-cuts as in the proof of Carathedory’s theorem) and asked back whether the map is actually continuous at all points.

This seemed to be such a natural question but we couldn’t find it in the literature. For a day or so we were both feeling negative about this since the cross-cut method fails when the Jordan curve has positive measure, which “should” happen a lot. In any case, I posted a question on mathoverflow to see if there is a standard theorem out there implying this. Almost right after I posted the question, during a wonderful lunch-conversation with Charles, I got this wonderful idea of applying extremal length techniques not to the semi-circular crosscut but only to the ‘feet’ of it. Which later that day turned out to be a proof of the continuity.

The next morning, after confirming the steps of the proof and made sure it works, I was thrilled to find that Thurston responded to the post and explained his intuition that the answer is positive. Although having solved the problem already, I am still amazed by his insights ^^ (It’s the second question I asked there, he left an comment again! It just feels great to have your idol giving you ideas, isn’t it? :-P)

Later on, McMullen pointed out to us that in fact a book by Pommerenke contains the result. Nevertheless, it was great fun proving this, hence I decided to sketch the proof here ^^

—End story—

Ingredients of the proof: We quote the following well-known but weaker theorem (one can find this in, for example Goluzin’s classical book, p228)

Theorem: If the Jordan domains converge (in the sense that parametrizations of the boundaries converge uniformly) then the Riemann maps converge uniformly on compact sets.

We also use the following topological lemma:

Lemma: Given Jordan curve C \subseteq \hat{\mathbb{C}}, \gamma: S^1 \rightarrow C be a parametrization. For all \varepsilon > 0, there exists \mu >0 s.t. for all \gamma' : S^1 \rightarrow \hat{\mathbb{C}} with || \gamma - \gamma'|| < \mu ( denote C' = \gamma'(S^1)$) , for all p, q \in C', d(p,q) < \mu \Rightarrow \mbox{diam}(A(p,q)) < \varepsilon

where A(p,q) is the short arc in C' connecting p, q.

The proof of the lemma is left as an exercise

Proof of the Theorem:

Given C and \varepsilon as in the theorem, apply the lemma to (C, \varepsilon/6), we obtain a \mu < \varepsilon / 6 so that all curves \mu-close to C has the property that the arc connecting any two points less than \mu-apart has diameter no more than \varepsilon/100.

By compactness of \partial \mathbb{D}, we can choose finitely many crosscut neignbourhoods \{ H_1, H_2, \cdots, H_N \}, H_i \subseteq \bar{\mathbb{D}} are "semi-discs" around points in \partial \mathbb{D} as shown:

By extremal length, we can choose the cross-cuts C_i bounding H_i with length \ell(R(C_i)) < \mu/4 where R: \bar{\mathbb{D}} \rightarrow \hat{\mathbb{C}} is the canonical Riemann map corresponding to C. Hence by lemma, we also get \mbox{diam}(R(H_i) < \varepsilon/3.

Let \{ f_1, f_2, \cdots, f_{2N} \} be endpoints of \{C_1, \cdots, C_N \}.

Let d = \min \{ d(f_i, f_j) \ | \ 1 \leq i < j \leq 2N \}.

Choose \sigma < \mu d / 40 and \{ B( \bar{0}, 1-2\sigma), H_1, \cdots, H_N \} covers \bar{\mathbb{D}}. Let R = 1-\sigma:

By the above theorem in Goluzin, since B_R = \bar{B(0, R)} is compact, there exists a 0 < \delta < \min \{\mu/4, d/10 \} s.t.

|| \gamma' - \gamma || < \delta \Rightarrow ||R|_{B_R} - R'|_{B_R}|| < \mu/4.

Fix a (C', \gamma') with || \gamma - \gamma'|| < \delta. Let R' be the canonical Riemann map corresponding to C'.

Claim: ||R-R'|| < \varepsilon.

First note that assuming the theorem in Goluzin, it suffice to show ||R|_{\partial \mathbb{D}} - R'|_{\partial \mathbb{D}}|| < \varepsilon.

For any 1 \leq i \leq N, let f_1, f_2 be endpoints of C_i. Apply the extremal length to the set of radial segments in the almost-rectangle [f_1, f_1+d/10] \times [0,\sigma].

We conclude there exists e_1 \in [f_1, f_1+d/10] s.t. the segment s_1 = \{e_1\} \times [0, \sigma] has length

\ell(R'(s_1)) \leq 2 \sigma (d/10) m_2(R'([f_1, f_1+d/10] \times [0,\sigma])).

Since \sigma < \mu d / 40 and m_2(R'([f_1, f_1+d/10] \times [0,\sigma])) \leq 1, we have

\ell(R'(s_1)) \leq \mu/4.

Similarly, find e_2 \in [f_2 - d/10, f_2] where \ell(R'(s_2)\leq \mu/4.

Connect e'_1, e'_2 by a semicircle contained in H_i, denote the enclosed region by V_i \subseteq H_i.

By construction, \{ B_R, V_1, \cdots, V_N \} still covers \bar{\mathbb{D}}.

Hence for all p \in \partial \mathbb{D}, there exists i where p \in latex V_i$.

Since inside V_i \cap B_R the two maps R, R' are less than d/10 apart, we have R(V_i) \cap R'(V_i) \neq \phi.

Hence d(R(p), R'(p)) \leq \mbox{diam}(R(H_i)) + \mbox{diam}(R'(V_i)).

By construction, \mbox{diam}(R(H_i)) < \varepsilon/2.

\mbox{diam}(R'(V_i)) = \mbox{diam}(\partial V_i), we will break \partial V_i into three parts and estimate diameter of each part separately.

Since ||\gamma-\gamma'|| < \delta, \tau = \gamma' \circ \gamma^{-1} \circ R|_{\partial \mathbb{D}} is another parametrization of C' with || \tau - R|_{\partial \mathbb{D}}|| < \delta.

The arc connecting e'_1 to e'_2 is contained in B_R \cap V_i, the arc in C' connecting \tau(e_1), \tau(e_2) is \delta away from R(H_i) hence the union of the two has diameter at most \mbox{diam}(R(V_i)) + \delta < \varepsilon/6 + \varepsilon/6 = \varepsilon/3

Length of the arcs R'(s_1), R'(s_2) are less than \mu/4 < \varepsilon/12.

Hence d(\tau(e_1), R'(e_1)) < \ell(R'(s_1)) + \delta < \mu. By lemma, this implies the arc in C' connecting \tau(e_1), R'(e_1) has length at most \varepsilon/12.

Hence altogether the we have \mbox{diam}(R'(V_i)) \leq \varepsilon/3+\varepsilon/12+\varepsilon/12 = \epsilon/2.

We deduce d(R(p), R'(p)) \leq \mbox{diam}(R(H_i)) + \mbox{diam}(R'(V_i)) < \varepsilon.


The moving needle problem

November 22, 2010

First, let’s clarify that this post has nothing to do with the Kakeya conjecture (except for the word ‘needle’ in it). Anyways, I was asked the following question via an e-mail from Charles this summer: (It turns out that the question was invented by Jonathan King and then communicated to Morris Hirsch and that’s where Charles heard about it from) In any case, I find the problem quite cute:

Problem: Given a smoothly embedded copy of \mathbb{R} in \mathbb{R}^3 containing \{ (x,0,0) \ | \ x \in (-\infty,-C] \cup [C, \infty) \}. Is it always possible to continuously slide a unit length needle lying on the ray (-\infty, -C] to the ray [C, \infty), while keeping the head and tail of the needle on the curve throughout the process?

i.e. the curve is straight once it passes the point (-C, 0) and (C,0), but can be bad between the two points:

We are interested in sliding the needle from the neigative x-axis to the positive x-axis:

Exercise: Try a few examples! It’s quite amusing to see that sometimes both ends of the needle needs to go back and forth along the curve many times, yet it always seem to get through.

One should note that this is not possible if we just require the curve to be eventually straight and goes to infinity at both ends. As we can see on a simple ‘hair clip’ curve:

The curve consists of two parallel rays of distance <1 apart, connected with a semicircle. A unit needle can never get from one ray to the other since the needle would have to rotate 180 degrees and hence it has to be vertical at some point in the process, but no two points on the curve has vertical distance 1.

After some thought, I think I can show for each given curve, 'generic' needle length can pass through:

Claim: For any C^2 embedding as above, there is a full measure and dense G_\delta set \mathcal{L} \subseteq \mathbb{R} of lengths where the needle of any length L \in \mathcal{L} can slide through the curve.

Proof: Let \gamma: \mathbb{R} \rightarrow \mathbb{R}^3 be a smooth parametrization of the curve s.t. \gamma(t) = t for t \in (-\infty, -C] \cup [C, \infty).

Define \varphi: \mathbb{R}^2 \rightarrow \mathbb{R} where \varphi: (s, t) \mapsto d(\gamma(s), \gamma(t)).

Hence \varphi vanishes on the diagonal and takes positive value everywhere else.
Since \gamma(t) = t for t \notin (-C, C), hence \varphi(s,t)=|s-t| for |s|, |t|>C. \varphi^{-1}(L) contains four rays \{ |s-t| = L \ | \ s, t \notin (-C, C) \}:

Observation: a needle of length L can slide through the curve iff there is a continuous path in the level set \varphi(p) = L connecting the two rays above the diagonal. (This is merely projection onto the x and $larex y$-axis.)

We also have \varphi is C^2 other than on the diagonal. (It behaves like the absolute value function near the diagonal). By Sard’s theorem, since \varphi is C^2 on \{s < t\}, the set of critical values is both measure 0 and first category.

Let \mathcal{L} be the set of regular values of \varphi. For any L \in \mathcal{L}, by implicit function theorem, the level set \varphi^{-1}(L) is a C^2 sub-manifold.

Since the arc \gamma([-C, C]) is compact, we can find large R where \gamma([-C, C]) \subseteq B(\bar{0}, R).

Hence for |t| > R + L and s \in [-C, C], we have

\varphi (s, t) = d(\gamma(s), \gamma(t)) > d(\gamma(t), B(\bar{0}, R)) > L

The same holds with |s| > R + L and t \in [-C, C]

i.e. \varphi takes value >L in the shaded region below:

Hence for L \in \mathcal{L}, \varphi^{-1}(L) \cap \{x \leq y\} is a 1 dimensional sub-manifold, outside a bounded region it contains only two rays. We also know that the level set is bounded away from the diagonal since \varphi vanishes on the diagonal. By an non-ending arc argument, one connected component of \varphi^{-1}(L) must be a curve connecting the end points of the two rays. Establishes the claim.

Remarks: This problem happens to come up at the very end (questio/answer part) of Charles’s talk in the midwest dynamics conference last month (where he talked about our joint work about funnel sections). A couple weeks later Michal Misiurewicz e-mailed us a counter-example when the curve is not smooth (only continuous).

Initially I tried to use the above argument to get the length 1 needle. Everything works fine until a point where one has a continuum in the level set connecting the end-points of the two rays. We want the continuum to be path connected. I got stuck on that. In the continuous curve case, Michal’s counter-example corresponds to the continuum containing a \sin(1/x) curve, hence is not path connected.

I believe such thing cannot happen for smooth. The hope would be that the length 1 needle can slide through any C^2 (or C^1) curve. (Note that once the length 1 needle can pass through, then all length can pass through just by rescaling the curve.) In any case, still trying…

On C^1 closing lemma

May 25, 2010

Let f: M \rightarrow M be a diffeomorphism. A point p is non-wandering if for all neighborhood U of p, there is increasing sequence (n_k) \subseteq \mathbb{N} where U \cap f^{n_k}(U) \neq \phi. We write p \in \mathcal{NW}(f).

Closing lemma: For any diffeomorphism f: M \rightarrow M, for any p \in \mathcal{NW}(f). For all \varepsilon>0 there exists diffeomorphism g s.t. ||f-g||_{C^1} < \varepsilon and g^N(p) = p for some N \in \mathbb{N}.

Suppose p \in \mathcal{NW}(f), \overline{\mathcal{O}(p)} is compact, then for any \varepsilon>0, there exists x_0 \in B(p, \varepsilon), k \in \mathbb{N} s.t. f^k(x) \in B(p, \varepsilon).

First we apply a selection process to pick an appropriate almost-orbit for the closing. Set x_i = f^i(x_0), \ 0 \leq i \leq k.

If there exists 0 < j < k where

\min \{ d(x_0, x_j), d(x_j, x_k) \} < \sqrt{\frac{2}{3}}d(x_0, x_k)

then we replace the origional finite sequence by (x_0, x_1, \cdots, x_j) or (x_j, \cdots, x_k). Iterate the above process. since the sequence is at least one term shorter after each shortening, the process stops in finite time. We obtain final sequence (p_0, \cdots, p_n) s.t. for all 0 < i < n,

\min \{ d(p_0, p_i), d(p_i, p_n) \} \geq \sqrt{\frac{2}{3}}d(p_0, p_n).

Since the process is applied at most k times, x_0, x_k \in B(p, \varepsilon), after the first shortening, d(p, x_{i_1}) \leq \max \{d(p, x_0), d(p, x_k) \} + \sqrt{\frac{2}{3}}d(x_0, x_k) \leq \varepsilon +  2 \sqrt{\frac{2}{3}} \varepsilon.

i.e. both initial and final term of the sequence is at most (\frac{1}{2}+ \sqrt{\frac{2}{3}}) 2 \varepsilon. Along the same line, we have, at the i-th shortening, the distance between the initial and final sequence and p is at most (\frac{1}{2} + \sqrt{\frac{2}{3}} + (\sqrt{\frac{2}{3}})^2 + \cdots (\sqrt{\frac{2}{3}})^i) 2 \varepsilon. Hence for the final sequence p_0, p_n \in B(p, 1+2 \sqrt{\frac{2}{3}}/(1-\sqrt{\frac{2}{3}}) \varepsilon) \subseteq B(p, 10 \varepsilon).

There is a rectangle R \subseteq M where p_0, p_n \in \sqrt{\frac{3}{4}}R
(i.e. shrunk R by a factor of \sqrt{\frac{3}{4}} w.r.t. the center) and for all 0 < i < n, \ p_i \notin R.

Next, we perturb f in R i.e. find h: M \rightarrow M with ||h||_{C^1} < \delta and h|_{M \backslash R} = id. Hence ||h \circ f - f ||_{C^1} < \delta.

Suppose R = I_1 \times I_2; L_1, L_2 are the lengths of I_1, I_2, L_1 < L_2.
By main value theorem, for all x \in M, \ d(x, h(x)) < \delta L_1.
On the other hand, since p_0 \in \sqrt{\frac{3}{4}}R, it's at least \frac{1}{2}(1-\sqrt{\frac{3}{4}})L_1 away from the boundary of R. i.e. there exists bump function h satisfying the above condition and d(p_0, h(p_0)) > \frac{\delta}{8}(1-\sqrt{\frac{3}{4}})L_1.

Hence in order to move a point by a distance L_1, we need about 1/ \delta such bump functions, to move a distance L_2, we need about \frac{L_2}{\delta L_1} bumps.

For simplicity, we now suppose M is a surface. By starting with an \varepsilon (and hence R) very small, we have for all 0 \leq i \leq N+M, \ f^i(R) is contained in a small neighbourhood of p_i. Hence on f^i(B), f^i is C^1 close to the linear map p_i + Df^i(p_0)(x-p_0). Hence mod some details we may reduce to the case where f is linear in a neighborhood of \mathcal{O}(p_0).

By choosing appropiate coordinate system in R, we can have f preserving the horizontal and vertical foliations and the horizontal vectors eventually grow more rapidly than the vertical vectors.

It turns out to be possible to choose R to be long and thin such that for all i \leq 40 / \delta, f^i(R) has height greater than width. (note that M = \lfloor 40/ \delta \rfloor bumps will be able to move the point by a distance equal to the width of the original rectangle R. Since horizontal vectors eventually grow more rapidly than the vertical vectors, there exists N s.t. for all N \leq i \leq N+M, f^i(R) has width greater than its height.
For small enough \epsilon, the boxes f^i(R) are disjoint for 0 \leq i \leq N+40/ \delta. Construct h to be identity outside of

\displaystyle \bigsqcup_{i=0}^M f^i(R) \sqcup \bigsqcup_{i=N}^{N + M} f^i(R)

For the first M boxes, we let h preserve the horizontal foliation and move along the width so that g = h \circ f has the property that g^M(p_n) lies on the same vertical fiber as f^M(p_0).

On the boxes f^{N+i}(R), \ 0 \leq i \leq M, we let h pushes along the vertical direction so that

g^{N+M}(p_n) = f^{N+M}(p_0)

Since iterates of the rectangle are disjoint, for N+M \leq i \leq n, \ h(p_i) = p_i, g(p_i) = f(p_i).

Hence g^n(p_n) = g^{n-(N+M)} \circ g^{N+M}(p_n) = g^{n-(N+M)} f^{N+M}(p_0) = g^{n-(N+M)} (p_{N+M}) = p_n.

Therefore we have obtained a periodic point p_n of g.

Since p_n \in B(p, 10 \varepsilon), we may further perturb g to move p_n to p. This takes care of the linear case on surfaces.

On plaque expansiveness

May 5, 2010

This note is mostly based on parts of (RH)^2U (2006) and conversations with R. Ures while he was visiting Northwestern.

Let \mathcal{F} be a foliation of the manifold M, for p \in M, a plaque in of \mathcal{F} through p is a small open neighborhood of p in the leaf \mathcal{F}_p that’s pre-image of a disc via a local foliation chart. (i.e. plaques stuck nicely to make open neighborhoods where the foliation chart is defined.) For \varepsilon small enough, whenever the leaves of \mathcal{F} are C^1, the path component of B(p, \varepsilon) containing p is automatically a plaque, we denote this by \mathcal{F}_\varepsilon(p).

Given a partially hyperbolic diffeomorphism f: M \rightarrow M, suppose the center integrates to foliation \mathcal{F}^c.

Definition: An \varepsilon-pseudo orbit w.r.t. \mathcal{F}^c is a sequence (p_n) where for any n \in \mathbb{Z}, f(x_n) \in \mathcal{F}^c_\varepsilon(x_{n+1}).

i.e. p_{n+1} is the f-image of p_n except we are allowed to move along the center plaque for a distance less than \varepsilon.

Definition: f is plaque expansive at \mathcal{F}^c if there exists \varepsilon>0 s.t. for all \varepsilon-pseudo orbits (p_n),  (q_n) w.r.t. \mathcal{F}^c, d(p_i, q_i)<\varepsilon for all i \in \mathbb{Z} then p_0 \in \mathcal{F}^c_\varepsilon(q_0).

i.e. any two pseudo-orbits in different plagues will eventually (under forward or backward iterates) be separated by a distance \varepsilon.

In the book Invariant Manifolds (Hirsch-Pugh-Shub), it’s proven that

Theorem: If a partially hyperbolic system has plaque expansive center foliation, then the center being integrable and plaque expansiveness are stable under perturbation (in the space of diffeos). Furthermore, the center foliation of the perturbed system g is conjugate to the center foliation of the origional system f in the sense that there exists homeomorphism h: M \rightarrow M where

1) h sends leaves of \mathcal{F}^c_f to leaves of \mathcal{F}^c_g i.e. for all p \in M,

h(\mathcal{F}^c_f(p)) = \mathcal{F}^c_g(p)

2) h conjugates the action of f and g on the set of center leaves i.e. for all p \in M,

h \circ f \ (\mathcal{F}^c_f(p)) = g \circ h \ ( \mathcal{F}^c_f(p))

(both sides produce a \mathcal{F}^c_g leaf)

Morally this means plaque expansiveness implies structurally stable in terms of permuting the center leaves.

It’s open whether or not any partially hyperbolic diffeomorphism with integrable center is plaque expansive w.r.t. its center foliation.

Another problem, stated in HPS about plaque expansiveness is:

Question: If f is partially hyperbolic and plaque expansive w.r.t. center foliation \mathcal{F}_c, then is \mathcal{F}_c the
unique f−invariant foliation tangent to E^c?

(RH)^2U has recently gave a series of super cool examples where the 1-dimensional center bundles of a C^1 partially hyperbolic diffeomorphism 1) does not integrate OR 2) integrates to a foliation but leaves through a given point is not unique (there is other curves through the point that’s everywhere tangent to the bundle). I will say a few words about the examples without spoil the paper (which is still under construction).

Start with the cat map on the 2-torus (matrix with entries ( 2, 1, 1, 1), take the direct product with the source-sink map on the circle, we obtain a diffeo on the 3 torus. For the purpose of our map, we make the expansion in the source-sink map weaker than that of the cat map and the contraction stronger.

Then we perturb the map by adding appropriate small rotations to the system, the perturbation vanish on the \mathbb{t}^2 fibers corresponding to the two fixed points in the source-sink map. This will make our system partially hyperbolic, with center bundles as shown below:

To construct a non-integrable center, we make a perturbation that gives center boundle (inside the unstable direction of the cat map times the circle):

For intergrable but have non-unique center leaves, we simply rotate the upper and bottom half in opposite directions and obtain:

Note that in this case, all center leaves are merely copies of S^1. The example is plaque expansive due to to fact that all centers leaves are compact (and of uniformly bounded length). However, although the curve through any given point tangent to the bundle is non-unique, there is only one possible foliation of the center. Hence this does not give a counter example to the above mentioned question in HPS.

I think there are hopes to modify the example and make one that has similar compact leafs but non-unique center foliation, perhaps by making the unique integrability fail not only on a single line.

On Alexander horned sphere

April 18, 2010

As I was drawing pictures for some stuff that should be done a year ago, I found this part would make a cool blog post, so here it is ^^ (well I admit that I mainly just want t show off the picture)

For kids who doesn’t know, let’s first talk a bit about what this ‘sphere’ is:

This is an embedded topological sphere in \mathbb{R}^3 which has non-simply connected exterior. Also, Since the surface is compact, through inversion about any point bounded away from infinity by the surface, we obtain a ‘sphere’ that bounds a non-simply connected region inside. This shows that the topology of the complement of a compact surface depends on the embedding, which is not true for embeddings of  compact 1-dimensional manifolds in \mathbb{R}^2. (i.e. all Jordan curves separates the plane into two simply connected open sets, via the Jordan curve theorem)

The construction, as shown in the beautiful 2 page article by Alexander, goes as follows:

Take an ordinary sphere (stage 0), stretch and bend it like a banana so that the two ‘end caps’ are supported on a pair of parallel circles such that one lies vertically on top of the other (state 1). Next, on each cap we develop a banana shape, the banana shape on the two caps link though each other and again has their caps supported on a pair of parallel circles (stage 2).  Continue the process to add successively smaller bananas on the caps produced in the immediate preceding stage.

Claim: The limit is a topological sphere.

To see this, we build homeomorphisms from S^2 to each sphere in the intermediate stages. i.e. let

h_n: S^2 \rightarrow S^2_n

be a homeomorphism where S^2_n \subseteq \mathbb{R}^3 is the embedded sphere at stage n.

We may take h_n s.t. h_n^{-1} restricting to the complement of the (n-1)th stage caps (denoted by C_{n-1}) agree with h_{n-1}^{-1}. Hence union the maps h_n|_{C_{n-1}} gives a continuous map on the complement of a Cantor set on the sphere. (Since C_n is increasing and the caps gets smaller) This map can be extended continuously to the whole sphere because any neighborhood of points in the Cantor set contains pre-image of some sufficiently small cap.

The extension h is injective since any two points in the Cantor set will be separated by a pair of disjoint pre-image of small caps. Since the sphere is compact, we conclude h is a homeomorphism. i.e. the limiting surface is a topological sphere.

The exterior of the surface is not simply connected as a loop just outside the ‘equator’ can’t be contracted to a point. In fact, it’s also easy to show that the fundamental group of the exterior is not finately generated.

For some reason, Charles and I wanted to create a diffeotopy of from the standard sphere to an Alexander horned sphere. (with differentiability failing only at time one, and this is necessary since there can be no diffeomorphism from the sphere to the horned sphere, otherwise it would extend to a neighbourhood of the surfaces and hence the whole \mathbb{R}^3, but the exteriors of the two are not homeomorphic.)

The above figure is in fact a particular kind of Alexander horned sphere we needed. i.e. it has the property that each cap in the (n+1)th stage has diameter less than 1/2 of that in the nth stage, and the distance between the parallel circles is also less than 1/2 of that in the previous stage. Spheres at each stage is differentiable.

This would allow us to construct a diffeotopy that achieves stage n at time 1-1/2^n, the diffeotopy is of bounded speed as all horns are half as large as the pervious stage, hence once we get to the first stage with bounded speed, making all points traveling at that maximum speed would get one to the next stage using 1/2 as much time.

However, we do not know if all horned sphere can be achieved by s diffeotopy from the standard sphere. i.e. does the property of being a ‘diffeotopic sphere’ depend on the embedding in \mathbb{R}^3.

Many thanks to Charles Pugh for forcing me to look at this business. It is indeed very fun~

Moser’s theorem with boundary

January 29, 2010

In the process of constructing a diffeo with a uniformly hyperbolic set with intermediate measure, I came across the following problem which I find interesting in its own right:

Given a C^1 diffeo f:S^1 \rightarrow S^1, when does it extend to a volume preserving diffeo of the unit disc to itself?

Some people suggested me to look into Moser’s theorem for volume forms on compact manifolds, I found it pretty cool, so here it is (taken from Moser’s paper):

Theorem: Given two smooth volume forms \tau, \sigma with same total volume on a compact connected C^\infty manifold M, there exists diffeomorphism \phi: M \rightarrow M s.t. \sigma = \phi^* \tau.

However this does not directly apply to the boundary problem as we are dealing with manifolds (disc) with boundary rather than entire compact manifolds…Hence to make it applicable to the case, I would have to get some kind of ‘relative’ version of the theorem.

The expectation is that this can be done by plugging in the case into Moser’s proof and see if it can be modified.. I’ll update this pose when I got around to do that (hopefully in the next few days)

Okay…I think I have finally figured out how to do this. (with a huge amount of hints and directions from Keith Burns). But temporarily I have to lose one degree of regularity when making the extension (i.e. starting with a C^2 f, extend it to a C^1 volume preserving. But at this point I strongly believe that we can in fact do it with f being C^1. (the regularity is lost when I extended the diffeo locally)

Theorem: Given C^2 diffeomorphism f:S^1 \rightarrow S^1, we can find (Lebesgue) volume preserving diffeomorphism \hat{f}: \mathbb{D} \rightarrow \mathbb{D} that extends f. (where \mathbb{D} is the closed unit disc and S^1 is its boundary)


First we extend the diffeo f:S^1 \rightarrow S^1 to a neignbourhood of S^1 in \mathbb{D}.

Let A = \{ \ r e^{i \theta} \ | \ 1/2 < r \leq 1 \} be the half-open annuals with radius 1/2 and 1.

Let C = \mathbb{R} / 2 \pi \mathbb{Z} \times [0, \infty) be the half-cylinder.

Define \phi: A \rightarrow  C s.t. \phi (r e^{i \theta}) = (\theta, 1/2 - r^2/2)

\phi is a volume preserving diffeo from A to \mathbb{R} / 2 \pi \mathbb{Z} \times [0, 3/8)

Consider h = \phi \circ f \circ \phi^{-1}: \mathbb{R} / 2 \pi \mathbb{Z} \times \{0\} \rightarrow \mathbb{R} / 2 \pi \mathbb{Z} \times \{0\}

(h^{-1})' is continuous hence bounded.

Choose \epsilon < 1/(3 \max_\theta | (h^{-1})' (\theta) | )

Define \hat{h}: \mathbb{R} / 2 \pi \mathbb{Z} \times [0, \epsilon) \rightarrow C s.t.

\hat{h}(\theta, y) = ( \pi_1(h(\theta, 0) ), y |(h^{-1})'(\theta) | )

y |(h^{-1})'(\theta) | <  \epsilon \max_\theta | (h^{-1})'(\theta) |  0 \ \mathbb{R} / 2 \pi \mathbb{Z} \times [0, \delta) \subseteq \hat{h}(\mathbb{R} / 2 \pi \mathbb{Z} \times [0, \epsilon/4)).

Let \epsilon'=1-\sqrt{1-2\epsilon}, \ \delta'=1-\sqrt{1-2\delta}. Let g:N_{\epsilon'}(S^1) \rightarrow A, \ g := \phi^{-1} \circ \hat{h} \circ \phi.

Hence g is volume preserving, g|_{S^1} = f and N_{\delta'}(S^1) \subseteq g(N_{\epsilon'/2}(S^1)).

Hence we have successfully extended f to a neighborhood of S^1 in a volume preserving way.

Now we further extend g|_{N_{\epsilon'/2}(S^1)} to a (non-volume-preserving) diffeo of \mathbb{D} to itself.

This can be done by first take f \times I on C, average it with g by a C^\infty bump function that is 1 on \mathbb{D} \backslash N_{\epsilon'}(S^1) and vanishes on N_{\epsilon'/2}(S^1).

Since both functions preserve vertical segments, taking the resulting map back to A will produce a diffeo except for at the point 0. We may smooth out the map at 0. Call the resulting diffeo \hat{g}: \mathbb{D} \rightarrow \mathbb{D}

Define measure \mu on \mathbb{D} by \mu(B) = \lambda(\hat{g}^{-1}(B)) where B is any Lebesgue measurable set and \lambda is the Lebesgue measure.

Since \hat{g}^{-1} is volume preserving on N_{\delta'}(S^1), hence \mu is equal to \lambda on N_{\delta'}(S^1).

Also we have \mu(\mathbb{D}) = \lambda(\hat{g}^{-1}(\mathbb{D}) = \lambda(\mathbb{D}).

Hence we can apply lemma 2 in Moser’s paper:

Lemma: If two C^r volume forms \mu_1, \ \mu_2 agree on an \epsilon neighbourhood of the boundary of the cube (disc in our case) with the same total volume, then there exists C^r diffeo \psi from the cube (disc) to itself s.t. \psi is identity on the \epsilon neighbourhood of the boundary and \mu_1(\psi(B)) = \mu_2(B) for all measurable set B.

We apply the lemma to \mu and \lambda, obtain \psi.

Hence \lambda(B) = \mu(\psi(B)) = \lambda(\hat{g}^{-1} \circ \psi(B))

\hat{g}^{-1} \circ \psi is a C^1 diffeo that preserved Lebesgue volume, so is its inverse \psi^{-1} \circ \hat{g}.

Let \hat{f} = \psi^{-1} \circ \hat{g}, \ \hat{f}|_{S^1} = \hat{g}|_{S^1} = f.

Hence \hat{f} is a volume preserving extension of f.