classical mechanics - Finding generalized coordinates when the implicit function theorem fails

Tuesday, 26 August 2014

classical mechanics - Finding generalized coordinates when the implicit function theorem fails

Given some coordinates $(x_1,\dots, x_N)$ and $h$ holonomic constraints, it should always be possible to reduce the coordinates to $n=N-h$ generalized coordinates $(q_1,\dots, q_n)$. This is guaranteed by the implicit function theorem (its application of reducing coordinates is briefly mentioned here).

For a constraint $F(x, y) = 0$ concerning only two coordinates, the implicit function theorem states that a function $g$ wherefore $y = g(x)$ exists in a neighbourhood of a point $(x_0, y_0)$ satisfying the constraint only if

$F$ is continuously differentiable, and

$\frac{\partial F}{\partial y}(x_0,y_0) \neq 0$.

However, this is not always the case.

Example 1: unit circle

As pointed out by Emilio Pisanty, an easy example is a particle that must move on the unit circle in the $(x, y)$ plane: $F(x, y) = x^2 + y^2 - 1 = 0$ with derivative $\frac{\partial F}{\partial y}(x, y) = 2y$. When trying to eliminate $y$ as a coordinate, there are two local solutions: either $y = \sqrt{1-x^2}$ or $y = -\sqrt{1-x^2}$, thus one can easily describe the motion of the particle locally using the Lagrangian. Unfortunately, there is a problem with two points, $(1, 0)$ and $(-1, 0)$, because the derivative $\frac{\partial F}{\partial y}$ becomes zero, thus the points joining the solutions of $y$ violate the second condition and we're unable to describe the global motion of the particle (the particle might "hop" from one solution to the other)!

Of course the problem in the example above is easily solved by choosing other initial coordinates, namely polar coordinates $(r, \theta)$. The constraint becomes $F(r, \theta) = r - 1 = 0$, which satisfies both conditions.

Example 2: quadrilateral

Imagine two pendulums hanging from a fixed ceiling tied together by a rigid string:

Choosing $\alpha_1$ and $\alpha_2$ as initial coordinates, there is only one constraint: the $d$-string, the $l$-strings and the ceiling should always form a quadrilateral with given lengths for all sides.

With some uninteresting geometry, one can find a relation between $\alpha_1$ and $\alpha_2$, as done here in a revision. The point is that, when given some $\alpha_1$, there are again two possible solutions for $\alpha_2$: $\alpha_2 = \phi_1(\alpha_1) + \phi_2(\alpha_1)$ or $\alpha_2 = \phi_1(\alpha_1) - \phi_2(\alpha_1)$, where $\phi_1$ and $\phi_2$ are defined as in the revision, but are unimportant. Here is an illustration of the two possibilities:

The constraint can be written as $F(\alpha_1, \alpha_2) = |\alpha_2 - \phi_1(\alpha_1)| - \phi_2(\alpha_1) = 0$, which violates the first condition, or it can be written as $F(\alpha_1, \alpha_2) = (\alpha_2 - \phi_1(\alpha_1))^2 - \phi_2(\alpha_1)^2 = 0$, which violates the second condition. Again, I'm not able to describe the global motion of the system.

In the previous example this could be solved by choosing different initial coordinates, but I don't know whether this can be done here. This brings me to my questions.

Questions

Is it always possible to choose initial coordinates in such a way that all constraints satisfy the conditions of the implicit function theorem?

If so, is there a systematic way of finding them?

If not, is it still possible to describe the global motion of the system using the Lagrangian?

Answer

Generally speaking, given a set of coordinates $x_1,\ldots,x_N$ under a set of $h=N-n$ holonomic constraints of the form $F_j(x_1,\ldots,x_N)=0$, you won't be able to find a subset $x_1,\ldots,x_n$ of your original coordinates that will function globally as generalized coordinates: the implicit-function theorem tells you that locally you are guaranteed such a subset, but in general that subset won't work everywhere.

This is kind of the main reason why we work with generalized coordinates $q_1,\ldots,q_n$, because they allow us to re-parametrize the available configuration space in a way that requires fewer coordinate charts. However, even those are not a full solution, because generally speaking the constraints will define the available coordinate space as a manifold which might require two or more charts in its minimal atlas, i.e. you are never guaranteed the existence of a set of coordinates which will work globally. Some examples:

A particle in a plane confined to the unit circle is best parametrized via the polar angle, but even this isn't perfect (it fails to reproduce the periodicity when taken literally).

For something where it's more clear that coordinate patches just won't work, consider e.g. a particle in 3D confined to the surface of a double torus: a single coordinate patch will simply be unable to cope with the nonzero genus of the surface.

This is why analytical mechanics, when done right (such as e.g. in the style of V.I. Arnol'd) works abstractly with the configuration space as a manifold and acknowledges that coordinate patches can only ever be local - and that that's OK. This means that wherever your configuration space is a regular manifold, you can always find a chart that works, and you can solve the dynamics in an explicit coordinate-dependent fashion there; if the solution wanders off the edge of the chart, then you can always use a separate chart where things are just fine.

For your configuration, though, you don't need to get particularly fancy, because the core topology of the problem is generally the same as that of the unit circle mentioned above. This can be seen by rephrasing the constraint in the form $F(\alpha_1,\alpha_2) = d$ where $$ F(\alpha_1,\alpha_2) = \bigg\|(D-l\cos(\alpha_2),-\sin(\alpha_2))-(l\cos(\alpha_1),-l\sin(\alpha_1))\bigg\|. $$ How do you analyze this? Well, have a look at a contour plot of $F(\alpha_1,\alpha_2)$ on the $\alpha_1,\alpha_2$ plane (taking $l=D/2$ for definiteness):

Mathematica graphics

Depending on what $d$ is, your system is confined to one of the contour lines in the plot, and for the most part, these are topological circles of the boringest, most vanilla kind. This means that you can find a single angle-type generalized coordinate that parametrizes the entirety of the configuration-space manifold modulo the same periodicity problems that you get for a particle confined to the unit circle.

This isn't to say that finding that coordinate will be easy, and indeed (i) its analytical form is likely to be quite messy and (ii) this messiness will be reflected and amplified in the Euler-Lagrange equations of motion. In this specific instance I don't think there's anything to be gained by trying to find such a coordinate (as opposed to just using $\alpha_1$ and $\alpha_2$ as independent variables on the charts where they work, obtaining and solving the Euler-Lagrange EOMs on that parametrization, and just being aware of when one might need to switch charts) but ultimately it's a personal choice.

Now, that said, there are some (very specific) situations where the coordinate space might fail to be a manifold, essentially because $F(\alpha_1,\alpha_2)$ fails to have a nonzero gradient. For your problem, this occurs e.g. at the $d=0.9D$ contour line when $l=0.95D$, which looks like this,

Mathematica graphics

i.e. it has a crossing at $\alpha_1=\alpha_2=0$, where the two arms are 'crossed', and you can get out of the configuration by pulling either arm away and making the other one follow, i.e. via two distinct routes. This type of configuration is the worst you'll encounter on the systems you've mentioned ─ but, really, you almost certainly never encounter it, which is why it's generally OK to dismiss the possibility as a pathological corner case for the mathematicians to worry about.

If you do insist on handling it, then what will generally happen from a physicist's perspective is that the system will tend to have some inertia, and that will tend to carry it on the same 'branch' it's going. (Or, in other words, you parametrize the figure-of-eight curve in a regular way, with a self-crossing which you then ignore.) This will work for all cases except those where the system is at the crossing with zero velocity, in which case it will stay there or only arrive asymptotically (but, if you really push things, yes: the lagrangian mechanics are undefined after you reach that kind of a pathological point).

Or, put another way, this kind of problem isn't really a problem.

Blog

Tuesday, 26 August 2014