Two particles with initial positions and velocities r1,v1 and r2,v2 are interacting by the inverse square law (with G=1), so that
d2r1dt2=−m2(r1−r2)|r1−r2|3 d2r2dt2=−m1(r2−r1)|r1−r2|3
(the inverse square law along the line of separation). What is the complete solution of these differential equations? What is the position of the two objects as a function of time?
After reading a lot on Wikipedia, I've come to the definition of center of mass and relative coordinates:
R(t) = m1r1+m2r2m1+m2
¨r(t)r(t)2 = (m1+m2)G
Where R is the center of mass, and r is the displacement between the particles... Is this correct? How do I proceed to solve the differential equation?
Answer
Textbook solution
The first thing to do is to define the center of mass and relative coordinates:
R(t)=m1r1+m2r2m1+m2
r(t)=r2−r1
You invert this to find
r1=R−m2Mr r2=R+m1Mr
The equation of motion for R is trivial, since center of mass is a conservation law:
d2Rdt2=0
and it is solved by R(t)=V0t+R0 where V0 and R0 are the initial center of mass velocity and position respectively (which are calculated from the given initial conditions).
The nontrivial equation is for the relative coordinate: md2rdt2=−m1m2r|r|3 where m is the reduced mass: 1m=1m1+1m2.
Or: d2rdt2=−Mr|r|3
Where M=m1+m2 is the total mass.
The problem is reduced to solving the Kepler motion in a 1/r potential. From now on, I will rescale time to make the mass parameter in the r equation 1.
You can choose the x axis to lie along the initial r, and the y-axis to lie along the component of the initial ˙r perpendicular to the initial r. Another way of saying this is that you rotate the coordinates to make the angular momentum vector r×p where p=m˙r to lie along the z-axis. This rotation reduces the problem to a plane, and the rotation matrix columns are given by the normalized initial r (now along the x-axis), the component of the initial velocity perpendicular to r, normalized (along the y-axis), and normalized L along the z-axis.
You then use units to set the total over reduced mass to 1, and use polar coordinates in the x-y plane of the motion, and note that the angular momentum is constant:
r2dθdt=L
This tells you that equal areas are swept out by the r-vector in equal times. The equation of motion for r(t) (no longer a vector, now a scalar radial coordinate) is:
d2rdt2=L2r3−1r2
Then you change time out for θ, expressing everything in terms of r(θ), which you can do using the equal area law, whenever the anguar momentum is nonzero (if the initial angular momentum is zero, or very close to zero, this is a one-dimensional two-body problem which can be solved directly by more elementary means). The equation of motion for r(θ) simplifies when you make a coordinate transformation to u=1r:
d2udθ2=C−u
Where C is some unimportant constant, and this is solved by
u(θ)=1A(1+acos(θ−θ0))
Where A is the semi-major axis of the ellipse (if the orbit is an ellipse), θ0 determines the orientation in the x-y plane, and a is the eccentricity of the ellipse (if a<1), or determines the angle of the hyperbola (if a>1) or tells you the orbit is a parabola (a=1).
the only result you need is that
r(θ)=A1+acos(θ−θ0)
This gives you the solution of r as a function of θ, which gives the shape of the orbit. This is where textbooks stop.
Finding θ as a function of t
But you then want the solution for θ as a function of time, to get the r and θ as functions of time. This is determined from the area law, conceptually:
r2dθdt=L
dθ(1+acos(θ−θ0))2=LA2dt
and integrating this from time 0 to time t, tells you in principle what θ(t) is. The result can be written as:
F(a,θ)=F(a,θ0)+LA2t
Where F(a,θ) is the special function that gives you the area of a conic section of parameter a, in a wedge from the focus where one half-line is along the major axis, and the other half-line makes an angle θ with the first. This special function is not expressible in terms of elementary functions.
This function is defined by the integral above, and you can calculate it numerically using any numerical integration method. Finding this function, and inverting it, is the only difficult part of this problem. There are three limits which are necessary for perturbations:
- for a=0, F(0,θ)=θ
- for a=1, F(1,θ)=y4+y312 where y=rsinθ=sin(θ)1+cos(θ)
- for a=∞, F(a,θ)≈1atan(θ)
Each of these are elementary degenerations: the first is the circle, the second is the parabola, and the third is a straight line. The important thing is that each of these degenerations gives you x(t) and y(t) which are simple, and further, you can perturb around each of these three limits in a nice way. In the following, the parameter t is rescaled to absorb LA2
- circle: x(t)=cos(t) y(t)=sin(t)
- parabola: y(t)=(√1+36t2+6t)13−(√1+36t2−6t)13 x(t)=12−y22
- line: x(t)=1a, y(t)=t
The line and circle are obvious, the parabola is found by inverting the cubic for y as a function of t using the cubic equation.
Near the circle, time is periodic with the orbital period, which is the area inside the ellipse aA2 divided by the area sweep-rate L/2. So you have once-winding function from a circle to a circle, which can always be written as a Fourier series with a linear term, which is found from the power series of the integrand in a, integrated term by term. Near the straight-line hyperbola, you can similarly perturb in a series, and the only interesting degeneration is the parabola. Near the parabola, the perturbation theory is a little more complicated.
No comments:
Post a Comment