I am looking for a general method to obtain derivative rules of a constrained matrix with respect to its matrix elements.
In the case of a symmetric matrix Sij (with Sij=Sji), one way to do that is the following (see Variation of the metric with respect to the metric). We say that a variation of a matrix element δSij is the same than that of δSji, and thus δSij=δSij+δSji2=δikδjl+δilδjk2δSkl=Sij;klδSkl.
I must admit that why this is the correct procedure is not quite clear to me (that seems to be quite arbitrary, although obviously it works to compute derivatives of a function of a symmetric matrix). This means that it is not clear to me how to generalize that when the constraint is different.
For example, let's take the set of matrices O belonging to the group SO(N). Is there a way to write δOijδOkl in terms of a tensor Bij;kl, with all the same nice properties ?
In the case of SO(2), this seems quite easy, since then Oji=(−1)i+jOij, and one finds in that case δOijδOkl=δikδjl+(−1)i+jδilδjk2,
Already in the case of SO(3), it does not seem to be easy to find the equivalent tensor...
Side note : using the defining property of SO(2), one can massage the formulas to obtain δOijδOkl=−OilOkj.
If anyone knows the standard procedure (if any exists) or a good reference, that would be greatly appreciated. In any case, a nice explanation (maybe a bit formal) in the case of symmetric matrix might also help me to get my head around the problem.
Answer
Setup. Let there be given an m-dimensional manifold M with coordinates (x1,…,xm). Let there be given an n-dimensional physical submanifold N with physical coordinates (y1,…,yn). Let there be given m−n independent constraints χ1(x) ≈ 0,…,χm−n(x) ≈ 0,
which defines the physical submanifold N. [Here the ≈ symbol means weak equality, i.e. equality modulo the constraints.] Assume that (y1,…,yn,χ1,…,χm−n)constitutes a coordinate system for the extended manifold M.Dirac derivative. In analogy with the Dirac bracket, let us introduce a Dirac derivative (∂∂xi)D := ∂∂xi−m−n∑a=1∂χa∂xi(∂∂χa)y = n∑α=1∂yα∂xi(∂∂yα)χ,i ∈ {1,…,m},
that projects onto the physical submanifold (∂∂xi)Dyα = ∂yα∂xi,(∂∂xi)Dχa = 0,i ∈ {1,…,m},α ∈ {1,…,n},a ∈ {1,…,m−n}.Remark. In many important cases it is possible to choose the physical coordinates (y1,…,yn) such that the Dirac derivative (4) can be written as linear combinations of unconstrained partial x-derivatives only, without referring to the (y,χ)-coordinate system (2), cf. eqs. (10) & (14) below.
Does Dirac derivatives commute? Does the commutator [(∂∂xi)D,(∂∂xj)D] = n∑α,β=1∂yα∂xi[(∂∂yα)χ,∂yβ∂xj](∂∂yβ)χ−(i↔j) ?≈ 0
vanishes weakly? Not necessarily. But if the coordinate transformation xi↔(yα,χa) is linear, then the Dirac derivatives commute.Example. Let the physical subspace be the hyperplane N={χ(x)=0} with the constraint χ = m∑i=1xi.
Define physical coordinates yα = xα−1mm∑i=1xi,α ∈ {1,…,n=m−1}.Conversely, xα = yα+1mχ,α ∈ {1,…,n},xm = −n∑β=1yβ+1mχ.The derivatives are related as ∂∂xα = (∂∂yα)χ−1mn∑β=1(∂∂yβ)χ+(∂∂χ)y,α ∈ {1,…,n},∂∂xm = −1mn∑β=1(∂∂yβ)χ+(∂∂χ)y.The Dirac derivative becomes after some algebra (∂∂xi)D = ∂∂xi−(∂∂χ)y = ∂∂xi−1mm∑j=1∂∂xj,i ∈ {1,…,m}.Example. Differentiation wrt. a symmetric matrix can be viewed as a Dirac differentiation (3), where the constraints (1) are given by antisymmetric matrices. Define s(ij) := Mij+Mji2anda(ij) := Mij−Mji2fori > j;andd(i) := Mii.
Conversely, Mij = θij(s(ij)+a(ij))+θji(s(ji)−a(ji))+δijd(i),where the discrete Heaviside step function θij here is assumed to obey θii=0 (no implicit sum). The derivatives are related as ∂∂Mij = θij2(∂∂s(ij)+∂∂a(ij))+θji2(∂∂s(ji)−∂∂a(ji))+δij∂∂d(i).The Dirac derivative becomes after some algebra (∂∂Mij)D = θij2∂∂s(ij)+θji2∂∂s(ji)+δij∂∂d(i) = 12(∂∂Mij+∂∂Mji).Remark. Additional complications arise if the coordinates and/or constraints are not globally defined. For starters, it is actually enough if (2) is a coordinate system in a tubular neighborhood of N.
Reparametrizations of the constraints. Assume that there exists a second coordinate system (˜y1,…,˜yn,˜χ1,…,˜χm−n)
(which we adorn with tildes), such that ˜yα = fα(y),˜χa = ga(y,χ) ≈ 0.This implies that (∂∂χa)y = (∂˜χb∂χa)y(∂∂˜χb)˜y,(∂∂yα)χ ≈ (∂˜yβ∂yα)χ(∂∂˜yβ)˜χ,i.e Δχ := span{(∂∂χ1)y,…,(∂∂χn−m)y} ⊆ TMis an involutive distribution, while Δy := span{(∂∂y1)χ,…,(∂∂yn)χ} ⊆ TMis a weak distribution.One may show that the Dirac derivative and its commutators (∂∂xi)∼D ≈ (∂∂xi)D,[(∂∂xi)∼D,(∂∂xj)∼D] ≈ [(∂∂xi)D,(∂∂xj)D],
[wrt. the tilde and the untilde coordinate systems (15) and (2), respectively] agree weakly. This shows that the Dirac derivative (3) is a geometric construction.Subsubmanifold. Given a p-dimensional physical subsubmanifold P with physical coordinates (z1,…,zp). Let there be given n−p independent constraints ϕ1(y) ≈ 0,…,ϕn−p(y) ≈ 0,
which defines the physical submanifold P. Assume that (z1,…,zp,ϕ1,…,ϕn−p)constitutes a coordinate system for the submanifold N. One may show that (∂∂xi)(P)D = (∂∂xi)(N)D−n−p∑a=1(∂ϕa∂xi)(N)D(∂∂ϕa)z,i ∈ {1,…,m}.This shows that the Dirac derivative construction behaves naturally wrt. further constraints.
No comments:
Post a Comment