Special relativity and the rod/slot paradox – XII – images of x,y-coordinate axes after Lorentz transformation to a diagonally moving frame

My post series about the 2-dimensional rod/slot paradox turns into its last phase. We have learned that there is no paradox regarding two distinct scenarios – defined by different initial conditions:

Collision scenario: If we in a special reference frame set up initial conditions such that a collision occurs between the rod and the slot-surrounding plate, we will find a collision in reference frames attached to the two moving objects, too.
Transit scenario: If we set up initial conditions such that the rod can move through the slot at encounter time, we will get the same result in other reference frames, too. In particular in frames attached to the rod and the slot.

You find consistent descriptions of both scenarios in different frames in the previous posts:

However, there are still some open questions. In particular, I have not compared the two scenarios directly in some setup-frame. And there other interesting issues.

Regarding the forthcoming posts you should be familiar with the reference frames A, B, C and Z, which I have introduced to mathematically describe observations of different observers. Remember also that reference frame B was attached to the slot and frame C was attached to the rod, which approached the slot on a diagonal path with constant relative velocity. You should also the necessary steps during Lorentz Transformations [LTs] in mind and by what steps we assured simultaneously measured events in the target frame.

Remaining tasks and challenges

Although we have found consistent descriptions of different observers for our two scenarios, we should nevertheless compare these scenarios a bit more directly. Just to get the differences in the initial conditions right in a selected frame. We have, e.g., not yet compared the conditions of the collision scenario with the conditions of the transit scenario in frame Z that we used to define the transit scenario with frame A. Or at least in a closely related frame. A direct comparison requires a LT of collision event data between frame A and Z (or a related frame).

In addition there are also some more aspects of the Lorentz Transformation which deserve a proper discussion. What we for example have not yet looked at are the images of the spatial coordinate axes after a LT to a diagonally moving reference frame. The rod in frame C was aligned with the x-axis there and appeared rotated from the perspective of the diagonally approaching slot. So this makes us suspect that the coordinate axes may appear rotated in some diagonally approaching frame.

If we compare some elementary conditions of the transit scenario in frame Z with the conditions for the collision scenario in A, we must conclude that the relative movement of the origin of a frame like Z would be a diagonal one vs. the origin of A. The reason is simply that Z‘s origin must follow both the horizontal movement of the slot and the vertical movement of the rod in A. So, to be able to compare the collision scenario with the transit scenario we should study diagonally moving frames more closely.

Therefore, another interesting question is how the LT from frame A to a diagonally moving frame as Z can be described by an alternative sequence of two consecutive LTs (more precisely boosts) along two perpendicular axes – so e.g. first along the y-axis (from frame A to frame Z) and afterward along the x-axis of C to frame Z. But there is an alternative sequence, namely from A to B along the common x-axes and then from B to Z along the y-axes. Will the alternative sequence of Lorentz boosts produce different results due to the known non-commutativity of non-collinear boosts? If so, what does this difference mean?

To cover all these points I want to analyze the collision scenario now in a diagonally moving frame W. We will later see that such a frame corresponds indeed in a certain way to the frame Z, which we used for the analysis of the “transit scenario”. This will later allow for a direct comparison of the collision and the slot scenario in a common frame.

We define W as a frame whose origin moves diagonally with respect to frame A (and the standard Cartesian coordinate system there) used to define the collision scenario. We adjust W‘s path such that it moves parallel to the left end of the rod and slot. So, the origin of W will meet the origin of A when the left ends of rod and slot meet there, too. See the illustrations 1 and 2 below.

Our objectives for the forthcoming posts are:

We want to answer the following two questions: a) What happens to images of the x- and y-axes after a Lorentz Transformation to a diagonally moving target frame W? b) According to which criterion can we reasonably define the orientation of the x-axis of a diagonally moving target system? Do the transformed axes span a Cartesian coordinate system at all?
We want to find out how the initial conditions of the collision scenario transform from the the setup frame A to a frame W. We should find some profound qualitative difference in comparison to the initial conditions of the transit scenario.
We have already learned that a Lorentz Transformation along the relative line of motion of our two physical objects is equivalent to a rotation and two boosts along different coordinate axes. On our way to analyze a transformation from A to a diagonally moving frame W we want to investigate what a decision about the order of two alternative boosts along different coordinate axes will mean for the interpretation of the probably differing results. We will learn that in our case the difference corresponds to a choice of the orientation of the Cartesian coordinate system in the target frame. Which may give you a new perspective on the non-commutativity of non-collinear Lorentz boosts (along different axes).

But let us start with introducing a new frame W moving diagonally versus frame A used to analyze the collision scenario. In this post we will try to find out what kind of inclination angles an observer in W observes for the Lorentz-transformed images of the standard x_A-axis and the y_A-axis of frame A versus the line of relative motion of W vs. A. I.e. we try to find answers for point 1 above.

We use the abbreviations :
CCS : Cartesian coordinate system. LT: Lorentz Transformation.

Introduction of a diagonally moving frame W

The frame W is depicted in the drawing below. In a later post we will see that we can indeed use it to define conditions as in the transit scenario. We assume that W co-moves horizontally with frame B (attached to the slot) versus frame A (velocity component ν_x in negative x-direction). We also assume that W moves vertically downward in parallel to frame C (attached to the rod) such that the origins of C and W are always on the same height.

A diagonally moving frame W to describe the collision scenario

Illustration 1: Introduction of a reference frame W which co-moves horizontally with B and vertically with C. Note that our physical objects – the rod and the slot – move in both frames.

The fact that the rod and the slot are moving in both frames A and W should make us somewhat careful regarding details of the Lorentz Transformation. We can not apply a simple length contraction or its reverse. We have to take into account the 2-dimensional motion of both objects when determining the right events in A that lead to simultaneously perceived events in W.

Relative velocity between frames W and A and some respective formulas

We first gather some information and formulas which can be derived easily. For the presently given Cartesian coordinate system of frame A (see illustration 1) we find that the relative (diagonal) movement of the origin of frame W is defined by the constant components of the following velocity vector v_rel,w2a = (v_xW, v_yW):

\[ \begin{align} \pmb{v}_{rel, w2a} \,&=\, \left( \, v_{xW}, \, v_{yW} \, \right) \\[8pt] &=\, \left( \, – \,\nu_x, \,\, – \, v_y \, \right) \\[10pt] \Rightarrow v_{rel, w2a} \,&= \, || \pmb{v}_{rel,w2a} || \\[8pt] &=\, \sqrt{ \, \nu_x^2 \,+\, v_y^2\, } \,. \end{align} \]

Below, we often use the alternative and simpler notation

\[ v_{rel}\,=\, v_{rel,w2a} \,\,. \]

Relative velocity between frames W and A

Illustration 2: Relative velocity of the origin of frame W against the origin of frame A as seen in the original coordinate system of A. The vector has two components: –ν_x and –v_y (parallel to the respective axes of A).

This gives us corresponding relativistic β– and γ-factors of

\[ \begin{align} \beta_{rel}^2\,=\, \beta_{w2a}^2 \,&=\, { 1 \over c^2}\, \left(\, \nu_x^2 \,+\, v_y^2 \,\right) \, =\, \beta_x^2 \,+\, \beta_y^2 \,, \\[10pt] \gamma_{rel}^2 \,=\,\gamma_{w2a}^2 \,&=\, {1 \over 1 \,-\ \beta_{w2a}^2 } \,=\, {1 \over 1 \,-\ \beta_x^2 \,-\, \beta_y^2} \,, \end{align} \]

with

\[ \begin{align} \beta_x \,&=\, {\nu_x \over c^2} \,, \quad \gamma_x \,=\, \left[ \, 1\,-\, \beta_x^2 \, \right]^{-1/2} \,, \\[8pt] \beta_y \,&=\, {v_y \over c^2} \,, \quad \gamma_y \,=\, \left[ \, 1\,-\, \beta_y^2 \, \right]^{-1/2} \,. \end{align} \]

The constant velocity components in turn define the angle Ψ_WA between the line of relative motion and the x-axis in the original coordinate system of A:

\[\begin{align} \cos\left(\Psi_{WA} \right) \,&=\, {1 \over v_{rel} } * \nu_x \,, \\[8pt] \sin \left( \Psi_{WA} \right) \,&=\, {1 \over v_{rel} } * v_y \,. \end{align}\]

Rotated coordinate systems

In this and the following posts we take the freedom to change the spatial coordinate system for a selected reference frame without changing the name of the frame. This may deviate from conventions in some standard books on Special Relativity [SR]. Here, I always keep the origin and motion of a named frame, but allow for a rotation of the 2-dimensional spatial Cartesian coordinate system associated with it.

In particular, we will soon employ coordinate systems of A and W with the x-axes aligned with the line of relative motion between the origins of W and A. In the case of A this coordinate system will be rotated relative to the original coordinate system, which has an x-axis parallel to the slot’s orientation (and the x-axis of frame B). We use a subscript “rel” to refer to the axes of the rotated CCS whose x-axis is aligned with the line of relative motion of the frames..

Choice of a proper x-axis for the diagonally approaching frame W

We work with 2-dimensional Cartesian coordinate systems in this post series. An interesting aspect that comes into our present investigation is the following:

How do we get or define a proper x-axis of a coordinate system for frame W?

The answer is less trivial than you may think. Firstly, this point is a bit different from what we normally do with the orientation of (x,t)-axes and world lines in so called Minkowski-diagrams. Here we refer to the choice of the orientation of a moving 2-dimensional spatial coordinate system with respect to the Cartesian spatial x,y-axes of other reference frames or – much more meaningful – with respect to given physical and measurable quantities in a scenario of moving objects.

In all our previous considerations the target frames B and C had one coordinate axis aligned with one of the axes of the setup frame A. And the physical objects (rod and slot) moved along one of the coordinate axes of A. So, the orientation of the physical object in the target frame B (slot) or C (rod) determined the orientation of the x-axis of either B or C. We did not even have to think about other choices and restricted our efforts to determining the angle of those physically given x-axes relative to line of motion between the reference frames.

But what happens if transformations from A to a diagonally moving target frame W lead to a rotation of both of our physical objects and in addition of the images of the coordinate axes of A? Given our experiences so far we cannot exclude different rotations of our objects physical objects during a transformation from A to W. The resulting angles of the objects with the line of relative motion may also be different from those of the images of the x_A– and the y_A-axis vs. the line of relative motion.

In the situation depicted above we have assumed a coordinate system of frame W where the x_W-axis is displayed as being parallel to the x_A-axis of A. This has a well defined meaning in A. But as all points on A‘s x_A-axis approach an observer in W with some horizontal and vertical velocity the x_A-axis may be perceived in W as rotated against the relative line of motion of the frames’ origins by another angle than in A. If you go through post V again, you will see we found this already before. But at that time we could always cling to the orientation of the physical object attached to the target system. But now, regarding W, it seems that we have an unexpected choice to make.

All in all it feels as if we loose some firm ground now – and as if the choice of the axes of the coordinate system in W is somewhat arbitrary with respect to the initial conditions in A. But, on the other hand, there is always the freedom of choosing the orientation of a 2-dimensional coordinate system in a two dimensional Euclidean space. The special setup of our scenarios only made the choices easier before.

We will sort details out later. For the time being let us do the only reasonable thing SR principles assure us:

The line of motion of the origin of W is physically and mathematically well defined with respect to the origin of A in whatever chosen coordinate system of A. So, in A we can always safely switch to a rotated coordinate system with an x_rel,A -axis along the relative line of motion of W. The initial conditions in the unrotated original coordinate system of A precisely define all vector components in the rotated system. And in W the relative line of motion of A defines a proper orientation for an x_rel,W -axis there – whatever this may mean

concerning the possibly different orientations of the rod and the slot in W
and a maybe different orientation of the LT-image of A‘s x_A-axis as perceived in W.

I indicate this approach to cling to the line of relative motion schematically below:

Rotated coordinate systems of frames A and W

Illustration 3: Rotated coordinate systems of A and W having x-axes aligned with the line of relative motion.

Vector components in the rotated coordinate system with x-axis along the line of relative motion at t_A = 0

At time t_A = 0, we have the following situation in A:

Vector components in rotated coordinate system of A

Illustration 4: Conditions in frame A at t_A = 0 and components of position vectors and of velocity vectors in the rotated coordinate system aligned with the line of relative motion of W vs. A.

We can now calculate some components of given position vectors by their projection onto the line of relative motion and onto a line vertical to the relative velocity vector. I call the respective distances from the origin of A at time t_A = 0:

Rod: x_r,r,rel,A = x-coordinate of rod’s right endpoint with respect to the x_rel,A-axis,
Rod: y_r,r,rel,A = y-coordinate of rod’s right endpoint with respect to the y_rel,A-axis,
Slot: x_s,r,rel,A = x-coordinate of slot’s right endpoint with respect to the x_rel,A-axis,
Slot: y_s,r,rel,A = y-coordinate of slot’s right endpoint with respect to the y_rel,A-axis.

We get:

\[ \begin{align} x_{r,r,rel,A} \,&=\, +\, \Delta x_{r,A} \,=\, +\, L * \cos \left(|\Psi_{WA}|\right) \,, \\[8pt] y_{r,r,rel,A} \,&=\, – \, \Delta y_{r,A} \,=\,\, – \, L * \sin \left(|\Psi_{WA}|\right) \,, \\[10pt] x_{s,r,rel,A} \,&=\, +\, \Delta x_{s,A} \,=\, +\, {1 \over \gamma_x} * L * \cos \left(|\Psi_{WA}|\right) \,, \\[8pt] y_{s,r,rel,A} \,&=\, – \, \Delta y_{s,A} \,=\,\, – \, {1 \over \gamma_x} * L * \sin \left(|\Psi_{WA}|\right) \,. \end{align}\]

In a similar way we can interpret the velocities of the slot and the rod in A as velocity vectors ν_x and v_y . Then we get the following components of these vectors along and perpendicular to the line of relative motion of W vs. A:

ν_x: ν_x,x,rel,A = component of vector ν_x of slot’s movement along the x_rel,A-axis,
ν_x: ν_x,y,rel,A = component of vector ν_x of slot’s movement along the y_rel,A-axis,
v_y: v_y,x,rel,A = component of vector v_y of rod’s movement along the y_rel,A-axis,
v_y: v_y,y,rel,A = component of vector v_y of rod’s movement along the y_rel,A-axis.

We need these components because we later on have to cover the motion of the rod and the slot in the rotated coordinate systems – both in A and in W. We get:

\[ \begin{align} \nu_{x,x,rel,A} \,&=\, -\, \nu_ x * \cos \left(|\Psi_{WA}|\right) \,, \\[8pt] \nu_{x,y,rel,A} \,&=\, – \, \nu_x * \sin \left(|\Psi_{WA}|\right) \,, \\[10pt] v_{y,x,rel,A} \,&=\, – \, v_y * \sin \left(|\Psi_{WA}|\right) \,, \\[8pt] v_{y,y,rel,A} \,&=\, – \, v_y * \cos \left(|\Psi_{WA}|\right) \,. \end{align}\]

Transformation of points on the x_A-axis to frame W and resulting inclination of the image x_A,W of x_A in W with respect to the line of relative motion

After having defined some elementary relations above we now try to tackle our first objective: We apply the Lorentz Transformation between A and W to selected points on the standard x_A-axis at t_A = t_W = 0. Applied to all points of x_A the LT gives us an image x_A,W of the line x_A in W. As we deal with linear operations, the analysis of angles requires to look at some selected points on the line, only. One of these points cn be the origin at t_W = t_W = 0.

The LT will enable us to find the angle between the image x_A,W of the x_A-axis and the x_rel,W -axis in the spatial coordinate system of W. This angle will be different from the respective angle Ψ_WA found in A.

Let us take a special point on the x_A-axis with distance 1 to the origin at t_A = 0.

This point does not move in A. (Its world line in a Minkowski diagram would be as straight vertical line.) All the points on the x_A-axis do, of course, not move in A. Events fixed to these points do not change their spatial coordinates with time. Neither do their projections onto the line of motion of W to A and projections perpendicular to this line. We name the coordinates of our special point in the rotated CCS of A x_xA1,rel,A and y_xA1,rel,A:

Projections of the point (1, 0), i.e. the point on the x_A-axis with distance “1” to the origin of A:

\[ \begin{align} x_{xA1,rel,A}\,&= \, \cos\left( \Psi_{WA} \right) \,, \\[8pt] y_{xA1,rel,A}\,&= \, -\, \sin\left( \Psi_{WA} \right) \,. \end{align}\]

These are the spatial coordinates of respective events on the world line of the selected point. Let us transform them via a LT along the line of relative motion to respective coordinates in W (with the respect to the rotated 2-dim CCS there). This LT is controlled by the relative velocity v_rel,w2a and respective relativistic factors β_w2a and γ_w2a.

I jump over intermediate steps of the LT and use a length contraction of the length interval [0, x_xA1,rel,A] (in this case we can use the length contraction due to a non-moving object in A; see previous posts).

\[ \begin{align} x_{xA1, rel, W} \,&=\, {1 \over \gamma_{rel} } \, x_{xA1,rel,A} \\[8pt] &=\, {1 \over \gamma_{rel} } * \cos\left(\Psi_{WA}\right) \,. \end{align} \]

The component perpendicular to the line of relative motion does not change during a respective LT :

\[ \begin{align} y_{xA1, rel, W} \,&=\, y_{xA1,rel,A} \\[8pt] &=\, – \, \sin\left(\Psi_{WA}\right) \,. \end{align} \]

From this we get the distance L_xA1,W of our selected point from the origin as perceived in W:

\[ \begin{align} L_{xA1,W}^2 \,&=\, \, {1 \over \gamma_{w2a}^2 } * \cos^2 \left(\Psi_{WA}\right) \,+\, \sin^2 \left(\Psi_{WA}\right) \\[8pt] &=\, \left(1 \,-\, \beta_x^2 \,-\, \beta_y^2\right) \, {\nu_x^2 \over \nu_x^2 \,+\, v_y^2} \,+\, {v_y^2 \over \nu_x^2 \,+\, v_y^2} \\[8pt] &=\, 1 \,-\, \beta_x^2 \,\, \Rightarrow \\[8pt] L_{xA1,W} \,&=\, {1\over \gamma_x } \,. \end{align} \]

Not totally unexpected! Can you guess why?

This tells us that in W the angle Ψ_xA,W between the image x_A,W of the original x_A-axis and the line of relative motion is given by:

\[\begin{align} \cos \left( |\Psi_{xA,W}| \right) \, &=\, { x_{xA1, rel, W} \over L_{xA1,W} } \,=\, {\gamma_x \over \gamma_{rel} } * {\beta_x \over \beta_{rel} } \\[8pt] &=\, {\gamma_x \over \gamma_{rel} } * \cos \left(|\Psi_{WA}|\right) \Rightarrow \\[8pt] \cos \left( |\Psi_{xA,W} |\right) \, &\lt \, \cos \left(|\Psi_{WA}|\right) \Rightarrow \\[8pt] |\Psi_{xA,W} |\,&\gt\, |\Psi_{WA} |\,. \end{align} \]

Transformation of points on the y_A-axis to frame W and resulting inclination of the image of y_A in W with respect to the line of relative motion

When we perform similar calculations for a point with distance 1 on the y_A-axis we get:

\[\begin{align} x_{yA1, rel, A} \,&=\, – \, \sin\left(\Psi_{WA}\right) \,, \\[8pt] y_{yA1, rel, A} \,&=\, + \, \cos\left(\Psi_{WA}\right) \,, \\[10pt] x_{yA1, rel, W} \,&=\, – \, {1 \over \gamma_{rel}} \, \sin\left(\Psi_{WA}\right) \,, \\[8pt] y_{yA1, rel, W} \,&=\, + \, \cos\left(\Psi_{WA}\right) \,, \\[10pt] L_{yA1,W} \,&=\, {1 \over \gamma_y} \,. \end{align} \]

Calling

the angle between the y_A-axis and the line of relative motion Ψ_y,A in A
and between the image y_A,W of the y_A-axis and the line of relative motion Ψ_yA,W in W,

we get:

\[\begin{align} \cos \left( |\Psi_{yA,W}| \right) \, &={\gamma_y \over \gamma_{rel} } * \sin \left(|\Psi_{WA}|\right)\\[8pt] &=\, {\gamma_y \over \gamma_{rel} } * \cos \left(|\Psi_{y,A}|\right) \Rightarrow \\[10pt] |\Psi_{yA,W}| \,&\gt\, |\Psi_{y,A}| \,. \end{align} \]

All in all we find that for our (diagonal) relative motion of W versus A the angle of the LT-images of the original coordinate axes of A with line of motion in W are bigger than the angles of the original axes with the line of relative motion in A. And the images of the axes have no angle of 90° versus each other in W, but a larger angle Ψ_xyA,W.

Angle Ψ_xyA,W between the images of the coordinate axes of A in W:

\[ |\Psi_{xyA,W}| \,=\, |\Psi_{xA,W} \,+\, \Psi_{yA,W}| \,\gt\, \pi/2 \,. \]

Funny, isn’t it?
Note that while our frame W was chosen with respect to physical conditions of our scenario this general qualitative result would hold for other diagonally approaching frames, too.

I have schematically depicted differences in the angles below. The change in angles is exaggerated.

Images of coordinate axes after a Lorentz Transformation

Illustration 5: Schematic illustration of angles in A and the widening of the angles of the images x_A,W and y_A,W of the axes x_A and y_A by the the LT from A to W. The change of angles depends on the velocities ν_x and v_y. The differences in the angles are exaggerated.

So, the images of the x_A– and the y_A-axes form no Cartesian coordinate system in W! This makes them no good choice for the yet to choose axes x_W and y_W to cover our 2-dimensional Euclidean space in W.

What can we do? Well, we could choose the image x_A,W of the x_A-axis as our x_W-axis and define a y_W -axis perpendicular to it. This is an interesting approach. But, in may opinion, a better and physically more solid approach is the following:

Calculate the inclination of the slot (or the rod) in W vs. the x_rel,W-axis of the rotated coordinate system of W via the simple LT-boost defined by v_rel,w2a.
Define the x_W -axis as aligned with the orientation of the slot (or the rod) in W.

This does not exclude the possibility that one of the physical objects may have the same orientation as the image x_A,W at t_W = 0 in W. We might think of the slot. But after a full LT along the line of relative motion we would have proved it …

Conclusion

In this post we have changed our perspective on the rod/slot paradox once again. We introduced a diagonally moving frame W. We hope that we can later use it to define a transit scenario there for a direct comparison of the initial conditions of both scenarios in a common reference frame.

We have started to analyze the collision scenario in W to understand the consequences of the LT to a diagonally moving frame a bit better. A first new insight was that the images of the axes of frame A get different angles in W versus the relative line of motion of W vs. A than in frame A. The images of the axes are not perpendicular to each other. Their relative angle is widened by the LT to a diagonally approaching frame.

We may therefore be forced to determine the orientation of one of the physical objects (rod, slot) in W and maybe define a suitable x_W-axis aligned with the selected object. And define a related vertical y_W-axis accordingly. Another very reasonable approach would be to first work with rotated coordinate systems in A and in W which are aligned with the line of relative motion of the origin’s of both frames versus each other.

This is the topic of the next post.

Special relativity and the rod/slot paradox – XIII – Lorentz transformation of the collision scenario to a diagonally moving frame

Stay tuned …