# PlanetPhysics/Simple Derivation of the Lorentz Transformation

\subsection{Simple Derivation of the Lorentz Transformation (Supplementary to Section 11)} From Relativity: The Special and General Theory by Albert Einstein For the relative orientation of the co-ordinate systems indicated in Fig. 2, the x-axes of both systems pernumently coincide. In the present case we can divide the problem into parts by considering first only events which are localised on the ${\displaystyle x}$-axis. Any such event is represented with respect to the co-ordinate system ${\displaystyle K}$ by the abscissa ${\displaystyle x}$ and the time ${\displaystyle t}$, and with respect to the system ${\displaystyle K^{1}}$ by the abscissa ${\displaystyle x'}$ and the time ${\displaystyle t'}$. We require to find ${\displaystyle x'}$ and ${\displaystyle t'}$ when ${\displaystyle x}$ and ${\displaystyle t}$ are given.

A light-signal, which is proceeding along the positive axis of ${\displaystyle x}$, is transmitted according to the equation

${\displaystyle x=ct}$ or

${\displaystyle (1)x-ct=0}$

Since the same light-signal has to be transmitted relative to ${\displaystyle K^{1}}$ with the velocity ${\displaystyle c}$, the propagation relative to the system ${\displaystyle K^{1}}$ will be represented by the analogous formula

${\displaystyle (2)x'-ct'=0}$

Those space-time points (events) which satisfy (24) must also satisfy (25). Obviously this will be the case when the relation

${\displaystyle (3)(x'-ct')=\lambda (x-ct)}$

\noindent is fulfilled in general, where ${\displaystyle \lambda }$ indicates a constant; for, according to (26), the disappearance of ${\displaystyle (x-ct)}$ involves the disappearance of ${\displaystyle (x'-ct')}$.

If we apply quite similar considerations to light rays which are being transmitted along the negative x-axis, we obtain the condition

${\displaystyle (4)(x'+ct')=\mu (x+ct)}$

By adding (or subtracting) equations (26) and (27), and introducing for convenience the constants ${\displaystyle a}$ and ${\displaystyle b}$ in place of the constants ${\displaystyle \lambda }$ and ${\displaystyle \mu }$, where

${\displaystyle a={\frac {\lambda +\mu }{2}}}$

\noindent and

${\displaystyle a={\frac {\lambda -\mu }{2}}}$

\noindent we obtain the equations

${\displaystyle (5)\left.{\begin{matrix}{rcl}x'&=&ax-bct\\ct'&=&act-bx\end{matrix}}\right\}}$

We should thus have the solution of our problem, if the constants ${\displaystyle a}$ and ${\displaystyle b}$ were known. These result from the following discussion.

For the origin of ${\displaystyle K^{1}}$ we have permanently ${\displaystyle x'=0}$, and hence according to the first of the equations (28)

${\displaystyle x={\frac {bc}{a}}t}$

If we call ${\displaystyle v}$ the velocity with which the origin of ${\displaystyle K^{1}}$ is moving relative to ${\displaystyle K}$, we then have

${\displaystyle (6)v={\frac {bc}{a}}}$

The same value ${\displaystyle v}$ can be obtained from equations (28), if we calculate the velocity of another point of ${\displaystyle K^{1}}$ relative to ${\displaystyle K}$, or the velocity (directed towards the negative ${\displaystyle x}$-axis) of a point of ${\displaystyle K}$ with respect to ${\displaystyle K'}$. In short, we can designate ${\displaystyle v}$ as the relative velocity of the two systems.

Furthermore, the principle of relativity teaches us that, as judged from ${\displaystyle K}$, the length of a unit measuring-rod which is at rest with reference to ${\displaystyle K^{1}}$ must be exactly the same as the length, as judged from ${\displaystyle K'}$, of a unit measuring-rod which is at rest relative to ${\displaystyle K}$. In order to see how the points of the ${\displaystyle x}$-axis appear as viewed from ${\displaystyle K}$, we only require to take a "snapshot" of ${\displaystyle K^{1}}$ from ${\displaystyle K}$; this means that we have to insert a particular value of ${\displaystyle t}$ (time of ${\displaystyle K}$), {\it e.g.} ${\displaystyle t=0}$. For this value of ${\displaystyle t}$ we then obtain from the first of the equations (5)

${\displaystyle x'=ax}$

Two points of the ${\displaystyle x'}$-axis which are separated by the distance ${\displaystyle \Delta x'=I}$ when measured in the ${\displaystyle K^{1}}$ system are thus separated in our instantaneous photograph by the distance

${\displaystyle (7)\Delta x={\frac {I}{a}}}$

\noindent But if the snapshot be taken from ${\displaystyle K'(t'=0)}$, and if we eliminate ${\displaystyle t}$ from the equations (28), taking into account the expression (29), we obtain

${\displaystyle x'=a\left(I-{\frac {v^{2}}{c^{2}}}\right)x}$

\noindent From this we conclude that two points on the ${\displaystyle x}$-axis separated by the distance ${\displaystyle I}$ (relative to ${\displaystyle K}$) will be represented on our snapshot by the distance

${\displaystyle \Delta x'=a\left(I-{\frac {v^{2}}{c^{2}}}\right)\quad .\quad .\quad .\quad {\mbox{(7a)}}}$

But from what has been said, the two snapshots must be identical; hence ${\displaystyle \Delta x}$ in (7) must be equal to ${\displaystyle \Delta x'}$ in (7a), so that we obtain

${\displaystyle a={\frac {I}{I-{\frac {v^{2}}{c^{2}}}}}\quad .\quad .\quad .\quad {\mbox{(7b)}}}$

The equations (29) and (7b) determine the constants ${\displaystyle a}$ and ${\displaystyle b}$. By inserting the values of these constants in (28), we obtain the first and the fourth of the equations given in section 11.

${\displaystyle (8)\left.{\begin{matrix}{rcl}x'&=&{\frac {x-vt}{\sqrt {I-{\frac {v^{2}}{c^{2}}}}}}\\~\\t'&=&{\frac {t-{\frac {v}{c^{2}}}x}{\sqrt {I-{\frac {v^{2}}{c^{2}}}}}}\end{matrix}}\right\}}$

Thus we have obtained The Lorentz transformation for events on the ${\displaystyle x}$-axis. It satisfies the condition

${\displaystyle x'^{2}-c^{2}t'^{2}=x^{2}-c^{2}t^{2}\quad .\quad .\quad .\quad {\mbox{(8a)}}}$

The extension of this result, to include events which take place outside the ${\displaystyle x}$-axis, is obtained by retaining equations (31) and supplementing them by the relations

${\displaystyle (9)\left.{\begin{matrix}{rcl}y'&=&y\\z'&=&z\end{matrix}}\right\}}$

In this way we satisfy the postulate of the constancy of the velocity of light in vacuo for rays of light of arbitrary direction, both for the system ${\displaystyle K}$ and for the system ${\displaystyle K'}$. This may be shown in the following manner.

We suppose a light-signal sent out from the origin of ${\displaystyle K}$ at the time [/itex]t = 0${\displaystyle .Itwillbepropagatedaccordingtotheequation$r={\sqrt {x^{2}+y^{2}+z^{2}}}=ct}$ \noindent or, if we square this equation, according to the equation ${\displaystyle (10)x^{2}+y^{2}+z^{2}=c^{2}t^{2}=0}$ It is required by the law of propagation of light, in conjunction with the postulate of relativity, that the transmission of the signal in question should take place---as judged from ${\displaystyle K^{1}}$---in accordance with the corresponding formula ${\displaystyle r'=ct'}$ \noindent or, ${\displaystyle x'^{2}+y'^{2}+z'^{2}-c^{2}t'^{2}=0\quad .\quad .\quad .\quad {\mbox{(10a)}}}$ In order that equation (10a) may be a consequence of equation (33), we must have ${\displaystyle (11)x'^{2}+y'^{2}+z'^{2}-c^{2}t'^{2}=\sigma (x^{2}+y^{2}+z^{2}-c^{2}t^{2})}$ Since equation (8a) must hold for points on the ${\displaystyle x}$-axis, we thus have$\sigma = I${\displaystyle .ItiseasilyseenthattheLorentztransformationreallysatisfiesequation(34)for[itex]\sigma =I}$; for (34) is a consequence of (8a) and (32), and hence also of (31) and (32). We have thus derived the Lorentz transformation.

The Lorentz transformation represented by (31) and (32) still requires to be generalised. Obviously it is immaterial whether the axes of ${\displaystyle K^{1}}$ be chosen so that they are spatially parallel to those of ${\displaystyle K}$. It is also not essential that the velocity of translation of ${\displaystyle K^{1}}$ with respect to ${\displaystyle K}$ should be in the direction of the ${\displaystyle x}$-axis. A simple consideration shows that we are able to construct the Lorentz transformation in this general sense from two kinds of transformations, {\it viz.} from Lorentz transformations in the special sense and from purely spatial transformations. which corresponds to the replacement of the rectangular co-ordinate system by a new system with its axes pointing in other directions.

Mathematically, we can characterise the generalised Lorentz transformation thus:

It expresses ${\displaystyle x',y',x',t'}$, in terms of linear homogeneous functions of ${\displaystyle x,y,x,t}$, of such a kind that the relation

${\displaystyle x'^{2}+y'^{2}+z'^{2}-c^{2}t'^{2}=x^{2}+y^{2}+z^{2}-c^{2}t^{2}\quad .\quad .\quad .\quad {\mbox{(11a)}}}$

\noindent is satisficd identically. That is to say: If we substitute their expressions in ${\displaystyle x,y,x,t}$, in place of ${\displaystyle x',y',x',t'}$, on the left-hand side, then the left-hand side of (11a) agrees with the right-hand side.

### References

This article is derived from the Einstein Reference Archive (marxists.org) 1999, 2002. Einstein Reference Archive which is under the FDL copyright.