Relatividad especial

En física , la teoría especial de la relatividad , o relatividad especial para abreviar, es una teoría científica de la relación entre el espacio y el tiempo . En el artículo de Albert Einstein de 1905, Sobre la electrodinámica de los cuerpos en movimiento , la teoría se presenta como basada en solo dos postulados : ^{[p 1]}^[1]^[2]

Las leyes de la física son invariantes (idénticas) en todos los marcos de referencia inerciales (es decir, marcos de referencia sin aceleración ). Esto se conoce como el principio de relatividad .
La velocidad de la luz en el vacío es la misma para todos los observadores, independientemente del movimiento de la fuente de luz o del observador. Esto se conoce como el principio de constancia de la luz o el principio de invariancia de la velocidad de la luz.

El primer postulado fue formulado por primera vez por Galileo Galilei (véase Invariancia galileana ).

Orígenes y significado

La relatividad especial fue descrita por Albert Einstein en un artículo publicado el 26 de septiembre de 1905 titulado "Sobre la electrodinámica de los cuerpos en movimiento". ^{[p 1]} Las ecuaciones de Maxwell del electromagnetismo parecían ser incompatibles con la mecánica newtoniana , y el experimento de Michelson-Morley no logró detectar el movimiento de la Tierra frente al hipotético éter luminífero . Esto condujo al desarrollo de las transformaciones de Lorentz , por Hendrik Lorentz , que ajustan las distancias y los tiempos para los objetos en movimiento. La relatividad especial corrige las leyes de la mecánica hasta entonces vigentes para manejar situaciones que involucran todos los movimientos y especialmente aquellos a una velocidad cercana a la de la luz (conocida comovelocidades relativistas ). Hoy en día, se ha demostrado que la relatividad especial es el modelo más preciso del movimiento a cualquier velocidad cuando los efectos gravitacionales y cuánticos son insignificantes.^[3]^[4]Aun así, el modelo newtoniano sigue siendo válido como una aproximación simple y precisa a bajas velocidades (relativas a la velocidad de la luz), por ejemplo, los movimientos cotidianos en la Tierra.

La relatividad especial tiene una amplia gama de consecuencias que han sido verificadas experimentalmente. ^[5] Incluyen la relatividad de la simultaneidad , la contracción de la longitud , la dilatación del tiempo , la fórmula relativista de adición de velocidad, el efecto Doppler relativista , la masa relativista , un límite de velocidad universal , la equivalencia masa-energía , la velocidad de causalidad y la precesión de Thomas . ^[1]^[2] Por ejemplo, ha reemplazado la noción convencional de un tiempo universal absoluto con la noción de un tiempo que depende del marco de referencia y la posición espacial . En lugar de un intervalo de tiempo invariante entre dos eventos, hay un intervalo de espacio-tiempo invariante . Combinados con otras leyes de la física, los dos postulados de la relatividad especial predicen la equivalencia de masa y energía , como se expresa en la fórmula de equivalencia masa-energía , donde es la velocidad de la luz en el vacío. ^[6]^[7] También explica cómo se relacionan los fenómenos de la electricidad y el magnetismo. ^[1]^[2] $E=mc^{2}$ $c$

Una característica definitoria de la relatividad especial es la sustitución de las transformaciones galileanas de la mecánica newtoniana por las transformaciones de Lorentz . El tiempo y el espacio no pueden definirse por separado (como se creía anteriormente). Más bien, el espacio y el tiempo están entrelazados en un único continuo conocido como "espacio-tiempo" . Los acontecimientos que ocurren al mismo tiempo para un observador pueden ocurrir en momentos diferentes para otro.

Hasta varios años después, cuando Einstein desarrolló la relatividad general , que introdujo un espacio-tiempo curvado para incorporar la gravedad, no se utilizó la frase "relatividad especial". Una traducción que a veces se utiliza es "relatividad restringida"; "especial" en realidad significa "caso especial". ^{[p 2]}^{[p 3]}^{[p 4]}^{[nota 1]} Parte del trabajo de Albert Einstein en relatividad especial se basa en el trabajo anterior de Hendrik Lorentz y Henri Poincaré . La teoría se completó esencialmente en 1907, con los artículos de Hermann Minkowski sobre el espacio-tiempo. ^[4]

La teoría es "especial" en el sentido de que sólo se aplica en el caso especial en el que el espacio-tiempo es "plano", es decir, donde la curvatura del espacio-tiempo (una consecuencia del tensor de energía-momento y que representa la gravedad ) es despreciable. ^[8]^{[nota 2]} Para dar cabida correctamente a la gravedad, Einstein formuló la relatividad general en 1915. La relatividad especial, al contrario de algunas descripciones históricas, da cabida tanto a las aceleraciones como a los marcos de referencia acelerados . ^[9]^[10]

Así como ahora se acepta que la relatividad galileana es una aproximación de la relatividad especial que es válida para velocidades bajas, la relatividad especial se considera una aproximación de la relatividad general que es válida para campos gravitatorios débiles , es decir, a una escala suficientemente pequeña (por ejemplo, cuando las fuerzas de marea son despreciables) y en condiciones de caída libre . Pero la relatividad general incorpora geometría no euclidiana para representar los efectos gravitatorios como la curvatura geométrica del espacio-tiempo. La relatividad especial está restringida al espacio-tiempo plano conocido como espacio de Minkowski . Mientras el universo pueda modelarse como una variedad pseudo-riemanniana , se puede definir un marco invariante de Lorentz que respete la relatividad especial para una vecindad suficientemente pequeña de cada punto en este espacio-tiempo curvado .

Galileo Galilei ya había postulado que no existe un estado de reposo absoluto y bien definido (no hay marcos de referencia privilegiados ), un principio que ahora se denomina principio de relatividad de Galileo . Einstein amplió este principio de modo que explicara la velocidad constante de la luz, ^[11] un fenómeno que se había observado en el experimento de Michelson-Morley. También postuló que se cumple para todas las leyes de la física , incluidas tanto las leyes de la mecánica como las de la electrodinámica . ^[12]

El enfoque tradicional de los "dos postulados" para la relatividad especial

"Reflexiones de este tipo me hicieron comprender, ya poco después de 1900, es decir, poco después del trabajo pionero de Planck, que ni la mecánica ni la electrodinámica podían (salvo en casos limitados) reivindicar una validez exacta. Poco a poco fui perdiendo la esperanza de poder descubrir las leyes verdaderas mediante esfuerzos constructivos basados en hechos conocidos. Cuanto más lo intentaba y más desesperadamente, más llegaba a la convicción de que sólo el descubrimiento de un principio formal universal podía llevarnos a resultados seguros... ¿Cómo, entonces, podría encontrarse un principio tan universal?"

Albert Einstein: Notas autobiográficas ^{[p 5]}

Einstein distinguió dos proposiciones fundamentales que parecían ser las más seguras, independientemente de la validez exacta de las leyes (entonces) conocidas de la mecánica o la electrodinámica. Estas proposiciones eran la constancia de la velocidad de la luz en el vacío y la independencia de las leyes físicas (especialmente la constancia de la velocidad de la luz) de la elección del sistema inercial. En su presentación inicial de la relatividad especial en 1905, expresó estos postulados como: ^{[p 1]}

El principio de relatividad : las leyes por las cuales los estados de los sistemas físicos experimentan cambios no se ven afectadas, ya sea que estos cambios de estado se refieran a uno u otro de dos sistemas en movimiento de traslación uniforme uno con respecto al otro. ^{[p 1]}
Principio de invariabilidad de la velocidad de la luz: "... la luz se propaga siempre en el espacio vacío con una velocidad definida [rapidez] c , que es independiente del estado de movimiento del cuerpo emisor" (del prefacio). ^{[p. 1]} Es decir, la luz en el vacío se propaga con la velocidad c (una constante fija, independiente de la dirección) en al menos un sistema de coordenadas inerciales (el "sistema estacionario"), independientemente del estado de movimiento de la fuente de luz.

La constancia de la velocidad de la luz fue motivada por la teoría del electromagnetismo de Maxwell ^[13] y la falta de evidencia del éter luminífero . ^[14] Hay evidencia contradictoria sobre el grado en que Einstein fue influenciado por el resultado nulo del experimento de Michelson-Morley. ^[15]^[16] En cualquier caso, el resultado nulo del experimento de Michelson-Morley ayudó a que la noción de la constancia de la velocidad de la luz ganara una aceptación generalizada y rápida.

La derivación de la relatividad especial depende no sólo de estos dos postulados explícitos, sino también de varias suposiciones tácitas ( hechas en casi todas las teorías de la física ), incluyendo la isotropía y homogeneidad del espacio y la independencia de las varas de medición y los relojes de su historia pasada. ^{[p 6]}

Tras la presentación original de la relatividad especial por parte de Einstein en 1905, se han propuesto muchos conjuntos diferentes de postulados en diversas derivaciones alternativas. ^[17] Pero el conjunto de postulados más común sigue siendo el empleado por Einstein en su artículo original. Una declaración más matemática del principio de relatividad hecha posteriormente por Einstein, que introduce el concepto de simplicidad no mencionado anteriormente, es:

Principio especial de relatividad : Si se elige un sistema de coordenadas K de modo que, en relación con él, las leyes físicas se cumplan en su forma más simple, las mismas leyes se cumplen en relación con cualquier otro sistema de coordenadas K ′ que se mueva en traslación uniforme con respecto a K . ^[18]

Henri Poincaré proporcionó el marco matemático para la teoría de la relatividad al demostrar que las transformaciones de Lorentz son un subconjunto de su grupo de Poincaré de transformaciones de simetría. Einstein derivó posteriormente estas transformaciones a partir de sus axiomas.

Muchos de los artículos de Einstein presentan derivaciones de la transformación de Lorentz basadas en estos dos principios. ^{[p 7]}

Principio de relatividad

Marcos de referencia y movimiento relativo

Los marcos de referencia desempeñan un papel crucial en la teoría de la relatividad. El término marco de referencia tal como se utiliza aquí es una perspectiva de observación en el espacio que no sufre ningún cambio de movimiento (aceleración), desde el que se puede medir una posición a lo largo de tres ejes espaciales (es decir, en reposo o a velocidad constante). Además, un marco de referencia tiene la capacidad de determinar mediciones del tiempo de los eventos utilizando un "reloj" (cualquier dispositivo de referencia con periodicidad uniforme).

Un evento es un suceso al que se le puede asignar un único momento y ubicación en el espacio en relación con un marco de referencia: es un "punto" en el espacio-tiempo . Dado que la velocidad de la luz es constante en relatividad independientemente del marco de referencia, los pulsos de luz se pueden utilizar para medir distancias de forma inequívoca y hacer referencia al reloj a los momentos en que ocurrieron los eventos, aunque la luz tarda en llegar al reloj después de que el evento haya ocurrido.

Por ejemplo, la explosión de un petardo puede considerarse un "acontecimiento". Podemos especificar completamente un acontecimiento mediante sus cuatro coordenadas espacio-temporales: el momento en que se produce y su ubicación espacial tridimensional definen un punto de referencia. Llamemos a este marco de referencia S .

En la teoría de la relatividad, a menudo queremos calcular las coordenadas de un evento a partir de diferentes sistemas de referencia. Las ecuaciones que relacionan las mediciones realizadas en diferentes sistemas se denominan ecuaciones de transformación .

Configuración estándar

Para comprender mejor cómo se comparan entre sí las coordenadas del espacio-tiempo medidas por observadores en diferentes marcos de referencia , es útil trabajar con una configuración simplificada con marcos en una configuración estándar . ^[19]^{: 107} Con cuidado, esto permite simplificar las matemáticas sin pérdida de generalidad en las conclusiones a las que se llega. En la Fig. 2-1, se muestran dos marcos de referencia galileanos (es decir, marcos convencionales de 3 espacios) en movimiento relativo. El marco S pertenece a un primer observador O , y el marco S ′ (pronunciado "S prima" o "S guión") pertenece a un segundo observador O ′ .

Los ejes x , y , z del marco S están orientados paralelos a los respectivos ejes primos del marco S ′ .
El marco S ′ se mueve, para simplificar, en una sola dirección: la dirección x del marco S con una velocidad constante v medida en el marco S .
Los orígenes de los marcos S y S ′ son coincidentes cuando el tiempo t = 0 para el marco S y t ′ = 0 para el marco S ′ .

Como no existe un sistema de referencia absoluto en la teoría de la relatividad, no existe estrictamente el concepto de "movimiento", ya que todo puede estar moviéndose con respecto a algún otro sistema de referencia. En cambio, se dice que dos sistemas cualesquiera que se muevan a la misma velocidad en la misma dirección están en co-movimiento . Por lo tanto, S y S ′ no están en co-movimiento .

Falta de un marco de referencia absoluto

El principio de relatividad , que establece que las leyes físicas tienen la misma forma en cada sistema de referencia inercial , se remonta a Galileo y fue incorporado a la física newtoniana. Pero a finales del siglo XIX, la existencia de ondas electromagnéticas llevó a algunos físicos a sugerir que el universo estaba lleno de una sustancia que llamaron " éter ", que, según postularon, actuaría como el medio a través del cual se propagarían estas ondas o vibraciones (en muchos aspectos de manera similar a la forma en que el sonido se propaga a través del aire). Se pensaba que el éter era un sistema de referencia absoluto contra el cual se podían medir todas las velocidades, y podía considerarse fijo e inmóvil en relación con la Tierra o algún otro punto de referencia fijo. Se suponía que el éter era lo suficientemente elástico como para soportar ondas electromagnéticas, mientras que esas ondas podían interactuar con la materia, pero no ofrecer resistencia a los cuerpos que pasaban a través de él (su única propiedad era que permitía que las ondas electromagnéticas se propagaran). Los resultados de varios experimentos, incluido el experimento de Michelson-Morley en 1887 (posteriormente verificado con experimentos más precisos e innovadores), condujeron a la teoría de la relatividad especial, al demostrar que el éter no existía. ^[20] La solución de Einstein fue descartar la noción de éter y el estado de reposo absoluto. En relatividad, cualquier sistema de referencia que se mueva con un movimiento uniforme observará las mismas leyes de la física. En particular, la velocidad de la luz en el vacío siempre se mide como c , incluso cuando se mide por múltiples sistemas que se mueven a velocidades diferentes (pero constantes).

Relatividad sin el segundo postulado

A partir del principio de relatividad únicamente y sin asumir la constancia de la velocidad de la luz (es decir, utilizando la isotropía del espacio y la simetría implícita en el principio de relatividad especial), se puede demostrar que las transformaciones del espacio-tiempo entre sistemas inerciales son euclidianas, galileanas o lorentzianas. En el caso lorentziano, se puede obtener la conservación del intervalo relativista y una cierta velocidad límite finita. Los experimentos sugieren que esta velocidad es la velocidad de la luz en el vacío. ^{[p 8]}^[21]

La invariancia de Lorentz como núcleo esencial de la relatividad especial

Enfoques alternativos a la relatividad especial

Einstein basó sistemáticamente la derivación de la invariancia de Lorentz (el núcleo esencial de la relatividad especial) únicamente en los dos principios básicos de la relatividad y la invariancia de la velocidad de la luz. Escribió:

La idea fundamental de la teoría especial de la relatividad es la siguiente: los supuestos de relatividad e invariancia de la velocidad de la luz son compatibles si se postulan relaciones de un nuevo tipo ("transformación de Lorentz") para la conversión de coordenadas y tiempos de eventos... El principio universal de la teoría especial de la relatividad está contenido en el postulado: las leyes de la física son invariantes con respecto a las transformaciones de Lorentz (para la transición de un sistema inercial a cualquier otro sistema inercial elegido arbitrariamente). Este es un principio restrictivo para las leyes naturales... ^{[p 5]}

Por ello, muchos tratamientos modernos de la relatividad especial la basan en el postulado único de la covariancia universal de Lorentz o, equivalentemente, en el postulado único del espacio-tiempo de Minkowski . ^{[p 9]}^{[p 10]}

En lugar de considerar la covariancia universal de Lorentz como un principio derivado, este artículo la considera como el postulado fundamental de la relatividad especial. El enfoque tradicional de dos postulados para la relatividad especial se presenta en innumerables libros de texto universitarios y presentaciones populares. ^[22] Los libros de texto que comienzan con el postulado único del espacio-tiempo de Minkowski incluyen los de Taylor y Wheeler ^[11] y los de Callahan. ^[23] Este es también el enfoque seguido por los artículos de Wikipedia Espacio-tiempo y Diagrama de Minkowski .

Transformación de Lorentz y su inversa

Definamos un evento con coordenadas espaciotemporales ( t , x , y , z ) en el sistema S y ( t ′ , x ′ , y ′ , z ′ ) en un marco de referencia que se mueve a una velocidad v en el eje x con respecto a ese marco, S ′ . Luego, la transformación de Lorentz especifica que estas coordenadas están relacionadas de la siguiente manera: donde es el factor de Lorentz y c es la velocidad de la luz en el vacío, y la velocidad v de S ′ , relativa a S , es paralela al eje x . Para simplificar, las coordenadas y y z no se ven afectadas; solo se transforman las coordenadas x y t . Estas transformaciones de Lorentz forman un grupo de un parámetro de aplicaciones lineales , cuyo parámetro se denomina rapidez . ${\begin{aligned}t'&=\gamma \ (t-vx/c^{2})\\x'&=\gamma \ (x-vt)\\y'&=y\\z'&=z,\end{aligned}}$ $\gamma ={\frac {1}{\sqrt {1-v^{2}/c^{2}}}}$

Resolviendo las cuatro ecuaciones de transformación anteriores para las coordenadas no primas se obtiene la transformación de Lorentz inversa: ${\begin{aligned}t&=\gamma (t'+vx'/c^{2})\\x&=\gamma (x'+vt')\\y&=y'\\z&=z'.\end{aligned}}$

Esto demuestra que el marco no preparado se mueve con la velocidad − v , medida en el marco preparado. ^[24]

El eje x no tiene nada de especial . La transformación se puede aplicar al eje y o al eje z , o incluso en cualquier dirección paralela al movimiento (que se deforma por el factor γ ) y perpendicular; consulte el artículo Transformación de Lorentz para obtener más detalles.

Una cantidad invariante bajo las transformaciones de Lorentz se conoce como escalar de Lorentz .

Escribiendo la transformación de Lorentz y su inversa en términos de diferencias de coordenadas, donde un evento tiene coordenadas ( x ₁ , t ₁ ) y ( x ′ ₁ , t ′ ₁ ) , otro evento tiene coordenadas ( x ₂ , t ₂ ) y ( x ′ ₂ , t ′ ₂ ) , y las diferencias se definen como

Ecuación 1: $\Delta x'=x'_{2}-x'_{1}\ ,\ \Delta t'=t'_{2}-t'_{1}\ .$
Ecuación 2: $\Delta x=x_{2}-x_{1}\ ,\ \ \Delta t=t_{2}-t_{1}\ .$

Nosotros conseguimos

Ecuación 3: $\Delta x'=\gamma \ (\Delta x-v\,\Delta t)\ ,\ \$ $\Delta t'=\gamma \ \left(\Delta t-v\ \Delta x/c^{2}\right)\ .$
Ecuación 4: $\Delta x=\gamma \ (\Delta x'+v\,\Delta t')\ ,\$ $\Delta t=\gamma \ \left(\Delta t'+v\ \Delta x'/c^{2}\right)\ .$

Si tomamos diferenciales en lugar de tomar diferencias, obtenemos

Ecuación 5: $dx'=\gamma \ (dx-v\,dt)\ ,\ \$ $dt'=\gamma \ \left(dt-v\ dx/c^{2}\right)\ .$
Ecuación 6: $dx=\gamma \ (dx'+v\,dt')\ ,\$ $dt=\gamma \ \left(dt'+v\ dx'/c^{2}\right)\ .$

Representación gráfica de la transformación de Lorentz

Figura 3-1. Dibujo de un diagrama de espacio-tiempo de Minkowski para ilustrar una transformación de Lorentz.

Los diagramas de espacio-tiempo ( diagramas de Minkowski ) son una ayuda extremadamente útil para visualizar cómo se transforman las coordenadas entre diferentes sistemas de referencia. Aunque no es tan fácil realizar cálculos exactos utilizándolos como invocando directamente las transformaciones de Lorentz, su principal poder es su capacidad de proporcionar una comprensión intuitiva de los resultados de un escenario relativista. ^[21]

Para dibujar un diagrama de espacio-tiempo, comience por considerar dos marcos de referencia galileanos, S y S', en configuración estándar, como se muestra en la figura 2-1. ^[21]^[25]^{: 155–199}

Fig. 3-1a . Dibuje los ejes y del marco S. El eje es horizontal y el eje (en realidad ) es vertical, lo que es lo opuesto a la convención habitual en cinemática. El eje está escalado por un factor de de modo que ambos ejes tienen unidades de longitud comunes. En el diagrama mostrado, las líneas de cuadrícula están espaciadas una unidad de distancia. Las líneas diagonales de 45° representan las líneas de mundo de dos fotones que pasan por el origen en el tiempo La pendiente de estas líneas de mundo es 1 porque los fotones avanzan una unidad en el espacio por unidad de tiempo. Se han trazado dos eventos y en este gráfico de modo que sus coordenadas se puedan comparar en los marcos S y S'. $x$ $t$ $x$ $t$ $ct$ $ct$ $c$ $t=0.$ ${\text{A}}$ ${\text{B}},$

Fig. 3-1b . Dibuje los ejes y del marco S'. El eje representa la línea de mundo del origen del sistema de coordenadas S' medido en el marco S. En esta figura, los ejes y están inclinados con respecto a los ejes no primarios en un ángulo donde Los ejes primarios y no primarios comparten un origen común porque los marcos S y S' se habían configurado en la configuración estándar, de modo que cuando $x'$ $ct'$ $ct'$ $v=c/2.$ $ct'$ $x'$ $\alpha =\tan ^{-1}(\beta ),$ $\beta =v/c.$ $t=0$ $t'=0.$

Fig. 3-1c . Las unidades en los ejes con cebadores tienen una escala diferente a las unidades en los ejes sin cebadores. A partir de las transformaciones de Lorentz, observamos que las coordenadas de en el sistema de coordenadas con cebadores se transforman a en el sistema de coordenadas sin cebadores. Asimismo, las coordenadas de en el sistema de coordenadas con cebadores se transforman a en el sistema sin cebadores. Dibuje líneas de cuadrícula paralelas al eje a través de los puntos medidos en el marco sin cebadores, donde es un entero. Asimismo, dibuje líneas de cuadrícula paralelas al eje a través de los puntos medidos en el marco sin cebadores. Utilizando el teorema de Pitágoras, observamos que el espaciado entre unidades es igual a veces el espaciado entre unidades, medido en el marco S. Esta relación siempre es mayor que 1 y, en última instancia, se acerca al infinito como $(x',ct')$ $(0,1)$ $(\beta \gamma ,\gamma )$ $(x',ct')$ $(1,0)$ $(\gamma ,\beta \gamma )$ $ct'$ $(k\gamma ,k\beta \gamma )$ $k$ $x'$ $(k\beta \gamma ,k\gamma )$ $ct'$ ${\textstyle {\sqrt {(1+\beta ^{2})/(1-\beta ^{2})}}}$ $ct$ $\beta \to 1.$

Fig. 3-1d . Puesto que la velocidad de la luz es invariante, las líneas de mundo de dos fotones que pasan por el origen en el tiempo todavía se trazan como líneas diagonales de 45°. Las coordenadas primadas de y están relacionadas con las coordenadas no primadas a través de las transformaciones de Lorentz y podrían medirse aproximadamente a partir del gráfico (suponiendo que se haya trazado con la suficiente precisión), pero el verdadero mérito de un diagrama de Minkowski es que nos otorga una visión geométrica del escenario. Por ejemplo, en esta figura, observamos que los dos eventos separados en el tiempo que tenían diferentes coordenadas x en el marco no primado ahora están en la misma posición en el espacio. $t'=0$ ${\text{A}}$ ${\text{B}}$

Mientras que el marco sin primar está dibujado con ejes de espacio y tiempo que se encuentran en ángulos rectos, el marco primar está dibujado con ejes que se encuentran en ángulos agudos u obtusos. Esta asimetría se debe a distorsiones inevitables en la forma en que las coordenadas del espacio-tiempo se asignan a un plano cartesiano , pero los marcos son en realidad equivalentes.

Consecuencias derivadas de la transformación de Lorentz

Las consecuencias de la relatividad especial pueden derivarse de las ecuaciones de transformación de Lorentz . ^[26] Estas transformaciones, y por lo tanto la relatividad especial, conducen a predicciones físicas diferentes a las de la mecánica newtoniana en todas las velocidades relativas, y más pronunciadas cuando las velocidades relativas se vuelven comparables a la velocidad de la luz. La velocidad de la luz es mucho mayor que cualquier cosa que la mayoría de los humanos encuentren, por lo que algunos de los efectos predichos por la relatividad son inicialmente contraintuitivos .

Intervalo invariante

En la relatividad galileana, la longitud de un objeto ( ) ^{[nota 3]} y la separación temporal entre dos eventos ( ) son invariantes independientes, cuyos valores no cambian cuando se observan desde diferentes marcos de referencia. ^{[nota 4]}^{[nota 5]} $\Delta r$ $\Delta t$

Sin embargo, en relatividad especial, el entrelazamiento de coordenadas espaciales y temporales genera el concepto de intervalo invariante , denotado como : ^{[nota 6]} $\Delta s^{2}$ $\Delta s^{2}\;{\overset {\text{def}}{=}}\;c^{2}\Delta t^{2}-(\Delta x^{2}+\Delta y^{2}+\Delta z^{2})$

El entrelazamiento del espacio y el tiempo revoca los conceptos implícitos de simultaneidad absoluta y sincronización a través de marcos no comóviles.

La forma de ser la diferencia del cuadrado del lapso de tiempo y la distancia espacial al cuadrado, demuestra una discrepancia fundamental entre las distancias euclidianas y del espacio-tiempo. ^{[nota 7]} La invariancia de este intervalo es una propiedad de la transformada general de Lorentz (también llamada transformación de Poincaré ), lo que la convierte en una isometría del espacio-tiempo. La transformada general de Lorentz extiende la transformada de Lorentz estándar (que trata con traslaciones sin rotación, es decir, impulsos de Lorentz , en la dirección x) con todas las demás traslaciones , reflexiones y rotaciones entre cualquier marco inercial cartesiano. ^[30]^{: 33–34} $\Delta s^{2},$

En el análisis de escenarios simplificados, como los diagramas de espacio-tiempo, a menudo se emplea una forma de dimensionalidad reducida del intervalo invariante: $\Delta s^{2}\,=\,c^{2}\Delta t^{2}-\Delta x^{2}$

Demostrar que el intervalo es invariante es sencillo para el caso de dimensionalidad reducida y con marcos en configuración estándar: ^[21] ${\begin{aligned}c^{2}\Delta t^{2}-\Delta x^{2}&=c^{2}\gamma ^{2}\left(\Delta t'+{\dfrac {v\Delta x'}{c^{2}}}\right)^{2}-\gamma ^{2}\ (\Delta x'+v\Delta t')^{2}\\&=\gamma ^{2}\left(c^{2}\Delta t'^{\,2}+2v\Delta x'\Delta t'+{\dfrac {v^{2}\Delta x'^{\,2}}{c^{2}}}\right)-\gamma ^{2}\ (\Delta x'^{\,2}+2v\Delta x'\Delta t'+v^{2}\Delta t'^{\,2})\\&=\gamma ^{2}c^{2}\Delta t'^{\,2}-\gamma ^{2}v^{2}\Delta t'^{\,2}-\gamma ^{2}\Delta x'^{\,2}+\gamma ^{2}{\dfrac {v^{2}\Delta x'^{\,2}}{c^{2}}}\\&=\gamma ^{2}c^{2}\Delta t'^{\,2}\left(1-{\dfrac {v^{2}}{c^{2}}}\right)-\gamma ^{2}\Delta x'^{\,2}\left(1-{\dfrac {v^{2}}{c^{2}}}\right)\\&=c^{2}\Delta t'^{\,2}-\Delta x'^{\,2}\end{aligned}}$

El valor de es por tanto independiente del marco en el que se mide. $\Delta s^{2}$

Al considerar el significado físico de , hay tres casos a tener en cuenta: ^[21]^[31]^{: 25–39} $\Delta s^{2}$

Δs ² > 0: En este caso, los dos eventos están separados por más tiempo que espacio, y por lo tanto se dice que están separados temporalmente . Esto implica que y dada la transformación de Lorentz es evidente que existe un menor que para el cual (en particular, ). En otras palabras, dados dos eventos que están separados temporalmente, es posible encontrar un marco en el que los dos eventos ocurren en el mismo lugar. En este marco, la separación en el tiempo se llama tiempo propio . $|\Delta x/\Delta t|<c,$ $\Delta x'=\gamma \ (\Delta x-v\,\Delta t),$ $v$ $c$ $\Delta x'=0$ $v=\Delta x/\Delta t$ $\Delta s/c,$
Δs ² < 0: En este caso, los dos eventos están separados por más espacio que tiempo, y por lo tanto se dice que están separados espacialmente . Esto implica que y dada la transformación de Lorentz existe un menor que para el cual (en particular, ). En otras palabras, dados dos eventos que están separados espacialmente, es posible encontrar un marco en el que los dos eventos ocurren al mismo tiempo. En este marco, la separación en el espacio se llama distancia propia o longitud propia . Para valores de mayor que y menor que el signo de cambia, lo que significa que el orden temporal de los eventos separados espacialmente cambia dependiendo del marco en el que se vean los eventos. Pero el orden temporal de los eventos separados temporalmente es absoluto, ya que la única forma en que podría ser mayor que sería si $|\Delta x/\Delta t|>c,$ $\Delta t'=\gamma \ (\Delta t-v\Delta x/c^{2}),$ $v$ $c$ $\Delta t'=0$ $v=c^{2}\Delta t/\Delta x$ ${\textstyle {\sqrt {-\Delta s^{2}}},}$ $v$ $c^{2}\Delta t/\Delta x,$ $\Delta t'$ $v$ $c^{2}\Delta t/\Delta x$ $v>c.$
Δs ² = 0: En este caso, se dice que los dos eventos están separados por la luz . Esto implica que y esta relación es independiente del marco debido a la invariancia de A partir de esto, observamos que la velocidad de la luz es en cada marco inercial. En otras palabras, a partir del supuesto de la covarianza universal de Lorentz, la velocidad constante de la luz es un resultado derivado, en lugar de un postulado como en la formulación de dos postulados de la teoría especial. $|\Delta x/\Delta t|=c,$ $s^{2}.$ $c$

Relatividad de la simultaneidad

Consideremos dos eventos que suceden en dos lugares diferentes y que ocurren simultáneamente en el marco de referencia de un observador inercial. Pueden ocurrir de manera no simultánea en el marco de referencia de otro observador inercial (falta de simultaneidad absoluta ).

De la ecuación 3 (la transformación de Lorentz hacia adelante en términos de diferencias de coordenadas) $\Delta t'=\gamma \left(\Delta t-{\frac {v\,\Delta x}{c^{2}}}\right)$

Es evidente que los dos eventos que son simultáneos en el marco S (satisfaciendo Δ t = 0 ), no son necesariamente simultáneos en otro marco inercial S ′ (satisfaciendo Δ t ′ = 0 ). Sólo si estos eventos son además co-locales en el marco S (satisfaciendo Δ x = 0 ), serán simultáneos en otro marco S ′ .

El efecto Sagnac puede considerarse una manifestación de la relatividad de la simultaneidad. ^[32] Dado que la relatividad de la simultaneidad es un efecto de primer orden en , ^[21] los instrumentos basados en el efecto Sagnac para su funcionamiento, como los giroscopios láser de anillo y los giroscopios de fibra óptica , son capaces de alcanzar niveles extremos de sensibilidad. ^{[p 14]} $v$

Dilatación del tiempo

El lapso de tiempo entre dos eventos no es invariable de un observador a otro, sino que depende de las velocidades relativas de los marcos de referencia de los observadores.

Supongamos que un reloj está en reposo en el sistema no primo S . La posición del reloj en dos tics diferentes se caracteriza entonces por Δ x = 0 . Para encontrar la relación entre los tiempos entre estos tics medidos en ambos sistemas, se puede utilizar la ecuación 3 para encontrar:

\Delta t'=\gamma \,\Delta t

Para eventos que satisfagan

\Delta x=0\ .

Esto demuestra que el tiempo (Δ t ′ ) entre los dos tics, como se ve en el marco en el que se mueve el reloj ( S ′ ), es más largo que el tiempo (Δ t ) entre estos tics, medido en el marco de reposo del reloj ( S ). La dilatación del tiempo explica una serie de fenómenos físicos; por ejemplo, la vida útil de los muones de alta velocidad creados por la colisión de rayos cósmicos con partículas en la atmósfera exterior de la Tierra y que se mueven hacia la superficie es mayor que la vida útil de los muones que se mueven lentamente, creados y decayendo en un laboratorio. ^[33]

Siempre que se oye una afirmación del tipo "los relojes en movimiento van más despacio", hay que imaginarse un sistema de referencia inercial poblado de relojes idénticos y sincronizados. A medida que un reloj en movimiento recorre este sistema, su lectura en cualquier punto concreto se compara con la de un reloj estacionario en el mismo punto. ^[34]^{: 149–152}

Las mediciones que obtendríamos si miráramos un reloj en movimiento no serían, en general, en absoluto las mismas, porque el tiempo que veríamos estaría retrasado por la velocidad finita de la luz, es decir, los tiempos que vemos estarían distorsionados por el efecto Doppler . Las mediciones de los efectos relativistas siempre deben entenderse como realizadas después de que se hayan descartado los efectos de la velocidad finita de la luz. ^[34]^{: 149–152}

El reloj de luz de Langevin

Paul Langevin , uno de los primeros defensores de la teoría de la relatividad, hizo mucho por popularizar la teoría frente a la resistencia de muchos físicos a los conceptos revolucionarios de Einstein. Entre sus numerosas contribuciones a los fundamentos de la relatividad especial se encuentran trabajos independientes sobre la relación masa-energía, un examen exhaustivo de la paradoja de los gemelos e investigaciones sobre sistemas de coordenadas rotatorios. Su nombre se asocia con frecuencia a un concepto hipotético llamado "reloj de luz" (desarrollado originalmente por Lewis y Tolman en 1909 ^[35] ) que utilizó para realizar una novedosa derivación de la transformación de Lorentz. ^[36]

Se considera que un reloj de luz es una caja de paredes perfectamente reflectantes en la que una señal luminosa se refleja de un lado a otro desde caras opuestas. El concepto de dilatación del tiempo se enseña con frecuencia utilizando un reloj de luz que se desplaza en un movimiento inercial uniforme perpendicular a una línea que conecta los dos espejos. ^[37]^[38]^[39]^[40] (El propio Langevin utilizó un reloj de luz orientado en paralelo a su línea de movimiento. ^[36] )

Consideremos el escenario ilustrado en la Fig. 4-3A. El observador A sostiene un reloj de luz de longitud así como un cronómetro electrónico con el que mide cuánto tarda un pulso en hacer un viaje de ida y vuelta a lo largo del reloj de luz. Aunque el observador A viaja rápidamente a lo largo de un tren, desde su punto de vista la emisión y recepción del pulso ocurren en el mismo lugar, y mide el intervalo utilizando un solo reloj ubicado en la posición precisa de estos dos eventos. Para el intervalo entre estos dos eventos, el observador A encuentra Un intervalo de tiempo medido utilizando un solo reloj que está inmóvil en un marco de referencia particular se llama intervalo de tiempo propio . ^[41] $L$ $t_{\text{A}}=2L/c.$

La figura 4-3B ilustra estos mismos dos eventos desde el punto de vista del observador B, que está estacionado junto a las vías mientras el tren pasa a una velocidad de En lugar de hacer movimientos rectos hacia arriba y hacia abajo, el observador B ve los pulsos moviéndose a lo largo de una línea en zigzag. Sin embargo, debido al postulado de la constancia de la velocidad de la luz, la velocidad de los pulsos a lo largo de estas líneas diagonales es la misma que la que el observador A vio para sus pulsos hacia arriba y hacia abajo. B mide la velocidad del componente vertical de estos pulsos como de modo que el tiempo total de ida y vuelta de los pulsos es Nótese que para el observador B, la emisión y recepción del pulso de luz ocurrieron en diferentes lugares, y midió el intervalo utilizando dos relojes estacionarios y sincronizados ubicados en dos posiciones diferentes en su marco de referencia. El intervalo que B midió, por lo tanto, no fue un intervalo de tiempo propio porque no lo midió con un solo reloj en reposo. ^[41] $v.$ $c$ ${\textstyle \pm {\sqrt {c^{2}-v^{2}}},}$ ${\textstyle t_{B}=2L{\big /}{\sqrt {c^{2}-v^{2}}}={}}$ ${\textstyle t_{A}{\big /}{\sqrt {1-v^{2}/c^{2}}}.}$

Dilatación recíproca del tiempo

En la descripción anterior del reloj de luz de Langevin, la clasificación de un observador como estacionario y del otro como en movimiento fue completamente arbitraria. También podría suceder que el observador B llevara el reloj de luz y se moviera a una velocidad de hacia la izquierda, en cuyo caso el observador A percibiría que el reloj de B funcionaba más lento que su reloj local. $v$

No hay ninguna paradoja aquí, porque no hay un observador independiente C que esté de acuerdo con A y B. El observador C necesariamente hace sus mediciones a partir de su propio marco de referencia. Si ese marco de referencia coincide con el marco de referencia de A, entonces C estará de acuerdo con la medición del tiempo de A. Si el marco de referencia de C coincide con el marco de referencia de B, entonces C estará de acuerdo con la medición del tiempo de B. Si el marco de referencia de C no coincide ni con el marco de A ni con el de B, entonces la medición del tiempo de C estará en desacuerdo con la medición del tiempo de A y de B. ^[42]

Paradoja de los gemelos

La reciprocidad de la dilatación del tiempo entre dos observadores en sistemas inerciales separados conduce a la llamada paradoja de los gemelos , articulada en su forma actual por Langevin en 1911. ^[43] Langevin imaginó a un aventurero que desea explorar el futuro de la Tierra. Este viajero se sube a un proyectil capaz de viajar al 99,995% de la velocidad de la luz. Después de hacer un viaje de ida y vuelta hacia y desde una estrella cercana que dura solo dos años de su propia vida, regresa a una Tierra que es doscientos años más vieja.

Este resultado parece desconcertante porque tanto el viajero como un observador terrestre verían al otro en movimiento y, por lo tanto, debido a la reciprocidad de la dilatación del tiempo, uno podría esperar inicialmente que cada uno de ellos hubiera encontrado que el otro había envejecido menos. En realidad, no hay ninguna paradoja, porque para que los dos observadores comparen sus tiempos propios, la simetría de la situación debe romperse: al menos uno de los dos observadores debe cambiar su estado de movimiento para que coincida con el del otro. ^[44]

Sin embargo, conocer la resolución general de la paradoja no permite calcular inmediatamente resultados cuantitativos correctos. En la literatura se han proporcionado muchas soluciones a este problema y se han analizado en el artículo sobre la paradoja de los gemelos . A continuación, examinaremos una de esas soluciones a la paradoja.

Nuestro objetivo básico será demostrar que, después del viaje, ambos gemelos están en perfecto acuerdo sobre quién envejeció cuánto, independientemente de sus diferentes experiencias. La figura 4-4 ilustra un escenario donde el gemelo viajero vuela a 0,6 c hacia y desde una estrella a 3 años luz de distancia. Durante el viaje, cada gemelo envía señales de tiempo anuales (medidas en sus propios tiempos propios) al otro. Después del viaje, se comparan los recuentos acumulativos. En la fase de ida del viaje, cada gemelo recibe las señales del otro a la tasa reducida de Inicialmente, la situación es perfectamente simétrica: observe que cada gemelo recibe la señal de un año del otro a los dos años medidos en su propio reloj. La simetría se rompe cuando el gemelo viajero se da la vuelta en la marca de los cuatro años medidos por su reloj. Durante los cuatro años restantes de su viaje, recibe señales a la tasa mejorada de La situación es bastante diferente con el gemelo estacionario. Debido al retraso de la velocidad de la luz, no ve a su hermana darse la vuelta hasta que hayan pasado ocho años en su propio reloj. Por lo tanto, recibe señales de frecuencia aumentada de su hermana durante un período relativamente breve. Aunque los gemelos no están de acuerdo en sus respectivas medidas del tiempo total, vemos en la siguiente tabla, así como por simple observación del diagrama de Minkowski, que cada gemelo está totalmente de acuerdo con el otro en cuanto al número total de señales enviadas de uno al otro. Por lo tanto, no hay ninguna paradoja. ^[34]^{: 152–159} ${\textstyle f'=f{\sqrt {(1-\beta )/(1+\beta )}}.}$ ${\textstyle f''=f{\sqrt {(1+\beta )/(1-\beta )}}.}$

Contracción de longitud

Las dimensiones (por ejemplo, la longitud) de un objeto medidas por un observador pueden ser menores que los resultados de las mediciones del mismo objeto realizadas por otro observador (por ejemplo, la paradoja de la escalera implica una escalera larga que viaja casi a la velocidad de la luz y está contenida dentro de un garaje más pequeño).

De manera similar, supongamos que una varilla de medición está en reposo y alineada a lo largo del eje x en el sistema no primo S . En este sistema, la longitud de esta varilla se escribe como Δ x . Para medir la longitud de esta varilla en el sistema S ′ , en el que la varilla se está moviendo, las distancias x ′ a los puntos finales de la varilla deben medirse simultáneamente en ese sistema S ′ . En otras palabras, la medición se caracteriza por Δ t ′ = 0 , que se puede combinar con la Ecuación 4 para encontrar la relación entre las longitudes Δ x y Δ x ′ :

\Delta x'={\frac {\Delta x}{\gamma }}

Para eventos que satisfagan

\Delta t'=0\ .

Esto demuestra que la longitud (Δ x ′ ) de la varilla medida en el marco en el que se mueve ( S ′ ), es más corta que su longitud (Δ x ) en su propio marco en reposo ( S ).

La dilatación del tiempo y la contracción de la longitud no son meras apariencias. La dilatación del tiempo está explícitamente relacionada con nuestra forma de medir los intervalos de tiempo entre eventos que ocurren en el mismo lugar en un sistema de coordenadas dado (llamados eventos "co-locales"). Estos intervalos de tiempo (que pueden ser, y son, medidos experimentalmente por observadores relevantes) son diferentes en otro sistema de coordenadas que se mueve con respecto al primero, a menos que los eventos, además de ser co-locales, también sean simultáneos. De manera similar, la contracción de la longitud se relaciona con nuestras distancias medidas entre eventos separados pero simultáneos en un sistema de coordenadas dado de elección. Si estos eventos no son co-locales, sino que están separados por la distancia (espacio), no ocurrirán a la misma distancia espacial entre sí cuando se los observe desde otro sistema de coordenadas en movimiento.

Transformación de velocidades de Lorentz

Consideremos dos marcos S y S ′ en configuración estándar. Una partícula en S se mueve en la dirección x con el vector de velocidad ¿Cuál es su velocidad en el marco S ′ ? $\mathbf {u} .$ $\mathbf {u'}$

Podemos escribir

Sustituyendo expresiones para y de la ecuación 5 en la ecuación 8 , seguido de manipulaciones matemáticas sencillas y sustitución inversa de la ecuación 7, se obtiene la transformación de Lorentz de la velocidad en : $dx'$ $dt'$ $u$ $u'$

La relación inversa se obtiene intercambiando los símbolos primados y no primados y reemplazando con $v$ $-v\ .$

Para no alineado a lo largo del eje x, escribimos: ^[12]^{: 47–49} $\mathbf {u}$

Las transformaciones directa e inversa para este caso son:

La ecuación 10 y la ecuación 14 pueden interpretarse como la resultante de las dos velocidades y y reemplazan la fórmula que es válida en la relatividad galileana. Interpretadas de esa manera, se las conoce comúnmente como fórmulas relativistas de adición (o composición) de velocidades , válidas para los tres ejes de S y S ′ alineados entre sí (aunque no necesariamente en la configuración estándar). ^[12]^{: 47–49} $\mathbf {u}$ $\mathbf {v}$ $\mathbf {u'} ,$ $\mathbf {u=u'+v}$

Tomamos nota de los siguientes puntos:

Si un objeto (por ejemplo, un fotón ) se moviera a la velocidad de la luz en un marco (es decir, u = ± c o u ′ = ± c ) , entonces también se movería a la velocidad de la luz en cualquier otro marco, moviéndose a | v | < c .
La velocidad resultante de dos velocidades con magnitud menor que c es siempre una velocidad con magnitud menor que c .
Si tanto | u | como | v | (y entonces también | u ′ | y | v ′ |) son pequeños con respecto a la velocidad de la luz (es decir, por ejemplo, | ⁠tú/do⁠ | ≪ $1$ ), entonces las transformaciones galileanas intuitivas se recuperan de las ecuaciones de transformación para la relatividad especial
Unir un marco a un fotón ( que viaja sobre un haz de luz , como considera Einstein) requiere un tratamiento especial de las transformaciones.

No hay nada especial en la dirección x en la configuración estándar. El formalismo anterior se aplica a cualquier dirección; y tres direcciones ortogonales permiten tratar con todas las direcciones en el espacio descomponiendo los vectores de velocidad en sus componentes en estas direcciones. Consulte la fórmula de adición de velocidad para obtener más detalles.

Rotación de Thomas

Figura 4-5. Rotación de Thomas-Wigner

La composición de dos impulsos de Lorentz no colineales (es decir, dos transformaciones de Lorentz no colineales, ninguna de las cuales implica rotación) da como resultado una transformación de Lorentz que no es un impulso puro, sino que es la composición de un impulso y una rotación.

La rotación de Thomas resulta de la relatividad de la simultaneidad. En la figura 4-5a, una varilla de longitud en su sistema de referencia en reposo (es decir, que tiene una longitud propia de ) se eleva verticalmente a lo largo del eje y en el sistema de referencia en el suelo. $L$ $L$

En la figura 4-5b, se observa la misma varilla desde el marco de un cohete que se mueve a gran velocidad hacia la derecha. Si imaginamos dos relojes situados en los extremos izquierdo y derecho de la varilla que están sincronizados en el marco de la varilla , la relatividad de la simultaneidad hace que el observador en el marco del cohete observe (no vea) el reloj en el extremo derecho de la varilla como si estuviera adelantado en el tiempo y la varilla se observe correspondientemente como inclinada. ^[31]^{: 98–99} $v$ $Lv/c^{2},$

A diferencia de los efectos relativistas de segundo orden, como la contracción de la longitud o la dilatación del tiempo, este efecto se vuelve bastante significativo incluso a velocidades bastante bajas. Por ejemplo, esto se puede ver en el espín de partículas en movimiento , donde la precesión de Thomas es una corrección relativista que se aplica al espín de una partícula elemental o la rotación de un giroscopio macroscópico , relacionando la velocidad angular del espín de una partícula que sigue una órbita curvilínea con la velocidad angular del movimiento orbital. ^[31]^{: 169–174}

La rotación de Thomas proporciona la solución a la conocida "paradoja del metro y el agujero". ^{[p 15]}^[31]^{: 98–99}

Causalidad y prohibición del movimiento más rápido que la luz

En la figura 4-6, el intervalo de tiempo entre los eventos A (la "causa") y B (el "efecto") es "temporal"; es decir, hay un marco de referencia en el que los eventos A y B ocurren en el mismo lugar en el espacio , separados solo por ocurrir en diferentes momentos. Si A precede a B en ese marco, entonces A precede a B en todos los marcos accesibles por una transformación de Lorentz. Es posible que la materia (o información) viaje (por debajo de la velocidad de la luz) desde el lugar de A, comenzando en el momento de A, hasta el lugar de B, llegando al momento de B, por lo que puede haber una relación causal (siendo A la causa y B el efecto).

El intervalo AC del diagrama es "similar al espacio", es decir, hay un marco de referencia en el que los eventos A y C ocurren simultáneamente, separados solo en el espacio. También hay marcos en los que A precede a C (como se muestra) y marcos en los que C precede a A. Pero no hay marcos accesibles mediante una transformación de Lorentz, en la que los eventos A y C ocurren en el mismo lugar. Si fuera posible que existiera una relación de causa y efecto entre los eventos A y C, se producirían paradojas de causalidad.

Por ejemplo, si las señales pudieran enviarse más rápido que la luz, entonces podrían enviarse señales al pasado del remitente (el observador B en los diagramas). ^[45]^{[p 16]} Se podrían construir entonces diversas paradojas causales.

Figura 4-7. Violación de la causalidad mediante el uso de
"comunicadores instantáneos" ficticios

Considere los diagramas de espacio-tiempo de la figura 4-7. A y B se encuentran junto a una vía férrea cuando pasa un tren de alta velocidad, con C viajando en el último vagón del tren y D viajando en el vagón delantero. Las líneas de universo de A y B son verticales ( ct ), lo que distingue la posición estacionaria de estos observadores en el suelo, mientras que las líneas de universo de C y D están inclinadas hacia delante ( ct ′ ), lo que refleja el movimiento rápido de los observadores C y D estacionarios en su tren, tal como se observa desde el suelo.

Fig. 4-7a. El evento de "B pasando un mensaje a D", mientras el vagón que va delante pasa, está en el origen del marco de D. D envía el mensaje a lo largo del tren a C en el vagón trasero, utilizando un "comunicador instantáneo" ficticio. La línea de tiempo de este mensaje es la flecha roja gruesa a lo largo del eje, que es una línea de simultaneidad en los marcos preparados de C y D. En el marco de tierra (no preparado) la señal llega antes de lo que fue enviada. $-x'$
Fig. 4-7b. El evento de "C pasando el mensaje a A", que está de pie junto a las vías del tren, está en el origen de sus tramas. Ahora A envía el mensaje por las vías a B mediante un "comunicador instantáneo". La línea de tiempo de este mensaje es la flecha gruesa azul, a lo largo del eje, que es una línea de simultaneidad para las tramas de A y B. Como se ve en el diagrama del espacio-tiempo, B recibirá el mensaje antes de haberlo enviado, una violación de la causalidad. ^[46] $+x$

No es necesario que las señales sean instantáneas para violar la causalidad. Incluso si la señal de D a C fuera ligeramente más superficial que el eje (y la señal de A a B ligeramente más inclinada que el eje), aún sería posible que B recibiera su mensaje antes de haberlo enviado. Al aumentar la velocidad del tren a velocidades cercanas a la de la luz, los ejes y pueden comprimirse muy cerca de la línea discontinua que representa la velocidad de la luz. Con esta configuración modificada, se puede demostrar que incluso señales solo ligeramente más rápidas que la velocidad de la luz darán como resultado una violación de la causalidad. ^[47] $x'$ $x$ $ct'$ $x'$

Por lo tanto, si se quiere preservar la causalidad , una de las consecuencias de la relatividad especial es que ninguna señal de información ni ningún objeto material puede viajar más rápido que la luz en el vacío.

Esto no quiere decir que todas las velocidades superiores a la de la luz sean imposibles. Se pueden describir varias situaciones triviales en las que algunas "cosas" (no materia ni energía reales) se mueven más rápido que la luz. ^[48] Por ejemplo, el lugar donde el haz de luz de un reflector incide en la parte inferior de una nube puede moverse más rápido que la luz cuando el reflector se gira rápidamente (aunque esto no viola la causalidad ni ningún otro fenómeno relativista). ^[49]^[50]

Efectos ópticos

Efectos de arrastre

En 1850, Hippolyte Fizeau y Léon Foucault establecieron de forma independiente que la luz viaja más lentamente en el agua que en el aire, validando así una predicción de la teoría ondulatoria de la luz de Fresnel e invalidando la predicción correspondiente de la teoría corpuscular de Newton . ^[51] La velocidad de la luz se midió en agua en calma. ¿Cuál sería la velocidad de la luz en agua en movimiento?

En 1851, Fizeau realizó un experimento para responder a esta pregunta, cuya representación simplificada se ilustra en la figura 5-1. Un divisor de haz divide un haz de luz y los haces divididos pasan en direcciones opuestas a través de un tubo de agua en movimiento. Se recombinan para formar franjas de interferencia, que indican una diferencia en la longitud del recorrido óptico, que un observador puede ver. El experimento demostró que el arrastre de la luz por el agua en movimiento provocó un desplazamiento de las franjas, lo que demuestra que el movimiento del agua había afectado a la velocidad de la luz.

Según las teorías que prevalecían en ese momento, la luz que viaja a través de un medio en movimiento sería una simple suma de su velocidad a través del medio más la velocidad del medio. Contrariamente a lo esperado, Fizeau descubrió que, aunque la luz parecía ser arrastrada por el agua, la magnitud del arrastre era mucho menor de lo esperado. Si es la velocidad de la luz en agua quieta, y es la velocidad del agua, y es la velocidad de la luz transportada por el agua en el marco de referencia del laboratorio con el flujo de agua sumando o restando a la velocidad de la luz, entonces $u'=c/n$ $v$ $u_{\pm }$ $u_{\pm }={\frac {c}{n}}\pm v\left(1-{\frac {1}{n^{2}}}\right)\ .$

Los resultados de Fizeau, aunque consistentes con la hipótesis anterior de Fresnel sobre el arrastre parcial del éter , fueron extremadamente desconcertantes para los físicos de la época. Entre otras cosas, la presencia de un término de índice de refracción significaba que, dado que depende de la longitud de onda, el éter debe ser capaz de sostener diferentes movimientos al mismo tiempo . ^{[nota 8]} Se propusieron diversas explicaciones teóricas para explicar el coeficiente de arrastre de Fresnel , que eran completamente contradictorias entre sí. Incluso antes del experimento de Michelson-Morley, los resultados experimentales de Fizeau se encontraban entre una serie de observaciones que crearon una situación crítica para explicar la óptica de los cuerpos en movimiento. ^[52] $n$

Desde el punto de vista de la relatividad especial, el resultado de Fizeau no es más que una aproximación a la ecuación 10 , la fórmula relativista para la composición de velocidades. ^[30]

u_{\pm }={\frac {u'\pm v}{1\pm u'v/c^{2}}}=

{\frac {c/n\pm v}{1\pm v/cn}}\approx

c\left({\frac {1}{n}}\pm {\frac {v}{c}}\right)\left(1\mp {\frac {v}{cn}}\right)\approx

{\frac {c}{n}}\pm v\left(1-{\frac {1}{n^{2}}}\right)

Aberración relativista de la luz

Debido a la velocidad finita de la luz, si los movimientos relativos de una fuente y un receptor incluyen un componente transversal, entonces la dirección desde la cual la luz llega al receptor se desplazará con respecto a la posición geométrica en el espacio de la fuente con respecto al receptor. El cálculo clásico del desplazamiento adopta dos formas y realiza predicciones diferentes dependiendo de si el receptor, la fuente o ambos están en movimiento con respecto al medio. (1) Si el receptor está en movimiento, el desplazamiento sería consecuencia de la aberración de la luz . El ángulo de incidencia del haz con respecto al receptor se podría calcular a partir de la suma vectorial de los movimientos del receptor y la velocidad de la luz incidente. ^[53] (2) Si la fuente está en movimiento, el desplazamiento sería consecuencia de la corrección del tiempo de luz . El desplazamiento de la posición aparente de la fuente con respecto a su posición geométrica sería el resultado del movimiento de la fuente durante el tiempo que tarda su luz en llegar al receptor. ^[54]

La explicación clásica no superó la prueba experimental. Dado que el ángulo de aberración depende de la relación entre la velocidad del receptor y la velocidad de la luz incidente, el paso de la luz incidente a través de un medio refractivo debería cambiar el ángulo de aberración. En 1810, Arago utilizó este fenómeno esperado en un intento fallido de medir la velocidad de la luz, ^[55] y en 1870, George Airy probó la hipótesis utilizando un telescopio lleno de agua, encontrando que, contra lo esperado, la aberración medida era idéntica a la aberración medida con un telescopio lleno de aire. ^[56] Un intento "engorroso" de explicar estos resultados utilizó la hipótesis del arrastre parcial del éter, ^[57] pero era incompatible con los resultados del experimento de Michelson-Morley, que aparentemente exigía un arrastre completo del éter. ^[58]

Suponiendo sistemas inerciales, la expresión relativista para la aberración de la luz es aplicable tanto a los casos de receptor en movimiento como a los de fuente en movimiento. Se han publicado diversas fórmulas trigonométricamente equivalentes. Expresadas en términos de las variables de la figura 5-2, estas incluyen ^[30]^{: 57–60}

\cos \theta '={\frac {\cos \theta +v/c}{1+(v/c)\cos \theta }}

O O

\sin \theta '={\frac {\sin \theta }{\gamma [1+(v/c)\cos \theta ]}}

\tan {\frac {\theta '}{2}}=\left({\frac {c-v}{c+v}}\right)^{1/2}\tan {\frac {\theta }{2}}

Efecto Doppler relativista

Efecto Doppler longitudinal relativista

El efecto Doppler clásico depende de si la fuente, el receptor o ambos están en movimiento con respecto al medio. El efecto Doppler relativista es independiente de cualquier medio. Sin embargo, el efecto Doppler relativista para el caso longitudinal, con la fuente y el receptor moviéndose directamente uno hacia el otro o alejándose uno del otro, se puede derivar como si fuera el fenómeno clásico, pero modificado mediante la adición de un término de dilatación del tiempo , y ese es el tratamiento que se describe aquí. ^[59]^[60]

Suponga que el receptor y la fuente se alejan entre sí con una velocidad relativa medida por un observador en el receptor o la fuente (la convención de signos adoptada aquí es que es negativa si el receptor y la fuente se mueven uno hacia el otro). Suponga que la fuente está estacionaria en el medio. Entonces, ¿dónde es la velocidad del sonido? $v\,$ $v$ $f_{r}=\left(1-{\frac {v}{c_{s}}}\right)f_{s}$ $c_{s}$

En el caso de la luz, y con el receptor moviéndose a velocidades relativistas, los relojes del receptor están dilatados en el tiempo en relación con los relojes de la fuente. El receptor medirá la frecuencia recibida para que esté donde $f_{r}=\gamma \left(1-\beta \right)f_{s}={\sqrt {\frac {1-\beta }{1+\beta }}}\,f_{s}.$

$\beta =v/c$ y
$\gamma ={\frac {1}{\sqrt {1-\beta ^{2}}}}$ es el factor de Lorentz .

Se obtiene una expresión idéntica para el desplazamiento Doppler relativista cuando se realiza el análisis en el marco de referencia del receptor con una fuente en movimiento. ^[61]^[21]

Efecto Doppler transversal

The transverse Doppler effect is one of the main novel predictions of the special theory of relativity.

Classically, one might expect that if source and receiver are moving transversely with respect to each other with no longitudinal component to their relative motions, that there should be no Doppler shift in the light arriving at the receiver.

Special relativity predicts otherwise. Fig. 5-3 illustrates two common variants of this scenario. Both variants can be analyzed using simple time dilation arguments.^[21] In Fig. 5-3a, the receiver observes light from the source as being blueshifted by a factor of $\gamma$ . In Fig. 5-3b, the light is redshifted by the same factor.

Measurement versus visual appearance

Time dilation and length contraction are not optical illusions, but genuine effects. Measurements of these effects are not an artifact of Doppler shift, nor are they the result of neglecting to take into account the time it takes light to travel from an event to an observer.

Scientists make a fundamental distinction between measurement or observation on the one hand, versus visual appearance, or what one sees. The measured shape of an object is a hypothetical snapshot of all of the object's points as they exist at a single moment in time. But the visual appearance of an object is affected by the varying lengths of time that light takes to travel from different points on the object to one's eye.

For many years, the distinction between the two had not been generally appreciated, and it had generally been thought that a length contracted object passing by an observer would in fact actually be seen as length contracted. In 1959, James Terrell and Roger Penrose independently pointed out that differential time lag effects in signals reaching the observer from the different parts of a moving object result in a fast moving object's visual appearance being quite different from its measured shape. For example, a receding object would appear contracted, an approaching object would appear elongated, and a passing object would have a skew appearance that has been likened to a rotation.^{[p 19]}^{[p 20]}^[62]^[63] A sphere in motion retains the circular outline for all speeds, for any distance, and for all view angles, although the surface of the sphere and the images on it will appear distorted.^[64]^[65]

Both Fig. 5-4 and Fig. 5-5 illustrate objects moving transversely to the line of sight. In Fig. 5-4, a cube is viewed from a distance of four times the length of its sides. At high speeds, the sides of the cube that are perpendicular to the direction of motion appear hyperbolic in shape. The cube is actually not rotated. Rather, light from the rear of the cube takes longer to reach one's eyes compared with light from the front, during which time the cube has moved to the right. At high speeds, the sphere in Fig. 5-5 takes on the appearance of a flattened disk tilted up to 45° from the line of sight. If the objects' motions are not strictly transverse but instead include a longitudinal component, exaggerated distortions in perspective may be seen.^[66] This illusion has come to be known as Terrell rotation or the Terrell–Penrose effect.^{[note 9]}

Another example where visual appearance is at odds with measurement comes from the observation of apparent superluminal motion in various radio galaxies, BL Lac objects, quasars, and other astronomical objects that eject relativistic-speed jets of matter at narrow angles with respect to the viewer. An apparent optical illusion results giving the appearance of faster than light travel.^[67]^[68]^[69] In Fig. 5-6, galaxy M87 streams out a high-speed jet of subatomic particles almost directly towards us, but Penrose–Terrell rotation causes the jet to appear to be moving laterally in the same manner that the appearance of the cube in Fig. 5-4 has been stretched out.^[70]

Dynamics

Section Consequences derived from the Lorentz transformation dealt strictly with kinematics, the study of the motion of points, bodies, and systems of bodies without considering the forces that caused the motion. This section discusses masses, forces, energy and so forth, and as such requires consideration of physical effects beyond those encompassed by the Lorentz transformation itself.

Equivalence of mass and energy

As an object's speed approaches the speed of light from an observer's point of view, its relativistic mass increases thereby making it more and more difficult to accelerate it from within the observer's frame of reference.

The energy content of an object at rest with mass m equals mc². Conservation of energy implies that, in any reaction, a decrease of the sum of the masses of particles must be accompanied by an increase in kinetic energies of the particles after the reaction. Similarly, the mass of an object can be increased by taking in kinetic energies.

In addition to the papers referenced above—which give derivations of the Lorentz transformation and describe the foundations of special relativity—Einstein also wrote at least four papers giving heuristic arguments for the equivalence (and transmutability) of mass and energy, for E = mc².

Mass–energy equivalence is a consequence of special relativity. The energy and momentum, which are separate in Newtonian mechanics, form a four-vector in relativity, and this relates the time component (the energy) to the space components (the momentum) in a non-trivial way. For an object at rest, the energy–momentum four-vector is (E/c, 0, 0, 0): it has a time component which is the energy, and three space components which are zero. By changing frames with a Lorentz transformation in the x direction with a small value of the velocity v, the energy momentum four-vector becomes (E/c, Ev/c², 0, 0). The momentum is equal to the energy multiplied by the velocity divided by c². As such, the Newtonian mass of an object, which is the ratio of the momentum to the velocity for slow velocities, is equal to E/c².

The energy and momentum are properties of matter and radiation, and it is impossible to deduce that they form a four-vector just from the two basic postulates of special relativity by themselves, because these do not talk about matter or radiation, they only talk about space and time. The derivation therefore requires some additional physical reasoning. In his 1905 paper, Einstein used the additional principles that Newtonian mechanics should hold for slow velocities, so that there is one energy scalar and one three-vector momentum at slow velocities, and that the conservation law for energy and momentum is exactly true in relativity. Furthermore, he assumed that the energy of light is transformed by the same Doppler-shift factor as its frequency, which he had previously shown to be true based on Maxwell's equations.^{[p 1]} The first of Einstein's papers on this subject was "Does the Inertia of a Body Depend upon its Energy Content?" in 1905.^{[p 21]} Although Einstein's argument in this paper is nearly universally accepted by physicists as correct, even self-evident, many authors over the years have suggested that it is wrong.^[71] Other authors suggest that the argument was merely inconclusive because it relied on some implicit assumptions.^[72]

Einstein acknowledged the controversy over his derivation in his 1907 survey paper on special relativity. There he notes that it is problematic to rely on Maxwell's equations for the heuristic mass–energy argument. The argument in his 1905 paper can be carried out with the emission of any massless particles, but the Maxwell equations are implicitly used to make it obvious that the emission of light in particular can be achieved only by doing work. To emit electromagnetic waves, all you have to do is shake a charged particle, and this is clearly doing work, so that the emission is of energy.^{[p 22]}^{[note 10]}

Einstein's 1905 demonstration of E = mc2

In his fourth of his 1905 Annus mirabilis papers,^{[p 21]} Einstein presented a heuristic argument for the equivalence of mass and energy. Although, as discussed above, subsequent scholarship has established that his arguments fell short of a broadly definitive proof, the conclusions that he reached in this paper have stood the test of time.

Einstein took as starting assumptions his recently discovered formula for relativistic Doppler shift, the laws of conservation of energy and conservation of momentum, and the relationship between the frequency of light and its energy as implied by Maxwell's equations.

Fig. 6-1 (top). Consider a system of plane waves of light having frequency $f$ traveling in direction $\phi$ relative to the x-axis of reference frame S. The frequency (and hence energy) of the waves as measured in frame S′ that is moving along the x-axis at velocity $v$ is given by the relativistic Doppler shift formula which Einstein had developed in his 1905 paper on special relativity:^{[p 1]}

{\frac {f'}{f}}={\frac {1-(v/c)\cos {\phi }}{\sqrt {1-v^{2}/c^{2}}}}

Fig. 6-1 (bottom). Consider an arbitrary body that is stationary in reference frame S. Let this body emit a pair of equal-energy light-pulses in opposite directions at angle $\phi$ with respect to the x-axis. Each pulse has energy $L/2$ . Because of conservation of momentum, the body remains stationary in S after emission of the two pulses. Let $E_{0}$ be the energy of the body before emission of the two pulses and $E_{1}$ after their emission.

Next, consider the same system observed from frame S′ that is moving along the x-axis at speed $v$ relative to frame S. In this frame, light from the forwards and reverse pulses will be relativistically Doppler-shifted. Let $H_{0}$ be the energy of the body measured in reference frame S′ before emission of the two pulses and $H_{1}$ after their emission. We obtain the following relationships:^{[p 21]}

{\begin{aligned}E_{0}&=E_{1}+{\tfrac {1}{2}}L+{\tfrac {1}{2}}L=E_{1}+L\\[5mu]H_{0}&=H_{1}+{\tfrac {1}{2}}L{\frac {1-(v/c)\cos {\phi }}{\sqrt {1-v^{2}/c^{2}}}}+{\tfrac {1}{2}}L{\frac {1+(v/c)\cos {\phi }}{\sqrt {1-v^{2}/c^{2}}}}=H_{1}+{\frac {L}{\sqrt {1-v^{2}/c^{2}}}}\end{aligned}}

From the above equations, we obtain the following:

The two differences of form $H-E$ seen in the above equation have a straightforward physical interpretation. Since $H$ and $E$ are the energies of the arbitrary body in the moving and stationary frames, $H_{0}-E_{0}$ and $H_{1}-E_{1}$ represents the kinetic energies of the bodies before and after the emission of light (except for an additive constant that fixes the zero point of energy and is conventionally set to zero). Hence,

Taking a Taylor series expansion and neglecting higher order terms, he obtained

Comparing the above expression with the classical expression for kinetic energy, K.E. = ⁠1/2⁠mv², Einstein then noted: "If a body gives off the energy L in the form of radiation, its mass diminishes by L/c²."

Rindler has observed that Einstein's heuristic argument suggested merely that energy contributes to mass. In 1905, Einstein's cautious expression of the mass–energy relationship allowed for the possibility that "dormant" mass might exist that would remain behind after all the energy of a body was removed. By 1907, however, Einstein was ready to assert that all inertial mass represented a reserve of energy. "To equate all mass with energy required an act of aesthetic faith, very characteristic of Einstein."^[12]^: 81–84 Einstein's bold hypothesis has been amply confirmed in the years subsequent to his original proposal.

For a variety of reasons, Einstein's original derivation is currently seldom taught. Besides the vigorous debate that continues until this day as to the formal correctness of his original derivation, the recognition of special relativity as being what Einstein called a "principle theory" has led to a shift away from reliance on electromagnetic phenomena to purely dynamic methods of proof.^[73]

How far can you travel from the Earth?

Since nothing can travel faster than light, one might conclude that a human can never travel farther from Earth than ~100 light years. You would easily think that a traveler would never be able to reach more than the few solar systems which exist within the limit of 100 light years from Earth. However, because of time dilation, a hypothetical spaceship can travel thousands of light years during a passenger's lifetime. If a spaceship could be built that accelerates at a constant 1g, it will, after one year, be travelling at almost the speed of light as seen from Earth. This is described by: $v(t)={\frac {at}{\sqrt {1+a^{2}t^{2}/c^{2}}}}$ where v(t) is the velocity at a time t, a is the acceleration of the spaceship and t is the coordinate time as measured by people on Earth.^{[p 23]} Therefore, after one year of accelerating at 9.81 m/s², the spaceship will be travelling at v = 0.712c and 0.946c after three years, relative to Earth. After three years of this acceleration, with the spaceship achieving a velocity of 94.6% of the speed of light relative to Earth, time dilation will result in each second experienced on the spaceship corresponding to 3.1 seconds back on Earth. During their journey, people on Earth will experience more time than they do - since their clocks (all physical phenomena) would really be ticking 3.1 times faster than those of the spaceship. A 5-year round trip for the traveller will take 6.5 Earth years and cover a distance of over 6 light-years. A 20-year round trip for them (5 years accelerating, 5 decelerating, twice each) will land them back on Earth having travelled for 335 Earth years and a distance of 331 light years.^[74] A full 40-year trip at 1g will appear on Earth to last 58,000 years and cover a distance of 55,000 light years. A 40-year trip at 1.1g will take 148,000 Earth years and cover about 140,000 light years. A one-way 28 year (14 years accelerating, 14 decelerating as measured with the astronaut's clock) trip at 1g acceleration could reach 2,000,000 light-years to the Andromeda Galaxy.^[74] This same time dilation is why a muon travelling close to c is observed to travel much farther than c times its half-life (when at rest).^[75]

Elastic collisions

Examination of the collision products generated by particle accelerators around the world provides scientists evidence of the structure of the subatomic world and the natural laws governing it. Analysis of the collision products, the sum of whose masses may vastly exceed the masses of the incident particles, requires special relativity.^[76]

In Newtonian mechanics, analysis of collisions involves use of the conservation laws for mass, momentum and energy. In relativistic mechanics, mass is not independently conserved, because it has been subsumed into the total relativistic energy. We illustrate the differences that arise between the Newtonian and relativistic treatments of particle collisions by examining the simple case of two perfectly elastic colliding particles of equal mass. (Inelastic collisions are discussed in Spacetime#Conservation laws. Radioactive decay may be considered a sort of time-reversed inelastic collision.^[76])

Elastic scattering of charged elementary particles deviates from ideality due to the production of Bremsstrahlung radiation.^[77]^[78]

Newtonian analysis

Fig. 6-2 provides a demonstration of the result, familiar to billiard players, that if a stationary ball is struck elastically by another one of the same mass (assuming no sidespin, or "English"), then after collision, the diverging paths of the two balls will subtend a right angle. (a) In the stationary frame, an incident sphere traveling at 2v strikes a stationary sphere. (b) In the center of momentum frame, the two spheres approach each other symmetrically at ±v. After elastic collision, the two spheres rebound from each other with equal and opposite velocities ±u. Energy conservation requires that |u| = |v|. (c) Reverting to the stationary frame, the rebound velocities are v ± u. The dot product (v + u) ⋅ (v − u) = v² − u² = 0, indicating that the vectors are orthogonal.^[12]^: 26–27

Relativistic analysis

Consider the elastic collision scenario in Fig. 6-3 between a moving particle colliding with an equal mass stationary particle. Unlike the Newtonian case, the angle between the two particles after collision is less than 90°, is dependent on the angle of scattering, and becomes smaller and smaller as the velocity of the incident particle approaches the speed of light:

The relativistic momentum and total relativistic energy of a particle are given by

Conservation of momentum dictates that the sum of the momenta of the incoming particle and the stationary particle (which initially has momentum = 0) equals the sum of the momenta of the emergent particles:

Likewise, the sum of the total relativistic energies of the incoming particle and the stationary particle (which initially has total energy mc²) equals the sum of the total energies of the emergent particles:

Breaking down (6-5) into its components, replacing $v$ with the dimensionless $\beta$ , and factoring out common terms from (6-5) and (6-6) yields the following:^{[p 24]}

From these we obtain the following relationships:^{[p 24]}

For the symmetrical case in which $\phi =\theta$ and $\beta _{2}=\beta _{3},$ (6-12) takes on the simpler form:^{[p 24]}

Beyond the basics

Rapidity

Figure 7–2. Plot of the three basic Hyperbolic functions: hyperbolic sine (sinh), hyperbolic cosine (cosh) and hyperbolic tangent (tanh). Sinh is red, cosh is blue and tanh is green.

Lorentz transformations relate coordinates of events in one reference frame to those of another frame. Relativistic composition of velocities is used to add two velocities together. The formulas to perform the latter computations are nonlinear, making them more complex than the corresponding Galilean formulas.

This nonlinearity is an artifact of our choice of parameters.^[11]^: 47–59 We have previously noted that in an x–ct spacetime diagram, the points at some constant spacetime interval from the origin form an invariant hyperbola. We have also noted that the coordinate systems of two spacetime reference frames in standard configuration are hyperbolically rotated with respect to each other.

The natural functions for expressing these relationships are the hyperbolic analogs of the trigonometric functions. Fig. 7-1a shows a unit circle with sin(a) and cos(a), the only difference between this diagram and the familiar unit circle of elementary trigonometry being that a is interpreted, not as the angle between the ray and the x-axis, but as twice the area of the sector swept out by the ray from the x-axis. Numerically, the angle and 2 × area measures for the unit circle are identical. Fig. 7-1b shows a unit hyperbola with sinh(a) and cosh(a), where a is likewise interpreted as twice the tinted area.^[79] Fig. 7-2 presents plots of the sinh, cosh, and tanh functions.

For the unit circle, the slope of the ray is given by

{\text{slope}}=\tan a={\frac {\sin a}{\cos a}}.

In the Cartesian plane, rotation of point (x, y) into point (x', y') by angle θ is given by

{\begin{pmatrix}x'\\y'\\\end{pmatrix}}={\begin{pmatrix}\cos \theta &-\sin \theta \\\sin \theta &\cos \theta \\\end{pmatrix}}{\begin{pmatrix}x\\y\\\end{pmatrix}}.

In a spacetime diagram, the velocity parameter $\beta$ is the analog of slope. The rapidity, φ, is defined by^[21]^: 96–99

\beta \equiv \tanh \phi \equiv {\frac {v}{c}},

where

\tanh \phi ={\frac {\sinh \phi }{\cosh \phi }}={\frac {e^{\phi }-e^{-\phi }}{e^{\phi }+e^{-\phi }}}.

The rapidity defined above is very useful in special relativity because many expressions take on a considerably simpler form when expressed in terms of it. For example, rapidity is simply additive in the collinear velocity-addition formula;^[11]^: 47–59

\beta ={\frac {\beta _{1}+\beta _{2}}{1+\beta _{1}\beta _{2}}}=

{\frac {\tanh \phi _{1}+\tanh \phi _{2}}{1+\tanh \phi _{1}\tanh \phi _{2}}}=

\tanh(\phi _{1}+\phi _{2}),

or in other words, $\phi =\phi _{1}+\phi _{2}.$

The Lorentz transformations take a simple form when expressed in terms of rapidity. The γ factor can be written as

\gamma ={\frac {1}{\sqrt {1-\beta ^{2}}}}={\frac {1}{\sqrt {1-\tanh ^{2}\phi }}}

=\cosh \phi ,

\gamma \beta ={\frac {\beta }{\sqrt {1-\beta ^{2}}}}={\frac {\tanh \phi }{\sqrt {1-\tanh ^{2}\phi }}}

=\sinh \phi .

Transformations describing relative motion with uniform velocity and without rotation of the space coordinate axes are called boosts.

Substituting γ and γβ into the transformations as previously presented and rewriting in matrix form, the Lorentz boost in the x-direction may be written as

{\begin{pmatrix}ct'\\x'\end{pmatrix}}={\begin{pmatrix}\cosh \phi &-\sinh \phi \\-\sinh \phi &\cosh \phi \end{pmatrix}}{\begin{pmatrix}ct\\x\end{pmatrix}},

and the inverse Lorentz boost in the x-direction may be written as

{\begin{pmatrix}ct\\x\end{pmatrix}}={\begin{pmatrix}\cosh \phi &\sinh \phi \\\sinh \phi &\cosh \phi \end{pmatrix}}{\begin{pmatrix}ct'\\x'\end{pmatrix}}.

In other words, Lorentz boosts represent hyperbolic rotations in Minkowski spacetime.^[21]^: 96–99

The advantages of using hyperbolic functions are such that some textbooks such as the classic ones by Taylor and Wheeler introduce their use at a very early stage.^[11]^{[note 11]}

4‑vectors

Four‑vectors have been mentioned above in context of the energy–momentum 4‑vector, but without any great emphasis. Indeed, none of the elementary derivations of special relativity require them. But once understood, 4‑vectors, and more generally tensors, greatly simplify the mathematics and conceptual understanding of special relativity. Working exclusively with such objects leads to formulas that are manifestly relativistically invariant, which is a considerable advantage in non-trivial contexts. For instance, demonstrating relativistic invariance of Maxwell's equations in their usual form is not trivial, while it is merely a routine calculation, really no more than an observation, using the field strength tensor formulation.^[80]

On the other hand, general relativity, from the outset, relies heavily on 4‑vectors, and more generally tensors, representing physically relevant entities. Relating these via equations that do not rely on specific coordinates requires tensors, capable of connecting such 4‑vectors even within a curved spacetime, and not just within a flat one as in special relativity. The study of tensors is outside the scope of this article, which provides only a basic discussion of spacetime.

Definition of 4-vectors

A 4-tuple, ⁠ $A=\left(A_{0},A_{1},A_{2},A_{3}\right)$ ⁠ is a "4-vector" if its component A_i transform between frames according to the Lorentz transformation.

If using ⁠ $(ct,x,y,z)$ ⁠ coordinates, A is a 4–vector if it transforms (in the x-direction) according to

{\begin{aligned}A_{0}'&=\gamma \left(A_{0}-(v/c)A_{1}\right)\\A_{1}'&=\gamma \left(A_{1}-(v/c)A_{0}\right)\\A_{2}'&=A_{2}\\A_{3}'&=A_{3}\end{aligned}}

which comes from simply replacing ct with A₀ and x with A₁ in the earlier presentation of the Lorentz transformation.

As usual, when we write x, t, etc. we generally mean Δx, Δt etc.

The last three components of a 4–vector must be a standard vector in three-dimensional space. Therefore, a 4–vector must transform like ⁠ $(c\Delta t,\Delta x,\Delta y,\Delta z)$ ⁠ under Lorentz transformations as well as rotations.^[81]^: 36–59

Properties of 4-vectors

Closure under linear combination: If A and B are 4-vectors, then ⁠ $C=aA+aB$ ⁠ is also a 4-vector.
Inner-product invariance: If A and B are 4-vectors, then their inner product (scalar product) is invariant, i.e. their inner product is independent of the frame in which it is calculated. Note how the calculation of inner product differs from the calculation of the inner product of a 3-vector. In the following, ${\vec {A}}$ and ${\vec {B}}$ are 3-vectors:
$A\cdot B\equiv$ $A_{0}B_{0}-A_{1}B_{1}-A_{2}B_{2}-A_{3}B_{3}\equiv$ $A_{0}B_{0}-{\vec {A}}\cdot {\vec {B}}$

In addition to being invariant under Lorentz transformation, the above inner product is also invariant under rotation in 3-space.

Two vectors are said to be orthogonal if

A\cdot B=0.

Unlike the case with 3-vectors, orthogonal 4-vectors are not necessarily at right angles with each other. The rule is that two 4-vectors are orthogonal if they are offset by equal and opposite angles from the 45° line which is the world line of a light ray. This implies that a lightlike 4-vector is orthogonal with itself.

Invariance of the magnitude of a vector: The magnitude of a vector is the inner product of a 4-vector with itself, and is a frame-independent property. As with intervals, the magnitude may be positive, negative or zero, so that the vectors are referred to as timelike, spacelike or null (lightlike). Note that a null vector is not the same as a zero vector. A null vector is one for which $A\cdot A=0,$ while a zero vector is one whose components are all zero. Special cases illustrating the invariance of the norm include the invariant interval $c^{2}t^{2}-x^{2}$ and the invariant length of the relativistic momentum vector $E^{2}-p^{2}c^{2}.$ ^[21]^{: 178–181}^[81]^: 36–59

Examples of 4-vectors

Displacement 4-vector: Otherwise known as the spacetime separation, this is (Δt, Δx, Δy, Δz), or for infinitesimal separations, (dt, dx, dy, dz).
$dS\equiv (dt,dx,dy,dz)$
Velocity 4-vector: This results when the displacement 4-vector is divided by $d\tau$ , where $d\tau$ is the proper time between the two events that yield dt, dx, dy, and dz.
$V\equiv {\frac {dS}{d\tau }}={\frac {(dt,dx,dy,dz)}{dt/\gamma }}=$ $\gamma \left(1,{\frac {dx}{dt}},{\frac {dy}{dt}},{\frac {dz}{dt}}\right)=$ $(\gamma ,\gamma {\vec {v}})$

The 4-velocity is tangent to the world line of a particle, and has a length equal to one unit of time in the frame of the particle.

An accelerated particle does not have an inertial frame in which it is always at rest. However, an inertial frame can always be found which is momentarily comoving with the particle. This frame, the momentarily comoving reference frame (MCRF), enables application of special relativity to the analysis of accelerated particles.

Since photons move on null lines,

d\tau =0

for a photon, and a 4-velocity cannot be defined. There is no frame in which a photon is at rest, and no MCRF can be established along a photon's path.

Energy–momentum 4-vector:
$P\equiv (E/c,{\vec {p}})=(E/c,p_{x},p_{y},p_{z})$

As indicated before, there are varying treatments for the energy-momentum 4-vector so that one may also see it expressed as

(E,{\vec {p}})

(E,{\vec {p}}c).

The first component is the total energy (including mass) of the particle (or system of particles) in a given frame, while the remaining components are its spatial momentum. The energy-momentum 4-vector is a conserved quantity.

Acceleration 4-vector: This results from taking the derivative of the velocity 4-vector with respect to $\tau .$
$A\equiv {\frac {dV}{d\tau }}=$ ${\frac {d}{d\tau }}(\gamma ,\gamma {\vec {v}})=$ $\gamma \left({\frac {d\gamma }{dt}},{\frac {d(\gamma {\vec {v}})}{dt}}\right)$
Force 4-vector: This is the derivative of the momentum 4-vector with respect to $\tau .$
$F\equiv {\frac {dP}{d\tau }}=$ $\gamma \left({\frac {dE}{dt}},{\frac {d{\vec {p}}}{dt}}\right)=$ $\gamma \left({\frac {dE}{dt}},{\vec {f}}\right)$

As expected, the final components of the above 4-vectors are all standard 3-vectors corresponding to spatial 3-momentum, 3-force etc.^[21]^{: 178–181}^[81]^: 36–59

4-vectors and physical law

The first postulate of special relativity declares the equivalency of all inertial frames. A physical law holding in one frame must apply in all frames, since otherwise it would be possible to differentiate between frames. Newtonian momenta fail to behave properly under Lorentzian transformation, and Einstein preferred to change the definition of momentum to one involving 4-vectors rather than give up on conservation of momentum.

Physical laws must be based on constructs that are frame independent. This means that physical laws may take the form of equations connecting scalars, which are always frame independent. However, equations involving 4-vectors require the use of tensors with appropriate rank, which themselves can be thought of as being built up from 4-vectors.^[21]^: 186

Acceleration

It is a common misconception that special relativity is applicable only to inertial frames, and that it is unable to handle accelerating objects or accelerating reference frames. Actually, accelerating objects can generally be analyzed without needing to deal with accelerating frames at all. It is only when gravitation is significant that general relativity is required.^[82]

Properly handling accelerating frames does require some care, however. The difference between special and general relativity is that (1) In special relativity, all velocities are relative, but acceleration is absolute. (2) In general relativity, all motion is relative, whether inertial, accelerating, or rotating. To accommodate this difference, general relativity uses curved spacetime.^[82]

In this section, we analyze several scenarios involving accelerated reference frames.

Dewan–Beran–Bell spaceship paradox

The Dewan–Beran–Bell spaceship paradox (Bell's spaceship paradox) is a good example of a problem where intuitive reasoning unassisted by the geometric insight of the spacetime approach can lead to issues.

In Fig. 7-4, two identical spaceships float in space and are at rest relative to each other. They are connected by a string which is capable of only a limited amount of stretching before breaking. At a given instant in our frame, the observer frame, both spaceships accelerate in the same direction along the line between them with the same constant proper acceleration.^{[note 12]} Will the string break?

When the paradox was new and relatively unknown, even professional physicists had difficulty working out the solution. Two lines of reasoning lead to opposite conclusions. Both arguments, which are presented below, are flawed even though one of them yields the correct answer.^[21]^{: 106, 120–122}

To observers in the rest frame, the spaceships start a distance L apart and remain the same distance apart during acceleration. During acceleration, L is a length contracted distance of the distance L' = γL in the frame of the accelerating spaceships. After a sufficiently long time, γ will increase to a sufficiently large factor that the string must break.
Let A and B be the rear and front spaceships. In the frame of the spaceships, each spaceship sees the other spaceship doing the same thing that it is doing. A says that B has the same acceleration that he has, and B sees that A matches her every move. So the spaceships stay the same distance apart, and the string does not break.^[21]^{: 106, 120–122}

The problem with the first argument is that there is no "frame of the spaceships." There cannot be, because the two spaceships measure a growing distance between the two. Because there is no common frame of the spaceships, the length of the string is ill-defined. Nevertheless, the conclusion is correct, and the argument is mostly right. The second argument, however, completely ignores the relativity of simultaneity.^[21]^{: 106, 120–122}

A spacetime diagram (Fig. 7-5) makes the correct solution to this paradox almost immediately evident. Two observers in Minkowski spacetime accelerate with constant magnitude $k$ acceleration for proper time $\sigma$ (acceleration and elapsed time measured by the observers themselves, not some inertial observer). They are comoving and inertial before and after this phase. In Minkowski geometry, the length along the line of simultaneity $A'B''$ turns out to be greater than the length along the line of simultaneity $AB$ .

The length increase can be calculated with the help of the Lorentz transformation. If, as illustrated in Fig. 7-5, the acceleration is finished, the ships will remain at a constant offset in some frame $S'.$ If $x_{A}$ and $x_{B}=x_{A}+L$ are the ships' positions in $S,$ the positions in frame $S'$ are:^[83]

{\begin{aligned}x'_{A}&=\gamma \left(x_{A}-vt\right)\\x'_{B}&=\gamma \left(x_{A}+L-vt\right)\\L'&=x'_{B}-x'_{A}=\gamma L\end{aligned}}

The "paradox", as it were, comes from the way that Bell constructed his example. In the usual discussion of Lorentz contraction, the rest length is fixed and the moving length shortens as measured in frame $S$ . As shown in Fig. 7-5, Bell's example asserts the moving lengths $AB$ and $A'B'$ measured in frame $S$ to be fixed, thereby forcing the rest frame length $A'B''$ in frame $S'$ to increase.

Accelerated observer with horizon

Certain special relativity problem setups can lead to insight about phenomena normally associated with general relativity, such as event horizons. In the text accompanying Section "Invariant hyperbola" of the article Spacetime, the magenta hyperbolae represented actual paths that are tracked by a constantly accelerating traveler in spacetime. During periods of positive acceleration, the traveler's velocity just approaches the speed of light, while, measured in our frame, the traveler's acceleration constantly decreases.

Fig. 7-6 details various features of the traveler's motions with more specificity. At any given moment, her space axis is formed by a line passing through the origin and her current position on the hyperbola, while her time axis is the tangent to the hyperbola at her position. The velocity parameter $\beta$ approaches a limit of one as $ct$ increases. Likewise, $\gamma$ approaches infinity.

The shape of the invariant hyperbola corresponds to a path of constant proper acceleration. This is demonstrable as follows:

We remember that $\beta =ct/x.$
Since $c^{2}t^{2}-x^{2}=s^{2},$ we conclude that $\beta (ct)=ct/{\sqrt {c^{2}t^{2}-s^{2}}}.$
$\gamma =1/{\sqrt {1-\beta ^{2}}}=$ ${\sqrt {c^{2}t^{2}-s^{2}}}/s$
From the relativistic force law, $F=dp/dt=$ $dpc/d(ct)=d(\beta \gamma mc^{2})/d(ct).$
Substituting $\beta (ct)$ from step 2 and the expression for $\gamma$ from step 3 yields $F=mc^{2}/s,$ which is a constant expression.^[84]^{: 110–113}

Fig. 7-6 illustrates a specific calculated scenario. Terence (A) and Stella (B) initially stand together 100 light hours from the origin. Stella lifts off at time 0, her spacecraft accelerating at 0.01 c per hour. Every twenty hours, Terence radios updates to Stella about the situation at home (solid green lines). Stella receives these regular transmissions, but the increasing distance (offset in part by time dilation) causes her to receive Terence's communications later and later as measured on her clock, and she never receives any communications from Terence after 100 hours on his clock (dashed green lines).^[84]^{: 110–113}

After 100 hours according to Terence's clock, Stella enters a dark region. She has traveled outside Terence's timelike future. On the other hand, Terence can continue to receive Stella's messages to him indefinitely. He just has to wait long enough. Spacetime has been divided into distinct regions separated by an apparent event horizon. So long as Stella continues to accelerate, she can never know what takes place behind this horizon.^[84]^{: 110–113}

Relativity and unifying electromagnetism

Theoretical investigation in classical electromagnetism led to the discovery of wave propagation. Equations generalizing the electromagnetic effects found that finite propagation speed of the E and B fields required certain behaviors on charged particles. The general study of moving charges forms the Liénard–Wiechert potential, which is a step towards special relativity.

The Lorentz transformation of the electric field of a moving charge into a non-moving observer's reference frame results in the appearance of a mathematical term commonly called the magnetic field. Conversely, the magnetic field generated by a moving charge disappears and becomes a purely electrostatic field in a comoving frame of reference. Maxwell's equations are thus simply an empirical fit to special relativistic effects in a classical model of the Universe. As electric and magnetic fields are reference frame dependent and thus intertwined, one speaks of electromagnetic fields. Special relativity provides the transformation rules for how an electromagnetic field in one inertial frame appears in another inertial frame.

Maxwell's equations in the 3D form are already consistent with the physical content of special relativity, although they are easier to manipulate in a manifestly covariant form, that is, in the language of tensor calculus.^[80]

Theories of relativity and quantum mechanics

Special relativity can be combined with quantum mechanics to form relativistic quantum mechanics and quantum electrodynamics. How general relativity and quantum mechanics can be unified is one of the unsolved problems in physics; quantum gravity and a "theory of everything", which require a unification including general relativity too, are active and ongoing areas in theoretical research.

The early Bohr–Sommerfeld atomic model explained the fine structure of alkali metal atoms using both special relativity and the preliminary knowledge on quantum mechanics of the time.^[85]

In 1928, Paul Dirac constructed an influential relativistic wave equation, now known as the Dirac equation in his honour,^{[p 25]} that is fully compatible both with special relativity and with the final version of quantum theory existing after 1926. This equation not only described the intrinsic angular momentum of the electrons called spin, it also led to the prediction of the antiparticle of the electron (the positron),^{[p 25]}^{[p 26]} and fine structure could only be fully explained with special relativity. It was the first foundation of relativistic quantum mechanics.

On the other hand, the existence of antiparticles leads to the conclusion that relativistic quantum mechanics is not enough for a more accurate and complete theory of particle interactions. Instead, a theory of particles interpreted as quantized fields, called quantum field theory, becomes necessary; in which particles can be created and destroyed throughout space and time.

Status

Special relativity in its Minkowski spacetime is accurate only when the absolute value of the gravitational potential is much less than c² in the region of interest.^[86] In a strong gravitational field, one must use general relativity. General relativity becomes special relativity at the limit of a weak field. At very small scales, such as at the Planck length and below, quantum effects must be taken into consideration resulting in quantum gravity. But at macroscopic scales and in the absence of strong gravitational fields, special relativity is experimentally tested to extremely high degree of accuracy (10⁻²⁰)^[87]and thus accepted by the physics community. Experimental results which appear to contradict it are not reproducible and are thus widely believed to be due to experimental errors.^[88]

Special relativity is mathematically self-consistent, and it is an organic part of all modern physical theories, most notably quantum field theory, string theory, and general relativity (in the limiting case of negligible gravitational fields).

Newtonian mechanics mathematically follows from special relativity at small velocities (compared to the speed of light) – thus Newtonian mechanics can be considered as a special relativity of slow moving bodies. See classical mechanics for a more detailed discussion.

Several experiments predating Einstein's 1905 paper are now interpreted as evidence for relativity. Of these it is known Einstein was aware of the Fizeau experiment before 1905,^[89] and historians have concluded that Einstein was at least aware of the Michelson–Morley experiment as early as 1899 despite claims he made in his later years that it played no role in his development of the theory.^[16]

The Fizeau experiment (1851, repeated by Michelson and Morley in 1886) measured the speed of light in moving media, with results that are consistent with relativistic addition of colinear velocities.
The famous Michelson–Morley experiment (1881, 1887) gave further support to the postulate that detecting an absolute reference velocity was not achievable. It should be stated here that, contrary to many alternative claims, it said little about the invariance of the speed of light with respect to the source and observer's velocity, as both source and observer were travelling together at the same velocity at all times.
The Trouton–Noble experiment (1903) showed that the torque on a capacitor is independent of position and inertial reference frame.
The Experiments of Rayleigh and Brace (1902, 1904) showed that length contraction does not lead to birefringence for a co-moving observer, in accordance with the relativity principle.

Particle accelerators accelerate and measure the properties of particles moving at near the speed of light, where their behavior is consistent with relativity theory and inconsistent with the earlier Newtonian mechanics. These machines would simply not work if they were not engineered according to relativistic principles. In addition, a considerable number of modern experiments have been conducted to test special relativity. Some examples:

Tests of relativistic energy and momentum – testing the limiting speed of particles
Ives–Stilwell experiment – testing relativistic Doppler effect and time dilation
Experimental testing of time dilation – relativistic effects on a fast-moving particle's half-life
Kennedy–Thorndike experiment – time dilation in accordance with Lorentz transformations
Hughes–Drever experiment – testing isotropy of space and mass
Modern searches for Lorentz violation – various modern tests
Experiments to test emission theory demonstrated that the speed of light is independent of the speed of the emitter.
Experiments to test the aether drag hypothesis – no "aether flow obstruction".

Technical discussion of spacetime

Geometry of spacetime

Comparison between flat Euclidean space and Minkowski space

Special relativity uses a "flat" 4-dimensional Minkowski space – an example of a spacetime. Minkowski spacetime appears to be very similar to the standard 3-dimensional Euclidean space, but there is a crucial difference with respect to time.

In 3D space, the differential of distance (line element) ds is defined by $ds^{2}=d\mathbf {x} \cdot d\mathbf {x} =dx_{1}^{2}+dx_{2}^{2}+dx_{3}^{2},$ where dx = (dx₁, dx₂, dx₃) are the differentials of the three spatial dimensions. In Minkowski geometry, there is an extra dimension with coordinate X⁰ derived from time, such that the distance differential fulfills $ds^{2}=-dX_{0}^{2}+dX_{1}^{2}+dX_{2}^{2}+dX_{3}^{2},$ where dX = (dX₀, dX₁, dX₂, dX₃) are the differentials of the four spacetime dimensions. This suggests a deep theoretical insight: special relativity is simply a rotational symmetry of our spacetime, analogous to the rotational symmetry of Euclidean space (see Fig. 10-1).^[91] Just as Euclidean space uses a Euclidean metric, so spacetime uses a Minkowski metric. Basically, special relativity can be stated as the invariance of any spacetime interval (that is the 4D distance between any two events) when viewed from any inertial reference frame. All equations and effects of special relativity can be derived from this rotational symmetry (the Poincaré group) of Minkowski spacetime.

The actual form of ds above depends on the metric and on the choices for the X⁰ coordinate. To make the time coordinate look like the space coordinates, it can be treated as imaginary: X₀ = ict (this is called a Wick rotation). According to Misner, Thorne and Wheeler (1971, §2.3), ultimately the deeper understanding of both special and general relativity will come from the study of the Minkowski metric (described below) and to take X⁰ = ct, rather than a "disguised" Euclidean metric using ict as the time coordinate.

Some authors use X⁰ = t, with factors of c elsewhere to compensate; for instance, spatial coordinates are divided by c or factors of c^±2 are included in the metric tensor.^[92]These numerous conventions can be superseded by using natural units where c = 1. Then space and time have equivalent units, and no factors of c appear anywhere.

3D spacetime

If we reduce the spatial dimensions to 2, so that we can represent the physics in a 3D space $ds^{2}=dx_{1}^{2}+dx_{2}^{2}-c^{2}dt^{2},$ we see that the null geodesics lie along a dual-cone (see Fig. 10-2) defined by the equation; $ds^{2}=0=dx_{1}^{2}+dx_{2}^{2}-c^{2}dt^{2}$ or simply $dx_{1}^{2}+dx_{2}^{2}=c^{2}dt^{2},$ which is the equation of a circle of radius c dt.

4D spacetime

If we extend this to three spatial dimensions, the null geodesics are the 4-dimensional cone: $ds^{2}=0=dx_{1}^{2}+dx_{2}^{2}+dx_{3}^{2}-c^{2}dt^{2}$ so $dx_{1}^{2}+dx_{2}^{2}+dx_{3}^{2}=c^{2}dt^{2}.$

As illustrated in Fig. 10-3, the null geodesics can be visualized as a set of continuous concentric spheres with radii = c dt.

This null dual-cone represents the "line of sight" of a point in space. That is, when we look at the stars and say "The light from that star which I am receiving is X years old", we are looking down this line of sight: a null geodesic. We are looking at an event a distance ${\textstyle d={\sqrt {x_{1}^{2}+x_{2}^{2}+x_{3}^{2}}}}$ away and a time d/c in the past. For this reason the null dual cone is also known as the "light cone". (The point in the lower left of the Fig. 10-2 represents the star, the origin represents the observer, and the line represents the null geodesic "line of sight".)

The cone in the −t region is the information that the point is "receiving", while the cone in the +t section is the information that the point is "sending".

The geometry of Minkowski space can be depicted using Minkowski diagrams, which are useful also in understanding many of the thought experiments in special relativity.

Physics in spacetime

Transformations of physical quantities between reference frames

Above, the Lorentz transformation for the time coordinate and three space coordinates illustrates that they are intertwined. This is true more generally: certain pairs of "timelike" and "spacelike" quantities naturally combine on equal footing under the same Lorentz transformation.

The Lorentz transformation in standard configuration above, that is, for a boost in the x-direction, can be recast into matrix form as follows:

${\begin{pmatrix}ct'\\x'\\y'\\z'\end{pmatrix}}={\begin{pmatrix}\gamma &-\beta \gamma &0&0\\-\beta \gamma &\gamma &0&0\\0&0&1&0\\0&0&0&1\end{pmatrix}}{\begin{pmatrix}ct\\x\\y\\z\end{pmatrix}}={\begin{pmatrix}\gamma ct-\gamma \beta x\\\gamma x-\beta \gamma ct\\y\\z\end{pmatrix}}.$

In Newtonian mechanics, quantities that have magnitude and direction are mathematically described as 3d vectors in Euclidean space, and in general they are parametrized by time. In special relativity, this notion is extended by adding the appropriate timelike quantity to a spacelike vector quantity, and we have 4d vectors, or "four-vectors", in Minkowski spacetime. The components of vectors are written using tensor index notation, as this has numerous advantages. The notation makes it clear the equations are manifestly covariant under the Poincaré group, thus bypassing the tedious calculations to check this fact. In constructing such equations, we often find that equations previously thought to be unrelated are, in fact, closely connected being part of the same tensor equation. Recognizing other physical quantities as tensors simplifies their transformation laws. Throughout, upper indices (superscripts) are contravariant indices rather than exponents except when they indicate a square (this should be clear from the context), and lower indices (subscripts) are covariant indices. For simplicity and consistency with the earlier equations, Cartesian coordinates will be used.

The simplest example of a four-vector is the position of an event in spacetime, which constitutes a timelike component ct and spacelike component x = (x, y, z), in a contravariant position four-vector with components: $X^{\nu }=(X^{0},X^{1},X^{2},X^{3})=(ct,x,y,z)=(ct,\mathbf {x} ).$ where we define X⁰ = ct so that the time coordinate has the same dimension of distance as the other spatial dimensions; so that space and time are treated equally.^[93]^[94]^[95] Now the transformation of the contravariant components of the position 4-vector can be compactly written as: $X^{\mu '}=\Lambda ^{\mu '}{}_{\nu }X^{\nu }$ where there is an implied summation on $\nu$ from 0 to 3, and $\Lambda ^{\mu '}{}_{\nu }$ is a matrix.

More generally, all contravariant components of a four-vector $T^{\nu }$ transform from one frame to another frame by a Lorentz transformation: $T^{\mu '}=\Lambda ^{\mu '}{}_{\nu }T^{\nu }$

Examples of other 4-vectors include the four-velocity $U^{\mu },$ defined as the derivative of the position 4-vector with respect to proper time: $U^{\mu }={\frac {dX^{\mu }}{d\tau }}=\gamma (v)(c,v_{x},v_{y},v_{z})=\gamma (v)(c,\mathbf {v} ).$ where the Lorentz factor is: $\gamma (v)={\frac {1}{\sqrt {1-v^{2}/c^{2}}}}\qquad v^{2}=v_{x}^{2}+v_{y}^{2}+v_{z}^{2}.$

The relativistic energy $E=\gamma (v)mc^{2}$ and relativistic momentum $\mathbf {p} =\gamma (v)m\mathbf {v}$ of an object are respectively the timelike and spacelike components of a contravariant four-momentum vector: $P^{\mu }=mU^{\mu }=m\gamma (v)(c,v_{x},v_{y},v_{z})=\left({\frac {E}{c}},p_{x},p_{y},p_{z}\right)=\left({\frac {E}{c}},\mathbf {p} \right).$ where m is the invariant mass.

The four-acceleration is the proper time derivative of 4-velocity: $A^{\mu }={\frac {dU^{\mu }}{d\tau }}.$

The transformation rules for three-dimensional velocities and accelerations are very awkward; even above in standard configuration the velocity equations are quite complicated owing to their non-linearity. On the other hand, the transformation of four-velocity and four-acceleration are simpler by means of the Lorentz transformation matrix.

The four-gradient of a scalar field φ transforms covariantly rather than contravariantly: ${\begin{pmatrix}{\dfrac {1}{c}}{\dfrac {\partial \phi }{\partial t'}}&{\dfrac {\partial \phi }{\partial x'}}&{\dfrac {\partial \phi }{\partial y'}}&{\dfrac {\partial \phi }{\partial z'}}\end{pmatrix}}={\begin{pmatrix}{\dfrac {1}{c}}{\dfrac {\partial \phi }{\partial t}}&{\dfrac {\partial \phi }{\partial x}}&{\dfrac {\partial \phi }{\partial y}}&{\dfrac {\partial \phi }{\partial z}}\end{pmatrix}}{\begin{pmatrix}\gamma &+\beta \gamma &0&0\\+\beta \gamma &\gamma &0&0\\0&0&1&0\\0&0&0&1\end{pmatrix}}.$ which is the transpose of: $(\partial _{\mu '}\phi )=\Lambda _{\mu '}{}^{\nu }(\partial _{\nu }\phi )\qquad \partial _{\mu }\equiv {\frac {\partial }{\partial x^{\mu }}}.$ only in Cartesian coordinates. It is the covariant derivative which transforms in manifest covariance, in Cartesian coordinates this happens to reduce to the partial derivatives, but not in other coordinates.

More generally, the covariant components of a 4-vector transform according to the inverse Lorentz transformation: $T_{\mu '}=\Lambda _{\mu '}{}^{\nu }T_{\nu },$ where $\Lambda _{\mu '}{}^{\nu }$ is the reciprocal matrix of $\Lambda ^{\mu '}{}_{\nu }$ .

The postulates of special relativity constrain the exact form the Lorentz transformation matrices take.

More generally, most physical quantities are best described as (components of) tensors. So to transform from one frame to another, we use the well-known tensor transformation law^[96] $T_{\theta '\iota '\cdots \kappa '}^{\alpha '\beta '\cdots \zeta '}=\Lambda ^{\alpha '}{}_{\mu }\Lambda ^{\beta '}{}_{\nu }\cdots \Lambda ^{\zeta '}{}_{\rho }\Lambda _{\theta '}{}^{\sigma }\Lambda _{\iota '}{}^{\upsilon }\cdots \Lambda _{\kappa '}{}^{\phi }T_{\sigma \upsilon \cdots \phi }^{\mu \nu \cdots \rho }$ where $\Lambda _{\chi '}{}^{\psi }$ is the reciprocal matrix of $\Lambda ^{\chi '}{}_{\psi }$ . All tensors transform by this rule.

An example of a four-dimensional second order antisymmetric tensor is the relativistic angular momentum, which has six components: three are the classical angular momentum, and the other three are related to the boost of the center of mass of the system. The derivative of the relativistic angular momentum with respect to proper time is the relativistic torque, also second order antisymmetric tensor.

The electromagnetic field tensor is another second order antisymmetric tensor field, with six components: three for the electric field and another three for the magnetic field. There is also the stress–energy tensor for the electromagnetic field, namely the electromagnetic stress–energy tensor.

Metric

The metric tensor allows one to define the inner product of two vectors, which in turn allows one to assign a magnitude to the vector. Given the four-dimensional nature of spacetime the Minkowski metric η has components (valid with suitably chosen coordinates) which can be arranged in a 4 × 4 matrix: $\eta _{\alpha \beta }={\begin{pmatrix}-1&0&0&0\\0&1&0&0\\0&0&1&0\\0&0&0&1\end{pmatrix}}$ which is equal to its reciprocal, $\eta ^{\alpha \beta }$ , in those frames. Throughout we use the signs as above, different authors use different conventions – see Minkowski metric alternative signs.

The Poincaré group is the most general group of transformations which preserves the Minkowski metric: $\eta _{\alpha \beta }=\eta _{\mu '\nu '}\Lambda ^{\mu '}{}_{\alpha }\Lambda ^{\nu '}{}_{\beta }$ and this is the physical symmetry underlying special relativity.

The metric can be used for raising and lowering indices on vectors and tensors. Invariants can be constructed using the metric, the inner product of a 4-vector T with another 4-vector S is: $T^{\alpha }S_{\alpha }=T^{\alpha }\eta _{\alpha \beta }S^{\beta }=T_{\alpha }\eta ^{\alpha \beta }S_{\beta }={\text{invariant scalar}}$

Invariant means that it takes the same value in all inertial frames, because it is a scalar (0 rank tensor), and so no $Λ$ appears in its trivial transformation. The magnitude of the 4-vector T is the positive square root of the inner product with itself: $|\mathbf {T} |={\sqrt {T^{\alpha }T_{\alpha }}}$

One can extend this idea to tensors of higher order, for a second order tensor we can form the invariants: $T^{\alpha }{}_{\alpha },T^{\alpha }{}_{\beta }T^{\beta }{}_{\alpha },T^{\alpha }{}_{\beta }T^{\beta }{}_{\gamma }T^{\gamma }{}_{\alpha }={\text{invariant scalars}},$ similarly for higher order tensors. Invariant expressions, particularly inner products of 4-vectors with themselves, provide equations that are useful for calculations, because one does not need to perform Lorentz transformations to determine the invariants.

Relativistic kinematics and invariance

The coordinate differentials transform also contravariantly: $dX^{\mu '}=\Lambda ^{\mu '}{}_{\nu }dX^{\nu }$ so the squared length of the differential of the position four-vector dX^μ constructed using $d\mathbf {X} ^{2}=dX^{\mu }\,dX_{\mu }=\eta _{\mu \nu }\,dX^{\mu }\,dX^{\nu }=-(c\,dt)^{2}+(dx)^{2}+(dy)^{2}+(dz)^{2}$ is an invariant. Notice that when the line element dX² is negative that $\sqrt - d X 2$ is the differential of proper time, while when dX² is positive, $\sqrt d X 2$ is differential of the proper distance.

The 4-velocity U^μ has an invariant form: $\mathbf {U} ^{2}=\eta _{\nu \mu }U^{\nu }U^{\mu }=-c^{2}\,,$ which means all velocity four-vectors have a magnitude of c. This is an expression of the fact that there is no such thing as being at coordinate rest in relativity: at the least, you are always moving forward through time. Differentiating the above equation by τ produces: $2\eta _{\mu \nu }A^{\mu }U^{\nu }=0.$ So in special relativity, the acceleration four-vector and the velocity four-vector are orthogonal.

Relativistic dynamics and invariance

The invariant magnitude of the momentum 4-vector generates the energy–momentum relation: $\mathbf {P} ^{2}=\eta ^{\mu \nu }P_{\mu }P_{\nu }=-\left({\frac {E}{c}}\right)^{2}+p^{2}.$

We can work out what this invariant is by first arguing that, since it is a scalar, it does not matter in which reference frame we calculate it, and then by transforming to a frame where the total momentum is zero. $\mathbf {P} ^{2}=-\left({\frac {E_{\text{rest}}}{c}}\right)^{2}=-(mc)^{2}.$

We see that the rest energy is an independent invariant. A rest energy can be calculated even for particles and systems in motion, by translating to a frame in which momentum is zero.

The rest energy is related to the mass according to the celebrated equation discussed above: $E_{\text{rest}}=mc^{2}.$

The mass of systems measured in their center of momentum frame (where total momentum is zero) is given by the total energy of the system in this frame. It may not be equal to the sum of individual system masses measured in other frames.

To use Newton's third law of motion, both forces must be defined as the rate of change of momentum with respect to the same time coordinate. That is, it requires the 3D force defined above. Unfortunately, there is no tensor in 4D which contains the components of the 3D force vector among its components.

If a particle is not traveling at c, one can transform the 3D force from the particle's co-moving reference frame into the observer's reference frame. This yields a 4-vector called the four-force. It is the rate of change of the above energy momentum four-vector with respect to proper time. The covariant version of the four-force is: $F_{\nu }={\frac {dP_{\nu }}{d\tau }}=mA_{\nu }$

In the rest frame of the object, the time component of the four-force is zero unless the "invariant mass" of the object is changing (this requires a non-closed system in which energy/mass is being directly added or removed from the object) in which case it is the negative of that rate of change of mass, times c. In general, though, the components of the four-force are not equal to the components of the three-force, because the three force is defined by the rate of change of momentum with respect to coordinate time, that is, dp/dt while the four-force is defined by the rate of change of momentum with respect to proper time, that is, dp/dτ.

In a continuous medium, the 3D density of force combines with the density of power to form a covariant 4-vector. The spatial part is the result of dividing the force on a small cell (in 3-space) by the volume of that cell. The time component is −1/c times the power transferred to that cell divided by the volume of the cell. This will be used below in the section on electromagnetism.

Notes

^ Einstein himself, in The Foundations of the General Theory of Relativity, Ann. Phys. 49 (1916), writes "The word 'special' is meant to intimate that the principle is restricted to the case ...". See p. 111 of The Principle of Relativity, A. Einstein, H. A. Lorentz, H. Weyl, H. Minkowski, Dover reprint of 1923 translation by Methuen and Company.]
^ Wald, General Relativity, p. 60: "... the special theory of relativity asserts that spacetime is the manifold $\mathbb {R} ^{4}$ with a flat metric of Lorentz signature defined on it. Conversely, the entire content of special relativity ... is contained in this statement ..."
^ In a spacetime setting, the length of a moving rigid object is the spatial distance between the ends of the object measured at the same time. In the rest frame of the object the simultaneity is not required.
^ The results of the Michelson–Morley experiment led George Francis FitzGerald and Hendrik Lorentz independently to propose the phenomenon of length contraction. Lorentz believed that length contraction represented a physical contraction of the atoms making up an object. He envisioned no fundamental change in the nature of space and time.^[27]^: 62–68
Lorentz expected that length contraction would result in compressive strains in an object that should result in measurable effects. Such effects would include optical effects in transparent media, such as optical rotation^{[p 11]} and induction of double refraction,^{[p 12]} and the induction of torques on charged condensers moving at an angle with respect to the aether.^{[p 12]} Lorentz was perplexed by experiments such as the Trouton–Noble experiment and the experiments of Rayleigh and Brace which failed to validate his theoretical expectations.^[27]
^ For mathematical consistency, Lorentz proposed a new time variable, the "local time", called that because it depended on the position of a moving body, following the relation t′ = t − vx/c².^{[p 13]} Lorentz considered local time not to be "real"; rather, it represented an ad hoc change of variable.^[28]^: 51, 80
Impressed by Lorentz's "most ingenious idea", Poincaré saw more in local time than a mere mathematical trick. It represented the actual time that would be shown on a moving observer's clocks. On the other hand, Poincaré did not consider this measured time to be the "true time" that would be exhibited by clocks at rest in the aether. Poincaré made no attempt to redefine the concepts of space and time. To Poincaré, Lorentz transformation described the apparent states of the field for a moving observer. True states remained those defined with respect to the ether.^[29]
^ This concept is counterintuitive at least for the fact that, in contrast to usual concepts of distance, it may assume negative values (is not positive definite for non-coinciding events), and that the square-denotation is misleading. This negative square lead to, now not broadly used, concepts of imaginary time. It is immediate that the negative of Δs² is also an invariant, generated by a variant of the metric signature of spacetime.
^ The invariance of Δs² under standard Lorentz transformation in analogous to the invariance of squared distances Δr² under rotations in Euclidean space. Although space and time have an equal footing in relativity, the minus sign in front of the spatial terms marks space and time as being of essentially different character. They are not the same. Because it treats time differently than it treats the 3 spatial dimensions, Minkowski space differs from four-dimensional Euclidean space.
^ The refractive index dependence of the presumed partial aether-drag was eventually confirmed by Pieter Zeeman in 1914–1915, long after special relativity had been accepted by the mainstream. Using a scaled-up version of Michelson's apparatus connected directly to Amsterdam's main water conduit, Zeeman was able to perform extended measurements using monochromatic light ranging from violet (4358 Å) through red (6870 Å).^{[p 17]}^{[p 18]}
^ Even though it has been many decades since Terrell and Penrose published their observations, popular writings continue to conflate measurement versus appearance. For example, Michio Kaku wrote in Einstein's Cosmos (W. W. Norton & Company, 2004. p. 65): "... imagine that the speed of light is only 20 miles per hour. If a car were to go down the street, it might look compressed in the direction of motion, being squeezed like an accordion down to perhaps 1 inch in length."
^ In a letter to Carl Seelig in 1955, Einstein wrote "I had already previously found that Maxwell's theory did not account for the micro-structure of radiation and could therefore have no general validity.", Einstein letter to Carl Seelig, 1955.
^ Rapidity arises naturally as a coordinates on the pure boost generators inside the Lie algebra algebra of the Lorentz group. Likewise, rotation angles arise naturally as coordinates (modulo 2 $π$ ) on the pure rotation generators in the Lie algebra. (Together they coordinatize the whole Lie algebra.) A notable difference is that the resulting rotations are periodic in the rotation angle, while the resulting boosts are not periodic in rapidity (but rather one-to-one). The similarity between boosts and rotations is formal resemblance.
^ In relativity theory, proper acceleration is the physical acceleration (i.e., measurable acceleration as by an accelerometer) experienced by an object. It is thus acceleration relative to a free-fall, or inertial, observer who is momentarily at rest relative to the object being measured.

Primary sources

^ a b c d e f g Albert Einstein (1905) "Zur Elektrodynamik bewegter Körper", Annalen der Physik 17: 891; English translation On the Electrodynamics of Moving Bodies by George Barker Jeffery and Wilfrid Perrett (1923); Another English translation On the Electrodynamics of Moving Bodies by Megh Nad Saha (1920).
^ "Science and Common Sense", P. W. Bridgman, The Scientific Monthly, Vol. 79, No. 1 (Jul. 1954), pp. 32–39.
^ The Electromagnetic Mass and Momentum of a Spinning Electron, G. Breit, Proceedings of the National Academy of Sciences, Vol. 12, p.451, 1926
^ Kinematics of an electron with an axis. Phil. Mag. 3:1-22. L. H. Thomas.]
^ a b Einstein, Autobiographical Notes, 1949.
^ Einstein, "Fundamental Ideas and Methods of the Theory of Relativity", 1920
^ Einstein, On the Relativity Principle and the Conclusions Drawn from It, 1907; "The Principle of Relativity and Its Consequences in Modern Physics", 1910; "The Theory of Relativity", 1911; Manuscript on the Special Theory of Relativity, 1912; Theory of Relativity, 1913; Einstein, Relativity, the Special and General Theory, 1916; The Principal Ideas of the Theory of Relativity, 1916; What Is The Theory of Relativity?, 1919; The Principle of Relativity (Princeton Lectures), 1921; Physics and Reality, 1936; The Theory of Relativity, 1949.
^ Yaakov Friedman (2004). Physical Applications of Homogeneous Balls. Progress in Mathematical Physics. Vol. 40. pp. 1–21. ISBN 978-0-8176-3339-4.
^ Das, A. (1993) The Special Theory of Relativity, A Mathematical Exposition, Springer, ISBN 0-387-94042-1.
^ Schutz, J. (1997) Independent Axioms for Minkowski Spacetime, Addison Wesley Longman Limited, ISBN 0-582-31760-6.
^ Lorentz, H.A. (1902). "The rotation of the plane of polarization in moving media" (PDF). Huygens Institute - Royal Netherlands Academy of Arts and Sciences (KNAW). 4: 669–678. Bibcode:1901KNAB....4..669L. Retrieved 15 November 2018.
^ a b Lorentz, H. A. (1904). "Electromagnetic phenomena in a system moving with any velocity smaller than that of light" (PDF). Huygens Institute - Royal Netherlands Academy of Arts and Sciences (KNAW). 6: 809–831. Bibcode:1903KNAB....6..809L. Retrieved 15 November 2018.
^ Lorentz, Hendrik (1895). "Investigation of oscillations excited by oscillating ions". Attempt at a Theory of Electrical and Optical Phenomena in Moving Bodies (Versuch einer Theorie der electrischen und optischen Erscheinungen in bewegten Körpern). Leiden: E. J. Brill. (subsection § 31).
^ Lin, Shih-Chun; Giallorenzi, Thomas G. (1979). "Sensitivity analysis of the Sagnac-effect optical-fiber ring interferometer". Applied Optics. 18 (6): 915–931. Bibcode:1979ApOpt..18..915L. doi:10.1364/AO.18.000915. PMID 20208844. S2CID 5343180.
^ Shaw, R. (1962). "Length Contraction Paradox". American Journal of Physics. 30 (1): 72. Bibcode:1962AmJPh..30...72S. doi:10.1119/1.1941907. S2CID 119855914.
^ G. A. Benford; D. L. Book & W. A. Newcomb (1970). "The Tachyonic Antitelephone". Physical Review D. 2 (2): 263–265. Bibcode:1970PhRvD...2..263B. doi:10.1103/PhysRevD.2.263. S2CID 121124132.
^ Zeeman, Pieter (1914). "Fresnel's coefficient for light of different colours. (First part)". Proc. Kon. Acad. Van Weten. 17: 445–451. Bibcode:1914KNAB...17..445Z.
^ Zeeman, Pieter (1915). "Fresnel's coefficient for light of different colours. (Second part)". Proc. Kon. Acad. Van Weten. 18: 398–408. Bibcode:1915KNAB...18..398Z.
^ Terrell, James (15 November 1959). "Invisibility of the Lorentz Contraction". Physical Review. 116 (4): 1041–1045. Bibcode:1959PhRv..116.1041T. doi:10.1103/PhysRev.116.1041.
^ Penrose, Roger (24 October 2008). "The Apparent Shape of a Relativistically Moving Sphere". Mathematical Proceedings of the Cambridge Philosophical Society. 55 (1): 137–139. Bibcode:1959PCPS...55..137P. doi:10.1017/S0305004100033776. S2CID 123023118.
^ a b c Does the inertia of a body depend upon its energy content? A. Einstein, Annalen der Physik. 18:639, 1905 (English translation by W. Perrett and G.B. Jeffery)
^ On the Inertia of Energy Required by the Relativity Principle, A. Einstein, Annalen der Physik 23 (1907): 371–384
^ Baglio, Julien (26 May 2007). "Acceleration in special relativity: What is the meaning of "uniformly accelerated movement" ?" (PDF). Physics Department, ENS Cachan. Retrieved 22 January 2016.
^ a b c Champion, Frank Clive (1932). "On some close collisions of fast β-particles with electrons, photographed by the expansion method". Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character. 136 (830). The Royal Society Publishing: 630–637. Bibcode:1932RSPSA.136..630C. doi:10.1098/rspa.1932.0108. S2CID 123018629.
^ a b P.A.M. Dirac (1930). "A Theory of Electrons and Protons". Proceedings of the Royal Society. A126 (801): 360–365. Bibcode:1930RSPSA.126..360D. doi:10.1098/rspa.1930.0013. JSTOR 95359.
^ C.D. Anderson (1933). "The Positive Electron". Phys. Rev. 43 (6): 491–494. Bibcode:1933PhRv...43..491A. doi:10.1103/PhysRev.43.491.

References

^ a b c Griffiths, David J. (2013). "Electrodynamics and Relativity". Introduction to Electrodynamics (4th ed.). Pearson. Chapter 12. ISBN 978-0-321-85656-2.
^ a b c Jackson, John D. (1999). "Special Theory of Relativity". Classical Electrodynamics (3rd ed.). John Wiley & Sons. Chapter 11. ISBN 0-471-30932-X.
^ Goldstein, Herbert (1980). "Chapter 7: Special Relativity in Classical Mechanics". Classical Mechanics (2nd ed.). Addison-Wesley Publishing Company. ISBN 0-201-02918-9.
^ a b Lanczos, Cornelius (1970). "Chapter IX: Relativistic Mechanics". The Variational Principles of Mechanics (4th ed.). Dover Publications. ISBN 978-0-486-65067-8.
^ Tom Roberts & Siegmar Schleif (October 2007). "What is the experimental basis of Special Relativity?". Usenet Physics FAQ. Retrieved 2008-09-17.
^ Albert Einstein (2001). Relativity: The Special and the General Theory (Reprint of 1920 translation by Robert W. Lawson ed.). Routledge. p. 48. ISBN 978-0-415-25384-0.
^ The Feynman Lectures on Physics Vol. I Ch. 15-9: Equivalence of mass and energy
^ Sean Carroll, Lecture Notes on General Relativity, ch. 1, "Special relativity and flat spacetime", http://ned.ipac.caltech.edu/level5/March01/Carroll3/Carroll1.html
^ Koks, Don (2006). Explorations in Mathematical Physics: The Concepts Behind an Elegant Language (illustrated ed.). Springer Science & Business Media. p. 234. ISBN 978-0-387-32793-8. Extract of page 234
^ Steane, Andrew M. (2012). Relativity Made Relatively Easy (illustrated ed.). OUP Oxford. p. 226. ISBN 978-0-19-966286-9. Extract of page 226
^ a b c d e Taylor, Edwin F.; Wheeler, John Archibald (1992). Spacetime Physics (2nd ed.). W. H. Freeman. ISBN 0-7167-2327-1.
^ a b c d e Rindler, Wolfgang (1977). Essential Relativity: Special, General, and Cosmological (illustrated ed.). Springer Science & Business Media. p. §1,11 p. 7. ISBN 978-3-540-07970-5.
^ "James Clerk Maxwell: a force for physics". Physics World. 2006-12-01. Retrieved 2024-03-22.
^ "November 1887: Michelson and Morley report their failure to detect the luminiferous ether". www.aps.org. Retrieved 2024-03-22.
^ Michael Polanyi (1974) Personal Knowledge: Towards a Post-Critical Philosophy, ISBN 0-226-67288-3, footnote page 10–11: Einstein reports, via Dr N Balzas in response to Polanyi's query, that "The Michelson–Morley experiment had no role in the foundation of the theory." and "... the theory of relativity was not founded to explain its outcome at all". [1]
^ a b Jeroen van Dongen (2009). "On the role of the Michelson–Morley experiment: Einstein in Chicago". Archive for History of Exact Sciences. 63 (6): 655–663. arXiv:0908.1545. Bibcode:2009arXiv0908.1545V. doi:10.1007/s00407-009-0050-5. S2CID 119220040.
^ For a survey of such derivations, see Lucas and Hodgson, Spacetime and Electromagnetism, 1990
^ Einstein, A., Lorentz, H. A., Minkowski, H., & Weyl, H. (1952). The Principle of Relativity: a collection of original memoirs on the special and general theory of relativity. Courier Dover Publications. p. 111. ISBN 978-0-486-60081-9.{{cite book}}: CS1 maint: multiple names: authors list (link)
^ Collier, Peter (2017). A Most Incomprehensible Thing: Notes Towards a Very Gentle Introduction to the Mathematics of Relativity (3rd ed.). Incomprehensible Books. ISBN 9780957389465.
^ Staley, Richard (2009), "Albert Michelson, the Velocity of Light, and the Ether Drift", Einstein's generation. The origins of the relativity revolution, Chicago: University of Chicago Press, ISBN 0-226-77057-5
^ a b c d e f g h i j k l m n o p David Morin (2007) Introduction to Classical Mechanics, Cambridge University Press, Cambridge, chapter 11, Appendix I, ISBN 1-139-46837-5.
^ Miller, D. J. (2010). "A constructive approach to the special theory of relativity". American Journal of Physics. 78 (6): 633–638. arXiv:0907.0902. Bibcode:2010AmJPh..78..633M. doi:10.1119/1.3298908. S2CID 20444859.
^ Callahan, James J. (2011). The Geometry of Spacetime: An Introduction to Special and General Relativity. New York: Springer. ISBN 9781441931429.
^ P. G. Bergmann (1976) Introduction to the Theory of Relativity, Dover edition, Chapter IV, page 36 ISBN 0-486-63282-2.
^ Mermin, N. David (1968). Space and Time in Special Relativity. McGraw-Hill. ISBN 978-0881334203.
^ Robert Resnick (1968). Introduction to special relativity. Wiley. pp. 62–63. ISBN 9780471717249.
^ a b Miller, Arthur I. (1998). Albert Einstein's Special Theory of Relativity: Emergence (1905) and Early Interpretation (1905–1911). Mew York: Springer-Verlag. ISBN 978-0-387-94870-6.
^ Bernstein, Jeremy (2006). Secrets of the Old One: Einstein, 1905. Copernicus Books (imprint of Springer Science + Business Media). ISBN 978-0387-26005-1.
^ Darrigol, Olivier (2005). "The Genesis of the Theory of Relativity" (PDF). Séminaire Poincaré. 1: 1–22. Bibcode:2006eins.book....1D. Retrieved 15 November 2018.
^ a b c Rindler, Wolfgang (1977). Essential Relativity (2nd ed.). New York: Springer-Verlag. ISBN 978-0-387-10090-6.
^ a b c d Taylor, Edwin F.; Wheeler, John Archibald (1966). Spacetime Physics (1st ed.). San Francisco: W. H. Freeman and Company.
^ Ashby, Neil (2003). "Relativity in the Global Positioning System". Living Reviews in Relativity. 6 (1): 1. Bibcode:2003LRR.....6....1A. doi:10.12942/lrr-2003-1. PMC 5253894. PMID 28163638.
^ Daniel Kleppner & David Kolenkow (1973). An Introduction to Mechanics. McGraw-Hill. pp. 468–70. ISBN 978-0-07-035048-9.
^ a b c French, A. P. (1968). Special Relativity. New York: W. W. Norton & Company. ISBN 0-393-09793-5.
^ Lewis, Gilbert Newton; Tolman, Richard Chase (1909). "The Principle of Relativity, and Non-Newtonian Mechanics". Proceedings of the American Academy of Arts and Sciences. 44 (25): 709–726. doi:10.2307/20022495. JSTOR 20022495. Retrieved 22 August 2023.
^ a b Cuvaj, Camillo (1971). "Paul Langeyin and the Theory of Relativity" (PDF). Japanese Studies in the History of Science. 10: 113–142. Retrieved 12 June 2023.
^ Cassidy, David C.; Holton, Gerald James; Rutherford, Floyd James (2002). Understanding Physics. Springer-Verlag. p. 422. ISBN 978-0-387-98756-9.
^ Cutner, Mark Leslie (2003). Astronomy, A Physical Perspective. Cambridge University Press. p. 128. ISBN 978-0-521-82196-4.
^ Ellis, George F. R.; Williams, Ruth M. (2000). Flat and Curved Space-times (2n ed.). Oxford University Press. pp. 28–29. ISBN 978-0-19-850657-7.
^ Feynman, Richard P.; Leighton, Robert B.; Sands, Matthew (2011). The feynman lectures on physics; vol I: The new millennium edition. Basic Books. p. 15-5. ISBN 978-0-465-02414-8. Retrieved 12 June 2023.
^ a b Halliday, David; Resnick, Robert (1988). Fundamental Physics: Extended Third Edition. New York: John Wiley & sons. pp. 958–959. ISBN 0-471-81995-6.
^ Adams, Steve (1997). Relativity: An introduction to space-time physics. CRC Press. p. 54. ISBN 978-0-7484-0621-0.
^ Langevin, Paul (1911). "L'Évolution de l'espace et du temps". Scientia. 10: 31–54. Retrieved 20 June 2023.
^ Debs, Talal A.; Redhead, Michael L.G. (1996). "The twin "paradox" and the conventionality of simultaneity". American Journal of Physics. 64 (4): 384–392. Bibcode:1996AmJPh..64..384D. doi:10.1119/1.18252.
^ Tolman, Richard C. (1917). The Theory of the Relativity of Motion. Berkeley: University of California Press. p. 54.
^ Takeuchi, Tatsu. "Special Relativity Lecture Notes – Section 10". Virginia Tech. Retrieved 31 October 2018.
^ Morin, David (2017). Special Relativity for the Enthusiastic Beginner. CreateSpace Independent Publishing Platform. pp. 90–92. ISBN 9781542323512.
^ Gibbs, Philip. "Is Faster-Than-Light Travel or Communication Possible?". Physics FAQ. Department of Mathematics, University of California, Riverside. Retrieved 31 October 2018.
^ Ginsburg, David (1989). Applications of Electrodynamics in Theoretical Physics and Astrophysics (illustrated ed.). CRC Press. p. 206. Bibcode:1989aetp.book.....G. ISBN 978-2-88124-719-4. Extract of page 206
^ Wesley C. Salmon (2006). Four Decades of Scientific Explanation. University of Pittsburgh. p. 107. ISBN 978-0-8229-5926-7., Section 3.7 page 107
^ Lauginie, P. (2004). "Measuring Speed of Light: Why? Speed of what?" (PDF). Proceedings of the Fifth International Conference for History of Science in Science Education. Archived from the original (PDF) on 4 July 2015. Retrieved 3 July 2015.
^ Stachel, J. (2005). "Fresnel's (dragging) coefficient as a challenge to 19th century optics of moving bodies". In Kox, A.J.; Eisenstaedt, J (eds.). The universe of general relativity. Boston: Birkhäuser. pp. 1–13. ISBN 978-0-8176-4380-5. Retrieved 17 April 2012.
^ Richard A. Mould (2001). Basic Relativity (2nd ed.). Springer. p. 8. ISBN 978-0-387-95210-9.
^ Seidelmann, P. Kenneth, ed. (1992). Explanatory Supplement to the Astronomical Almanac. ill Valley, Calif.: University Science Books. p. 393. ISBN 978-0-935702-68-2.
^ Ferraro, Rafael; Sforza, Daniel M. (2005). "European Physical Society logo Arago (1810): the first experimental result against the ether". European Journal of Physics. 26 (1): 195–204. arXiv:physics/0412055. Bibcode:2005EJPh...26..195F. doi:10.1088/0143-0807/26/1/020. S2CID 119528074.
^ Dolan, Graham. "Airy's Water Telescope (1870)". The Royal Observatory Greenwich. Retrieved 20 November 2018.
^ Hollis, H. P. (1937). "Airy's water telescope". The Observatory. 60: 103–107. Bibcode:1937Obs....60..103H. Retrieved 20 November 2018.
^ Janssen, Michel; Stachel, John (2004). "The Optics and Electrodynamics of Moving Bodies" (PDF). In Stachel, John (ed.). Going Critical. Springer. ISBN 978-1-4020-1308-9.
^ Sher, D. (1968). "The Relativistic Doppler Effect". Journal of the Royal Astronomical Society of Canada. 62: 105–111. Bibcode:1968JRASC..62..105S. Retrieved 11 October 2018.
^ Gill, T. P. (1965). The Doppler Effect. London: Logos Press Limited. pp. 6–9. OL 5947329M.
^ Feynman, Richard P.; Leighton, Robert B.; Sands, Matthew (February 1977). "Relativistic Effects in Radiation". The Feynman Lectures on Physics: Volume 1. Reading, Massachusetts: Addison-Wesley. pp. 34–7 f. ISBN 9780201021165. LCCN 2010938208.
^ Cook, Helen. "Relativistic Distortion". Mathematics Department, University of British Columbia. Retrieved 12 April 2017.
^ Signell, Peter. "Appearances at Relativistic Speeds" (PDF). Project PHYSNET. Michigan State University, East Lansing, MI. Archived from the original (PDF) on 13 April 2017. Retrieved 12 April 2017.
^ Kraus, Ute. "The Ball is Round". Space Time Travel: Relativity visualized. Institut für Physik Universität Hildesheim. Archived from the original on 12 May 2017. Retrieved 16 April 2017.
^ Boas, Mary L. (1961). "Apparent Shape of Large Objects at Relativistic Speeds". American Journal of Physics. 29 (5): 283. Bibcode:1961AmJPh..29..283B. doi:10.1119/1.1937751.
^ Müller, Thomas; Boblest, Sebastian (2014). "Visual appearance of wireframe objects in special relativity". European Journal of Physics. 35 (6): 065025. arXiv:1410.4583. Bibcode:2014EJPh...35f5025M. doi:10.1088/0143-0807/35/6/065025. S2CID 118498333.
^ Zensus, J. Anton; Pearson, Timothy J. (1987). Superluminal Radio Sources (1st ed.). Cambridge, New York: Cambridge University Press. p. 3. ISBN 9780521345606.
^ Chase, Scott I. "Apparent Superluminal Velocity of Galaxies". The Original Usenet Physics FAQ. Department of Mathematics, University of California, Riverside. Retrieved 12 April 2017.
^ Richmond, Michael. ""Superluminal" motions in astronomical sources". Physics 200 Lecture Notes. School of Physics and Astronomy, Rochester Institute of Technology. Archived from the original on 16 February 2017. Retrieved 20 April 2017.
^ Keel, Bill. "Jets, Superluminal Motion, and Gamma-Ray Bursts". Galaxies and the Universe - WWW Course Notes. Department of Physics and Astronomy, University of Alabama. Archived from the original on 1 March 2017. Retrieved 29 April 2017.
^ Max Jammer (1997). Concepts of Mass in Classical and Modern Physics. Courier Dover Publications. pp. 177–178. ISBN 978-0-486-29998-3.
^ John J. Stachel (2002). Einstein from B to Z. Springer. p. 221. ISBN 978-0-8176-4143-6.
^ Fernflores, Francisco (2018). Einstein's Mass-Energy Equation, Volume I: Early History and Philosophical Foundations. New York: Momentum Pres. ISBN 978-1-60650-857-2.
^ a b Philip Gibbs & Don Koks. "The Relativistic Rocket". Retrieved 30 August 2012.
^ The special theory of relativity shows that time and space are affected by motion Archived 2012-10-21 at the Wayback Machine. Library.thinkquest.org. Retrieved on 2013-04-24.
^ a b Idema, Timon (17 April 2019). "Mechanics and Relativity. Chapter 14: Relativistic Collisions". LibreTexts Physics. California State University Affordable Learning Solutions Program. Retrieved 2 January 2023.
^ Nakel, Werner (1994). "The elementary process of bremsstrahlung". Physics Reports. 243 (6): 317–353. Bibcode:1994PhR...243..317N. doi:10.1016/0370-1573(94)00068-9.
^ Halbert, M.L. (1972). "Review of Experiments on Nucleon-Nucleon Bremsstrahlung". In Austin, S.M.; Crawley, G.M. (eds.). The Two-Body Force in Nuclei. Boston, MA.: Springer.
^ Thomas, George B.; Weir, Maurice D.; Hass, Joel; Giordano, Frank R. (2008). Thomas' Calculus: Early Transcendentals (Eleventh ed.). Boston: Pearson Education, Inc. p. 533. ISBN 978-0-321-49575-4.
^ a b E. J. Post (1962). Formal Structure of Electromagnetics: General Covariance and Electromagnetics. Dover Publications Inc. ISBN 978-0-486-65427-0.
^ a b c Schutz, Bernard F. (1985). A first course in general relativity. Cambridge, UK: Cambridge University Press. p. 26. ISBN 0521277035.
^ a b Gibbs, Philip. "Can Special Relativity Handle Acceleration?". The Physics and Relativity FAQ. math.ucr.edu. Archived from the original on 7 June 2017. Retrieved 28 May 2017.
^ Franklin, Jerrold (2010). "Lorentz contraction, Bell's spaceships, and rigid body motion in special relativity". European Journal of Physics. 31 (2): 291–298. arXiv:0906.1919. Bibcode:2010EJPh...31..291F. doi:10.1088/0143-0807/31/2/006. S2CID 18059490.
^ a b c Bais, Sander (2007). Very Special Relativity: An Illustrated Guide. Cambridge, Massachusetts: Harvard University Press. ISBN 978-0-674-02611-7.
^ R. Resnick; R. Eisberg (1985). Quantum Physics of Atoms, Molecules, Solids, Nuclei and Particles (2nd ed.). John Wiley & Sons. pp. 114–116. ISBN 978-0-471-87373-0.
^ Øyvind Grøn & Sigbjørn Hervik (2007). Einstein's general theory of relativity: with modern applications in cosmology. Springer. p. 195. ISBN 978-0-387-69199-2. Extract of page 195 (with units where c = 1)
^ The number of works is vast, see as example:
Sidney Coleman; Sheldon L. Glashow (1997). "Cosmic Ray and Neutrino Tests of Special Relativity". Physics Letters B. 405 (3–4): 249–252. arXiv:hep-ph/9703240. Bibcode:1997PhLB..405..249C. doi:10.1016/S0370-2693(97)00638-2. S2CID 17286330.
An overview can be found on this page
^ Roberts, Tom; Schleif, Siegmar. "Experiments that Apparently are NOT Consistent with SR/GR". What is the experimental basis of Special Relativity?. University of California at Riverside. Retrieved 10 July 2024.
^ John D. Norton, John D. (2004). "Einstein's Investigations of Galilean Covariant Electrodynamics prior to 1905". Archive for History of Exact Sciences. 59 (1): 45–105. Bibcode:2004AHES...59...45N. doi:10.1007/s00407-004-0085-6. S2CID 17459755.
^ J.A. Wheeler; C. Misner; K.S. Thorne (1973). Gravitation. W.H. Freeman & Co. p. 58. ISBN 978-0-7167-0344-0.
^ J.R. Forshaw; A.G. Smith (2009). Dynamics and Relativity. Wiley. p. 247. ISBN 978-0-470-01460-8.
^ R. Penrose (2007). The Road to Reality. Vintage books. ISBN 978-0-679-77631-4.
^ Jean-Bernard Zuber & Claude Itzykson, Quantum Field Theory, pg 5, ISBN 0-07-032071-3
^ Charles W. Misner, Kip S. Thorne & John A. Wheeler, Gravitation, pg 51, ISBN 0-7167-0344-0
^ George Sterman, An Introduction to Quantum Field Theory, pg 4, ISBN 0-521-31132-2
^ Sean M. Carroll (2004). Spacetime and Geometry: An Introduction to General Relativity. Addison Wesley. p. 22. ISBN 978-0-8053-8732-2.

External links

Wikisource has original text related to this article:

Relativity: The Special and General Theory

Wikisource has original works on the topic: Relativity

Wikibooks has a book on the topic of: Special Relativity

Wikiversity has learning resources about Special Relativity

Look up special relativity in Wiktionary, the free dictionary.

Original works

Zur Elektrodynamik bewegter Körper Einstein's original work in German, Annalen der Physik, Bern 1905
On the Electrodynamics of Moving Bodies English Translation as published in the 1923 book The Principle of Relativity.

Special relativity for a general audience (no mathematical knowledge required)

Einstein Light An award-winning, non-technical introduction (film clips and demonstrations) supported by dozens of pages of further explanations and animations, at levels with or without mathematics.
Einstein Online Archived 2010-02-01 at the Wayback Machine Introduction to relativity theory, from the Max Planck Institute for Gravitational Physics.
Audio: Cain/Gay (2006) – Astronomy Cast. Einstein's Theory of Special Relativity

Special relativity explained (using simple or more advanced mathematics)

Bondi K-Calculus – A simple introduction to the special theory of relativity.
Greg Egan's Foundations Archived 2013-04-25 at the Wayback Machine.
The Hogg Notes on Special Relativity A good introduction to special relativity at the undergraduate level, using calculus.
Relativity Calculator: Special Relativity Archived 2013-03-21 at the Wayback Machine – An algebraic and integral calculus derivation for E = mc².
MathPages – Reflections on Relativity A complete online book on relativity with an extensive bibliography.
Special Relativity An introduction to special relativity at the undergraduate level.
Relativity: the Special and General Theory at Project Gutenberg, by Albert Einstein
Special Relativity Lecture Notes is a standard introduction to special relativity containing illustrative explanations based on drawings and spacetime diagrams from Virginia Polytechnic Institute and State University.
Understanding Special Relativity The theory of special relativity in an easily understandable way.
An Introduction to the Special Theory of Relativity (1964) by Robert Katz, "an introduction ... that is accessible to any student who has had an introduction to general physics and some slight acquaintance with the calculus" (130 pp; pdf format).
Lecture Notes on Special Relativity by J D Cresser Department of Physics Macquarie University.
SpecialRelativity.net – An overview with visualizations and minimal mathematics.
Relativity 4-ever? The problem of superluminal motion is discussed in an entertaining manner.

Visualization

Raytracing Special Relativity Software visualizing several scenarios under the influence of special relativity.
Real Time Relativity Archived 2013-05-08 at the Wayback Machine The Australian National University. Relativistic visual effects experienced through an interactive program.
Spacetime travel A variety of visualizations of relativistic effects, from relativistic motion to black holes.
Through Einstein's Eyes Archived 2013-05-14 at the Wayback Machine The Australian National University. Relativistic visual effects explained with movies and images.
Warp Special Relativity Simulator A computer program to show the effects of traveling close to the speed of light.
Animation clip on YouTube visualizing the Lorentz transformation.
Original interactive FLASH Animations from John de Pillis illustrating Lorentz and Galilean frames, Train and Tunnel Paradox, the Twin Paradox, Wave Propagation, Clock Synchronization, etc.
lightspeed An OpenGL-based program developed to illustrate the effects of special relativity on the appearance of moving objects.
Animation showing the stars near Earth, as seen from a spacecraft accelerating rapidly to light speed.