2

I know this might be a general or well known topic, but please bear with me, since I have a very specific question regarding this and I want to see specifically whether there is a flaw in my reasoning or logic.

We are using the $(+---)$ metric here.

So basically, we know that

$$\Lambda ^{\mu} \ _{\nu} (\Lambda^{-1})^{\nu}\ _{\rho} = \delta^{\mu} \ _{\rho}$$ So that $\Lambda \Lambda^{-1} = 1$.

And we have $$(\Lambda^{-1})^{\nu} \ _{\rho} = (\Lambda)_{\rho} \ ^{\nu}$$ which seems to imply that $\Lambda^{-1} = \Lambda^T$

like an orthogonal transformation.

However, if we plug that in the original equation, we get:

$$\Lambda ^{\mu} \ _{\nu} (\Lambda)_{\rho} \ ^{\nu} = \delta^{\mu} \ _{\rho}$$

which is not matrix multiplication, since the column and row number don't match. This seems to contradict the statement that $$\Lambda \Lambda^{-1} = 1$$

How do I resolve this paradox? Is $\Lambda$ an orthogonal transformation (in Minkowski space) or not? The indices are confusing me here.

Qmechanic
  • 220,844
Stallmp
  • 879

2 Answers2

6

The standard transpose operation ${{}\cdot{}}^T$ (which flips a matrix along its diagonal) is equivalent to the Euclidean metric. If you have defined ${{}\cdot{}}^T,$ then you can define the (Euclidean) dot product as $x\cdot y=x^Ty.$ If you instead have just the dot product but not the transpose, you can define the transpose $x^T$ of any vector $x$ as the unique row vector $r$ that has $ry=x\cdot y$ for all $y.$ Because the standard transpose is tied to the Euclidean metric, it has no physical meaning in relativity. In particular, you are wrong to identify ${\Lambda_\rho}^\nu$ as being the transpose $\Lambda^T$ of a Lorentz transform ${\Lambda^\rho}_\nu.$

In relativity, indices are "raised" or "lowered" by applying the metric. To invent a word, the "relativistic transpose" of a vector $x^\mu$ is the covector $x_\mu=\eta_{\mu\nu}x^\nu.$ ($x_\mu$ is the unique covector $r_\mu$ having $r_\mu y^\mu=\eta_{\nu\mu}x^\nu y^\mu$ for all $y^\mu,$ so the situation is just like with the standard transpose and Euclidean metric.) To turn a covector $k_\mu$ into a vector, you instead apply the inverse of the metric like so: $k^\mu=\eta^{\mu\nu}k_\nu.$ The process extends to any number of indices: contract the metric against any contravariant index you would like to lower and contract the inverse metric against any covariant index you would like to raise. For the case in your question, ${\Lambda_\rho}^\nu=\eta_{\rho\alpha}\eta^{\nu\beta}{\Lambda^\alpha}_\beta.$

All that said, to answer your question, a Lorentz transform $\Lambda$ is orthogonal under the Minkowski inner product. That is, $\eta_{\mu\nu}{\Lambda^\mu}_\alpha{\Lambda^\nu}_\beta=\delta_{\alpha\beta}=\eta_{\beta\rho}{\delta_\alpha}^\rho=\eta_{\alpha\beta}.$ (Replace $\eta$ with the Euclidean metric and you get the more common definition of orthogonal.) $\Lambda$ will generally not be orthogonal in the usual sense because the usual sense of orthogonal is defined with the wrong metric! (The set of all Lorentz transforms is even called $O(1,3),$ generalizing the notation $O(n)$ used to talk about the set of $n$-by-$n$ orthogonal matrices.)

HTNW
  • 5,055
4

As the invariant scalar product is given by $x^T \!\eta y$ with $\eta= {\rm diag}(1,-1,-1,-1)$, a Lorentz transformation matrix $\Lambda$ is characterized by the relation $$\Lambda^T \eta \Lambda=\eta. \tag{1} \label{1}$$ Multiplying \eqref{1} by $\eta^{-1}$ from the left, we find $$\eta^{-1} \Lambda^T \eta \,\Lambda = \eta^{-1} \eta = \mathbb{I}, \tag{2} \label{2}$$ implying $$\Lambda^{-1}= \eta^{-1} \Lambda^T \eta. \tag{3} \label{3}$$ Translating these matrix equations into index notation, we employ the usual conventions $$\Lambda = (\Lambda^\mu_{\;\,\nu}),\quad \eta=(\eta_{\mu \nu}),\quad \eta^{-1}=(\eta^{\mu \nu}), \tag{4}$$ fixing at the same time also the index positions for all other matrices. Eq. \eqref{1} becomes $$(\Lambda^T)_\mu^{\; \, \rho} \,\eta_{\rho \sigma} \Lambda^\sigma_{\; \, \nu}=\eta_{\mu \nu} \quad \Leftrightarrow \quad \Lambda^\rho_{\; \mu}\, \eta_{\rho \sigma} \Lambda^\sigma_{\;\nu} =\eta_{\mu \nu}, \tag{5} \label{5}$$ and eq. \eqref{2} is translated into $$\eta^{\alpha \mu} \Lambda^\rho_{\; \mu}\,\eta_{\rho \sigma} \Lambda^\sigma_{\; \nu} =\eta^{\alpha \mu} \eta_{\mu \nu}=\delta^\alpha_{\; \,\nu}. $$ Finally, the equivalent of eq. \eqref{3} reads $$(\Lambda^{-1})^\alpha_{\; \sigma}=\eta^{\alpha \mu}\Lambda^\rho_{\;\mu }\eta_{\rho \sigma}=:\Lambda_\sigma^{\; \alpha}. \tag{7} $$

Hyperon
  • 10,007