General Relativity, Part 3#

After having explored geodesics, the metric tensor, and the curvature tensors, we are ready to tackle the formidable task of finally deriving Einstein’s equations!

Deriving the Einstein Field Equations#

As with before, we can use the Euler-Lagrange equations and the principle of least action to obtain the Einstein Field Equations.

The action for General Relativity in empty spacetime can be generalized as:

S=κRg d4x

Here, g=det(gμν), d4x=dtdxdydz and κ is simply a proportionality constant. Note that while it describes a vacuum, that spacetime can still be curved. For example, you could say that the spacetime outside of a black hole is a vacuum (because there is no matter), but the spacetime would still be curved (because the black hole warps its surrounding spacetime, even if we only include the spacetime around a black hole and not the black hole itself).

The action can be derived from one of two ways. It can be shown to be correct through dimensional analysis - the units on the left and right side of the equation match up. However, there is also a more intuitive way to illustrate this.

The action must be composed of scalar-valued functions (or scalars), as it is an integral over all spacetime, and multidimensional integrals can only take scalar-valued functions or scalars to integrate over (see for yourself that this must be true). But it must also include information about the curvature of spacetime and spacetime itself. As we know, all the information about the curvature of spacetime is captured in the Riemann tensor. But the Riemann tensor is not a scalar-valued function - it is instead a (rank-4) tensor-valued function. So we have to find a way to get a scalar from the Riemann tensor. We already know of a scalar that can be formed from the Riemann tensor - the Ricci scalar. We want to add an additional proportionality constant in front, which is also a scalar, because we’d expect to see constants in our final field equations as well. We can always set the constant κ=1 if we find it’s not necessary later. Since both the curvature of spacetime and the matter and energy present within spacetime should act on the metric, we add them together. Finally, since spacetime is often curved, we need a factor of g to make sure the volume element d4x is the same size no matter what coordinates or what spacetime we use. So from there, we obtain the action.

From our action, we know that the Lagrangian is:

L=κRg

We will use the Euler-Lagrange field equations, a slight variation of the original Euler-Lagrange equations we derived:

Lφxβ(L(βφ))=0

Here, φ is the field, and in our case, the field is the metric tensor field gμν(xβ), thus φ=gμν, so if we substitute, we have:

Lgμνxβ(L(βgμν))=0

Note that we use the curly L for the Lagrangian because it is not technically the Lagrangian per se, but the field equivalent of the Lagrangian, known as the Lagrangian density. But we’ll just call it the Lagrangian here. The distinction between the Lagrangian density and the Lagrangian isn’t important here; the practical difference here is that the Lagrangian uses the typical Euler-Lagrange equation, while the Lagrangian density uses the Euler-Lagrange field equation.

We notice in the Euler-Lagrange field equations that the second term contains the partial derivative with respect to the derivatives of the metric. But note that in our Lagrangian, there are no terms that take the derivative of the metric as input. So the second term vanishes, and we are left with a comparatively easier equation:

Lgμν=0

Before we take this derivative, let us first rewrite our Lagrangian as:

L=κgμνRμνg

Now, we can finally take the derivative with respect to the metric:

Lgμν=κgμν(gμνRμνg)=0

We immediately run into a hurdle! The Lagrangian has three multiplied functions, the inverse metric, the Ricci tensor, and the square root of the determinant of the metric. How do we differentiate a triple product? We can use the triple product rule:

(fgh)=fgh+fgh+fgh

Another problem! How do we differentiate the inverse metric with respect to the metric? The answer comes from a matrix calculus identity, which, translated to tensor notation, is this:

gμνgμν=gμνgμν

Final problem! How do we differentiate the determinant of the metric with respect to the metric? This answer also comes from a matrix calculus identity, which is this:

det(gμν)gμν=ggμν=ggμν

With all this in mind, we can finally compute the derivatives. The first term of the derivative is just the derivative of the inverse metric, multiplied by the other two terms in the triple product. The derivative of the Ricci tensor with respect to the metric is zero (it doesn’t depend on the metric), so the second term of the derivative of the triple product is zero. In the third term, we need to use the chain rule to differentiate the square root. The final result is this:

κRμνgμνgμνκ12gggμνgμνRμν=0

We can clean this up a bit. First, we can multiply both sides by 1, to get:

κRμνgμνgμν+κ12gggμνgμνRμν=0

Then, we can multiply both sides of the equation by 1g, which results in:

κRμνgμνgμνκ12gμνgμνRμν=0

We remember that R=gμνRμν, so we can substitute it in:

κRμνgμνgμνκ12gμνR=0

We want to get rid of the double gμν terms, so we can multiply both sides of the equation by gμνgμν, to get:

κRμνgμνgμνgμνgμνκ12gμνgμνgμνR=0

The inverse metric contracts with the metric:

gμνgμνgμνgμν=gμνgμν=δμμ=i=031=4

So this entire expression becomes:

κ4Rμνκ2Rgμν=0

But we can divide by 4 right after as the right-hand side is zero, to yield:

κRμνκ12Rgμν=0

We can factor out the constant:

κ(Rμν12Rgμν)=0

The term inside the parentheses is called the Einstein tensor and describes the curvature and characteristics of spacetime:

Gμν=Rμν12Rgμν

In vacuum, the equation we just derived is the Einstein Field Equation:

Gμν=0

Now, there is matter and energy within space, then we use a modified action, where M is the contribution to the action of the gravitating matter and energy:

S=(κRM)g d4x

So the Lagrangian is:

L=(κRM)g

Using the Euler-Lagrange field equations, this becomes:

κRμνgμνgμνgκ12gggμνgμνRμνMgμνg+12gggμνM=0

First, we multiply by 1:

κRμνgμνgμνg+κ12gggμνgμνRμν+Mgμνg12gggμνM=0

Then we multiply by 1g:

κRμνgμνgμνκ12gμνgμνRμν+Mgμνg+12gμνM=0

We use the definition R=gμνRμν:

κRμνgμνgμνκ12gμνR+Mgμνg+12gμνM=0

And by contraction with gμνgμν we have:

κRμνκ12gμνR+Mgμνgμνgμνg+12gμνM=0

We can move the second and third terms, which depend on M to the right of the equation:

κRμνκ12gμνR=Mgμνgμνgμνg12gμνM

And factor the left-hand side of the equation:

κ(Rμν12gμνR)=Mgμνgμνgμνg12gμνM

We recognize our familiar friend, the Einstein tensor, on the left. If we define a tensor Tμν to equal the right-hand side:

Tμν=Mgμνgμνgμνg12gμνM

Then we have the complete field equations:

Gμν=1κTμν

The tensor Tμν on the right is called the stress-energy tensor. There is no one “general formula” for the stress-energy tensor; we can define different expressions for the stress-energy tensor depending on what matter, energy, momentum, and stresses are present within the region of spacetime being analyzed, with the only real rule being that the resulting expression follow tensor algebra conventions (e.g. same number of free indices on both sides of the equation). One of the simplest expressions for a stress-energy tensor is:

Tμν=ρUμUν

Here, Uμ and Uν are four-velocities, as shown before in special relativity, and ρ is the density of the gravitating matter-energy.

But back to the equation:

Gμν=1κTμν

What is the constant κ? We will need to use the Newtonian limit of relativity to answer that question. When gravity is weak, and objects are moving much slower than the speed of light, we expect that we can recover Poisson’s equation from the field equation. We will cover that in the following derivation.

Given that four-velocity is defined as Uμ=(γc,γv), and we defined objects to be moving much slower than the speed of light, the 0th component of four-velocity, so slow that their speeds are effectively zero compared to the speed of light, we can effectively say that γ1 and Uμ(c,0,0,0). Therefore, the component T00 of the stress-energy tensor is just ρc2, and all other components of the stress-energy tensor are zero.

Given a static metric, that is, one that doesn’t change much in time, we can also say that 0gμν=0. And because we expect spacetime to be very close to Minkowski spacetime, we will set:

gμν=ημν+hμν

Where hμν is a tiny bit of metric that accounts for Newtonian gravity. Also, given our low velocity assumtions, we already mentioned that:

Uμ=(dtdτ,dxdτ,dydτ,dzdτ)(c,0,0,0)

We consider the geodesic equations:

d2xμdτ2+Γγσμdxγdτdxσdτ=0

We can simplify given that given that only the dtdτ component matters, because all the other velocities are zero:

d2xμdτ2+Γ00μdtdτdtdτ=0

We can rewrite this as:

d2xμdτ2=Γ00μdtdτdtdτ

And recalling dtdτc, we have:

d2xμdτ2=Γ00μc2

Now, we can compare this to Newton’s equation for gravity:

d2rdt2=ϕ

Therefore:

Γ00μc2=ϕ
Γ00μ=1c2ϕ

Now, given that:

Γkli=12gim(lgmk+kgmlmgkl)

If we substitute iμ,kt,lt, we have:

Γ00μ=12gμm(tgmt+tgmtmg00)

But recall that tgμν=0 given our static metric assumption, so:

Γ00μ=12gμmmg00

Since gμν=ημνhμν, we have:

Γ00μ=12(ημmhμm)(mη00mh00)

We can simplify this by noting that because η00=1, its derivative is zero, so:

Γ00μ=12(ημmhμm)mh00

If we expand this out, we would get:

Γ00μ=12ημmmh00+hμmmh00

But the second term is very tiny, so we can effectively say it is zero, and given ημm=1, we get:

Γ00μ=12mh00

Note

How did we know that ημm=1? This is because ημm is the first column of the Minkowski metric (as m is a dummy index, so you can think of it as η0mη3m), and the only non-zero term in its first column is η00.

The partial derivative with respect to an arbitrary coordinate is just the gradient:

Γ00μ=12h00

We already know what Γ00μ is though, so:

1c2ϕ=12h00

So we find that h00=2ϕc2, and therefore g00=12ϕc2.

Now, we have everything we need to compute the Einstein tensor. Using the definition of the Ricci tensor, we have:

Rij=kΓijkjΓikk+ΓijkΓkmmΓimkΓjkm

Given our knowledge of Γ00μ, and with substitutions kμ,it,jt, we have:

R00=μΓ00μtΓtμμ+Γ00μΓμmmΓtmμΓtμm

But recall that the time derivative of the metric must be zero, so the second term cancels out. And recall that all the Christoffel symbols that are not Γ00μ are zero. If we expand out the dummy summation indices, we find that they all go to zero. Therefore, we just have:

R00=μΓ00μ

Which becomes:

R00=1c22ϕ

Recall the definition of the Einstein tensor:

Gμν=Rμν12gμνgμνRμν

If we substitute, we have:

G00=R00[(12ϕc2)R00]=1c22ϕ+1c22ϕ+2ϕc42ϕ

Now, since c4 is a massive number, 2ϕc40, so:

G00=2c22ϕ

Using the Einstein field equations:

Gμν=1κTμν

We can substitute in our values for the Einstein and stress-energy tensors:

2c22ϕ=1κρc2
2ϕ=12κρc4

Compare this with Poisson’s equation:

2ϕ=4πGρ

This means that:

12κρc4=4πGρ
κ=c48πG

Remember the field equations:

Gμν=1κTμν

Now knowing the value of κ, we need only substitute to get:

Gμν=8πGc4Tμν

This elegant equation is the apotheosis of general relativity, and it rightfully deserves its place as one of the most famous equations in all of physics.

Note that sometimes, there is an alternate form of the Einstein Field Equations that is easier to solve. To do this, we expand out the full equations:

Rμν12Rgμν=8πGc4Tμν

We now multiply both sides by gμν:

gμνRμν12Rgμνgμν=8πGc4Tμνgμν

Using the fact that gμν=gμν=4 and Tμνgμν=T, this becomes:

R12(4R)=8πGc4TR=8πGc4T

So, substituting back into the original EFEs:

Rμν+128πGc4Tgμν=8πGc4Tμν
Rμν=8πGc4(Tμν12gμνT)

This makes the field equations simpler for vacuum solutions, where Tμν=T=0. Thus, the equations just become:

Rμν=0

which is still incredibly hard to solve, but more manageable than the typical case.

Finally, there is one more important fact about the field equations: taking the covariant derivatives of both sides is equal to zero. This means that:

μTμν=0

This expression may look familiar if we recall that the covariant derivative with a repeated index is just the divergence of a field. What this is saying is that the total change in matter-energy flux in all of spacetime is zero - essentially, the conservation of energy.

A recap with intuition#

After doing so much math, it is helpful to reconnect with what the math is actually saying. That is, we want to regain our physical intuition for what the math describes.

Gravity is a fictitious force, caused by the curvature of spacetime. When spacetime isn’t curved, particles undergo no acceleration, and thus feel no gravitational force. But when spacetime is curved, which happens whenever masses are present in spacetime, particles undergo a definite acceleration. Due to the equivalence principle, the effect of gravity is indistinguishable from the effect of an acceleration, so therefore particles that are being accelerated feel like a force is acting on them.

The gravitational field is an object that extends through all space that gives each point a vector proportional to the gravitational force. Masses create and vary the gravitational field, and in turn the field exerts a force on masses within the field.

The gravitational potential is a function whose slope is equal to the gravitational field. It can be thought of as a landscape that masses are placed in. Where that landscape is very steep, the gravitational force is very strong; where that landscape is very flat, the gravitational force is very weak.

The metric tensor is a mathematical description of a spacetime. The classical analogue of it is the gravitational potential. Just as the gravitational potential influences the force of gravity, the metric tensor influences the curvature of spacetime, which particles experience as gravity.

We can measure distances in typical Euclidean space using a fixed grid, where the increments between the grid line are measured by constant basis vectors. In curved spacetime, basis vectors are no longer constant. The Christoffel symbols are a precise measure of how basis vectors changes in spacetime, or essentially, how the Euclidean constant grid gets distorted in spacetime. It is roughly analogous to the gravitational field. Just as a gravitational field disappears in empty space far away from any masses, the Christoffel symbols vanish too.

The Riemann tensor describes how the curvature of spacetime changes a vector as you move it in different directions in spacetime. The Ricci tensor describes how a volume in spacetime located at a given point in spacetime becomes contracted due to the curvature of spacetime. The Einstein tensor is the average value of this contraction of volume across all of the region of spacetime being studied.

The stress-energy tensor describes the matter-energy fluxes within a region of spacetime. The classical analogue would be matter density. Finally, just as matter is the source for gravity in Newtonian mechanics, matter is the source of spacetime curvature in General Relativity, and spacetime curvature is what we feel as gravity. John Archibald Wheeler famously summarized all of these ideas with the succint observation:

“Spacetime tells matter how to move; matter tells spacetime how to curve”