General Relativity, Part 3#
After having explored geodesics, the metric tensor, and the curvature tensors, we are ready to tackle the formidable task of finally deriving Einstein’s equations!
Deriving the Einstein Field Equations#
As with before, we can use the Euler-Lagrange equations and the principle of least action to obtain the Einstein Field Equations.
The action for General Relativity in empty spacetime can be generalized as:
Here,
The action can be derived from one of two ways. It can be shown to be correct through dimensional analysis - the units on the left and right side of the equation match up. However, there is also a more intuitive way to illustrate this.
The action must be composed of scalar-valued functions (or scalars), as it is an integral over all spacetime, and multidimensional integrals can only take scalar-valued functions or scalars to integrate over (see for yourself that this must be true). But it must also include information about the curvature of spacetime and spacetime itself. As we know, all the information about the curvature of spacetime is captured in the Riemann tensor. But the Riemann tensor is not a scalar-valued function - it is instead a (rank-4) tensor-valued function. So we have to find a way to get a scalar from the Riemann tensor. We already know of a scalar that can be formed from the Riemann tensor - the Ricci scalar. We want to add an additional proportionality constant in front, which is also a scalar, because we’d expect to see constants in our final field equations as well. We can always set the constant
From our action, we know that the Lagrangian is:
We will use the Euler-Lagrange field equations, a slight variation of the original Euler-Lagrange equations we derived:
Here,
Note that we use the curly L for the Lagrangian because it is not technically the Lagrangian per se, but the field equivalent of the Lagrangian, known as the Lagrangian density. But we’ll just call it the Lagrangian here. The distinction between the Lagrangian density and the Lagrangian isn’t important here; the practical difference here is that the Lagrangian uses the typical Euler-Lagrange equation, while the Lagrangian density uses the Euler-Lagrange field equation.
We notice in the Euler-Lagrange field equations that the second term contains the partial derivative with respect to the derivatives of the metric. But note that in our Lagrangian, there are no terms that take the derivative of the metric as input. So the second term vanishes, and we are left with a comparatively easier equation:
Before we take this derivative, let us first rewrite our Lagrangian as:
Now, we can finally take the derivative with respect to the metric:
We immediately run into a hurdle! The Lagrangian has three multiplied functions, the inverse metric, the Ricci tensor, and the square root of the determinant of the metric. How do we differentiate a triple product? We can use the triple product rule:
Another problem! How do we differentiate the inverse metric with respect to the metric? The answer comes from a matrix calculus identity, which, translated to tensor notation, is this:
Final problem! How do we differentiate the determinant of the metric with respect to the metric? This answer also comes from a matrix calculus identity, which is this:
With all this in mind, we can finally compute the derivatives. The first term of the derivative is just the derivative of the inverse metric, multiplied by the other two terms in the triple product. The derivative of the Ricci tensor with respect to the metric is zero (it doesn’t depend on the metric), so the second term of the derivative of the triple product is zero. In the third term, we need to use the chain rule to differentiate the square root. The final result is this:
We can clean this up a bit. First, we can multiply both sides by
Then, we can multiply both sides of the equation by
We remember that
We want to get rid of the double
The inverse metric contracts with the metric:
So this entire expression becomes:
But we can divide by 4 right after as the right-hand side is zero, to yield:
We can factor out the constant:
The term inside the parentheses is called the Einstein tensor and describes the curvature and characteristics of spacetime:
In vacuum, the equation we just derived is the Einstein Field Equation:
Now, there is matter and energy within space, then we use a modified action, where
So the Lagrangian is:
Using the Euler-Lagrange field equations, this becomes:
First, we multiply by
Then we multiply by
We use the definition
And by contraction with
We can move the second and third terms, which depend on
And factor the left-hand side of the equation:
We recognize our familiar friend, the Einstein tensor, on the left. If we define a tensor
Then we have the complete field equations:
The tensor
Here,
But back to the equation:
What is the constant
Given that four-velocity is defined as
Given a static metric, that is, one that doesn’t change much in time, we can also say that
Where
We consider the geodesic equations:
We can simplify given that given that only the
We can rewrite this as:
And recalling
Now, we can compare this to Newton’s equation for gravity:
Therefore:
Now, given that:
If we substitute
But recall that
Since
We can simplify this by noting that because
If we expand this out, we would get:
But the second term is very tiny, so we can effectively say it is zero, and given
Note
How did we know that
The partial derivative with respect to an arbitrary coordinate is just the gradient:
We already know what
So we find that
Now, we have everything we need to compute the Einstein tensor. Using the definition of the Ricci tensor, we have:
Given our knowledge of
But recall that the time derivative of the metric must be zero, so the second term cancels out. And recall that all the Christoffel symbols that are not
Which becomes:
Recall the definition of the Einstein tensor:
If we substitute, we have:
Now, since
Using the Einstein field equations:
We can substitute in our values for the Einstein and stress-energy tensors:
Compare this with Poisson’s equation:
This means that:
Remember the field equations:
Now knowing the value of
This elegant equation is the apotheosis of general relativity, and it rightfully deserves its place as one of the most famous equations in all of physics.
Note that sometimes, there is an alternate form of the Einstein Field Equations that is easier to solve. To do this, we expand out the full equations:
We now multiply both sides by
Using the fact that
So, substituting back into the original EFEs:
This makes the field equations simpler for vacuum solutions, where
which is still incredibly hard to solve, but more manageable than the typical case.
Finally, there is one more important fact about the field equations: taking the covariant derivatives of both sides is equal to zero. This means that:
This expression may look familiar if we recall that the covariant derivative with a repeated index is just the divergence of a field. What this is saying is that the total change in matter-energy flux in all of spacetime is zero - essentially, the conservation of energy.
A recap with intuition#
After doing so much math, it is helpful to reconnect with what the math is actually saying. That is, we want to regain our physical intuition for what the math describes.
Gravity is a fictitious force, caused by the curvature of spacetime. When spacetime isn’t curved, particles undergo no acceleration, and thus feel no gravitational force. But when spacetime is curved, which happens whenever masses are present in spacetime, particles undergo a definite acceleration. Due to the equivalence principle, the effect of gravity is indistinguishable from the effect of an acceleration, so therefore particles that are being accelerated feel like a force is acting on them.
The gravitational field is an object that extends through all space that gives each point a vector proportional to the gravitational force. Masses create and vary the gravitational field, and in turn the field exerts a force on masses within the field.
The gravitational potential is a function whose slope is equal to the gravitational field. It can be thought of as a landscape that masses are placed in. Where that landscape is very steep, the gravitational force is very strong; where that landscape is very flat, the gravitational force is very weak.
The metric tensor is a mathematical description of a spacetime. The classical analogue of it is the gravitational potential. Just as the gravitational potential influences the force of gravity, the metric tensor influences the curvature of spacetime, which particles experience as gravity.
We can measure distances in typical Euclidean space using a fixed grid, where the increments between the grid line are measured by constant basis vectors. In curved spacetime, basis vectors are no longer constant. The Christoffel symbols are a precise measure of how basis vectors changes in spacetime, or essentially, how the Euclidean constant grid gets distorted in spacetime. It is roughly analogous to the gravitational field. Just as a gravitational field disappears in empty space far away from any masses, the Christoffel symbols vanish too.
The Riemann tensor describes how the curvature of spacetime changes a vector as you move it in different directions in spacetime. The Ricci tensor describes how a volume in spacetime located at a given point in spacetime becomes contracted due to the curvature of spacetime. The Einstein tensor is the average value of this contraction of volume across all of the region of spacetime being studied.
The stress-energy tensor describes the matter-energy fluxes within a region of spacetime. The classical analogue would be matter density. Finally, just as matter is the source for gravity in Newtonian mechanics, matter is the source of spacetime curvature in General Relativity, and spacetime curvature is what we feel as gravity. John Archibald Wheeler famously summarized all of these ideas with the succint observation:
“Spacetime tells matter how to move; matter tells spacetime how to curve”