Classical field theory — scalar fields and Klein–Gordon

A single real number attached to every spacetime point — how the simplest relativistic wave equation falls out of the action principle.

Opening

In chapter 5 we extended the Lagrangian formalism from particles to fields. The countable set of degrees of freedom {qi(t)}\{q_i(t)\} was replaced by a continuous function ϕ(t,x)\phi(t, \vec x) defined on spacetime, but the skeleton — “make the action stationary and the equation of motion drops out” — survived intact. In this chapter we apply that skeleton to the simplest possible example: a real scalar field. By the end you will be able to derive the Klein–Gordon equation from a one-line action, and to state in a single sentence why this equation is the first stepping stone toward quantum field theory.

Throughout this chapter we use the mostly-minus signature for the spacetime metric: ημν=diag(+1,1,1,1)\eta_{\mu\nu} = \mathrm{diag}(+1, -1, -1, -1). Partway through we also switch to natural units with c=1c = 1.

Main 1 — Four-vector setup

A point in spacetime is written as a four-vector

xμ=(ct, x),μ{0,1,2,3}.x^\mu = (ct,\ \vec x), \qquad \mu \in \{0, 1, 2, 3\}.

The Greek index μ\mu (mu) runs from 0 to 3. The zeroth component is the speed of light times time; components 1–3 are the spatial coordinates.

The metric tensor ημν\eta_{\mu\nu} is a symmetric matrix used to raise and lower a pair of indices. We adopt

ημν=diag(+1,1,1,1)\eta_{\mu\nu} = \mathrm{diag}(+1, -1, -1, -1)

(the mostly-minus convention). The inverse ημν\eta^{\mu\nu} has the same form. Whenever the same index appears once up and once down, we sum it from 0 to 3 — the Einstein summation convention.

Bundling the partial derivative operators into four-vectors,

μ=xμ=(1ct, ),μ=ημνν=(1ct, ).\partial_\mu = \frac{\partial}{\partial x^\mu} = \left( \frac{1}{c}\partial_t,\ \nabla \right), \qquad \partial^\mu = \eta^{\mu\nu}\partial_\nu = \left( \frac{1}{c}\partial_t,\ -\nabla \right).

Contracting the two gives the d’Alembertian:

=μμ=1c2t22.\Box = \partial^\mu \partial_\mu = \frac{1}{c^2}\partial_t^2 - \nabla^2.

This is the four-dimensional cousin of the Laplacian. From here on we set c=1c = 1 to compress notation; time and space are then measured in the same (length) units and 0=t\partial_0 = \partial_t.

Main 2 — Scalar field Lagrangian and Klein–Gordon

A real scalar field ϕ(t,x)\phi(t, \vec x) is a function assigning a single real number to every spacetime point. Its value at a point is unchanged when we rotate or boost the coordinate axes — hence the name scalar.

The simplest Lorentz-invariant Lagrangian density is

L=12μϕμϕ12m2ϕ2,\mathcal{L} = \tfrac{1}{2}\partial^\mu \phi\, \partial_\mu \phi - \tfrac{1}{2} m^2 \phi^2,

where mm is a mass parameter attached to the field. The first term is a kinetic piece built from two spacetime derivatives contracted together; the second is a potential piece.

The Euler–Lagrange equation for a field, derived in chapter 5,

μL(μϕ)=Lϕ,\partial_\mu \frac{\partial \mathcal{L}}{\partial(\partial_\mu \phi)} = \frac{\partial \mathcal{L}}{\partial \phi},

now takes us all the way to the answer. Compute L/(μϕ)=μϕ\partial \mathcal{L}/\partial(\partial_\mu \phi) = \partial^\mu \phi and L/ϕ=m2ϕ\partial \mathcal{L}/\partial \phi = -m^2 \phi. Substituting,

(+m2)ϕ=0.(\Box + m^2)\, \phi = 0.

This is the Klein–Gordon equation.

Plugging in the plane-wave ansatz ϕ=ei(ωtkx)\phi = e^{-i(\omega t - \vec k \cdot \vec x)} yields ω2=k2+m2\omega^2 = |\vec k|^2 + m^2 — the relativistic dispersion relation. For m=0m = 0 this collapses to the lightlike dispersion ω=k\omega = |\vec k|; for m0m \neq 0 the group velocity approaches the speed of light as the wavelength shrinks.

Main 3 — Why Klein–Gordon matters

Klein–Gordon is the lowest-order relativistic wave equation that can describe a spin-0 boson. In the late 1920s Schrödinger himself first wrote it down as a candidate “relativistic Schrödinger equation”. He quickly hit two problems.

First, the equation is second order in time, so the initial data must include both ϕ\phi and tϕ\partial_t \phi. The Schrödinger equation is first order, requiring only ψ\psi. Second, the solutions ω=±k2+m2\omega = \pm\sqrt{|\vec k|^2 + m^2} include negative-energy branches. Forcing a single-particle probability interpretation on top of these solutions makes the probability density turn negative.

The resolution arrived later. Klein–Gordon is not the wave function of one particle; it is a classical field equation that, after quantization, describes a spin-0 boson — the Higgs and pions are real-world examples. Negative-energy solutions are reinterpreted as antiparticles.

We will meet this result again in chapter 9. There we will treat the path integral, in which the action SS enters as a phase eiS/e^{iS/\hbar} summed over all paths. In the limit 0\hbar \to 0 only the stationary point survives, and the Euler–Lagrange equation is recovered. Klein–Gordon will be the cleanest example of that stationary point.

In Python

# Leapfrog integration of the 1+1D Klein-Gordon equation.
# (∂_t^2 - ∂_x^2 + m^2) φ = 0
# Start from a Gaussian and watch the wavepacket disperse.
import numpy as np
import matplotlib.pyplot as plt

Nx, dx = 400, 0.1
x = np.linspace(-20, 20, Nx)
dt = 0.05
m = 1.0

# Initial data: width-1.5 Gaussian, zero initial velocity
phi_prev = np.exp(-(x / 1.5)**2)
phi = phi_prev.copy()  # ∂_t φ = 0 so the previous step equals the current

snapshots = {0.0: phi.copy()}
target_times = [5.0, 10.0, 15.0]

t, step = 0.0, 0
while t < target_times[-1] + dt:
    lap = np.zeros_like(phi)
    lap[1:-1] = (phi[2:] - 2*phi[1:-1] + phi[:-2]) / dx**2
    phi_next = 2*phi - phi_prev + dt**2 * (lap - m**2 * phi)
    phi_prev, phi = phi, phi_next
    t += dt; step += 1
    for tt in target_times:
        if abs(t - tt) < dt/2 and tt not in snapshots:
            snapshots[tt] = phi.copy()

for tt, snap in snapshots.items():
    plt.plot(x, snap, label=f"t = {tt:.0f}")
plt.xlabel("x"); plt.ylabel(r"$\phi(x, t)$")
plt.legend(); plt.title("1+1D Klein-Gordon — a dispersing wavepacket")
plt.show()

The Gaussian splits into two outgoing lumps, but unlike a massless wave that would propagate rigidly at the speed of light, the peaks shrink and the widths broaden. That is dispersion: the m2m^2 term makes each kk-component travel at its own group velocity vg=k/k2+m2v_g = k/\sqrt{k^2 + m^2}.

To the next chapter

Chapter 7: Relativistic mechanics turns the four-vector toolbox we just assembled back onto particles. Starting from the free-particle action S=mdτS = -m \int d\tau, we will read off the four-momentum, four-acceleration, and the equation of motion under an external force — and see how fields and particles snap together into a single picture under the same variational principle.