Spinor field theory

B.4 Spinor field theory

...anything that comes back to itself with a minus sign after a 2 $π$ rotation is always going to be a little strange. — David Tong [76]

So far, we have focused on scalar fields, which live in the trivial representation of the Lorentz group and correspond to spin- $0$ bosons. In this section, we discuss the field theory for spin- $\frac{1}{2}$ particles, or fermions, which constitute all matter in the universe.

B.4.1 The Dirac equation

Like the Klein-Gordon equation, the Dirac equation was also an attempt at a relativistic version of the Schrödinger equation. Before the development of QFT, the quantized KG equation was thought to produce negative probabilities due to its second derivative in time.⁶ Dirac thus sought a relativistic first-order differential equation in space and time.

Legend has it he was staring into a fire in Cambridge when he came up with an equation of the form

(i γ^{μ} \partial_{μ} - m) ψ = 0,

(B.4.1)

where $γ^{μ}$ are constants that will be defined in a moment, and $ψ$ is a complex field. It is difficult to make this equation Lorentz covariant; indeed, it is impossible if $ψ$ is a scalar and each $γ^{μ}$ is simply a number.⁷ Dirac’s brilliant insight, however, was that it can be covariant if $γ_{μ}$ are $4 \times 4$ complex matrices and $ψ$ a four component field.

The key is that $γ^{μ} \partial_{μ}$ is essentially the “square-root” of the d’Alembertian $□$ from the KG-equation:

γ^{μ} \partial_{μ} γ^{ν} \partial_{ν} = □ = \partial_{μ} \partial^{μ},

(B.4.2)

if (and only if) $γ^{μ}$ and $γ^{ν}$ satisfy the Clifford algebra:

{γ^{μ}, γ^{ν}} = 2 η^{μν},

(B.4.3)

where ${A, B} = AB + BA$ is the anticommutator. Dirac found this is possible with $4 \times 4$ matrices such as

γ^{0} = (\begin{matrix} 0 & 𝟙 \\ 𝟙 & 0 \end{matrix}), γ^{i} = (\begin{matrix} 0 & σ^{i} \\ - σ^{i} & 0 \end{matrix}),

(B.4.4)

where $σ^{i}$ are the Pauli matrices (Chapter 2.2). These are called the gamma, or Dirac, matrices, and plugging them into Eq. B.4.1 yields the Dirac equation, which can be written even more compactly by defining $\partial \equiv γ^{μ} \partial_{μ}$ :

(i \partial - m) ψ = 0 .

(B.4.5)

This equation is considered one of the most significant breakthroughs in theoretical physics, “on par with the works of Newton, Maxwell, and Einstein before him” [432]. The insights that followed, as we will outline in this section, provided a theoretical basis for fermion spin, implied the existence of antiparticles, and overall were foundational to the development of the SM.⁸

B.4.2 Spinors

Before discussing solutions and quantization of the Dirac equation, let us examine what kind of object $ψ$ is. A related property of the Clifford algebra is that

Σ_{μν} \equiv \frac{i}{4} [γ^{μ}, γ^{ν}]

(B.4.6)

satisfies the Lorentz algebra (Eq. 2.3.12). This means $Σ_{μν}$ are generators of Lorentz transformations

S [Λ] = e^{\frac{1}{2} ω^{μν} Σ_{μν}},

(B.4.7)

where $Λ$ is a Lorentz transformation with parameters $ω^{μν}$ , and $S [Λ]$ is a particular 4D representation.

It can be shown⁹ that the Dirac equation is only Lorentz covariant if the components of $ψ$ , $ψ_{α}$ , transform under this exact representation:

ψ_{α} \to ψ_{α}^{'} = S {[Λ]}_{α}^{β} ψ_{β} .

(B.4.8)

It is important to note here that $S [Λ]$ is acting on the $ψ$ components — also called the spinor indices – and not on the spacetime coordinates $x^{μ}$ , which transform under the vector representation (Eq. 2.3.2). Explicitly, including the spacetime coordinates, $ψ (x)$ transforms as:

ψ_{α} (x) \to ψ_{α}^{'} (x^{'}) = S {[Λ]}_{α}^{β} ψ_{β} (Λ^{- 1} x),

(B.4.9)

where both $S [Λ]$ and $Λ$ share the same transformation parameters $ω^{μν}$ and thus correspond to the same Lorentz transformation.¹⁰

Dirac and Weyl spinors

What is this representation? Let’s look at the rotation and boost generators individually:

Σ_{0 i} = \frac{i}{2} (\begin{matrix} - σ^{i} & 0 \\ 0 & σ^{i} \end{matrix}), Σ_{ij} = \frac{1}{2} 𝜖_{ijk} (\begin{matrix} σ^{k} & 0 \\ 0 & σ^{k} \end{matrix}) .

(B.4.10)

Comparing this with Eqs. 2.3.8 and 2.3.9, we see that the top left and bottom right blocks are exactly the left- and right-handed Weyl spinor irreps of the generators. The handedness of a spinor is called its chirality, and its physical significance will be discussed in a moment. Thus, we identify $S [Λ]$ with the $(\frac{1}{2}, 0) \oplus (0, \frac{1}{2})$ , or Dirac spinor, representation.

This also means that, in this basis of the gamma matrices (called the Weyl, or chiral, basis), the Dirac spinor $ψ$ can be decomposed into two Weyl spinors:

ψ = (\begin{matrix} ψ_{L} \\ ψ_{R} \end{matrix}),

(B.4.11)

which transform under their respective representations. The two components can be isolated if we consider a fifth gamma matrix:

γ^{5} = i γ^{0} γ^{1} γ^{2} γ^{3} = (\begin{matrix} - 𝟙 & 0 \\ 0 & 𝟙 \end{matrix}) .

(B.4.12)

$γ^{5}$ is similar to our main four matrices in that ${γ^{5}, γ^{μ}} = 0$ and ${(γ^{5})}^{2} = 𝟙$ . Importantly, we see from its form in the Chiral basis that projection operators $P_{L}$ and $P_{R}$ can be defined as:

P_{L} = \frac{1 - γ^{5}}{2}, P_{R} = \frac{1 + γ^{5}}{2},

(B.4.13)

which satisfy the projection property $P_{L ∕ R}^{2} = P_{L ∕ R}$ and project out the left- and right-handed components of a Dirac spinor:

P_{L} (\begin{matrix} ψ_{L} \\ ψ_{R} \end{matrix}) = (\begin{matrix} ψ_{L} \\ 0 \end{matrix}), P_{R} (\begin{matrix} ψ_{L} \\ ψ_{R} \end{matrix}) = (\begin{matrix} 0 \\ ψ_{R} \end{matrix}) .

(B.4.14)

Note that while the specific form depends on the basis, the definitions in Eq. B.4.13 are basis-independent and can be considered to define chirality.

Chirality

The two Weyl spinor representations are related by a complex conjugation, meaning $ψ_{L}^{*}$ is a right-handed Weyl spinor, and vice versa. For a complex scalar field, we interpreted the conjugate as the antiparticle. The same interpretation applies here; hence, if a left-handed spinor describes a particle, its antiparticle is described by its conjugate, right-handed spinor.

The Dirac equation can be rewritten in the Weyl basis as two coupled equations of the Weyl spinors. Let us define $σ^{μ} = (𝟙, σ)$ and ${\bar{σ}}^{μ} = (𝟙, - σ)$ , so that

(i γ^{μ} \partial_{μ} - m) ψ = (\begin{matrix} - m & i σ^{μ} \partial_{μ} \\ i {\bar{σ}}^{μ} \partial_{μ} & - m \end{matrix}) (\begin{matrix} ψ_{L} \\ ψ_{R} \end{matrix}) = 0 .

(B.4.15)

Hence, we see the mass term couples the left- and right-handed components. This is why all massive fermions must exist in pairs of particles and antiparticles. An important special case, however, is for a neutral Majorana fermion, where $ψ$ equals its charge conjugate $ψ^{c}$ (to be defined below). Such a particle is its own antiparticle and can have a left-handed- or right-handed-only mass term. As discussed in Chapter 2.3, the only Majorana candidate in the SM is the right-handed neutrino.

For $m = 0$ , the Dirac equation decouples and leaves us with the Weyl equations describing massless fermions:

i σ^{μ} \partial_{μ} ψ_{R} = 0, i {\bar{σ}}^{μ} \partial_{μ} ψ_{L} = 0 .

(B.4.16)

In Fourier space, these are:

\begin{matrix} \begin{aligned} σ^{μ} p_{μ} ψ_{R} = (E - σ \cdot p) ψ_{R} = 0 \Rightarrow \frac{σ \cdot p}{| p |} ψ_{R} = + ψ_{R}, \\ {\bar{σ}}^{μ} p_{μ} ψ_{L} = (E + σ \cdot p) ψ_{L} = 0 \Rightarrow \frac{σ \cdot p}{| p |} ψ_{L} = - ψ_{L}, \end{aligned} \end{matrix}

(B.4.17)

where we used $E = | p |$ for massless particles. You may recall $\frac{σ \cdot p}{| p |}$ is the helicity operator, projecting the particle spin along its momentum. Thus, in the massless limit, we see that the left- and right-handed Weyl spinors are the $+ 1$ and $- 1$ helicity eigenstates, respectively. This is not the case for massive particles, as helicity is no longer Lorentz invariant: one can always boost into a frame where the momentum is inverted while the spin remains the same, changing the sign of the helicity. Chirality is thus a more abstract concept for massive particles, related only to how they transform under Lorentz transformations.

Theories not symmetric under exchange of left- and right-handed components are called chiral, and symmetric theories vector. QED and QCD are both vector theories, but weak interactions are, surprisingly, chiral. This necessarily means it violates parity and charge conjugation symmetries ( $P$ and $C$ ), which we will discuss soon in Section B.4.6.

B.4.3 The Dirac Lagrangian

Recall that to quantize the scalar theory, we first needed the Lagrangian and the classical solutions of the K-G equation, to then obtain Hamiltonian and canonical fields and Poisson brackets before finally promoting them to quantum commutatation relations. We will proceed in similar (though condensed) fashion for the spinor theory, and first derive the Lagrangian corresponding to the Dirac equation.

Since we are no longer dealing with trivial representation of the Lorentz group, we have to be more careful with the types of terms we put into the Lagrangian; it must be composed of good Lorentz-invariant objects. A first guess at a Lorentz scalar formed of spinors may be $ψ^{†} ψ$ . This is indeed a scalar, but it is not Lorentz invariant: $ψ$ and $ψ^{†}$ transform as $ψ \to S [Λ] ψ$ , $ψ^{†} \to ψ^{†} S {[Λ]}^{†}$ and, hence

ψ^{†} ψ \to ψ^{†} S {[Λ]}^{†} S [Λ] ψ .

(B.4.18)

However, recall from Chapter 2.3 that (finite-dimensional) representations of Lorentz transformations are not unitary. (We can see this as well from the fact that the generators of $S [Λ]$ in Eq. B.4.10 are not anti-Hermitian.) Thus, $S {[Λ]}^{†} S [Λ] \neq 1$ in general and $ψ^{†} ψ$ is not a Lorentz scalar.

Instead, with a bit of matrix algebra¹¹, one can show that

γ^{0} S [Λ] γ^{0} = {(S {[Λ]}^{- 1})}^{†},

(B.4.19)

and hence

ψ^{†} γ^{0} ψ \to ψ^{†} S {[Λ]}^{†} γ^{0} S [Λ] ψ = ψ^{†} γ^{0} S {[Λ]}^{- 1} S [Λ] ψ = ψ^{†} γ^{0} ψ

(B.4.20)

is a Lorentz scalar. Thus, we define $\bar{ψ} \equiv ψ^{†} γ^{0}$ as the “natural” conjugate to $ψ$ , and end up with a nice Lorentz scalar $\bar{ψ} ψ$ for our Lagrangian.

Similarly, one can show that $\bar{ψ} γ^{μ} ψ$ transforms as a Lorentz $4$ -vector and, hence, contracting it with $\partial_{μ}$ as $\bar{ψ} γ^{μ} \partial_{μ} ψ$ yields another scalar. These two terms, which are analogous to the mass and derivative terms a free complex scalar field (Eq. 3.1.15), are enough to build the Dirac Lagrangian:

L = i \bar{ψ} γ^{μ} \partial_{μ} ψ - m \bar{ψ} ψ = \bar{ψ} (i \partial - m) ψ .

(B.4.21)

One can check that the EL equations reproduce the Dirac equation for $ψ$ and $\bar{ψ}$ .

The U(1) conserved current

As with the complex scalar field, observe that the Dirac Lagrangian is invariant under global $U (1)$ symmetry $ψ \to e^{iα} ψ$ . Using Noether’s theorem, we can derive the conserved current and charge associated with this symmetry:

j^{μ} = \bar{ψ} γ^{μ} ψ, Q = \int d^{3} x j^{0} = \int d^{3} x ψ^{†} ψ .

(B.4.22)

As for the complex scalar field, these represent the electromagnetic $4$ -current and charge, respectively — a connection we will explore further in Section 3.3.1.

B.4.4 Quantizing the Dirac field

Solutions to the Dirac equation

Before quantizing, we first need the classical solutions to the Dirac equation. Multiplying both sides of it by $- (i γ^{μ} \partial_{μ} + m)$ gives us:

- (i γ^{μ} \partial_{μ} + m) (i γ^{ν} \partial_{ν} - m) ψ = (□ - m^{2}) ψ = 0,

(B.4.23)

which means each component of $ψ$ individually satisfies the KG-equation. Thus, we can assume similar plane wave solutions:

ψ (x) = \int \frac{d^{3} p}{{(2 π)}^{3}} u (p) e^{- ip \cdot x} + v (p) e^{ip \cdot x},

(B.4.24)

where $u (p)$ and $v (p)$ are now spinors, and again we have positive and negative frequency solutions that correspond to particles and antiparticles, respectively, after quantization.

One can check using Fourier space, as we did for the Weyl equations, that

u (p) = (\begin{matrix} \sqrt{p \cdot σ} ξ \\ \sqrt{p \cdot \bar{σ}} ξ \end{matrix}), v (p) = (\begin{matrix} \sqrt{p \cdot σ} η \\ - \sqrt{p \cdot \bar{σ}} η \end{matrix})

(B.4.25)

are general solutions to the Dirac equation, where $ξ$ and $η$ are the familiar two-component spinors from QM for spin- $\frac{1}{2}$ particles (although technically they do not have this interpretation before quantization). As is conventional, we will use a basis of $σ_{z}$ eigenstates $ξ_{1} = η_{1} = {(1, 0)}^{T}$ and $ξ_{2} = η_{2} = {(0, 1)}^{T}$ , corresponding to spin-up and spin-down, respectively.

For example, in the rest frame $p_{μ} = (m, 0, 0, 0)$ , we have:

u {(p)}_{1} = \sqrt{m} (\begin{matrix} 1 \\ 0 \\ 1 \\ 0 \end{matrix}), u {(p)}_{2} = \sqrt{m} (\begin{matrix} 0 \\ 1 \\ 0 \\ 1 \end{matrix}), v {(p)}_{1} = \sqrt{m} (\begin{matrix} 1 \\ 0 \\ - 1 \\ 0 \end{matrix}), v {(p)}_{2} = \sqrt{m} (\begin{matrix} 0 \\ 1 \\ 0 \\ - 1 \end{matrix}) .

(B.4.26)

More generally, we can always orient a particle’s 3-momentum along the $z$ -axis, in which case:

u {(p)}_{1} = (\begin{matrix} \sqrt{E - p_{z}} \\ 0 \\ \sqrt{E + p_{z}} \\ 0 \end{matrix}), u {(p)}_{2} = (\begin{matrix} 0 \\ \sqrt{E - p_{z}} \\ 0 \\ \sqrt{E + p_{z}} \end{matrix}) v {(p)}_{1} = (\begin{matrix} \sqrt{E + p_{z}} \\ 0 \\ - \sqrt{E - p_{z}} \\ 0 \end{matrix}), v {(p)}_{2} = (\begin{matrix} 0 \\ \sqrt{E + p_{z}} \\ 0 \\ - \sqrt{E - p_{z}} \end{matrix}) .

(B.4.27)

Quantization

Now that we have a sensible Lagrangian and the classical solutions to the Dirac equation, the remaining steps to quantization follow closely that for our complex scalar field in Section B.2.4, but with two notable differences. The first is that we now must sum over the two spin components of $u_{s} (p)$ and $v_{s} (p)$ , in addition to integrating over the momentum:

\begin{matrix} \begin{aligned} ψ (x) & = \sum_{s = 1, 2} \int \frac{d^{3} p}{{(2 π)}^{3}} [{\hat{b}}_{p}^{s} u_{s} (p) e^{- ip \cdot x} + c_{p}^{s †} v_{s} (p) e^{ip \cdot x}], \\ \bar{ψ} (x) & = \sum_{s = 1, 2} \int \frac{d^{3} p}{{(2 π)}^{3}} [{\hat{b}}_{p}^{s †} ū_{s} (p) e^{ip \cdot x} + ĉ_{p}^{s} {\bar{v}}_{s} (p) e^{- ip \cdot x}] . \end{aligned} \end{matrix}

(B.4.28)

As before, we have positive and negative frequency solutions, with the $b ∕ b^{†}$ and $c ∕ c^{†}$ operators associated with particles of the same mass and opposite charge. For spinors, we find that the ${\hat{b}}^{s †} | 0 ⟩$ and $ĉ^{s †} | 0 ⟩$ also have opposite spins, i.e. for the $z$ -axis angular momentum operator $J_{z}$ (which can be derived through Noether’s theorem as we did for the momentum operator in Section 3.1.2):

J_{z} {\hat{b}}^{s †} | 0 ⟩ = \pm \frac{1}{2} {\hat{b}}^{s †} | 0 ⟩, J_{z} ĉ^{s †} | 0 ⟩ = \mp \frac{1}{2} ĉ^{s †} | 0 ⟩ .

(B.4.29)

By convention, we take $b^{s †}$ and $b^{s}$ to be the creation and annihilation operators for the electron, and $c^{s †}$ and $c^{s}$ for its antiparticle, the positron. Thus, ${\bar{ψ}}_{s} (x) | 0 ⟩$ corresponds to an electron at $x$ with spin state $s$ , and $ψ_{s} (x) | 0 ⟩$ to a positron at $x$ with the opposite spin state to $s$ .

Through his equation, Dirac was the first to predict the existence of antimatter in 1930 [434] (although he initially thought the electron’s antiparticle was the proton). This prediction was soon confirmed by the discovery of a particle with the same mass as the electron but opposite charge by Carl Anderson in a bubble chamber in 1932 [435]. Both were awarded the Nobel prize.

The spin-statistics connection

The second, extremely important difference from scalar quantization is that, because spinors are spin- $\frac{1}{2}$ particles, they must obey anticommutation relations:

\begin{matrix} \begin{aligned} {ψ_{α} (x), ψ_{β} (y)} = & {{\bar{ψ}}_{α} (x), {\bar{ψ}}_{β} (y)} = 0, \\ {ψ_{α} (x), {\bar{ψ}}_{β} (y)} & = δ_{αβ} δ^{3} (x - y), \end{aligned} \end{matrix}

(B.4.30)

which also means the creation and annihilation operators satisfy:

{{\hat{b}}_{p}^{s}, {\hat{b}}_{q}^{r †}} = {ĉ_{p}^{s}, ĉ_{q}^{r †}} = {(2 π)}^{3} δ^{3} (p - q) δ_{sr} .

(B.4.31)

Thus, unlike bosons, exchanging two particles yields a minus sign: ${\hat{b}}_{p_{1}}^{r †} {\hat{b}}_{p_{1}}^{s †} | 0 ⟩ = - {\hat{b}}_{p_{2}}^{s †} {\hat{b}}_{p_{1}}^{r †} | 0 ⟩$ , confirming that spinors obey Fermi-Dirac statistics and obey the Paul-Exclusion principle. Were we to try and impose our earlier commutation relations for spinors (or indeed, any half-integer-spin field), we would run into several issues. These include the time-ordered product in the $S$ -matrix not being Lorentz invariant, and antiparticles contributing arbitrarily negative energies, making the theory unstable. They are all related to the deep connection between spin and statistics: the requirement of Lorentz invariance, stability, and causality in a QFT necessitates that half-integer-spin particles obey Fermi-Dirac, and integer-spin particles Bose-Einstein statistics.¹²

B.4.5 Interactions and Feynman rules

Having quantized the free Dirac field, we now discuss interactions, again focusing on small (and renormalizable) perturbations to the free theory. We start by presenting the propagators for the Dirac field and then extending our scalar Yukawa theory from Section 3.2 to spinor “nucleons”.

Propagators

We define the propagator for the Dirac field the same as for scalar fields in Section 3.1.4:

D_{αβ} (x - y) = ⟨ 0 | ψ {(x)}_{α} \bar{ψ} {(y)}_{β} | 0 ⟩ = \int \frac{d^{3} p}{{(2 π)}^{3}} \frac{1}{2 E_{p}} \sum_{s} u_{α}^{s} (p) ū_{β}^{s} (p) e^{- ip \cdot (x - y)},

(B.4.32)

where $α$ and $β$ index the spinor components. Again, we have an extra sum over the spin states. With some more matrix algebra one can show that these kinds of sums simplify nicely to

\sum_{s} u_{α}^{s} (p) ū_{β}^{s} (p) = {(p + m)}_{αβ}, \sum_{s} v_{α}^{s} (p) {\bar{v}}_{β}^{s} (p) = {(p - m)}_{αβ},

(B.4.33)

so that we end up with, in momentum space, the Feynman propagator:

Δ_{F} (p) \equiv ⟨ 0 | Tψ (x) \bar{ψ} (y) | 0 ⟩ = \frac{i (p + m)}{p^{2} - m^{2} + i𝜖} .

(B.4.34)

Note that we have now suppressed the spinor indices; $Δ_{F}$ is still a $4 \times 4$ matrix in spinor space. Note as well the relative minus sign in the time-ordering operator for fermions, due to exchanging the fields:

⟨ 0 | Tψ (x) \bar{ψ} (y) | 0 ⟩ = {\begin{matrix} ⟨ 0 | ψ (x) \bar{ψ} (y) | 0 ⟩ & x^{0} > y^{0}, \\ - ⟨ 0 | \bar{ψ} (y) ψ (x) | 0 ⟩ & x^{0} < y^{0} . \end{matrix}

(B.4.35)

External lines

For scalars, external line terms such as $ϕ | p ⟩$ simply contributed a factor of $1$ to the matrix element, where $| p ⟩$ is again a one-particle meson state with momentum $p$ :

ϕ | p ⟩ \sim \int \frac{d^{3} p^{'}}{{(2 π)}^{3}} \frac{1}{\sqrt{2 E_{p^{'}}}} a_{p^{'}} e^{- i p^{'} \cdot x} \sqrt{2 E_{p}} a_{p}^{†} | 0 ⟩ = e^{- ip \cdot x} | 0 ⟩ .

(B.4.36)

(The $e^{- ip \cdot x}$ factor contributes only to the momentum conservation delta function in the $S$ -matrix element.) For spinors, we instead end up with a spinor factor. For example, for an incoming fermion with momentum $q$ and spin $s$ :

ψ | q, s ⟩ \sim \int \frac{d^{3} q^{'}}{{(2 π)}^{3}} \frac{1}{\sqrt{2 E_{q^{'}}}} \sum_{s}^{'} b_{q^{'}}^{s^{'}} u^{s^{'}} (q^{'}) e^{- i q^{'} \cdot x} \sqrt{2 E_{q}} b_{q}^{s †} | 0 ⟩ = u^{s} (q) e^{- iq \cdot x} | 0 ⟩ .

(B.4.37)

We can see looking at the form of the quantized fields (Eq. B.4.28), and which terms will contribute something non-zero, that incoming (outgoing) external fermions will be associated with a $u$ $(ū$ ) and antifermions with a $\bar{v}$ $(v$ ) factor.¹³

Yukawa theory reloaded

We now revisit Yukawa theory, the simplest possible theory of interactions for spinors. The Lagrangian is the same as in Eq. 3.2.3, but now with $ψ$ a spinor:

L = \frac{1}{2} \partial^{μ} ϕ \partial_{μ} ϕ + i \bar{ψ} \partial ψ - \frac{1}{2} m^{2} ϕ^{2} - M \bar{ψ} ψ - gϕ \bar{ψ} ψ .

(B.4.38)

Note that through dimensional analysis, since $[M \bar{ψ} ψ] = [\bar{ψ} \partial ψ] \overset{!}{=} 4$ we can deduce that $[ψ] = \frac{3}{2}$ . This means that (1) the Yukawa interaction is marginal, with $[ϕ \bar{ψ} ψ] = 4$ and $[g] = 0$ , and (2) importantly, there are no other renormalizable, Lorentz-invariant interactions we can write down for spinors with the fields at our disposal (modulo some $γ^{5}$ ’s thrown in, as we’ll discuss in Section B.4.6). Terms like $ψ ϕ^{2}$ , $\partial ψϕ$ , or $\bar{ψ} ψ ϕ^{2}$ are all either not Lorentz-scalars or of dimension $\geq 5$ . In this sense, because their possible interactions are so heavily constrained by their $\frac{3}{2}$ -dimensionality, spinors in QFT are quite simple! There is only one other spinor interaction in the SM, which we will see in Section 3.3, with gauge bosons.

We again refer to $ϕ$ and $ψ$ as the “meson” and “nucleon” fields, which is slightly more accurate now since nucleons are in reality fermions. The two main features missing from this theory are that the relevant mesons, the pions, are pseudoscalars (to be discussed in the next section) and are a strong isospin triplet (to be described briefly in Chapter. 4.1).

Definition B.4.1. The Feynman rules in momentum space for spinor Yukawa theory are:

1.

Vertices:

= - ig

2.

Internal lines (propagators)
Mesons:

= \frac{i}{p^{2} - m^{2} + i𝜀}

Nucleons:

= \frac{i (q + m)}{q^{2} - M^{2} + i𝜀}

3.

External lines (on-shell particles)

Incoming mesons:

= 1

Outgoing mesons:

= 1

Incoming nucleons:

= u_{s} (q)

Outgoing nucleons:

= ū_{s} (q)

Incoming antinucleons:

= {\bar{v}}_{s} (q)

Outgoing antinucleons:

= v_{s} (q)

4.

Impose momentum conservation at each vertex.

5.

Integrate over the momentum

k

flowing through each loop.

6.

Figure out the sign based on statistics.

Meson decay and the Higgs decay width

ϕ¯uvss12((qq12)) — Figure B.3. Tree-level Feynman diagram for meson decay via a Yukawa interaction.

The matrix element for meson decay into a fermion-antifermion pair with spin and momentum $s_{1}, q_{1}$ and $s_{2}, q_{2}$ , respectively, to first-order can be read off from the Feynman diagram in Figure B.3:

i M = - ig ū_{s_{1}} (q_{1}) v_{s_{2}} (q_{2})

(B.4.39)

We can calculate the decay rate as in Section 3.2.4, except now we have to sum over the spins of the fermions:

d Γ = \sum_{s_{1}, s_{2}}^{2} \frac{1}{2 m} {| M |}^{2} d Π_{LIPS} = \frac{g^{2}}{2 m} \sum_{s_{1}, s_{2}}^{2} {| ū_{s_{1}} (q_{1}) v_{s_{2}} (q_{2}) |}^{2} d Π_{LIPS} .

(B.4.40)

In the COM frame, we can choose $q_{1} = (\frac{m}{2}, 0, 0, q)$ and $q_{2} = (\frac{m}{2}, 0, 0, - q)$ , with $q^{2} = \frac{m^{2}}{4} - M^{2}$ by energy conservation. Using the forms of $ū_{s}$ and $v_{s}$ we found in Eq. B.4.27, we see that the sum over spin states simplifies nicely:

\sum_{s_{1}, s_{2}}^{2} {| ū_{s_{1}} (q_{1}) v_{s_{2}} (q_{2}) |}^{2} = 8 q^{2} = 2 (m^{2} - 4 M^{2}) .

(B.4.41)

Since this is independent of the final state kinematics, the integral of $d Π_{LIPS}$ is the same as for the scalar meson decay, and we obtain an the overall decay rate of:

Γ = \frac{g^{2} m}{16 π} {(1 - \frac{4 M^{2}}{m^{2}})}^{3 ∕ 2} .

(B.4.42)

As we hinted at in Section 3.2.4, this is in fact the decay width of the Higgs boson to fermions at tree level, if we plug in the Higgs Yukawa coupling constant $g_{f} = \frac{\sqrt{2} m_{f}}{v}$ . Here $m_{f}$ is the fermion mass and $v$ is the Higgs vacuum expectation value, $246 GeV$ . For example, for the $H \to μ^{+} μ^{-}$ decay, with $M = m_{μ} = 105.7 MeV$ and $m = m_{H} = 125 GeV$ , we get $Γ \approx 900 eV$ , exactly in line with the predicted value [436]!

One can similarly update our nucleon scattering amplitudes from Section 3.2.3, which simply gain some inner products between the incoming and outgoing spin states (see e.g. Tong QFT [74] Chapter 5.7). Notably, however, the $t$ -channel and $u$ -channel diagrams (Figure B.2) now have a relative minus sign, in accordance with Fermi-Dirac statistics.

B.4.6 CPT Symmetries

In this section, we discuss three important discrete symmetries in QFT. As discussed in Chapter 2.3, the full Lorentz group includes the parity $P$ and time reversal $T$ operators. In the $4$ -vector representation, they have the simple forms $P = diag (1, - 1, - 1, - 1)$ and $T = diag (- 1, 1, 1, 1)$ , meaning

P : (t, x) \to (t, - x), T : (t, x) \to (- t, x) .

(B.4.43)

However, their forms in other representations, such as spinors, are not as straightforward.

Observe also that all our complex Lagrangians so far have been invariant under some form of complex conjugation $ψ \leftrightarrow ψ^{*}$ . This represents another discrete symmetry, and since we know from Eq. 3.1.17 that complex conjugation inverts “charge”, we call this charge conjugation, or $C$ , symmetry.

All local, relativistic QFTs are necessarily invariant under the combined $CPT$ symmetry; this is known as the CPT theorem [437, 438].¹⁴ Whether a theory is individually $C$ , $P$ , or $T$ invariant, however, must be determined by experiment,¹⁵ as we give examples of below. If it is, we must impose the symmetries in our mathematical formulation by carefully defining the actions of the relevant operators; i.e., we have to consider how $ψ$ must transform under $P$ to maintain $P$ -invariance of the Lagrangian, etc.

Such symmetries are crucial handles for understanding QFTs, particularly in the case of the weak and strong interactions for which we have otherwise little classical intuition. By studying them, we often glean important insights into the theory, such as why certain processes are forbidden: for example, we now understand that the pion cannot decay into three photons because this would violate the $C$ -invariance of QED.

$P$ - and $CP$ -violation

Historically, it was thought that parity individually is a universal symmetry of nature. Indeed, this was verified experimentally for electromagnetism and the strong interaction, but, surprisingly, in 1956 an experiment measuring the isotropy of the beta decay of cobalt-60 to nickel-60 by Chien-Shiung Wu showed that the weak interaction in fact violates parity- (and $C$ -) invariance [439]. The two theorists, Yang Chen-Ning and Lee Tsung-Dao, who proposed this experiment won the Nobel prize the year after but, controversially, Wu did not.

It was then proposed by Lev Landau [440] and others that perhaps the combined $CP$ -symmetry is the true symmetry of nature. As we define below, the $CP$ operation transforms a particle into its antiparticle, hence, $CP$ -invariance can be thought of as saying the laws of physics are the same for particles and antiparticles. This indeed appeared to be the case until 1964, when the Fitch-Cronin experiment discovered small, indirect $CP$ -violation by the weak interaction by measuring decays of neutral kaons [441], for which another Nobel prize was awarded to James Cronin and Val Fitch. Since then, several experiments have observed both direct and indirect $CP$ -violation, and quantifying the magnitude of $CP$ -violation in different sectors of the SM remains an active area of research in HEP (see Ref. [442] Chapters 13-14 for a nice comprehensive review).

Interestingly, $CP$ -violation is only possible through the weak interaction if there exist $\geq 3$ generations of fermions, whereas it is expected for the strong interaction but not observed (the so-called “strong $CP$ problem” [102, 103].¹⁶ Furthermore, the experimentally determined magnitude of $CP$ -violation in the weak interaction is about $1000 \times$ smaller than what is allowed [103, 442]. These mysterious “coincidences” — Why did nature “choose” exactly the minimum number of generations needed for $CP$ -violation? Why is there no strong $CP$ -violation? etc. — suggest deeper underlying physics, such as “axions” [443].

Scalar fields

We see from our complex scalar Lagrangian in Eq. 3.1.15 that it can only be invariant under $C$ , $P$ , or $T$ if they transform the field $ϕ$ by at most a complex phase: $ϕ \to e^{iα} ϕ$ . A further physical requirement, however, is that applying any of the operators twice should return the original field, which thus constrains the possible transformations to:

\begin{matrix} \begin{aligned} C : ϕ (t, x) & \to \pm ϕ^{*} (t, x), \\ P : ϕ (t, x) & \to \pm ϕ (t, - x), \\ T : ϕ (t, x) & \to \pm ϕ (- t, x) . \end{aligned} \end{matrix}

(B.4.44)

The time-reversal operation is a bit subtle, as it must be anti-unitary. We will not discuss it much further, although its implications can be fun to think about.

Nomenclature Whether a field transforms with a $+$ or $-$ sign under $P$ is called its intrinsic parity, and similarly under $C$ its intrinsic $C$ -parity. We also refer to them as “even” or “odd” under the transformation, respectively. In particular, an odd-parity scalar, i.e. one which transforms with a minus sign under parity, is called a pseudoscalar. The Higgs field, for example, is a scalar, while the pion is a pseudoscalar (as was determined based on nuclear interactions).

Vector fields

Though we introduce vector fields in detail in the next section, their transformation properties are analogous to scalars and simple enough to describe here:

\begin{matrix} \begin{aligned} C : A^{μ} (t, x) & \to \pm A^{† μ} (t, x), \\ P : A^{μ} (t, x) & \to \pm η_{μν} A^{ν} (t, - x), \\ T : A^{μ} (t, x) & \to \mp η_{μν} A^{ν} (- t, x), \end{aligned} \end{matrix}

(B.4.45)

where $η_{μν}$ is the Minkowski metric (i.e. $P$ and $T$ flip the sign of the first and the last three components of $A^{μ}$ , respectively). We use similar “odd” and “even” nomenclature for vectors, with an odd-parity vector called a pseudovector. Recall for example that the electric and magnetic $3$ -vector fields are vectors and pseudovectors, respectively. Notably, the photon is odd under $C$ while the neutral pion; this explains why the pion can decay into two photons (since the two photons have a combined parity of $(- 1) (- 1) = + 1$ ), but not to three, even though either would be allowed kinematically.

Spinors: parity

Spinors live in a more complicated representation of the Lorentz group, so it takes more work to derive their transformations. On the other hand, this also means their properties and the physical consequences are more interesting.

If $P$ is a true symmetry of the theory, after a parity transformation $ψ^{'} (x^{'}) = Pψ (x) P^{†}$ must satisfy the parity-transformed Dirac equation:

(i γ^{μ} \partial_{μ}^{'} - m) ψ^{'} (x^{'}) = 0,

(B.4.46)

where $x^{μ} \to {x^{'}}^{μ} = (x^{0}, - x)$ and $\partial_{μ}^{'} \equiv \partial ∕ \partial {x^{'}}^{μ}$ under parity. One can see, by multiplying the original Dirac equation by $γ^{0}$ , that this is satisfied if $ψ^{'} (x^{'}) = \pm γ^{0} ψ (x)$ :

γ^{0} (i γ^{μ} \partial_{μ} - m) ψ (x) = (i γ^{μ} \partial_{μ}^{'} - m) γ^{0} ψ (x) = (i γ^{μ} \partial_{μ}^{'} - m) ψ^{'} (x^{'}) = 0 .

(B.4.47)

Again, the sign in the transformation indicates the intrinsic parity of the field.

Looking at the form of $γ^{0}$ and $ψ$ in the Weyl basis (Eqs. B.4.4 and B.4.11), we see that the parity transformation swaps around left- and right-handed spinors:

P ψ_{L} (x) P^{†} = \pm ψ_{R} (x^{'}), P ψ_{R} (x) P^{†} = \pm ψ_{L} (x^{'}) .

(B.4.48)

Chirality being inverted makes sense given its (loose) connection to helicity, which is flipped under parity. Similarly, remembering from Section B.4.4 that particle and anti-particle solutions to the Dirac equation have the form $u (p) \propto {(ξ, ξ)}^{T}$ and $v (p) \propto {(η, - η)}^{T}$ , respectively, we see that fermions and antifermions have even and odd parity, respectively. The weak interaction breaks parity symmetry by interacting only with left-chiral fermions and right-chiral antifermions.

We can also check that the Lorentz scalars and vectors we constructed, $\bar{ψ} ψ$ and $\bar{ψ} γ^{μ} ψ$ , are indeed invariant under parity, e.g.:

P : \bar{ψ} ψ \to {\bar{ψ}}^{'} ψ^{'} = ψ^{†} γ^{0} γ^{0} γ^{0} ψ = ψ^{†} γ^{0} ψ = \bar{ψ} ψ .

(B.4.49)

However, we can also construct pseudoscalars and pseudovectors by throwing in a $γ^{5}$ matrix: $\bar{ψ} γ^{5} ψ$ and $\bar{ψ} γ^{5} γ^{μ} ψ$ . One can confirm this by grinding it out as above, or by simply looking at their form in the Weyl basis, e.g.:

\bar{ψ} γ^{5} ψ = (\begin{matrix} ψ_{L}^{†} & ψ_{R}^{†} \end{matrix}) (\begin{matrix} 0 & 𝟙 \\ 𝟙 & 0 \end{matrix}) (\begin{matrix} - 𝟙 & 0 \\ 0 & 𝟙 \end{matrix}) (\begin{matrix} ψ_{L} \\ ψ_{R} \end{matrix}) = ψ_{L}^{†} ψ_{R} - ψ_{R}^{†} ψ_{L} .

(B.4.50)

We thus see that this will pick up an overall minus sign under $ψ_{L} \leftrightarrow ψ_{R}$ .

Spinors: charge conjugation and $CP$

Under charge conjugation, $ψ \to ψ_{c} = C ψ^{*}$ , where $C$ is a matrix that can mix up the spinor components. We can follow similar reasoning as for parity to show that $ψ_{c}$ satisfies the Dirac equation only if:

C^{- 1} γ^{μ} C = - {(γ^{μ})}^{*}

(B.4.51)

In the Weyl basis, this means $C = \pm i γ^{2}$ and thus

C : ψ \to ψ_{c} = \pm i γ^{2} ψ^{*},

(B.4.52)

where as always the sign in the transformation indicates the intrinsic $C$ -parity of the field. Looking at the individual components:

C : ψ_{L} \to \pm i σ^{2} ψ_{R}^{*}, C : ψ_{R} \to \mp i σ^{2} ψ_{L}^{*} .

(B.4.53)

$γ^{2}$ and complex conjugation both flip chirality, so combined we see that charge conjugation retains it, transforming left-(right-)chiral fermions into left-(right-)chiral antifermions. Thus, the weak interaction violates $C$ -symmetry as well by coupling only to opposite-chirality fermions and antifermions.

Combining parity and charge conjugation gives us, in the Weyl basis:

CP : ψ \to \pm i γ^{2} γ^{0} ψ^{*},

(B.4.54)

or, in terms of the Weyl spinors:

CP : ψ_{L} \to \pm i σ^{2} ψ_{L}^{*}, CP : ψ_{R} \to \mp i σ^{2} ψ_{R}^{*} .

(B.4.55)

The combination thus transforms fermions into their opposite-chirality antifermions, and vice versa. Often, this transformation is considered to define the relation between particles and antiparticles, and is a better symmetry of the weak interaction (and, hence, the sm) than $C$ or $P$ individually. However, as discussed above, it is violated as well, to a lesser extent, through the mixing of the three generations of fermions.

Spinors: time reversal and CPT

The time reversal operation is more subtle, as it is anti-unitary. We will forego a detailed discussion of these subtleties (see e.g. Schwartz [86] Chapter 11.6), and note that the time reversal operator $T$ is defined to transform a Dirac spinor in the Weyl basis as:

T : ψ (t, x) \to \pm i γ^{1} γ^{3} ψ (- t, x) .

(B.4.56)

It flips both the spin and momenta of the fermions, and is violated as well by the weak interaction (as it must be to ensure $CPT$ -invariance, given $CP$ -violation).

Finally, we can combine all these operations to obtain the $CPT$ -transformation of the Dirac spinor:

CPT : ψ (x) \to \pm - i γ^{2} γ^{0} γ^{1} γ^{3} ψ^{*} (- x) = - γ^{5} ψ^{*} (- x) .

(B.4.57)

This transforms a particle into an antiparticle reversed in space and time.

One interesting way of testing $CPT$ -invariance is to measure the rates of a process’ $CP$ - and $T$ -conjugates, and confirm that they are equal. All experimental tests to this date have confirmed $CPT$ -invariance [442].

⁶We now understand that the KG equation describes perfectly good scalar quantum fields, where the field-theoretic analog of the probability density is in fact the conserved charge of Eq. B.2.16, which is allowed to be negative.

⁷Or even two- or three-dimensional.

⁸These insights were so unexpected that Dirac thought “his equation was more intelligent than its author” [433].

⁹See e.g. Ref. [82] Lecture 14.

¹⁰ $x^{'} = Λ^{- 1} x$ as this is an active transformation, in which the field is shifted.

¹¹See e.g. Schwartz [86] Chapter 10.3

¹²For more detailed discussion, see e.g. Peskin and Schroeder [81] Chapter 3.5 and Schwartz [86] Chapter 12.

¹³The “ $\sim$ ” becomes an “ $=$ ” for a Wick contraction, $ϕ | p ⟩$ , which is what we deal with with time-ordered operator products.

¹⁴One way to convince yourself of this is to check that all possible Lorentz scalar terms in the Lagrangian are invariant under $CPT$ , as shown in Peskin and Shroeder [81] Chapter 3.6.

¹⁵And also somewhat by the requirement of anomaly cancellation; see e.g. Tong SM [76] Chapter 4.

¹⁶The difference is a consequence of an ABJ anomaly for the $SU (2)$ gauge group (see e.g. Tong SM [76] Chapter 5.1).