3.4 Spinor field theory
...anything that comes back to itself with a minus sign after a 2 rotation is always going to be a little strange. — David Tong [5]
So far, we have focused on scalar fields, which live in the trivial representation of the Lorentz group and correspond to spin- bosons. In this section, we discuss the field theory for spin- particles, or fermions, which constitute all matter in the universe.
3.4.1 The Dirac equation
Like the Klein-Gordon equation, the Dirac equation was also an attempt at a relativistic version of the Schrödinger equation. Before the development of QFT, the quantized KG equation was thought to produce negative probabilities due to its second derivative in time.12 Dirac thus sought a relativistic first-order differential equation in space and time.
Legend has it he was staring into a fire in Cambridge when he came up with an equation of the form
| (3.101) |
where are constants that will be defined in a moment, and is a complex field. It is difficult to make this equation Lorentz covariant; indeed, it is impossible if is a scalar and each is simply a number.13 Dirac’s brilliant insight, however, was that it can be covariant if are complex matrices and a four component field.
The key is that is essentially the “square-root” of the d’Alembertian from the KG-equation:
| (3.102) |
if (and only if) and satisfy the Clifford algebra:
| (3.103) |
where is the anticommutator. Dirac found this is possible with matrices such as
| (3.104) |
where are the Pauli matrices (Chapter 2.2). These are called the gamma, or Dirac, matrices, and plugging them into Eq. 3.101 yields the Dirac equation, which can be written even more compactly by defining :
| (3.105) |
This equation is considered one of the most significant breakthroughs in theoretical physics, “on par with the works of Newton, Maxwell, and Einstein before him” [36]. The insights that followed, as we will outline in this section, provided a theoretical basis for fermion spin, implied the existence of antiparticles, and overall were foundational to the development of the SM.14
3.4.2 Spinors
Before discussing solutions and quantization of the Dirac equation, let us examine what kind of object is. A related property of the Clifford algebra is that
| (3.106) |
satisfies the Lorentz algebra (Eq. 2.32). This means are generators of Lorentz transformations
| (3.107) |
where is a Lorentz transformation with parameters , and is a particular 4D representation.
It can be shown15 that the Dirac equation is only Lorentz covariant if the components of , , transform under this exact representation:
| (3.108) |
It is important to note here that is acting on the components — also called the spinor indices – and not on the spacetime coordinates , which transform under the vector representation (Eq. 2.22). Explicitly, including the spacetime coordinates, transforms as:
| (3.109) |
where both and share the same transformation parameters and thus correspond to the same Lorentz transformation.16
Dirac and Weyl spinors
What is this representation? Let’s look at the rotation and boost generators individually:
| (3.110) |
Comparing this with Eqs. 2.28 and 2.29, we see that the top left and bottom right blocks are exactly the left- and right-handed Weyl spinor irreps of the generators. The handedness of a spinor is called its chirality, and its physical significance will be discussed in a moment. Thus, we identify with the , or Dirac spinor, representation.
This also means that, in this basis of the gamma matrices (called the Weyl, or chiral, basis), the Dirac spinor can be decomposed into two Weyl spinors:
| (3.111) |
which transform under their respective representations. The two components can be isolated if we consider a fifth gamma matrix:
| (3.112) |
is similar to our main four matrices in that and . Importantly, we see from its form in the Chiral basis that projection operators and can be defined as:
| (3.113) |
which satisfy the projection property and project out the left- and right-handed components of a Dirac spinor:
| (3.114) |
Note that while the specific form depends on the basis, the definitions in Eq. 3.113 are basis-independent and can be considered to define chirality.
Chirality
The two Weyl spinor representations are related by a complex conjugation, meaning is a right-handed Weyl spinor, and vice versa. For a complex scalar field, we interpreted the conjugate as the antiparticle. The same interpretation applies here; hence, if a left-handed spinor describes a particle, its antiparticle is described by its conjugate, right-handed spinor.
The Dirac equation can be rewritten in the Weyl basis as two coupled equations of the Weyl spinors. Let us define and , so that
| (3.115) |
Hence, we see the mass term couples the left- and right-handed components. This is why all massive fermions must exist in pairs of particles and antiparticles. An important special case, however, is for a neutral Majorana fermion, where equals its charge conjugate (to be defined below). Such a particle is its own antiparticle and can have a left-handed- or right-handed-only mass term. As discussed in Chapter 2.3, the only Majorana candidate in the SM is the right-handed neutrino.
For , the Dirac equation decouples and leaves us with the Weyl equations describing massless fermions:
| (3.116) |
In Fourier space, these are:
| (3.117) |
where we used for massless particles. You may recall is the helicity operator, projecting the particle spin along its momentum. Thus, in the massless limit, we see that the left- and right-handed Weyl spinors are the and helicity eigenstates, respectively. This is not the case for massive particles, as helicity is no longer Lorentz invariant: one can always boost into a frame where the momentum is inverted while the spin remains the same, changing the sign of the helicity. Chirality is thus a more abstract concept for massive particles, related only to how they transform under Lorentz transformations.
Theories not symmetric under exchange of left- and right-handed components are called chiral, and symmetric theories vector. QED and QCD are both vector theories, but weak interactions are, surprisingly, chiral. This necessarily means it violates parity and charge conjugation symmetries ( and ), which we will discuss soon in Section 3.4.6.
3.4.3 The Dirac Lagrangian
Recall that to quantize the scalar theory, we first needed the Lagrangian and the classical solutions of the K-G equation, to then obtain Hamiltonian and canonical fields and Poisson brackets before finally promoting them to quantum commutatation relations. We will proceed in similar (though condensed) fashion for the spinor theory, and first derive the Lagrangian corresponding to the Dirac equation.
Since we are no longer dealing with trivial representation of the Lorentz group, we have to be more careful with the types of terms we put into the Lagrangian; it must be composed of good Lorentz-invariant objects. A first guess at a Lorentz scalar formed of spinors may be . This is indeed a scalar, but it is not Lorentz invariant: and transform as , and, hence
| (3.118) |
However, recall from Chapter 2.3 that (finite-dimensional) representations of Lorentz transformations are not unitary. (We can see this as well from the fact that the generators of in Eq. 3.110 are not anti-Hermitian.) Thus, in general and is not a Lorentz scalar.
Instead, with a bit of matrix algebra17, one can show that
| (3.119) |
and hence
| (3.120) |
is a Lorentz scalar. Thus, we define as the “natural” conjugate to , and end up with a nice Lorentz scalar for our Lagrangian.
Similarly, one can show that transforms as a Lorentz -vector and, hence, contracting it with as yields another scalar. These two terms, which are analogous to the mass and derivative terms a free complex scalar field (Eq. 3.26), are enough to build the Dirac lagrangian:
| (3.121) |
One can check that the EL equations reproduce the Dirac equation for and .
The U(1) conserved current
As with the complex scalar field, observe that the Dirac Lagrangian is invariant under global symmetry . Using Noether’s theorem, we can derive the conserved current and charge associated with this symmetry:
| (3.122) |
As for the complex scalar field, these represent the electromagnetic -current and charge, respectively — a connection we will explore further in Section 3.5.2.
3.4.4 Quantizing the Dirac field
Solutions to the Dirac equation
Before quantizing, we first need the classical solutions to the Dirac equation. Multiplying both sides of it by gives us:
| (3.123) |
which means each component of individually satisfies the KG-equation. Thus, we can assume similar plane wave solutions:
| (3.124) |
where and are now spinors, and again we have positive and negative frequency solutions that correspond to particles and antiparticles, respectively, after quantization.
One can check using Fourier space, as we did for the Weyl equations, that
| (3.125) |
are general solutions to the Dirac equation, where and are the familiar two-component spinors from QM for spin- particles (although technically they do not have this interpretation before quantization). As is conventional, we will use a basis of eigenstates and , corresponding to spin-up and spin-down, respectively.
For example, in the rest frame , we have:
| (3.126) |
More generally, we can always orient a particle’s 3-momentum along the -axis, in which case:
| (3.127) |
Quantization
Now that we have a sensible Lagrangian and the classical solutions to the Dirac equation, the remaining steps to quantization follow closely that for our complex scalar field in Section 3.2.4, but with two notable differences. The first is that we now must sum over the two spin components of and , in addition to integrating over the momentum:
| (3.128) |
As before, we have positive and negative frequency solutions, with the and operators associated with particles of the same mass and opposite charge. For spinors, we find that the and also have opposite spins, i.e. for the -axis angular momentum operator (which can be derived through Noether’s theorem as we did for the momentum operator in Section 3.1.3):
| (3.129) |
By convention, we take and to be the creation and annihilation operators for the electron, and and for its antiparticle, the positron. Thus, corresponds to an electron at with spin state , and to a positron at with the opposite spin state to .
Through his equation, Dirac was the first to predict the existence of antimatter in 1930 [38] (although he initially thought the electron’s antiparticle was the proton). This prediction was soon confirmed by the discovery of a particle with the same mass as the electron but opposite charge by Carl Anderson in a bubble chamber in 1932 [39]. Both were awarded the Nobel prize.
The spin-statistics connection
The second, extremely important difference from scalar quantization is that, because spinors are spin- particles, they must obey anticommutation relations:
| (3.130) |
which also means the creation and annihilation operators satisfy:
(3.131) |
Thus, unlike bosons, exchanging two particles yields a minus sign: , confirming that spinors obey Fermi-Dirac statistics and obey the Paul-Exclusion principle. Were we to try and impose our earlier commutation relations for spinors (or indeed, any half-integer-spin field), we would run into several issues. These include the time-ordered product in the -matrix not being Lorentz invariant, and antiparticles contributing arbitrarily negative energies, making the theory unstable. They are all related to the deep connection between spin and statistics: the requirement of Lorentz invariance, stability, and causality in a QFT necessitates that half-integer-spin particles obey Fermi-Dirac, and integer-spin particles Bose-Einstein statistics.18
3.4.5 Interactions and Feynman rules
Having quantized the free Dirac field, we now discuss interactions, again focusing on small (and renormalizable) perturbations to the free theory. We start by presenting the propagators for the Dirac field and then extending our scalar Yukawa theory from Section 3.3 to spinor “nucleons”.
Propagators
We define the propagator for the Dirac field the same as for scalar fields in Section 3.2.5:
| (3.132) |
where and index the spinor components. Again, we have an extra sum over the spin states. With some more matrix algebra one can show that these kinds of sums simplify nicely to
| (3.133) |
so that we end up with, in momentum space, the Feynman propagator:
| (3.134) |
Note that we have now suppressed the spinor indices; is still a matrix in spinor space. Note as well the relative minus sign in the time-ordering operator for fermions, due to exchanging the fields:
| (3.135) |
External lines
For scalars, external line terms such as simply contributed a factor of to the matrix element, where is again a one-particle meson state with momentum :
| (3.136) |
(The factor contributes only to the momentum conservation delta function in the -matrix element.) For spinors, we instead end up with a spinor factor. For example, for an incoming fermion with momentum and spin :
| (3.137) |
We can see looking at the form of the quantized fields (Eq. 3.128), and which terms will contribute something non-zero, that incoming (outgoing) external fermions will be associated with a ) and antifermions with a ) factor.19
Yukawa theory reloaded
We now revisit Yukawa theory, the simplest possible theory of interactions for spinors. The Lagrangian is the same as in Eq. 3.69, but now with a spinor:
| (3.138) |
Note that through dimensional analysis, since we can deduce that . This means that (1) the Yukawa interaction is marginal, with and , and (2) importantly, there are no other renormalizable, Lorentz-invariant interactions we can write down for spinors with the fields at our disposal (modulo some ’s thrown in, as we’ll discuss in Section 3.4.6). Terms like , , or are all either not Lorentz-scalars or of dimension . In this sense, because their possible interactions are so heavily constrained by their -dimensionality, spinors in QFT are quite simple! There is only one other spinor interaction in the SM, which we will see in Section 3.5, with gauge bosons.
We again refer to and as the “meson” and “nucleon” fields, which is slightly more accurate now since nucleons are in reality fermions. The two main features missing from this theory are that the relevant mesons, the pions, are pseudoscalars (to be discussed in the next section) and are a strong isospin triplet (to be described briefly in Chapter. 4.1).
Definition 3.3. The Feynman rules in momentum space for spinor Yukawa theory are:
- 1.
- Vertices:
- 2.
- Internal lines (propagators)
Mesons: Nucleons:
- 3.
- External lines (on-shell particles)
Incoming mesons: Outgoing mesons: Incoming nucleons: Outgoing nucleons: Incoming antinucleons: Outgoing antinucleons: - 4.
- Impose momentum conservation at each vertex.
- 5.
- Integrate over the momentum flowing through each loop.
- 6.
- Figure out the sign based on statistics.
Meson decay and the Higgs decay width
The matrix element for meson decay into a fermion-antifermion pair with spin and momentum and , respectively, to first-order can be read off from the Feynman diagram in Figure 3.7:
| (3.139) |
We can calculate the decay rate as in Section 3.3.4, except now we have to sum over the spins of the fermions:
| (3.140) |
In the COM frame, we can choose and , with by energy conservation. Using the forms of and we found in Eq. 3.127, we see that the sum over spin states simplifies nicely:
| (3.141) |
Since this is independent of the final state kinematics, the integral of is the same as for the scalar meson decay, and we obtain an overall decay rate of:
| (3.142) |
As we hinted at in Section 3.3.4, this is in fact the decay width of the Higgs boson to fermions at tree level, if we plug in the Higgs Yukawa coupling constant . Here is the fermion mass and is the Higgs vacuum expectation value, . For example, for the decay, with and , we get , exactly in line with the predicted value [40]!
One can similarly update our nucleon scattering amplitudes from Section 3.3.3, which simply gain some inner products between the incoming and outgoing spin states (see e.g. Tong QFT [3] Chapter 5.7). Notably, however, the -channel and -channel diagrams (Figure 3.3) now have a relative minus sign, in accordance with Fermi-Dirac statistics.
3.4.6 CPT Symmetries
In this section, we discuss three important discrete symmetries in QFT. As discussed in Chapter 2.3, the full Lorentz group includes the parity and time reversal operators. In the -vector representation, they have the simple forms and , meaning
| (3.143) |
However, their forms in other representations, such as spinors, are not as straightforward.
Observe also that all our complex Lagrangians so far have been invariant under some form of complex conjugation . This represents another discrete symmetry, and since we know from Eq. 3.28 that complex conjugation inverts “charge”, we call this charge conjugation, or , symmetry.
All local, relativistic QFTs are necessarily invariant under the combined symmetry; this is known as the CPT theorem [41, 42].20 Whether a theory is individually , , or invariant, however, must be determined by experiment,21 as we give examples of below. If it is, we must impose the symmetries in our mathematical formulation by carefully defining the actions of the relevant operators; i.e., we have to consider how must transform under to maintain -invariance of the Lagrangian, etc.
Such symmetries are crucial handles for understanding QFTs, particularly in the case of the weak and strong interactions for which we have otherwise little classical intuition. By studying them, we often glean important insights into the theory, such as why certain processes are forbidden: for example, we now understand that the pion cannot decay into three photons because this would violate the -invariance of QED.
- and -violation
Historically, it was thought that parity individually is a universal symmetry of nature. Indeed, this was verified experimentally for electromagnetism and the strong interaction, but, surprisingly, in 1956 an experiment measuring the isotropy of the beta decay of cobalt-60 to nickel-60 by Chien-Shiung Wu showed that the weak interaction in fact violates parity- (and -) invariance [43]. The two theorists, Yang Chen-Ning and Lee Tsung-Dao, who proposed this experiment won the Nobel prize the year after but, controversially, Wu did not.
It was then proposed by Lev Landau [44] and others that perhaps the combined -symmetry is the true symmetry of nature. As we define below, the operation transforms a particle into its antiparticle, hence, -invariance can be thought of as saying the laws of physics are the same for particles and antiparticles. This indeed appeared to be the case until 1964, when the Fitch-Cronin experiment discovered small, indirect -violation by the weak interaction by measuring decays of neutral kaons [45], for which another Nobel prize was awarded to James Cronin and Val Fitch. Since then, several experiments have observed both direct and indirect -violation, and quantifying the magnitude of -violation in different sectors of the SM remains an active area of research in HEP (see Ref. [46] Chapters 13-14 for a nice comprehensive review).
Interestingly, -violation is only possible through the weak interaction if there exist generations of fermions, whereas it is expected for the strong interaction but not observed (the so-called “strong problem” [47, 48].22 Furthermore, the experimentally determined magnitude of -violation in the weak interaction is about smaller than what is allowed [46, 48]. These mysterious “coincidences” — Why did nature “choose” exactly the minimum number of generations needed for -violation? Why is there no strong -violation? etc. — suggest deeper underlying physics, such as “axions” [49].
Scalar fields
We see from our complex scalar Lagrangian in Eq. 3.26 that it can only be invariant under , , or if they transform the field by at most a complex phase: . A further physical requirement, however, is that applying any of the operators twice should return the original field, which thus constrains the possible transformations to:
| (3.144) |
The time-reversal operation is a bit subtle, as it must be anti-unitary. We will not discuss it much further, although its implications can be fun to think about.
Nomenclature Whether a field transforms with a or sign under is called its intrinsic parity, and similarly under its intrinsic -parity. We also refer to them as “even” or “odd” under the transformation, respectively. In particular, an odd-parity scalar, i.e. one which transforms with a minus sign under parity, is called a pseudoscalar. The Higgs field, for example, is a scalar, while the pion is a pseudoscalar (as was determined based on nuclear interactions).
Vector fields
Though we introduce vector fields in detail in the next section, their transformation properties are analogous to scalars and simple enough to describe here:
| (3.145) |
where is the Minkowski metric (i.e. and flip the sign of the first and the last three components of , respectively). We use similar “odd” and “even” nomenclature for vectors, with an odd-parity vector called a pseudovector. Recall for example that the electric and magnetic -vector fields are vectors and pseudovectors, respectively. Notably, the photon is odd under while the neutral pion; this explains why the pion can decay into two photons (since the two photons have a combined parity of ), but not to three, even though either would be allowed kinematically.
Spinors: parity
Spinors live in a more complicated representation of the Lorentz group, so it takes more work to derive their transformations. On the other hand, this also means their properties and the physical consequences are more interesting.
If is a true symmetry of the theory, after a parity transformation must satisfy the parity-transformed Dirac equation:
| (3.146) |
where and under parity. One can see, by multiplying the original Dirac equation by , that this is satisfied if :
| (3.147) |
Again, the sign in the transformation indicates the intrinsic parity of the field.
Looking at the form of and in the Weyl basis (Eqs. 3.104 and 3.111), we see that the parity transformation swaps around left- and right-handed spinors:
| (3.148) |
Chirality being inverted makes sense given its (loose) connection to helicity, which is flipped under parity. Similarly, remembering from Section 3.4.4 that particle and anti-particle solutions to the Dirac equation have the form and , respectively, we see that fermions and antifermions have even and odd parity, respectively. The weak interaction breaks parity symmetry by interacting only with left-chiral fermions and right-chiral antifermions.
We can also check that the Lorentz scalars and vectors we constructed, and , are indeed invariant under parity, e.g.:
| (3.149) |
However, we can also construct pseudoscalars and pseudovectors by throwing in a matrix: and . One can confirm this by grinding it out as above, or by simply looking at their form in the Weyl basis, e.g.:
| (3.150) |
We thus see that this will pick up an overall minus sign under .
Spinors: charge conjugation and
Under charge conjugation, , where is a matrix that can mix up the spinor components. We can follow similar reasoning as for parity to show that satisfies the Dirac equation only if:
| (3.151) |
In the Weyl basis, this means and thus
| (3.152) |
where as always the sign in the transformation indicates the intrinsic -parity of the field. Looking at the individual components:
| (3.153) |
and complex conjugation both flip chirality, so combined we see that charge conjugation retains it, transforming left-(right-)chiral fermions into left-(right-)chiral antifermions. Thus, the weak interaction violates -symmetry as well by coupling only to opposite-chirality fermions and antifermions.
Combining parity and charge conjugation gives us, in the Weyl basis:
| (3.154) |
or, in terms of the Weyl spinors:
| (3.155) |
The combination thus transforms fermions into their opposite-chirality antifermions, and vice versa. Often, this transformation is considered to define the relation between particles and antiparticles, and is a better symmetry of the weak interaction (and, hence, the SM) than or individually. However, as discussed above, it is violated as well, to a lesser extent, through the mixing of the three generations of fermions.
Spinors: time reversal and CPT
The time reversal operation is more subtle, as it is anti-unitary. We will forego a detailed discussion of these subtleties (see e.g. Schwartz [16] Chapter 11.6), and note that the time reversal operator is defined to transform a Dirac spinor in the Weyl basis as:
| (3.156) |
It flips both the spin and momenta of the fermions, and is violated as well by the weak interaction (as it must be to ensure -invariance, given -violation).
Finally, we can combine all these operations to obtain the -transformation of the Dirac spinor:
| (3.157) |
This transforms a particle into an antiparticle reversed in space and time.
One interesting way of testing -invariance is to measure the rates of a process’ - and -conjugates, and confirm that they are equal. All experimental tests to this date have confirmed -invariance [46].
12We now understand that the KG equation describes perfectly good scalar quantum fields, where the field-theoretic analog of the probability density is in fact the conserved charge of Eq. 3.57, which is allowed to be negative.
13Or even two- or three-dimensional.
14These insights were so unexpected that Dirac thought “his equation was more intelligent than its author” [37].
15See e.g. Ref. [12] Lecture 14.
16 as this is an active transformation, in which the field is shifted.
17See e.g. Schwartz [16] Chapter 10.3
18For more detailed discussion, see e.g. Peskin and Schroeder [10] Chapter 3.5 and Schwartz [16] Chapter 12.
19The “” becomes an “” for a Wick contraction, , which is what we deal with with time-ordered operator products.
20One way to convince yourself of this is to check that all possible Lorentz scalar terms in the Lagrangian are invariant under , as shown in Peskin and Shroeder [10] Chapter 3.6.
21And also somewhat by the requirement of anomaly cancellation; see e.g. Tong SM [5] Chapter 4.
22The difference is a consequence of an ABJ anomaly for the gauge group (see e.g. Tong SM [5] Chapter 5.1).