fbpx
Wikipedia

Dirac equation

In particle physics, the Dirac equation is a relativistic wave equation derived by British physicist Paul Dirac in 1928. In its free form, or including electromagnetic interactions, it describes all spin-12 massive particles, called "Dirac particles", such as electrons and quarks for which parity is a symmetry. It is consistent with both the principles of quantum mechanics and the theory of special relativity,[1] and was the first theory to account fully for special relativity in the context of quantum mechanics. It was validated by accounting for the fine structure of the hydrogen spectrum in a completely rigorous way.

The equation also implied the existence of a new form of matter, antimatter, previously unsuspected and unobserved and which was experimentally confirmed several years later. It also provided a theoretical justification for the introduction of several component wave functions in Pauli's phenomenological theory of spin. The wave functions in the Dirac theory are vectors of four complex numbers (known as bispinors), two of which resemble the Pauli wavefunction in the non-relativistic limit, in contrast to the Schrödinger equation which described wave functions of only one complex value. Moreover, in the limit of zero mass, the Dirac equation reduces to the Weyl equation.

Although Dirac did not at first fully appreciate the importance of his results, the entailed explanation of spin as a consequence of the union of quantum mechanics and relativity—and the eventual discovery of the positron—represents one of the great triumphs of theoretical physics. This accomplishment has been described as fully on a par with the works of Newton, Maxwell, and Einstein before him.[2] It has been deemed by some physicists to be the "real seed of modern physics".[3] In the context of quantum field theory, the Dirac equation is reinterpreted to describe quantum fields corresponding to spin-12 particles.

The Dirac equation is inscribed upon a plaque on the floor of Westminster Abbey. Unveiled on 13 November 1995, the plaque commemorates Paul Dirac's life.[4]

Mathematical formulation edit

In its modern formulation for field theory, the Dirac equation is written in terms of a Dirac spinor field   taking values in a complex vector space described concretely as  , defined on flat spacetime (Minkowski space)  . Its expression also contains gamma matrices and a parameter   interpreted as the mass, as well as other physical constants. Dirac first obtained his equation through a factorization of Einstein's energy-momentum-mass equivalence relation assuming a scalar product of momentum vectors determined by the metric tensor and quantized the resulting relation by associating momenta to their respective operators.

In terms of a field  , the Dirac equation is then

Dirac equation

 

and in natural units, with Feynman slash notation,

Dirac equation (natural units)

 

The gamma matrices are a set of four   complex matrices (elements of  ) which satisfy the defining anti-commutation relations:

 
where   is the Minkowski metric element, and the indices   run over 0,1,2 and 3. These matrices can be realized explicitly under a choice of representation. Two common choices are the Dirac representation
 
where   are the Pauli matrices, and the chiral representation: the   are the same, but  

The slash notation is a compact notation for

 
where   is a four-vector (often it is the four-vector differential operator  ). The summation over the index   is implied.

Dirac adjoint and the adjoint equation edit

The Dirac adjoint of the spinor field   is defined as

 
Using the property of gamma matrices (which follows straightforwardly from Hermicity properties of the  ) that
 
one can derive the adjoint Dirac equation by taking the Hermitian conjugate of the Dirac equation and multiplying on the right by  :
 
where the partial derivative   acts from the right on  : written in the usual way in terms of a left action of the derivative, we have
 

Klein–Gordon equation edit

Applying   to the Dirac equation gives

 
That is, each component of the Dirac spinor field satisfies the Klein–Gordon equation.

Conserved current edit

A conserved current of the theory is

 
Proof of conservation from Dirac equation

Adding the Dirac and adjoint Dirac equations gives

 
so by Leibniz rule,
 

Another approach to derive this expression is by variational methods, applying Noether's theorem for the global   symmetry to derive the conserved current  

Proof of conservation from Noether's theorem

Recall the Lagrangian is

 
Under a   symmetry which sends
 
we find the Lagrangian is invariant.

Now considering the variation parameter   to be infinitesimal, we work at first order in   and ignore   terms. From the previous discussion we immediately see the explicit variation in the Lagrangian due to   is vanishing, that is under the variation,

 
where  .

As part of Noether's theorem, we find the implicit variation in the Lagrangian due to variation of fields. If the equation of motion for   are satisfied, then

 

 

 

 

 

(*)

This immediately simplifies as there are no partial derivatives of   in the Lagrangian.   is the infinitesimal variation

 
We evaluate
 
The equation (*) becomes
 
and we're done.

Solutions edit

Since the Dirac operator acts on 4-tuples of square-integrable functions, its solutions should be members of the same Hilbert space. The fact that the energies of the solutions do not have a lower bound is unexpected.

Plane-wave solutions edit

Plane-wave solutions are those arising from an ansatz

 
which models a particle with definite 4-momentum   where  

For this ansatz, the Dirac equation becomes an equation for  :

 
After picking a representation for the gamma matrices  , solving this is a matter of solving a system of linear equations. It is a representation-free property of gamma matrices that the solution space is two-dimensional (see here).

For example, in the chiral representation for  , the solution space is parametrised by a   vector  , with

 
where   and   is the Hermitian matrix square-root.

These plane-wave solutions provide a starting point for canonical quantization.

Lagrangian formulation edit

Both the Dirac equation and the Adjoint Dirac equation can be obtained from (varying) the action with a specific Lagrangian density that is given by:

 

If one varies this with respect to   one gets the adjoint Dirac equation. Meanwhile, if one varies this with respect to   one gets the Dirac equation.

In natural units and with the slash notation, the action is then

Dirac Action

 

For this action, the conserved current   above arises as the conserved current corresponding to the global   symmetry through Noether's theorem for field theory. Gauging this field theory by changing the symmetry to a local, spacetime point dependent one gives gauge symmetry (really, gauge redundancy). The resultant theory is quantum electrodynamics or QED. See below for a more detailed discussion.

Lorentz invariance edit

The Dirac equation is invariant under Lorentz transformations, that is, under the action of the Lorentz group   or strictly  , the component connected to the identity.

For a Dirac spinor viewed concretely as taking values in  , the transformation under a Lorentz transformation   is given by a   complex matrix  . There are some subtleties in defining the corresponding  , as well as a standard abuse of notation.

Most treatments occur at the Lie algebra level. For a more detailed treatment see here. The Lorentz group of   real matrices acting on   is generated by a set of six matrices   with components

 
When both the   indices are raised or lowered, these are simply the 'standard basis' of antisymmetric matrices.

These satisfy the Lorentz algebra commutation relations

 
In the article on the Dirac algebra, it is also found that the spin generators
 
satisfy the Lorentz algebra commutation relations.

A Lorentz transformation   can be written as

 
where the components   are antisymmetric in  .

The corresponding transformation on spin space is

 
This is an abuse of notation, but a standard one. The reason is   is not a well-defined function of  , since there are two different sets of components   (up to equivalence) which give the same   but different  . In practice we implicitly pick one of these   and then   is well defined in terms of  

Under a Lorentz transformation, the Dirac equation

 
becomes
 
Remainder of proof of Lorentz invariance

Multiplying both sides from the left by   and returning the dummy variable to   gives

 
We'll have shown invariance if
 
or equivalently
 
This is most easily shown at the algebra level. Supposing the transformations are parametrised by infinitesimal components  , then at first order in  , on the left-hand side we get
 
while on the right-hand side we get
 
It's a standard exercise to evaluate the commutator on the left-hand side. Writing   in terms of components completes the proof.

Associated to Lorentz invariance is a conserved Noether current, or rather a tensor of conserved Noether currents  . Similarly, since the equation is invariant under translations, there is a tensor of conserved Noether currents  , which can be identified as the stress-energy tensor of the theory. The Lorentz current   can be written in terms of the stress-energy tensor in addition to a tensor representing internal angular momentum.

Historical developments and further mathematical details edit

The Dirac equation was also used (historically) to define a quantum-mechanical theory where   is instead interpreted as a wave-function.

The Dirac equation in the form originally proposed by Dirac is:[5]

 
where ψ(x, t) is the wave function for the electron of rest mass m with spacetime coordinates x, t. The p1, p2, p3 are the components of the momentum, understood to be the momentum operator in the Schrödinger equation. Also, c is the speed of light, and ħ is the reduced Planck constant. These fundamental physical constants reflect special relativity and quantum mechanics, respectively.

Dirac's purpose in casting this equation was to explain the behavior of the relativistically moving electron, and so to allow the atom to be treated in a manner consistent with relativity. His rather modest hope was that the corrections introduced this way might have a bearing on the problem of atomic spectra.

Up until that time, attempts to make the old quantum theory of the atom compatible with the theory of relativity, which were based on discretizing the angular momentum stored in the electron's possibly non-circular orbit of the atomic nucleus, had failed – and the new quantum mechanics of Heisenberg, Pauli, Jordan, Schrödinger, and Dirac himself had not developed sufficiently to treat this problem. Although Dirac's original intentions were satisfied, his equation had far deeper implications for the structure of matter and introduced new mathematical classes of objects that are now essential elements of fundamental physics.

The new elements in this equation are the four 4 × 4 matrices α1, α2, α3 and β, and the four-component wave function ψ. There are four components in ψ because the evaluation of it at any given point in configuration space is a bispinor. It is interpreted as a superposition of a spin-up electron, a spin-down electron, a spin-up positron, and a spin-down positron.

The 4 × 4 matrices αk and β are all Hermitian and are involutory:

 
and they all mutually anticommute:
 

These matrices and the form of the wave function have a deep mathematical significance. The algebraic structure represented by the gamma matrices had been created some 50 years earlier by the English mathematician W. K. Clifford. In turn, Clifford's ideas had emerged from the mid-19th-century work of the German mathematician Hermann Grassmann in his Lineare Ausdehnungslehre (Theory of Linear Extensions). The latter had been regarded as almost incomprehensible by most of his contemporaries. The appearance of something so seemingly abstract, at such a late date, and in such a direct physical manner, is one of the most remarkable chapters in the history of physics.[citation needed] (Even more so, a validation of the exquisite insight displayed by the mathematicians Grassmann and Clifford.)

The single symbolic equation thus unravels into four coupled linear first-order partial differential equations for the four quantities that make up the wave function. The equation can be written more explicitly in Planck units as:[6]

 
which makes it clearer that it is a set of four partial differential equations with four unknown functions.

Making the Schrödinger equation relativistic edit

The Dirac equation is superficially similar to the Schrödinger equation for a massive free particle:

 

The left side represents the square of the momentum operator divided by twice the mass, which is the non-relativistic kinetic energy. Because relativity treats space and time as a whole, a relativistic generalization of this equation requires that space and time derivatives must enter symmetrically as they do in the Maxwell equations that govern the behavior of light — the equations must be differentially of the same order in space and time. In relativity, the momentum and the energies are the space and time parts of a spacetime vector, the four-momentum, and they are related by the relativistically invariant relation

 

which says that the length of this four-vector is proportional to the rest mass m. Substituting the operator equivalents of the energy and momentum from the Schrödinger theory produces the Klein–Gordon equation describing the propagation of waves, constructed from relativistically invariant objects,

 
with the wave function ϕ being a relativistic scalar: a complex number which has the same numerical value in all frames of reference. Space and time derivatives both enter to second order. This has a telling consequence for the interpretation of the equation. Because the equation is second order in the time derivative, one must specify initial values both of the wave function itself and of its first time-derivative in order to solve definite problems. Since both may be specified more or less arbitrarily, the wave function cannot maintain its former role of determining the probability density of finding the electron in a given state of motion. In the Schrödinger theory, the probability density is given by the positive definite expression
 
and this density is convected according to the probability current vector
 
with the conservation of probability current and density following from the continuity equation:
 

The fact that the density is positive definite and convected according to this continuity equation implies that one may integrate the density over a certain domain and set the total to 1, and this condition will be maintained by the conservation law. A proper relativistic theory with a probability density current must also share this feature. To maintain the notion of a convected density, one must generalize the Schrödinger expression of the density and current so that space and time derivatives again enter symmetrically in relation to the scalar wave function. The Schrödinger expression can be kept for the current, but the probability density must be replaced by the symmetrically formed expression[further explanation needed]

 
which now becomes the 4th component of a spacetime vector, and the entire probability 4-current density has the relativistically covariant expression
 

The continuity equation is as before. Everything is compatible with relativity now, but the expression for the density is no longer positive definite; the initial values of both ψ and tψ may be freely chosen, and the density may thus become negative, something that is impossible for a legitimate probability density. Thus, one cannot get a simple generalization of the Schrödinger equation under the naive assumption that the wave function is a relativistic scalar, and the equation it satisfies, second order in time.

Although it is not a successful relativistic generalization of the Schrödinger equation, this equation is resurrected in the context of quantum field theory, where it is known as the Klein–Gordon equation, and describes a spinless particle field (e.g. pi meson or Higgs boson). Historically, Schrödinger himself arrived at this equation before the one that bears his name but soon discarded it. In the context of quantum field theory, the indefinite density is understood to correspond to the charge density, which can be positive or negative, and not the probability density.

Dirac's coup edit

Dirac thus thought to try an equation that was first order in both space and time. One could, for example, formally (i.e. by abuse of notation) take the relativistic expression for the energy

 
replace p by its operator equivalent, expand the square root in an infinite series of derivative operators, set up an eigenvalue problem, then solve the equation formally by iterations. Most physicists had little faith in such a process, even if it were technically possible.

As the story goes, Dirac was staring into the fireplace at Cambridge, pondering this problem, when he hit upon the idea of taking the square root of the wave operator thus:

 

On multiplying out the right side it is apparent that, in order to get all the cross-terms such as xy to vanish, one must assume

 
with
 

Dirac, who had just then been intensely involved with working out the foundations of Heisenberg's matrix mechanics, immediately understood that these conditions could be met if A, B, C and D are matrices, with the implication that the wave function has multiple components. This immediately explained the appearance of two-component wave functions in Pauli's phenomenological theory of spin, something that up until then had been regarded as mysterious, even to Pauli himself. However, one needs at least 4 × 4 matrices to set up a system with the properties required — so the wave function had four components, not two, as in the Pauli theory, or one, as in the bare Schrödinger theory. The four-component wave function represents a new class of mathematical object in physical theories that makes its first appearance here.

Given the factorization in terms of these matrices, one can now write down immediately an equation

 
with   to be determined. Applying again the matrix operator on both sides yields
 

Taking   shows that all the components of the wave function individually satisfy the relativistic energy–momentum relation. Thus the sought-for equation that is first-order in both space and time is

 

Setting

 
and because  , the Dirac equation is produced as written above.

Covariant form and relativistic invariance edit

To demonstrate the relativistic invariance of the equation, it is advantageous to cast it into a form in which the space and time derivatives appear on an equal footing. New matrices are introduced as follows:

 
and the equation takes the form (remembering the definition of the covariant components of the 4-gradient and especially that 0 = 1/ct)
Dirac equation

 

where there is an implied summation over the values of the twice-repeated index μ = 0, 1, 2, 3, and μ is the 4-gradient. In practice one often writes the gamma matrices in terms of 2 × 2 sub-matrices taken from the Pauli matrices and the 2 × 2 identity matrix. Explicitly the standard representation is

 

The complete system is summarized using the Minkowski metric on spacetime in the form

 
where the bracket expression
 
denotes the anticommutator. These are the defining relations of a Clifford algebra over a pseudo-orthogonal 4-dimensional space with metric signature (+ − − −). The specific Clifford algebra employed in the Dirac equation is known today as the Dirac algebra. Although not recognized as such by Dirac at the time the equation was formulated, in hindsight the introduction of this geometric algebra represents an enormous stride forward in the development of quantum theory.

The Dirac equation may now be interpreted as an eigenvalue equation, where the rest mass is proportional to an eigenvalue of the 4-momentum operator, the proportionality constant being the speed of light:

 

Using   (  is pronounced "d-slash"),[7] according to Feynman slash notation, the Dirac equation becomes:

 

In practice, physicists often use units of measure such that ħ = c = 1, known as natural units. The equation then takes the simple form

Dirac equation (natural units)

 

A fundamental theorem states that if two distinct sets of matrices are given that both satisfy the Clifford relations, then they are connected to each other by a similarity transformation:

 

If in addition the matrices are all unitary, as are the Dirac set, then S itself is unitary;

 

The transformation U is unique up to a multiplicative factor of absolute value 1. Let us now imagine a Lorentz transformation to have been performed on the space and time coordinates, and on the derivative operators, which form a covariant vector. For the operator γμμ to remain invariant, the gammas must transform among themselves as a contravariant vector with respect to their spacetime index. These new gammas will themselves satisfy the Clifford relations, because of the orthogonality of the Lorentz transformation. By the fundamental theorem, one may replace the new set by the old set subject to a unitary transformation. In the new frame, remembering that the rest mass is a relativistic scalar, the Dirac equation will then take the form

 

If the transformed spinor is defined as

 
then the transformed Dirac equation is produced in a way that demonstrates manifest relativistic invariance:
 

Thus, settling on any unitary representation of the gammas is final, provided the spinor is transformed according to the unitary transformation that corresponds to the given Lorentz transformation.

The various representations of the Dirac matrices employed will bring into focus particular aspects of the physical content in the Dirac wave function. The representation shown here is known as the standard representation – in it, the wave function's upper two components go over into Pauli's 2 spinor wave function in the limit of low energies and small velocities in comparison to light.

The considerations above reveal the origin of the gammas in geometry, hearkening back to Grassmann's original motivation; they represent a fixed basis of unit vectors in spacetime. Similarly, products of the gammas such as γμγν represent oriented surface elements, and so on. With this in mind, one can find the form of the unit volume element on spacetime in terms of the gammas as follows. By definition, it is

 

For this to be an invariant, the epsilon symbol must be a tensor, and so must contain a factor of g, where g is the determinant of the metric tensor. Since this is negative, that factor is imaginary. Thus

 

This matrix is given the special symbol γ5, owing to its importance when one is considering improper transformations of space-time, that is, those that change the orientation of the basis vectors. In the standard representation, it is

 

This matrix will also be found to anticommute with the other four Dirac matrices:

 

It takes a leading role when questions of parity arise because the volume element as a directed magnitude changes sign under a space-time reflection. Taking the positive square root above thus amounts to choosing a handedness convention on spacetime.

Comparison with related theories edit

Pauli theory edit

The necessity of introducing half-integer spin goes back experimentally to the results of the Stern–Gerlach experiment. A beam of atoms is run through a strong inhomogeneous magnetic field, which then splits into N parts depending on the intrinsic angular momentum of the atoms. It was found that for silver atoms, the beam was split in two; the ground state therefore could not be integer, because even if the intrinsic angular momentum of the atoms were as small as possible, 1, the beam would be split into three parts, corresponding to atoms with Lz = −1, 0, +1. The conclusion is that silver atoms have net intrinsic angular momentum of 12. Pauli set up a theory which explained this splitting by introducing a two-component wave function and a corresponding correction term in the Hamiltonian, representing a semi-classical coupling of this wave function to an applied magnetic field, as so in SI units: (Note that bold faced characters imply Euclidean vectors in 3 dimensions, whereas the Minkowski four-vector Aμ can be defined as  .)

 

Here A and   represent the components of the electromagnetic four-potential in their standard SI units, and the three sigmas are the Pauli matrices. On squaring out the first term, a residual interaction with the magnetic field is found, along with the usual classical Hamiltonian of a charged particle interacting with an applied field in SI units:

 

This Hamiltonian is now a 2 × 2 matrix, so the Schrödinger equation based on it must use a two-component wave function. On introducing the external electromagnetic 4-vector potential into the Dirac equation in a similar way, known as minimal coupling, it takes the form:

 

A second application of the Dirac operator will now reproduce the Pauli term exactly as before, because the spatial Dirac matrices multiplied by i, have the same squaring and commutation properties as the Pauli matrices. What is more, the value of the gyromagnetic ratio of the electron, standing in front of Pauli's new term, is explained from first principles. This was a major achievement of the Dirac equation and gave physicists great faith in its overall correctness. There is more however. The Pauli theory may be seen as the low energy limit of the Dirac theory in the following manner. First the equation is written in the form of coupled equations for 2-spinors with the SI units restored:

 
so
 

Assuming the field is weak and the motion of the electron non-relativistic, the total energy of the electron is approximately equal to its rest energy, and the momentum going over to the classical value,

 
and so the second equation may be written
 

which is of order v/c – thus at typical energies and velocities, the bottom components of the Dirac spinor in the standard representation are much suppressed in comparison to the top components. Substituting this expression into the first equation gives after some rearrangement

 

The operator on the left represents the particle energy reduced by its rest energy, which is just the classical energy, so one can recover Pauli's theory upon identifying his 2-spinor with the top components of the Dirac spinor in the non-relativistic approximation. A further approximation gives the Schrödinger equation as the limit of the Pauli theory. Thus, the Schrödinger equation may be seen as the far non-relativistic approximation of the Dirac equation when one may neglect spin and work only at low energies and velocities. This also was a great triumph for the new equation, as it traced the mysterious i that appears in it, and the necessity of a complex wave function, back to the geometry of spacetime through the Dirac algebra. It also highlights why the Schrödinger equation, although superficially in the form of a diffusion equation, actually represents the propagation of waves.

It should be strongly emphasized that this separation of the Dirac spinor into large and small components depends explicitly on a low-energy approximation. The entire Dirac spinor represents an irreducible whole, and the components just neglected here to arrive at the Pauli theory will bring in new phenomena in the relativistic regime – antimatter and the idea of creation and annihilation of particles.

Weyl theory edit

In the massless case  , the Dirac equation reduces to the Weyl equation, which describes relativistic massless spin-12 particles.[8]

The theory acquires a second   symmetry: see below.

Physical interpretation edit

Identification of observables edit

The critical physical question in a quantum theory is this: what are the physically observable quantities defined by the theory? According to the postulates of quantum mechanics, such quantities are defined by Hermitian operators that act on the Hilbert space of possible states of a system. The eigenvalues of these operators are then the possible results of measuring the corresponding physical quantity. In the Schrödinger theory, the simplest such object is the overall Hamiltonian, which represents the total energy of the system. To maintain this interpretation on passing to the Dirac theory, the Hamiltonian must be taken to be

 
where, as always, there is an implied summation over the twice-repeated index k = 1, 2, 3. This looks promising, because one can see by inspection the rest energy of the particle and, in the case of A = 0, the energy of a charge placed in an electric potential cqA0. What about the term involving the vector potential? In classical electrodynamics, the energy of a charge moving in an applied potential is
 

Thus, the Dirac Hamiltonian is fundamentally distinguished from its classical counterpart, and one must take great care to correctly identify what is observable in this theory. Much of the apparently paradoxical behavior implied by the Dirac equation amounts to a misidentification of these observables.[citation needed]

Hole theory edit

The negative E solutions to the equation are problematic, for it was assumed that the particle has a positive energy. Mathematically speaking, however, there seems to be no reason for us to reject the negative-energy solutions. Since they exist, they cannot simply be ignored, for once the interaction between the electron and the electromagnetic field is included, any electron placed in a positive-energy eigenstate would decay into negative-energy eigenstates of successively lower energy. Real electrons obviously do not behave in this way, or they would disappear by emitting energy in the form of photons.

To cope with this problem, Dirac introduced the hypothesis, known as hole theory, that the vacuum is the many-body quantum state in which all the negative-energy electron eigenstates are occupied. This description of the vacuum as a "sea" of electrons is called the Dirac sea. Since the Pauli exclusion principle forbids electrons from occupying the same state, any additional electron would be forced to occupy a positive-energy eigenstate, and positive-energy electrons would be forbidden from decaying into negative-energy eigenstates.

Dirac further reasoned that if the negative-energy eigenstates are incompletely filled, each unoccupied eigenstate – called a hole – would behave like a positively charged particle. The hole possesses a positive energy because energy is required to create a particle–hole pair from the vacuum. As noted above, Dirac initially thought that the hole might be the proton, but Hermann Weyl pointed out that the hole should behave as if it had the same mass as an electron, whereas the proton is over 1800 times heavier. The hole was eventually identified as the positron, experimentally discovered by Carl Anderson in 1932.[9]

It is not entirely satisfactory to describe the "vacuum" using an infinite sea of negative-energy electrons. The infinitely negative contributions from the sea of negative-energy electrons have to be canceled by an infinite positive "bare" energy and the contribution to the charge density and current coming from the sea of negative-energy electrons is exactly canceled by an infinite positive "jellium" background so that the net electric charge density of the vacuum is zero. In quantum field theory, a Bogoliubov transformation on the creation and annihilation operators (turning an occupied negative-energy electron state into an unoccupied positive energy positron state and an unoccupied negative-energy electron state into an occupied positive energy positron state) allows us to bypass the Dirac sea formalism even though, formally, it is equivalent to it.

In certain applications of condensed matter physics, however, the underlying concepts of "hole theory" are valid. The sea of conduction electrons in an electrical conductor, called a Fermi sea, contains electrons with energies up to the chemical potential of the system. An unfilled state in the Fermi sea behaves like a positively charged electron, and although it too is referred to as an "electron hole", it is distinct from a positron. The negative charge of the Fermi sea is balanced by the positively charged ionic lattice of the material.

In quantum field theory edit

In quantum field theories such as quantum electrodynamics, the Dirac field is subject to a process of second quantization, which resolves some of the paradoxical features of the equation.

Further discussion of Lorentz covariance of the Dirac equation edit

The Dirac equation is Lorentz covariant. Articulating this helps illuminate not only the Dirac equation, but also the Majorana spinor and Elko spinor, which although closely related, have subtle and important differences.

Understanding Lorentz covariance is simplified by keeping in mind the geometric character of the process.[10] Let   be a single, fixed point in the spacetime manifold. Its location can be expressed in multiple coordinate systems. In the physics literature, these are written as   and  , with the understanding that both   and   describe the same point  , but in different local frames of reference (a frame of reference over a small extended patch of spacetime). One can imagine   as having a fiber of different coordinate frames above it. In geometric terms, one says that spacetime can be characterized as a fiber bundle, and specifically, the frame bundle. The difference between two points   and   in the same fiber is a combination of rotations and Lorentz boosts. A choice of coordinate frame is a (local) section through that bundle.

Coupled to the frame bundle is a second bundle, the spinor bundle. A section through the spinor bundle is just the particle field (the Dirac spinor, in the present case). Different points in the spinor fiber correspond to the same physical object (the fermion) but expressed in different Lorentz frames. Clearly, the frame bundle and the spinor bundle must be tied together in a consistent fashion to get consistent results; formally, one says that the spinor bundle is the associated bundle; it is associated to a principal bundle, which in the present case is the frame bundle. Differences between points on the fiber correspond to the symmetries of the system. The spinor bundle has two distinct generators of its symmetries: the total angular momentum and the intrinsic angular momentum. Both correspond to Lorentz transformations, but in different ways.

The presentation here follows that of Itzykson and Zuber.[11] It is very nearly identical to that of Bjorken and Drell.[12] A similar derivation in a general relativistic setting can be found in Weinberg.[13] Here we fix our spacetime to be flat, that is, our spacetime is Minkowski space.

Under a Lorentz transformation   the Dirac spinor to transform as

 
It can be shown that an explicit expression for   is given by
 
where   parameterizes the Lorentz transformation, and   are the six 4×4 matrices satisfying:
 

This matrix can be interpreted as the intrinsic angular momentum of the Dirac field. That it deserves this interpretation arises by contrasting it to the generator   of Lorentz transformations, having the form

 
This can be interpreted as the total angular momentum. It acts on the spinor field as
 
Note the   above does not have a prime on it: the above is obtained by transforming   obtaining the change to   and then returning to the original coordinate system  .

The geometrical interpretation of the above is that the frame field is affine, having no preferred origin. The generator   generates the symmetries of this space: it provides a relabelling of a fixed point   The generator   generates a movement from one point in the fiber to another: a movement from   with both   and   still corresponding to the same spacetime point   These perhaps obtuse remarks can be elucidated with explicit algebra.

Let   be a Lorentz transformation. The Dirac equation is

 
If the Dirac equation is to be covariant, then it should have exactly the same form in all Lorentz frames:
 
The two spinors   and   should both describe the same physical field, and so should be related by a transformation that does not change any physical observables (charge, current, mass, etc.) The transformation should encode only the change of coordinate frame. It can be shown that such a transformation is a 4×4 unitary matrix. Thus, one may presume that the relation between the two frames can be written as
 
Inserting this into the transformed equation, the result is
 
The coordinates related by Lorentz transformation satisfy:
 
The original Dirac equation is then regained if
 
An explicit expression for   (equal to the expression given above) can be obtained by considering a Lorentz transformation of infinitesimal rotation near the identity transformation:
 
where   is the metric tensor :   and is symmetric while   is antisymmetric. After plugging and chugging, one obtains
 
which is the (infinitesimal) form for   above and yields the relation   . To obtain the affine relabelling, write
 

After properly antisymmetrizing, one obtains the generator of symmetries   given earlier. Thus, both   and   can be said to be the "generators of Lorentz transformations", but with a subtle distinction: the first corresponds to a relabelling of points on the affine frame bundle, which forces a translation along the fiber of the spinor on the spin bundle, while the second corresponds to translations along the fiber of the spin bundle (taken as a movement   along the frame bundle, as well as a movement   along the fiber of the spin bundle.) Weinberg provides additional arguments for the physical interpretation of these as total and intrinsic angular momentum.[14]

Other formulations edit

The Dirac equation can be formulated in a number of other ways.

Curved spacetime edit

This article has developed the Dirac equation in flat spacetime according to special relativity. It is possible to formulate the Dirac equation in curved spacetime.

The algebra of physical space edit

This article developed the Dirac equation using four-vectors and Schrödinger operators. The Dirac equation in the algebra of physical space uses a Clifford algebra over the real numbers, a type of geometric algebra.

Coupled Weyl Spinors edit

As mentioned above, the massless Dirac equation immediately reduces to the homogeneous Weyl equation. By using the chiral representation of the gamma matrices, the nonzero-mass equation can also be decomposed into a pair of coupled inhomogeneous Weyl equations acting on the first and last pairs of indices of the original four-component spinor, i.e.  , where   and   are each two-component Weyl spinors. This is because the skew block form of the chiral gamma matrices means that they swap the   and   and apply the two-by-two Pauli matrices to each:

 .

So the Dirac equation

 

becomes

 

which in turn is equivalent to a pair of inhomogeneous Weyl equations for massless left- and right-helicity spinors, where the coupling strength is proportional to the mass:

 

 .[clarification needed]

This has been proposed as an intuitive explanation of Zitterbewegung, as these massless components would propagate at the speed of light and move in opposite directions, since the helicity is the projection of the spin onto the direction of motion.[15] Here the role of the "mass"   is not to make the velocity less than the speed of light, but instead controls the average rate at which these reversals occur; specifically, the reversals can be modeled as a Poisson process.[16]

U(1) symmetry edit

Natural units are used in this section. The coupling constant is labelled by convention with  : this parameter can also be viewed as modelling the electron charge.

Vector symmetry edit

The Dirac equation and action admits a   symmetry where the fields   transform as

 
This is a global symmetry, known as the   vector symmetry (as opposed to the   axial symmetry: see below). By Noether's theorem there is a corresponding conserved current: this has been mentioned previously as
 

Gauging the symmetry edit

If we 'promote' the global symmetry, parametrised by the constant  , to a local symmetry, parametrised by a function  , or equivalently   the Dirac equation is no longer invariant: there is a residual derivative of  .

The fix proceeds as in scalar electrodynamics: the partial derivative is promoted to a covariant derivative

dirac, equation, confused, with, dirac, delta, function, particle, physics, relativistic, wave, equation, derived, british, physicist, paul, dirac, 1928, free, form, including, electromagnetic, interactions, describes, spin, massive, particles, called, dirac, . Not to be confused with Dirac delta function In particle physics the Dirac equation is a relativistic wave equation derived by British physicist Paul Dirac in 1928 In its free form or including electromagnetic interactions it describes all spin 1 2 massive particles called Dirac particles such as electrons and quarks for which parity is a symmetry It is consistent with both the principles of quantum mechanics and the theory of special relativity 1 and was the first theory to account fully for special relativity in the context of quantum mechanics It was validated by accounting for the fine structure of the hydrogen spectrum in a completely rigorous way The equation also implied the existence of a new form of matter antimatter previously unsuspected and unobserved and which was experimentally confirmed several years later It also provided a theoretical justification for the introduction of several component wave functions in Pauli s phenomenological theory of spin The wave functions in the Dirac theory are vectors of four complex numbers known as bispinors two of which resemble the Pauli wavefunction in the non relativistic limit in contrast to the Schrodinger equation which described wave functions of only one complex value Moreover in the limit of zero mass the Dirac equation reduces to the Weyl equation Although Dirac did not at first fully appreciate the importance of his results the entailed explanation of spin as a consequence of the union of quantum mechanics and relativity and the eventual discovery of the positron represents one of the great triumphs of theoretical physics This accomplishment has been described as fully on a par with the works of Newton Maxwell and Einstein before him 2 It has been deemed by some physicists to be the real seed of modern physics 3 In the context of quantum field theory the Dirac equation is reinterpreted to describe quantum fields corresponding to spin 1 2 particles The Dirac equation is inscribed upon a plaque on the floor of Westminster Abbey Unveiled on 13 November 1995 the plaque commemorates Paul Dirac s life 4 Contents 1 Mathematical formulation 1 1 Dirac adjoint and the adjoint equation 1 2 Klein Gordon equation 1 3 Conserved current 1 4 Solutions 1 4 1 Plane wave solutions 1 5 Lagrangian formulation 1 6 Lorentz invariance 2 Historical developments and further mathematical details 2 1 Making the Schrodinger equation relativistic 2 2 Dirac s coup 2 3 Covariant form and relativistic invariance 3 Comparison with related theories 3 1 Pauli theory 3 2 Weyl theory 4 Physical interpretation 4 1 Identification of observables 4 2 Hole theory 4 3 In quantum field theory 5 Further discussion of Lorentz covariance of the Dirac equation 6 Other formulations 6 1 Curved spacetime 6 2 The algebra of physical space 6 3 Coupled Weyl Spinors 7 U 1 symmetry 7 1 Vector symmetry 7 2 Gauging the symmetry 7 3 Axial symmetry 7 4 Extension to color symmetry 7 4 1 Physical applications 7 4 2 Generalisations 8 See also 8 1 Articles on the Dirac equation 8 2 Other equations 8 3 Other topics 9 References 9 1 Citations 9 2 Selected papers 9 3 Textbooks 10 External linksMathematical formulation editIn its modern formulation for field theory the Dirac equation is written in terms of a Dirac spinor field ps displaystyle psi nbsp taking values in a complex vector space described concretely as C 4 displaystyle mathbb C 4 nbsp defined on flat spacetime Minkowski space R 1 3 displaystyle mathbb R 1 3 nbsp Its expression also contains gamma matrices and a parameter m gt 0 displaystyle m gt 0 nbsp interpreted as the mass as well as other physical constants Dirac first obtained his equation through a factorization of Einstein s energy momentum mass equivalence relation assuming a scalar product of momentum vectors determined by the metric tensor and quantized the resulting relation by associating momenta to their respective operators In terms of a field ps R 1 3 C 4 displaystyle psi mathbb R 1 3 rightarrow mathbb C 4 nbsp the Dirac equation is then Dirac equation i ℏ g m m m c ps x 0 displaystyle i hbar gamma mu partial mu mc psi x 0 nbsp and in natural units with Feynman slash notation Dirac equation natural units i m ps x 0 displaystyle i partial m psi x 0 nbsp The gamma matrices are a set of four 4 4 displaystyle 4 times 4 nbsp complex matrices elements of Mat 4 4 C displaystyle text Mat 4 times 4 mathbb C nbsp which satisfy the defining anti commutation relations g m g n 2 h m n I 4 displaystyle gamma mu gamma nu 2 eta mu nu I 4 nbsp where h m n displaystyle eta mu nu nbsp is the Minkowski metric element and the indices m n displaystyle mu nu nbsp run over 0 1 2 and 3 These matrices can be realized explicitly under a choice of representation Two common choices are the Dirac representation g 0 I 2 0 0 I 2 g i 0 s i s i 0 displaystyle gamma 0 begin pmatrix I 2 amp 0 0 amp I 2 end pmatrix quad gamma i begin pmatrix 0 amp sigma i sigma i amp 0 end pmatrix nbsp where s i displaystyle sigma i nbsp are the Pauli matrices and the chiral representation the g i displaystyle gamma i nbsp are the same but g 0 0 I 2 I 2 0 displaystyle gamma 0 begin pmatrix 0 amp I 2 I 2 amp 0 end pmatrix nbsp The slash notation is a compact notation forA g m A m displaystyle A gamma mu A mu nbsp where A displaystyle A nbsp is a four vector often it is the four vector differential operator m displaystyle partial mu nbsp The summation over the index m displaystyle mu nbsp is implied Dirac adjoint and the adjoint equation edit The Dirac adjoint of the spinor field ps x displaystyle psi x nbsp is defined asps x ps x g 0 displaystyle bar psi x psi x dagger gamma 0 nbsp Using the property of gamma matrices which follows straightforwardly from Hermicity properties of the g m displaystyle gamma mu nbsp that g m g 0 g m g 0 displaystyle gamma mu dagger gamma 0 gamma mu gamma 0 nbsp one can derive the adjoint Dirac equation by taking the Hermitian conjugate of the Dirac equation and multiplying on the right by g 0 displaystyle gamma 0 nbsp ps x i g m m m 0 displaystyle bar psi x i gamma mu overleftarrow partial mu m 0 nbsp where the partial derivative m displaystyle overleftarrow partial mu nbsp acts from the right on ps x displaystyle bar psi x nbsp written in the usual way in terms of a left action of the derivative we have i m ps x g m m ps x 0 displaystyle i partial mu bar psi x gamma mu m bar psi x 0 nbsp Klein Gordon equation edit Applying i m displaystyle i partial m nbsp to the Dirac equation gives m m m 2 ps x 0 displaystyle partial mu partial mu m 2 psi x 0 nbsp That is each component of the Dirac spinor field satisfies the Klein Gordon equation Conserved current edit A conserved current of the theory isJ m ps g m ps displaystyle J mu bar psi gamma mu psi nbsp Proof of conservation from Dirac equation Adding the Dirac and adjoint Dirac equations givesi m ps g m ps ps g m m ps 0 displaystyle i partial mu bar psi gamma mu psi bar psi gamma mu partial mu psi 0 nbsp so by Leibniz rule i m ps g m ps 0 displaystyle i partial mu bar psi gamma mu psi 0 nbsp Another approach to derive this expression is by variational methods applying Noether s theorem for the global U 1 displaystyle text U 1 nbsp symmetry to derive the conserved current J m displaystyle J mu nbsp Proof of conservation from Noether s theorem Recall the Lagrangian isL ps i g m m m ps displaystyle mathcal L bar psi i gamma mu partial mu m psi nbsp Under a U 1 displaystyle U 1 nbsp symmetry which sends ps e i a ps ps e i a ps displaystyle begin aligned psi amp mapsto e i alpha psi bar psi amp mapsto e i alpha bar psi end aligned nbsp we find the Lagrangian is invariant Now considering the variation parameter a displaystyle alpha nbsp to be infinitesimal we work at first order in a displaystyle alpha nbsp and ignore O a 2 displaystyle mathcal O alpha 2 nbsp terms From the previous discussion we immediately see the explicit variation in the Lagrangian due to a displaystyle alpha nbsp is vanishing that is under the variation L L d L displaystyle mathcal L mapsto mathcal L delta mathcal L nbsp where d L 0 displaystyle delta mathcal L 0 nbsp As part of Noether s theorem we find the implicit variation in the Lagrangian due to variation of fields If the equation of motion for ps ps displaystyle psi bar psi nbsp are satisfied then d L m L m ps d ps L m ps d ps displaystyle delta mathcal L partial mu left frac partial mathcal L partial partial mu psi delta psi frac partial mathcal L partial partial mu bar psi delta bar psi right nbsp This immediately simplifies as there are no partial derivatives of ps displaystyle bar psi nbsp in the Lagrangian d ps displaystyle delta psi nbsp is the infinitesimal variationd ps x i a ps x displaystyle delta psi x i alpha psi x nbsp We evaluate L m ps i ps g m displaystyle frac partial mathcal L partial partial mu psi i bar psi gamma mu nbsp The equation becomes 0 a m ps g m ps displaystyle 0 alpha partial mu bar psi gamma mu psi nbsp and we re done Solutions edit Further information Dirac spinor and Hole theory Since the Dirac operator acts on 4 tuples of square integrable functions its solutions should be members of the same Hilbert space The fact that the energies of the solutions do not have a lower bound is unexpected Plane wave solutions edit Plane wave solutions are those arising from an ansatzps x u p e i p x displaystyle psi x u mathbf p e ip cdot x nbsp which models a particle with definite 4 momentum p E p p displaystyle p E mathbf p mathbf p nbsp where E p m 2 p 2 textstyle E mathbf p sqrt m 2 mathbf p 2 nbsp For this ansatz the Dirac equation becomes an equation for u p displaystyle u mathbf p nbsp g m p m m u p 0 displaystyle left gamma mu p mu m right u mathbf p 0 nbsp After picking a representation for the gamma matrices g m displaystyle gamma mu nbsp solving this is a matter of solving a system of linear equations It is a representation free property of gamma matrices that the solution space is two dimensional see here For example in the chiral representation for g m displaystyle gamma mu nbsp the solution space is parametrised by a C 2 displaystyle mathbb C 2 nbsp vector 3 displaystyle xi nbsp withu p s m p m 3 s m p m 3 displaystyle u mathbf p begin pmatrix sqrt sigma mu p mu xi sqrt bar sigma mu p mu xi end pmatrix nbsp where s m I 2 s i s m I 2 s i displaystyle sigma mu I 2 sigma i bar sigma mu I 2 sigma i nbsp and displaystyle sqrt cdot nbsp is the Hermitian matrix square root These plane wave solutions provide a starting point for canonical quantization Lagrangian formulation edit Both the Dirac equation and the Adjoint Dirac equation can be obtained from varying the action with a specific Lagrangian density that is given by L i ℏ c ps g m m ps m c 2 ps ps displaystyle mathcal L i hbar c overline psi gamma mu partial mu psi mc 2 overline psi psi nbsp If one varies this with respect to ps displaystyle psi nbsp one gets the adjoint Dirac equation Meanwhile if one varies this with respect to ps displaystyle bar psi nbsp one gets the Dirac equation In natural units and with the slash notation the action is then Dirac Action S d 4 x ps i m ps displaystyle S int d 4 x bar psi i partial big m psi nbsp For this action the conserved current J m displaystyle J mu nbsp above arises as the conserved current corresponding to the global U 1 displaystyle text U 1 nbsp symmetry through Noether s theorem for field theory Gauging this field theory by changing the symmetry to a local spacetime point dependent one gives gauge symmetry really gauge redundancy The resultant theory is quantum electrodynamics or QED See below for a more detailed discussion Lorentz invariance edit The Dirac equation is invariant under Lorentz transformations that is under the action of the Lorentz group SO 1 3 displaystyle text SO 1 3 nbsp or strictly SO 1 3 displaystyle text SO 1 3 nbsp the component connected to the identity For a Dirac spinor viewed concretely as taking values in C 4 displaystyle mathbb C 4 nbsp the transformation under a Lorentz transformation L displaystyle Lambda nbsp is given by a 4 4 displaystyle 4 times 4 nbsp complex matrix S L displaystyle S Lambda nbsp There are some subtleties in defining the corresponding S L displaystyle S Lambda nbsp as well as a standard abuse of notation Most treatments occur at the Lie algebra level For a more detailed treatment see here The Lorentz group of 4 4 displaystyle 4 times 4 nbsp real matrices acting on R 1 3 displaystyle mathbb R 1 3 nbsp is generated by a set of six matrices M m n displaystyle M mu nu nbsp with components M m n r s h m r d n s h n r d m s displaystyle M mu nu rho sigma eta mu rho delta nu sigma eta nu rho delta mu sigma nbsp When both the r s displaystyle rho sigma nbsp indices are raised or lowered these are simply the standard basis of antisymmetric matrices These satisfy the Lorentz algebra commutation relations M m n M r s M m s h n r M n s h m r M n r h m s M m r h n s displaystyle M mu nu M rho sigma M mu sigma eta nu rho M nu sigma eta mu rho M nu rho eta mu sigma M mu rho eta nu sigma nbsp In the article on the Dirac algebra it is also found that the spin generators S m n 1 4 g m g n displaystyle S mu nu frac 1 4 gamma mu gamma nu nbsp satisfy the Lorentz algebra commutation relations A Lorentz transformation L displaystyle Lambda nbsp can be written asL exp 1 2 w m n M m n displaystyle Lambda exp left frac 1 2 omega mu nu M mu nu right nbsp where the components w m n displaystyle omega mu nu nbsp are antisymmetric in m n displaystyle mu nu nbsp The corresponding transformation on spin space isS L exp 1 2 w m n S m n displaystyle S Lambda exp left frac 1 2 omega mu nu S mu nu right nbsp This is an abuse of notation but a standard one The reason is S L displaystyle S Lambda nbsp is not a well defined function of L displaystyle Lambda nbsp since there are two different sets of components w m n displaystyle omega mu nu nbsp up to equivalence which give the same L displaystyle Lambda nbsp but different S L displaystyle S Lambda nbsp In practice we implicitly pick one of these w m n displaystyle omega mu nu nbsp and then S L displaystyle S Lambda nbsp is well defined in terms of w m n displaystyle omega mu nu nbsp Under a Lorentz transformation the Dirac equationi g m m ps x m ps x displaystyle i gamma mu partial mu psi x m psi x nbsp becomes i g m L 1 m n n S L ps L 1 x m S L ps L 1 x 0 displaystyle i gamma mu Lambda 1 mu nu partial nu S Lambda psi Lambda 1 x mS Lambda psi Lambda 1 x 0 nbsp Remainder of proof of Lorentz invariance Multiplying both sides from the left by S 1 L displaystyle S 1 Lambda nbsp and returning the dummy variable to x displaystyle x nbsp givesi S L 1 g m S L L 1 m n n ps x m ps x 0 displaystyle iS Lambda 1 gamma mu S Lambda Lambda 1 mu nu partial nu psi x m psi x 0 nbsp We ll have shown invariance if S L 1 g m S L L 1 n m g n displaystyle S Lambda 1 gamma mu S Lambda Lambda 1 nu mu gamma nu nbsp or equivalently S L 1 g m S L L m n g n displaystyle S Lambda 1 gamma mu S Lambda Lambda mu nu gamma nu nbsp This is most easily shown at the algebra level Supposing the transformations are parametrised by infinitesimal components w m n displaystyle omega mu nu nbsp then at first order in w displaystyle omega nbsp on the left hand side we get 1 2 w r s M r s m n g n displaystyle frac 1 2 omega rho sigma M rho sigma mu nu gamma nu nbsp while on the right hand side we get 1 2 w r s S r s g m 1 2 w r s S r s g m displaystyle left frac 1 2 omega rho sigma S rho sigma gamma mu right frac 1 2 omega rho sigma left S rho sigma gamma mu right nbsp It s a standard exercise to evaluate the commutator on the left hand side Writing M r s displaystyle M rho sigma nbsp in terms of components completes the proof Associated to Lorentz invariance is a conserved Noether current or rather a tensor of conserved Noether currents J r s m displaystyle mathcal J rho sigma mu nbsp Similarly since the equation is invariant under translations there is a tensor of conserved Noether currents T m n displaystyle T mu nu nbsp which can be identified as the stress energy tensor of the theory The Lorentz current J r s m displaystyle mathcal J rho sigma mu nbsp can be written in terms of the stress energy tensor in addition to a tensor representing internal angular momentum Historical developments and further mathematical details editThe Dirac equation was also used historically to define a quantum mechanical theory where ps x displaystyle psi x nbsp is instead interpreted as a wave function The Dirac equation in the form originally proposed by Dirac is 5 b m c 2 c n 1 3 a n p n ps x t i ℏ ps x t t displaystyle left beta mc 2 c sum n 1 3 alpha n p n right psi x t i hbar frac partial psi x t partial t nbsp where ps x t is the wave function for the electron of rest mass m with spacetime coordinates x t The p1 p2 p3 are the components of the momentum understood to be the momentum operator in the Schrodinger equation Also c is the speed of light and ħ is the reduced Planck constant These fundamental physical constants reflect special relativity and quantum mechanics respectively Dirac s purpose in casting this equation was to explain the behavior of the relativistically moving electron and so to allow the atom to be treated in a manner consistent with relativity His rather modest hope was that the corrections introduced this way might have a bearing on the problem of atomic spectra Up until that time attempts to make the old quantum theory of the atom compatible with the theory of relativity which were based on discretizing the angular momentum stored in the electron s possibly non circular orbit of the atomic nucleus had failed and the new quantum mechanics of Heisenberg Pauli Jordan Schrodinger and Dirac himself had not developed sufficiently to treat this problem Although Dirac s original intentions were satisfied his equation had far deeper implications for the structure of matter and introduced new mathematical classes of objects that are now essential elements of fundamental physics The new elements in this equation are the four 4 4 matrices a1 a2 a3 and b and the four component wave function ps There are four components in ps because the evaluation of it at any given point in configuration space is a bispinor It is interpreted as a superposition of a spin up electron a spin down electron a spin up positron and a spin down positron The 4 4 matrices ak and b are all Hermitian and are involutory a i 2 b 2 I 4 displaystyle alpha i 2 beta 2 I 4 nbsp and they all mutually anticommute a i a j a j a i 0 i j a i b b a i 0 displaystyle begin aligned alpha i alpha j alpha j alpha i amp 0 quad i neq j alpha i beta beta alpha i amp 0 end aligned nbsp These matrices and the form of the wave function have a deep mathematical significance The algebraic structure represented by the gamma matrices had been created some 50 years earlier by the English mathematician W K Clifford In turn Clifford s ideas had emerged from the mid 19th century work of the German mathematician Hermann Grassmann in his Lineare Ausdehnungslehre Theory of Linear Extensions The latter had been regarded as almost incomprehensible by most of his contemporaries The appearance of something so seemingly abstract at such a late date and in such a direct physical manner is one of the most remarkable chapters in the history of physics citation needed Even more so a validation of the exquisite insight displayed by the mathematicians Grassmann and Clifford The single symbolic equation thus unravels into four coupled linear first order partial differential equations for the four quantities that make up the wave function The equation can be written more explicitly in Planck units as 6 i x ps 4 ps 3 ps 2 ps 1 y ps 4 ps 3 ps 2 ps 1 i z ps 3 ps 4 ps 1 ps 2 m ps 1 ps 2 ps 3 ps 4 i t ps 1 ps 2 ps 3 ps 4 displaystyle i partial x begin bmatrix psi 4 psi 3 psi 2 psi 1 end bmatrix partial y begin bmatrix psi 4 psi 3 psi 2 psi 1 end bmatrix i partial z begin bmatrix psi 3 psi 4 psi 1 psi 2 end bmatrix m begin bmatrix psi 1 psi 2 psi 3 psi 4 end bmatrix i partial t begin bmatrix psi 1 psi 2 psi 3 psi 4 end bmatrix nbsp which makes it clearer that it is a set of four partial differential equations with four unknown functions Making the Schrodinger equation relativistic edit The Dirac equation is superficially similar to the Schrodinger equation for a massive free particle ℏ 2 2 m 2 ϕ i ℏ t ϕ displaystyle frac hbar 2 2m nabla 2 phi i hbar frac partial partial t phi nbsp The left side represents the square of the momentum operator divided by twice the mass which is the non relativistic kinetic energy Because relativity treats space and time as a whole a relativistic generalization of this equation requires that space and time derivatives must enter symmetrically as they do in the Maxwell equations that govern the behavior of light the equations must be differentially of the same order in space and time In relativity the momentum and the energies are the space and time parts of a spacetime vector the four momentum and they are related by the relativistically invariant relationE 2 m 2 c 4 p 2 c 2 displaystyle E 2 m 2 c 4 p 2 c 2 nbsp which says that the length of this four vector is proportional to the rest mass m Substituting the operator equivalents of the energy and momentum from the Schrodinger theory produces the Klein Gordon equation describing the propagation of waves constructed from relativistically invariant objects 1 c 2 2 t 2 2 ϕ m 2 c 2 ℏ 2 ϕ displaystyle left frac 1 c 2 frac partial 2 partial t 2 nabla 2 right phi frac m 2 c 2 hbar 2 phi nbsp with the wave function ϕ being a relativistic scalar a complex number which has the same numerical value in all frames of reference Space and time derivatives both enter to second order This has a telling consequence for the interpretation of the equation Because the equation is second order in the time derivative one must specify initial values both of the wave function itself and of its first time derivative in order to solve definite problems Since both may be specified more or less arbitrarily the wave function cannot maintain its former role of determining the probability density of finding the electron in a given state of motion In the Schrodinger theory the probability density is given by the positive definite expression r ϕ ϕ displaystyle rho phi phi nbsp and this density is convected according to the probability current vector J i ℏ 2 m ϕ ϕ ϕ ϕ displaystyle J frac i hbar 2m phi nabla phi phi nabla phi nbsp with the conservation of probability current and density following from the continuity equation J r t 0 displaystyle nabla cdot J frac partial rho partial t 0 nbsp The fact that the density is positive definite and convected according to this continuity equation implies that one may integrate the density over a certain domain and set the total to 1 and this condition will be maintained by the conservation law A proper relativistic theory with a probability density current must also share this feature To maintain the notion of a convected density one must generalize the Schrodinger expression of the density and current so that space and time derivatives again enter symmetrically in relation to the scalar wave function The Schrodinger expression can be kept for the current but the probability density must be replaced by the symmetrically formed expression further explanation needed r i ℏ 2 m c 2 ps t ps ps t ps displaystyle rho frac i hbar 2mc 2 left psi partial t psi psi partial t psi right nbsp which now becomes the 4th component of a spacetime vector and the entire probability 4 current density has the relativistically covariant expression J m i ℏ 2 m ps m ps ps m ps displaystyle J mu frac i hbar 2m left psi partial mu psi psi partial mu psi right nbsp The continuity equation is as before Everything is compatible with relativity now but the expression for the density is no longer positive definite the initial values of both ps and tps may be freely chosen and the density may thus become negative something that is impossible for a legitimate probability density Thus one cannot get a simple generalization of the Schrodinger equation under the naive assumption that the wave function is a relativistic scalar and the equation it satisfies second order in time Although it is not a successful relativistic generalization of the Schrodinger equation this equation is resurrected in the context of quantum field theory where it is known as the Klein Gordon equation and describes a spinless particle field e g pi meson or Higgs boson Historically Schrodinger himself arrived at this equation before the one that bears his name but soon discarded it In the context of quantum field theory the indefinite density is understood to correspond to the charge density which can be positive or negative and not the probability density Dirac s coup edit Dirac thus thought to try an equation that was first order in both space and time One could for example formally i e by abuse of notation take the relativistic expression for the energyE c p 2 m 2 c 2 displaystyle E c sqrt p 2 m 2 c 2 nbsp replace p by its operator equivalent expand the square root in an infinite series of derivative operators set up an eigenvalue problem then solve the equation formally by iterations Most physicists had little faith in such a process even if it were technically possible As the story goes Dirac was staring into the fireplace at Cambridge pondering this problem when he hit upon the idea of taking the square root of the wave operator thus 2 1 c 2 2 t 2 A x B y C z i c D t A x B y C z i c D t displaystyle nabla 2 frac 1 c 2 frac partial 2 partial t 2 left A partial x B partial y C partial z frac i c D partial t right left A partial x B partial y C partial z frac i c D partial t right nbsp On multiplying out the right side it is apparent that in order to get all the cross terms such as x y to vanish one must assumeA B B A 0 displaystyle AB BA 0 ldots nbsp with A 2 B 2 1 displaystyle A 2 B 2 dots 1 nbsp Dirac who had just then been intensely involved with working out the foundations of Heisenberg s matrix mechanics immediately understood that these conditions could be met if A B C and D are matrices with the implication that the wave function has multiple components This immediately explained the appearance of two component wave functions in Pauli s phenomenological theory of spin something that up until then had been regarded as mysterious even to Pauli himself However one needs at least 4 4 matrices to set up a system with the properties required so the wave function had four components not two as in the Pauli theory or one as in the bare Schrodinger theory The four component wave function represents a new class of mathematical object in physical theories that makes its first appearance here Given the factorization in terms of these matrices one can now write down immediately an equation A x B y C z i c D t ps k ps displaystyle left A partial x B partial y C partial z frac i c D partial t right psi kappa psi nbsp with k displaystyle kappa nbsp to be determined Applying again the matrix operator on both sides yields 2 1 c 2 t 2 ps k 2 ps displaystyle left nabla 2 frac 1 c 2 partial t 2 right psi kappa 2 psi nbsp Taking k m c ℏ displaystyle kappa tfrac mc hbar nbsp shows that all the components of the wave function individually satisfy the relativistic energy momentum relation Thus the sought for equation that is first order in both space and time is A x B y C z i c D t m c ℏ ps 0 displaystyle left A partial x B partial y C partial z frac i c D partial t frac mc hbar right psi 0 nbsp SettingA i b a 1 B i b a 2 C i b a 3 D b displaystyle A i beta alpha 1 B i beta alpha 2 C i beta alpha 3 D beta nbsp and because D 2 b 2 I 4 displaystyle D 2 beta 2 I 4 nbsp the Dirac equation is produced as written above Covariant form and relativistic invariance edit To demonstrate the relativistic invariance of the equation it is advantageous to cast it into a form in which the space and time derivatives appear on an equal footing New matrices are introduced as follows D g 0 A i g 1 B i g 2 C i g 3 displaystyle begin aligned D amp gamma 0 A amp i gamma 1 quad B i gamma 2 quad C i gamma 3 end aligned nbsp and the equation takes the form remembering the definition of the covariant components of the 4 gradient and especially that 0 1 c t Dirac equation i ℏ g m m m c ps 0 displaystyle i hbar gamma mu partial mu mc psi 0 nbsp where there is an implied summation over the values of the twice repeated index m 0 1 2 3 and m is the 4 gradient In practice one often writes the gamma matrices in terms of 2 2 sub matrices taken from the Pauli matrices and the 2 2 identity matrix Explicitly the standard representation isg 0 I 2 0 0 I 2 g 1 0 s x s x 0 g 2 0 s y s y 0 g 3 0 s z s z 0 displaystyle gamma 0 begin pmatrix I 2 amp 0 0 amp I 2 end pmatrix quad gamma 1 begin pmatrix 0 amp sigma x sigma x amp 0 end pmatrix quad gamma 2 begin pmatrix 0 amp sigma y sigma y amp 0 end pmatrix quad gamma 3 begin pmatrix 0 amp sigma z sigma z amp 0 end pmatrix nbsp The complete system is summarized using the Minkowski metric on spacetime in the form g m g n 2 h m n I 4 displaystyle left gamma mu gamma nu right 2 eta mu nu I 4 nbsp where the bracket expression a b a b b a displaystyle a b ab ba nbsp denotes the anticommutator These are the defining relations of a Clifford algebra over a pseudo orthogonal 4 dimensional space with metric signature The specific Clifford algebra employed in the Dirac equation is known today as the Dirac algebra Although not recognized as such by Dirac at the time the equation was formulated in hindsight the introduction of this geometric algebra represents an enormous stride forward in the development of quantum theory The Dirac equation may now be interpreted as an eigenvalue equation where the rest mass is proportional to an eigenvalue of the 4 momentum operator the proportionality constant being the speed of light P op ps m c ps displaystyle P text op psi mc psi nbsp Using d e f g m m displaystyle partial mathrel stackrel mathrm def gamma mu partial mu nbsp displaystyle partial big nbsp is pronounced d slash 7 according to Feynman slash notation the Dirac equation becomes i ℏ ps m c ps 0 displaystyle i hbar partial big psi mc psi 0 nbsp In practice physicists often use units of measure such that ħ c 1 known as natural units The equation then takes the simple form Dirac equation natural units i m ps 0 displaystyle i partial big m psi 0 nbsp A fundamental theorem states that if two distinct sets of matrices are given that both satisfy the Clifford relations then they are connected to each other by a similarity transformation g m S 1 g m S displaystyle gamma mu prime S 1 gamma mu S nbsp If in addition the matrices are all unitary as are the Dirac set then S itself is unitary g m U g m U displaystyle gamma mu prime U dagger gamma mu U nbsp The transformation U is unique up to a multiplicative factor of absolute value 1 Let us now imagine a Lorentz transformation to have been performed on the space and time coordinates and on the derivative operators which form a covariant vector For the operator gm m to remain invariant the gammas must transform among themselves as a contravariant vector with respect to their spacetime index These new gammas will themselves satisfy the Clifford relations because of the orthogonality of the Lorentz transformation By the fundamental theorem one may replace the new set by the old set subject to a unitary transformation In the new frame remembering that the rest mass is a relativistic scalar the Dirac equation will then take the form i U g m U m m ps x t 0 U i g m m m U ps x t 0 displaystyle begin aligned left iU dagger gamma mu U partial mu prime m right psi left x prime t prime right amp 0 U dagger i gamma mu partial mu prime m U psi left x prime t prime right amp 0 end aligned nbsp If the transformed spinor is defined asps U ps displaystyle psi prime U psi nbsp then the transformed Dirac equation is produced in a way that demonstrates manifest relativistic invariance i g m m m ps x t 0 displaystyle left i gamma mu partial mu prime m right psi prime left x prime t prime right 0 nbsp Thus settling on any unitary representation of the gammas is final provided the spinor is transformed according to the unitary transformation that corresponds to the given Lorentz transformation The various representations of the Dirac matrices employed will bring into focus particular aspects of the physical content in the Dirac wave function The representation shown here is known as the standard representation in it the wave function s upper two components go over into Pauli s 2 spinor wave function in the limit of low energies and small velocities in comparison to light The considerations above reveal the origin of the gammas in geometry hearkening back to Grassmann s original motivation they represent a fixed basis of unit vectors in spacetime Similarly products of the gammas such as gmgn represent oriented surface elements and so on With this in mind one can find the form of the unit volume element on spacetime in terms of the gammas as follows By definition it isV 1 4 ϵ m n a b g m g n g a g b displaystyle V frac 1 4 epsilon mu nu alpha beta gamma mu gamma nu gamma alpha gamma beta nbsp For this to be an invariant the epsilon symbol must be a tensor and so must contain a factor of g where g is the determinant of the metric tensor Since this is negative that factor is imaginary ThusV i g 0 g 1 g 2 g 3 displaystyle V i gamma 0 gamma 1 gamma 2 gamma 3 nbsp This matrix is given the special symbol g5 owing to its importance when one is considering improper transformations of space time that is those that change the orientation of the basis vectors In the standard representation it isg 5 0 I 2 I 2 0 displaystyle gamma 5 begin pmatrix 0 amp I 2 I 2 amp 0 end pmatrix nbsp This matrix will also be found to anticommute with the other four Dirac matrices g 5 g m g m g 5 0 displaystyle gamma 5 gamma mu gamma mu gamma 5 0 nbsp It takes a leading role when questions of parity arise because the volume element as a directed magnitude changes sign under a space time reflection Taking the positive square root above thus amounts to choosing a handedness convention on spacetime Comparison with related theories editPauli theory edit See also Pauli equation The necessity of introducing half integer spin goes back experimentally to the results of the Stern Gerlach experiment A beam of atoms is run through a strong inhomogeneous magnetic field which then splits into N parts depending on the intrinsic angular momentum of the atoms It was found that for silver atoms the beam was split in two the ground state therefore could not be integer because even if the intrinsic angular momentum of the atoms were as small as possible 1 the beam would be split into three parts corresponding to atoms with Lz 1 0 1 The conclusion is that silver atoms have net intrinsic angular momentum of 1 2 Pauli set up a theory which explained this splitting by introducing a two component wave function and a corresponding correction term in the Hamiltonian representing a semi classical coupling of this wave function to an applied magnetic field as so in SI units Note that bold faced characters imply Euclidean vectors in 3 dimensions whereas the Minkowski four vector Am can be defined as A m ϕ c A displaystyle A mu phi c mathbf A nbsp H 1 2 m s p e A 2 e ϕ displaystyle H frac 1 2m left boldsymbol sigma cdot left mathbf p e mathbf A right right 2 e phi nbsp Here A and ϕ displaystyle phi nbsp represent the components of the electromagnetic four potential in their standard SI units and the three sigmas are the Pauli matrices On squaring out the first term a residual interaction with the magnetic field is found along with the usual classical Hamiltonian of a charged particle interacting with an applied field in SI units H 1 2 m p e A 2 e ϕ e ℏ 2 m s B displaystyle H frac 1 2m left mathbf p e mathbf A right 2 e phi frac e hbar 2m boldsymbol sigma cdot mathbf B nbsp This Hamiltonian is now a 2 2 matrix so the Schrodinger equation based on it must use a two component wave function On introducing the external electromagnetic 4 vector potential into the Dirac equation in a similar way known as minimal coupling it takes the form g m i ℏ m e A m m c ps 0 displaystyle left gamma mu i hbar partial mu eA mu mc right psi 0 nbsp A second application of the Dirac operator will now reproduce the Pauli term exactly as before because the spatial Dirac matrices multiplied by i have the same squaring and commutation properties as the Pauli matrices What is more the value of the gyromagnetic ratio of the electron standing in front of Pauli s new term is explained from first principles This was a major achievement of the Dirac equation and gave physicists great faith in its overall correctness There is more however The Pauli theory may be seen as the low energy limit of the Dirac theory in the following manner First the equation is written in the form of coupled equations for 2 spinors with the SI units restored m c 2 E e ϕ c s p e A c s p e A m c 2 E e ϕ ps ps 0 0 displaystyle begin pmatrix mc 2 E e phi amp c boldsymbol sigma cdot left mathbf p e mathbf A right c boldsymbol sigma cdot left mathbf p e mathbf A right amp mc 2 E e phi end pmatrix begin pmatrix psi psi end pmatrix begin pmatrix 0 0 end pmatrix nbsp so E e ϕ ps c s p e A ps m c 2 ps E e ϕ ps c s p e A ps m c 2 ps displaystyle begin aligned E e phi psi c boldsymbol sigma cdot left mathbf p e mathbf A right psi amp mc 2 psi E e phi psi c boldsymbol sigma cdot left mathbf p e mathbf A right psi amp mc 2 psi end aligned nbsp Assuming the field is weak and the motion of the electron non relativistic the total energy of the electron is approximately equal to its rest energy and the momentum going over to the classical value E e ϕ m c 2 p m v displaystyle begin aligned E e phi amp approx mc 2 mathbf p amp approx m mathbf v end aligned nbsp and so the second equation may be written ps 1 2 m c s p e A ps displaystyle psi approx frac 1 2mc boldsymbol sigma cdot left mathbf p e mathbf A right psi nbsp which is of order v c thus at typical energies and velocities the bottom components of the Dirac spinor in the standard representation are much suppressed in comparison to the top components Substituting this expression into the first equation gives after some rearrangement E m c 2 ps 1 2 m s p e A 2 ps e ϕ ps displaystyle left E mc 2 right psi frac 1 2m left boldsymbol sigma cdot left mathbf p e mathbf A right right 2 psi e phi psi nbsp The operator on the left represents the particle energy reduced by its rest energy which is just the classical energy so one can recover Pauli s theory upon identifying his 2 spinor with the top components of the Dirac spinor in the non relativistic approximation A further approximation gives the Schrodinger equation as the limit of the Pauli theory Thus the Schrodinger equation may be seen as the far non relativistic approximation of the Dirac equation when one may neglect spin and work only at low energies and velocities This also was a great triumph for the new equation as it traced the mysterious i that appears in it and the necessity of a complex wave function back to the geometry of spacetime through the Dirac algebra It also highlights why the Schrodinger equation although superficially in the form of a diffusion equation actually represents the propagation of waves It should be strongly emphasized that this separation of the Dirac spinor into large and small components depends explicitly on a low energy approximation The entire Dirac spinor represents an irreducible whole and the components just neglected here to arrive at the Pauli theory will bring in new phenomena in the relativistic regime antimatter and the idea of creation and annihilation of particles Weyl theory edit In the massless case m 0 displaystyle m 0 nbsp the Dirac equation reduces to the Weyl equation which describes relativistic massless spin 1 2 particles 8 The theory acquires a second U 1 displaystyle text U 1 nbsp symmetry see below Physical interpretation editIdentification of observables edit The critical physical question in a quantum theory is this what are the physically observable quantities defined by the theory According to the postulates of quantum mechanics such quantities are defined by Hermitian operators that act on the Hilbert space of possible states of a system The eigenvalues of these operators are then the possible results of measuring the corresponding physical quantity In the Schrodinger theory the simplest such object is the overall Hamiltonian which represents the total energy of the system To maintain this interpretation on passing to the Dirac theory the Hamiltonian must be taken to beH g 0 m c 2 c g k p k q A k c q A 0 displaystyle H gamma 0 left mc 2 c gamma k left p k qA k right right cqA 0 nbsp where as always there is an implied summation over the twice repeated index k 1 2 3 This looks promising because one can see by inspection the rest energy of the particle and in the case of A 0 the energy of a charge placed in an electric potential cqA0 What about the term involving the vector potential In classical electrodynamics the energy of a charge moving in an applied potential is H c p q A 2 m 2 c 2 q A 0 displaystyle H c sqrt left mathbf p q mathbf A right 2 m 2 c 2 qA 0 nbsp Thus the Dirac Hamiltonian is fundamentally distinguished from its classical counterpart and one must take great care to correctly identify what is observable in this theory Much of the apparently paradoxical behavior implied by the Dirac equation amounts to a misidentification of these observables citation needed Hole theory edit The negative E solutions to the equation are problematic for it was assumed that the particle has a positive energy Mathematically speaking however there seems to be no reason for us to reject the negative energy solutions Since they exist they cannot simply be ignored for once the interaction between the electron and the electromagnetic field is included any electron placed in a positive energy eigenstate would decay into negative energy eigenstates of successively lower energy Real electrons obviously do not behave in this way or they would disappear by emitting energy in the form of photons To cope with this problem Dirac introduced the hypothesis known as hole theory that the vacuum is the many body quantum state in which all the negative energy electron eigenstates are occupied This description of the vacuum as a sea of electrons is called the Dirac sea Since the Pauli exclusion principle forbids electrons from occupying the same state any additional electron would be forced to occupy a positive energy eigenstate and positive energy electrons would be forbidden from decaying into negative energy eigenstates Dirac further reasoned that if the negative energy eigenstates are incompletely filled each unoccupied eigenstate called a hole would behave like a positively charged particle The hole possesses a positive energy because energy is required to create a particle hole pair from the vacuum As noted above Dirac initially thought that the hole might be the proton but Hermann Weyl pointed out that the hole should behave as if it had the same mass as an electron whereas the proton is over 1800 times heavier The hole was eventually identified as the positron experimentally discovered by Carl Anderson in 1932 9 It is not entirely satisfactory to describe the vacuum using an infinite sea of negative energy electrons The infinitely negative contributions from the sea of negative energy electrons have to be canceled by an infinite positive bare energy and the contribution to the charge density and current coming from the sea of negative energy electrons is exactly canceled by an infinite positive jellium background so that the net electric charge density of the vacuum is zero In quantum field theory a Bogoliubov transformation on the creation and annihilation operators turning an occupied negative energy electron state into an unoccupied positive energy positron state and an unoccupied negative energy electron state into an occupied positive energy positron state allows us to bypass the Dirac sea formalism even though formally it is equivalent to it In certain applications of condensed matter physics however the underlying concepts of hole theory are valid The sea of conduction electrons in an electrical conductor called a Fermi sea contains electrons with energies up to the chemical potential of the system An unfilled state in the Fermi sea behaves like a positively charged electron and although it too is referred to as an electron hole it is distinct from a positron The negative charge of the Fermi sea is balanced by the positively charged ionic lattice of the material In quantum field theory edit See also Fermionic field In quantum field theories such as quantum electrodynamics the Dirac field is subject to a process of second quantization which resolves some of the paradoxical features of the equation Further discussion of Lorentz covariance of the Dirac equation editThe Dirac equation is Lorentz covariant Articulating this helps illuminate not only the Dirac equation but also the Majorana spinor and Elko spinor which although closely related have subtle and important differences Understanding Lorentz covariance is simplified by keeping in mind the geometric character of the process 10 Let a displaystyle a nbsp be a single fixed point in the spacetime manifold Its location can be expressed in multiple coordinate systems In the physics literature these are written as x displaystyle x nbsp and x displaystyle x nbsp with the understanding that both x displaystyle x nbsp and x displaystyle x nbsp describe the same point a displaystyle a nbsp but in different local frames of reference a frame of reference over a small extended patch of spacetime One can imagine a displaystyle a nbsp as having a fiber of different coordinate frames above it In geometric terms one says that spacetime can be characterized as a fiber bundle and specifically the frame bundle The difference between two points x displaystyle x nbsp and x displaystyle x nbsp in the same fiber is a combination of rotations and Lorentz boosts A choice of coordinate frame is a local section through that bundle Coupled to the frame bundle is a second bundle the spinor bundle A section through the spinor bundle is just the particle field the Dirac spinor in the present case Different points in the spinor fiber correspond to the same physical object the fermion but expressed in different Lorentz frames Clearly the frame bundle and the spinor bundle must be tied together in a consistent fashion to get consistent results formally one says that the spinor bundle is the associated bundle it is associated to a principal bundle which in the present case is the frame bundle Differences between points on the fiber correspond to the symmetries of the system The spinor bundle has two distinct generators of its symmetries the total angular momentum and the intrinsic angular momentum Both correspond to Lorentz transformations but in different ways The presentation here follows that of Itzykson and Zuber 11 It is very nearly identical to that of Bjorken and Drell 12 A similar derivation in a general relativistic setting can be found in Weinberg 13 Here we fix our spacetime to be flat that is our spacetime is Minkowski space Under a Lorentz transformation x x displaystyle x mapsto x nbsp the Dirac spinor to transform asps x S ps x displaystyle psi x S psi x nbsp It can be shown that an explicit expression for S displaystyle S nbsp is given by S exp i 4 w m n s m n displaystyle S exp left frac i 4 omega mu nu sigma mu nu right nbsp where w m n displaystyle omega mu nu nbsp parameterizes the Lorentz transformation and s m n displaystyle sigma mu nu nbsp are the six 4 4 matrices satisfying s m n i 2 g m g n displaystyle sigma mu nu frac i 2 gamma mu gamma nu nbsp This matrix can be interpreted as the intrinsic angular momentum of the Dirac field That it deserves this interpretation arises by contrasting it to the generator J m n displaystyle J mu nu nbsp of Lorentz transformations having the formJ m n 1 2 s m n i x m n x n m displaystyle J mu nu frac 1 2 sigma mu nu i x mu partial nu x nu partial mu nbsp This can be interpreted as the total angular momentum It acts on the spinor field as ps x exp i 2 w m n J m n ps x displaystyle psi prime x exp left frac i 2 omega mu nu J mu nu right psi x nbsp Note the x displaystyle x nbsp above does not have a prime on it the above is obtained by transforming x x displaystyle x mapsto x nbsp obtaining the change to ps x ps x displaystyle psi x mapsto psi x nbsp and then returning to the original coordinate system x x displaystyle x mapsto x nbsp The geometrical interpretation of the above is that the frame field is affine having no preferred origin The generator J m n displaystyle J mu nu nbsp generates the symmetries of this space it provides a relabelling of a fixed point x displaystyle x nbsp The generator s m n displaystyle sigma mu nu nbsp generates a movement from one point in the fiber to another a movement from x x displaystyle x mapsto x nbsp with both x displaystyle x nbsp and x displaystyle x nbsp still corresponding to the same spacetime point a displaystyle a nbsp These perhaps obtuse remarks can be elucidated with explicit algebra Let x L x displaystyle x Lambda x nbsp be a Lorentz transformation The Dirac equation isi g m x m ps x m ps x 0 displaystyle i gamma mu frac partial partial x mu psi x m psi x 0 nbsp If the Dirac equation is to be covariant then it should have exactly the same form in all Lorentz frames i g m x m ps x m ps x 0 displaystyle i gamma mu frac partial partial x prime mu psi prime x prime m psi prime x prime 0 nbsp The two spinors ps displaystyle psi nbsp and ps displaystyle psi prime nbsp should both describe the same physical field and so should be related by a transformation that does not change any physical observables charge current mass etc The transformation should encode only the change of coordinate frame It can be shown that such a transformation is a 4 4 unitary matrix Thus one may presume that the relation between the two frames can be written as ps x S L ps x displaystyle psi prime x prime S Lambda psi x nbsp Inserting this into the transformed equation the result is i g m x n x m x n S L ps x m S L ps x 0 displaystyle i gamma mu frac partial x nu partial x prime mu frac partial partial x nu S Lambda psi x mS Lambda psi x 0 nbsp The coordinates related by Lorentz transformation satisfy x n x m L 1 n m displaystyle frac partial x nu partial x prime mu left Lambda 1 right nu mu nbsp The original Dirac equation is then regained if S L g m S 1 L L 1 m n g n displaystyle S Lambda gamma mu S 1 Lambda left Lambda 1 right mu nu gamma nu nbsp An explicit expression for S L displaystyle S Lambda nbsp equal to the expression given above can be obtained by considering a Lorentz transformation of infinitesimal rotation near the identity transformation L m n g m n w m n L 1 m n g m n w m n displaystyle Lambda mu nu g mu nu omega mu nu Lambda 1 mu nu g mu nu omega mu nu nbsp where g m n displaystyle g mu nu nbsp is the metric tensor g m n g m n g n n d m n displaystyle g mu nu g mu nu g nu nu delta mu nu nbsp and is symmetric while w m n w a n g a m displaystyle omega mu nu omega alpha nu g alpha mu nbsp is antisymmetric After plugging and chugging one obtains S L I i 4 w m n s m n O L 2 displaystyle S Lambda I frac i 4 omega mu nu sigma mu nu mathcal O left Lambda 2 right nbsp which is the infinitesimal form for S displaystyle S nbsp above and yields the relation s m n i 2 g m g n displaystyle sigma mu nu frac i 2 gamma mu gamma nu nbsp To obtain the affine relabelling write ps x I i 4 w m n s m n ps x I i 4 w m n s m n ps x w m n x n I i 4 w m n s m n x m w m n n ps x I i 2 w m n J m n ps x displaystyle begin aligned psi x amp left I frac i 4 omega mu nu sigma mu nu right psi x amp left I frac i 4 omega mu nu sigma mu nu right psi x omega mu nu x prime nu amp left I frac i 4 omega mu nu sigma mu nu x mu prime omega mu nu partial nu right psi x amp left I frac i 2 omega mu nu J mu nu right psi x end aligned nbsp After properly antisymmetrizing one obtains the generator of symmetries J m n displaystyle J mu nu nbsp given earlier Thus both J m n displaystyle J mu nu nbsp and s m n displaystyle sigma mu nu nbsp can be said to be the generators of Lorentz transformations but with a subtle distinction the first corresponds to a relabelling of points on the affine frame bundle which forces a translation along the fiber of the spinor on the spin bundle while the second corresponds to translations along the fiber of the spin bundle taken as a movement x x displaystyle x mapsto x nbsp along the frame bundle as well as a movement ps ps displaystyle psi mapsto psi nbsp along the fiber of the spin bundle Weinberg provides additional arguments for the physical interpretation of these as total and intrinsic angular momentum 14 Other formulations editThe Dirac equation can be formulated in a number of other ways Curved spacetime edit This article has developed the Dirac equation in flat spacetime according to special relativity It is possible to formulate the Dirac equation in curved spacetime The algebra of physical space edit This article developed the Dirac equation using four vectors and Schrodinger operators The Dirac equation in the algebra of physical space uses a Clifford algebra over the real numbers a type of geometric algebra Coupled Weyl Spinors edit As mentioned above the massless Dirac equation immediately reduces to the homogeneous Weyl equation By using the chiral representation of the gamma matrices the nonzero mass equation can also be decomposed into a pair of coupled inhomogeneous Weyl equations acting on the first and last pairs of indices of the original four component spinor i e ps ps L ps R displaystyle psi begin pmatrix psi L psi R end pmatrix nbsp where ps L displaystyle psi L nbsp and ps R displaystyle psi R nbsp are each two component Weyl spinors This is because the skew block form of the chiral gamma matrices means that they swap the ps L displaystyle psi L nbsp and ps R displaystyle psi R nbsp and apply the two by two Pauli matrices to each g m ps L ps R s m ps R s m ps L displaystyle gamma mu begin pmatrix psi L psi R end pmatrix begin pmatrix sigma mu psi R overline sigma mu psi L end pmatrix nbsp So the Dirac equation i g m m m ps L ps R 0 displaystyle i gamma mu partial mu m begin pmatrix psi L psi R end pmatrix 0 nbsp becomesi s m m ps R s m m ps L m ps L ps R displaystyle i begin pmatrix sigma mu partial mu psi R overline sigma mu partial mu psi L end pmatrix m begin pmatrix psi L psi R end pmatrix nbsp which in turn is equivalent to a pair of inhomogeneous Weyl equations for massless left and right helicity spinors where the coupling strength is proportional to the mass i s m m ps R m ps L displaystyle i sigma mu partial mu psi R m psi L nbsp i s m m ps L m ps R displaystyle i overline sigma mu partial mu psi L m psi R nbsp clarification needed This has been proposed as an intuitive explanation of Zitterbewegung as these massless components would propagate at the speed of light and move in opposite directions since the helicity is the projection of the spin onto the direction of motion 15 Here the role of the mass m displaystyle m nbsp is not to make the velocity less than the speed of light but instead controls the average rate at which these reversals occur specifically the reversals can be modeled as a Poisson process 16 U 1 symmetry editNatural units are used in this section The coupling constant is labelled by convention with e displaystyle e nbsp this parameter can also be viewed as modelling the electron charge Vector symmetry edit The Dirac equation and action admits a U 1 displaystyle text U 1 nbsp symmetry where the fields ps ps displaystyle psi bar psi nbsp transform asps x e i a ps x ps x e i a ps x displaystyle begin aligned psi x amp mapsto e i alpha psi x bar psi x amp mapsto e i alpha bar psi x end aligned nbsp This is a global symmetry known as the U 1 displaystyle text U 1 nbsp vector symmetry as opposed to the U 1 displaystyle text U 1 nbsp axial symmetry see below By Noether s theorem there is a corresponding conserved current this has been mentioned previously as J m x ps x g m ps x displaystyle J mu x bar psi x gamma mu psi x nbsp Gauging the symmetry edit See also Quantum electrodynamics If we promote the global symmetry parametrised by the constant a displaystyle alpha nbsp to a local symmetry parametrised by a function a R 1 3 R displaystyle alpha mathbb R 1 3 to mathbb R nbsp or equivalently e i a R 1 3 U 1 displaystyle e i alpha mathbb R 1 3 to text U 1 nbsp the Dirac equation is no longer invariant there is a residual derivative of a x displaystyle alpha x nbsp The fix proceeds as in scalar electrodynamics the partial derivative is promoted to a covariant derivative mstyle displaysty, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.