Nuclear Reactions in Stars
This chapter is divided into three parts:
preliminaries: relating rates to cross sections;
thermal distrubutions; thermally averaged rates; and the
S-factor
application to the pp chain
He burning
2.1 Reactions
2.1.1 Rates and cross sections
We want to consider the reaction
where the four-momentum of particle 1 is given by
, etc.
The rate (events/unit time in some volume V) is
![]()
where
is the number density of particles
of type 1 with four-momentum
(that is, the number of
particles per unit volume). The relative velocity is
defined

where the dot product the the four vector is defined
![]()
To convince yourself that this is a reasonable definition:
1) Evaluate this in the rest frame of particle 2, which mean
. The answer is
.
2) Evaluate this for 1 and 2 being nonrelativistic, so that
. One finds the
result
![]()
Suppose the densities above are constant over the volume of
interest (some region within a star). Then the integral over
is simple, yielding
![]()
Note the factor of
. The rate should be
proportional to the number of pairs of interacting particles
in the volume. If the particles are distinct, that is just
![]()
But if the particles are identical, the sume over distinct
pairs is
![]()
2.1.2 Decay rates
Another process of interest in stars in the decay of particle 1
to possible final states, which we might number 2,3,4, etc.
In this case ``2" might stand for a final nucleus, an electron,
and an antineutrino, in the case of
decay.
The rate can be written
![]()
where each
describes a decay channel and is given
in units of 1/sec. The mean lifetime is defined
![]()
Note that the halflife is defined by
![]()
As the total decay rate is
![]()
it follows
![]()
2.1.3 Thermal distributions
The particles in our stellar plasma have a distribution of
momenta characterized by their temperature. Thus to get total
rates we need to integrate the expressions above over those
distributions.
We consider three distributions:
a) Maxwell Boltzmann distribution
![]()
b) Fermi-Dirac distribution
![]()
c) Bose-Einstein
![]()
The particle energy
above is
.
The Fermi-Dirac distribution describes identical fermions.
We write
![]()
so that as
,

Thus
is often called the Fermi level, as it divides
the low-energy completely occupied levels from the higher energy
completely unoccupied levels. Of course, at finite temperatures,
this demarcation is not sharp.
We can integrate over some finite, uniform volume V to count
the total number of contained fermions
![]()
In this expression (
is the particle four-momentum
and
represents the REMAINING degeneracy of the
quantum level of energy
, e.g., perhaps the spin and
isospin degeneracy. The degeneracy due to momentum
![]()
is included explicitly in the integral. We can rewrite the
above integral as an integral over energy
![]()
Now at kT=0 the exponential goes to zero for
and infinity otherwise. Therefore
![]()
which then defines the Fermi energy
in terms
of the number density

Note for electrons, with two spin states, the second factor
on the RHS would be 1/2 (
= 1/2).
It should be clear that for general kT, one has to solve the
full equation [*] to relate
to
. And
it turns out the
is a slowly varying function
of kT. A picture of
, the number of particles
of energy
, is sketched on the following page. The
region around the Fermi surface gradually ``softens" as kT
is increased, in accordance with the naive expectation that
particles with kT of the Fermi surface ought to be occasionally
excited above the Fermi surface.
The Maxwell-Boltzmann distribution describes the behavior of
identical, distinguishable particles and can be thought of as the
classical limit of Fermi-Dirac statistics, where quantum effects
associated with exchange are unimportant. The common
situation we will encounter is when the density is low (so that
goes to 0) and the particles are nonrelativistic.
Then
is a large number, and the Fermi-Dirac
distribution goes over to the Maxwell-Boltzmann distribution.
We used this result in the big bang discussion, where these two
conditions are met.
Some typical uses of the Maxwell-Boltzmann distribution in
astrophysics:
Describing the occupation of levels in
well-isolated atoms. This is appropriate when quantum effects
due to electrons in theplasma and due to other atoms are
unimportant.
Describing molecular excitations, such as rotations.
As an example, consider a two-level atom, that is, one with a
ground state (which we will take to be 1s
) and an
excited state 1p
. The MB weights are, respectively,
![]()
Thus the population of the excited state is
![]()
The result we will use frequently is the Maxwell-Boltzmann
velocity distribution law, which comes immediately from [*]
![]()
We will not use the Bose-Einstein distribution, which describes
the distribution of identical bosons. But it has astrophysics
applications in matters such as pion and kaon condensation in
dense nuclear matter, etc.
2.1.4 Thermally averaged rates
We now discuss reactions of nonrelativistic charged nuclei
in a stellar plasma, where the nuclei have a distribution of velocities.
The rate formula discussed earlier for 1+2
1'+2'
can be generalized to take care of the velocity distribution
![]()
where
represents a thermal average. Note
that we have written the cross section as a function of the
relative velocity v: this is ok as the total cross section
is invariant under Galilean transformations (as is the rate),
so it must have this form.
Now we use our Maxwell-Boltzmann velocity distribution
![]()
to define this thermal average
![]()
We introduce the center-of-mass and relative velocities
![]()
![]()
so that
![]()
![]()
With these definitions,
![]()
where
is the reduced mass.
Now
![]()
But
![]()
Therefore
![]()
If we make this transformation in our expression for r, the
entire dependence on
is
![]()
So we derive our desired result
![]()
important result:
the relative velocity distribution
is a Maxwellian
based on the reduced mass
This can be written
![]()
In the center of mass
![]()
so that
![]()
Thus
![]()
leading to
![]()
Therefore
and
![]()
2.1.5 Nonresonant reactions
Nuclear reactions of various types can occur in stars. The
first division is between charged reactions and neutron-induced
reactions. The physics distinctions are the Coulomb barrier
suppression of the former, and the need for a neutron source
in the latter.
The charged particle reactions can also be divided in several
classes. First, it is helpful to develop a general physical
picture of the process 1 + 2
1' + 2' as the
merging of 1 and 2 to form a compound nucleus, followed by
the decay of that nucleus into 1' + 2'. The notion of the
compound nucleus is important: a nucleus is formed that is
clear unstable, as it was formed from 1+2 and therefore can
decay at least into the 1+2 channel. Yet it is a long-lived
state in the sense that it exists for a time much much longer
than the transit time of a nucleon to cross the nucleus.
Although the picture is not entirely accurate, it is
nevertheless helpful to envision the following analogy.
Imagine a shallow ashtray, the bottom of which has a
fairly uniform covering of marbles. Now put a marble on the
flat lip of the ashtray and give it a push, so that it rolls
to the bottom of the ashtray with some kinetic energy.
All collisions will be assumed elastic. Thus the system
that one has created is unstable: there is enough energy for
the system to eject the marble back to the lip of the
ashtray and thus off to infinity. But once the marble collides
with the other marbles in the bottom of the ashtray, the
energy is shared among the marbles. It becomes extremely
improbable for one marble to get all of the energy, enabling
it to escape. This is thus the picture of a compound nucleus,
an unstable state that nevertheless is long lived, as it can
only fission by a very improbable circumstance where one
nucleon (or group of nucleons) acquires sufficient energy
to escape.
If one probes a nucleus above its particle breakup threshold -
this would be the intermediate nucleus in the discussion above -
one will observe resonances, states that are not eigenstates
but instead are unstable and thus have some finite spread in
energy. You may be familiar with some examples from quantum
mechanics: the case often first studied is the shape resonances
that occur when scattering particles off a well, such as a
square well. Such states usually carry a large fraction of
the scattering strength and can be thought of as quasistationary
states.
The charged particles break up into two classes, resonant (where
the incident energy corresponds to a resonance) and nonresonant.
The first applications we will make involve nonresonant
reactions, so this is the example we will do in some detail.
A picture of a nonresonant charged particle reaction is shown in
the figure. It depicts barrier penetration: the incident
energy is well below the Coulomb barrier, so the classical
turning point is well outside the region of the strong potential
where fusion can occur. But this energy is not coincident with
any of the resonant quasistates.
Suppose we were interested in the reaction
![]()
where
is the intermediate nucleus formed in the fusion.
To calculate the cross section, it will prove sufficient to
ask the following question: given the nucleus
, what is
the probability for it to decay into the channels
and
? The former will be related by
time reversal to the probability for forming the compound
nucleus.
For definiteness we ask for the rate for decaying into
. This is
![]()
where r is the relative coordinate of the
and
.
This can be written
![]()
![]()
Note that
is a constant for very large r.
We can write this result as follows
![]()
![]()
where
is a strong interaction quantity
that depends on the wave function at the nuclear radius.
The first term above is the penetration factor, the square of
the ratio of the wave function at the nuclear surface to
that an infinity. If the Coulomb barrier is high, this
penetration factor will be very small because the tunneling
probability is low. The simplest estimate of this would
come from treating the wave function as a pure Coulomb
wave function. The Coulomb radial equation is
![]()
where r is the relative
-
coordinate.
Defining
![]()
the outgoing solution corresponds to the following combination
of the standard Coulomb functions
![]()
Thus we find the penetration factor
![]()
And values for the penetration could be obtained by looking
up numerical values.
Now there is a very nice improvement in this approach that
takes into account the effects of the nuclear potential.
The method is described in Clayton and is based on the WKB
approximation, in which the Schroedinger equation is solved
via an expansion in powers of
. Thus this is a
semiclassical approximation. The derivation takes a full
lecture and thus is not appropriate here. So I will just
quote the answer and refer those interested to Clayton.

where
is the Coulomb potential at
the nuclear surface and
is the nuclear radius. This
expression for the penetration factor consists of three terms
The leading Gamow factor, which also comes from the
l=0 Coulomb expression we derived earlier
The effects of the angular momentum barrier,
proportional to l(l+1), which suppresses the contributions
of higher partial waves
The third term shows that the nuclear radius effects
the penetration
If we take some reaction like
, the
theory of compound nucleus reactions gives the cross section
for
(e.g.,
= 1+2 and
= 1'+2') as
![]()
Here
is the energy of the nearest resonance,
is the total
width, and k is the wave number. Widths are related to the
decay rate we have calculated by
and thus have the units of energy: the larger the width, the
faster the decay, in accordance with the uncertainty
principle. And
, so the wave number k has the
dimensions of 1/length. Thus it is clear that the cross
section so defined has the proper units.
Now the definition of a nonresonant reaction is that
is much larger that
, so that one is a long way from
the resonance. The denominator above is then relatively
smooth: it can be quite smooth if there are a number of
contributing distant resonances. Noting
![]()
it follows that
![]()
motivating the definition of the S-factor
![]()
Effectively what one has done is to remove the most rapid
dependence on energy, the dependence that would correspond
to the s-wave interaction of two charged particles. What
remains is a much more gently changing function S(E), which
contains a lot of physics: the effects of finite nuclear
size, high partial waves, etc. The importance of the S(E)
is that it can be fitted to experimental cross section
measurements made at energies higher than those characteristic
of stars. But if S(E) evolves slowly, it can be extrapolated
to lower energies that are relevant to stellar burning.
This limits the need for nuclear theory: one needs to
estimate the shape of S(E) as a function of E, but not its
magnitude, as the magnitude can be pegged to experiment.
This is the strategy followed for the nonresonant reactions
of interest in solar burning.
2.1.6 Thermally averaged cross sections
The leading Coulomb effect - the Gamow penetration factor - is
a sharply rising function of E. The Maxwell-Boltzmann
distribution has an exponentially declining high-energy tail.
Thus one immediately sees that
involves a sharp competition between these two effects,
leading to some compromise most-effective-energy. This is
illustrated in the figure. We can determine this energy:
![]()
Recalling
and defining
![]()
this integral becomes
![]()
Clearly the exponential is small at small E and at large E.
Now S(E) is assumed to be a slowly varying function. The
standard method for estimating such an integral, then, is
to find the energy that maximizes the exponential, and expand
around this peak in the integrand. This corresponds to
solving
![]()
The solution is
![]()
We now expand the argument of the exponential around this peak
energy
![]()
But as
vanishes by definition of
![]()
It follows
![]()
![]()
In deriving this result, we have assume S(E) is slowly varying
in the vicinity of the integrand peak at
, and thus can
be replaced by its value at the peak. Note our formula could
easily be improved by doing a Taylor expansion on S(E)
![]()
Thus our final answer would have an additional contribution
due to
.
But if we just keep
, the integral can be done, yielding
![]()

Now
![]()
![]()
Thus

With a little algebra this can be reexpressed
![]()
Now we define a quantity A by
![]()
where
is the nucleon mass. Substituting this in, evaluating
some constants, and dividing out the dimensions of S (note
S has the units of a cross section times energy) yields
![]()
Note that the overall dimensions are clearly 1/(cm
sec),
as the number densities have units 1/cm
. Also remember
that a barn =
cm
.
Now
defines the peak of the contributions to
. From its definition
![]()
![]()
where the speed of light has been reinserted to make it explicit
that this quantity carries no units. For example, in the center
of our sun
K
1.3 keV. So if we
plug in the appropriate numbers for the
He+
He reaction one finds
![]()
One could compare this to the average energy of a Maxwell-
Boltzmann distribition of paticles of
E
3 kT. Thus, indeed, the reactions are occurring far out on the
Boltzmann tail, where nuclei have a better chance of penetrating
the Coulomb barrier.
It might be helpful at this point to walk through the example
of
C+p going to
N. If we define the zero of energy
as that of the
C nucleus and proton at rest, then
N is bound by 1.943 MeV. Furthermore there is a
resonance in
at 2.367 MeV, 424 keV above the zero
of energy. Thus a
C+p collision at a center-of-mass
energy of 424 keV would be directly on resonance.
In the lab frame, this corresponds to a 460 keV proton incident
on a
C nucleus at rest.
The cross section is
![]()
![]()
One can reexpress the S-factor, then, as
![]()
Now the product of the exponential and
on the right
should be roughly energy independent, as the exponential cancels
the penetration probability buried in
. Thus the
assumption the S(E) is weakly energy dependent requires
that one not be too close to the resonance.
If one examines this system experimentally, the results are as shown in the figures at the back of the previous set of notes. Note that S(E) is quite smooth below above 300 keV. Thus data in the 100-300 keV range can be used to extrapolate the measured cross section to the region of interest for p burning via the CNO cycles. In contrast, the raw cross section varies over 9 orders of magnitude. Note that the theory curve, which takes into account the resonance, does quite well throughout the illustrated region. Thus the success of theory in the 100-400 keV region gives one great confidence that the values extrapolated to 20-50 keV are correct. The sharp steeping of S(E) above 400 keV lab energy is a clear indication of the resonance at 460 keV lab proton energy.
What about resonant reactions?
That is, suppose we had some astrophysical setting where the
relevant value of
was not as above (
keV), but
in fact sat on the resonance at 424 keV center-of-mass energy?
If the resonance is narrow (usually the case) compared to the
typical spred of relevant energies of the colliding nuclei,
=
![]()

The integral is 2
/
, so
![]()
If the only open channels for decay of the compound nucleus are
proton and
emission, then
.
If
greatly exceeds
, then
![]()
That is, the rate depends only on the
width. The
opposite limit, a very small
, which might occur in
a low energy resonance in a high Z target, yields a rate that
depends only on
, which governs the formation
probability of the compound nucleus.
2.1.7 The pp chain and the standard solar model
We now start a discussion of stars that burn hydrogen through the
pp and CNO cycles, such as our sun and similar stars. Almost
all stars lying along the main sequence - perhaps 80% of the
stars we observed - are thought to be hydrogen burning.
The main sequence is a track of stars in the Hertzsprung-Russell
diagram, or HR diagram. The HR diagram is a plot of stars
on a plane where the vertical axis is the luminosity and the
horizontal axis is the surface temperature (as measured by the
color of the star). Stellar luminosities vary from
that of our sun, with surface temperatures
vary from 2000-50000K.
The most obvious properties that one can use to characterize a
star are its surface temperature
, luminosity L, and radius R. The former
two are accessible to observation, but generally the radius is
not. Yet it is easy to see that there is a relationship between
these properties. If we pretend stars radiate as black bodies,
then the energy emitted per unit time per unit surface area is
given by the Stefan-Boltzmann black-body radiation law,
, where
. Thus the star's
luminosity is
![]()
We can normalize things to solar
properties to rewrite this as
![]()
Though this result is true only for a blackbody, it makes it
plausible that a plot of luminosity vs. temperature might yield
a one-dimensional path in the plane parameterized by the radius,
and thus the mass, of the star, provided that the stars have
similar internal structure. For example, if a class of stars radiated as
blackbodies, the trajectory would be as described above.
This was basically the discovery of Hertzsprung and Russell.
The HR diagram on the next page shows a dominant trajectory -
the main sequence - running from high temperature to low
temperature. It also shows other classes of stars that reside well
off the main sequence. The sun is situated on the main
sequence according to its observed surface
temperature of about 6500 K. Stars at the upper left -
on the main sequence
with temperatures 4 times that of the sun and luminosities
6 orders of magnitude larger - would have a radius about
60 times that of our sun.
The red, cool, dwarf stars in the lower right of the main
sequence, with luminosities about 2000 times lower than the sun and
temperatures about half that of the sun, have radii
about 0.1 that of the sun.
Other classes of stars are well separated from the main sequence.
One group has luminosities on the order of
and temperatures again about half that of the sun.
Thus these supergiants would correspond to a radius about 400
times that of the sun. Red giants, which form another patch
off the main sequence, have a radius about 50 times that of
our sun.
White dwarfs - with luminosities about 1/200 of solar and
temperatures twice solar - would correspond to a radius of about
1/50th that of the sun. These sit well below the main sequence.
The sun is our ``test case" for developing a theory of main-sequence
stellar evolution. We know far more about this star - its age,
luminosity, radius, surface composition, and even its neutrino
luminosity and helioseismology - that any other star.
Solar models trace the evolution of the sun over the past
4.6 billion years of main sequence burning, thereby predicting
the present-day temperature and composition profiles of the solar
core that govern energy production. Standard solar models
share four basic assumptions:
The sun evolves in hydrostatic equilibrium, maintaining
a local balance between the gravitational force and the pressure
gradient. To describe this condition in detail, one must
specify the equation of state as a function of temperature,
density, and composition.
Energy is transported by radiation and convection. While
the solar envelope is convective, radiative transport dominates
in the core region where thermonuclear reactions take place.
The opacity depends sensitively on the solar composition, particularly
the abundances of heavier elements.
Thermonuclear reaction chains generate solar energy.
The standard model predicts this energy
is produced from the conversion of four protons into
He.
![]()
About 98% of the time this occurs through the pp chain,
with the CNO cycle contributing the remaining 2%. The sun is
a large but slow reactor: the core temperature,
K,
results in typical center-of-mass energies for reacting particles
of
10 keV, much less than the Coulomb barriers inhibiting
charged particle nuclear reactions. Thus reaction cross
sections are small, and one must go to significantly higher
energies before laboratory measurements are feasible. These
laboratory data must then be extrapolated to the solar energies of
interest, as we discussed previously.
The model is constrained to produce today's solar
radius, mass, and luminosity. An important assumption of
the standard model is that the sun was highly convective,
and therefore uniform in composition, when it first
entered the main sequence. It is furthermore assumed
that the surface abundances of metals (nuclei with A > 5)
were undisturbed by the subsequent evolution, and thus
provide a record of the initial solar metallicity. The
remaining parameter is the initial
He/H ratio, which
is adjusted until the model reproduces the present solar
luminosity after 4.6 billion years of evolution. The resulting
He/H mass fraction ratio is typically 0.27
0.01,
which can be compared to the big-bang value of 0.23
0.01.
(Note that today's surface abundance can differ from this value
due to diffusion of He over the lifetime of the sun.)
Note that the sun was formed from previously processed
material.
The ``standard solar model" is the terminology used to describe
models, such as that of Bahcall and Pinsonneault, that implement
the above physics in a computer code, then evolve the sun
forward from the onset of main sequence burning. Generally
calculations are one-dimensional, which means that the
physics such as convection can not arise dynamically. It
can, and sometimes is, put in phenomenologically, through
approximations such as ``mixing length theory."
The model that emerges is an evolving sun. As the
core's chemical composition changes, the opacity and
core temperature rise, producing a 44% luminosity increase
since the onset of the main sequence. Some other features of the
sun evolve even more rapidly. For example, the
B neutrino
flux, the most temperature-dependent component, proves to
be of relatively recent origin: the predicted flux
increases exponentially with a doubling period of about
0.9 billion years. This is the flux to which Ray Davis's famous
Homestake gold mine experiment is primarily sensitive.
As another example of a time-dependent feature of the sun,
the equilibrium abundance and equilibration time
for
He are both sharply increasing functions of the distance
from the solar center. Thus a steep
He density gradient is
established over time. We will shortly see that the
He
is sort of a ``caltalyst" in the pp chain, being produced and then
consumed as an intermediate step in synthesizing
He.
Such models generally do not model the earliest history of our
sun, when it first formed as a body of gas contracting under
its own gravity, heating and ionizing as the gravitational work
is done. The early contraction of the sun, when it approaches
the main sequence vertically from above in the HR diagram,
is characterized by high luminosity and convection throughout
the star, lasting for a few million years. (Actually, there is
thought to be continued convection in the core of sun for
perhaps 10
years, though this is driven by another mechanism:
the fact that the CNO cycle is burning out of equilibrium due
to initials metals in the sun.) The core of the sun reaches
radiative equilibrium first, and this region then grows
outward.
One of the problems on the homework, solar Li depletion,
illustrates one shortcoming of the standard solar model. Li
had to be burned at some point in the sun's evolution to
account for the fact that the solar surface abundance is
roughly 1/100th that found in meteorites. Perhaps this is
due to the failure to model the sun's early convective stage.
Or perhaps material can be pulled below the convective
envelope by some mechanism, to a depth where the higher
temperature allows
Li(
He)
He to occur.
Now let's turn to the pp chain. The basic equation governing
the nonresonant strong and radiative reactions is, from our
earlier work,
![]()
which, after plugging in for
, can be rewritten
![]()
This tells us that small
is favored, and that rates
are expected to rise as
. In the above,
is the temperature in units of
K.
Now there is clearly no strong or radiative capture reaction
that can initiate
He synthesis in the sun. The p+p, p+
He,
and
He+
He reactions do not release energy and thus do
not form bound states. The driving reaction of the pp chain is
a weak interaction, like those discussed in the big bang,
![]()
This is analogous to neutron or nuclear
decay, except that the
initial state is not a nucleus, but two
protons in the plasma.
If the initiating p+p
decay reaction occurs, we can
see relatively easily how the rest of the burning might
proceed:
![]()
![]()
This is called the ppI cycle and is, indeed, the most robost
part of the pp chain in stars with temperatures like our
sun: about 84% of the
He produced today in the solar core
is predicted to be synthesized in this way. The two reactions
above are of the type we have previously discussed.
Thus we need to derive something like an S-factor for a weak decay.
Qualitatively, one can see that the p+p
decay reaction
will occur if a plasma proton decays into a neutron
![]()
while a second spectator proton is close by, within the range of the
nuclear force (several fermis) so that the final n+p state
can form a bound deuteron. It is the binding energy of the
deuteron that makes this proton decay energetically possible.
The nuclear
decay rate is
![]()
(I treat neutrinos as massive fermions: Don't worry about this.
It is just a choice in normalizing neutrino spinors.)
The invariant amplitude M is, as we discussed in the big bang
section, effectively a contact interaction, because the
momentum transfered between leptons and nucleons is so much
smaller than the mass of the W boson. Thus it can be written
![]()
If the factors involving
were ignored, this would just
be the current-current interaction familiar from electromagnetism.
The factor
projects out the left-hand part of the
interacting fields. That is, the weak interaction is just like
electromagnetism except that only the left-handed components
of particle fields participate. This is correct at the
level of the bare particles taking part in weak interactions,
the quarks, electron/positron, and the neutrino. Note that
the effective interaction for p
n involves the
factor
![]()
The vector coupling is not modified because the total electric
charge is conserved. But the axial-vector coupling has a nontrivial
relation to the underlying quark couplings. Neutron
decay
gives the effective nucleon axial coupling constant of
.
As the momenta for reacting solar protons typical are of order
![]()
and as momenta of nucleons bound in deuterium are also reasonably
small (
100 MeV), the nucleons in our
decay
amplitude can be treated nonrelativistically. In this
approximation the operators in our amplitude become

Thus it is the time-like part of the vector current and the
space-like part of the axial-vector current that survive in the
nonrelativistic limit.
It follows that our
decay invariant amplitude can be
approximated by
![]()
where the
are now tw-component Pauli spinors for the nucleonss.
The above result is written for the
decay
. It is convenient to generalize it for
by introducing the isospin operators
where
n
=
p
and ![]()
p
=
n
, with
all other matrix elements being zero.
With this generalization, we can now finish the calculation.
We square the invariant amplitude, integrate over the outgoing
electron, neutrino, and final nuclear three-momenta, average
over initial nucleon spin, and sum over final nucleon spin,
electron spin, and neutrino spin. The result is
![]()
where f and i are the final and initial nucleon states, and
where m is the electron mass, w is the energy release in the
decay, and
is the outgoing electron energy.
The
operator corresponds to
decay and the
to
decay.
This result easily generalizes to nuclear decay. The operators
are replaced
![]()
![]()
The factor of
before the square of the nuclear
matrix elements is replaced by
where
is the initial nuclear spin. Finally, the Coulomb effects on the
outgoing electron or positron can be approximately accounted for
by including the Coulomb factor
![]()
where
is the electron/positron velocity and
is the s-wave Coulomb wave function in the
field of the daughter nucleus of charge
, evaluated at
the nuclear origin. Note the close relation to the Gamow
factor.
The spin-independent and spin-dependent operators appearing
above are known as the Fermi and Gamow-Teller operators.
The Fermi operator is the isospin raising/lowering operator:
in the limit of good isopsin, which typically is good to 5% or
better in the description of low-lying nuclear states,
it can only connect states in the same isospin multiplet.
That is, it is capable of exciting only one state, the state
identical to the initial state in terms of space and spin,
but with
for
and
decay, respectively.
Now the reaction of interest
![]()
produces a final nuclear state with J = 1 and isospin T=0:
this is an isospin singlet, so there is no corresponding
state in p+p that can be reached by the Fermi operator.
It follows that only the Gamow-Teller operator contributes.
Thus the matrix element that must be calculated is
![]()
Here
is the relative two-nucleon coordinate.
Thus
is the relative two-proton wave function in the
plasma. Note this expresses what we stated qualitatively before:
the
decay of the proton can only occur if there is
another, spectator proton close enough by such that the
result pn pair has a reasonable overlap with the deuteron, a
compact state. We will now see that this is unlikely to occur,
leading to a small p+p S-factor.
2.1.7 The pp chain and the standard solar model (continued)
So now we want to go about calculating the S-factor. As always
we will be a little sloppy, as we want to avoid real calculations.
But in spirit what we will do below is not too different from
the 1938 paper by Bethe and Critchfield, where the pp cross
section was first derived.
The first step is to go back to the rate formula and do the
integral over the outgoing electron energy in
decay,
ignoring the Coulomb correction for the outgoing state.
For a deuteron plus an e
, this is not too bad because
![]()
Since .511 MeV of this is needed to make the electron mass,
the outgoing electron and neutrino share .420 MeV of kinetic
energy. If half of this goes to the electron on average,
then the typical velocity of the electron is 0.7c. Thus
![]()
So the Coulomb effects can be ignored.
Therefore we can integrate the
decay formula over
electron energies to get
![]()
![]()
where
. We can then evaluate this for the
``decay" of p+p to get
and
![]()
Now at this point you should be a bit puzzled because we are
treating the p+p system as a ``nucleus" even though it refers
to collisions within the plasma. We want a cross section, or
better yet, an S-factor. What is the connection?
But this is not too hard. Let's imagine cutting out a ``box"
in our plasma of volume V such that the average number of
contained protons is 2. Our formula
for rate/vol/sec is
![]()
But remember the factor on the right is supposed to be the
number of distinct pairs times
. So for one pair
and multiplying by V, we get the rate for our pair to
interact. This is
![]()
Note v/V has the dimensions of flux.
Now just consider the box to be a big nucleus. The two protons
in the box have a relative wave function normalized to unity
in the box. The wave function at nuclear distance scales
(small compared to our box dimension) is suppressed due
to the Coulomb effects, but at larger scales it is just a plane
wave. (Or a Coulomb wave - the logic is similar.) Thus the
correct normalization for a large box is
![]()
But looking at our
decay formula for our ``nucleus",
the deuteron wave function is compact, nonzero only for r
on the order of the deuteron size. The spin part of the
matrix element could be evaluated if we had a good deuteron
wave function from solving the Schroedinger equation for
some strong potential. We won't do that, but the spin
operator carries no units and thus should be of order unity.
Thus, replacing
![]()
and taking the deuteron wave function to be constant over the
nuclear volume

we find a decay rate
![]()
But this can be written
![]()
Here P(v) is the usually penetration factor, formed from the
square of the ratio of the wave function at
and large r.
But we know the normalization of the wave function at large r,
by our box description, so
![]()
Now we equate our two expressions for
to find
![]()
Plugging in our old expressions for P(v) we get
![]()
Immediately we have the S-factor
![]()
The deuteron is relatively diffuse, and we expect the reaction
to occur on the tail of the wave function, due to the Coulomb
penetration. Thus we make the guess
10 f. It follows
![]()
![]()
![]()
Thus plugging in the numbers
![]()
Interestingly the result of a rigorous calculation is quite
close to our guess
![]()
That is, our crude guess of a unit spin matrix element and a
reaction radius of 10 f seems to be unexpectedly close.
Thus two important points can be made:
We understand the size of
and note it is
very small, 20 orders of magnitude below strong interaction
S-factors. This reaction controls the rate of pp chain
hydrogen burning.
This cross section cannot be measured in the lab:
the initial state is not a stable nucleus, and the rate is
impossible small. Thus our description of the basic process
that powers the majority of stars MUST be taken from a
first principles nuclear theory calculation. It is believed
this can be done to an accuracy of about 1%. This involves
fortunate aspects of the weak interaction, such as the fact
that exchange current contributions to the Gamow-Teller
operator are only of order (v/c)
1%.
Theory also determines the shape of S(E)
![]()
Thus at 10 keV, this slope generates a 10% correction to
the S-factor.
Now that we know the S-factor, we can plug it into our rate
formula to determine the rate of the p+p reaction. Recall

So in the core of the sun, where
,
and
keV, the most effective energy
for the p+p reaction. The rate formula then gives
![]()
Now the number density at the center of the sun is about
/cm
, so
![]()
So the time scale for burning hydrogen is the number
density divided by twice the burning rate (two protons are
consumed per reaction)
![]()
which can be compared to the sun's present age, 4.55 b.y.
Thus it has lived about half its lifetime.
2.1.7 The pp chain and the standard solar model (continued)
The work just completed gives us the basic tools needed to build
a ``network" calculation of the ppI cycle. The contributing
reactions are
![]()
![]()
![]()
Here r represents a rate and
.
From one calculated S-factor (for pp) and two that are measured,
we could calculate the production of He once the composition and
temperature of our volume of interest was specified.
One feature of interest in this simple network is that d and
He
both act as ``catalysts": they are produced and then consumed in
the burning. In a steady state process, this implies they must
reach some equilibrium abundance where the production rate
equals the destruction rate. That is, the general rate
equation
![]()
is satisfied at equilibrium by replacing the LHS by zero. Thus
![]()
But from the S-factors
![]()
and our rate formula
![]()
We can plug in the values
![]()
![]()
to find
![]()
Therefore this ratio is a decreasing function of
:
the higher the temperature, the lower the equilibrium abundance
of deuterium. Therefore in the region of the sun where the
ppI cycle is operating, the deuterium abundance is lowest in
the sun's center. Plugging in the solar core temperature
![]()
There isn't much deuterium about: using
/cm
one finds
/cm
. Remembering
our previous result
![]()
it follows that the typical life time of a deuterium nucleus is
![]()
That is, deuterium is burned instantaneously and thus reaches
equilibrium very, very quickly.
This result then allows us to write the analogous equation for
He as
![]()
where the factor of two in the term on the right comes because
the
He+
He reaction destroys two
He nuclei.
Thus at equilibrium

Using
![]()
we can again do the rate algebra to find

This ratio is clearly a sharply decreasing function of
and thus a sharply increasing function of r. That
is, a sharp gradient in
He is established in the sun.
One can estimate the time required to reach equilibrium in
a simple way: as the burning of
He is quadratic in the
abundance, it will not become significant until one is
rather close to the
He equilibrium value. Thus a
reasonable estimate of the time require to get close (say
a factor of 2) to equilibrium is just the time required to
produce the requisite number of
He nuclei.
At the sun's center, we have found that the
He abundance
is
, where we have assumed 75% of
the matter is protons (as it was when the sun first entered
the main sequence). Thus
![]()
The same calculation at
, where a reasonable
solar density of 36 g/cm
and a 75% proton abundance is
used, gives
![]()
![]()
![]()
It turns out that at temperatures of
, the
equilibration time corresponds to the present age of the sun.
This temperature characterizes a solar radius of about 0.27,
at the very edge of the energy-producing core. The resulting
interesting profile of
He is shown in the figure.
There are some interesting issues connected with this
He
gradient. It was shown by Dilke and Gough that it implies
the sun is overstable to large-amplitude radial oscillations.
If one throws a
He rich volume element towards the core,
the
He will ignite at the higher temperatures, become
bouyant, and return to its original equilibrium position with
a kinetic energy greater than the required for the original
perturbation. This has lead to speculations that the
He
gradient could trigger sudden overturn of the core. Most
of the experts believe there is no large amplitude trigger
that will allow the sun to discover the existence of this
instability.
To be somewhat more complete, the initial step of the ppI cycle
can occur in a different way
![]()
But the electron capture process only accounts for about 1%
of the pp reactions. Thus the full ppI cycle can be written
![]()
![]()
![]()
However there are two other paths through the pp chain that
can occur if
He burns by another path. Thus
determines
ppI vs. ppII+ppIII
The splitting between the ppII and ppIII cycles depends on the
fate of the
Be
determines
ppII vs. ppIII
Thus the two additional cycles are
![]()
![]()
![]()
![]()
![]()
and
![]()
![]()
![]()
![]()
![]()
![]()
The calculations presented below will show that the competition
between the three cycles is quite sensitive to the interior
temperature of the sun, and to core composition. We also note
that, in principle, we can experimentally determine the
relative importance of the three cycles: the cycles are
distinguished by the neutrinos they produce.
![]()
![]()
![]()
Peter Doe will tell you about the exciting progress in measuring
these fluxes in his lecture Monday.
The
He+
He
He+
He branching:
The Coulomb effects for these two reactions are rather similar,
except for small effects proportional to the different
masses. The heavier nucleus moves more slowly, and thus is at
a disadvantage in overcoming Coulomb barriers. The S-factor
for the 3+3 reaction is larger than that for the 3+4 reaction
by almost a factor of
. However we have also seen that
the core ambundance of
He is almost a factor of
less than that of
He. The net result, from our rate
formulas, is
![]()
Noting that
![]()
and using
keV b leads to
![]()
It is clear that higher temperatures favor the ppII+ppIII cycles.
We can solve for the temperature where the ratio becomes 1,
![]()
Thus the 3+3 reaction dominates at solar temperatures, but not
in stars that are
10% hotter. At the center of the sun,
where
1.5, the ratio is 0.6; at
1.2,
corresponding to a radius half way through the energy producing
core, the ratio is 0.12. The figure, from Cameron, shows the
ppI/ppII+ppIII branching as a function of
.