Nuclear Astrophysics
Physics and Astronomy Department at the University of Washington



Nuclear Reactions in Stars

This chapter is divided into three parts:
tex2html_wrap_inline2893 preliminaries: relating rates to cross sections; thermal distrubutions; thermally averaged rates; and the S-factor
tex2html_wrap_inline2893 application to the pp chain
tex2html_wrap_inline2893 He burning

2.1 Reactions

2.1.1 Rates and cross sections
We want to consider the reaction tex2html_wrap_inline2899 where the four-momentum of particle 1 is given by tex2html_wrap_inline2901, etc. The rate (events/unit time in some volume V) is
displaymath2539
where tex2html_wrap_inline2903 is the number density of particles of type 1 with four-momentum tex2html_wrap_inline2901 (that is, the number of particles per unit volume). The relative velocity is defined
displaymath2540
where the dot product the the four vector is defined
displaymath2541
To convince yourself that this is a reasonable definition:
1) Evaluate this in the rest frame of particle 2, which mean tex2html_wrap_inline2907. The answer is tex2html_wrap_inline2909.
2) Evaluate this for 1 and 2 being nonrelativistic, so that tex2html_wrap_inline2911. One finds the result
displaymath2542

Suppose the densities above are constant over the volume of interest (some region within a star). Then the integral over tex2html_wrap_inline2913 is simple, yielding
displaymath2543
Note the factor of tex2html_wrap_inline2915. The rate should be proportional to the number of pairs of interacting particles in the volume. If the particles are distinct, that is just
displaymath2544
But if the particles are identical, the sume over distinct pairs is
displaymath2545

2.1.2 Decay rates
Another process of interest in stars in the decay of particle 1 to possible final states, which we might number 2,3,4, etc. In this case ``2" might stand for a final nucleus, an electron, and an antineutrino, in the case of tex2html_wrap_inline2917 decay.

The rate can be written
displaymath2546
where each tex2html_wrap_inline2919 describes a decay channel and is given in units of 1/sec. The mean lifetime is defined
displaymath2547
Note that the halflife is defined by
displaymath2548
As the total decay rate is
displaymath2549
it follows
displaymath2550

2.1.3 Thermal distributions
The particles in our stellar plasma have a distribution of momenta characterized by their temperature. Thus to get total rates we need to integrate the expressions above over those distributions.

We consider three distributions:
a) Maxwell Boltzmann distribution
displaymath2551
b) Fermi-Dirac distribution
displaymath2552
c) Bose-Einstein
displaymath2553
The particle energy tex2html_wrap_inline2921 above is tex2html_wrap_inline2923.

The Fermi-Dirac distribution describes identical fermions. We write
displaymath2554
so that as tex2html_wrap_inline2925,
displaymath2555
Thus tex2html_wrap_inline2927 is often called the Fermi level, as it divides the low-energy completely occupied levels from the higher energy completely unoccupied levels. Of course, at finite temperatures, this demarcation is not sharp.

We can integrate over some finite, uniform volume V to count the total number of contained fermions
displaymath2556
In this expression (tex2html_wrap_inline2929 is the particle four-momentum and tex2html_wrap_inline2931 represents the REMAINING degeneracy of the quantum level of energy tex2html_wrap_inline2933, e.g., perhaps the spin and isospin degeneracy. The degeneracy due to momentum
displaymath2557
is included explicitly in the integral. We can rewrite the above integral as an integral over energy
displaymath2558

Now at kT=0 the exponential goes to zero for tex2html_wrap_inline2937 tex2html_wrap_inline2927 and infinity otherwise. Therefore
displaymath2559
which then defines the Fermi energy tex2html_wrap_inline2927 in terms of the number density tex2html_wrap_inline2943
displaymath2560
Note for electrons, with two spin states, the second factor on the RHS would be 1/2 (tex2html_wrap_inline2931 = 1/2).

It should be clear that for general kT, one has to solve the full equation [*] to relate tex2html_wrap_inline2927 to tex2html_wrap_inline2951. And it turns out the tex2html_wrap_inline2927 is a slowly varying function of kT. A picture of tex2html_wrap_inline2957, the number of particles of energy tex2html_wrap_inline2933, is sketched on the following page. The region around the Fermi surface gradually ``softens" as kT is increased, in accordance with the naive expectation that particles with kT of the Fermi surface ought to be occasionally excited above the Fermi surface.

The Maxwell-Boltzmann distribution describes the behavior of identical, distinguishable particles and can be thought of as the classical limit of Fermi-Dirac statistics, where quantum effects associated with exchange are unimportant. The common situation we will encounter is when the density is low (so that tex2html_wrap_inline2927 goes to 0) and the particles are nonrelativistic. Then tex2html_wrap_inline2967 is a large number, and the Fermi-Dirac distribution goes over to the Maxwell-Boltzmann distribution. We used this result in the big bang discussion, where these two conditions are met.

Some typical uses of the Maxwell-Boltzmann distribution in astrophysics:
tex2html_wrap_inline2893 Describing the occupation of levels in well-isolated atoms. This is appropriate when quantum effects due to electrons in theplasma and due to other atoms are unimportant.
tex2html_wrap_inline2893 Describing molecular excitations, such as rotations.

As an example, consider a two-level atom, that is, one with a ground state (which we will take to be 1stex2html_wrap_inline2973) and an excited state 1ptex2html_wrap_inline2975. The MB weights are, respectively,
displaymath2561
Thus the population of the excited state is
displaymath2562

The result we will use frequently is the Maxwell-Boltzmann velocity distribution law, which comes immediately from [*]
displaymath2563

We will not use the Bose-Einstein distribution, which describes the distribution of identical bosons. But it has astrophysics applications in matters such as pion and kaon condensation in dense nuclear matter, etc.

2.1.4 Thermally averaged rates
We now discuss reactions of nonrelativistic charged nuclei in a stellar plasma, where the nuclei have a distribution of velocities. The rate formula discussed earlier for 1+2 tex2html_wrap_inline2977 1'+2' can be generalized to take care of the velocity distribution
displaymath2564
where tex2html_wrap_inline2979 represents a thermal average. Note that we have written the cross section as a function of the relative velocity v: this is ok as the total cross section is invariant under Galilean transformations (as is the rate), so it must have this form.

Now we use our Maxwell-Boltzmann velocity distribution
displaymath2565
to define this thermal average
displaymath2566
We introduce the center-of-mass and relative velocities
displaymath2567

displaymath2568
so that
displaymath2569

displaymath2570
With these definitions,
displaymath2571
where tex2html_wrap_inline2983 is the reduced mass.

Now
displaymath2572
But
displaymath2573
Therefore
displaymath2574

If we make this transformation in our expression for r, the entire dependence on tex2html_wrap_inline2987 is
displaymath2575
So we derive our desired result
displaymath2576
important result:
the relative velocity distribution
is a Maxwellian
based on the reduced mass This can be written
displaymath2577

In the center of mass
displaymath2578
so that
displaymath2579
Thus
displaymath2580
leading to
displaymath2581
Therefore tex2html_wrap_inline2989 and
displaymath2582

2.1.5 Nonresonant reactions
Nuclear reactions of various types can occur in stars. The first division is between charged reactions and neutron-induced reactions. The physics distinctions are the Coulomb barrier suppression of the former, and the need for a neutron source in the latter.

The charged particle reactions can also be divided in several classes. First, it is helpful to develop a general physical picture of the process 1 + 2 tex2html_wrap_inline2977 1' + 2' as the merging of 1 and 2 to form a compound nucleus, followed by the decay of that nucleus into 1' + 2'. The notion of the compound nucleus is important: a nucleus is formed that is clear unstable, as it was formed from 1+2 and therefore can decay at least into the 1+2 channel. Yet it is a long-lived state in the sense that it exists for a time much much longer than the transit time of a nucleon to cross the nucleus. Although the picture is not entirely accurate, it is nevertheless helpful to envision the following analogy. Imagine a shallow ashtray, the bottom of which has a fairly uniform covering of marbles. Now put a marble on the flat lip of the ashtray and give it a push, so that it rolls to the bottom of the ashtray with some kinetic energy. All collisions will be assumed elastic. Thus the system that one has created is unstable: there is enough energy for the system to eject the marble back to the lip of the ashtray and thus off to infinity. But once the marble collides with the other marbles in the bottom of the ashtray, the energy is shared among the marbles. It becomes extremely improbable for one marble to get all of the energy, enabling it to escape. This is thus the picture of a compound nucleus, an unstable state that nevertheless is long lived, as it can only fission by a very improbable circumstance where one nucleon (or group of nucleons) acquires sufficient energy to escape.

If one probes a nucleus above its particle breakup threshold - this would be the intermediate nucleus in the discussion above - one will observe resonances, states that are not eigenstates but instead are unstable and thus have some finite spread in energy. You may be familiar with some examples from quantum mechanics: the case often first studied is the shape resonances that occur when scattering particles off a well, such as a square well. Such states usually carry a large fraction of the scattering strength and can be thought of as quasistationary states.

The charged particles break up into two classes, resonant (where the incident energy corresponds to a resonance) and nonresonant. The first applications we will make involve nonresonant reactions, so this is the example we will do in some detail.

A picture of a nonresonant charged particle reaction is shown in the figure. It depicts barrier penetration: the incident energy is well below the Coulomb barrier, so the classical turning point is well outside the region of the strong potential where fusion can occur. But this energy is not coincident with any of the resonant quasistates.

Suppose we were interested in the reaction
displaymath2583
where tex2html_wrap_inline2993 is the intermediate nucleus formed in the fusion. To calculate the cross section, it will prove sufficient to ask the following question: given the nucleus tex2html_wrap_inline2993, what is the probability for it to decay into the channels tex2html_wrap_inline2997 and tex2html_wrap_inline2999? The former will be related by time reversal to the probability for forming the compound nucleus.

For definiteness we ask for the rate for decaying into tex2html_wrap_inline2997. This is
displaymath2584
where r is the relative coordinate of the tex2html_wrap_inline3005 and tex2html_wrap_inline3007. This can be written
displaymath2585

displaymath2586
Note that tex2html_wrap_inline3009 is a constant for very large r. We can write this result as follows
displaymath2587

displaymath2588
where tex2html_wrap_inline3011 is a strong interaction quantity that depends on the wave function at the nuclear radius.

The first term above is the penetration factor, the square of the ratio of the wave function at the nuclear surface to that an infinity. If the Coulomb barrier is high, this penetration factor will be very small because the tunneling probability is low. The simplest estimate of this would come from treating the wave function as a pure Coulomb wave function. The Coulomb radial equation is
displaymath2589
where r is the relative tex2html_wrap_inline3013-tex2html_wrap_inline3007 coordinate. Defining
displaymath2590
the outgoing solution corresponds to the following combination of the standard Coulomb functions
displaymath2591
Thus we find the penetration factor
displaymath2592
And values for the penetration could be obtained by looking up numerical values.

Now there is a very nice improvement in this approach that takes into account the effects of the nuclear potential. The method is described in Clayton and is based on the WKB approximation, in which the Schroedinger equation is solved via an expansion in powers of tex2html_wrap_inline3017. Thus this is a semiclassical approximation. The derivation takes a full lecture and thus is not appropriate here. So I will just quote the answer and refer those interested to Clayton.
displaymath2593
where tex2html_wrap_inline3019 is the Coulomb potential at the nuclear surface and tex2html_wrap_inline3021 is the nuclear radius. This expression for the penetration factor consists of three terms
tex2html_wrap_inline2893 The leading Gamow factor, which also comes from the l=0 Coulomb expression we derived earlier
tex2html_wrap_inline2893 The effects of the angular momentum barrier, proportional to l(l+1), which suppresses the contributions of higher partial waves
tex2html_wrap_inline2893 The third term shows that the nuclear radius effects the penetration

If we take some reaction like tex2html_wrap_inline3031, the theory of compound nucleus reactions gives the cross section for tex2html_wrap_inline3033 (e.g., tex2html_wrap_inline3035 = 1+2 and tex2html_wrap_inline2917 = 1'+2') as
displaymath2594
Here tex2html_wrap_inline3039 is the energy of the nearest resonance, tex2html_wrap_inline3041 is the total width, and k is the wave number. Widths are related to the decay rate we have calculated by tex2html_wrap_inline3043 and thus have the units of energy: the larger the width, the faster the decay, in accordance with the uncertainty principle. And tex2html_wrap_inline3045, so the wave number k has the dimensions of 1/length. Thus it is clear that the cross section so defined has the proper units. Now the definition of a nonresonant reaction is that tex2html_wrap_inline3047 is much larger that tex2html_wrap_inline3049, so that one is a long way from the resonance. The denominator above is then relatively smooth: it can be quite smooth if there are a number of contributing distant resonances. Noting
displaymath2595
it follows that
displaymath2596
motivating the definition of the S-factor
displaymath2597
Effectively what one has done is to remove the most rapid dependence on energy, the dependence that would correspond to the s-wave interaction of two charged particles. What remains is a much more gently changing function S(E), which contains a lot of physics: the effects of finite nuclear size, high partial waves, etc. The importance of the S(E) is that it can be fitted to experimental cross section measurements made at energies higher than those characteristic of stars. But if S(E) evolves slowly, it can be extrapolated to lower energies that are relevant to stellar burning. This limits the need for nuclear theory: one needs to estimate the shape of S(E) as a function of E, but not its magnitude, as the magnitude can be pegged to experiment. This is the strategy followed for the nonresonant reactions of interest in solar burning.

2.1.6 Thermally averaged cross sections
The leading Coulomb effect - the Gamow penetration factor - is a sharply rising function of E. The Maxwell-Boltzmann distribution has an exponentially declining high-energy tail. Thus one immediately sees that tex2html_wrap_inline3051 involves a sharp competition between these two effects, leading to some compromise most-effective-energy. This is illustrated in the figure. We can determine this energy:
displaymath2598
Recalling tex2html_wrap_inline3053 and defining
displaymath2599
this integral becomes
displaymath2600
Clearly the exponential is small at small E and at large E.

Now S(E) is assumed to be a slowly varying function. The standard method for estimating such an integral, then, is to find the energy that maximizes the exponential, and expand around this peak in the integrand. This corresponds to solving
displaymath2601
The solution is
displaymath2602
We now expand the argument of the exponential around this peak energy
displaymath2603
But as tex2html_wrap_inline3055 vanishes by definition of tex2html_wrap_inline3057
displaymath2604

It follows
displaymath2605

displaymath2606
In deriving this result, we have assume S(E) is slowly varying in the vicinity of the integrand peak at tex2html_wrap_inline3057, and thus can be replaced by its value at the peak. Note our formula could easily be improved by doing a Taylor expansion on S(E)
displaymath2607
Thus our final answer would have an additional contribution due to tex2html_wrap_inline3061.

But if we just keep tex2html_wrap_inline3063, the integral can be done, yielding
displaymath2608

displaymath2609
Now
displaymath2610

displaymath2611
Thus
displaymath2612
With a little algebra this can be reexpressed
displaymath2613

Now we define a quantity A by
displaymath2614
where tex2html_wrap_inline3065 is the nucleon mass. Substituting this in, evaluating some constants, and dividing out the dimensions of S (note S has the units of a cross section times energy) yields
displaymath2615
Note that the overall dimensions are clearly 1/(cmtex2html_wrap_inline3071sec), as the number densities have units 1/cmtex2html_wrap_inline3071. Also remember that a barn = tex2html_wrap_inline3075 cmtex2html_wrap_inline3077.

Now tex2html_wrap_inline3057 defines the peak of the contributions to tex2html_wrap_inline3081. From its definition
displaymath2616

displaymath2617
where the speed of light has been reinserted to make it explicit that this quantity carries no units. For example, in the center of our sun tex2html_wrap_inline3083K tex2html_wrap_inline3085 1.3 keV. So if we plug in the appropriate numbers for the tex2html_wrap_inline3071He+tex2html_wrap_inline3071He reaction one finds
displaymath2618
One could compare this to the average energy of a Maxwell- Boltzmann distribition of paticles of tex2html_wrap_inline3091 E tex2html_wrap_inline3093 3 kT. Thus, indeed, the reactions are occurring far out on the Boltzmann tail, where nuclei have a better chance of penetrating the Coulomb barrier.

It might be helpful at this point to walk through the example of tex2html_wrap_inline3095C+p going to tex2html_wrap_inline3097N. If we define the zero of energy as that of the tex2html_wrap_inline3095C nucleus and proton at rest, then tex2html_wrap_inline3097N is bound by 1.943 MeV. Furthermore there is a resonance in tex2html_wrap_inline3103 at 2.367 MeV, 424 keV above the zero of energy. Thus a tex2html_wrap_inline3095C+p collision at a center-of-mass energy of 424 keV would be directly on resonance. In the lab frame, this corresponds to a 460 keV proton incident on a tex2html_wrap_inline3095C nucleus at rest.

The cross section is
displaymath2619

displaymath2620
One can reexpress the S-factor, then, as
displaymath2621
Now the product of the exponential and tex2html_wrap_inline3109 on the right should be roughly energy independent, as the exponential cancels the penetration probability buried in tex2html_wrap_inline3109. Thus the assumption the S(E) is weakly energy dependent requires that one not be too close to the resonance.

If one examines this system experimentally, the results are as shown in the figures at the back of the previous set of notes. Note that S(E) is quite smooth below above 300 keV. Thus data in the 100-300 keV range can be used to extrapolate the measured cross section to the region of interest for p burning via the CNO cycles. In contrast, the raw cross section varies over 9 orders of magnitude. Note that the theory curve, which takes into account the resonance, does quite well throughout the illustrated region. Thus the success of theory in the 100-400 keV region gives one great confidence that the values extrapolated to 20-50 keV are correct. The sharp steeping of S(E) above 400 keV lab energy is a clear indication of the resonance at 460 keV lab proton energy.

What about resonant reactions? That is, suppose we had some astrophysical setting where the relevant value of tex2html_wrap_inline3057 was not as above (tex2html_wrap_inline3117 keV), but in fact sat on the resonance at 424 keV center-of-mass energy? If the resonance is narrow (usually the case) compared to the typical spred of relevant energies of the colliding nuclei, tex2html_wrap_inline3051 =
displaymath2622

displaymath2623
The integral is 2tex2html_wrap_inline3121/tex2html_wrap_inline3049, so
displaymath2624
If the only open channels for decay of the compound nucleus are proton and tex2html_wrap_inline3125 emission, then tex2html_wrap_inline3127. If tex2html_wrap_inline3109 greatly exceeds tex2html_wrap_inline3131, then
displaymath2625
That is, the rate depends only on the tex2html_wrap_inline3125 width. The opposite limit, a very small tex2html_wrap_inline3109, which might occur in a low energy resonance in a high Z target, yields a rate that depends only on tex2html_wrap_inline3109, which governs the formation probability of the compound nucleus.

2.1.7 The pp chain and the standard solar model

We now start a discussion of stars that burn hydrogen through the pp and CNO cycles, such as our sun and similar stars. Almost all stars lying along the main sequence - perhaps 80% of the stars we observed - are thought to be hydrogen burning. The main sequence is a track of stars in the Hertzsprung-Russell diagram, or HR diagram. The HR diagram is a plot of stars on a plane where the vertical axis is the luminosity and the horizontal axis is the surface temperature (as measured by the color of the star). Stellar luminosities vary from tex2html_wrap_inline3139 that of our sun, with surface temperatures vary from 2000-50000K.

The most obvious properties that one can use to characterize a star are its surface temperature tex2html_wrap_inline3141, luminosity L, and radius R. The former two are accessible to observation, but generally the radius is not. Yet it is easy to see that there is a relationship between these properties. If we pretend stars radiate as black bodies, then the energy emitted per unit time per unit surface area is given by the Stefan-Boltzmann black-body radiation law, tex2html_wrap_inline3147, where tex2html_wrap_inline3149. Thus the star's luminosity is
displaymath2626
We can normalize things to solar properties to rewrite this as
displaymath2627
Though this result is true only for a blackbody, it makes it plausible that a plot of luminosity vs. temperature might yield a one-dimensional path in the plane parameterized by the radius, and thus the mass, of the star, provided that the stars have similar internal structure. For example, if a class of stars radiated as blackbodies, the trajectory would be as described above.

This was basically the discovery of Hertzsprung and Russell. The HR diagram on the next page shows a dominant trajectory - the main sequence - running from high temperature to low temperature. It also shows other classes of stars that reside well off the main sequence. The sun is situated on the main sequence according to its observed surface temperature of about 6500 K. Stars at the upper left - on the main sequence with temperatures 4 times that of the sun and luminosities 6 orders of magnitude larger - would have a radius about 60 times that of our sun. The red, cool, dwarf stars in the lower right of the main sequence, with luminosities about 2000 times lower than the sun and temperatures about half that of the sun, have radii about 0.1 that of the sun. Other classes of stars are well separated from the main sequence. One group has luminosities on the order of tex2html_wrap_inline3151 and temperatures again about half that of the sun. Thus these supergiants would correspond to a radius about 400 times that of the sun. Red giants, which form another patch off the main sequence, have a radius about 50 times that of our sun. White dwarfs - with luminosities about 1/200 of solar and temperatures twice solar - would correspond to a radius of about 1/50th that of the sun. These sit well below the main sequence.

The sun is our ``test case" for developing a theory of main-sequence stellar evolution. We know far more about this star - its age, luminosity, radius, surface composition, and even its neutrino luminosity and helioseismology - that any other star. Solar models trace the evolution of the sun over the past 4.6 billion years of main sequence burning, thereby predicting the present-day temperature and composition profiles of the solar core that govern energy production. Standard solar models share four basic assumptions:

tex2html_wrap_inline2893 The sun evolves in hydrostatic equilibrium, maintaining a local balance between the gravitational force and the pressure gradient. To describe this condition in detail, one must specify the equation of state as a function of temperature, density, and composition.

tex2html_wrap_inline2893 Energy is transported by radiation and convection. While the solar envelope is convective, radiative transport dominates in the core region where thermonuclear reactions take place. The opacity depends sensitively on the solar composition, particularly the abundances of heavier elements.

tex2html_wrap_inline2893 Thermonuclear reaction chains generate solar energy. The standard model predicts this energy is produced from the conversion of four protons into tex2html_wrap_inline3159He.
displaymath2628
About 98% of the time this occurs through the pp chain, with the CNO cycle contributing the remaining 2%. The sun is a large but slow reactor: the core temperature, tex2html_wrap_inline3161 K, results in typical center-of-mass energies for reacting particles of tex2html_wrap_inline3085 10 keV, much less than the Coulomb barriers inhibiting charged particle nuclear reactions. Thus reaction cross sections are small, and one must go to significantly higher energies before laboratory measurements are feasible. These laboratory data must then be extrapolated to the solar energies of interest, as we discussed previously.

tex2html_wrap_inline2893 The model is constrained to produce today's solar radius, mass, and luminosity. An important assumption of the standard model is that the sun was highly convective, and therefore uniform in composition, when it first entered the main sequence. It is furthermore assumed that the surface abundances of metals (nuclei with A > 5) were undisturbed by the subsequent evolution, and thus provide a record of the initial solar metallicity. The remaining parameter is the initial tex2html_wrap_inline3159He/H ratio, which is adjusted until the model reproduces the present solar luminosity after 4.6 billion years of evolution. The resulting tex2html_wrap_inline3159He/H mass fraction ratio is typically 0.27 tex2html_wrap_inline3173 0.01, which can be compared to the big-bang value of 0.23 tex2html_wrap_inline3173 0.01. (Note that today's surface abundance can differ from this value due to diffusion of He over the lifetime of the sun.) Note that the sun was formed from previously processed material.

The ``standard solar model" is the terminology used to describe models, such as that of Bahcall and Pinsonneault, that implement the above physics in a computer code, then evolve the sun forward from the onset of main sequence burning. Generally calculations are one-dimensional, which means that the physics such as convection can not arise dynamically. It can, and sometimes is, put in phenomenologically, through approximations such as ``mixing length theory."

The model that emerges is an evolving sun. As the core's chemical composition changes, the opacity and core temperature rise, producing a 44% luminosity increase since the onset of the main sequence. Some other features of the sun evolve even more rapidly. For example, the tex2html_wrap_inline3177B neutrino flux, the most temperature-dependent component, proves to be of relatively recent origin: the predicted flux increases exponentially with a doubling period of about 0.9 billion years. This is the flux to which Ray Davis's famous Homestake gold mine experiment is primarily sensitive. As another example of a time-dependent feature of the sun, the equilibrium abundance and equilibration time for tex2html_wrap_inline3071He are both sharply increasing functions of the distance from the solar center. Thus a steep tex2html_wrap_inline3071He density gradient is established over time. We will shortly see that the tex2html_wrap_inline3071He is sort of a ``caltalyst" in the pp chain, being produced and then consumed as an intermediate step in synthesizing tex2html_wrap_inline3159He.

Such models generally do not model the earliest history of our sun, when it first formed as a body of gas contracting under its own gravity, heating and ionizing as the gravitational work is done. The early contraction of the sun, when it approaches the main sequence vertically from above in the HR diagram, is characterized by high luminosity and convection throughout the star, lasting for a few million years. (Actually, there is thought to be continued convection in the core of sun for perhaps 10tex2html_wrap_inline3177 years, though this is driven by another mechanism: the fact that the CNO cycle is burning out of equilibrium due to initials metals in the sun.) The core of the sun reaches radiative equilibrium first, and this region then grows outward.

One of the problems on the homework, solar Li depletion, illustrates one shortcoming of the standard solar model. Li had to be burned at some point in the sun's evolution to account for the fact that the solar surface abundance is roughly 1/100th that found in meteorites. Perhaps this is due to the failure to model the sun's early convective stage. Or perhaps material can be pulled below the convective envelope by some mechanism, to a depth where the higher temperature allows tex2html_wrap_inline3189Li(tex2html_wrap_inline3191He)tex2html_wrap_inline3159He to occur.

Now let's turn to the pp chain. The basic equation governing the nonresonant strong and radiative reactions is, from our earlier work,
displaymath2629
which, after plugging in for tex2html_wrap_inline3057, can be rewritten
displaymath2630
This tells us that small tex2html_wrap_inline3197 is favored, and that rates are expected to rise as tex2html_wrap_inline3199. In the above, tex2html_wrap_inline3201 is the temperature in units of tex2html_wrap_inline3203K.

Now there is clearly no strong or radiative capture reaction that can initiate tex2html_wrap_inline3159He synthesis in the sun. The p+p, p+tex2html_wrap_inline3159He, and tex2html_wrap_inline3159He+tex2html_wrap_inline3159He reactions do not release energy and thus do not form bound states. The driving reaction of the pp chain is a weak interaction, like those discussed in the big bang,
displaymath2631
This is analogous to neutron or nuclear tex2html_wrap_inline2917 decay, except that the initial state is not a nucleus, but two protons in the plasma.

If the initiating p+p tex2html_wrap_inline2917 decay reaction occurs, we can see relatively easily how the rest of the burning might proceed:
displaymath2632

displaymath2633
This is called the ppI cycle and is, indeed, the most robost part of the pp chain in stars with temperatures like our sun: about 84% of the tex2html_wrap_inline3159He produced today in the solar core is predicted to be synthesized in this way. The two reactions above are of the type we have previously discussed.

Thus we need to derive something like an S-factor for a weak decay. Qualitatively, one can see that the p+p tex2html_wrap_inline2917 decay reaction will occur if a plasma proton decays into a neutron
displaymath2634
while a second spectator proton is close by, within the range of the nuclear force (several fermis) so that the final n+p state can form a bound deuteron. It is the binding energy of the deuteron that makes this proton decay energetically possible.

The nuclear tex2html_wrap_inline2917 decay rate is
displaymath2635
(I treat neutrinos as massive fermions: Don't worry about this. It is just a choice in normalizing neutrino spinors.) The invariant amplitude M is, as we discussed in the big bang section, effectively a contact interaction, because the momentum transfered between leptons and nucleons is so much smaller than the mass of the W boson. Thus it can be written
displaymath2636
If the factors involving tex2html_wrap_inline3223 were ignored, this would just be the current-current interaction familiar from electromagnetism. The factor tex2html_wrap_inline3225 projects out the left-hand part of the interacting fields. That is, the weak interaction is just like electromagnetism except that only the left-handed components of particle fields participate. This is correct at the level of the bare particles taking part in weak interactions, the quarks, electron/positron, and the neutrino. Note that the effective interaction for p tex2html_wrap_inline2977 n involves the factor
displaymath2637
The vector coupling is not modified because the total electric charge is conserved. But the axial-vector coupling has a nontrivial relation to the underlying quark couplings. Neutron tex2html_wrap_inline2917 decay gives the effective nucleon axial coupling constant of tex2html_wrap_inline3231.

As the momenta for reacting solar protons typical are of order
displaymath2638
and as momenta of nucleons bound in deuterium are also reasonably small (tex2html_wrap_inline3085 100 MeV), the nucleons in our tex2html_wrap_inline2917 decay amplitude can be treated nonrelativistically. In this approximation the operators in our amplitude become
displaymath2639
Thus it is the time-like part of the vector current and the space-like part of the axial-vector current that survive in the nonrelativistic limit.

It follows that our tex2html_wrap_inline2917 decay invariant amplitude can be approximated by
displaymath2640
where the tex2html_wrap_inline3239 are now tw-component Pauli spinors for the nucleonss. The above result is written for the tex2html_wrap_inline2917 decay tex2html_wrap_inline3243. It is convenient to generalize it for tex2html_wrap_inline3245 by introducing the isospin operators tex2html_wrap_inline3247 where tex2html_wrap_inline3249 tex2html_wrap_inline3251 ntex2html_wrap_inline3253 = tex2html_wrap_inline3251 ptex2html_wrap_inline3253 and tex2html_wrap_inline3259tex2html_wrap_inline3251 ptex2html_wrap_inline3253 = tex2html_wrap_inline3251 ntex2html_wrap_inline3253, with all other matrix elements being zero.

With this generalization, we can now finish the calculation. We square the invariant amplitude, integrate over the outgoing electron, neutrino, and final nuclear three-momenta, average over initial nucleon spin, and sum over final nucleon spin, electron spin, and neutrino spin. The result is
displaymath2641
where f and i are the final and initial nucleon states, and where m is the electron mass, w is the energy release in the decay, and tex2html_wrap_inline2933 is the outgoing electron energy. The tex2html_wrap_inline3249 operator corresponds to tex2html_wrap_inline3277 decay and the tex2html_wrap_inline3259 to tex2html_wrap_inline3281 decay.

This result easily generalizes to nuclear decay. The operators are replaced
displaymath2642

displaymath2643
The factor of tex2html_wrap_inline3283 before the square of the nuclear matrix elements is replaced by tex2html_wrap_inline3285 where tex2html_wrap_inline3287 is the initial nuclear spin. Finally, the Coulomb effects on the outgoing electron or positron can be approximately accounted for by including the Coulomb factor
displaymath2644
where tex2html_wrap_inline2917 is the electron/positron velocity and tex2html_wrap_inline3291 is the s-wave Coulomb wave function in the field of the daughter nucleus of charge tex2html_wrap_inline3293, evaluated at the nuclear origin. Note the close relation to the Gamow factor.

The spin-independent and spin-dependent operators appearing above are known as the Fermi and Gamow-Teller operators. The Fermi operator is the isospin raising/lowering operator: in the limit of good isopsin, which typically is good to 5% or better in the description of low-lying nuclear states, it can only connect states in the same isospin multiplet. That is, it is capable of exciting only one state, the state identical to the initial state in terms of space and spin, but with tex2html_wrap_inline3295 for tex2html_wrap_inline3277 and tex2html_wrap_inline3281 decay, respectively. Now the reaction of interest
displaymath2631
produces a final nuclear state with J = 1 and isospin T=0: this is an isospin singlet, so there is no corresponding state in p+p that can be reached by the Fermi operator. It follows that only the Gamow-Teller operator contributes. Thus the matrix element that must be calculated is
displaymath2646
Here tex2html_wrap_inline3301 is the relative two-nucleon coordinate. Thus tex2html_wrap_inline3303 is the relative two-proton wave function in the plasma. Note this expresses what we stated qualitatively before: the tex2html_wrap_inline2917 decay of the proton can only occur if there is another, spectator proton close enough by such that the result pn pair has a reasonable overlap with the deuteron, a compact state. We will now see that this is unlikely to occur, leading to a small p+p S-factor.

2.1.7 The pp chain and the standard solar model (continued)

So now we want to go about calculating the S-factor. As always we will be a little sloppy, as we want to avoid real calculations. But in spirit what we will do below is not too different from the 1938 paper by Bethe and Critchfield, where the pp cross section was first derived.

The first step is to go back to the rate formula and do the integral over the outgoing electron energy in tex2html_wrap_inline2917 decay, ignoring the Coulomb correction for the outgoing state. For a deuteron plus an etex2html_wrap_inline3309, this is not too bad because
displaymath2647
Since .511 MeV of this is needed to make the electron mass, the outgoing electron and neutrino share .420 MeV of kinetic energy. If half of this goes to the electron on average, then the typical velocity of the electron is 0.7c. Thus
displaymath2648
So the Coulomb effects can be ignored.

Therefore we can integrate the tex2html_wrap_inline2917 decay formula over electron energies to get
displaymath2649

displaymath2650
where tex2html_wrap_inline3313. We can then evaluate this for the ``decay" of p+p to get tex2html_wrap_inline3315 and
displaymath2651

Now at this point you should be a bit puzzled because we are treating the p+p system as a ``nucleus" even though it refers to collisions within the plasma. We want a cross section, or better yet, an S-factor. What is the connection?

But this is not too hard. Let's imagine cutting out a ``box" in our plasma of volume V such that the average number of contained protons is 2. Our formula for rate/vol/sec is
displaymath2652
But remember the factor on the right is supposed to be the number of distinct pairs times tex2html_wrap_inline3319. So for one pair and multiplying by V, we get the rate for our pair to interact. This is
displaymath2653

Note v/V has the dimensions of flux. Now just consider the box to be a big nucleus. The two protons in the box have a relative wave function normalized to unity in the box. The wave function at nuclear distance scales (small compared to our box dimension) is suppressed due to the Coulomb effects, but at larger scales it is just a plane wave. (Or a Coulomb wave - the logic is similar.) Thus the correct normalization for a large box is
displaymath2654
But looking at our tex2html_wrap_inline2917 decay formula for our ``nucleus", the deuteron wave function is compact, nonzero only for r on the order of the deuteron size. The spin part of the matrix element could be evaluated if we had a good deuteron wave function from solving the Schroedinger equation for some strong potential. We won't do that, but the spin operator carries no units and thus should be of order unity. Thus, replacing
displaymath2655
and taking the deuteron wave function to be constant over the nuclear volume
displaymath2656
we find a decay rate
displaymath2657
But this can be written
displaymath2658
Here P(v) is the usually penetration factor, formed from the square of the ratio of the wave function at tex2html_wrap_inline3327 and large r. But we know the normalization of the wave function at large r, by our box description, so
displaymath2659

Now we equate our two expressions for tex2html_wrap_inline2919 to find
displaymath2660
Plugging in our old expressions for P(v) we get
displaymath2661
Immediately we have the S-factor
displaymath2662
The deuteron is relatively diffuse, and we expect the reaction to occur on the tail of the wave function, due to the Coulomb penetration. Thus we make the guess tex2html_wrap_inline3331 10 f. It follows
displaymath2663

displaymath2664

displaymath2665
Thus plugging in the numbers
displaymath2666

Interestingly the result of a rigorous calculation is quite close to our guess
displaymath2667
That is, our crude guess of a unit spin matrix element and a reaction radius of 10 f seems to be unexpectedly close. Thus two important points can be made:
tex2html_wrap_inline2893 We understand the size of tex2html_wrap_inline3335 and note it is very small, 20 orders of magnitude below strong interaction S-factors. This reaction controls the rate of pp chain hydrogen burning.
tex2html_wrap_inline2893 This cross section cannot be measured in the lab: the initial state is not a stable nucleus, and the rate is impossible small. Thus our description of the basic process that powers the majority of stars MUST be taken from a first principles nuclear theory calculation. It is believed this can be done to an accuracy of about 1%. This involves fortunate aspects of the weak interaction, such as the fact that exchange current contributions to the Gamow-Teller operator are only of order (v/c) tex2html_wrap_inline3085 1%.

Theory also determines the shape of S(E)
displaymath2668
Thus at 10 keV, this slope generates a 10% correction to the S-factor.

Now that we know the S-factor, we can plug it into our rate formula to determine the rate of the p+p reaction. Recall
displaymath2669
So in the core of the sun, where tex2html_wrap_inline3341, tex2html_wrap_inline3343 and tex2html_wrap_inline3345 keV, the most effective energy for the p+p reaction. The rate formula then gives
displaymath2670
Now the number density at the center of the sun is about tex2html_wrap_inline3347/cmtex2html_wrap_inline3071, so
displaymath2671
So the time scale for burning hydrogen is the number density divided by twice the burning rate (two protons are consumed per reaction)
displaymath2672
which can be compared to the sun's present age, 4.55 b.y. Thus it has lived about half its lifetime.

2.1.7 The pp chain and the standard solar model (continued)

The work just completed gives us the basic tools needed to build a ``network" calculation of the ppI cycle. The contributing reactions are
displaymath2673

displaymath2674

displaymath2675
Here r represents a rate and tex2html_wrap_inline3353. From one calculated S-factor (for pp) and two that are measured, we could calculate the production of He once the composition and temperature of our volume of interest was specified.

One feature of interest in this simple network is that d and tex2html_wrap_inline3071He both act as ``catalysts": they are produced and then consumed in the burning. In a steady state process, this implies they must reach some equilibrium abundance where the production rate equals the destruction rate. That is, the general rate equation
displaymath2676
is satisfied at equilibrium by replacing the LHS by zero. Thus
displaymath2677
But from the S-factors
displaymath2678
and our rate formula
displaymath2679
We can plug in the values
displaymath2680

displaymath2681
to find
displaymath2682

Therefore this ratio is a decreasing function of tex2html_wrap_inline3357: the higher the temperature, the lower the equilibrium abundance of deuterium. Therefore in the region of the sun where the ppI cycle is operating, the deuterium abundance is lowest in the sun's center. Plugging in the solar core temperature
displaymath2683
There isn't much deuterium about: using tex2html_wrap_inline3359/cmtex2html_wrap_inline3071 one finds tex2html_wrap_inline3363/cmtex2html_wrap_inline3071. Remembering our previous result
displaymath2671
it follows that the typical life time of a deuterium nucleus is
displaymath2685
That is, deuterium is burned instantaneously and thus reaches equilibrium very, very quickly.

This result then allows us to write the analogous equation for tex2html_wrap_inline3071He as
displaymath2686
where the factor of two in the term on the right comes because the tex2html_wrap_inline3071He+tex2html_wrap_inline3071He reaction destroys two tex2html_wrap_inline3071He nuclei. Thus at equilibrium
displaymath2687
Using
displaymath2688
we can again do the rate algebra to find
displaymath2689
This ratio is clearly a sharply decreasing function of tex2html_wrap_inline3357 and thus a sharply increasing function of r. That is, a sharp gradient in tex2html_wrap_inline3071He is established in the sun.

One can estimate the time required to reach equilibrium in a simple way: as the burning of tex2html_wrap_inline3071He is quadratic in the abundance, it will not become significant until one is rather close to the tex2html_wrap_inline3071He equilibrium value. Thus a reasonable estimate of the time require to get close (say a factor of 2) to equilibrium is just the time required to produce the requisite number of tex2html_wrap_inline3071He nuclei. At the sun's center, we have found that the tex2html_wrap_inline3071He abundance is tex2html_wrap_inline3389, where we have assumed 75% of the matter is protons (as it was when the sun first entered the main sequence). Thus
displaymath2690
The same calculation at tex2html_wrap_inline3391, where a reasonable solar density of 36 g/cmtex2html_wrap_inline3071 and a 75% proton abundance is used, gives
displaymath2691

displaymath2692

displaymath2693
It turns out that at temperatures of tex2html_wrap_inline3395, the equilibration time corresponds to the present age of the sun. This temperature characterizes a solar radius of about 0.27, at the very edge of the energy-producing core. The resulting interesting profile of tex2html_wrap_inline3071He is shown in the figure.

There are some interesting issues connected with this tex2html_wrap_inline3071He gradient. It was shown by Dilke and Gough that it implies the sun is overstable to large-amplitude radial oscillations. If one throws a tex2html_wrap_inline3071He rich volume element towards the core, the tex2html_wrap_inline3071He will ignite at the higher temperatures, become bouyant, and return to its original equilibrium position with a kinetic energy greater than the required for the original perturbation. This has lead to speculations that the tex2html_wrap_inline3071He gradient could trigger sudden overturn of the core. Most of the experts believe there is no large amplitude trigger that will allow the sun to discover the existence of this instability.

To be somewhat more complete, the initial step of the ppI cycle can occur in a different way
displaymath2694
But the electron capture process only accounts for about 1% of the pp reactions. Thus the full ppI cycle can be written
displaymath2695

displaymath2632

displaymath2697
However there are two other paths through the pp chain that can occur if tex2html_wrap_inline3071He burns by another path. Thus tex2html_wrap_inline3409
determines
ppI   vs.  ppII+ppIII The splitting between the ppII and ppIII cycles depends on the fate of the tex2html_wrap_inline3189Be tex2html_wrap_inline3413
determines
ppII   vs.   ppIII Thus the two additional cycles are
displaymath2695

displaymath2632

displaymath2700

displaymath2701

displaymath2702
and
displaymath2695

displaymath2632

displaymath2700

displaymath2706

displaymath2707

displaymath2708

The calculations presented below will show that the competition between the three cycles is quite sensitive to the interior temperature of the sun, and to core composition. We also note that, in principle, we can experimentally determine the relative importance of the three cycles: the cycles are distinguished by the neutrinos they produce.
displaymath2709

displaymath2710

displaymath2711
Peter Doe will tell you about the exciting progress in measuring these fluxes in his lecture Monday.

tex2html_wrap_inline2893 The tex2html_wrap_inline3071He+tex2html_wrap_inline3071He tex2html_wrap_inline3421 tex2html_wrap_inline3071He+tex2html_wrap_inline3159He branching:
The Coulomb effects for these two reactions are rather similar, except for small effects proportional to the different masses. The heavier nucleus moves more slowly, and thus is at a disadvantage in overcoming Coulomb barriers. The S-factor for the 3+3 reaction is larger than that for the 3+4 reaction by almost a factor of tex2html_wrap_inline3151. However we have also seen that the core ambundance of tex2html_wrap_inline3071He is almost a factor of tex2html_wrap_inline3431 less than that of tex2html_wrap_inline3159He. The net result, from our rate formulas, is
displaymath2712
Noting that
displaymath2713
and using tex2html_wrap_inline3435 keV b leads to
displaymath2714
It is clear that higher temperatures favor the ppII+ppIII cycles. We can solve for the temperature where the ratio becomes 1,
displaymath2715
Thus the 3+3 reaction dominates at solar temperatures, but not in stars that are tex2html_wrap_inline3085 10% hotter. At the center of the sun, where tex2html_wrap_inline3439 1.5, the ratio is 0.6; at tex2html_wrap_inline3439 1.2, corresponding to a radius half way through the energy producing core, the ratio is 0.12. The figure, from Cameron, shows the ppI/ppII+ppIII branching as a function of tex2html_wrap_inline3357.





Course Notes |PHYS 554 | UW Physics | UW Astronomy | INT | UW Homepage

Last update: July 10, 1999