IBM®
Skip to main content
    Country/region [change]    Terms of use
 
 
 
    Home    Products    Services & solutions    Support & downloads    My account    

IBM Journal of Research and Development

Soft Errors in Circuits and Systems   Volume 52, Number 3, 2008
Table of contents: HTMLPDF This article: HTML PDFDOI: 10.1147/rd.523.0225Copyright info

Alpha-particle-induced upsets in advanced CMOS circuits and technology

by D. F. Heidel,
K. P. Rodbell,
E. H. Cannon,
C. Cabral, Jr.,
M. S. Gordon,
P. Oldiges,
and H. H. K. Tang

In this paper, we review the current status of single-event upsets caused by alpha-particles in IBM circuits and technology. While both alpha-particles and cosmic radiation can induce upsets, the alpha-particle-induced upset rate has become an increasingly important issue because alpha-particle-induced upsets are no longer limited to memory circuits. Latch circuits have become highly sensitive to alpha-particles. The alpha-particle-induced upset rate of latch circuits is one of the most critical issues for microprocessors requiring both high performance and high reliability.

Introduction

Radiation-induced single-event upsets (SEUs) in logic and memory circuits continue to be a key issue for advanced CMOS (complementary metal-oxide semiconductor) technologies. Recent publications show experimental upset rates in both bulk CMOS [13] and silicon-on-insulator (SOI) CMOS technologies [4]. For high-performance 65-nm and 45-nm circuits, the critical charge (Qcrit) required to upset the circuit can be very small, making latch and memory circuits susceptible to a wide range of ionizing particles.

While alpha-particles have been known to produce SEUs in very-large-scale integrated circuits (ICs) for many years, these upsets were mostly limited to memory circuits in CMOS chips. Specifically, SRAM (static RAM) circuits can be upset with a relatively small amount of charge, so SRAM circuits are the most susceptible to the low level of ionization produced by alpha-particles. With technology scaling, latch circuits have become susceptible to alpha-particles. Currently, the upset rate of latch circuits often determines the overall level of soft errors in high-performance microprocessors.

Three factors have contributed to the increased importance of alpha-particle-induced upsets in latches. First, technology scaling has reduced the Qcrit required to upset the latch circuit; smaller capacitances and lower power-supply voltages also have reduced this Qcrit value. In addition, in advanced transistors, bipolar currents can be induced by the deposited charge [5]. The bipolar current adds to the total charge collected, and therefore, the charge that must be deposited by ionization can be very small.

Second, the increasing number of latches on a chip contributes to the increasing chip fail rate due to latches. While technology scaling reduces the sensitive area of each latch, the number of latches in high-performance microprocessors has increased significantly. The size of the microprocessor chip has also continued to increase with every technology generation. Combining all of these factors leads to an increase in the total sensitive area associated with latch circuits and, therefore, an increase in the SEU rate.

Finally, latch circuits are more important in determining the overall fail rate because SEU mitigation techniques are commonly applied to memory circuits. A number of SEU mitigation techniques are available that can be used for memory with a relatively low impact on total system performance and power. One of the most common approaches is the use of error checking and correction (ECC) codes (typically double-bit-error detection and single-bit-error correction). The impact of double-bit errors can be further reduced by interleaving the bits used in any given ECC word. High-performance systems also typically have a memory hierarchy that provides an efficient means of SEU mitigation. Data is stored in multiple levels of the memory system. As soon as one level detects that an error has occurred, it can recover the data from a higher level of the memory system. These SEU mitigation techniques have proven very effective in controlling the overall soft-error rate (SER) from SEUs in memory circuits.

Because it is much more costly to provide SEU mitigation for latch circuits, the relative SER from SEUs in latches is increasing more rapidly with technology scaling than the SER from SEUs in memory. In addition, the latch fail rate is further increased because the latches can now be upset not only by cosmic particles but also by alpha-particles. The latch fail rate due to alpha-particles is reviewed in this paper.

The next section of this paper reviews recent data on alpha-particle-induced upsets in IBM latch and memory circuits. A comparison with modeling results is also shown with the data. A more complete analysis of the SEU circuit modeling is covered in Reference [6]. The third section of this paper describes techniques for mitigating alpha-particle-induced upsets in both memory and latches. The fourth section covers evolving issues for SEUs in latches. The last section summarizes the results presented and describes the projections for SEUs in future technology generations.

Alpha-particle sensitivity of advanced latches and memory circuits

A series of experiments on the upset rates of IBM 65-nm SOI latch and memory circuits have been completed. The data was taken using a 3-MV Tandem Van de Graaff accelerator that can produce a monoenergetic beam of alpha-particles. The full details of the experiments are provided in Reference [4].

Figure 1(a) shows an example of the data obtained on one of the 65-nm SOI latch chains (chain B), as well as the fail rate as a function of angle for both data states of the level 1 (L1) (or master) latch and the L2 (or slave) latch, using 5.28-MeV alpha-particles. Table 1 shows critical angles and the critical charge derived for two of the latch chains tested in these experiments (latch chains A and B). The critical angle is the angle at which the fail rate per incident particle is 10% of the maximum fail rate per incident particle. The 10% angle criteria is low enough to represent the turn-on angle for fails but high enough to contain a sufficient number of fails to produce consistent results. This turn-on angle can then be converted into a critical charge by calculating the charge deposited within the body of the transistor for each angle.

Figure 1 Figure 1


Table 1 Critical angles (θc) and critical charge (Qcrit) values obtained from data using 5.28-MeV alpha-particles.
Vdd = 1 VLatch chain ALatch chain B

L1, data = 0θc = 29 degrees
Qcrit = 0.5 fC
θc = 57 degrees
Qcrit = 0.9 fC
L1, data = 1θc = 56 degrees
Qcrit = 0.9 fC
θc = 56 degrees
Qcrit = 0.9 fC
L2, data = 0θc = 53 degrees
Qcrit = 0.8 fC
θc = 59 degrees
Qcrit = 1.0 fC
L2, data = 1θc = 48 degrees
Qcrit = 0.7 fC
θc = 42 degrees
Qcrit = 0.6 fC

Figures 1(b) and 1(c) show the data for an SRAM array at normal incidence. Figure 1(b) shows fail rates for both data = 0 and data = 1 as a function of voltage, while Figure 1(c) shows the measured fail cross-section (at 0.8 V) as a function of the alpha-particle energy.

As seen in Table 1, the critical charge for the latches was determined to be approximately 0.5–1.0 fC (femtocoulomb) for states data = 0 and data = 1. This small value of the critical charge can be easily obtained by a wide range of alpha-particle energies and angles.

Simulation methodology and extraction of critical charge

The deposited charge for a given ion incident energy is calculated by accurately accounting for the energy loss (dE/dx) of the incident particle as it travels through the various materials (wiring metals and insulators) of the back-end-of-line (BEOL) interconnects. Figure 2 shows an example of such a calculation for a 5-MeV alpha-particle as a function of the incident angle. Three curves are shown: one using the mean linear energy transfer (LET) for each of the 11 dual damascene levels in the BEOL process, an upper bound using a particle path that traverses mainly metal regions, and a lower bound using a particle path that traverses mainly insulator regions.

Figure 2 Figure 2

FIELDAY modeling

Transistor modeling of single events in small circuits was performed using the mixed-mode capability of the FIELDAY program [5, 7, 8]. As described above, the L1 latch (chain A) was defined and simulated.

The critical charge is determined by assuming that the alpha-particle strikes at normal incidence through the center of the channel. The charge generation model has an adjustable parameter to scale the ionization. A simple binary hunt algorithm is implemented to submit simulation jobs in parallel with different charge-scaling factors. Iterations on this scaling factor are performed to determine the charge generation that only causes the logic state to switch. Table 2 shows an example of these calculations for the L1 latch (chain A).


Table 2 Critical charge for a level-1 master latch (chain A) at Vdd = 1 V calculated using the FIELDAY program (reproduced from Reference [5]; ©IEEE 2006).
DeviceData = 0 (fC)Data = 1 (fC)

n-FET0.530.98
p-FET1.50>10.0

The simulation results compare favorably with the measured hardware data (shown in Table 1). The measured hardware data is typically dominated by the lowest critical charge value. Therefore, the n-FET values shown in Table 2 (0.53 fC and 0.98 fC) should be compared with the measured values from Table 1 (0.5 fC and 0.9 fC). Using these results, the relative contribution of n-FET and p-FET strikes to the SER can be quantified using the SEU simulator SEMM-2 (IBM soft-error Monte Carlo model) [9]. (For details on the first version of this model, see Reference [10].) These simulation results were further used to help develop a Qcrit model using SPICE, through a detailed comparison of the time evolution of the circuit contact voltages and internal potentials, as described in [5]. Additional single-event simulation studies were performed on the effects of non-normal incidence strikes including rotation angle, investigation of technology variants, Qcrit variations in SRAM cells, and Qcrit fluctuations due to process variability. The simulations on the rotational angle show that the SEU cross-section is only weakly dependent on the rotational angle; the results of these simulations are consistent with the small Qcrit differences between modeling two-dimensional (2D) and three-dimensional (3D) devices.

Mitigation of alpha-particle-induced upsets

The accurate determination of the energy spectra from alpha-particle sources in IBM semiconductor chips and packaging components is critical. Having detectors with low background levels is necessary in order to be able to adequately screen these materials in a timely manner. This is especially true as the need for screening low-alpha-particle materials for chip and package applications increases, a need driven by the increased sensitivity of modern circuits (both memory and logic) to alpha-particle-induced soft fails [11]. In many cases, the alpha-particle source is commonly generated in materials adjacent to the chip, in the solders, and in the packaging materials.

Typically, SERs in volatile memory can be ameliorated through software using ECC. This option does not exist in logic, because costly circuit redundancy would be required. Other approaches entail additional layers sandwiched between the traditional last level of metallization on the chip and the solder that is used for attachment to the package (Figure 3). Two approaches should be considered: The first is to extend the current BEOL structure using a copper (Cu) damascene integration approach, and the second is to add a layer of Si (silicon) interposer, containing through-vias for contacts. Two options have been explored for the first approach [12]: 1) Cu in SiO2 and 2) Cu in a photosensitive polyimide (PSPI). In addition to serving as blocking layers, circuitry can be added to these layers, such as passive electronic components (e.g., resistors and capacitors), in order to enhance chip performance. The advantage of the Si interposer approach over the Cu damascene approach is that it can be produced and tested independent of the IC chip, lowering both the production cost and the good die yield loss.

Figure 3 Figure 3

Summary of alpha-particle emissions from interconnect metallization and packaging

The alpha-particle SER is driven by BEOL metallization and packaging material alpha-particle-emitter compositions. Lead (Pb) solder in the form of solder balls is a significant contributor to the SER. The Pb-210 isotope, which leads to alpha-particle emissions, is chemically inseparable from stable Pb isotopes. Therefore, obtaining very low alpha-particle-emitting Pb is a challenge for the semiconductor industry. Lead-free solder does not guarantee low alpha-particle levels, since tin (Sn), for example, can have an appreciable alpha-particle component because of the incorporation of impurities during Sn refining. Samples of Sn have been observed to emit alpha-particles at rates as high as or higher than those of Pb. Within IBM, there are continuous efforts to achieve the lowest alpha-emitting solder for both Pb solder and Pb-free solder; however, solder balls remain a significant source of the alpha-particles. The alpha-particle emission rates for solder are typically in the range of 5–50 counts/cm2-khr.

Another source of alpha-particle emission is the packaging materials. Underfill, overmolds, organic packages, and ceramic packages can all be sources of alpha-particles. While mold compounds are generally in the range of 1–5 counts/cm2-khr, this alpha-particle contribution to the SER cannot be ignored as the industry moves toward technologies that are more sensitive to alpha-particle energy. Organic and ceramic packages can also emit a significant amount of alpha-particles, on the order of 80–100 counts/cm2-khr.

Two paths can be taken to control the alpha-particle component of SERs in new technologies. The first is an effort to drive the alpha-particle levels of the material to even lower emission rates. This requires the development and manufacturing of products using new material compositions, as well as better qualification and monitoring of incoming materials, which in turn requires the development of more-sensitive alpha-particle detection equipment that can isolate subtle changes in the alpha-particle emission rate of materials. The second is an effort to develop an alpha-particle barrier that prevents all alpha-particles from external sources from affecting product circuits. The barrier could reside either on the chip directly under the solder balls or on the packaging above the thin-film wiring levels. Both of these approaches add processing complexity and cost, and they still require a level of internal alpha-particle contamination control by manufacturing lines. However, the flux of alpha-particles reaching the ICs is greatly reduced, effectively switching this problem from one of internally generated alpha-particles to one of solely a cosmic neutron particle flux issue.

Evolving single-event issues

In latches, the increasing rate of SEUs caused by alpha‐particles, as described above, is one of the most difficult SEU issues for advanced high-performance microprocessors. However, numerous additional single-event effects are also important, including multiple-bit upsets (MBUs), single-event transients (SETs), and the upsets caused by direct ionization from protons. As described below, these single-event effects are mostly initiated by inelastic collisions from terrestrial cosmic neutron radiation. Because alpha-particles are among the most prevalent fragments generated in an inelastic collision, the increasing upset sensitivity from alpha-particles also enhances the SEU issues that are due to cosmic neutrons.

MBUs are a key issue for many systems [2]. Most of the published work on MBUs has focused on memory circuits. However, as with single-bit upsets in memory, effective mitigation techniques exist to minimize the impact of MBUs in memory. The more challenging issue is multiple-bit effects in latches. MBUs can prevent parity-checking circuits from detecting latch upsets and can limit the effectiveness of redundant latches.

MBUs can be caused by several mechanisms, the most common of which is charge sharing between two closely spaced nodes. An ionizing particle at a particularly shallow angle will deposit charge in the silicon substrate or well structures of bulk CMOS circuits. The charge generated can be collected and can upset multiple circuits.

Because SOI technology isolates each transistor from the silicon substrate, charge sharing will not create MBUs in SOI circuits. However, MBUs can still be created in SOI transistors. An inelastic collision from a high-energy neutron can often produce multiple fragments and each of these fragments can upset a memory cell or latch. A single fragment that is emitted parallel to the silicon surface can also cross and upset multiple memory cells in either bulk or SOI technologies. Figure 4 shows data on MBUs from a 65-nm SOI memory array produced by 148-MeV protons. The 148-MeV protons will create fragments from inelastic collisions that are similar to fragments created by terrestrial neutrons of the same energy. These data indicate that the MBU rate is relatively low in SOI technology. Even for closely spaced circuits, the MBU rate is much less than 1% of the single-bit error rate. The lower MBU rate is one advantage of SOI technology. Nevertheless, MBUs in all technologies can be important if parity checking and ECC are used to eliminate SEUs.

Figure 4 Figure 4

The issue of SETs is becoming more significant in advanced technologies. For older technology generations, SETs are typically observed only in space applications and are caused by the heavy ions in the space environment. For 90-nm technologies and beyond, SETs can also be produced in terrestrial environments. The largest transients are created by the heavier fragments produced in an elastic collision of a high-energy neutron. The length of the transient pulse created is long enough that it will not be attenuated in passing through stages of logic toward the latch circuit. For static CMOS circuits, the pulse must arrive at the latch simultaneously with the clock capture edge. Therefore, the upset rate due to SETs will scale as the length of the SET divided by the clock period. The increasing importance of SETs is a function of both the longer transients created and the shorter cycle times of high-performance chips.

Upsets due to direct ionization from high-energy protons [13] have also been observed. Figure 5 shows upsets of 65-nm SOI latches that are due to direct ionization resulting from high-energy protons. The energy of the proton was carefully chosen in order to produce the maximum rate of ionization in the body of the transistor. Most high-energy protons create less charge per unit length and are not capable of upsetting these latches. However, the results demonstrate that upsets can be created by the relatively low ionization rate of protons. As latches become more susceptible to protons, these fail rates can become significant because of both the large number of secondary protons created in inelastic collisions and the long distances that protons travel.

Figure 5 Figure 5

Discussion and conclusions

The issue of radiation-induced upsets of latch circuits by both alpha-particles and cosmic radiation is significant for systems requiring high reliability. The alpha-particle-induced upset rate of latches has increased dramatically for several technology generations and will continue to increase in future technologies. This increasing trend of upsets is driven by the lower critical charge required to upset the latches, as well as the rapidly increasing number of latches on a chip.

The increasing upset rate can be mitigated by many techniques. Technology options include using materials with lower emission rates, building barriers to reduce the number of alpha-particles that reach the transistors, and building circuits that are less sensitive to collecting the charge generated. As is described in other papers in this issue, the upset rate can also be mitigated at the circuit level or the system level. With the increasing sensitivity to alpha-particles, choosing latch designs consistent with the overall SER for the system is important.

Acknowledgments

We acknowledge valuable interactions with many of our IBM colleagues including Robert Dennard, AJ KleinOsowski, Scott McAllister, Charles Montrose, Conal Murray, and Tak Ning.

References

Received July 12, 2007; accepted for publication November 19, 2007; Published online February 27, 2008.


    About IBMPrivacyContact