RequestLink
MICRO
Advertiser and
Product
Information

Buyer's Guide
Buyers Guide

tom
Chip Shots blog

Greatest Hits of 2005
Greatest Hits of 2005

Featured Series
Featured Series


Web Sightings

Media Kit

Comments? Suggestions? Send us your feedback.

 

MicroMagazine.com

Advanced Process/Equipment Control

Applying a simple statistical process control approach to defect monitoring

Bruno Scibilia, Altis Semiconductor

Statistical process control (SPC) is used extensively to identify the causes of defect excursions as early as possible. Because of the increasing complexity of semiconductor devices and the steady decrease in feature size, it is necessary to eliminate defect sources and reduce the density of systematic and random defects per wafer in all semiconductor processes to maintain high device yields. Traditionally, IC development has focused on performance and technological improvements. Today, however, more attention is being paid to techniques such as SPC to improve manufacturing efficiency.

Defect data are particularly difficult to monitor using SPC charts, partly because they exhibit erratic behavior and noisiness. While measurement data such as critical dimensions and depths follow a Gaussian, or normal, distribution (the well-known bell-shaped curve), defect data have an asymmetrical distribution. Most defect data are positive and relatively close to zero. This violation of the Gaussian assumption must be taken into account in process control operations.

Defects arise from two types of causes: common causes and assignable causes. Common causes include routine or chronic sources of variations in the fab, such as the failure of environmental cleanroom controls, a lack of repeatability, and measurement errors. By definition, fab personnel cannot eliminate common causes from processes. Deviations from the norm can also have assignable causes. Defects with assignable causes can lead to processes that are out of statistical control. Once a control chart has triggered a true alarm, assignable causes must be discovered and corrected.

Figure 1: Theoretical Poisson distributions with different means.

Traditionally, four types of charts have been used in SPC applications: np and p charts for binomial random distributions, and c or u charts for Poisson random distributions. Figure 1 illustrates theoretical Poisson distributions with different means.

The shape of the random distribution (whether it is a normal, Poisson, or other type of distribution) has an important influence on where control limits are set. SPC limits on defect data are usually set according to the Poisson distribution.1 Control limits are helpful to identify systematic departures from random distributions and are therefore set at the very limits of these distributions. However, statistician engineers have observed that in the semiconductor industry, such limits are often too tight, leading to many false alarms. To minimize the incidence of false alarms, many practitioners resort to limits that are not based on statistical principles, a pragmatic approach that is valid only when the behavior of defect data is well understood.

The objective of this article is to present a simple and effective SPC approach to monitoring defect data. The article begins with a summary of the techniques and concepts used in the IC industry. Then it presents examples of defect distributions and proposes an approach based on exponential distributions to account for the overdispersion of defect data.

Results from a Benchmark Survey

Engineers at Altis Semiconductor (Corbeil-Essonnes, France) conducted an informal benchmark survey covering six firms in the semiconductor industry. The survey concluded that the Poisson model was the most widely used SPC method throughout the industry, although other, alternative approaches are also implemented.

In two firms, control limits were based purely on "engineering judgment." In one of these firms, control limits were set so that a certain percentage of lots (say 10%) would be rechecked. Sometimes, defect-yield correlations and yield-impact calculations were used to identify defect levels at which the impact on yield became noticeable. Actually a specification limit, that level was not considered a control limit.

An expert from an equipment supplier proposed that a logarithmic transformation be used to normalize distributions. Transformed values, however, have little physical significance and are therefore not popular in the industry.

At some companies, exponentially weighted moving average (EWMA) charts were used in addition to standard charts to track gradually increasing particle counts. However, when Altis engineers tested the efficiency of EWMA charts using the fab's own data, they observed that whenever defect levels were out of control, they were so much above the control limits that the statistical results from EWMA charts and standard charts were the same. Hence, EWMA charts did not provide useful information in that context.

Overdispersion of Defect Data

While within-wafer processing conditions are quite homogeneous, wafer-to-wafer conditions can vary extensively. Most of the time, small groups of defects, or even isolated defects, appear on wafers. However, when a common cause (routine run-to-run variations) exposes a wafer to particle contamination, clusters of defects rather than isolated particles appear on the surface. In such cases, the number of defects increases exponentially according to the number and size of the clusters.

A well-known consequence of this clustering phenomenon is the overdispersion of defect data, which results in high false-alarm rates when the effect is not properly taken into account. Because of within-wafer and within-lot correlations, the Poisson distribution assumption is inaccurate in this situation.

Some researchers advocate the use of a square-root transformation of particle counts to account for overdispersion of defect data.2 According to them, particle counts often follow a χ2 distribution. Quantile-quantile plots indicate that after transformation, the data structure is much closer to a normal distribution. However, the tails of the transformed defect data distribution still deviate somewhat from the Gaussian model.

Other researchers recommend that a mixture distribution be used to account for overdispersion and the correlation of within-wafer data.3 In this approach, a Poisson distribution is combined with a gamma distribution to achieve a negative binomial distribution. The shape of the gamma distribution can be adjusted, providing a very high degree of flexibility. Although the flexibility of the mixture distribution model enables it to fit defect data well, the technique is complex. It requires that maximum-likelihood estimations be calculated using quasi-Newtonian methods.

Experiments and Results

Defect Monitoring in the Low-Pressure Chemical Vapor Deposition (LPCVD) Area. The LPCVD poly gate deposition furnace that was selected for this study is used to process batches of 150 wafers at a time. At the time of the experiment, defects were often caused by particles and dust released from the quartz racks that supported the wafers. Because the LPCVD process involves several cooling and heating cycles, the racks gradually deteriorated. (Since the experiment was conducted, the quartz racks have been replaced by more-expensive silicon racks, resulting in a significant reduction in defect levels.)

Figure 2: Histogram of the raw sum of defects gathered over a five-month period.

Control charts were used to determine when the racks needed to be replaced. When measurements indicated that defect levels were out of control, the tool was cleaned. Production resumed if postcleaning measurements were considered acceptable. Otherwise, the tool was shut down and serviced by the maintenance team.

Tool defect data were collected 200 times over a five-month period and divided according to their value into nine bins of uniform width. The results of the data acquisition are presented in the histograms in Figures 2 and 3, in which the smallest sum of defects is represented by the leftmost bar and the largest by the rightmost bar.

Figure 3: Histogram based on the same data as in Figure 2 without outliers.

The data shown in Figure 2 were clearly distorted by outliers. After the outliers were eliminated, the data, presented in Figure 3, seemed to follow an exponential distribution, a theoretical example of which is presented in Figure 4 for comparison. The similarity between the data curve from the LPCVD tool and the theoretical exponential distribution is striking. More importantly, the estimates of the mean and standard deviation values in Figure 3—13.2 and 11.8σ, respectively—are very close to each other, which is typical of exponential distributions.

Figure 4: Theoretical exponential distribution.

A positive random variable is exponentially distributed when f(x) = ß exp(–ßx) for x > 0. The random variable, x, is more likely to have small rather than large positive values. The single parameter, ß, determines the exponential distribution and all of its moments. For example,

 



where E(X) stands for mean, σ(X) stands for standard deviation, and V(X) stands for variance. The mean and standard deviation are therefore equal theoretically and very close to each other in practice.

In an exponential distribution, density function (f(x)), distribution function (F(x)), mean (µ), variance (σ2), and standard deviation (σ) are determined by the following equations:





Figure 5 presents examples of exponential distributions with different means (ß).

Figure 5: Exponential distributions with different means.

In conjunction with exponential distributions, control limits can be set to achieve average run length (ARL) statistics. ARL, a measure of the average number of samples between false alarms (when the system is in statistical control), is determined by the equation:

 

where α = the risk of false alarms (the percentage of samples beyond the control limits).

In an exponential distribution, a 3σ limit (with a mean of +3σ) corresponds to 98.16% of the data. Hence, α = 1.84%, which represents the percentage of samples beyond the control limit. Those percentages correspond to a very low ARL of 54.3, which means that the system is triggering a high rate of false alarms. By setting the control limits to +4σ (99.32% of the data), a higher ARL can be achieved. And at 4.5σ (99.59% of the data), the ARL reaches 245. In comparison, in a Gaussian distribution, an ARL based on a +3σ limit is 370 (99.73% of the data), where α = 0.27%.

Defect Monitoring in the Etch Area. Another experiment was performed based on data from a tool that is used for plasma etching of metal layers in semiconductor devices. To obtain the most accurate measurements, particle counts were measured directly from the wafer surface rather than from the process gases. Whenever a data point was beyond the control limits of a control chart, a second measurement was made. If the second measurement was within the control limits, the tool was allowed to continue processing. If not, the tool was stopped and serviced by the maintenance team.

Figure 6: Histogram of defect data without outliers from an etch-tool chamber. The mean is 26.31 and the standard deviation is 26.83.

After the outliers were eliminated, as reflected in the chart in Figure 6, the mean of the process chamber was 26.31 and the standard deviation was 26.83. As in any exponential distribution, the mean and standard deviation estimates were remarkably similar. The SPC chart in Figure 7, with a control limit of 80, presents the same defect data as in Figure 6 plotted versus time. It indicates that while most values were small, the deviations appeared to be very significant.

Figure 7: SPC chart based on the same data as in Figure 6. The data are plotted versus time.

Figure 8 contains a histogram showing the distribution of defect data from the chamber of another etch tool. The mean was 13.12 and the standard deviation was 13.73. Considering that the maximum value was 82 and the minimum was 0, the mean and standard deviation were very close to each other.

Figure 8: Histogram of the sum of particle defects from a different etch tool than that examined in Figures 6 and 7.

Figure 9 shows the same particle count data as those in Figure 8 after a square-root transformation. Clearly, this figure indicates that a square-root transformation is not adequate to obtain a Gaussian distribution. The tail on the left side is much thicker than that on the right.

Figure 9: Square-root transformation performed on the same particle count data as in Figure 8.

Figure 10 presents an SPC chart based on the same defect data as those in Figure 8 plotted versus time. The chart also shows outliers and out-of-control points. A 4.5σ control limit (represented by the red line) corresponded to a value of 75, where the mean was 13.12 and the standard deviation was 13.73. The control limit alarm was triggered four times during the period of time covered by the test.

Figure 10: SPC chart based on the same data as in Figure 8, with outliers and out-of-control points. Data are plotted versus time.

Defect Monitoring in the Wet-Processing Area. The only area where the exponential distribution method was unsuccessful was the wet-processing area. Since wet processes are generally used to perform wafer-cleaning applications, the number of particles added during processing can be smaller than the number eliminated, occasionally resulting in a negative number. In such cases, distributions are asymmetrical, with a much lower degree of dispersion on the negative side, as highlighted in Figure 11. The particle data in Figure 11 come from a wet process that is used to eliminate organic and metal contaminants from the wafer surface. However, even when wet processes are used to perform etch applications, cleaning still occurs. Consequently, a different approach to SPC monitoring is clearly required in the wet-processing area.

Figure 11: Histogram of defect data without outliers from a wet-process tool. Values range from –18 to 38.

Conclusion

When this study was initiated, the investigators anticipated that defect data would follow complex mixture distributions. However they soon realized that except for the wet-processing area, the exponential assumption works very well in practice, offering useful insights into the behavior of particle count data.

The use of the exponential assumption at Altis has had a major impact on control limits and process control, leading to improved particle counts. For example, the upper control limit of a Poisson process, where the mean µ = v and the standard deviation σ = v0.5, is µ + 3(µ0.5). In contrast, the upper control limit of an exponential process, where the mean µ = v and the standard deviation σ = v, is µ + 3µ, since µ = σ. Therefore, when the mean value is higher than 1, the control limits of Poisson distributions are tighter than those of exponential distributions. This implies that when the Poisson assumption is used for exponential distributions, the rate of false alarms tends to be very high. Table I compares control limits of Poisson distributions with those of exponential distributions for LPCVD and lithography tools.

Tool
Mean
Control Limit
Poisson
Assumption
Exponential
Assumption
(98.16%)

LPCVD tools

Tool 1

Tool 2



12

12



24

24



48

48

Lithography tools

Tool 1

Tool 2

 


11

1.6

 


21

5.4

 


44

6.4

Table I: Comparison between 3σ control limits based on the Poisson assumption and exponential assumption. The difference is often very significant (exponential limits are sometimes twice as large as Poisson-based limits).

An important consequence of using wider exponential control limits is that the decreased number of false alarms enables engineers to assess defect levels generated by common causes more accurately. Therefore, the fab's focus will gradually shift from searching for special causes that do not in fact exist to taking longer-term actions to reduce chronic sources of defects. That shift in emphasis will eventually lead to more-effective strategies to reduce defect levels.

An important advantage of the exponential distribution approach is its simplicity. While the experimental results presented in this article are based solely on an analysis of control charts generated at Altis, they are relevant for the semiconductor industry as a whole.

With shrinking dimensions and increased aspect ratios, semiconductor yields will become increasingly sensitive to defects. Defect monitoring using SPC is likely to play an increasingly crucial role in identifying special causes, enabling engineers to remove them quickly.

Acknowledgments

The author wishes to thank Emmanuelle Hiolle for her useful contributions to this work. He also would like to thank the Technological University of Compiègne, France, for collaborating on this project and acknowledges those colleagues at Altis Semiconductor who participated.

References

1. CJ Spanos, "Statistical Process Control in Semiconductor Manufacturing," Proceedings of the IEEE 80, no. 6 (1992): 819–830.

2. WA Levinson et al., "SPC for Particle Counts," Semiconductor International 24, no. 12 (2001): 83–90.

3. B Cantell, R Collica, and J Ramírez, "Statistical Process Monitoring of Correlated Binary and Count Data Using Mixture Distributions," in Proceedings of the 23rd SAS Users Group International Conference (Nashville, TN: SAS, 1998 [cited 26 January 2004]); available from Internet: http://www2.sas.com/proceedings/sugi23/Stats/p237.pdf.

Bruno Scibilia, PhD, is a statistician engineer at Altis Semiconductor (an IBM-Infineon joint venture) in Corbeil-Essonnes, France, where he works in the fields of SPC (particularly in the photolithography area), Cp/Cpk monitoring, and designs of experiments. He received a PhD in applied statistics from the University of Angers (ISTIA), France, in 2000. (Scibilia can be reached at +33 160 885804 or bruno.scibilia@altissemiconductor.com.)


MicroHome | Search | Current Issue | MicroArchives
Buyers Guide | Media Kit

Questions/comments about MICRO Magazine? E-mail us at cheynman@gmail.com.

© 2007 Tom Cheyney
All rights reserved.