Bruno
Scibilia, Altis Semiconductor
Statistical
process control (SPC) is used extensively to identify the causes of
defect excursions as early as possible. Because of the increasing complexity
of semiconductor devices and the steady decrease in feature size, it
is necessary to eliminate defect sources and reduce the density of systematic
and random defects per wafer in all semiconductor processes to maintain
high device yields. Traditionally, IC development has focused on performance
and technological improvements. Today, however, more attention is being
paid to techniques such as SPC to improve manufacturing efficiency.
Defect
data are particularly difficult to monitor using SPC charts, partly
because they exhibit erratic behavior and noisiness. While measurement
data such as critical dimensions and depths follow a Gaussian, or normal,
distribution (the well-known bell-shaped curve), defect data have an
asymmetrical distribution. Most defect data are positive and relatively
close to zero. This violation of the Gaussian assumption must be taken
into account in process control operations.
Defects
arise from two types of causes: common causes and assignable causes.
Common causes include routine or chronic sources of variations in the
fab, such as the failure of environmental cleanroom controls, a lack
of repeatability, and measurement errors. By definition, fab personnel
cannot eliminate common causes from processes. Deviations from the norm
can also have assignable causes. Defects with assignable causes can
lead to processes that are out of statistical control. Once a control
chart has triggered a true alarm, assignable causes must be discovered
and corrected.
 |
| Figure
1: Theoretical Poisson distributions with different means. |
Traditionally,
four types of charts have been used in SPC applications: np and p charts
for binomial random distributions, and c or u charts for Poisson random
distributions. Figure 1 illustrates theoretical Poisson distributions
with different means.
The
shape of the random distribution (whether it is a normal, Poisson, or
other type of distribution) has an important influence on where control
limits are set. SPC limits on defect data are usually set according
to the Poisson distribution.1 Control limits are helpful
to identify systematic departures from random distributions and are
therefore set at the very limits of these distributions. However, statistician
engineers have observed that in the semiconductor industry, such limits
are often too tight, leading to many false alarms. To minimize the incidence
of false alarms, many practitioners resort to limits that are not based
on statistical principles, a pragmatic approach that is valid only when
the behavior of defect data is well understood.
The
objective of this article is to present a simple and effective SPC approach
to monitoring defect data. The article begins with a summary of the
techniques and concepts used in the IC industry. Then it presents examples
of defect distributions and proposes an approach based on exponential
distributions to account for the overdispersion of defect data.
Results
from a Benchmark Survey
Engineers
at Altis Semiconductor (Corbeil-Essonnes, France) conducted an informal
benchmark survey covering six firms in the semiconductor industry. The
survey concluded that the Poisson model was the most widely used SPC
method throughout the industry, although other, alternative approaches
are also implemented.
In
two firms, control limits were based purely on "engineering judgment."
In one of these firms, control limits were set so that a certain percentage
of lots (say 10%) would be rechecked. Sometimes, defect-yield correlations
and yield-impact calculations were used to identify defect levels at
which the impact on yield became noticeable. Actually a specification
limit, that level was not considered a control limit.
An
expert from an equipment supplier proposed that a logarithmic transformation
be used to normalize distributions. Transformed values, however, have
little physical significance and are therefore not popular in the industry.
At
some companies, exponentially weighted moving average (EWMA) charts
were used in addition to standard charts to track gradually increasing
particle counts. However, when Altis engineers tested the efficiency
of EWMA charts using the fab's own data, they observed that whenever
defect levels were out of control, they were so much above the control
limits that the statistical results from EWMA charts and standard charts
were the same. Hence, EWMA charts did not provide useful information
in that context.
Overdispersion
of Defect Data
While
within-wafer processing conditions are quite homogeneous, wafer-to-wafer
conditions can vary extensively. Most of the time, small groups of defects,
or even isolated defects, appear on wafers. However, when a common cause
(routine run-to-run variations) exposes a wafer to particle contamination,
clusters of defects rather than isolated particles appear on the surface.
In such cases, the number of defects increases exponentially according
to the number and size of the clusters.
A
well-known consequence of this clustering phenomenon is the overdispersion
of defect data, which results in high false-alarm rates when the effect
is not properly taken into account. Because of within-wafer and within-lot
correlations, the Poisson distribution assumption is inaccurate in this
situation.
Some
researchers advocate the use of a square-root transformation of particle
counts to account for overdispersion of defect data.2 According
to them, particle counts often follow a χ2 distribution.
Quantile-quantile plots indicate that after transformation, the data
structure is much closer to a normal distribution. However, the tails
of the transformed defect data distribution still deviate somewhat from
the Gaussian model.
Other
researchers recommend that a mixture distribution be used to account
for overdispersion and the correlation of within-wafer data.3
In this approach, a Poisson distribution is combined with a gamma distribution
to achieve a negative binomial distribution. The shape of the gamma
distribution can be adjusted, providing a very high degree of flexibility.
Although the flexibility of the mixture distribution model enables it
to fit defect data well, the technique is complex. It requires that
maximum-likelihood estimations be calculated using quasi-Newtonian methods.
Experiments
and Results
Defect
Monitoring in the Low-Pressure Chemical Vapor Deposition (LPCVD) Area.
The LPCVD poly gate deposition furnace that was selected for this study
is used to process batches of 150 wafers at a time. At the time of the
experiment, defects were often caused by particles and dust released
from the quartz racks that supported the wafers. Because the LPCVD process
involves several cooling and heating cycles, the racks gradually deteriorated.
(Since the experiment was conducted, the quartz racks have been replaced
by more-expensive silicon racks, resulting in a significant reduction
in defect levels.)
 |
| Figure
2: Histogram of the raw sum of defects gathered over a five-month
period. |
Control
charts were used to determine when the racks needed to be replaced.
When measurements indicated that defect levels were out of control,
the tool was cleaned. Production resumed if postcleaning measurements
were considered acceptable. Otherwise, the tool was shut down and serviced
by the maintenance team.
Tool
defect data were collected 200 times over a five-month period and divided
according to their value into nine bins of uniform width. The results
of the data acquisition are presented in the histograms in Figures 2
and 3, in which the smallest sum of defects is represented by the leftmost
bar and the largest by the rightmost bar.
 |
| Figure
3: Histogram based on the same data as in Figure 2 without outliers. |
The
data shown in Figure 2 were clearly distorted by outliers. After the
outliers were eliminated, the data, presented in Figure 3, seemed to
follow an exponential distribution, a theoretical example of which is
presented in Figure 4 for comparison. The similarity between the data
curve from the LPCVD tool and the theoretical exponential distribution
is striking. More importantly, the estimates of the mean and standard
deviation values in Figure 3—13.2 and 11.8σ, respectively—are
very close to each other, which is typical of exponential distributions.
 |
| Figure
4: Theoretical exponential distribution. |
A
positive random variable is exponentially distributed when f(x)
= ß exp(–ßx) for x > 0. The random
variable, x, is more likely to have small rather than large
positive values. The single parameter, ß, determines the exponential
distribution and all of its moments. For example,
where
E(X) stands for mean, σ(X) stands for
standard deviation, and V(X) stands for variance.
The mean and standard deviation are therefore equal theoretically and
very close to each other in practice.
In
an exponential distribution, density function (f(x)),
distribution function (F(x)), mean (µ), variance
(σ2), and standard deviation (σ) are determined
by the following equations:
Figure
5 presents examples of exponential distributions with different means
(ß).
 |
| Figure
5: Exponential distributions with different means. |
In
conjunction with exponential distributions, control limits can be
set to achieve average run length (ARL) statistics. ARL, a measure
of the average number of samples between false alarms (when the system
is in statistical control), is determined by the equation:
where
α = the risk of false alarms (the percentage of samples beyond the
control limits).
In
an exponential distribution, a 3σ limit (with a mean of +3σ)
corresponds to 98.16% of the data. Hence, α = 1.84%, which represents
the percentage of samples beyond the control limit. Those percentages
correspond to a very low ARL of 54.3, which means that the system
is triggering a high rate of false alarms. By setting the control
limits to +4σ (99.32% of the data), a higher ARL can be achieved.
And at 4.5σ (99.59% of the data), the ARL reaches 245. In comparison,
in a Gaussian distribution, an ARL based on a +3σ limit is 370
(99.73% of the data), where α = 0.27%.
Defect
Monitoring in the Etch Area. Another experiment was performed
based on data from a tool that is used for plasma etching of metal
layers in semiconductor devices. To obtain the most accurate measurements,
particle counts were measured directly from the wafer surface rather
than from the process gases. Whenever a data point was beyond the
control limits of a control chart, a second measurement was made.
If the second measurement was within the control limits, the tool
was allowed to continue processing. If not, the tool was stopped and
serviced by the maintenance team.
 |
| Figure
6: Histogram of defect data without outliers from an etch-tool
chamber. The mean is 26.31 and the standard deviation is 26.83. |
After
the outliers were eliminated, as reflected in the chart in Figure
6, the mean of the process chamber was 26.31 and the standard deviation
was 26.83. As in any exponential distribution, the mean and standard
deviation estimates were remarkably similar. The SPC chart in Figure
7, with a control limit of 80, presents the same defect data as in
Figure 6 plotted versus time. It indicates that while most values
were small, the deviations appeared to be very significant.
 |
| Figure
7: SPC chart based on the same data as in Figure 6. The data are
plotted versus time. |
Figure
8 contains a histogram showing the distribution of defect data from
the chamber of another etch tool. The mean was 13.12 and the standard
deviation was 13.73. Considering that the maximum value was 82 and
the minimum was 0, the mean and standard deviation were very close
to each other.
 |
| Figure
8: Histogram of the sum of particle defects from a different etch
tool than that examined in Figures 6 and 7. |
Figure
9 shows the same particle count data as those in Figure 8 after a
square-root transformation. Clearly, this figure indicates that a
square-root transformation is not adequate to obtain a Gaussian distribution.
The tail on the left side is much thicker than that on the right.
 |
| Figure
9: Square-root transformation performed on the same particle count
data as in Figure 8. |
Figure
10 presents an SPC chart based on the same defect data as those in
Figure 8 plotted versus time. The chart also shows outliers and out-of-control
points. A 4.5σ control limit (represented by the red line) corresponded
to a value of 75, where the mean was 13.12 and the standard deviation
was 13.73. The control limit alarm was triggered four times during
the period of time covered by the test.
 |
| Figure
10: SPC chart based on the same data as in Figure 8, with outliers
and out-of-control points. Data are plotted versus time. |
Defect
Monitoring in the Wet-Processing Area. The only area where
the exponential distribution method was unsuccessful was the wet-processing
area. Since wet processes are generally used to perform wafer-cleaning
applications, the number of particles added during processing can
be smaller than the number eliminated, occasionally resulting in a
negative number. In such cases, distributions are asymmetrical, with
a much lower degree of dispersion on the negative side, as highlighted
in Figure 11. The particle data in Figure 11 come from a wet process
that is used to eliminate organic and metal contaminants from the
wafer surface. However, even when wet processes are used to perform
etch applications, cleaning still occurs. Consequently, a different
approach to SPC monitoring is clearly required in the wet-processing
area.
 |
| Figure
11: Histogram of defect data without outliers from a wet-process
tool. Values range from –18 to 38. |
Conclusion
When
this study was initiated, the investigators anticipated that defect
data would follow complex mixture distributions. However they soon
realized that except for the wet-processing area, the exponential
assumption works very well in practice, offering useful insights into
the behavior of particle count data.
The
use of the exponential assumption at Altis has had a major impact
on control limits and process control, leading to improved particle
counts. For example, the upper control limit of a Poisson process,
where the mean µ = v and the standard deviation σ
= v0.5, is µ + 3(µ0.5). In
contrast, the upper control limit of an exponential process, where
the mean µ = v and the standard deviation σ = v,
is µ + 3µ, since µ = σ. Therefore, when the mean
value is higher than 1, the control limits of Poisson distributions
are tighter than those of exponential distributions. This implies
that when the Poisson assumption is used for exponential distributions,
the rate of false alarms tends to be very high. Table I compares control
limits of Poisson distributions with those of exponential distributions
for LPCVD and lithography tools.
| Tool |
Mean |
Control
Limit |
| Poisson
Assumption |
Exponential
Assumption
(98.16%) |
| LPCVD
tools
Tool
1
Tool
2 |
12
12 |
24
24 |
48
48 |
| Lithography
tools
Tool
1
Tool
2 |
11
1.6 |
21
5.4 |
44
6.4 |
|
| Table
I: Comparison between 3σ control limits based on the Poisson
assumption and exponential assumption. The difference is often
very significant (exponential limits are sometimes twice as large
as Poisson-based limits). |
An
important consequence of using wider exponential control limits is
that the decreased number of false alarms enables engineers to assess
defect levels generated by common causes more accurately. Therefore,
the fab's focus will gradually shift from searching for special causes
that do not in fact exist to taking longer-term actions to reduce
chronic sources of defects. That shift in emphasis will eventually
lead to more-effective strategies to reduce defect levels.
An
important advantage of the exponential distribution approach is its
simplicity. While the experimental results presented in this article
are based solely on an analysis of control charts generated at Altis,
they are relevant for the semiconductor industry as a whole.
With
shrinking dimensions and increased aspect ratios, semiconductor yields
will become increasingly sensitive to defects. Defect monitoring using
SPC is likely to play an increasingly crucial role in identifying
special causes, enabling engineers to remove them quickly.
Acknowledgments
The
author wishes to thank Emmanuelle Hiolle for her useful contributions
to this work. He also would like to thank the Technological University
of Compiègne, France, for collaborating on this project and
acknowledges those colleagues at Altis Semiconductor who participated.
References
1. CJ
Spanos, "Statistical Process Control in Semiconductor Manufacturing,"
Proceedings of the IEEE 80, no. 6 (1992): 819–830.
2. WA
Levinson et al., "SPC for Particle Counts," Semiconductor International
24, no. 12 (2001): 83–90.
3. B
Cantell, R Collica, and J Ramírez, "Statistical Process Monitoring
of Correlated Binary and Count Data Using Mixture Distributions,"
in Proceedings of the 23rd SAS Users Group International Conference
(Nashville, TN: SAS, 1998 [cited 26 January 2004]); available from
Internet: http://www2.sas.com/proceedings/sugi23/Stats/p237.pdf.
Bruno
Scibilia, PhD, is a statistician engineer at Altis Semiconductor
(an IBM-Infineon joint venture) in Corbeil-Essonnes, France, where
he works in the fields of SPC (particularly in the photolithography
area), Cp/Cpk monitoring, and designs of experiments. He received
a PhD in applied statistics from the University of Angers (ISTIA),
France, in 2000. (Scibilia can be reached at +33 160 885804 or
bruno.scibilia@altissemiconductor.com.)