Professional Documents
Culture Documents
Variation is a major cause of quality problems and consequently many process improvement activities
focus on identifying and reducing it. It is thus important to understand as much about variation as
possible, even though its statistical nature can be disconcerting.
This chapter discusses the basic principles of variation, keeping the mathematical content to a
minimum. In keeping with the style of the book, calculations are also minimized, consisting only of
inserting numbers into simple formulae and using illustrated panels for calculations involving lists of
numbers.
There are four main sections in this chapter:
Understanding variation
The continuously variable nature of the universe is at the heart of the science of statistics, and at first
glance can look very complex, particularly if approached from a mathematical viewpoint. This can lead
to it being ignored, which is a pity, as even a simple appreciation of it can result in a reduction in
haphazard attempts to control it, with a consequent saving in wasted time and degraded performance.
What is variation?
When a process is executed repeatedly, its outputs are seldom identical. For example, when a gun is
successively fired at a target, as in Fig. 1, the bullets will not all pass through the same hole.
Causes of variation
Variation in process output is caused by variations within the process. These may be one or more of:
1. Differing actions within the process.
2. Differing effects within the process.
3. Differing inputs to the process.
As an example for each of these conditions, the variation in the placement of the bullet holes in the
target may be affected by:
1. The gun being held or used differently.
2. Wear in the hammer mechanism causing the shell to be struck differently.
3. The bullets being of slightly differing shape or weight.
Thus, even if the first point is eliminated by putting the gun in a clamp and firing it remotely, the bullets
will still not all hit the target in the identical position.
The reasons why variation occurs can be divided into two important classes, known as common and
special causes of variation. These are discussed further below.
Common causes of variation
Within any process there are many variable factors, as indicated above, each of which may vary a
small amount and in a predictable way, but when taken together result in a degree of randomness in
the output, as indicated in the figure below. These seemingly uncontrollable factors are called common
causes of variation.
Common causes of variation can seldom be eliminated by 'tampering' with the process. For example,
consider the effect of simple adjustments to the clamped gun, as in the figure below.
Fig. 2. Tampering
1. The first hole is to the left of center, so the clamp is rotated a little to the right.
2. If the clamp had been left alone, the second bullet would have gone a little to the right of
center, but as it has been moved right, the bullet now goes further to the right. As a reaction to
this, the clamp is rotated somewhat more to the left.
3. The third bullet tends towards the left anyway, so the result is a hole even further to the left.
It can be seen from this that it would have been better not to tinker with the clamp, and that the score
would be more likely to improve if the whole system were understood first and then fundamental
improvements made, such as building a better gun or making better bullets.
Special causes of variation
Special causes of variation are unusual occurrences which come from outside the normal common
causes, for example where a shot goes outside the main grouping, due to someone tripping over the
gunner as the gun is fired, as below:
Measuring variation
Variation is not simple to measure, as by its nature is random and individual events cannot be
predicted. Despite this, a degree of measurement can be achieved by looking at how a number of
measurements group together. Usually these items are selected with sampling methods.
The spread of measurements within a group enables special causes of variation to be distinguished
from common causes of variation. Beyond this, the characteristics of how these random events are
spread out can allow improvements in seemingly random chaos to be simply measured.
Distribution of results
It is common in processes for most measurements to cluster around a central value, with less and less
measurements occurring further away from this center. For example, the distribution of holes across
the target will gradually spread out from a central, most common placement, as below:
With a single die, the distribution is rectangular, as there is one, equally likely way of achieving each
number. With two dice, the distribution becomes triangular, as although there is only one way of
averaging one (two ones), there are six ways of averaging the central value of 3.5 (1-6, 2-5, 3-4, 4-3,
5-2 and 6-1).
With three dice, the distribution becomes curved, and with four dice it is markedly bell-shaped, as
there is still only one way of averaging one, but there are four ways of averaging 1.25 (three 1s and a
2) and so on up to 147 ways of averaging 3.5! A key use of this effect is that a predictable Normal
distribution can be produced by measuring samples in groups of as few as four items at a time.
Measuring distribution
The measurements of a process can vary in two different ways, in terms of their centering and
their spread, as illustrated below:
The centering (also called accuracy or central tendency) of a process, is the degree to which
measurements gather around a target value. The spread (also called dispersion or precision) of the
process is the degree of scatter of its output values.
Measuring centering
To measure the centering of a process requires that the center point of the set of results be identified.
The accuracy of the process can then be determined by comparing it with target values. There are
three ways of measuring this center point: the mean (or average), the median and the mode (see the
figure below).
Measuring spread
There are two main ways of measuring the degree of spread of a set of measurements: the range and
the standard deviation.
Range
The range of a set of measures is simply the difference between the largest and the smallest
measurement value.
Thus, for example, if you have a set of measures (21, 22, 26, 19, 12, 24, 33) then you first find the
highest measure (33) and subtract the lowest measure (12) to give the range (21).
This is easy to calculate, but there can be several problems with using it:
Special causes of variation can cause an unrealistically wide range.
As more measurements are made, it will tend to increase.
It gives no indication of the data between its values.
Standard deviation
The standard deviation is a number which is calculated using a simple mathematical trick (calculating
the square root of the average of squares) to find an 'average' number for the distance of the majority
of measures from the mean.
The standard deviation is of particular value when used with the Normal distribution, where known
proportions of the measurements fall within one, two and three standard deviations of the mean, as
below.
10