300 ◾ Simple Statistical Methods for Software Engineering
IBM’s Peter Norden (see Box 19.4) favored the Weibull equation over the logis-
tic curve to model software development project cost. Lawrence Putnam promoted
the Weibull curve with shape factor = 2 as a Rayleigh distribution. Putnam used
this in his estimation model SLIM to estimate cost and defect. He went further to
call this distribution the software equation.
The Weibull distribution is by far the world’s most popular
statistical model for life data. It is also used in many other
straight line. He repeats this process n times. I require the probability that
after n of these stretches he is at a distance between r and r + δr from his
starting point.
Rayleigh pointed out that, for large values of n, the answer given by
Rayleigh was
2
2
2
2
e r r
r
nl
−
δ
(19.1)
is actually has the shape of a normal distribution, centered at the origin.
In this equation Rayleigh assumed that the drunkard walks in one dimen-
sion. e model suggests that the drunkard will return to the origin after a
random walk. If we allow two additional dimensions and solve the problem,
a new phenomenon called Rayleigh Flight occurs. e distribution now is
Rayleigh. e drunkard will not return to the origin.
Rayleigh missed Smoluchowski’s 1906 paper on the motion of colloidal
particles, in which he introduces the random flight idea.
A one-dimensional walk is Gaussian. A multidimensional walk is Rayleigh.
Brownian motion in one dimension is Gaussian. e vector sum of Brownian
movements in several dimensions is Rayleigh. Simple processes follow
Gaussian. A combination of several simple processes is Rayleigh.
Software development is due to the combined work of several people and
several processes. Even if the individual processes are Gaussian, the com-
bined result can be the skewed Rayleigh. is being the essential case, can we
expect team results to be normally distributed? Relentless pursuit of normal-
ity in software engineering data is futile.