76 ◾ Simple Statistical Methods for Software Engineering
ere is a dierence between goal and need.
Goal: the end toward which eort is directed
Need: a condition requiring supply or relief
Goals are complex; they consist of interconnected layers and are influenced by
“personal goals” and “self-efficacy.” Goals have multiple dimensions, are likely to
drive discussions of metric design into inconclusive divergence, and are correspond-
ingly undesirable to rely on for deriving metrics. Needs have a simple structure and
are well defined with greater degree of objectivity in software development projects.
Meaning of Metrics: Interpreting Metric Data
We define a metric by defining the relationship the metric has with raw data. Metric
definition inheres the meaning of that metric. e defining equation is more signifi-
cant than the name we give to a metric. Names could mislead, but definitions do not.
It is good to recall metric definitions, even if obvious, before beginning interpretation.
Having recalled the metric definition, we now look at metric data. It is better to
work with a data table that shows the raw data and the derived metric in separate
columns. Visibility into basic observations helps in getting a detailed understand-
ing of metrics. In one column, we can have the time stamp of data. If data fall into
categories, it is better to include the category name in one column. Occasionally, we
may have to allocate additional columns to accept further categorization schemes.
Next we should construct a box plot of metric data and analyze the statisti-
cal nature of data. e box, whiskers, and outliers seen in the box plot must be
understood and explained. Questions regarding the stability and the reasonable
dispersion of the metric must be addressed. We can support this enquiry with a
descriptive statistics analysis.
Data have intrinsic meaning that can be seen by applying statistical, engi-
neering, and management perspectives.
Box 5.2 olyMPic RunneR
Time in a school’s final sprint competition is measured using an analog stop-
watch. In interschool competitions at the state level, time is measured more
precisely using a digital stopwatch. In Olympic sprints, time is measured
by laser systems controlled by computers to the precision of a millisecond.
As the capability of running improves, the precision of measurement also
is improved. Likewise, when software engineering practices become more
mature, metric capability also is improved. e quality of metric data also is
improved. Metrics and maturity go together.