Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 9
Control Charts for Categorical Processes

Having discussed control charts for count processes in the Sections 8.2 and 8.3, we shall now turn to another type of attributes data process $c09-math-001$ , namely categorical processes, as introduced in Part II. Although now the range (state space) $c09-math-002$ of $c09-math-003$ consists of a finite number $c09-math-004$ of (unordered) categories with $c09-math-005$ , count values will still play an important role since the most obvious way of evaluating categorical data is by counting the occurrences of categories; see also Chapter 6.

For quality-related applications, $c09-math-006$ often describes the result of an inspection of an item, which either leads to the classification $c09-math-007$ for an $c09-math-008$ iff the $c09-math-009$ th item was non-conforming of type $c09-math-010$ , or $c09-math-011$ for a conforming item. A typical example is the one described by Mukhopadhyay (2008), in which a non-conforming ceiling fan cover is classified according to the most predominant type of paint defect, say “poor covering” or “bubbles”. Another field of application is the monitoring of network traffic data with different types of audit events; see Ye et al. (2002) for details. For monitoring such categorical processes, we shall consider two general strategies: if the process evolves too fast to be monitored continuously, then segments are taken from the process at selected times. For each of the resulting samples, a statistic is computed and plotted on a control chart. Here, it is important to carefully consider the serial dependence within the sample; see Section 9.1 for further details. In other cases, it is possible to continuously monitor the process, but then the serial dependence has to be taken into account between the plotted statistics. Control charts for this scenario are presented in Section 9.2. In both cases, we shall first concentrate on the special case of a binary process (that is, $c09-math-012$ ) and then extend our discussion to the general categorical case.

9.1 Sample-based Monitoring of Categorical Processes

In this section, we assume that the categorical process $c09-math-013$ cannot be monitored continuously. Instead, samples are taken as non-overlapping segments¹ from the process at times $c09-math-014$ , each being of a certain length $c09-math-015$ . Note that we restrict ourselves to a constant segment length $c09-math-016$ for simplicity, but at least the Shewhart-type charts could be directly adapted to varying length $c09-math-017$ by using varying control limits (Montgomery, 2009, Section 7.3.2). The time distance $c09-math-018$ is assumed to be sufficiently large such that we do not need to worry about the serial dependence between the samples; just the serial dependence within the samples. After having collected the segment, a certain type of sample statistic is computed and then plotted on an appropriately designed control chart.

9.1.1 Sample-based Monitoring: Binary Case

In view of its practical importance, let us first focus on the special case of a binary process $c09-math-019$ , with the range coded as $c09-math-020$ as in Example 6.3.2. Having available the binary segment $c09-math-021$ , one commonly determines either the sample sum $c09-math-022$ (say, counts of non-conforming items) or the corresponding sample fraction of ‘1’s. Since the sample fraction differs from the count just by a factor $c09-math-023$ , we shall always consider the resulting count process $c09-math-024$ in the sequel. The original binary process $c09-math-025$ is now monitored by monitoring this derived count process $c09-math-026$ . At this point, the fundamental premise of this section should be remembered: although $c09-math-027$ might exhibit serial dependence, due to taking sufficiently distant segments, we shall assume $c09-math-028$ to be serially independent, and hence i.i.d. in its in-control state.

For monitoring $c09-math-029$ (being i.i.d. in its in-control state), any of the concepts discussed in Chapter 8 can be used, it just has to be adapted to the finite range $c09-math-030$ of $c09-math-031$ . This difference sometimes manifests itself in the name of the resulting control charts. If the counts are plotted directly on a Shewhart-type chart, for instance, it is no longer referred to as a $c09-math-032$ chart, but as an $c09-math-033$ chart; see also the discussion in Section 8.2.1, as well as Montgomery (2009). If the sample fractions are plotted, it is called a $c09-math-034$ chart. Despite this different terminology, the $c09-math-035$ chart still has two control limits $c09-math-036$ satisfying $c09-math-037$ , which includes the one-sided charts as boundary cases (upper-sided if $c09-math-038$ ). Also ARLs are computed as before; see (8.3) and (8.4).

Remark 9.1.1.1 (ARL vs. ATS)

The ARL values related to the $c09-math-039$ chart applied to $c09-math-040$ have to be treated with some caution. To illustrate this, let us assume that the underlying process $c09-math-041$ refers to the quality of manufactured items, and that we have fixed sampling intervals $c09-math-042$ ; say, $c09-math-043$ . If the chart triggers its first alarm after plotting the $c09-math-044$ th sample statistic $c09-math-045$ (that is, the run length equals $c09-math-046$ ), then the number of manufactured items until this alarm is much larger, given by $c09-math-047$ . Therefore, in such a situation, it is sometimes preferred to look at the average time to signal (ATS), where “time” refers to the original process $c09-math-048$ , not to the number of plotted statistics. In the given example, we have $c09-math-049$ . Note that the ATS is sometimes referred to as the average number of events (ANE) or the average number of observations to signal (ANOS) instead.

Concerning the distribution of the sample counts, the serial dependence structure of the underlying binary process $c09-math-050$ is of importance. If $c09-math-051$ is i.i.d. with $c09-math-052$ (say, the probability of a non-conforming item), then each sample sum $c09-math-053$ is binomially distributed according to $c09-math-054$ (Example A.2.1). So the statistics $c09-math-055$ constitute themselves as an i.i.d. process of binomial counts. But if $c09-math-056$ exhibits serial dependence, in contrast, the distribution of $c09-math-057$ will deviate from a binomial one.

In Deligonul & Mergen (1987), Bhat & Lal (1990) and Weiß (2009f), the case of $c09-math-058$ being a binary Markov chain with success probability $c09-math-059$ and autocorrelation parameter $c09-math-060$ was considered (Example 7.1.3); that is, with the transition matrix given by (7.6). In this case, $c09-math-061$ follows the so-called Markov binomial distribution $c09-math-062$ (which coincides with $c09-math-063$ iff $c09-math-064$ ). While the mean of $c09-math-065$ is not affected by the serial dependence, the variance in particular changes (extra-binomial variation if $c09-math-066$ , see the discussion in the context of Equation (2.3)):

9.1

The pmf is given by (Kedem, 1980, Corollary 1.1)

9.2

for $c09-math-069$ (zero inflation if $c09-math-070$ ; see the discussion in Appendix A.2). If the time distance $c09-math-071$ between successive segments from $c09-math-072$ is sufficiently large, the resulting process of counts $c09-math-073$ can still be assumed to be approximately i.i.d. (note that the correlation between $c09-math-074$ and $c09-math-075$ decays exponentially with $c09-math-076$ ), but with a marginal distribution different from a binomial one. This difference in the distribution of $c09-math-077$ certainly has to be considered when designing a corresponding control chart (Weiß, 2009f).

In addition, advanced control schemes, such as the EWMA or CUSUM charts discussed in Section 8.3, can be used for monitoring $c09-math-078$ . Assuming the counts $c09-math-079$ to be binomially distributed in their in-control state (that is, $c09-math-080$ is assumed to be i.i.d.), Gan (1993) applied the CUSUM scheme described in Section 8.3.1 for process monitoring, while Gan (1990b) used the modified EWMA chart from Section 8.3.3 (with rounding operation as in (8.20)) for this purpose. The application of such an EWMA chart to the case of $c09-math-081$ being a binary Markov chain – that is, with $c09-math-082$ following the Markov binomial distribution – was considered by Weiß (2009f). The computation of ARLs is done in the same way as described in Sections 8.3.1 and 8.3.3, by just using the pmf of the (Markov) binomial distribution. A completely different approach for a sample-based monitoring of an underlying binary Markov chain was recently proposed by Adnaik et al. (2015), who did not compute the sample sums $c09-math-083$ as the charting statistics, but instead used some kind of likelihood ratio statistic for each of the successive segments. Finally, Höhle (2010) proposed a log-LR CUSUM chart for monitoring the $c09-math-084$ under the assumption that these counts follow a marginal (beta-)binomial logit regression model; see Section 7.4.

Example 9.1.1.2 (Medical diagnoses)

In Weiß & Atzmüller (2010), several binary time series concerning the diagnostic behavior of different examiners were discussed (abdominal ultrasound examinations; time passes according to the arrival of patients). Each of these time series refers to a particular diagnosis and a particular examiner, where the value ‘1’ states that the considered diagnosis cannot be excluded for the current patient; that is, suitable countermeasures are required. As explained in Weiß & Atzmüller (2010), the time series are expected to stem from a serially independent process, since the patients usually arrive independently of each other at the examiner, but the marginal distribution may change over time due to a change in the diagnostic behavior of the examiner (due to, say, learning). The aim is to monitor the diagnostic behavior of the considered examiners in view of detecting such changes.

Obviously, the above processes evolve sufficiently slowly in time that continuous process monitoring is feasible; we shall consider such approaches in Example 9.2.1.2. Here, just as an exercise, we shall take segments from one of these processes to illustrate the application of some of the control charts discussed above. We consider the time series concerning the diagnosis “M165” (fatty liver) of examiner “Mf542a5” (Weiß & Atzmüller, 2010, Example 3.1), and we focus on the data collected between times 251 and 1178, since these were justified to be reasonable Phase-I data.

So, altogether, a binary time series of length $c09-math-085$ is available, which is assumed to stem from an i.i.d. model. We decide on a sampling interval $c09-math-086$ (Remark 9.1.1.1) and sample size $c09-math-087$ , so altogether 32 counts $c09-math-088$ are obtained, which should follow a unique binomial distribution, $c09-math-089$ . In fact, an inspection of a time series plot as well as of an autocorrelation plot give no reason to doubt an i.i.d.-assumption for $c09-math-090$ ; the resulting moment estimate for $c09-math-091$ equals about 0.241.

For this fitted binomial model, we now develop designs for the upper $c09-math-092$ chart and the upper CUSUM chart, to detect a possible future increase in the number of “fatty liver” diagnoses. For the $c09-math-093$ chart, ARLs are computed according to (8.3) and (8.4), and the design $c09-math-094$ (and $c09-math-095$ ) leads to $c09-math-096$ (Remark 9.1.1.1). Applying this chart in Phase I to the sample counts, no alarm is triggered, which confirms the in-control assumption.

For the CUSUM chart, ARLs are computed by the MC approach, as described in Section 8.3.1, and comparable zero-state in-control ATS values are obtained for the designs $c09-math-097$ and $c09-math-098$ , with $c09-math-099$ (no FIR) in both cases: $c09-math-100$ and $c09-math-101$ , respectively. The steady-state out-of-control performance of the charts is analyzed in Figure 9.1. While the $c09-math-102$ chart is difficult to compare with the CUSUM charts, since it is visibly more robust in its in-control state, it is easy to decide between the CUSUMs: the design $c09-math-103$ leads to clearly better $c09-math-104$ performance.

Note that the worse CUSUM, $c09-math-105$ , results in a very similar decision rule to the $c09-math-106$ chart: values for $c09-math-107$ $c09-math-108$ always lead to an alarm, and additionally, any pattern of the form “ $c09-math-109$ ” (with arbitrary number of ‘4’). So this particular CUSUM chart just extends the considered $c09-math-110$ chart by some kind of runs rule. This is illustrated by the simulated Phase-II sample ( $c09-math-111$ shifts to $c09-math-112$ at $c09-math-113$ ) in Figure 9.2. While the CUSUM design $c09-math-114$ clearly performs best in this particular example (not shown; first alarm at sample $c09-math-115$ ), it is interesting to compare the two remaining charts: both trigger alarms at samples $c09-math-116$ caused by sample counts of 6, but the CUSUM in (b) has an additional alarm at $c09-math-117$ . The latter is caused by the sample counts $c09-math-118$ , which corresponds to the above pattern with zero ‘4’s.

Figure 9.1 $c09-math-119$ chart and CUSUM charts of Example 9.1.1.2: $c09-math-120$ against $c09-math-121$ .

Figure 9.2 Simulated sample counts for Example 9.1.1.2: (a) upper-sided $c09-math-122$ chart with $c09-math-123$ and (b) CUSUM chart $c09-math-124$ .

9.1.2 Sample-based Monitoring: Categorical Case

Let us return to the truly categorical case; that is, where the range of $c09-math-125$ consists of more than two states, $c09-math-126$ with $c09-math-127$ . As in Section 6.2, we denote the time-invariant marginal probabilities by $c09-math-128$ . If the number of different states, $c09-math-129$ , is small, it would be feasible to monitor the process with $c09-math-130$ simultaneous binary charts; say, by using the $c09-math-131$ -tree method described in Duran & Albin (2009). However, here we shall concentrate on charting procedures in which the information about the process is comprised in a univariate statistic: After having taken a segment from the process, we first compute the resulting frequency distribution as a summary, which then serves as the base for deriving the statistic to be plotted on the control chart. To keep it consistent with the binary case from Section 9.1.1, we concentrate on absolute frequencies: $c09-math-132$ with $c09-math-133$ being the absolute frequency of the state $c09-math-134$ in the sample $c09-math-135$ , such that $c09-math-136$ . With $c09-math-137$ denoting the binarization of $c09-math-138$ , we may express $c09-math-139$ .

If the underlying categorical process $c09-math-140$ is even serially independent (so altogether i.i.d.), then the distribution of each $c09-math-141$ is a multinomial one; see Example A.3.3. This case was considered by Marcucci (1985) and Mukhopadhyay (2008), among others, who proposed plotting Pearson's $c09-math-142$ -statistic on a control chart,

9.3

where $c09-math-144$ refers to the in-control values of the categorical probabilities. So in the in-control case, the process $c09-math-145$ is i.i.d. with a marginal distribution that might be approximated by a $c09-math-146$ -distribution (see Horn (1977) concerning the goodness of this approximation). As an alternative, Weiß (2012) proposed using a control statistic that measures the relative change of categorical dispersion. As the underlying categorical dispersion measure, the Gini index (6.1) might be used. If $c09-math-147$ is i.i.d., following the in-control model, then

9.4

is approximately normally distributed, with a mean of $c09-math-149$ and variance $c09-math-150$ ; see Section 6.2. These approximate distributions for $c09-math-151$ or $c09-math-152$ may be used during chart design. But since the quality of these approximations is often rather bad (note that $c09-math-153$ is often quite small and that the control limit is usually chosen as an extreme quantile), the final design and evaluation of the ARL performance requires simulations in practice.

Remark 9.1.2.1 (ARL simulation)

The standard approach for ARL simulation is to generate the process for a certain number of replications (usually between 10,000 and 1 million), and to apply the considered chart to each of these process replications. The $c09-math-154$ th process is stopped when the first alarm occurs, and the time until this alarm (the actual run length $c09-math-155$ ) is stored. The mean $c09-math-156$ of all these run lengths is the approximation to the true ARL.

The above charts $c09-math-157$ or $c09-math-158$ , however, are Shewhart charts for i.i.d. statistics. So we just need to know the marginal distribution of $c09-math-159$ or $c09-math-160$ , then ARLs are computed according to (8.3) and (8.4). Therefore, here, we can also simulate just one very long time series and then compute the control limit as an appropriate quantile from the simulated values. The ARL, in turn, is computed from the relative frequency of observations violating the control limits.

Example 9.1.2.2 (Paint defects)

We pick up the application scenario described by Mukhopadhyay (2008), where manufactured ceiling fan covers were checked for possible paint defects. If the fan cover has no paint defect, then it is conforming and categorized as ‘ok’. Otherwise, if non-conforming, it is classified according to the most predominant defect, with the $c09-math-161$ defect categories:

poor covering (‘pc’)
overflow (‘of’)
patty defect (‘pt’)
bubbles (‘bb’)
paint defect (‘pd’)
buffing defect (‘bd’).

In Mukhopadhyay (2008), an underlying i.i.d. process is assumed, such that the $c09-math-162$ are multinomially distributed. The reported data example shows varying sample sizes (varying around $c09-math-163$ ), and the overall observed marginal frequencies are

For illustration, we assume this distribution as the in-control distribution, and we take segments of constant length $c09-math-164$ . Since we do not specify a certain sampling interval, we just look at the ARLs in this example.

In quality-related applications such as this, the probability $c09-math-165$ of a conforming unit is typically large, while the defect probabilities are much smaller. So in the sense of Section 6.2, the in-control distribution $c09-math-166$ is expected to exhibit low dispersion. The aim is to detect a deterioration in the quality, which goes along with a decrease of $c09-math-167$ and hence (often) with an increase in dispersion. Therefore, we use an upper-sided version of the $c09-math-168$ chart, besides the $c09-math-169$ chart, which is upper-sided anyway. For chart design, we first try to use the asymptotic distributions of $c09-math-170$ and $c09-math-171$ , respectively, and invert (8.3) to find the control limit for a specified value of $c09-math-172$ . To check the quality of this asymptotic approximation, we then determine the actual in-control ARL by simulations (10,000 replications; see Remark 9.1.2.1). The results are shown in Table 9.1.

Table 9.1 $c09-math-173$ performance of $c09-math-174$ and $c09-math-175$ chart with approximate chart design

Chart	$c09-math-176$ chart			$c09-math-177$ chart
Design- $c09-math-178$	50	100	200	50	100	200
Simulated $c09-math-179$	39.9	69.5	117.1	56.1	129.4	268.5

Obviously, none of the approximations is satisfactory, since we get only roughly the intended in-control behavior. So a further simulation-based fine-tuning of the control limit is necessary for both charts.

For illustration purposes, let us further investigate the designs $c09-math-180$ for the $c09-math-181$ chart, and $c09-math-182$ for the $c09-math-183$ chart, both leading to $c09-math-184$ . Two out-of-control scenarios are considered:

the probability of poor covering is increased by a factor $c09-math-185$ ; that is, $c09-math-186$ , while all other probabilities are uniformly decreased
the probability of there being no paint defect is decreased by a factor $c09-math-187$ ; that is, $c09-math-188$ , while all other probabilities are uniformly increased.

The resulting simulated ARL performance is shown in Figure 9.3; the non-smoothness is caused by simulation error. In both cases, the Gini chart is much more sensitive to small shifts $c09-math-189$ , while the $c09-math-190$ chart performs equally well (scenario 2) or even better (scenario 1) for larger shifts $c09-math-191$ . This behavior can be explained to some extent by looking at the “true” values of the monitored statistics; that is, at $c09-math-192$ and $c09-math-193$ , respectively, as functions of $c09-math-194$ . Both statistics increase with increasing deviation, but the Pearson in a convex way and the Gini in a concave one.

Figure 9.3 Simulated ARL performances for (a) Scenario 1 and (b) Scenario 2; see Example 9.1.2.2.

Finally, Figure 9.4 shows the $c09-math-195$ chart and $c09-math-196$ charts obtained for a simulated sample of length 100, where the first 20 samples stem from the in-control model, and the remaining 80 samples stem from the out-of-control Scenario 1 with $c09-math-197$ (that is, $c09-math-198$ ). Both charts trigger several alarms after the change point, with the first alarm for the $c09-math-199$ chart at sample 34. Note that the “true” Gini value shifts from 1 to about 1.067 after the change point; that is, the true (Gini) dispersion is increased by about 6.7%.

A sample-based approach is also possible if $c09-math-200$ is serially dependent. But then, certainly, the distributions of $c09-math-201$ and hence of $c09-math-202$ and $c09-math-203$ will deviate from those given above for the i.i.d. case. If, for instance, $c09-math-204$ is an NDARMA process (Section 7.2), then the effect on the distribution can be quantified in terms of the constant $c09-math-205$ from (7.12). Considering the complete vector $c09-math-206$ , the covariance matrix $c09-math-207$ from Example A.3.3 is asymptotically inflated by the factor $c09-math-208$ (Weiß, 2013b). For the Gini statistic $c09-math-209$ , variance and mean change (approximately) according to (7.13) and (7.14), respectively, while Weiß (2013b) showed that $c09-math-210$ is approximately $c09-math-211$ -distributed.

Image described by caption/surrounding text. — **Figure 9.4** (a) $c09-math-220$ chart and (b) $c09-math-221$ chart applied to simulated sample; see Example 9.1.2.2.

Höhle (2010) proposed a log-LR CUSUM chart if the $c09-math-222$ stem from a marginal multinomial logit regression model; see Section 7.4.

9.2 Continuously Monitoring Categorical Processes

If the process evolves sufficiently slowly, then it is possible to implement a continuous monitoring approach of the categorical process $c09-math-223$ . So as a new categorical observation $c09-math-224$ arrives, the next control statistic is computed and plotted on the control chart.

9.2.1 Continuous Monitoring: Binary Case

As in Section 9.1.1, let us first focus on the special case of a binary process $c09-math-225$ . Perhaps the best-known approach for (quasi) continuously monitoring a binary process is by plotting run lengths $c09-math-226$ on an appropriately designed chart:

9.5

As an example,

The monitoring of such runs is a reasonable approach, especially for high-quality processes where $c09-math-228$ is very small. Small $c09-math-229$ implies that long runs are observed, but if $c09-math-230$ increases (deterioration of quality), the runs become shorter (and vice versa). So the detection of a decrease in the run lengths is often particularly relevant. Having fixed a truly two-sided design $c09-math-231$ , we stop monitoring with the $c09-math-232$ th run if either $c09-math-233$ for the first time, or if already $c09-math-234$ zeros have been observed since the last run (because then, $c09-math-235$ will necessarily become larger than $c09-math-236$ , but we do not need to wait until the run is finished).

Concerning performance evaluation, Remark 9.1.1.1 should be remembered. The ARL – that is, the average number of plotted runs until the first alarm – would be quite misleading, since a single run might comprise a rather large number of original observations. Therefore, the ATS is clearly preferable as a measure of chart performance.

If $c09-math-237$ is i.i.d. (Bourke, 1991; Xie et al., 2000; Weiß, 2013c), then $c09-math-238$ is also i.i.d. according to the shifted geometric distribution (Example A.1.5). The ATS can be computed according to (Weiß, 2013c)

9.6

Illustration of ATS performance of runs chart against Π. — **Figure 9.5** ATS performance of runs chart against $c09-math-240$ ; see Example 9.2.1.1.

Example 9.2.1.1 (ATS performance of runs chart)

Let us apply formula (9.6) to evaluate the performance of some runs chart designs. Trying to find (nearly) ATS-unbiased designs (see Example 8.2.1.1 and also Xie et al. (2000)) with an $c09-math-241$ around 600–700 for the in-control values $c09-math-242$ (high-quality processes), we ultimately obtain the designs $c09-math-243$ ( $c09-math-244$ , $c09-math-245$ ), $c09-math-246$ ( $c09-math-247$ , $c09-math-248$ ), and $c09-math-249$ ( $c09-math-250$ , $c09-math-251$ ). The plot of the respective $c09-math-252$ functions against $c09-math-253$ in Figure 9.5 confirms the designs to be quasi unbiased. It should be noted, however, that these designs do not lead to unbiased ARL performances. However, the practical meaning of ARLs for a runs chart (with values ranging between 0 and 25 in the present examples) is questionable anyway.

For illustration, a binary sequence and the respective runs were simulated, where the simulation stopped after having completed the 100th run (which happened after 3454 binary observations). The data leading to the first 20 runs stem from the in-control model; that is, they are i.i.d. with $c09-math-254$ . Afterwards, the defect probability was shifted to $c09-math-255$ (deterioration of quality). The sequence of runs is plotted on a control chart with design $c09-math-256$ , the in-control performance of which is described by $c09-math-257$ and $c09-math-258$ , and the out-of-control performance by $c09-math-259$ and $c09-math-260$ (note that runs are shorter on average for $c09-math-261$ than for $c09-math-262$ , so the ATS is more strongly affected by such a change). The resulting runs chart is plotted in Figure 9.6, using a log scale to improve the readability.

The first alarm is triggered for the 6th run; that is, it is a false alarm (up to this event, altogether 467 binary observations had been generated). The alarm is caused by violating the upper limit ( $c09-math-263$ ); that is, it would indicate an improvement of the quality. With $c09-math-264$ , it is not surprising to observe such a false alarm within the first 20 runs (this in-control period altogether comprises 1281 binary observations).

Figure 9.6 Runs chart applied to simulated data; see Example 9.2.1.1.

The next alarm is a true alarm, which is caused by the 35th run $c09-math-265$ (that is, with the 1621th binary observations), and indicates a deterioration of quality. So the delay in detecting the out-of-control situation is 15 runs ( $c09-math-266$ ) or 340 binary observations ( $c09-math-267$ ), respectively.

The monitoring of runs is straightforwardly extended to the Markov case; see Blatterman & Champ (1992) and Lai et al. (2000). The runs $c09-math-268$ from a binary Markov chain $c09-math-269$ according to Example 7.1.3 are still serially independent, but their distribution is no longer shifted geometric. While $c09-math-270$ has to be treated separately, for $c09-math-271$ with $c09-math-272$ , we obviously have

If ‘1’s are observed more frequently, the runs become quite short on average. In such a case, the CUSUM procedure proposed by Bourke (1991) to monitor the run length in $c09-math-273$ is more appropriate. This geometric CUSUM control chart is essentially equivalent to the Bernoulli CUSUM control chart, which was proposed by Reynolds & Stoumbos (1999) for an i.i.d. binary process $c09-math-274$ , and which was extended to the case of a binary Markov chain, as in Example 7.1.3, by Mousavi & Reynolds (2009). These charts are constructed in an analogous way to (8.15): the CUSUM chart is defined by accumulating the contributions to the log-likelihood ratio (log-LR) at times $c09-math-275$ . The contribution by the $c09-math-276$ th observation equals

where $c09-math-277$ refers to the relevant out-of-control parameter value of $c09-math-278$ , while $c09-math-279$ represents the in-control value. In the Markov case (Example 7.1.3; see also Remark 8.3.2.2), $c09-math-280$ is computed as before, while

An upper-sided CUSUM chart (we restrict to this case, since usually increases in $c09-math-281$ are to be detected) can now be constructed analogously to (8.13) by defining

9.7

In the i.i.d. case, the CUSUM (9.7) might be rewritten in the form

9.8

Note that the plotted statistics of these CUSUM charts go along with the observations, so $c09-math-284$ . To allow for an exact ARL computation with the MC approach (Section 8.2.2), $c09-math-285$ can be required to take the form $c09-math-286$ with an $c09-math-287$ (Reynolds & Stoumbos, 1999); a similar strategy is proposed by Mousavi & Reynolds (2009) for the case of $c09-math-288$ being a binary Markov chain. In this case (with $c09-math-289$ being a multiple of $c09-math-290$ ), the resulting transition matrices $c09-math-291$ for the MC approach are sparse matrices (see Section 8.3.2), since only a few combinations $c09-math-292$ are possible at all for $c09-math-293$ . We have

Note that a lower-sided CUSUM chart can be constructed in an analogous way to (9.8). For a log-LR CUSUM with respect to an underlying logit regression model (Section 7.4), see Höhle (2010).

Example 9.2.1.2 (CUSUM chart for diagnosis data)

Let us return to the binary time series $c09-math-294$ from Example 9.1.1.2, for the diagnosis of “fatty liver” by examiner “Mf542a5” (Weiß & Atzmüller, 2010). Now investigating the full time series, the SACF plot still indicates serial independence, and the overall mean leads to the estimate $c09-math-295$ for the probability of diagnosing a fatty liver. With a corresponding mean run length of only about 4.14, a runs chart is not appropriate. So we shall concentrate on the CUSUM chart (9.8) for the fitted i.i.d. model.

To get an idea of reasonable values for $c09-math-296$ , we compute $c09-math-297$ for $c09-math-298$ and $c09-math-299$ , which leads to about $c09-math-300$ , respectively. So we choose $c09-math-301$ (which should be better for small shifts) and $c09-math-302$ (which should be better for somewhat larger shifts). Trying to obtain in-control zero-state ARLs close to the $c09-math-303$ values from Example 9.1.1.2 (note that for the considered CUSUM, we have $c09-math-304$ ), we ultimately end up with the designs $c09-math-305$ ( $c09-math-306$ ) and $c09-math-307$ ( $c09-math-308$ ). The corresponding transition matrices for the MC approach have only 105 and 30 non-zero entries, respectively. The steady-state ARL performance of these designs is illustrated by Figure 9.7, with the first design indeed being more sensitive to small shifts.

As an illustration, we now apply these two CUSUMs in Phase II. For this purpose, we use the remaining 449 observations from Example 3.1 in Weiß & Atzmüller (2010), which were collected after an eight-month break. Both CUSUM designs do not lead to an alarm (design $c09-math-309$ is shown in Figure 9.8), so the examiner seems to remain in the in-control state that had previously been identified.

Scheme for CUSUM charts. — **Figure 9.7** CUSUM charts of Example 9.2.1.2: $c09-math-310$ against $c09-math-311$ .

**Figure 9.7** CUSUM charts of Example 9.2.1.2: $c09-math-310$ against $c09-math-311$ .

**Figure 9.8** CUSUM chart $c09-math-312$ for fatty liver data; see Example 9.2.1.2.

We conclude by pointing out another approach for continuously monitoring a binary process: the EWMA chart discussed in Section 8.3.3, which was applied to binary processes by Yeh et al. (2008) and Weiß & Atzmüller (2010), among others.

9.2.2 Continuous Monitoring: Categorical Case

Generally, while it is quite natural to check for runs in a binary process, it is more difficult to define a run for the truly categorical case in a reasonable way. One possible solution was discussed in Weiß (2012), where one waits for a segment $c09-math-313$ of length $c09-math-314$ , where $c09-math-315$ is taken from a specified subset $c09-math-316$ . But as pointed out in Weiß (2012), waiting times for completely different types of patterns might also be relevant, depending on the actual application scenario. Because of this ambiguity, we shall not further consider the monitoring of runs in a categorical process here.

Instead, we follow the path of Section 9.2.1 and consider CUSUM charts derived from the log-likelihood ratio approach. So let $c09-math-317$ denote again the in-control value of the marginal distribution $c09-math-318$ , and let $c09-math-319$ be the relevant out-of-control value. A CUSUM scheme for the case of an underlying i.i.d. process was proposed by Ryan et al. (2011). In this case, the contribution to the log-LR at time $c09-math-320$ , $c09-math-321$ , can be expressed as either

9.9

where the latter version uses the binarization $c09-math-323$ of $c09-math-324$ . The log-LR approach also applies to serially dependent categorical processes. As an example, in analogy to the work by Mousavi & Reynolds (2009) concerning a binary Markov chain (see Section 9.2.1), we consider a categorical Markov chain. Then, $c09-math-325$ is computed as before, where

If we are concerned with the particular case of a DAR(1) process as in Examples 7.2.2 and 9.1.2.3, it then follows that

9.10

The log-LR-based CUSUM statistic at time $c09-math-327$ is defined as before, by the recursion

9.11

An alarm is triggered once $c09-math-329$ violates the upper control limit $c09-math-330$ for the first time. For a log-LR CUSUM with respect to an underlying logit regression model (Section 7.4), see Höhle (2010).

Example 9.2.2.1 (CUSUM chart for paint defects)

Let us continue Example 9.1.2.2, where we monitored the quality of manufactured ceiling fan covers as described by Mukhopadhyay (2008). The possible states $c09-math-331$ are either ‘ok’ ( $c09-math-332$ ) or one of the $c09-math-333$ defect categories: ‘pc’, ‘of’, ‘pt’, ‘bb’, ‘pd’, ‘bd’ ( $c09-math-334$ ). The in-control distribution of the resulting i.i.d. process $c09-math-335$ equals

Two out-of-control scenarios were considered in Example 9.1.2.2. Scenario 1 assumes the probability of poor covering ( $c09-math-336$ ) to be increased by the factor $c09-math-337$ ; that is, $c09-math-338$ , while all other probabilities are decreased according to $c09-math-339$ . Hence, (9.9) becomes

and a slightly modified version of (9.11) (see (9.8)) is given by

So Scenario 1 just leads to a type of Bernoulli CUSUM chart (Scenario 2 from Example 9.1.2.2 could be treated in an analogous way by using $c09-math-340$ and $c09-math-341$ otherwise). If we want to compute the ARLs exactly by the MC approach (Section 8.2.2), we can again choose $c09-math-342$ in the form $c09-math-343$ with an $c09-math-344$ (Reynolds & Stoumbos, 1999), otherwise, simulations are required (Remark 9.1.2.1). For more complex out-of-control scenarios $c09-math-345$ , the categorical CUSUM chart (9.11) differs from a simple Bernoulli CUSUM, but following the approach described in Appendix A of Ryan et al. (2011), an adaption of the MC approach is still possible.

Figure 9.9 CUSUM charts of Example 9.2.2.1: $c09-math-346$ against $c09-math-347$ .

For illustration, we continue with Scenario 1. As in Example 9.2.1.2, we compute the value for $c09-math-348$ for some reasonable shift sizes – say $c09-math-349$ – which imply choosing $c09-math-350$ or $c09-math-351$ . Aiming at an in-control zero-state ARL around 1000, we find the designs $c09-math-352$ ( $c09-math-353$ ) and $c09-math-354$ ( $c09-math-355$ ). Both charts show nearly the same steady-state ARL performance with respect to changes according to Scenario 1 (see Figure 9.9), with slightly lower ARLs for the first design. For instance, if $c09-math-356$ , we have $c09-math-357$ vs. $c09-math-358$ . Applied to a simulated sample path, where the first 100 observations follow $c09-math-359$ and the remaining 400 follow $c09-math-360$ with $c09-math-361$ , both charts look very similar, with chart 2 (shown in Figure 9.10) being slightly faster in detecting the process change (the first alarm is triggered at time $c09-math-362$ vs. $c09-math-363$ ).

Figure 9.10 CUSUM chart $c09-math-364$ for simulated sample; see Example 9.2.2.1.

While the designed CUSUM chart does well for a change in the anticipated direction, we may get bad performance in a misspecified shift scenario; see the discussion in Ryan et al. (2011). Imagine, for instance, a modification of Scenario 1, where the positive shift does not occur for poor covering (‘pc’) but for bubbles (‘bb’). In this case, the value of $c09-math-365$ would be decreased compared to $c09-math-366$ ; that is, the designed CUSUM would even become more robust. So if flexibility concerning the out-of-control scenario is required, one has to apply different (Bernoulli) CUSUM charts simultaneously.

An EWMA control chart for the monitoring of a categorical process is proposed by Ye et al. (2002).

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter 9: Control Charts for Categorical Processes

Create new playlist

Sign In