From Fig. 7.3.2, we can see that the trends of changes on ABS and
ABP are along opposite directions with M—ABS is decreasing while M
increasing, on the contrary, ABP increasing, and eventually, the changes of
ABS and ABP become slight and a balanced state is reached. As indicated
above, we hope to form a better AB with both higher ABS and ABP values to
construct a good searching start for K-centre clustering algorithm. Due to the
inconsistent changes of ABS and ABP with M, we have to choose a tradeoff
between ABS and ABP to ensure the better clustering results. According
to the experimental results, we fi nd the fact that choosing a reasonable M
can guarantee a better AB. The setting of M is an open question in various
applications [19]. In this chapter, we experimentally set M value based on
the size of data set and the cluster number.
7.3.2 Heuristic Clustering Algorithm based on Approximate Backbone
7.3.2.1 Reconstruct the New Searching Space
A new search space, S determined by the AB, is crucial to heuristic clustering
algorithms. Various applications might have different new search space
reconstruction methods. For example, in TSP [16], the new search space is
reconstructed by including all the edges that are not occurred in AB. While
the construction of AB leads to the formation of the fraction of the new search
space, the number of centres within the AB is usually less than the number
of predefi ned clusters in K-centre clustering. In that case, how to select
the rest centres from the local suboptimal solutions is another important
issue. Essentially, a cluster centre that was’nt included in the AB but was
frequently occurred in local suboptimal clustering results is more likely to
be selected to represent a real cluster. Although we can refi ne the new search
space with all the centers occurred in local suboptimal solutions, in order to
reduce the size of the new search space, we therefore select a centre into the
new search space, only if it is a frequent centre. DEFINITION 5 (Frequent
Centre). Given an occurrence threshold β, the local suboptimal clustering
result Z = V
1
, ..., V
M
and the AB a
b
one(Z), for each centre v
i
¢ UZ a
b
one(Z), if
the occurrence frequency of v
i
exceeds β, then v
i
is a frequent centre, where
is a set difference operator and UZ = U
m
= 1
M
V
m
.
We use the example shown in Table 1 to explain the new search space
refi nement. From table 1, we fi nd that there are 14 objects be selected as the
centre of clusters in V
1
, V
2
, V
3
. The AB of this example is a
b
one(V
1
, V
2
, V
3
) =
109, 128, 130, 262, 252, 412, 432, if the occurrence frequency of v
i
exceeds β,
then it is a frequent centre and will be included in S. According to the big
valley phenomenon, here we set the parameter β = M
*
80% = 2.4 (for this
example M=3), and we obtain the three additional centers 128, 262, 412.
Advanced Clustering Analysis 167