246
MacKAY
governing its own^activities according to its experience of their relative
success (judged by some general criterion) and diminishes by a kind of
"natural selection" the probabilities of all except those which have been
successful often, enough.
This is not the place to elaborate on engineering aspects of the
application of this principle to the hierarchy of Figure 3 . We must,
however, satisfy ourselves that a trial-and-error process could converge
quickly enough to be useful in the more general case.
It is first clear that if n independent thresholds must each be
set to one of a levels, then up to n log2 a binary choices may be re
quired to change from one configuration of thresholds to another. Even an
error-signal of m bits could guide such a change only after at least
(n/m) log2 a trials.
Clearly, until the higher-order statistical structure of events
begins to be abstracted, the automaton must begin by searching for adaptive
reactions from a very limited repertoire, if the search-process is to con
verge with any speed. It would seem sensible to begin with most thresholds
low, so that in effect the number of degrees of freedom of response is
small, the elements being coupled together in large groups. Spontaneous
activity would then be initially fast and furious, and would of course, in
a richly-interconnected system, include much random suppression of coupling
elements. This dissociative activity would amount to random augmentation
and alteration of the degrees of freedom of the response-system, and any
changes found useful could be statistically favoured- (See page 2 4 4 .)
As soon as simple adaptive activity began to show regularities,
the second-order mechanism of Figure 3 , on the same principle, could begin
to develop simple organizing routines to Improve the average overall
success. At this stage matching-responses would be crude and perhaps not
highly differentiated in their success-value from others; but as experi
ence of success and failure accumulated, and useful second-and higher-
order organizing routines began to develop, the speed of convergence on a
rough matching-response would increase and leave progressively more time
for exploratory development of finer response-structure before the environ
ment changed sufficiently to demand a new response.
It will be noted that this trial process is not organized for
maximum returns on simple betting principles. The response having maximum
likelihood of success, on the information contained in the error-signal,
is certainly most likely to be tried first, if the feedback from the error-
signal on the threshold-configuration is effective. But it will not always
take precedence; and any of the other responses with finite likelihoods
of success may sometimes be tried first instead, in the spontaneous
sequence of activity guided by the configuration of thresholds.