among the objects, and capture the common characteristics of data
distribution via machine intelligence methods. For example, clustering
analysis is used to partition data objects into various groups unknown
beforehand based on the mutual distance or similarity between them.
The criterion of such partition is to meet the optimal condition that
the objects within the same group are close to each other, while the
objects from different groups should be separated far enough. Topic
modeling is a newly emerging descriptive learning method to detect
the topical coherence with the observations. Through the adjustment
of the statistical model chosen for learning and comparison between
the observation and model derivation, we can identify the hidden topic
distribution underlying the observations and associations between
the topics and the data objects. In this way all the objects are treated
equally and an overall and statistical description is derived from the
machine learning process. As they mainly rely on the computational
power of machines without human interactions, sometimes we also
call them unsupervised approaches.
• Predictive approach: This kind of approach aims at concluding some
operational rules or regulations for prediction. By generalizing the
linkage between the outcome and observed variables, we can induce
some rules or patterns of classifi cations and predictions. These rules
help us to predict the unknown status of new targeted objects or
occurrence of specifi c results. To accomplish this, we have to collect
suffi cient data samples in advance, which have been already labeled
with the specifi c input labels, for example, the positive or negative in
pathological examination or accept and reject decision in bank credit
assessment. These approaches are mainly developed in the domain of
machine learning such as Support Vector Machine (SVM), decision tree
and so on. The learned results from such approaches are represented
as a set of reasoning conditions and stored as rule to guide the future
prediction and judgment. One distinct feature of this kind approaches is
the presence of labeled samples beforehand and the classifi er are trained
upon the training data, so it is also called supervised approaches (i.e.,
with prior knowledge and human supervision). Predictive approaches
account for majority of analytical tasks in real applications due to its
advantage for future prediction.
• Evolutionary approach: The above two kinds of approaches are often
used to deal with the static data, i.e., data collected is restricted within
a specifi c time frame. However, with the huge refl ux of massive data
available in a distributed and networked environment, the dynamics
becomes a challenging characteristic in data mining research. This
calls for evolutionary data mining algorithms to deal with the change
of temporal and spatial data within the database. The representative
Introduction 9