Algorithm 5.1: Hunt’s algorithm
(1) If D
t
contains records that belong the same class y
t
, then t is a leaf node labeled
as y
t
(2) If D
t
is an empty set, then t is a leaf node labeled by the default class, y
d
(3) If D
t
contains records that belong to more than one class, use an attribute test
to split the data into smaller subsets.
5.3 Bayesian Network and Classifi cation
5.3.1 Bayesian Network
Bayesian network theory can be thought of as a fusion of incidence
diagrams and Bayes’ theorem. A Bayesian network, or belief network, shows
conditional probability and causality relationships between variables. For
example, a Bayesian network could represent the probabilistic relationships
between diseases and symptoms. Given the symptoms, the network can
be used to compute the probabilities of the presence of various diseases.
The probability of an event occurring given that another event has already
occurred is called conditional probability. The probabilistic model is described
qualitatively by a directed acyclic graph, or DAG. The vertices of the graph,
which represent variables, are called nodes. The nodes are represented as
circles containing the variable name. The connections between the nodes
are called arcs, or edges. The edges are drawn as arrows between the
nodes, and represent dependence between the variables. Therefore, any
pair of nodes indicates that one node is the parent of the other so there are
no independent assumptions. Independent assumptions are implied in
Bayesian networks by the absence of a link. Figure 5.3.1 shows an example of
DAG. The node where the arc originates is called the parent, while the node
where the arc ends is called the child. In this case, V
0
is a parent of V
1
and
V
2
, V
2
has parents V
0
and V
1
. Nodes that can be reached from other nodes
are called descendants. Nodes that lead a path to a specifi c node are called
ancestors. For example, V
1
and V
2
are descendants of V
0
, and V
1
is ancestors
of V
2
and V
3
. Since no child can be its own ancestor or descendent, there
are no loops in Bayesian networks. Bayesian networks will generally also
include a set of probability tables, stating the probabilities for the true/false
values of the variables. The main point of Bayesian Networks is to allow
for probabilistic inference to be performed. This means that the probability
of each value of a node in the Bayesian network can be computed when
the values of the other variables are known. Also, because independence
among the variables is easy to recognize since conditional relationships are
clearly defi ned by a graph edge, not all joint probabilities in the Bayesian
system need to be calculated in order to make a decision. Classifi cation
with Bayesian network.
Classifi cation 107