Bayesian treatment of neural networks

To set the neural network learning in a Bayesian context, consider the error function Bayesian treatment of neural networks for the regression case. It can be treated as a Gaussian noise term for observing the given dataset conditioned on the weights w. This is precisely the likelihood function that can be written as follows:

Bayesian treatment of neural networks

Here, Bayesian treatment of neural networks is the variance of the noise term given by Bayesian treatment of neural networks and Bayesian treatment of neural networksrepresents a probabilistic model. The regularization term can be considered as the log of the prior probability distribution over the parameters:

Bayesian treatment of neural networks

Here, Bayesian treatment of neural networks is the variance of the prior distribution of weights. It can be easily shown using Bayes' theorem that the objective function M(w) then corresponds to the posterior distribution of parameters w:

Bayesian treatment of neural networks

In the neural network case, we are interested in the local maxima of Bayesian treatment of neural networks. The posterior is then approximated as a Gaussian around each maxima Bayesian treatment of neural networks, as follows:

Bayesian treatment of neural networks

Here, A is a matrix of the second derivative of M(w) with respect to w and represents an inverse of the covariance matrix. It is also known by the name Hessian matrix.

The value of hyper parameters Bayesian treatment of neural networks and Bayesian treatment of neural networks is found using the evidence framework. In this, the probability Bayesian treatment of neural networks is used as a evidence to find the best values of Bayesian treatment of neural networks and Bayesian treatment of neural networks from data D. This is done through the following Bayesian rule:

Bayesian treatment of neural networks

By using the evidence framework and Gaussian approximation of posterior (references 2 and 5 in the References section of this chapter), one can show that the best value of Bayesian treatment of neural networks satisfies the following:

Bayesian treatment of neural networks

Also, the best value of Bayesian treatment of neural networks satisfies the following:

Bayesian treatment of neural networks

In these equations, Bayesian treatment of neural networks is the number of well-determined parameters given by Bayesian treatment of neural networks where k is the length of w.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.119.163.238