Levenberg–Marquardt Backpropagation

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

3.2. STANDARD BACKPROPAGATION ALGORITHM 27

e neuron has a bias b, which is summed with the weighted inputs to form the net input

n, which can be expressed by

n D

j D1

C b D W

C b: (3.1)

en the net input n passes through an active function f , which generates the neuron

output a:

a D f .n/: (3.2)

In this study, the log-sigmoid activation function is adopted. It can be given by the fol-

lowing expression:

f .x/ D

1 C e

x

: (3.3)

us, the multi-input FFNN in Fig. 3.2 implements the following equation:

D f

iD1

1;i

j D1

i;j

C b

; (3.4)

where

denotes the output of the overall networks.

is the number of inputs,

is the number

of neurons in the hidden layer, and p

indicates the j th input. f

and f

are the activation

functions of the hidden layer and output layer, respectively. b

represents the bias of the ith

neuron in the hidden layer, and b

is the bias of the neuron in the output layer. w

i;j

represents

the weight connecting the j th input and the ith neuron of the hidden layer, and w

1;i

represents

the weight connecting the i th source of the hidden layer to the output layer neuron.

3.2 STANDARD BACKPROPAGATION ALGORITHM

In order to train the established FFNN, the backpropagation algorithm can be utilized [67].

Considering a multilayer FFNN, such as the one with three-layer shown in Fig. 3.2, its operation

can be described using the following equation:

mC1

D f

mC1



mC1

C b

mC1



; (3.5)

where a

and a

mC1

are the outputs of the mth and (m C 1)th layers of the networks, respectively.

mC1

is the bias vector of (m C 1)th layers of the networks. m D 0; 1; : : : ; M  1, where M is

the number of layers of the neural network. e neurons of the ﬁrst layer obtain inputs:

D p: (3.6)

Equation (3.6) provides the initial condition for Equation (3.5). e outputs of the neurons in

the last layer can be seen as the overall networks’ outputs:

a D a

: (3.7)

28 3. STATE ESTIMATION OF CYBER-PHYSICAL VEHICLE SYSTEMS

e task is to train the network with associations between a speciﬁed set of input-output

pairs f.p

; t

/; .p

; t

/; : : : ; .p

; t

/g, where p

is an input to the network, and t

is the corre-

sponding target output. As each input is applied to the network, the network output is compared

to the target.

e backpropagation algorithm uses mean square error as the performance index, which

is to be minimized by adjusting the network parameters, as shown in Equation (3.8):

F .x/ D E





D E



.t  a/



; (3.8)

where x is the vector matrix of network weights and biases. Using the approximate steepest

descent rule, the performance index F .x/ can be approximated by

F .x/ D .t.k/  a.k//

.t.k/  a.k// D e

.k/e.k/; (3.9)

where the expectation of the squared error in Equation (3.8) has been replaced by the squared

error at iteration step k.

e steepest descent algorithm for the approximate mean square error is

i;j

.k C 1/ D w

i;j

.k/  ˛

i;j

(3.10)

.k C 1/ D b

.k/  ˛

; (3.11)

where ˛ is the learning rate.

Based on the chain rule, the derivatives in Equations (3.10) and (3.11) can be calculated

as:

i;j



i;j

;



: (3.12)

We now deﬁne s

as the sensitivity of

F to changes in the ith element of the net input

at layer m:



: (3.13)

Using the deﬁned sensitivity, then the derivatives in Equation (3.12) can be simpliﬁed as

i;j

D s

m1

(3.14)

D s

: (3.15)

3.2. STANDARD BACKPROPAGATION ALGORITHM 29

en the approximate steepest descent algorithm can be rewritten in matrix form as:

.k C 1/ D W

.k/  ˛s



m1



(3.16)

.k C 1/ D b

.k/  ˛s

; (3.17)

where



;

; : : : ;

: (3.18)

To derive the recurrence relationship for the sensitivities, the following Jacobian matrix is

utilized:

mC1



mC1

  

mC1

  

mC1

  

mC1

: (3.19)

Consider the i , j element in the matrix:

mC1

D w

mC1

i;j

D w

mC1

i;j





: (3.20)

us, the Jacobian matrix can be rewritten as

mC1

D W

mC1

; (3.21)

where





0 : : : 0





0 0 : : :





: (3.22)

en the recurrence relation for the sensitivity can be obtained by using the chain rule:



mC1



mC1



mC1



mC1

(3.23)

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Levenberg–Marquardt Backpropagation

Create new playlist

Sign In

Sign Up

Table of Contents for
Levenberg–Marquardt Backpropagation