6.5 Kalman Filter

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

6.4.5 Environment Sensing

The measurement made in the environment can be used to improve the estimation of the state X (e.g., location) in this environment. Imagine that we are sleepwalking around the house in the middle of the night. When we wake up, we can figure out where we are by using our senses (sight, touch, etc.).

Mathematically, the initial knowledge about the environment can be described with probability distribution p(x) (prior). This distribution can be improved when a new measurement z (with belief p(z|x)) is available if the probability after the measurement is determined p(x|z). This can be achieved using the Bayesian rule $p (x | z) = b e l (x) = \frac{p (z | x) p (x)}{p (z)}$ $p (x | z) = b e l (x) = \frac{p (z | x) p (x)}{p (z)}$ . The probability p(z|x) represents the statistical model of the sensor, and bel(x) is state-estimate belief after the measurement is made. In the process of perception, the correction step of the Bayesian filter is evaluated.

Example 6.9

Considering Example 6.8, assume that the mobile system detects a dark cell Z = dark. The mobile system detects a dark cell with probability 0.6 and makes a mistake with probability 0.2 (detects a bright cell as dark). Hence,

$\begin{array}{l} \begin{matrix} p (Z = d a r k | X = x_{d}) & = 0.6, d \in {3, 4} \\ p (Z = d a r k | X = x_{b}) & = 0.2, b \in {1, 2, 5} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} p (Z = d a r k | X = x_{d}) & = 0.6, d \in {3, 4} \\ p (Z = d a r k | X = x_{b}) & = 0.2, b \in {1, 2, 5} \end{matrix} \end{array}$

si123_e

where index b indicated bright cells and index d indicated dark cells. In the beginning, the mobile system does not know its position. This can be described with a uniform probability distribution P(X = x_i) = bel(x_i) = 0.2, i ∈{1, …, 5}. Calculate the location probability distribution after a single measurement.

Solution

We would like to determine the distribution of conditional probability p(X₁|Z = dark), the state-estimate belief after the measurement is made. The desired probability can be determined using the correction step of the Bayesian filter:

$\begin{array}{l} \begin{matrix} p (X_{1} | Z = d a r k) & = \frac{p (Z = d a r k | X_{1}) * p (X_{1})}{P (Z = d a r k)} \\ = \frac{{[0.2, 0.2, 0.6, 0.6, 0.2]}^{T} * {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T}}{P (Z = d a r k)} \\ = \frac{{[0.04, 0.04, 0.12, 0.12, 0.04]}^{T}}{P (Z = d a r k)} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} p (X_{1} | Z = d a r k) & = \frac{p (Z = d a r k | X_{1}) * p (X_{1})}{P (Z = d a r k)} \\ = \frac{{[0.2, 0.2, 0.6, 0.6, 0.2]}^{T} * {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T}}{P (Z = d a r k)} \\ = \frac{{[0.04, 0.04, 0.12, 0.12, 0.04]}^{T}}{P (Z = d a r k)} \end{matrix} \end{array}$

si124_e

where the operator * represents the operation of element-wise multiplication of vector elements. We need to calculate the probability of detecting a dark cell P(Z = dark). Therefore the total probability must be evaluated, that is, the probability of detecting a dark cell considering all the cells:

$\begin{array}{l} \begin{matrix} P (Z = d a r k) & = \sum_{i} P (Z = d a r k | X_{1} = x_{i}) P (X_{1} = x_{i}) \\ = p^{T} (Z = d a r k | X_{1}) p (X_{1}) \\ = [0.2, 0.2, 0.6, 0.6, 0.2] {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T} \\ = 0.36 \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} P (Z = d a r k) & = \sum_{i} P (Z = d a r k | X_{1} = x_{i}) P (X_{1} = x_{i}) \\ = p^{T} (Z = d a r k | X_{1}) p (X_{1}) \\ = [0.2, 0.2, 0.6, 0.6, 0.2] {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T} \\ = 0.36 \end{matrix} \end{array}$

si125_e

The posterior probability distribution is therefore the following:

$\begin{array}{l} p (X_{1} | Z = d a r k) = {[0.11, 0.11, 0.33, 0.33, 0.11]}^{T} \end{array}$ $\begin{array}{l} p (X_{1} | Z = d a r k) = {[0.11, 0.11, 0.33, 0.33, 0.11]}^{T} \end{array}$

Hence, we can conclude that the position of the mobile system is three times more likely to be in cells 3 or 4 than in the remaining three cells. The probability distributions are also shown graphically in Fig. 6.10. The solution of this example is also given in Listing 6.4.

f06-10-9780128042045 — Fig. 6.10 Probability distributions from Example 6.9.

Listing 6.4

Implementation of the solution of Example 6.9

1 disp( ’Sensor measurement distribution p(Z = dark|X ) ’)

2 p_ZdX= [ 0 . 2 0.2 0.6 0.6 0 . 2 ]

3 disp( ’Sensor measurement distribution p(Z = bright|X ) ’)

4 p_ZbX= 1− p_ZdX

5 disp( ’ Initial distribution p(X ) ’)

6 p_X= ones (1 ,5) /5

8 disp( ’Probability of detecting a darkc e l l P (Z = dark) ’)

9 P_Zd= p_ZdX*p_X. ’

11 disp( ’Posterior distribution p(X |Z = dark) ’)

12 p_XZd= p_ZdX.*p_X/P_Zd

Sensor measurement distribution p ( Z = dark | X )

p_ZdX =

0.2000 0.2000 0.6000 0.6000 0.2000

Sensor measurement distribution p ( Z = bright | X )

p_ZbX =

0.8000 0.8000 0.4000 0.4000 0.8000

Initial distribution p ( X )

p_X =

0.2000 0.2000 0.2000 0.2000 0.2000

Probability of detecting a dark c e l l P ( Z = dark )

P_Zd =

0.3600

Posterior distribution p ( X | Z = dark )

p_XZd =

0.1111 0.1111 0.3333 0.3333 0.1111

Example 6.10

Answer the following questions about Example 6.9:

1. Can multiple measurements improve the estimated mobile system position (the mobile system does not move between the measurements)?

2. What is the probability distribution of the mobile system position if the mobile system detects a tile as dark twice in a row?

3. What is the probability distribution of the mobile system position if the mobile system detects a tile as dark and then as bright?

4. What is the probability distribution of the mobile system position if the mobile system detects a tile as dark, then as bright, and then again as dark?

Solution

1. Multiple measurements can improve the estimate about the mobile system position if the probability of the correct measurement is higher than the probability of the measurement mistake.

2. The probability distribution if the sensor detects cell as dark twice in a row is the following:

$\begin{array}{l} \begin{matrix} p (X_{2} | Z_{1} = d a r k, Z_{2} = d a r k) = p (X_{2} | z_{1}, z_{2}) \\ = \frac{p (z_{2} | X_{2}) * p (X_{2} | z_{1})}{P (z_{2} | z_{1})} \\ = \frac{{[0.2, 0.2, 0.6, 0.6, 0.2]}^{T} * {[0.11, 0.11, 0.33, 0.33, 0.11]}^{T}}{P (z_{2} | z_{1})} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} p (X_{2} | Z_{1} = d a r k, Z_{2} = d a r k) = p (X_{2} | z_{1}, z_{2}) \\ = \frac{p (z_{2} | X_{2}) * p (X_{2} | z_{1})}{P (z_{2} | z_{1})} \\ = \frac{{[0.2, 0.2, 0.6, 0.6, 0.2]}^{T} * {[0.11, 0.11, 0.33, 0.33, 0.11]}^{T}}{P (z_{2} | z_{1})} \end{matrix} \end{array}$

si127_e

where jth element in the probability distribution p(X₂|z₁) is given by P(X₂ = x_j|z₁) =p^T(X₂ = x_j|X₁)p(X₁|z₁) = P(X₁ = x_j|z₁), since we have no influence on the states (see Example 6.5); we only observe the states with measurements. The conditional probability in the denominator is

$\begin{array}{l} \begin{matrix} P (z_{2} | z_{1}) & = \sum_{x_{i}} P (z_{2} | X_{2} = x_{i}) P (X_{2} = x_{i} | z_{1}) \\ = p^{T} (z_{2} | X_{2}) p (X_{2} | z_{1}) \\ = [0.2, 0.2, 0.6, 0.6, 0.2] {[0.11, 0.11, 0.33, 0.33, 0.11]}^{T} \\ = 0.4667 \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} P (z_{2} | z_{1}) & = \sum_{x_{i}} P (z_{2} | X_{2} = x_{i}) P (X_{2} = x_{i} | z_{1}) \\ = p^{T} (z_{2} | X_{2}) p (X_{2} | z_{1}) \\ = [0.2, 0.2, 0.6, 0.6, 0.2] {[0.11, 0.11, 0.33, 0.33, 0.11]}^{T} \\ = 0.4667 \end{matrix} \end{array}$

si128_e

Finally, the solution is

$\begin{array}{l} \begin{matrix} p (X_{2} | z_{1}, z_{2}) & = \frac{{[0.2, 0.2, 0.6, 0.6, 0.2]}^{T} * {[0.11, 0.11, 0.33, 0.33, 0.11]}^{T}}{[0.2, 0.2, 0.6, 0.6, 0.2] {[0.11, 0.11, 0.33, 0.33, 0.11]}^{T}} \\ = {[0.0476, 0.0476, 0.4286, 0.4286, 0.0476]}^{T} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} p (X_{2} | z_{1}, z_{2}) & = \frac{{[0.2, 0.2, 0.6, 0.6, 0.2]}^{T} * {[0.11, 0.11, 0.33, 0.33, 0.11]}^{T}}{[0.2, 0.2, 0.6, 0.6, 0.2] {[0.11, 0.11, 0.33, 0.33, 0.11]}^{T}} \\ = {[0.0476, 0.0476, 0.4286, 0.4286, 0.0476]}^{T} \end{matrix} \end{array}$

si129_e

3. The bright cell is detected correctly with probability p(Z = bright|X = bright) = 1 − p(Z = dark|X = bright) = 0.8 and incorrectly with probability p(Z = bright|X = dark) = 1 − p(Z = dark|X = dark) = 0.4. The second measurement can be made based on the probability distribution p(X₂|Z₁ = dark):

$\begin{array}{l} \begin{matrix} p (X_{2} | Z_{1} = d a r k, Z_{2} = b r i g h t) = p (X_{2} | z_{1}, z_{2}) \\ = \frac{{[0.8, 0.8, 0.4, 0.4, 0.8]}^{T} * {[0.11, 0.11, 0.33, 0.33, 0.11]}^{T}}{P (Z_{2} = b r i g h t | Z_{1} = d a r k)} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} p (X_{2} | Z_{1} = d a r k, Z_{2} = b r i g h t) = p (X_{2} | z_{1}, z_{2}) \\ = \frac{{[0.8, 0.8, 0.4, 0.4, 0.8]}^{T} * {[0.11, 0.11, 0.33, 0.33, 0.11]}^{T}}{P (Z_{2} = b r i g h t | Z_{1} = d a r k)} \end{matrix} \end{array}$

si130_e

$\begin{array}{l} \begin{matrix} P (Z_{2} = b r i g h t | Z_{1} = d a r k) \\ = \sum_{x_{i}} P (Z_{2} = b r i g h t | X_{2} = x_{i}) P (X_{2} = x_{i} | Z_{1} = d a r k) \\ = p^{T} (Z_{2} = b r i g h t | X_{2}) p (X_{2} | Z_{1} = d a r k) \\ = [0.8, 0.8, 0.4, 0.4, 0.8] {[0.11, 0.11, 0.33, 0.33, 0.11]}^{T} = 0.533 \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} P (Z_{2} = b r i g h t | Z_{1} = d a r k) \\ = \sum_{x_{i}} P (Z_{2} = b r i g h t | X_{2} = x_{i}) P (X_{2} = x_{i} | Z_{1} = d a r k) \\ = p^{T} (Z_{2} = b r i g h t | X_{2}) p (X_{2} | Z_{1} = d a r k) \\ = [0.8, 0.8, 0.4, 0.4, 0.8] {[0.11, 0.11, 0.33, 0.33, 0.11]}^{T} = 0.533 \end{matrix} \end{array}$

si131_e

$\begin{array}{l} \begin{matrix} p (X_{2} | Z_{1} = d a r k, Z_{2} = b r i g h t) = {[0.167, 0.167, 0.25, 0.25, 0.167]}^{T} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} p (X_{2} | Z_{1} = d a r k, Z_{2} = b r i g h t) = {[0.167, 0.167, 0.25, 0.25, 0.167]}^{T} \end{matrix} \end{array}$

4. The state probability distribution after three measurements were made is

$\begin{array}{l} \begin{matrix} p (X_{3} | Z_{1} = d a r k, Z_{2} = b r i g h t, Z_{3} = d a r k) \\ = {[0.083, 0.083, 0.375, 0.375, 0.083]}^{T} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} p (X_{3} | Z_{1} = d a r k, Z_{2} = b r i g h t, Z_{3} = d a r k) \\ = {[0.083, 0.083, 0.375, 0.375, 0.083]}^{T} \end{matrix} \end{array}$

si133_e

The implementation of the solution in Matlab is shown in Listing 6.5. The posterior state probability distributions for three time steps are presented graphically in Fig. 6.11.

f06-11-9780128042045 — Fig. 6.11 Posterior state probability distributions in three time steps from the fourth case in Example 6.9.

Listing 6.5

Implementation of the solution of Example 6.10

1 p_ZdX =[ 0 . 2 0.2 0.6 0.6 0 . 2 ] ;

2 p_ZbX = 1−p_ZdX;

3 p_X = ones (1 ,5) /5;

5 disp( ’Probability of detecting a darkt i l e P (Z1= dark) ’)

6 P_z1= p_ZdX*p_X. ’

7 disp( ’Posterior distribution p(X1 |Z1= dark) ’)

8 p_Xz1= p_ZdX.*p_X/P_z1

10 disp( ’Probability of detecting a brightt i l e P (Z2= bright|Z1= dark) ’)

11 P_z2= p_ZbX*p_Xz1 . ’

12 disp( ’Posterior distribution p(X2 |Z1= dark,Z2= bright) ’)

13 p_Xz2= p_ZbX.* p_Xz1/P_z2

15 disp( ’Probability of detecting a darkt i l e P (Z3= dark|Z1= dark,Z2= bright) ’)

16 P_z3= p_ZdX*p_Xz2 . ’

17 disp( ’Posterior distribution p(X3 |Z1= dark,Z2= bright ,Z3= dark) ’)

18 p_Xz3= p_ZdX.* p_Xz2/P_z3

Probability of detecting a dark t i l e P ( Z1 = dark )

P_z1 =

0.3600

Posterior distribution p ( X1 | Z1 = dark )

p_Xz1 =

0.1111 0.1111 0.3333 0.3333 0.1111

Probability of detecting a bright t i l e P ( Z2 = bright | Z1 = dark )

P_z2 =

0.5333

Posterior distribution p ( X2 | Z1 = dark , Z2 = bright )

p_Xz2 =

0.1667 0.1667 0.2500 0.2500 0.1667

Probability of detecting a dark t i l e P ( Z3 = dark | Z1 = dark , Z2 = bright )

P_z3 =

0.4000

Posterior distribution p ( X3 | Z1 = dark , Z2 = bright , Z3 = dark )

p_Xz3 =

0.0833 0.0833 0.3750 0.3750 0.0833

6.4.6 Motion in the Environment

Mobile systems can move in the environment using some actuators (e.g., motorized wheels) and a control system. Every movement has small or large uncertainty; therefore, the movement of the mobile system through the environment increases the uncertainty about the mobile system state (pose) in the environment.

Consider that we are standing in well-known environment. We close our eyes and make several steps. After a few steps we know approximately where we are, since we know how big are our steps and we know in which direction the steps were made. Therefore, we can imagine where we are. However, the lengths of our steps are not known precisely, and also the directions of the steps are hard to estimate; therefore, our knowledge about our pose in space decreases with time as more and more steps are made.

In the case of movement without observing the states through measurement, the relation (6.28) can be reformulated:

$\begin{array}{l} p (x_{k} | u_{0 : k - 1}) = \int_{- \infty}^{+ \infty} p (x_{k} | x_{k - 1}, u_{k - 1}) p (x_{k - 1} | u_{1 : k - 2}) d x_{k - 1} \end{array}$ $\begin{array}{l} p (x_{k} | u_{0 : k - 1}) = \int_{- \infty}^{+ \infty} p (x_{k} | x_{k - 1}, u_{k - 1}) p (x_{k - 1} | u_{1 : k - 2}) d x_{k - 1} \end{array}$

si134_e

The belief in the new state p(x_k|u_0:k−1) depends on the belief in the previous time step p(x_k−1|u_0:k−2) and conditional state transition probability p(x_k|x_k−1, u_k−1). The probability distribution p(x_k|, u_0:k−1) can be determined with integration (or summation in the discrete case) of all possible state transition probabilities p(x_k|x_k−1, u_k−1) from previous states x_k−1 into state x_k, given the known action u_k−1.

Example 6.11

Consider again Example 6.8 and assume that the initial position of the mobile system is in the first cell (X₀ = x₁). The initial state can be written with probability distribution p(X₀) = [1, 0, 0, 0, 0]. The mobile system can move between cells, the outcome of movement action is correct in 80%, in 10% the movement of the mobile system is one cell less than required, and in 10% the mobile system moves one cell more than required. This can be described with the following state transition probabilities:

$\begin{array}{l} \begin{matrix} P (X_{k} = x_{i} | X_{k - 1} = x_{j}, U_{k - 1} = u) & = 0.8; for i = j + u \\ P (X_{k} = x_{i} | X_{k - 1} = x_{j}, U_{k - 1} = u) & = 0.1; for i = j + u - 1 \\ P (X_{k} = x_{i} | X_{k - 1} = x_{j}, U_{k - 1} = u) & = 0.1; for i = j + u + 1 \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} P (X_{k} = x_{i} | X_{k - 1} = x_{j}, U_{k - 1} = u) & = 0.8; for i = j + u \\ P (X_{k} = x_{i} | X_{k - 1} = x_{j}, U_{k - 1} = u) & = 0.1; for i = j + u - 1 \\ P (X_{k} = x_{i} | X_{k - 1} = x_{j}, U_{k - 1} = u) & = 0.1; for i = j + u + 1 \end{matrix} \end{array}$

si135_e

The mobile system has to make a movement for two cells in the counter-clockwise direction (U₀ = 2). Determine the mobile system position belief after the movement is made.

Solution

The probability distribution (belief) after the movement is made can be determined if the probabilities of the mobile system positions in every cell are calculated (total probability). The mobile system can arrive into the first cell only from cell 3 (too long move), cell 4 (correct move), and cell 5 (too short move). This yields the probability distribution of the transition into the first cell p(X₁ = x₁|X₀, U₀ = 2) = [0, 0, 0.1, 0.8, 0.1]^T. After the movement the mobile system is in the first cell with probability

$\begin{array}{l} \begin{matrix} P (X_{1} = x_{1} | U_{0} = 2) & = \sum_{x_{i}} P (X_{1} = x_{1} | X_{0} = x_{i}, U_{0} = 2) P (X_{0} = x_{i}) \\ = p^{T} (X_{1} = x_{1} | X_{0}, U_{0} = 2) p (X_{0}) \\ = [0, 0, 0.1, 0.8, 0.1] {[1, 0, 0, 0, 0]}^{T} = 0 \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} P (X_{1} = x_{1} | U_{0} = 2) & = \sum_{x_{i}} P (X_{1} = x_{1} | X_{0} = x_{i}, U_{0} = 2) P (X_{0} = x_{i}) \\ = p^{T} (X_{1} = x_{1} | X_{0}, U_{0} = 2) p (X_{0}) \\ = [0, 0, 0.1, 0.8, 0.1] {[1, 0, 0, 0, 0]}^{T} = 0 \end{matrix} \end{array}$

si136_e

The probability that the mobile system after the movement is in the second cell is

$\begin{array}{l} \begin{matrix} P (X_{1} = x_{2} | U_{0} = 2) & = \sum_{x_{i}} P (X_{1} = x_{2} | X_{0} = x_{i}, U_{0} = 2) P (X_{0} = x_{i}) \\ = p^{T} (X_{1} = x_{2} | X_{0}, U_{0} = 2) p (X_{0}) \\ = [0.1, 0, 0, 0.1, 0.8] {[1, 0, 0, 0, 0]}^{T} = 0.1 \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} P (X_{1} = x_{2} | U_{0} = 2) & = \sum_{x_{i}} P (X_{1} = x_{2} | X_{0} = x_{i}, U_{0} = 2) P (X_{0} = x_{i}) \\ = p^{T} (X_{1} = x_{2} | X_{0}, U_{0} = 2) p (X_{0}) \\ = [0.1, 0, 0, 0.1, 0.8] {[1, 0, 0, 0, 0]}^{T} = 0.1 \end{matrix} \end{array}$

si137_e

Similarly, the probabilities of all the other cells can be calculated:

$\begin{array}{l} \begin{matrix} P (X_{1} = x_{3} | U_{0} = 2) & = [0.8, 0.1, 0, 0, 0.1] {[1, 0, 0, 0, 0]}^{T} = 0.8 \\ P (X_{1} = x_{4} | U_{0} = 2) & = [0.1, 0.8, 0.1, 0, 0] {[1, 0, 0, 0, 0]}^{T} = 0.1 \\ P (X_{1} = x_{5} | U_{0} = 2) & = [0, 0.1, 0.8, 0.1, 0] {[1, 0, 0, 0, 0]}^{T} = 0 \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} P (X_{1} = x_{3} | U_{0} = 2) & = [0.8, 0.1, 0, 0, 0.1] {[1, 0, 0, 0, 0]}^{T} = 0.8 \\ P (X_{1} = x_{4} | U_{0} = 2) & = [0.1, 0.8, 0.1, 0, 0] {[1, 0, 0, 0, 0]}^{T} = 0.1 \\ P (X_{1} = x_{5} | U_{0} = 2) & = [0, 0.1, 0.8, 0.1, 0] {[1, 0, 0, 0, 0]}^{T} = 0 \end{matrix} \end{array}$

si138_e

The mobile position belief after the movement is therefore

$\begin{array}{l} p (X_{1} | U_{0} = 2) = {[0, 0.1, 0.8, 0.1, 0]}^{T} \end{array}$ $\begin{array}{l} p (X_{1} | U_{0} = 2) = {[0, 0.1, 0.8, 0.1, 0]}^{T} \end{array}$

The posterior state probability distributions are presented graphically in Fig. 6.12. The implementation of the solution in Matlab is shown in Listing 6.6.

f06-12-9780128042045 — Fig. 6.12 Posterior state probability distributions in two time steps from Example 6.11.

Listing 6.6

Implementation of the solution of Example 6.11

1 disp( ’ Initial belief p(X0 ) ’)

2 p_X0= [1 0 0 00]

4 P_xxu_null= 0 . 8 ; % P (X = i|X ’=j ,U ’=u) , i= j+ u

5 P_xxu_less= 0 . 1 ; % P (X = i|X ’=j ,U ’=u) , i= j+ u−1

6 P_xxu_more = 0 . 1 ; % P (X = i|X ’=j ,U ’=u) , i= j+ u+1

8 disp( ’Belief p(X1 |U0 =2) ’) ;

9 p_xXu= [0 0 P_xxu_more P_xxu_null P_xxu_less ] ; % for U =2

10 p_Xu= zeros(1 ,5) ;

11 fori =1:5

12 p_Xu( i )= p_xXu*p_X0 . ’ ;

13 p_xXu= p_xXu ( [ end1:end−1]) ;

14 end

15 p_X1 = p_Xu

Initial belief p ( X0 )

p_X0 =

1 0 0 0 0

Belief p ( X1 | U0 =2)

p_X1 =

0 0.1000 0.8000 0.1000 0

Example 6.12

What is the belief of mobile system position if after the movement made in Example 6.11 the mobile system makes another move in the counter-clockwise direction, but this time only for a single cell (U₁ = 1)?

Solution

The probability distribution (belief) after the movement can again be determined if the probability that the mobile system is in a particular cell is calculated for every cell (total probability). In the case of a movement for a single cell, the first cell can be reached from cell 1 (too short move), cell 4 (too long move), and cell 5 (correct move). The mobile system can arrive to the second cell from cells 1, 2, and 5, and so on. After the movement is made, the following probabilities can be calculated:

$\begin{array}{l} \begin{matrix} P (X_{2} = x_{1} | U_{0} = 2, U_{1} = 1) & = [0.1, 0, 0, 0.1, 0.8] {[0, 0.1, 0.8, 0.1, 0]}^{T} = 0.01 \\ P (X_{2} = x_{2} | U_{0} = 2, U_{1} = 1) & = [0.8, 0.1, 0, 0, 0.1] {[0, 0.1, 0.8, 0.1, 0]}^{T} = 0.01 \\ P (X_{2} = x_{3} | U_{0} = 2, U_{1} = 1) & = [0.1, 0.8, 0.1, 0, 0] {[0, 0.1, 0.8, 0.1, 0]}^{T} = 0.16 \\ P (X_{2} = x_{4} | U_{0} = 2, U_{1} = 1) & = [0, 0.1, 0.8, 0.1, 0] {[0, 0.1, 0.8, 0.1, 0]}^{T} = 0.66 \\ P (X_{2} = x_{5} | U_{0} = 2, U_{1} = 1) & = [0, 0, 0.1, 0.8, 0.1] {[0, 0.1, 0.8, 0.1, 0]}^{T} = 0.16 \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} P (X_{2} = x_{1} | U_{0} = 2, U_{1} = 1) & = [0.1, 0, 0, 0.1, 0.8] {[0, 0.1, 0.8, 0.1, 0]}^{T} = 0.01 \\ P (X_{2} = x_{2} | U_{0} = 2, U_{1} = 1) & = [0.8, 0.1, 0, 0, 0.1] {[0, 0.1, 0.8, 0.1, 0]}^{T} = 0.01 \\ P (X_{2} = x_{3} | U_{0} = 2, U_{1} = 1) & = [0.1, 0.8, 0.1, 0, 0] {[0, 0.1, 0.8, 0.1, 0]}^{T} = 0.16 \\ P (X_{2} = x_{4} | U_{0} = 2, U_{1} = 1) & = [0, 0.1, 0.8, 0.1, 0] {[0, 0.1, 0.8, 0.1, 0]}^{T} = 0.66 \\ P (X_{2} = x_{5} | U_{0} = 2, U_{1} = 1) & = [0, 0, 0.1, 0.8, 0.1] {[0, 0.1, 0.8, 0.1, 0]}^{T} = 0.16 \end{matrix} \end{array}$

si140_e

The position belief after the second movement is

$\begin{array}{l} p (X_{2} | U_{0} = 2, U_{1} = 1) = {[0.01, 0.01, 0.16, 0.66, 0.16]}^{T} \end{array}$ $\begin{array}{l} p (X_{2} | U_{0} = 2, U_{1} = 1) = {[0.01, 0.01, 0.16, 0.66, 0.16]}^{T} \end{array}$

and it is also shown in the bottom of Fig. 6.13. Note that the mobile system is most probably in cell 4. However, the probability distribution does not have as significant peak as it was before the second movement action was made (compare the middle with the bottom probability distribution in Fig. 6.13; the peak dropped from 80% to 66%). This observation is in accordance with the statement that every movement (action) increases the level of uncertainty about the states of the environment.

f06-13-9780128042045 — Fig. 6.13 Posterior state probability distributions in three time steps from Example 6.12.

The implementation of the solution in Matlab is shown in Listing 6.7.

Listing 6.7

Implementation of the solution of Example 6.12

1 disp( ’ Initial belief p(X0 ) ’)

2 p_X0= [1 0 0 00]

4 P_xxu_null= 0 . 8 ; % P (X = i|X ’=j ,U ’=u) , i= j+ u

5 P_xxu_less= 0 . 1 ; % P (X = i|X ’=j ,U ’=u) , i= j+ u−1

6 P_xxu_more = 0 . 1 ; % P (X = i|X ’=j ,U ’=u) , i= j+ u+1

8 disp( ’Belief p(X1 |U0 =2) ’) ;

9 p_xXu= [0 0 P_xxu_more P_xxu_null P_xxu_less ] ; % for U =2

10 p_Xu= zeros(1 ,5) ;

11 fori =1:5

12 p_Xu( i )= p_xXu*p_X0 . ’ ;

13 p_xXu= p_xXu ( [ end1:end−1]) ;

14 end

15 p_X1 = p_Xu

17 disp ( ’Belief p(X2 |U1 =1) ’) ;

18 p_xXu= [ P_xxu_less 0 0 P_xxu_more P_xxu_null ] ; % for U =1

19 p_Xu= zeros(1 ,5) ;

20 fori =1:5

21 p_Xu( i )= p_xXu*p_X1 . ’ ;

22 p_xXu= p_xXu ( [ end1:end−1]) ;

23 end

24 p_X2 = p_Xu

Initial belief p ( X0 )

p_X0 =

1 0 0 0 0

Belief p ( X1 | U0 =2)

p_X1 =

0 0.1000 0.8000 0.1000 0

Belief p ( X2 | U1 =1)

p_X2 =

0.0100 0.0100 0.1600 0.6600 0.1600

Example 6.13

Consider that the mobile system in Example 6.11 is initially in the first cell, p(X₀) = [1, 0, 0, 0, 0]. In every time step the mobile system makes one step in counter-clockwise direction.

1. What is the belief into mobile system position after 10 time steps?

2. To which value does the belief converge after an infinite number of time steps?

Solution

1. The state belief after 10 time steps is

$\begin{array}{l} p (X_{10} | U_{0 : 9}) = {[0.29, 0.22, 0.13, 0.13, 0.22]}^{T} \end{array}$ $\begin{array}{l} p (X_{10} | U_{0 : 9}) = {[0.29, 0.22, 0.13, 0.13, 0.22]}^{T} \end{array}$

2. After an infinite number of time steps a uniform distribution is obtained, since all the cells become equally possible to be occupied by the mobile system:

$\begin{array}{l} p (X_{\infty} | U_{0 : \infty}) = {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T} \end{array}$ $\begin{array}{l} p (X_{\infty} | U_{0 : \infty}) = {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T} \end{array}$

These results were also validated in Matlab (Listing 6.8) and are shown graphically in Fig. 6.14.

f06-14-9780128042045 — Fig. 6.14 Posterior state probability distributions in three time steps from Example 6.13.

Listing 6.8

Implementation of the solution of Example 6.13

1 disp( ’ Initial belief p(X0 ) ’)

2 p_X0= [1 0 0 00]

4 P_xxu_null= 0 . 8 ; % P (X = i|X ’=j ,U ’=u) , i= j+ u

5 P_xxu_less= 0 . 1 ; % P (X = i|X ’=j ,U ’=u) , i= j+ u−1

6 P_xxu_more = 0 . 1 ; % P (X = i|X ’=j ,U ’=u) , i= j+ u+1

8 p_X= p_X0 ;

9 for k=1:1000

10 p_xXu= [ P_xxu_less 0 0 P_xxu_more P_xxu_null ] ; % for U =1

11 p_Xu= zeros(1 ,5) ;

12 fori =1:5

13 p_Xu( i )= p_xXu*p_X . ’ ;

14 p_xXu= p_xXu ( [ end1:end−1]) ;

15 end

16 p_X = p_Xu ;

17 i f k==10

18 disp( ’Belief p(X10|U9 =1) ’) ;

19 p_X10= p_X

20 else i f k==1000

21 disp( ’Belief p(X1000|U999=1) ’) ;

22 p_X1000= p_X

23 end

24 end

Initial belief p ( X0 )

p_X0 =

1 0 0 0 0

Belief p ( X10 | U9 =1)

p_X10 =

0.2949 0.2243 0.1283 0.1283 0.2243

Belief p ( X1000 | U999 =1)

p_X1000 =

0.2000 0.2000 0.2000 0.2000 0.2000

6.4.7 Localization in the Environment

The mobile system can estimate its location in the environment even if it does not know its initial location but has a map of the environment. The location of the mobile system can be determined precisely with probability distribution. The process of determining the location in the environment is known as localization. Localization combines the process of observation (measurement) and action (movement). As already mentioned, measurements made in the environment increase the knowledge about the location, but the movement of the mobile system through the environment decreases this information.

Localization is a process in which the mobile system repeatedly updates the probability distribution that represents the knowledge about the mobile system’s location in the environment. The peak in the probability distribution (if it exists) represents the most probable mobile system location.

The localization process is a realization of the Bayesian filter (Algorithm 3), which combines the processes of movement and perception.

Example 6.14

Consider the mobile system that moves around the environment as shown in Example 6.8. The mobile system first makes a move and then observes the environment. The initial pose of the mobile system is not known. This can be described with uniform probability distribution p(X₀) = bel(X₀) = [0.2, 0.2, 0.2, 0.2, 0.2].

The movement action for u_k cells in the counter-clockwise direction is accurate in 80%; in 10% the movement is either a cell shorter or longer than required:

$\begin{array}{l} \begin{matrix} p (X_{k} = x_{i} | X_{k - 1} = x_{j}, U_{k - 1} = u_{k - 1}) & = 0.8; for i = j + u_{k - 1} \\ p (X_{k} = x_{i} | X_{k - 1} = x_{j}, U_{k - 1} = u_{k - 1}) & = 0.1; for i = j + u_{k - 1} - 1 \\ p (X_{k} = x_{i} | X_{k - 1} = x_{j}, U_{k - 1} = u_{k - 1}) & = 0.1; for i = j + u_{k - 1} + 1 \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} p (X_{k} = x_{i} | X_{k - 1} = x_{j}, U_{k - 1} = u_{k - 1}) & = 0.8; for i = j + u_{k - 1} \\ p (X_{k} = x_{i} | X_{k - 1} = x_{j}, U_{k - 1} = u_{k - 1}) & = 0.1; for i = j + u_{k - 1} - 1 \\ p (X_{k} = x_{i} | X_{k - 1} = x_{j}, U_{k - 1} = u_{k - 1}) & = 0.1; for i = j + u_{k - 1} + 1 \end{matrix} \end{array}$

si144_e

The mobile system detects a dark cell correctly with probability 0.6, and the probability of detecting a bright cell correctly is 0.8. This can be written down in mathematical form as

$\begin{array}{l} \begin{matrix} P (Z = d a r k | X = d a r k) & = 0.6, & P (Z = b r i g h t | X = d a r k) & = 0.4 \\ P (Z = b r i g h t | X = b r i g h t) & = 0.8, & P (Z = d a r k | X = b r i g h t) & = 0.2 \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} P (Z = d a r k | X = d a r k) & = 0.6, & P (Z = b r i g h t | X = d a r k) & = 0.4 \\ P (Z = b r i g h t | X = b r i g h t) & = 0.8, & P (Z = d a r k | X = b r i g h t) & = 0.2 \end{matrix} \end{array}$

si145_e

In every time step, the mobile system receives a command of moving for a single cell in the counter-clockwise direction (u_k−1 = 1). The sequence of the first three measurements is z_1:3 = [bright, dark, dark].

1. What is the belief in the first time step k = 1?

2. What is the belief in the second time step k = 2?

3. What is the belief in the third time step k = 3?

4. In which cell is the mobile system most likely after the third step?

Solution

After every movement is made a prediction step of the Bayesian filter (Algorithm 3) is evaluated, and a correction step after the measurement is made.

1. The prediction step is evaluated based on the known movement action. The small symbol x_i, i ∈{1, …, 5} denotes that the location (state) of the mobile system is in cell i, and big symbol X_k denotes the vector of all possible states in time step k.

$\begin{array}{l} \begin{matrix} b e l_{p} (X_{1} = x_{1}) & = \sum_{x_{i}} P (X_{1} = x_{1} | X_{0} = x_{i}, u_{0}) b e l (X_{0} = x_{i}) \\ = p^{T} (X_{1} = x_{1} | X_{0}, u_{0}) bel (X_{0}) \\ = [0.1, 0, 0, 0.1, 0.8] {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T} = 0.2 \\ b e l_{p} (X_{1} = x_{2}) & = [0.8, 0.1, 0, 0, 0.1] {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T} = 0.2 \\ b e l_{p} (X_{1} = x_{3}) & = [0.1, 0.8, 0.1, 0, 0] {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T} = 0.2 \\ b e l_{p} (X_{1} = x_{4}) & = [0, 0.1, 0.8, 0.1, 0] {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T} = 0.2 \\ b e l_{p} (X_{1} = x_{5}) & = [0, 0, 0.1, 0.8, 0.1] {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T} = 0.2 \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} b e l_{p} (X_{1} = x_{1}) & = \sum_{x_{i}} P (X_{1} = x_{1} | X_{0} = x_{i}, u_{0}) b e l (X_{0} = x_{i}) \\ = p^{T} (X_{1} = x_{1} | X_{0}, u_{0}) bel (X_{0}) \\ = [0.1, 0, 0, 0.1, 0.8] {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T} = 0.2 \\ b e l_{p} (X_{1} = x_{2}) & = [0.8, 0.1, 0, 0, 0.1] {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T} = 0.2 \\ b e l_{p} (X_{1} = x_{3}) & = [0.1, 0.8, 0.1, 0, 0] {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T} = 0.2 \\ b e l_{p} (X_{1} = x_{4}) & = [0, 0.1, 0.8, 0.1, 0] {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T} = 0.2 \\ b e l_{p} (X_{1} = x_{5}) & = [0, 0, 0.1, 0.8, 0.1] {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T} = 0.2 \end{matrix} \end{array}$

si146_e

Therefore, the complete probability distribution (belief) of the prediction step is

$\begin{array}{l} {bel}_{p} (X_{1}) = {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T} \end{array}$ $\begin{array}{l} {bel}_{p} (X_{1}) = {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T} \end{array}$

After the measurement is obtained the correction step of the Bayesian filter is evaluated:

$\begin{array}{l} \begin{matrix} b e l (X_{1} = x_{1}) & = η p (Z_{1} = b r i g h t | x_{1}) b e l_{p} (X_{1} = x_{1}) = η 0.8 \cdot 0.2 = η 0.16 \\ b e l (X_{1} = x_{2}) & = η p (Z_{1} = b r i g h t | x_{2}) b e l_{p} (X_{1} = x_{2}) = η 0.8 \cdot 0.2 = η 0.16 \\ b e l (X_{1} = x_{3}) & = η p (Z_{1} = b r i g h t | x_{3}) b e l_{p} (X_{1} = x_{3}) = η 0.4 \cdot 0.2 = η 0.08 \\ b e l (X_{1} = x_{4}) & = η p (Z_{1} = b r i g h t | x_{4}) b e l_{p} (X_{1} = x_{4}) = η 0.4 \cdot 0.2 = η 0.08 \\ b e l (X_{1} = x_{5}) & = η p (Z_{1} = b r i g h t | x_{5}) b e l_{p} (X_{1} = x_{5}) = η 0.8 \cdot 0.2 = η 0.16 \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} b e l (X_{1} = x_{1}) & = η p (Z_{1} = b r i g h t | x_{1}) b e l_{p} (X_{1} = x_{1}) = η 0.8 \cdot 0.2 = η 0.16 \\ b e l (X_{1} = x_{2}) & = η p (Z_{1} = b r i g h t | x_{2}) b e l_{p} (X_{1} = x_{2}) = η 0.8 \cdot 0.2 = η 0.16 \\ b e l (X_{1} = x_{3}) & = η p (Z_{1} = b r i g h t | x_{3}) b e l_{p} (X_{1} = x_{3}) = η 0.4 \cdot 0.2 = η 0.08 \\ b e l (X_{1} = x_{4}) & = η p (Z_{1} = b r i g h t | x_{4}) b e l_{p} (X_{1} = x_{4}) = η 0.4 \cdot 0.2 = η 0.08 \\ b e l (X_{1} = x_{5}) & = η p (Z_{1} = b r i g h t | x_{5}) b e l_{p} (X_{1} = x_{5}) = η 0.8 \cdot 0.2 = η 0.16 \end{matrix} \end{array}$

si148_e

After considering the normalization factor,

$\begin{array}{l} η = \frac{1}{0.16 + 0.16 + 0.08 + 0.08 + 0.16} = 1.56 \end{array}$ $\begin{array}{l} η = \frac{1}{0.16 + 0.16 + 0.08 + 0.08 + 0.16} = 1.56 \end{array}$

si149_e

the updated probability distribution (belief) is obtained:

$\begin{array}{l} bel (X_{1}) = {[0.25, 0.25, 0.125, 0.125, 0.25]}^{T} \end{array}$ $\begin{array}{l} bel (X_{1}) = {[0.25, 0.25, 0.125, 0.125, 0.25]}^{T} \end{array}$

The same result can be obtained from the following:

$\begin{array}{l} \begin{matrix} bel (X_{1}) & = \frac{p^{T} (Z_{1} = b r i g h t | X_{1}) * {bel}_{p}^{T} (X_{1})}{p^{T} (Z_{1} = b r i g h t | X_{1}) {bel}_{p} (X_{1})} \\ = \frac{[0.8, 0.8, 0.4, 0.4, 0.8] * [0.2, 0.2, 0.2, 0.2, 0.2]}{[0.8, 0.8, 0.4, 0.4, 0.8] {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T}} \\ = {[0.25, 0.25, 0.125, 0.125, 0.25]}^{T} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} bel (X_{1}) & = \frac{p^{T} (Z_{1} = b r i g h t | X_{1}) * {bel}_{p}^{T} (X_{1})}{p^{T} (Z_{1} = b r i g h t | X_{1}) {bel}_{p} (X_{1})} \\ = \frac{[0.8, 0.8, 0.4, 0.4, 0.8] * [0.2, 0.2, 0.2, 0.2, 0.2]}{[0.8, 0.8, 0.4, 0.4, 0.8] {[0.2, 0.2, 0.2, 0.2, 0.2]}^{T}} \\ = {[0.25, 0.25, 0.125, 0.125, 0.25]}^{T} \end{matrix} \end{array}$

si151_e

2. The procedure from the first case can be repeated again on the last result to obtain state belief in time step k = 1. First, a prediction step is repeated:

$\begin{array}{l} \begin{matrix} b e l_{p} (X_{2} = x_{1}) & = [0.1, 0, 0, 0.1, 0.8] {[0.25, 0.25, 0.125, 0.125, 0.25]}^{T} = 0.237 \\ b e l_{p} (X_{2} = x_{2}) & = [0.8, 0.1, 0, 0, 0.1] {[0.25, 0.25, 0.125, 0.125, 0.25]}^{T} = 0.25 \\ b e l_{p} (X_{2} = x_{3}) & = [0.1, 0.8, 0.1, 0, 0] {[0.25, 0.25, 0.125, 0.125, 0.25]}^{T} = 0.237 \\ b e l_{p} (X_{2} = x_{4}) & = [0, 0.1, 0.8, 0.1, 0] {[0.25, 0.25, 0.125, 0.125, 0.25]}^{T} = 0.138 \\ b e l_{p} (X_{2} = x_{5}) & = [0, 0, 0.1, 0.8, 0.1] {[0.25, 0.25, 0.125, 0.125, 0.25]}^{T} = 0.138 \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} b e l_{p} (X_{2} = x_{1}) & = [0.1, 0, 0, 0.1, 0.8] {[0.25, 0.25, 0.125, 0.125, 0.25]}^{T} = 0.237 \\ b e l_{p} (X_{2} = x_{2}) & = [0.8, 0.1, 0, 0, 0.1] {[0.25, 0.25, 0.125, 0.125, 0.25]}^{T} = 0.25 \\ b e l_{p} (X_{2} = x_{3}) & = [0.1, 0.8, 0.1, 0, 0] {[0.25, 0.25, 0.125, 0.125, 0.25]}^{T} = 0.237 \\ b e l_{p} (X_{2} = x_{4}) & = [0, 0.1, 0.8, 0.1, 0] {[0.25, 0.25, 0.125, 0.125, 0.25]}^{T} = 0.138 \\ b e l_{p} (X_{2} = x_{5}) & = [0, 0, 0.1, 0.8, 0.1] {[0.25, 0.25, 0.125, 0.125, 0.25]}^{T} = 0.138 \end{matrix} \end{array}$

si152_e

The complete probability distribution of prediction is

$\begin{array}{l} {bel}_{p} (X_{2}) = {[0.237, 0.25, 0.237, 0.138, 0.138]}^{T} \end{array}$ $\begin{array}{l} {bel}_{p} (X_{2}) = {[0.237, 0.25, 0.237, 0.138, 0.138]}^{T} \end{array}$

The correction step yields

$\begin{array}{l} \begin{matrix} bel (X_{2}) & = \frac{{[0.2, 0.2, 0.6, 0.6, 0.2]}^{T} * {[0.237, 0.25, 0.237, 0.138, 0.138]}^{T}}{[0.2, 0.2, 0.6, 0.6, 0.2] {[0.237, 0.25, 0.237, 0.138, 0.138]}^{T}} \\ = {[0.136, 0.143, 0.407, 0.236, 0.079]}^{T} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} bel (X_{2}) & = \frac{{[0.2, 0.2, 0.6, 0.6, 0.2]}^{T} * {[0.237, 0.25, 0.237, 0.138, 0.138]}^{T}}{[0.2, 0.2, 0.6, 0.6, 0.2] {[0.237, 0.25, 0.237, 0.138, 0.138]}^{T}} \\ = {[0.136, 0.143, 0.407, 0.236, 0.079]}^{T} \end{matrix} \end{array}$

si154_e

3. Similarly as in the previous two cases, the belief distribution can be obtained for time step k = 3:

$\begin{array}{l} \begin{matrix} {bel}_{p} (X_{3}) & = {[0.1, 0.131, 0.167, 0.363, 0.237]}^{T} \\ bel (X_{3}) & = {[0.048, 0.063, 0.245, 0.528, 0.115]}^{T} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} {bel}_{p} (X_{3}) & = {[0.1, 0.131, 0.167, 0.363, 0.237]}^{T} \\ bel (X_{3}) & = {[0.048, 0.063, 0.245, 0.528, 0.115]}^{T} \end{matrix} \end{array}$

si155_e

4. After the third time step, the mobile system is most likely in the fourth cell, with probability 52.8%. The second most likely cell is the third cell, with probability 24.5%.

The state beliefs for all three time steps are presented graphically in Fig. 6.15. The implementation of the solution in Matlab is shown in Listing 6.9.

f06-15-9780128042045 — Fig. 6.15 Posterior state probability distributions in three time steps from Example 6.14.

Listing 6.9

Implementation of the solution of Example 6.14

1 disp( ’ Initial belief p(X0 ) ’)

2 bel_X0= ones (1 ,5) /5

4 P_xxu_null= 0 . 8 ; % P (X = i|X ’=j ,U ’=u) , i= j+ u

5 P_xxu_less= 0 . 1 ; % P (X = i|X ’=j ,U ’=u) , i= j+ u−1

6 P_xxu_more = 0 . 1 ; % P (X = i|X ’=j ,U ’=u) , i= j+ u+1

8 p_ZdX= [ 0 . 2 0.2 0.6 0.6 0 . 2 ] ; % p(Z = dark|X )

9 p_ZbX= 1− p_ZdX ; % p(Z = bright|X )

11 bel_X= bel_X0 ;

12 for k=1:3

13 % Prediction step

14 p_xXu= [ P_xxu_less 0 0 P_xxu_more P_xxu_null ] ; % for U =1

15 belp_X= zeros(1 ,5) ;

16 fori =1:5

17 belp_X ( i )= p_xXu*bel_X . ’ ;

18 p_xXu= p_xXu ( [ end1:end−1]) ;

19 end

21 % Correction step

22 if k ==1

23 bel_X= p_ZbX.* belp_X ;

24 else

25 bel_X= p_ZdX.* belp_X ;

26 end

27 bel_X= bel_X/sum( bel_X ) ;

29 if k ==1

30 disp( ’ belief s belp_X1 and bel_X1 ’)

31 belp_X1= belp_X

32 bel_X1= bel_X

33 else i f k ==2

34 disp( ’ belief s belp_X2 and bel_X2 ’)

35 belp_X2= belp_X

36 bel_X2= bel_X

37 else i f k ==3

38 disp( ’ belief s belp_X3 and bel_X3 ’)

39 belp_X3= belp_X

40 bel_X3= bel_X

41 disp( ’Lessl i k e l y to mostl i k e l y position ’)

42 [m, mi ] = sort( bel_X )

43 end

44 end

Initial belief p ( X0 )

bel_X0 =

0.2000 0.2000 0.2000 0.2000 0.2000

belief s belp_X1 and bel_X1

belp_X1 =

0.2000 0.2000 0.2000 0.2000 0.2000

bel_X1 =

0.2500 0.2500 0.1250 0.1250 0.2500

belief s belp_X2 and bel_X2

belp_X2 =

0.2375 0.2500 0.2375 0.1375 0.1375

bel_X2 =

0.1357 0.1429 0.4071 0.2357 0.0786

belief s belp_X3 and bel_X3

belp_X3 =

0.1000 0.1307 0.1686 0.3636 0.2371

bel_X3 =

0.0484 0.0633 0.2450 0.5284 0.1149

Less l i k e l y to most l i k e l y position

m =

0.0484 0.0633 0.1149 0.2450 0.5284

mi =

1 2 5 3 4

6.5 Kalman Filter

Kalman filter [6] is one of the most important state estimation and prediction algorithms, which has been applied to a diverse range of applications in various engineering fields, and autonomous mobile systems are no exception. The Kalman filter is designed for state estimation of linear systems where the system signals may be corrupted by noise. The algorithm has a typical two-step structure that consists of a prediction and a correction step that are evaluated in every time step. In the prediction step, the latest system state along with state uncertainties are predicted. Once a new measurement is available, the correction step is evaluated where the stochastic measurement is joined with the predicted state estimate as a weighted average, in a way that less uncertain values are given a greater weight. The algorithm is recursive and allows an online estimation of the current system state taking into account system and measurement uncertainties.

A classical Kalman filter assumes normally distributed noises, that is, the probability distribution of noise is a Gaussian function:

$\begin{array}{l} p (x) = \frac{1}{\sqrt{2 π σ^{2}}} e^{- \frac{1}{2} \frac{{(x - μ)}^{2}}{σ^{2}}} \end{array}$ $\begin{array}{l} p (x) = \frac{1}{\sqrt{2 π σ^{2}}} e^{- \frac{1}{2} \frac{{(x - μ)}^{2}}{σ^{2}}} \end{array}$

si26_e (6.30)

where μ is the mean value (mathematical expectation) and σ² is the variance. The Gaussian function is a unimodal (left and right from the single peak the function monotonically decreases toward zero)—more general distributions are normally multimodal (there are several local peaks). If the probability distributions of continuous variables are assumed to be unimodal, the Kalman filter can be used for optimum state estimation. In the case that the variables are not all unimodal, the state estimation is suboptimal; furthermore, the convergence of the estimate to the true value is questionable. The Bayesian filter does not have the aforementioned problems, but its applicability is limited to simple continuous problems and to discrete problems with a finite countable number of states.

In Fig. 6.16 an example of continuous probability distribution, which is not unimodal, is shown. The continuous distribution is approximated with a Gaussian function and with a histogram (domain is divided into discrete intervals). The approximation with a Gaussian function is used in the Kalman filter, and the histogram is used in the Bayesian filter.

f06-16-9780128042045 — Fig. 6.16 An example of probability distribution of a continuous variable x (solid line), approximation with Gaussian function (dashed line), and approximation with a histogram (dotted line).

The essence of the correction step (see Bayesian filter (6.29)) is information fusion from two independent sources, that is, sensor measurements and state predictions based on the previous state estimations. Let us use Example 6.15 again to demonstrate how two independent estimates of the same variable x can be jointed optimally if the value and variance (belief) of each source is known.

Example 6.15

There are two independent estimates of the variable x. The value of the first estimate is x₁ and has a variance $σ_{1}^{2}$ $σ_{1}^{2}$ , and the value of the second estimate is x₂ with a variance $σ_{2}^{2}$ $σ_{2}^{2}$ . What is the optimal linear combination of these two estimates that represent the state estimate $\hat{x}$ $\hat{x}$ with minimal variance?

Solution

The estimation of optimal value of the variable x is assumed to be a linear combination of two measurements:

$\begin{array}{l} \hat{x} = ω_{1} x_{1} + ω_{2} x_{2} \end{array}$ $\begin{array}{l} \hat{x} = ω_{1} x_{1} + ω_{2} x_{2} \end{array}$

where the parameters ω₁ and ω₂ are the unknown weights that satisfy the following condition: ω₁ + ω₂ = 1. The optimum values of the weights should minimize the variance σ² of the optimal estimate $\hat{x}$ $\hat{x}$ . Hence, the variance is as follows:

$\begin{array}{l} \begin{matrix} σ^{2} & = E \{{(\hat{x} - E \{\hat{x}\})}^{2}\} \\ = E \{{(ω_{1} x_{1} + ω_{2} x_{2} - E \{ω_{1} x_{1} + ω_{2} x_{2}\})}^{2}\} \\ = E \{{(ω_{1} x_{1} + ω_{2} x_{2} - ω_{1} E \{x_{1}\} - ω_{2} E \{x_{2}\})}^{2}\} \\ = E \{{(ω_{1} (x_{1} - E \{x_{1}\}) + ω_{2} (x_{2} - E \{x_{2}\}))}^{2}\} \\ = E \{ω_{1}^{2} {(x_{1} - E \{x_{1}\})}^{2} + ω_{2}^{2} {(x_{2} - E \{x_{2}\})}^{2} + 2 ω_{1} ω_{2} (x_{1} - E \{x_{1}\}) (x_{2} - E \{x_{2}\})\} \\ = ω_{1}^{2} E \{{(x_{1} - E \{x_{1}\})}^{2}\} + ω_{2}^{2} E \{{(x_{2} - E \{x_{2}\})}^{2}\} + 2 ω_{1} ω_{2} E \{(x_{1} - E \{x_{1}\}) (x_{2} - E \{x_{2}\})\} \\ = ω_{1}^{2} σ_{1}^{2} + ω_{2}^{2} σ_{2}^{2} + 2 ω_{1} ω_{2} E \{(x_{1} - E \{x_{1}\}) (x_{2} - E \{x_{2}\})\} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} σ^{2} & = E \{{(\hat{x} - E \{\hat{x}\})}^{2}\} \\ = E \{{(ω_{1} x_{1} + ω_{2} x_{2} - E \{ω_{1} x_{1} + ω_{2} x_{2}\})}^{2}\} \\ = E \{{(ω_{1} x_{1} + ω_{2} x_{2} - ω_{1} E \{x_{1}\} - ω_{2} E \{x_{2}\})}^{2}\} \\ = E \{{(ω_{1} (x_{1} - E \{x_{1}\}) + ω_{2} (x_{2} - E \{x_{2}\}))}^{2}\} \\ = E \{ω_{1}^{2} {(x_{1} - E \{x_{1}\})}^{2} + ω_{2}^{2} {(x_{2} - E \{x_{2}\})}^{2} + 2 ω_{1} ω_{2} (x_{1} - E \{x_{1}\}) (x_{2} - E \{x_{2}\})\} \\ = ω_{1}^{2} E \{{(x_{1} - E \{x_{1}\})}^{2}\} + ω_{2}^{2} E \{{(x_{2} - E \{x_{2}\})}^{2}\} + 2 ω_{1} ω_{2} E \{(x_{1} - E \{x_{1}\}) (x_{2} - E \{x_{2}\})\} \\ = ω_{1}^{2} σ_{1}^{2} + ω_{2}^{2} σ_{2}^{2} + 2 ω_{1} ω_{2} E \{(x_{1} - E \{x_{1}\}) (x_{2} - E \{x_{2}\})\} \end{matrix} \end{array}$

si162_e

Since the variables x₁ and x₂ are independent, the differences $x_{1} - E \{x_{1}\}$ $x_{1} - E \{x_{1}\}$ and $x_{2} - E \{x_{2}\}$ $x_{2} - E \{x_{2}\}$ are also independent, and therefore $E \{(x_{1} - E \{x_{1}\}) (x_{2} - E \{x_{2}\})\} = 0$ $E \{(x_{1} - E \{x_{1}\}) (x_{2} - E \{x_{2}\})\} = 0$ . Hence,

$\begin{array}{l} σ^{2} = ω_{1}^{2} σ_{1}^{2} + ω_{2}^{2} σ_{2}^{2} \end{array}$ $\begin{array}{l} σ^{2} = ω_{1}^{2} σ_{1}^{2} + ω_{2}^{2} σ_{2}^{2} \end{array}$

or after introducing ω₂ = ω and ω₁ = 1 − ω,

$\begin{array}{l} σ^{2} = {(1 - ω)}^{2} σ_{1}^{2} + ω^{2} σ_{2}^{2} \end{array}$ $\begin{array}{l} σ^{2} = {(1 - ω)}^{2} σ_{1}^{2} + ω^{2} σ_{2}^{2} \end{array}$

We are seeking the value of the weight ω that minimizes the variance that can be obtained from variance derivative:

$\begin{array}{l} \frac{\partial}{\partial ω} σ^{2} = - 2 (1 - ω) σ_{1}^{2} + 2 ω σ_{2}^{2} = 0 \end{array}$ $\begin{array}{l} \frac{\partial}{\partial ω} σ^{2} = - 2 (1 - ω) σ_{1}^{2} + 2 ω σ_{2}^{2} = 0 \end{array}$

si168_e

which yields the solution

$\begin{array}{l} ω = \frac{σ_{1}^{2}}{σ_{1}^{2} + σ_{2}^{2}} \end{array}$ $\begin{array}{l} ω = \frac{σ_{1}^{2}}{σ_{1}^{2} + σ_{2}^{2}} \end{array}$

si169_e

The minimum-variance estimate is therefore

$\begin{array}{l} \hat{x} = \frac{σ_{2}^{2} x_{1} + σ_{1}^{2} x_{2}}{σ_{1}^{2} + σ_{2}^{2}} \end{array}$ $\begin{array}{l} \hat{x} = \frac{σ_{2}^{2} x_{1} + σ_{1}^{2} x_{2}}{σ_{1}^{2} + σ_{2}^{2}} \end{array}$

si170_e (6.31)

and the minimum variance is

$\begin{array}{l} σ^{2} = \frac{σ_{1}^{2} σ_{2}^{2}}{σ_{1}^{2} + σ_{2}^{2}} = {(\frac{1}{σ_{1}^{2}} + \frac{1}{σ_{2}^{2}})}^{- 1} \end{array}$ $\begin{array}{l} σ^{2} = \frac{σ_{1}^{2} σ_{2}^{2}}{σ_{1}^{2} + σ_{2}^{2}} = {(\frac{1}{σ_{1}^{2}} + \frac{1}{σ_{2}^{2}})}^{- 1} \end{array}$

si171_e (6.32)

The obtained results confirm that the source with lower variance (higher belief) contributes more to the final estimate, and vice-versa.

Example 6.16

In a particular moment in time, an initial state estimate is given x = 2 with variance σ² = 4. Then a sensor is used to measure the value of the state, which is z = 4 with sensor variance $σ_{z}^{2} = 1$ $σ_{z}^{2} = 1$ . The Gaussian probability distributions of the state and measurement are shown in Fig. 6.17.

f06-17-9780128042045 — Fig. 6.17 Probability distribution of the state (dashed line) and the measurement (dash-dotted line).

What is the value of optimum state estimate that includes information from previous state estimate and current measurement? What is the probability distribution of the updated optimum state estimate?

Solution

Based on Fig. 6.17 we can foreknow that the mean value x′ of the updated state will be closer to the measurement mean value, since the measurement variance (uncertainty) is lower than previous estimate variance. Using Eq. (6.31) the updated state estimate is obtained:

$\begin{array}{l} x^{'} = \frac{σ_{z}^{2} x + σ^{2} z}{σ^{2} + σ_{z}^{2}} = 3.6 \end{array}$ $\begin{array}{l} x^{'} = \frac{σ_{z}^{2} x + σ^{2} z}{σ^{2} + σ_{z}^{2}} = 3.6 \end{array}$

si173_e

Variance of the updated estimate σ′² is lower than both previous variances, since the integration of the previous estimate and measurement information lower the uncertainty of the updated estimate. The variance of the updated estimate is obtained from Eq. (6.32):

$\begin{array}{l} {σ^{'}}^{2} = {(\frac{1}{σ^{2}} + \frac{1}{σ_{z}^{2}})}^{- 1} = 0.8 \end{array}$ $\begin{array}{l} {σ^{'}}^{2} = {(\frac{1}{σ^{2}} + \frac{1}{σ_{z}^{2}})}^{- 1} = 0.8 \end{array}$

si174_e

and the standard deviation is

$\begin{array}{l} σ^{'} = \sqrt{{σ^{'}}^{2}} = 0.894 \end{array}$ $\begin{array}{l} σ^{'} = \sqrt{{σ^{'}}^{2}} = 0.894 \end{array}$

The updated probability distribution p(x|z) of the state after the measurement-based correction is shown in Fig. 6.18.

f06-18-9780128042045 — Fig. 6.18 Probability distribution of the initial state (dashed line), measurement (dash-dotted line), and updated state (solid line).

Let us use the findings from Example 6.15 in the derivation of the recursive state estimation algorithm. In every time step a new state measurement z(k) = x(k) + n(k) is obtained by the sensor, where n(k) is the measurement noise. The measurement variance $σ_{z}^{2} (k)$ $σ_{z}^{2} (k)$ is assumed to be known. The updated optimum state estimate is a combination of the previous estimate $\hat{x} (k)$ $\hat{x} (k)$ and the current measurement z(k), as follows:

$\begin{array}{l} \hat{x} (k + 1) = (1 - ω) \hat{x} (k) + ω (k) z (k) = \hat{x} (k) + ω (z (k) - \hat{x} (k)) \end{array}$ $\begin{array}{l} \hat{x} (k + 1) = (1 - ω) \hat{x} (k) + ω (k) z (k) = \hat{x} (k) + ω (z (k) - \hat{x} (k)) \end{array}$

The updated state variance is

$\begin{array}{l} σ^{2} (k + 1) = \frac{σ^{2} (k) σ_{z}^{2} (k)}{σ^{2} (k) + σ_{z}^{2} (k)} = (1 - ω) σ^{2} (k) \end{array}$ $\begin{array}{l} σ^{2} (k + 1) = \frac{σ^{2} (k) σ_{z}^{2} (k)}{σ^{2} (k) + σ_{z}^{2} (k)} = (1 - ω) σ^{2} (k) \end{array}$

si179_e

where

$\begin{array}{l} ω = \frac{σ^{2} (k)}{σ^{2} (k) + σ_{z}^{2} (k)} \end{array}$ $\begin{array}{l} ω = \frac{σ^{2} (k)}{σ^{2} (k) + σ_{z}^{2} (k)} \end{array}$

si180_e

Therefore, given known initial state estimate x, $\hat{x} (0)$ $\hat{x} (0)$ and the corresponding variance σ²(0) the measurements z(1), z(2), … can be integrated optimally, in a way that the current state and state variance are estimated. This is the basic idea behind the correction step of the Kalman filter.

The prediction step of the Kalman filter provides the given state prediction a known input action. The initial state estimate $\hat{x} (k)$ $\hat{x} (k)$ has a probability distribution with variance σ²(k). In the same way, the action u(k), which is responsible for transition of the state x(k) to x(k + 1), has probability distribution (transition uncertainty) $σ_{u}^{2} (k)$ $σ_{u}^{2} (k)$ . Using Example 6.17 let us examine the value of the state and the variance after the action is executed (after state transition).

Example 6.17

The initial state estimate $\hat{x} (k)$ $\hat{x} (k)$ with variance σ²(k) is known. Then an action u(k) is executed, which represents the direct transition of the state with uncertainty (variance) $σ_{u}^{2} (k)$ $σ_{u}^{2} (k)$ . What is the value of the state estimate and the state uncertainty after the transition?

Solution

The updated state estimate after the transition is

$\begin{array}{l} \hat{x} (k + 1) = \hat{x} (k) + u (k) \end{array}$ $\begin{array}{l} \hat{x} (k + 1) = \hat{x} (k) + u (k) \end{array}$

and the uncertainty of this estimation is

$\begin{array}{l} σ^{2} (k + 1) & = E \{{(\hat{x} (k + 1) - E \{\hat{x} (k + 1)\})}^{2}\} \\ = E \{{(\hat{x} (k) + u (k) - E \{\hat{x} (k) + u (k)\})}^{2}\} \\ = E \{{((\hat{x} (k) - E \{\hat{x} (k)\}) + (u (k) - E \{u (k)\}))}^{2}\} \\ = E \{{(\hat{x} (k) - E \{\hat{x} (k)\})}^{2} + {(u (k) - E \{u (k)\})}^{2}\} \\ + E \{2 (\hat{x} (k) - E \{\hat{x} (k)\}) (u (k) - E \{u (k)\})\} \\ = σ^{2} (k) + σ_{u}^{2} (k) \\ + E \{2 (\hat{x} (k) - E \{\hat{x} (k)\}) (u (k) - E \{u (k)\})\} \end{array}$ $\begin{array}{l} σ^{2} (k + 1) & = E \{{(\hat{x} (k + 1) - E \{\hat{x} (k + 1)\})}^{2}\} \\ = E \{{(\hat{x} (k) + u (k) - E \{\hat{x} (k) + u (k)\})}^{2}\} \\ = E \{{((\hat{x} (k) - E \{\hat{x} (k)\}) + (u (k) - E \{u (k)\}))}^{2}\} \\ = E \{{(\hat{x} (k) - E \{\hat{x} (k)\})}^{2} + {(u (k) - E \{u (k)\})}^{2}\} \\ + E \{2 (\hat{x} (k) - E \{\hat{x} (k)\}) (u (k) - E \{u (k)\})\} \\ = σ^{2} (k) + σ_{u}^{2} (k) \\ + E \{2 (\hat{x} (k) - E \{\hat{x} (k)\}) (u (k) - E \{u (k)\})\} \end{array}$

si187_e (6.33)

Since $\hat{x}$ $\hat{x}$ and u are independent, it holds $E \{2 (\hat{x} (k) - E \{\hat{x} (k)\}) (u (k) -$ $E \{2 (\hat{x} (k) - E \{\hat{x} (k)\}) (u (k) -$ $E \{u (k)\})\} = 0$ $E \{u (k)\})\} = 0$ , and therefore Eq. (6.33) simplifies to

$\begin{array}{l} σ^{2} (k + 1) = σ^{2} (k) + σ_{u}^{2} (k) \end{array}$ $\begin{array}{l} σ^{2} (k + 1) = σ^{2} (k) + σ_{u}^{2} (k) \end{array}$

Simplified Implementation of the Kalman Filter Algorithm

The Kalman filter algorithm for a simple case with only a single state is given in Algorithm 4, where the variables with a subscript (⋅)_k|k−1 represent the estimated values in the prediction step, and variables with subscript (⋅)_k|k represent the values from the correction step. For improved readability the following notation is used: u(k − 1) = u_k−1 and z(k) = z_k.

Algorithm 4

Kalman Filter for a Single State

functionKalman_filter( ${\hat{x}}_{k - 1 | k - 1}$ ${\hat{x}}_{k - 1 | k - 1}$ , u_k−1, z_k, $σ_{k - 1 | k - 1}^{2}$ $σ_{k - 1 | k - 1}^{2}$ , ${σ_{u}}_{k - 1}^{2}$ ${σ_{u}}_{k - 1}^{2}$ , ${σ_{z}}_{k}^{2}$ ${σ_{z}}_{k}^{2}$ )

Prediction step:

${\hat{x}}_{k | k - 1} \leftarrow {\hat{x}}_{k - 1 | k - 1} + u_{k - 1}$ ${\hat{x}}_{k | k - 1} \leftarrow {\hat{x}}_{k - 1 | k - 1} + u_{k - 1}$

$σ_{k | k - 1}^{2} \leftarrow σ_{k - 1 | k - 1}^{2} + {σ_{u}}_{k - 1}^{2}$ $σ_{k | k - 1}^{2} \leftarrow σ_{k - 1 | k - 1}^{2} + {σ_{u}}_{k - 1}^{2}$

Correction step:

$ω_{k} \leftarrow \frac{σ_{k | k - 1}^{2}}{σ_{k | k - 1}^{2} + {σ_{z}}_{k}^{2}}$ $ω_{k} \leftarrow \frac{σ_{k | k - 1}^{2}}{σ_{k | k - 1}^{2} + {σ_{z}}_{k}^{2}}$ si198_e

${\hat{x}}_{k | k} \leftarrow {\hat{x}}_{k | k - 1} + ω_{k} (z_{k} - {\hat{x}}_{k | k - 1})$ ${\hat{x}}_{k | k} \leftarrow {\hat{x}}_{k | k - 1} + ω_{k} (z_{k} - {\hat{x}}_{k | k - 1})$

$σ_{k | k}^{2} \leftarrow (1 - ω_{k}) σ_{k | k - 1}^{2}$ $σ_{k | k}^{2} \leftarrow (1 - ω_{k}) σ_{k | k - 1}^{2}$

return ${\hat{x}}_{k | k}$ ${\hat{x}}_{k | k}$ , $σ_{k | k}^{2}$ $σ_{k | k}^{2}$

end function

The Kalman filter has two steps (prediction and correction step) that are executed one after another in the loop. In the prediction step only the known action is used in a way that enables prediction of the state in the next time step. From the initial belief a new belief is evaluated, and the uncertainty of the new belief is higher that the initial uncertainty. In the correction step the measurement is used to improve the predicted belief in a way that the new (corrected) state estimate has lower uncertainty than the previous belief. In both steps only two inputs are required: in the prediction step the value of previous belief ${\hat{x}}_{k - 1 | k - 1}$ ${\hat{x}}_{k - 1 | k - 1}$ and executed action u_k−1 need to be known, and in the correction step the previous belief ${\hat{x}}_{k | k - 1}$ ${\hat{x}}_{k | k - 1}$ and measurement z_k are required. The state transition variance $σ_{k - 1 | k - 1}^{2}$ $σ_{k - 1 | k - 1}^{2}$ , input action variance ${σ_{u}}_{k - 1}^{2}$ ${σ_{u}}_{k - 1}^{2}$ , and measurement variance ${σ_{z}}_{k}^{2}$ ${σ_{z}}_{k}^{2}$ also need to be given.

Example 6.18

There is a mobile robot that can move in only one dimension. The initial position of the robot is unknown (Fig. 6.19). Let us therefore assume that the initial position is ${\hat{x}}_{0} = 3$ ${\hat{x}}_{0} = 3$ with large variance $σ_{0}^{2} = 100$ $σ_{0}^{2} = 100$ (the true position x₀ = 0 is not known).

f06-19-9780128042045 — Fig. 6.19 Localization of a mobile robot in one-dimensional space with unknown initial position.

Then, in every time moment k − 1 = 0, …, 4 the mobile robot is moved for u_0:4 = (2, 3, 2, 1, 1) units and the measurements of the robot positions in time moments k = 1, …, 5 are taken, z_1:5 = (2, 5, 7, 8, 9). The movement action and measurement are disturbed with white normally distributed zero-mean noise that can be described with a constant uncertainty for movement $σ_{u}^{2} = 2$ $σ_{u}^{2} = 2$ and measurement uncertainty $σ_{z}^{2} = 4$ $σ_{z}^{2} = 4$ .

What is the estimated robot position and uncertainty of this estimate?

Solution

Let us apply Algorithm 4 to solve the given mobile robot localization problem. In the initial time step (k = 1) the predicted state and variance can be calculated first:

$\begin{array}{l} \begin{matrix} {\hat{x}}_{1 | 0} & = {\hat{x}}_{0 | 0} + u_{0} = 3 + 2 = 5 \\ σ_{1 | 0}^{2} & = σ_{0 | 0}^{2} + {σ_{u}}^{2} = 100 + 2 = 102 \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} {\hat{x}}_{1 | 0} & = {\hat{x}}_{0 | 0} + u_{0} = 3 + 2 = 5 \\ σ_{1 | 0}^{2} & = σ_{0 | 0}^{2} + {σ_{u}}^{2} = 100 + 2 = 102 \end{matrix} \end{array}$

si212_e

and then the correction step of the Kalman filter in the first time step k = 1 can be evaluated:

$\begin{array}{l} \begin{matrix} ω_{1} & = \frac{σ_{1 | 0}^{2}}{σ_{1 | 0}^{2} + {σ_{z}}^{2}} = \frac{102}{102 + 4} = 0.962 \\ {\hat{x}}_{1 | 1} & = {\hat{x}}_{1 | 0} + ω_{1} (z_{1} - {\hat{x}}_{1 | 0}) = 5 + 0.962 (2 - 5) = 2.113 \\ σ_{1 | 1}^{2} & = (1 - ω_{1}) σ_{1 | 0}^{2} = (1 - 0.962) 102 = 3.849 \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} ω_{1} & = \frac{σ_{1 | 0}^{2}}{σ_{1 | 0}^{2} + {σ_{z}}^{2}} = \frac{102}{102 + 4} = 0.962 \\ {\hat{x}}_{1 | 1} & = {\hat{x}}_{1 | 0} + ω_{1} (z_{1} - {\hat{x}}_{1 | 0}) = 5 + 0.962 (2 - 5) = 2.113 \\ σ_{1 | 1}^{2} & = (1 - ω_{1}) σ_{1 | 0}^{2} = (1 - 0.962) 102 = 3.849 \end{matrix} \end{array}$

si213_e

These prediction and correction steps can be evaluated for all the other time steps. The prediction results are therefore

$\begin{array}{l} \begin{matrix} {\hat{x}}_{1 : 5 | 0 : 4} & = (5.00, 5.11, 7.05, 8.02, 9.01) \\ σ_{1 : 5 | 0 : 4}^{2} & = (102, 5.85, 4.38, 4.09, 4.02) \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} {\hat{x}}_{1 : 5 | 0 : 4} & = (5.00, 5.11, 7.05, 8.02, 9.01) \\ σ_{1 : 5 | 0 : 4}^{2} & = (102, 5.85, 4.38, 4.09, 4.02) \end{matrix} \end{array}$

si214_e

and the correction results are

$\begin{array}{l} \begin{matrix} {\hat{x}}_{1 : 5 | 1 : 5} & = (2.11, 5.05, 7.02, 8.01, 9.01) \\ σ_{1 : 5 | 1 : 5}^{2} & = (3.85, 2.38, 2.09, 2.02, 2.01) \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} {\hat{x}}_{1 : 5 | 1 : 5} & = (2.11, 5.05, 7.02, 8.01, 9.01) \\ σ_{1 : 5 | 1 : 5}^{2} & = (3.85, 2.38, 2.09, 2.02, 2.01) \end{matrix} \end{array}$

si215_e

The obtained results show that the position of the mobile robot can be determined in a few time steps with uncertainty 2, which agrees with the prediction uncertainty and measurement uncertainty ${(\frac{1}{σ_{5 | 4}^{2}} + \frac{1}{σ_{z}^{2}})}^{- 1} = 2.01$ ${(\frac{1}{σ_{5 | 4}^{2}} + \frac{1}{σ_{z}^{2}})}^{- 1} = 2.01$ si216_e . The uncertainty of the predicted position estimate converges toward 4, which is in accordance with the correction uncertainty from the previous time step and measurement $σ_{4 | 4}^{2} + σ_{u}^{2} = 4.02$ $σ_{4 | 4}^{2} + σ_{u}^{2} = 4.02$ .

6.5.1 Kalman Filter in Matrix Form

Multiple input, multiple state, and multiple output systems can be represented in a matrix form for improved readability. A general linear system can be expressed in a state-space form as

$\begin{array}{l} \begin{matrix} x (k + 1) & = A x (k) + B u (k) + F w (k) \\ z (k) & = C x (k) + v (k) \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} x (k + 1) & = A x (k) + B u (k) + F w (k) \\ z (k) & = C x (k) + v (k) \end{matrix} \end{array}$

si218_e (6.34)

where x is the state vector, u is the input (action) vector, and z is the output (measurement); A is the state matrix, B is the input matrix, F is the input noise matrix, C is the output matrix; w(k) is the process noise and v is the output (measurement) noise. In the case the noise w is added to the system input u, the following relation holds: F = B. The process noise w(k) and measurement noise v(k) are assumed to be independent (uncorrelated) white noises with zero mean value and covariance matrices $Q_{k} = E \{w (k) w^{T} (k)\}$ $Q_{k} = E \{w (k) w^{T} (k)\}$ and $R_{k} = E \{v (k) v^{T} (k)\}$ $R_{k} = E \{v (k) v^{T} (k)\}$ .

The probability distribution of the states x that are disturbed by a white Gaussian noise can be written in a matrix form as follows:

$\begin{array}{l} p (x) = det {(2 π P)}^{- \frac{1}{2}} e^{- \frac{1}{2} {(x - μ)}^{T} P^{- 1} (x - μ)} \end{array}$ $\begin{array}{l} p (x) = det {(2 π P)}^{- \frac{1}{2}} e^{- \frac{1}{2} {(x - μ)}^{T} P^{- 1} (x - μ)} \end{array}$

where P is the state-error covariance matrix.

The Kalman filter is an approach for filtering and estimation of linear systems with continuous state space, which are disturbed with normal noise. The noise distribution is represented with a Gaussian function (Gaussian noise). The input and measurement noises influence the internal system states that we would like to estimate. In the case that the model of the system is linear, the Gaussian noise propagated through the model (e.g., from inputs to the states) is also a Gaussian noise. The system must therefore be linear, since this requirement ensures Gaussian distribution of the noise on the states, an assumption used in the derivation of the Kalman filter. The Kalman filter state estimate converges to the true value only in the case of linear systems that are disturbed with Gaussian noise.

The Kalman filter for a linear system (6.34) has a prediction step:

$\begin{array}{l} \begin{matrix} {\hat{x}}_{k | k - 1} & = A {\hat{x}}_{k - 1 | k - 1} + B u_{k - 1} \\ P_{k | k - 1} & = A P_{k - 1 | k - 1} A^{T} + F Q_{k - 1} F^{T} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} {\hat{x}}_{k | k - 1} & = A {\hat{x}}_{k - 1 | k - 1} + B u_{k - 1} \\ P_{k | k - 1} & = A P_{k - 1 | k - 1} A^{T} + F Q_{k - 1} F^{T} \end{matrix} \end{array}$

si222_e (6.35)

and a correction step:

$\begin{array}{l} \begin{matrix} K_{k} & = P_{k | k - 1} C^{T} {(C P_{k | k - 1} C^{T} + R_{k})}^{- 1} \\ {\hat{x}}_{k | k} & = {\hat{x}}_{k | k - 1} + K_{k} (z_{k} - C {\hat{x}}_{k | k - 1}) \\ P_{k | k} & = P_{k | k - 1} - K_{k} C P_{k | k - 1} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} K_{k} & = P_{k | k - 1} C^{T} {(C P_{k | k - 1} C^{T} + R_{k})}^{- 1} \\ {\hat{x}}_{k | k} & = {\hat{x}}_{k | k - 1} + K_{k} (z_{k} - C {\hat{x}}_{k | k - 1}) \\ P_{k | k} & = P_{k | k - 1} - K_{k} C P_{k | k - 1} \end{matrix} \end{array}$

si223_e (6.36)

In the prediction part of the algorithm the a priori estimate ${\hat{x}}_{k | k - 1}$ ${\hat{x}}_{k | k - 1}$ is determined, which is based on the previous estimate ${\hat{x}}_{k - 1 | k - 1}$ ${\hat{x}}_{k - 1 | k - 1}$ (obtained from measurements up to the time moment k − 1) and input u(k − 1). In the correction part of the Kalman filter the posteriori estimate ${\hat{x}}_{k | k}$ ${\hat{x}}_{k | k}$ is calculated, which is based on the measurements up to time step k. The state correction is made in a way that the difference between the true and estimated measurement is calculated ( $z_{k} - C {\hat{x}}_{k | k - 1}$ $z_{k} - C {\hat{x}}_{k | k - 1}$ ); this difference is also known as innovation or measurement residual. The state correction is calculated as a product of Kalman gain K_k and innovation. The prediction part can be evaluated in advance, in the time while we are waiting for the new measurement in the time step k. Notice the similarity of the matrix notation in Eqs. (6.35), (6.36) with the notation used in Algorithm 4.

Let us derive the equation for the calculation of the state error covariance matrix in the prediction part of the Kalman filter:

$\begin{array}{l} \begin{matrix} P_{k | k - 1} & = E \{(x_{k} - {\hat{x}}_{k | k - 1}) {(x_{k} - {\hat{x}}_{k | k - 1})}^{T}\} \\ = cov \{x_{k} - {\hat{x}}_{k | k - 1}\} \\ = cov \{A x_{k - 1} + B u_{k - 1} + F w_{k - 1} - A {\hat{x}}_{k - 1 | k - 1} - B u_{k - 1}\} \\ = cov \{A x_{k - 1} + F w_{k - 1} - A {\hat{x}}_{k - 1 | k - 1}\} \\ = cov \{A (x_{k - 1} - {\hat{x}}_{k - 1 | k - 1}) + F w_{k - 1}\} \\ = cov \{A (x_{k - 1} - {\hat{x}}_{k - 1 | k - 1})\} + cov \{F w_{k - 1}\} \\ = E \{(A (x_{k - 1} - {\hat{x}}_{k - 1 | k - 1})) {(A (x_{k - 1} - {\hat{x}}_{k - 1 | k - 1}))}^{T}\} \\ + E \{(F w_{k - 1}) {(F w_{k - 1})}^{T}\} \\ = E \{(A (x_{k - 1} - {\hat{x}}_{k - 1 | k - 1})) {(x_{k - 1} - {\hat{x}}_{k - 1 | k - 1})}^{T} A^{T}\} \\ + E \{F w_{k - 1} w_{k - 1}^{T} F^{T}\} \\ = A P_{k - 1 | k - 1} A^{T} + F Q_{k - 1} F^{T} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} P_{k | k - 1} & = E \{(x_{k} - {\hat{x}}_{k | k - 1}) {(x_{k} - {\hat{x}}_{k | k - 1})}^{T}\} \\ = cov \{x_{k} - {\hat{x}}_{k | k - 1}\} \\ = cov \{A x_{k - 1} + B u_{k - 1} + F w_{k - 1} - A {\hat{x}}_{k - 1 | k - 1} - B u_{k - 1}\} \\ = cov \{A x_{k - 1} + F w_{k - 1} - A {\hat{x}}_{k - 1 | k - 1}\} \\ = cov \{A (x_{k - 1} - {\hat{x}}_{k - 1 | k - 1}) + F w_{k - 1}\} \\ = cov \{A (x_{k - 1} - {\hat{x}}_{k - 1 | k - 1})\} + cov \{F w_{k - 1}\} \\ = E \{(A (x_{k - 1} - {\hat{x}}_{k - 1 | k - 1})) {(A (x_{k - 1} - {\hat{x}}_{k - 1 | k - 1}))}^{T}\} \\ + E \{(F w_{k - 1}) {(F w_{k - 1})}^{T}\} \\ = E \{(A (x_{k - 1} - {\hat{x}}_{k - 1 | k - 1})) {(x_{k - 1} - {\hat{x}}_{k - 1 | k - 1})}^{T} A^{T}\} \\ + E \{F w_{k - 1} w_{k - 1}^{T} F^{T}\} \\ = A P_{k - 1 | k - 1} A^{T} + F Q_{k - 1} F^{T} \end{matrix} \end{array}$

si228_e

where in line six of the derivation we took into account that the process noise w_k in time step k is independent of the state estimate error in the previous time step ( $x_{k - 1} - {\hat{x}}_{k - 1 | k - 1}$ $x_{k - 1} - {\hat{x}}_{k - 1 | k - 1}$ ).

Let us also derive the equation for computation of the state error covariance matrix in the correction part of the Kalman filter:

$\begin{array}{l} \begin{matrix} P_{k | k} & = E \{(x_{k} - {\hat{x}}_{k | k}) {(x_{k} - {\hat{x}}_{k | k})}^{T}\} \\ = cov \{x_{k} - {\hat{x}}_{k | k}\} \\ = cov \{x_{k} - {\hat{x}}_{k | k - 1} - K_{k} (z_{k} - C {\hat{x}}_{k | k - 1})\} \\ = cov \{x_{k} - {\hat{x}}_{k | k - 1} - K_{k} (C x_{k} + v_{k} - C {\hat{x}}_{k | k - 1})\} \\ = cov \{(I - K_{k} C) (x_{k} - {\hat{x}}_{k | k - 1}) - K_{k} v_{k}\} \\ = cov \{(I - K_{k} C) (x_{k} - {\hat{x}}_{k | k - 1})\} + cov \{K_{k} v_{k}\} \\ = (I - K_{k} C) P_{k | k - 1} {(I - K_{k} C)}^{T} + K_{k} R_{k} K_{k}^{T}, \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} P_{k | k} & = E \{(x_{k} - {\hat{x}}_{k | k}) {(x_{k} - {\hat{x}}_{k | k})}^{T}\} \\ = cov \{x_{k} - {\hat{x}}_{k | k}\} \\ = cov \{x_{k} - {\hat{x}}_{k | k - 1} - K_{k} (z_{k} - C {\hat{x}}_{k | k - 1})\} \\ = cov \{x_{k} - {\hat{x}}_{k | k - 1} - K_{k} (C x_{k} + v_{k} - C {\hat{x}}_{k | k - 1})\} \\ = cov \{(I - K_{k} C) (x_{k} - {\hat{x}}_{k | k - 1}) - K_{k} v_{k}\} \\ = cov \{(I - K_{k} C) (x_{k} - {\hat{x}}_{k | k - 1})\} + cov \{K_{k} v_{k}\} \\ = (I - K_{k} C) P_{k | k - 1} {(I - K_{k} C)}^{T} + K_{k} R_{k} K_{k}^{T}, \end{matrix} \end{array}$

si230_e

where in line six of the derivation we took into account that the measurement noise v_k is uncorrelated with the other terms. The obtained relation for calculation of the covariance matrix P_k|k is general and can be used for arbitrary gain K_k. However, the expression for P_k|k in Eq. (6.36) is valid only for optimum gain (Kalman gain) that minimizes the mean squared correction error $E \{|x_{k} - {\hat{x}}_{k | k}|^{2}\}$ $E \{|x_{k} - {\hat{x}}_{k | k}|^{2}\}$ ; this is equivalent to the minimization of the sum of all the diagonal elements in the correction covariance matrix P_k|k.

The general equation for P_k|k can be extended and the terms rearranged:

$\begin{array}{l} \begin{matrix} P_{k | k} & = (I - K_{k} C) P_{k | k - 1} {(I - K_{k} C)}^{T} + K_{k} R_{k} K_{k}^{T} \\ = P_{k | k - 1} - K_{k} C P_{k | k - 1} - P_{k | k - 1} C^{T} K_{k}^{T} + K_{k} C P_{k | k - 1} C^{T} K_{k}^{T} + K_{k} R_{k} K_{k}^{T} \\ = P_{k | k - 1} - K_{k} C P_{k | k - 1} - P_{k | k - 1} C^{T} K_{k}^{T} + K_{k} (C P_{k | k - 1} C^{T} + R_{k}) K_{k}^{T} \\ = P_{k | k - 1} - K_{k} C P_{k | k - 1} - P_{k | k - 1} C^{T} K_{k}^{T} + K_{k} S_{k} K_{k}^{T} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} P_{k | k} & = (I - K_{k} C) P_{k | k - 1} {(I - K_{k} C)}^{T} + K_{k} R_{k} K_{k}^{T} \\ = P_{k | k - 1} - K_{k} C P_{k | k - 1} - P_{k | k - 1} C^{T} K_{k}^{T} + K_{k} C P_{k | k - 1} C^{T} K_{k}^{T} + K_{k} R_{k} K_{k}^{T} \\ = P_{k | k - 1} - K_{k} C P_{k | k - 1} - P_{k | k - 1} C^{T} K_{k}^{T} + K_{k} (C P_{k | k - 1} C^{T} + R_{k}) K_{k}^{T} \\ = P_{k | k - 1} - K_{k} C P_{k | k - 1} - P_{k | k - 1} C^{T} K_{k}^{T} + K_{k} S_{k} K_{k}^{T} \end{matrix} \end{array}$

si232_e

where S_k = CP_k|k−1C^T +R_k represents the innovation covariance matrix ( $S_{k} = cov \{z_{k} - C {\hat{x}}_{k | k - 1}\}$ $S_{k} = cov \{z_{k} - C {\hat{x}}_{k | k - 1}\}$ ). The sum of the diagonal terms of P_k|k is minimal when the derivative of P_k|k with respect to K_k is zero:

$\begin{array}{l} \frac{\partial P_{k | k}}{\partial K_{k}} = - 2 {(C P_{k | k - 1})}^{T} + 2 K_{k} S_{k} = 0 \end{array}$ $\begin{array}{l} \frac{\partial P_{k | k}}{\partial K_{k}} = - 2 {(C P_{k | k - 1})}^{T} + 2 K_{k} S_{k} = 0 \end{array}$

si234_e

which leads to the optimum gain in Eq. (6.36):

$\begin{array}{l} K_{k} = P_{k | k - 1} C^{T} S_{k}^{- 1} = P_{k | k - 1} C^{T} {(C P_{k | k - 1} C^{T} + R_{k})}^{- 1} \end{array}$ $\begin{array}{l} K_{k} = P_{k | k - 1} C^{T} S_{k}^{- 1} = P_{k | k - 1} C^{T} {(C P_{k | k - 1} C^{T} + R_{k})}^{- 1} \end{array}$

The correction covariance matrix at optimum gain can be derived if the optimum gain is postmultiplied with $S_{k} K_{k}^{T}$ $S_{k} K_{k}^{T}$ and inserted into the equation for P_k|k:

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 6.5 Kalman Filter

Create new playlist

Sign In

Sign Up

6.4.5 Environment Sensing

6.4.6 Motion in the Environment

6.4.7 Localization in the Environment

6.5 Kalman Filter

Simplified Implementation of the Kalman Filter Algorithm

6.5.1 Kalman Filter in Matrix Form

Table of Contents for
6.5 Kalman Filter