Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 5

PID Control with Neural Compensation

Abstract

The simplest method to decrease the tracking error of PD control is to add an integral action, i.e., change the neural PD control into neural PID control. A natural question is: Why do we not add an integrator instead of increasing derivative gain in the neural PD control?

In this chapter, the well-known neural PD control of robot manipulators is extended to the neural PID control. It is an industrial linear PID controller adding a neural compensator. The semiglobal asymptotic stability of this novel neural control is proven. Explicit conditions for choosing PID gains are given. When the measurement of velocities it is not available, local asymptotic stability is also proven with a velocity observer. Unlike the other neural controllers of robot manipulators, our neural PID does not need a big derivative and integral gains to assure asymptotic stability. We apply this neural control to our 3-DoF exoskeleton robot in CINVESTA-IPN. Experimental results show that this neural PID control has many advantages over classic PID control, neural PD control, and the other neural PID control.

Keywords

Neural networks; Linear PID control

5.1 Stable neural PID control

Without flexible links and high-frequency joint dynamics, rigid robots can be expressed in the Lagrangian form:

$M (q) \overset{\cdot \cdot}{q} + C (q, \overset{\cdot}{q}) \overset{\cdot}{q} + G (q) + F (\dot{q}) = u$

(5.1)

where $q \in R^{n}$ represents the link positions. $M (q)$ is the inertia matrix, $C (q, \dot{q}) = {c_{k j}}$ represents centrifugal force, $G (q)$ is a vector of gravity torques, and $F (\dot{q})$ is friction. All terms $M (q)$ , $C (q, \overset{\cdot}{q})$ , $G (q)$ , and $F (\dot{q})$ are unknown. $u \in R^{n}$ is the control input. The friction $F (\dot{q})$ is represented by the Coulomb friction model:

$F (\dot{q}) = K_{f 1} \dot{q} + K_{f 2} \tanh (k_{f 3} \dot{q})$

(5.2)

where $k_{f 3}$ is a large positive constant, such that $\tanh (k_{f 3} \dot{q})$ can approximate $s i g n (\dot{q})$ , and $K_{f 1}$ and $K_{f 2}$ are positive coefficients. In this paper, we use a simple model for the friction as in [85] and [71],

$F (\dot{q}) = K_{f 1} \dot{q}$

(5.3)

When $G (q)$ and $F (\dot{q})$ are unknown, we may use a neural network to approximate them as

$\begin{matrix} f (q, \overset{\cdot}{q}) = G (q) + F (\dot{q}) \\ \hat{f} (q, \overset{\cdot}{q}) = \hat{W} σ (q, \overset{\cdot}{q}), f (q, \overset{\cdot}{q}) = W^{⁎} σ (q, \overset{\cdot}{q}) + ϕ (q) \end{matrix}$

(5.4)

where $W^{⁎}$ is an unknown constant weight, $\hat{W}$ is the estimated weight, $ϕ (q, \overset{\cdot}{q})$ is the neural approximation error, σ is a neural activation function. Here, we use the Gaussian function such that $σ (q, \overset{\cdot}{q}) ⩾ 0$ .

Since the joint velocity $\dot{q}$ is not always available, we may use a velocity observer to approximate it. This linear-in-the-parameter net is the simplest neural network. According to the universal function approximation theory, the smooth function $f (q, \overset{\cdot}{q})$ can be approximated by a multilayer neural network with one hidden layer in any desired accuracy provided proper weights and hidden neurons:

$\hat{f} (q, \overset{\cdot}{q}) = \hat{W} σ (\hat{V} [q, \overset{\cdot}{q}]), f (q) = W^{⁎} σ (V^{⁎} [q, \overset{\cdot}{q}]) + ϕ (q, \overset{\cdot}{q})$

(5.5)

where $\hat{W} \in R^{n \times m}$ , $\hat{V} \in R^{m \times n}$ , m is hidden node number, $\hat{V}$ is the weight in the hidden layer. In order to simplify the theory analysis, we first use linear-in-the-parameter net (5.4). Then we will show that the multilayer neural network (5.5) can also be used for the neural control of robot manipulators. The robot dynamics (5.1) have the following standard properties [131], which will be used to prove stability.

P5.1. The inertia matrix $M (q)$ is symmetric positive definite, and

$0 < λ_{m} {M (q)} ⩽ ‖ M ‖ ⩽ λ_{M} {M (q)} ⩽ β, β > 0$

(5.6)

where $λ_{M} {M}$ and $λ_{m} {M}$ are the maximum and minimum eigenvalues of the matrix M.

P5.2. For the centrifugal and Coriolis matrix $C (q, \dot{q})$ , there exists a number $k_{c} > 0$ such that

$‖ C (q, \dot{q}) \dot{q} ‖ ⩽ k_{c} {‖ \dot{q} ‖}^{2}, k_{c} > 0$

(5.7)

and $\dot{M} (q) - 2 C (q, \dot{q})$ is skew symmetric, i.e.,

$x^{T} [\dot{M} (q) - 2 C (q, \dot{q})] x = 0$

(5.8)

also

$\dot{M} (q) = C (q, \dot{q}) + C {(q, \dot{q})}^{T}$

(5.9)

P5.3. The neural approximation error $ϕ (q, \overset{\cdot}{q})$ is Lipschitz over q and $\overset{\cdot}{q}$

$‖ ϕ (x) - ϕ (y) ‖ ⩽ k_{ϕ} ‖ x - y ‖$

(5.10)

From (5.4), we know

$G (q) + F (\dot{q}) = W^{⁎} σ (q, \overset{\cdot}{q}) + ϕ (q, \overset{\cdot}{q})$

(5.11)

Because $G (q)$ and $F (\dot{q})$ satisfy the Lipschitz condition, P5.3 is established.

In order to simplify calculation, we use the simple model for the friction as in (5.3). The lower bound of $\int ϕ (q) d q$ can be estimated as

$\int_{0}^{t} ϕ (q, \overset{\cdot}{q}) d q = \int_{0}^{t} G (q) d q + \int_{0}^{t} F (\dot{q}) d q - \int_{0}^{t} W^{⁎} σ (q) d q$

(5.12)

where $U (q_{t})$ is the potential energy of the robot, $\frac{\partial U}{\partial q} = G (q)$ . Since $σ (\cdot)$ is a Gaussian function, $W^{⁎} σ (q) > 0$ . By $U (q_{t}) > 0$ ,

$\int_{0}^{t} ϕ (q, \overset{\cdot}{q}) d q > K_{f 1} q_{t} - K_{f 1} q_{0} - \frac{1}{2} \sqrt{π} W^{⁎}$

where $\int_{0}^{t} σ (q) = \frac{1}{2} \sqrt{π} \erf (q)$ . Since the work space of a manipulator (the entire set of points reachable by the manipulator) is known, $\min {q_{t}}$ can be estimated. We define the lower bound of $\int_{0}^{t} ϕ (q) d q$ as

$k_{ϕ} = K_{f 1} \min {q_{t}} - K_{f 1} q_{0} - \frac{1}{2} \sqrt{π} W^{⁎}$

(5.13)

Given a desired constant position $q^{d} \in R^{n}$ , the objective of robot control is to design the input torque u in (5.1) such that the regulation error

$\tilde{q} = q^{d} - q$

(5.14)

$\tilde{q} \to 0$ and $\overset{\cdot}{\tilde{q}} \to 0$ when initial conditions are in arbitrary large domain of attraction.

The classic industrial PID law is

$u = K_{p} \tilde{q} + K_{i} \int_{0}^{t} \tilde{q} (τ) d τ + K_{d} \overset{\cdot}{\tilde{q}}$

(5.15)

where $K_{p}$ , $K_{i}$ , and $K_{d}$ are proportional, integral, and derivative gains of the PID controller, respectively.

When the unknown dynamic $‖ f (q, \overset{\cdot}{q}) ‖$ in (5.4) is big, in order to assure asymptotic stability, the integral gain $K_{i}$ has to be increased. This may cause big overshoot, bad stability, and integrator windup. Model-free compensation is an alternative solution, where $f (q)$ is estimated by a neural network as in (5.4). Normal neural PD control is [85]

$u = K_{p} \tilde{q} + K_{d} \overset{\cdot}{\tilde{q}} + \hat{f}$

(5.16)

where $\hat{f} (q, \overset{\cdot}{q}) = \hat{W} σ (q, \overset{\cdot}{q})$ . With the filtered error $r = \tilde{q} + Λ \overset{\cdot}{\tilde{q}}$ , (5.16) becomes

$u = K_{v} r + \hat{f}$

(5.17)

The control (5.17) avoids integrator problems in (5.15). Unlike industrial PID control, they cannot reach asymptotic stability. The stability condition of the neural PD control (5.16) is $‖ r ‖ > \frac{B}{K_{v}}$ , B is a constant [87]. In order to decrease $‖ r ‖$ , $K_{d}$ has to be increased. This causes a long settling time problem. The asymptotic stability ( $r \to 0$ ) requires $K_{v} \to \infty$ .

An integrator is added into the normal neural PD control (5.16). It has a similar form as the industrial PID in (5.15),

$u = K_{p} \tilde{q} + K_{d} \overset{\cdot}{\tilde{q}} + K_{i} \int_{0}^{t} \tilde{q} (τ) d τ + \hat{f}$

(5.18)

Because in the regulation case ${\dot{q}}^{d} = 0$ , $\overset{\cdot}{\tilde{q}} = - \dot{q}$ , the PID control law can be expressed via the following equations:

$\begin{matrix} u = K_{p} \tilde{q} - K_{d} \dot{q} + ξ + \hat{W} σ (q, \overset{\cdot}{q}) \\ \dot{ξ} = K_{i} \tilde{q}, ξ (0) = ξ_{0} \end{matrix}$

(5.19)

We require the PID control part of (5.19) is decoupled, i.e., $K_{p}, K_{i}$ , and $K_{d}$ are positive definite diagonal matrices. The closed-loop system of the robot (5.1) is

$\begin{matrix} M (q) \ddot{q} + C (q, \dot{q}) \dot{q} + \tilde{f} (q, \overset{\cdot}{q}) = K_{p} \tilde{q} - K_{d} \dot{q} + ξ \\ \dot{ξ} = K_{i} \tilde{q} \end{matrix}$

(5.20)

where $\tilde{f} = f - \hat{f}$

$\tilde{f} = W^{⁎} σ (q) + ϕ (q) - \hat{W} σ (q) = \tilde{W} σ (q) + ϕ (q)$

(5.21)

Here, $\tilde{W} = W^{⁎} - \hat{W}$ . In matrix form the closed-loop system is

$\frac{d}{d t} [\begin{matrix} ξ \\ \tilde{q} \\ \overset{\cdot}{\tilde{q}} \end{matrix}] = [\begin{matrix} K_{i} \tilde{q} \\ - \dot{q} \\ {\ddot{q}}^{d} + M^{- 1} (\begin{matrix} C \dot{q} + \tilde{W} σ (q) + ϕ (q, \overset{\cdot}{q}) \\ - K_{p} \tilde{q} + K_{d} \dot{q} - ξ \end{matrix}) \end{matrix}]$

(5.22)

The equilibrium of (5.22) is $[ξ, \tilde{q}, \overset{\cdot}{\tilde{q}}] = [ξ^{⁎}, 0, 0]$ . Since at equilibrium point $q = q^{d}$ and ${\overset{\cdot}{q}}^{d} = 0$ the equilibrium is $[ϕ (q^{d}), 0, 0]$ . We simplify $ϕ (q^{d}, 0)$ as $ϕ (q^{d})$ .

In order to move the equilibrium to the origin, we define

$\tilde{ξ} = ξ - ϕ (q^{d})$

(5.23)

The final closed-loop equation becomes

$\begin{matrix} M (q) \ddot{q} + C (q, \dot{q}) \dot{q} + \tilde{W} σ (q, \overset{\cdot}{q}) + ϕ (q, \overset{\cdot}{q}) = K_{p} \tilde{q} - K_{d} \dot{q} + \tilde{ξ} + ϕ (q^{d}) \\ \overset{\cdot}{\tilde{ξ}} = K_{i} \tilde{q} \end{matrix}$

(5.24)

The following theorem gives the stability analysis of the neural PID control. From this theorem, we can see how to choose the PID gains and how to train the weight of the neural compensator in (5.19). Another important conclusion is that the neural PID control (5.19) can force the error $\tilde{q}$ to zero.

Theorem 5.1

Consider the robot dynamic (5.1) controlled by the neural PID control (5.19). The closed-loop system (5.24) is semiglobally asymptotically stable at the equilibrium $x = {[ξ - ϕ (q^{d}), \tilde{q}, \overset{\cdot}{\tilde{q}}]}^{T} = 0$ , provided that control gains satisfy

$\begin{matrix} λ_{m} (K_{p}) ⩾ \frac{3}{2} k_{ϕ} \\ λ_{M} (K_{i}) ⩽ β \frac{λ_{m} (K_{p})}{λ_{M} (M)} \\ λ_{m} (K_{d}) ⩾ β + λ_{M} (M) \end{matrix}$

(5.25)

where $β = \sqrt{\frac{λ_{m} (M) λ_{m} (K_{p})}{3}}$ , $k_{ϕ}$ satisfies (5.10), and the weight of the neural networks (5.4) is tuned by

$\overset{\cdot}{\hat{W}} = - K_{w} σ (q, \overset{\cdot}{q}) {(\dot{q} + α \tilde{q})}^{T}$

(5.26)

where α is positive design constant, it satisfies

$\frac{\sqrt{\frac{1}{3} λ_{m} (M) λ_{m} (K_{p})}}{λ_{M} (M)} ⩾ α ⩾ \frac{3}{λ_{m} (K_{i}^{- 1}) λ_{m} (K_{p})}$

(5.27)

Proof

We construct a Lyapunov function as

$\begin{matrix} V = \frac{1}{2} {\dot{q}}^{T} M \dot{q} + \frac{1}{2} {\tilde{q}}^{T} K_{p} \tilde{q} + \int_{0}^{t} ϕ (q, \overset{\cdot}{q}) d q - k_{ϕ} + {\tilde{q}}^{T} ϕ (q^{d}) \\ + \frac{3}{2} ϕ {(q^{d})}^{T} K_{p}^{- 1} ϕ (q^{d}) + \frac{α}{2} {\tilde{ξ}}^{T} K_{i}^{- 1} \tilde{ξ} \\ + {\tilde{q}}^{T} \tilde{ξ} - α {\tilde{q}}^{T} M \dot{q} + \frac{α}{2} {\tilde{q}}^{T} K_{d} \tilde{q} + \frac{1}{2} t r ({\tilde{W}}^{T} K_{w}^{- 1} \tilde{W}) \end{matrix}$

(5.28)

where $k_{ϕ}$ is defined in (5.13) such that $V (0) = 0$ . α is a design positive constant. We first prove V is a Lyapunov function, $V ⩾ 0$ . The term $\frac{1}{2} {\tilde{q}}^{T} K_{p} \tilde{q}$ is separated into three parts, and $V = \sum_{i = 1}^{4} V_{i}$ :

$\begin{matrix} V_{1} = \frac{1}{6} {\tilde{q}}^{T} K_{p} \tilde{q} + {\tilde{q}}^{T} ϕ (q^{d}) + \frac{3}{2} ϕ {(q^{d})}^{T} K_{p}^{- 1} ϕ (q^{d}) \\ V_{2} = \frac{1}{6} {\tilde{q}}^{T} K_{p} \tilde{q} + {\tilde{q}}^{T} \tilde{ξ} + \frac{α}{2} {\tilde{ξ}}^{T} K_{i}^{- 1} \tilde{ξ} \\ V_{3} = \frac{1}{6} {\tilde{q}}^{T} K_{p} \tilde{q} - α {\tilde{q}}^{T} M \dot{q} + \frac{1}{2} {\dot{q}}^{T} M \dot{q} \\ V_{4} = \int_{0}^{t} ϕ (q) d τ - k_{ϕ} + \frac{α}{2} {\tilde{q}}^{T} K_{d} \tilde{q} + \frac{1}{2} t r ({\tilde{W}}^{T} K_{w}^{- 1} \tilde{W}) ⩾ 0 \end{matrix}$

(5.29)

It is easy to find

$V_{1} = \frac{1}{2} {[\begin{matrix} \tilde{q} \\ ϕ (q^{d}) \end{matrix}]}^{T} [\begin{matrix} \frac{1}{3} K_{p} & I \\ I & 3 K_{p}^{- 1} \end{matrix}] [\begin{matrix} \tilde{q} \\ ϕ (q^{d}) \end{matrix}]$

(5.30)

Since $K_{p} ⩾ 0$ , $V_{1}$ is a semipositive definite matrix, $V_{1} ⩾ 0$ . When $α ⩾ \frac{3}{λ_{m} (K_{i}^{- 1}) λ_{m} (K_{p})}$ ,

$V_{2} ⩾ \frac{1}{2} {(\sqrt{\frac{1}{3} λ_{m} (K_{p})} ‖ \tilde{q} ‖ - \sqrt{\frac{3}{λ_{m} (K_{p})}} ‖ \tilde{ξ} ‖)}^{2} ⩾ 0$

(5.31)

Because

$y^{T} A x ⩽ ‖ y ‖ ‖ A x ‖ ⩽ ‖ y ‖ ‖ A ‖ ‖ x ‖ ⩽ | λ_{M} (A) | ‖ y ‖ ‖ x ‖$

(5.32)

when $α ⩽ \frac{\sqrt{\frac{1}{3} λ_{m} (M) λ_{m} (K_{p})}}{λ_{M} (M)}$ ,

$V_{3} ⩾ \frac{1}{2} {(\sqrt{λ_{m} (M)} ‖ \dot{q} ‖ - \sqrt{\frac{1}{3} λ_{m} (K_{p})} ‖ \tilde{q} ‖)}^{2} ⩾ 0$

(5.33)

Obviously, if

$\sqrt{\frac{1}{3}} λ_{m} (K_{i}^{- 1}) λ_{m}^{\frac{3}{2}} (K_{p}) λ_{m}^{\frac{1}{2}} (M) ⩾ λ_{M} (M)$

(5.34)

there exists

$\frac{\sqrt{\frac{1}{3} λ_{m} (M) λ_{m} (K_{p})}}{λ_{M} (M)} ⩾ α ⩾ \frac{3}{λ_{m} (K_{i}^{- 1}) λ_{m} (K_{p})}$

(5.35)

This means if $K_{p}$ is sufficiently large or $K_{i}$ is sufficiently small, (5.34) is established, and $V (\dot{q}, \tilde{q}, \tilde{ξ})$ is globally positive definite. Using $\frac{d}{d t} \int_{0}^{t} ϕ (q, \overset{\cdot}{q}) d q = \frac{\partial \int_{0}^{t} ϕ (q, \overset{\cdot}{q}) d q}{\partial q} \frac{\partial q}{\partial t} = {\dot{q}}^{T} ϕ (q, \overset{\cdot}{q})$ , $\frac{d}{d t} ϕ (q^{d}) = 0$ and $\frac{d}{d t} [{\tilde{q}}^{T} ϕ (q^{d})] = {\overset{\cdot}{\tilde{q}}}^{T} ϕ (q^{d})$ , the derivative of V is

$\begin{matrix} \dot{V} = {\dot{q}}^{T} M \ddot{q} + \frac{1}{2} {\dot{q}}^{T} \overset{\cdot}{M} \dot{q} + {\overset{\cdot}{\tilde{q}}}^{T} K_{p} \tilde{q} + ϕ {(q, \overset{\cdot}{q})}^{T} \dot{q} + {\overset{\cdot}{\tilde{q}}}^{T} ϕ (q^{d}) + t r ({\tilde{W}}^{T} K_{w}^{- 1} \overset{\cdot}{\tilde{W}}) \\ + \overset{\cdot}{α {\tilde{ξ}}^{T}} K_{i}^{- 1} \tilde{ξ} + {\overset{\cdot}{\tilde{q}}}^{T} \tilde{ξ} + {\tilde{q}}^{T} \overset{\cdot}{\tilde{ξ}} - α ({\overset{\cdot}{\tilde{q}}}^{T} M \dot{q} + {\tilde{q}}^{T} \overset{\cdot}{M} \dot{q} + {\tilde{q}}^{T} M \ddot{q}) + α {\tilde{q}}^{T} K_{d} \overset{\cdot}{\tilde{q}} \end{matrix}$

(5.36)

Using (5.8) the first three terms of (5.36) become

$- {\dot{q}}^{T} ϕ (q) - {\dot{q}}^{T} K_{d} \dot{q} + {\dot{q}}^{T} \tilde{ξ} + {\dot{q}}^{T} ϕ (q^{d}) + {\dot{q}}^{T} \tilde{W} σ (q, \overset{\cdot}{q})$

(5.37)

And

$\dot{V} ⩽ - [λ_{m} (K_{d}) - α λ_{M} (M) - α k_{c} ‖ \tilde{q} ‖] {‖ \dot{q} ‖}^{2} - [α λ_{m} (K_{p}) - λ_{M} (K_{i}) - α k_{g}] {‖ \tilde{q} ‖}^{2}$

(5.38)

$‖ \tilde{q} ‖ ⩽ \frac{λ_{M} (M)}{α k_{c}}$

(5.39)

and

$\begin{matrix} λ_{m} (K_{d}) ⩾ (1 + α) λ_{M} (M) \\ λ_{m} (K_{p}) ⩾ \frac{1}{α} λ_{M} (K_{i}) + k_{g} \end{matrix}$

(5.40)

then $\dot{V} ⩽ 0$ , $‖ \tilde{q} ‖$ decreases. Then (5.40) is established. Using (5.34) and $λ_{m} (K_{i}^{- 1}) = \frac{1}{λ_{M} (K_{i})}$ , (5.40) is (5.25).

$\dot{V}$ is negative semidefinite. Define a ball Σ of radius $σ > 0$ centered at the origin of the state space, which satisfies these conditions:

$Σ = {\tilde{q} : ‖ \tilde{q} ‖ ⩽ \frac{λ_{M} (M)}{α k_{c}} = σ}$

(5.41)

$\dot{V}$ is negative semidefinite on the ball Σ. There exists a ball Σ of radius $σ > 0$ centered at the origin of the state space on which $\dot{V} ⩽ 0$ . The origin of the closed-loop equation (5.24) is a stable equilibrium. Since the closed-loop equation is autonomous, we use LaSalle's theorem. Define Ω as

$\begin{matrix} Ω = {x (t) = [\tilde{q}, \dot{q}, \tilde{ξ}] \in R^{3 n} : \dot{V} = 0} \\ = {\tilde{ξ} \in R^{n} : \tilde{q} = 0 \in R^{n}, \dot{q} = 0 \in R^{n}} \end{matrix}$

(5.42)

From (5.36), $\dot{V} = 0$ if and only if $\tilde{q} = \dot{q} = 0$ . For a solution $x (t)$ to belong to Ω for all $t ⩾ 0$ , it is necessary and sufficient that $\tilde{q} = \dot{q} = 0$ for all $t ⩾ 0$ . Therefore it must also hold that $\ddot{q} = 0$ for all $t ⩾ 0$ . We conclude from the closed-loop system (5.24) that if $x (t) \in Ω$ for all $t ⩾ 0$ , then

$\begin{matrix} ϕ (q, \overset{\cdot}{q}) = ϕ (q^{d}, 0) = \tilde{ξ} + ϕ (q^{d}, 0) \\ \overset{\cdot}{\tilde{ξ}} = 0 \end{matrix}$

(5.43)

implies that $\tilde{ξ} = 0$ for all $t ⩾ 0$ . So $x (t) = [\tilde{q}, \dot{q}, \tilde{ξ}] = 0 \in R^{3 n}$ is the only initial condition in Ω for which $x (t) \in Ω$ for all $t ⩾ 0$ .

Finally, we conclude from all of this that the origin of the closed-loop system (5.24) is locally asymptotically stable. Because $\frac{1}{α} ⩽ λ_{m} (K_{i}^{- 1}) λ_{m} (K_{p})$ , the upper bound for $‖ \tilde{q} ‖$ can be

$‖ \tilde{q} ‖ ⩽ \frac{λ_{M} (M)}{k_{c}} λ_{M} (K_{i}) λ_{m} (K_{p})$

(5.44)

It establishes the semiglobal stability of our controller, in the sense that the domain of attraction can be arbitrarily enlarged with a suitable choice of the gains. Namely, increasing $K_{p}$ the basin of attraction will grow. □

Remark 5.1

From the above stability analysis, we see that the gain matrices of the neural PID control (5.19) can be chosen directly from the conditions (5.25). The tuning procedure of the PID parameters is more simple than [2,4,71,101,117]. No modeling information is needed. The upper or lower bounds of PID gains need the maximum eigenvalue of M in (5.25); it can be estimated without calculating M. For a robot with only revolute joints [131],

$λ_{M} (M) ⩽ β, β ⩾ n (\max_{i, j} | m_{i j} |)$

(5.45)

where $m_{i j}$ stands the ijth element of M, $M \in R^{n \times n}$ . A β can be selected such that it is much bigger than all elements.

Remark 5.2

The main difference between our neural PID control with the other neural PD controllers is that the stability condition is changed. We require the regulation error:

$‖ \tilde{q} ‖ < k_{1} λ_{M} (K_{i}) λ_{m} (K_{p})$

(5.46)

The other neural PD controllers need

$‖ \tilde{q} ‖ > \frac{k_{2}}{K_{v}}$

(5.47)

where $k_{1}$ and $k_{2}$ are positive constants. Obviously, if the initial condition is not worse and satisfies (5.46), (5.46) is always satisfied, and $‖ \tilde{q} ‖$ will decrease to zero. But (5.47) cannot be satisfied when $‖ \tilde{q} ‖$ becomes small, so $K_{v}$ has to be increased.

Remark 5.3

If the unknown $f (q)$ is estimated by the multilayer neural network (5.5) the modeling error (5.21) becomes

$\begin{matrix} \tilde{f} = f - \hat{f} = W^{⁎} σ (V^{⁎} [q, \overset{\cdot}{q}]) + ϕ (q, \overset{\cdot}{q}) - \hat{W} σ (\hat{V} [q, \overset{\cdot}{q}]) \\ = \tilde{W} σ (\hat{V} [q, \overset{\cdot}{q}]) - W^{⁎} σ (\hat{V} [q, \overset{\cdot}{q}]) + W^{⁎} σ (V^{⁎} [q, \overset{\cdot}{q}]) + ϕ (q, \overset{\cdot}{q}) \\ = \tilde{W} σ (\hat{V} [q, \overset{\cdot}{q}]) + W^{⁎} σ^{'} \tilde{V} [q, \overset{\cdot}{q}] + ϵ_{1} + ϕ (q, \overset{\cdot}{q}) \\ = \tilde{W} [σ (\hat{V} [q, \overset{\cdot}{q}]) + σ^{'} \tilde{V} [q, \overset{\cdot}{q}]] + \hat{W} σ^{'} \tilde{V} [q, \overset{\cdot}{q}] + ϕ_{1} (q, \overset{\cdot}{q}) \end{matrix}$

(5.48)

where $ϕ_{1} (q) = ϵ_{1} + ϕ (q, \overset{\cdot}{q})$ , $ϵ_{1}$ is the Taylor approximation error. The closed-loop equation (5.24) becomes

$\begin{matrix} M (q) \ddot{q} + C (q, \dot{q}) \dot{q} + \tilde{W} {σ (\hat{V} [q, \overset{\cdot}{q}]) + σ^{'} \tilde{V} [q, \overset{\cdot}{q}]} + \hat{W} σ^{'} \tilde{V} [q, \overset{\cdot}{q}] + ϕ_{1} (q, \overset{\cdot}{q}) \\ = K_{p} \tilde{q} - K_{d} \dot{q} + \tilde{ξ} + ϕ (q^{d}) \\ \overset{\cdot}{\tilde{ξ}} = K_{i} \tilde{q} \end{matrix}$

(5.49)

If the Lyapunov function in (5.28) is changed as

$V_{m} = V + \frac{1}{2} t r ({\tilde{V}}^{T} K_{v}^{- 1} \tilde{V})$

(5.50)

then the derivative of (5.50) is

${\overset{\cdot}{V}}_{m} = \dot{V} - {\dot{q}}^{T} \tilde{W} σ (q [q, \overset{\cdot}{q}]) + {\dot{q}}^{T} \tilde{W} [σ (\hat{V} [q, \overset{\cdot}{q}]) + σ^{'} \tilde{V} [q, \overset{\cdot}{q}]] + t r ({\tilde{V}}^{T} K_{v}^{- 1} \overset{\cdot}{\tilde{V}})$

(5.51)

If the training rule (5.26) is changed as

$\begin{matrix} \overset{\cdot}{\hat{W}} = - K_{w} {σ (\hat{V} [q, \overset{\cdot}{q}]) + σ^{'} \tilde{V} [q, \overset{\cdot}{q}]} {(\dot{q} + α \tilde{q})}^{T} \\ \overset{\cdot}{\hat{V}} = - K_{v} \hat{W} σ^{'} q {(\dot{q} + α \tilde{q})}^{T} \end{matrix}$

(5.52)

Theorem 5.1 is also established.

One common problem of the linear PID control (5.18) is integral windup, where the rate of integration is larger than the actual speed of the system. The integrator's output may exceed the saturation limit of the actuator. The actuator will then operate at its limit no matter what the process outputs. This means that the system runs with an open loop instead of a constant feedback loop. The solutions of antiwindup schemes are mainly classified into two types [142]: conditional integration and back-calculation. It has been shown that none of the existed methods is able to provide good performance over a wide range of processes [5]. In this paper, we use the conditional integration algorithm. The integral term is limited to a selected value

$u = K_{p} \tilde{q} + K_{d} \overset{\cdot}{\tilde{q}} + s a t [K_{i} \int_{0}^{t} \tilde{q} (τ) d τ, ν_{\max}] + \hat{f}$

(5.53)

where $s a t [x, ν_{\max}] = {\begin{matrix} x & if ‖ x ‖ < ν_{\max} \\ ν_{\max} & if ‖ x ‖ ⩾ ν_{\max} \end{matrix}$ . $ν_{\max}$ is a prescribed value to the integral term when the controller saturates. This approach is also called preloading [123]. Now the linear PID controller becomes nonlinear PID. The semiglobal asymptotic stability has been analyzed by [2]. When $ν_{\max}$ is the maximum torque of all joint actuators, $ν_{\max} = k_{s} \max_{i} (| u_{i}^{\max} |)$ , $u_{i}^{\max} = \max (| u_{i} |)$ , $k_{s} ⩽ 1$ . A necessary condition is

$ν_{\max} ⩾ 3 \bar{G}, ‖ G (q) ‖ ⩽ \bar{G}$

where $G (q)$ is the gravity torque of the robot (5.1), $\bar{G}$ is the upper bound of $G (q)$ . $k_{s}$ is a design factor in the case that not all PID terms are subjected to saturation. For the controller (5.53), $k_{s}$ can selected as $k_{s} = \frac{1}{4}$ .

Following the process from (5.4) to (5.13), the neural PID with an antiwindup controller (5.53) requires

$\begin{matrix} ν_{\max} ⩾ 3 \bar{ϕ} \\ ‖ W^{⁎} σ (q, \overset{\cdot}{q}) + ϕ (q, \overset{\cdot}{q}) ‖ ⩽ ‖ W^{⁎} σ (q, \overset{\cdot}{q}) ‖ + ‖ ϕ (q, \overset{\cdot}{q}) ‖ ⩽ \bar{ϕ} \end{matrix}$

(5.54)

where $\bar{ϕ}$ is the upper bound of the neural estimator, $W^{⁎}$ , $σ (q, \overset{\cdot}{q})$ and $ϕ (q, \overset{\cdot}{q})$ are defined in (5.4).

We can see that the first additional condition for the neural PID with antiwindup is that the neural estimator must be bounded, while the linear neural PID only requires that the neural estimation error satisfy the Lipschitz condition (5.10).

Since $u_{i}^{\max}$ (or $ν_{\max}$ ) is a physical requirement for the actuator, it is not a design parameter. In order to satisfy the condition (5.54), we should force $\bar{ϕ}$ as small as possible. A good structure of the neural estimator may make the term $‖ W^{⁎} σ (q, \overset{\cdot}{q}) ‖$ smaller, such as the multilayer neural network in Remark 5.3. There are several methods that can be used to find a good neural network, such as the genetic algorithm [7] and pruning [130]. Besides structure optimization, the initial condition for the gradient training algorithm (5.26) also affects $\bar{ϕ}$ . Since the initial conditions for $\hat{W}$ and $\hat{V}$ in (5.52) do not affect the stability property, we design an off-line method to find a better value for $\hat{W} (0)$ and $\hat{V} (0)$ . If we let $\hat{W} (0) = W_{0}$ , $V (0) = V_{0}$ , the algorithm (5.52) can make the identification error convergent, i.e., $\hat{W} (t)$ and $\hat{V} (t)$ will make the identification error smaller than that of $W_{0}$ and $V_{0}$ . $\hat{W} (0)$ and $\hat{V} (0)$ are selected by the following steps:

1. Start from any initial value for $\hat{W} (0) = W_{0}$ , $\hat{V} (0) = V_{0}$ .
2. Do training with (5.52) until $T_{0}$ .
3. If the $‖ \tilde{q} (T_{0}) ‖ < ‖ \tilde{q} (0) ‖$ , let $\hat{W} (T_{0})$ and $\hat{V} (T_{0})$ as a new $\hat{W} (0)$ and $\hat{V} {(0)}^{0}$ , i.e., $\hat{W} (0) = \hat{W} (T_{0})$ , $\hat{V} (0) = \hat{V} (T_{0})$ , go to 2 to repeat the training process.
4. If the $‖ \tilde{q} (T_{0}) ‖ ⩾ ‖ \tilde{q} (0) ‖$ , stop this off-line identification, now $\hat{W} (T_{0})$ and $\hat{V} (T_{0})$ are the final value for $\hat{W} (0)$ and $\hat{V} (0)$ .

5.2 Neural PID control with unmeasurable velocities

The neural PID control (5.19) uses the joint velocities $\dot{q}$ . In contrast to the high precision of the position measurements by the optical encoders, the measurement of velocities by tachometers may be quite mediocre in accuracy, specifically for certain intervals of velocity. The common idea in the design of PID controllers, which requires velocity measurements, has been to propose state observers to estimate the velocity. The simplest observer may be the first-order and zero-relative position filter [131]

$υ_{i} (s) = \frac{b_{i} s}{s + a_{i}} q_{i} (s), i = 1 \dots n$

(5.55)

where $υ_{i} (s)$ is an estimation of ${\dot{q}}_{i}$ , $a_{i}$ , and $b_{i}$ are the elements of diagonal matrices A and B, $A = d i a g {a_{i}}$ , $B = d i a g {b_{i}}$ , $a_{i} > 0$ , $b_{i} > 0$ . The transfer function (5.55) can be realized by

${\begin{matrix} \dot{x} = - A (x + B q) \\ \tilde{\dot{q}} = x + B q \end{matrix}$

(5.56)

The linear PID control (5.19) becomes

$\begin{matrix} u = K_{p} \tilde{q} - K_{d} υ + ξ + \hat{W} σ (q) \\ \dot{ξ} = K_{i} \tilde{q}, ξ (0) = ξ_{0} \\ \dot{x} = - A (x + B q) \\ υ = x + B q \end{matrix}$

(5.57)

where $K_{p}$ , $K_{i}$ , and $K_{d}$ are positive definite diagonal matrices, $a_{i}$ and $b_{i}$ in (5.55) are positive constants. The closed-loop system of the robot (5.1) is

$\frac{d}{d t} [\begin{matrix} ξ \\ υ \\ \dot{q} \end{matrix}] = [\begin{matrix} K_{i} \tilde{q} \\ - A υ + B \dot{q} \\ M^{- 1} [\begin{matrix} - C (q, \dot{q}) \dot{q} - \tilde{W} σ (q) - ϕ (q) \\ + K_{p} \tilde{q} - K_{d} υ + \tilde{ξ} + ϕ (q^{d}) \end{matrix}] \end{matrix}]$

(5.58)

The equilibrium of (5.58) is $[\tilde{ξ}, υ, \dot{q}] = [0, 0, 0]$ .

The following theorem gives the asymptotic stability of the neural PID control with the velocity observer (5.55). This theorem also provides a training algorithm for neural weights, and an explicit selection method of PID gains.

Since the velocities are not available, the input of the neural networks becomes

$\begin{matrix} \hat{f} (q, \overset{\cdot}{q}) = \hat{W} σ (q, v) \\ or \hat{f} (q, \overset{\cdot}{q}) = \hat{W} σ (\hat{V} [q, v]) \end{matrix}$

(5.59)

Theorem 5.2

Consider the robot dynamic (5.1) controlled by the neural PID controller (5.57), if A and B of the velocity observer (5.55) satisfy

$\begin{matrix} \frac{λ_{M} (A)}{λ_{m}^{2} (A)} ⩽ λ_{m} (K_{d}) \frac{λ_{m} (B)}{λ_{M} (B)} \\ λ_{M} (B) ⩽ \frac{1}{4} λ_{m} (K_{d}) \frac{λ_{m} (M)}{λ_{M}^{2} (M)} \\ λ_{m} (B - α I) ⩾ \frac{1}{2} λ_{m} (A) \end{matrix}$

(5.60)

where α is positive design constant, provided that the PID control gains of (5.57) satisfy

$\begin{matrix} λ_{m} (K_{p}) - \frac{1}{2} λ_{M} (K_{p}) ⩾ \frac{1}{α} [\begin{matrix} λ_{M} (K_{i}) + λ_{M} (A^{- 1} B K_{i}) \\ + \frac{1 + 2 α}{2} k_{ϕ} + \frac{α^{2}}{2} λ_{M} (K_{d}) + \frac{α}{2} λ_{M} (A^{- 1} K_{i}) \end{matrix}] \\ λ_{m} (K_{d}) ⩾ \frac{k_{g} + \frac{1}{2 α} λ_{M} (A^{- 1} K_{i}) + \frac{1}{2 α} λ_{M} (K_{p}) + κ (M) λ_{M} (M) λ_{M} (A)}{2 λ_{m} (A B^{- 1} - I) - 1} \\ λ_{M} (K_{i}) ⩽ \frac{α}{3} λ_{m} (K_{p}) \end{matrix}$

(5.61)

where $k_{ϕ}$ satisfies (5.10), $κ (M)$ is the condition number of M, and the weight of neural networks is tuned by

$\overset{\cdot}{\hat{W}} = - K_{w} σ (q, υ) {[α \tilde{q} + υ + B^{- 1} (\dot{υ} + A v)]}^{T}$

(5.62)

then the closed-loop system (5.58) is locally asymptotically stable at the equilibrium:

$x = {[ξ - ϕ (q^{d}), \tilde{q}, \overset{\cdot}{\tilde{q}}]}^{T} = 0$

(5.63)

in the domain of attraction

$‖ \tilde{q} ‖ ⩽ \frac{λ_{m} (M)}{α k_{c}} [λ_{m} (B - α I) - \frac{1}{2} λ_{m} (A)] + \frac{1}{α} ‖ υ ‖$

(5.64)

Proof

We construct a Lyapunov function as

$\begin{matrix} V_{c} = \frac{1}{2} {\dot{q}}^{T} M \dot{q} + \frac{1}{2} {\tilde{q}}^{T} K_{p} \tilde{q} + \int_{0}^{t} ϕ (q) d τ - k_{ϕ} + {\tilde{q}}^{T} ϕ (q^{d}) + \frac{3}{2} ϕ {(q^{d})}^{T} K_{p}^{- 1} ϕ (q^{d}) \\ + \frac{α}{2} {\tilde{ξ}}^{T} K_{i}^{- 1} \tilde{ξ} - α {\tilde{q}}^{T} M \dot{q} + {\tilde{q}}^{T} (I + A^{- 1} B) \tilde{ξ} + \frac{1}{2} υ^{T} B^{- 1} K_{d} υ - υ^{T} M \dot{q} + υ^{T} A^{- 1} \tilde{ξ} \\ + \frac{1}{2} t r ({\tilde{W}}^{T} K_{w}^{- 1} \tilde{W}) \end{matrix}$

(5.65)

where the definition of $k_{ϕ}$ is the same as Theorem 5.1. α is a design positive constant. We first prove V is a Lyapunov function, $V ⩾ 0$ . The term $\frac{1}{2} {\tilde{q}}^{T} K_{p} \tilde{q}$ is separated into three parts, and $V = \sum_{i = 1}^{6} V_{i}$ :

$\begin{matrix} V_{1} = \frac{1}{6} {\tilde{q}}^{T} K_{p} \tilde{q} + {\tilde{q}}^{T} ϕ (q^{d}) + \frac{3}{2} ϕ {(q^{d})}^{T} K_{p}^{- 1} ϕ (q^{d}) \\ V_{2} = \frac{1}{6} {\tilde{q}}^{T} K_{p} \tilde{q} + {\tilde{q}}^{T} \tilde{ξ} + \frac{α}{2} {\tilde{ξ}}^{T} K_{i}^{- 1} \tilde{ξ} \\ V_{3} = \frac{1}{6} {\tilde{q}}^{T} K_{p} \tilde{q} - α {\tilde{q}}^{T} M \dot{q} + \frac{1}{4} {\dot{q}}^{T} M \dot{q} \\ V_{4} = \frac{1}{4} υ^{T} (B^{- 1} K_{d}) υ + υ^{T} A^{- 1} \tilde{ξ} + {\tilde{ξ}}^{T} (A^{- 1} B) \tilde{ξ} \\ V_{5} = \frac{1}{4} υ^{T} (B^{- 1} K_{d}) υ - υ^{T} M \dot{q} + \frac{1}{4} {\dot{q}}^{T} M \dot{q} \\ V_{6} = \int_{0}^{t} ϕ (q) d τ - k_{ϕ} + \frac{1}{2} t r ({\tilde{W}}^{T} K_{w}^{- 1} \tilde{W}) ⩾ 0 \end{matrix}$

(5.66)

Here, $V_{1}$ and $V_{2}$ are the same as (5.29), i.e.,

$λ_{M} (K_{i}) ⩽ \frac{α}{3} λ_{m} (K_{p})$

(5.67)

For $V_{3}$ , if $α ⩽ \frac{\sqrt{\frac{1}{6} λ_{m} (K_{p}) λ_{m} (M)}}{λ_{M} (M)}$

$V_{3} ⩾ \frac{1}{2} {(\sqrt{\frac{1}{2} λ_{m} (M)} ‖ \dot{q} ‖ - \sqrt{\frac{1}{3} λ_{m} (K_{p})} ‖ \tilde{q} ‖)}^{2} ⩾ 0$

(5.68)

Because $λ_{m} (A B) ⩽ λ_{m} (B^{- 1}) λ_{M} (A)$ and $λ_{m} (B^{- 1}) = \frac{1}{λ_{M} (B)}$ , it is easy to find that, if $λ_{M} (A^{- 1}) ⩽ \sqrt{λ_{m} (B^{- 1} K_{d}) λ_{m} ((A^{- 1} B))}$ or $\frac{λ_{M} (A)}{λ_{m}^{2} (A)} ⩽ λ_{m} (K_{d}) \frac{λ_{m} (B)}{λ_{M} (B)}$ ,

$V_{4} ⩾ \frac{1}{2} (\begin{matrix} \frac{1}{2} λ_{m} (B^{- 1} K_{d}) {‖ υ ‖}^{2} - 2 λ_{M} (A^{- 1}) ‖ υ ‖ ‖ \tilde{ξ} ‖ \\ + 2 λ_{m} ((A^{- 1} B)) {‖ \tilde{ξ} ‖}^{2} \end{matrix}) ⩾ 0$

(5.69)

If $λ_{M} (M) ⩽ \frac{1}{2} \sqrt{λ_{m} ((B^{- 1} K_{d})) λ_{m} (M)}$ or $λ_{M} (B) ⩽ \frac{1}{4} λ_{m} (K_{d}) \frac{λ_{m} (M)}{λ_{M}^{2} (M)}$

$V_{5} = \frac{1}{2} [\frac{1}{2} υ^{T} K_{d} B^{- 1} υ + 2 υ^{T} M \dot{q} + \frac{1}{2} {\dot{q}}^{T} M \dot{q}] ⩾ 0$

(5.70)

Because $V_{6} ⩾ 0$ , obviously, there exist α, A, and B such that

$\begin{matrix} α^{2} ⩽ \frac{1}{6} \frac{λ_{m} (K_{p}) λ_{m} (M)}{λ_{M}^{2} (M)} \\ \frac{λ_{M} (A)}{λ_{m}^{2} (A)} ⩽ λ_{m} (K_{d}) \frac{λ_{m} (B)}{λ_{M} (B)} \\ λ_{M} (B) ⩽ \frac{1}{4} λ_{m} (K_{d}) \frac{λ_{m} (M)}{λ_{M}^{2} (M)} \end{matrix}$

(5.71)

This means if $K_{p}$ is sufficiently large or $K_{i}$ is sufficiently small, (5.34) is established, and $V_{c}$ is globally positive definite. Now we compute its derivative. The derivative of $V_{c}$ is

$\begin{matrix} \dot{V} ⩽ - {\dot{q}}^{T} B M \dot{q} - υ^{T} A B^{- 1} K_{d} υ - α {\tilde{q}}^{T} K_{p} \tilde{q} + α k_{g} {‖ \tilde{q} ‖}^{2} + α {\dot{q}}^{T} M \dot{q} + k_{c} ‖ α \tilde{q} - υ ‖ {‖ \dot{q} ‖}^{2} \\ + {\tilde{q}}^{T} K_{i} \tilde{q} + {\tilde{q}}^{T} A^{- 1} B K_{i} \tilde{q} + υ^{T} K_{d} υ + \frac{1}{2} k_{g} {‖ υ ‖}^{2} + \frac{1}{2} k_{g} {‖ \tilde{q} ‖}^{2} \\ + {\tilde{q}}^{T} (α K_{d} - K_{p} - A^{- 1} K_{i}) υ + {\dot{q}}^{T} A M υ \\ + t r [{\tilde{W}}^{T} (K_{w}^{- 1} \overset{\cdot}{\tilde{W}} + σ (q, v) {\dot{q}}^{T} + α σ (q, v) {\tilde{q}}^{T} + σ (q, v) υ^{T})] \end{matrix}$

(5.72)

Because $\dot{υ} = - A υ + B \dot{q}$ and $B = d i a g {b_{i}}$ , the last term is zero if we apply the updating law (5.62). Using (5.32), (5.72) is

$\begin{matrix} \dot{V} ⩽ - {\dot{q}}^{T} {\begin{matrix} λ_{m} (B M - α M) - k_{c} ‖ α \tilde{q} - υ ‖ \\ - \frac{1}{2 κ (M)} λ_{M} (A M) \end{matrix}} \dot{q} \\ - υ^{T} (\begin{matrix} λ_{m} (A B^{- 1} K_{d} - K_{d}) - \frac{1}{2} k_{g} - \frac{1}{2 α} λ_{M} (K_{e}) \\ - \frac{κ (M)}{2} λ_{M} (A M) \end{matrix}) υ \\ - {\tilde{q}}^{T} (\begin{matrix} λ_{m} (α K_{p} - K_{i} - A^{- 1} B K_{i}) - α k_{g} \\ - \frac{1}{2} k_{g} - \frac{α}{2} λ_{M} (K_{e}) \end{matrix}) \tilde{q} \end{matrix}$

(5.73)

Using $λ_{i} (A) λ_{M} (B) ⩾ λ_{i} (A B) ⩾ λ_{i} (A) λ_{m} (B)$ , i can be “m” or “M,” the last condition of (5.71) can be replaced by

$\begin{matrix} ‖ \tilde{q} - \frac{1}{α} υ ‖ \\ ⩽ \frac{1}{α k_{c}} [λ_{m} (B - α I) λ_{m} (M) - \frac{1}{2 κ (M)} λ_{M} (M) λ_{m} (A)] \end{matrix}$

It is the attraction area (5.64).

Using $λ_{i} (A) + λ_{M} (B) ⩾ λ_{i} (A + B) ⩾ λ_{i} (A) + λ_{m} (B)$ , the second condition of (5.71) is

$\begin{matrix} λ_{m} [(A B^{- 1} - I) K_{d}] ⩾ λ_{m} (A B^{- 1} - I) λ_{m} (K_{d}) \\ ⩾ \frac{1}{2} k_{g} + \frac{1}{2 α} λ_{M} (K_{e}) + \frac{κ (M)}{2} λ_{M} (A M) \end{matrix}$

(5.74)

It is the condition for $K_{d}$ in (5.61). Also

$λ_{m} (α K_{p}) ⩾ λ_{M} (K_{i}) + λ_{M} (A^{- 1} B K_{i}) + \frac{1 + 2 α}{2} k_{g} + \frac{α}{2} λ_{M} (K_{e})$

It is the condition for $K_{p}$ in (5.61). The condition for $K_{i}$ in (5.61) is obtained from (5.67). The remaining part of the proof is the same as Theorem 5.1. □

Remark 5.4

The conditions (5.60) and (5.61) decide how to choose the PID gains. The first condition of (5.61) is

$\begin{matrix} λ_{m} (K_{p}) ⩾ \frac{1}{α} λ_{M} (K_{i}) + Ω \\ Ω = \frac{1}{α} [\begin{matrix} λ_{M} (A^{- 1} B K_{i}) + \frac{1 + 2 α}{2} k_{g} \\ + \frac{α}{2} λ_{M} (K_{d}) + \frac{1}{2} λ_{M} (A^{- 1} K_{i}) \end{matrix}] + \frac{1}{2 α} λ_{M} (K_{p}) \end{matrix}$

(5.75)

the third conditions of (5.61) is $λ_{m} (K_{p}) ⩾ \frac{3}{α} λ_{M} (K_{i})$ ; they are compatible. When $K_{i}$ is not big, these conditions can be established. The second condition of (5.61) and the third condition of (5.60) are not directly compatible. We first let α as small as possible, and $K_{p}$ as big as possible. So $K_{i}$ cannot be big. These requirements are reasonable for our real control. If we select $B = β A + α I$ , form the third condition of (5.60), $β ⩾ \frac{1}{2}$ . The second condition of (5.61) requires $λ_{m} (A B^{- 1} - I) > \frac{1}{2}$ , there exists $1 > β ⩾ \frac{1}{2}$ and a small α such that $λ_{m} [A {(β A + α I)}^{- 1} - I] > \frac{1}{2}$ . After A and B are decided, we use the second condition of (5.61) to select $K_{d}$ .

5.3 Neural PID tracking control

We modify the reference signal and add a filter, such that the PID regulation control is extended to PID tracking control. Most important, the global and semiglobal asymptotic stability of this PID tracking control are proved.

Model-based compensation is an alternative method for PID control to overcome the problems of PID control. The intelligent compensation does not need a mathematical model; it is a model-free compensator, such as fuzzy PID with the fuzzy compensator [153] and neural PID with the neural compensator [154]. From the best of our knowledge, theory analysis for neural PID tracking control is still not published. In this paper, we will analyze the neural PID tracking control. We prove that the robot can follow a filtered reference asymptotically.

Given a desired time-varying position $q_{d} (t) \in R^{n}$ , the objective of robot control is to design the input torque u in (5.1) such that the tracking error is

$Δ (t) = q_{d} (t) - q (t)$

(5.76)

$e (t) \to 0$ when initial conditions are in arbitrary large domain of attraction. The classic industrial PID law is (5.15). In order to finish the tracking job, we define a filtered error as

$s = {\dot{q}}_{d} - \dot{q} = \dot{Δ} + K_{a} Δ + K_{b} \int Δ d t$

(5.77)

${\dot{q}}_{d} = \dot{q} + K_{a} Δ + K_{b} \int Δ d t$

(5.78)

We define a matrix $K_{a}$ as

$K_{a} = K_{a}^{T} = K_{d}^{- 1} K_{p}, K_{b} = K_{d}^{- 1} K_{i}$

(5.79)

In order to decouple the PID tracking control (5.19), we choose $K_{p}, K_{i}$ , and $K_{d}$ as positive definite diagonal matrices. The closed-loop system of the robot (5.1) is

$\begin{matrix} M (q) \ddot{q} + C (q, \dot{q}) \dot{q} + G (q) + F (\dot{q}) = K_{p} \tilde{q} - K_{d} \dot{q} + z \\ \dot{z} = K_{i} \tilde{q} \end{matrix}$

Using the filtered error s, the final closed-loop equation becomes

$M \dot{s} = - C s + f - K_{D} s$

(5.80)

where $f (q, \overset{\cdot}{q}) = G (q) + F (\dot{q})$ . The following theorem gives the stability analysis of the PID tracking control.

Theorem 5.3

Consider the robot dynamic (5.1) controlled by the PID tracking control:

$u = K_{p} Δ + K_{i} \int_{0}^{t} Δ (τ) d τ + K_{d} \dot{Δ} + f$

(5.81)

the closed-loop system (5.80) is globally asymptotically stable at the equilibrium, provided that control gains are positive.

Proof

Propose the following Lyapunov function:

$V = \frac{1}{2} s^{T} M s$

(5.82)

Using (5.80) the derivative of the proposed function yields

$\begin{matrix} \dot{V} = s^{T} M \dot{s} + \frac{1}{2} s^{T} \dot{M} s \\ = - s^{T} C s + s^{T} f - s^{T} K_{D} s + \frac{1}{2} s^{T} \dot{M} s \end{matrix}$

(5.83)

Since $x^{T} (\dot{M} - 2 C) x = 0$ ,

$\dot{V} = - s^{T} K_{D} s + s^{T} f$

Using the PID control law (5.81), then

$\dot{V} = - s^{T} K_{D} s$

$\dot{V} ⩽ - λ_{m} (K_{D}) {‖ s ‖}^{2}$

$\dot{V}$ is negative semidefinite on the ball Σ. There exists a ball Σ of radius $σ > 0$ centered at the origin of the state space on which $\dot{V} ⩽ 0$ . The origin of the closed-loop equation (5.80) is a stable equilibrium. Since the closed-loop equation is autonomous, we use LaSalle's theorem. Define Ω as

$\begin{matrix} Ω = {x (t) = [e, \dot{e}, \tilde{f}] \in R^{3 n} : \dot{V} = 0} \\ = {\tilde{f} \in R^{n} : e = 0 \in R^{n}, \dot{e} = 0 \in R^{n}} \end{matrix}$

From (5.83), $\dot{V} = 0$ if and only if $e = \dot{e} = 0$ . For a solution $x (t)$ to belong to Ω for all $t ⩾ 0$ , it is necessary and sufficient that $e = \dot{e} = 0$ for all $t ⩾ 0$ . □

In many cases, in (5.1) the gravity $G (q)$ and the friction $F (\dot{q})$ are not available. We can use a feedforward neural network to model them as (5.4)

$\begin{matrix} N (q, \overset{\cdot}{q}) = G (q) + F (\dot{q}) \\ \hat{N} (q, \overset{\cdot}{q}) = \hat{W} σ (q, \overset{\cdot}{q}), N (q, \overset{\cdot}{q}) = W^{⁎} σ (q, \overset{\cdot}{q}) + η \end{matrix}$

(5.84)

where $W^{⁎}$ is unknown constant weight, $\hat{W}$ is estimated weight, η is the neural approximation error, and σ is a neural activation function. Here, we use the Gaussian function such that $σ (q, \overset{\cdot}{q}) ⩾ 0$ .

This linear-in-the-parameter net is the simplest neural network. According to the universal function approximation theory, the smooth function $N (q, \overset{\cdot}{q})$ can be approximated by a multilayer neural network with one hidden layer in any desired accuracy provided proper weights and hidden neurons:

$\hat{N} (q, \overset{\cdot}{q}) = \hat{W} σ (\hat{V} [q, \overset{\cdot}{q}]), f (q) = W^{⁎} σ (V^{⁎} [q, \overset{\cdot}{q}]) + η$

(5.85)

where $\hat{W} \in R^{n \times m}$ , $\hat{V} \in R^{m \times n}$ , m is hidden node number, $\hat{V}$ is the weight in a hidden layer. In order to simplify the theory analysis, we first use linear-in-the-parameter net (5.84), then we will show that the multilayer neural network (5.85) can also be used for the neural control of robot manipulators. According to universal function approximation property, $f (x)$ can be shown that as long as x is restricted to a compact set $S \subset R^{n}$ . There exist synaptic weights and thresholds such that

$f (x) = W σ (V [x]) + η$

(5.86)

The functional approximation error $ϕ (x)$ of the neural network generally decreases as m increases. In fact, for any positive number $ϕ_{N}$ , we can find a feedforward neural network such that

$‖ η ‖ < η_{N}$

(5.87)

$η^{T} K_{a} η ⩽ \bar{η}, \bar{η} > 0$

(5.88)

When the unknown dynamic $‖ f (q, \overset{\cdot}{q}) ‖$ in (5.84) is big, in order to assure asymptotic stability, the integral gain $K_{i}$ has to be increased. This may cause big overshoot, bad stability, and integrator windup. Model-free compensation is an alternative solution, where $f (q)$ is estimated by a neural network as in (5.84). Normal neural PD control is

$u = K_{p} e + K_{d} \overset{\cdot}{e} + \hat{N}$

(5.89)

where $\hat{N} (q, \overset{\cdot}{q}) = \hat{W} σ (q, \overset{\cdot}{q})$ . With the filtered error,

$r = Δ + Λ \dot{Δ}$

(5.90)

(5.16) becomes

$u = K_{v} r + \hat{N}$

(5.91)

The control (5.91) avoids integrator problems in (5.15). Unlike industrial PID control, they cannot reach asymptotic stability. The stability condition of the neural PD control (5.89) is $‖ r ‖ > \frac{B}{K_{v}}$ , B is a constant. In order to decrease $‖ r ‖$ , $K_{d}$ has to be increased. This causes a long settling time problem. The asymptotic stability ( $r \to 0$ ) requires $K_{v} \to \infty$ .

An integrator is added into the normal neural PD control (5.89); it has a similar form as the industrial PID in (5.15),

$u = K_{p} e Δ + K_{d} \dot{Δ} + K_{i} \int_{0}^{t} Δ (τ) d τ + \hat{N}$

(5.92)

${\dot{q}}_{d} = {\dot{q}}_{d} + K_{a} Δ + K_{b} \int Δ d t$

(5.93)

The closed-loop system of the robot (5.1) is

$\begin{matrix} M (q) \ddot{q} + C (q, \dot{q}) \dot{q} + \tilde{N} = K_{p} \tilde{q} - K_{d} \dot{q} + z \\ \dot{z} = K_{i} \tilde{q} \end{matrix}$

(5.94)

where $\tilde{N} = N - \hat{N}$

$\tilde{N} = W^{⁎} σ (q) + ϕ (q) - \hat{W} σ (q) = \tilde{W} σ (q) + η$

(5.95)

here, $\tilde{W} = W^{⁎} - \hat{W}$ . With the filtered error, the final closed-loop equation becomes

$M \dot{s} = - C s + \tilde{W} σ (q, \dot{q}) + ϕ - K_{D} s$

(5.96)

The following theorem gives the stability analysis of the neural PID control. From this theorem, we can see how to train the weight of the neural compensator in (5.92).

Theorem 5.4

Consider the robot dynamic (5.1) controlled by the neural PID control (5.92). The closed-loop system (5.96) is semiglobally asymptotically stable at the equilibrium, provided that control gains are positive; weight of the neural networks (5.84) is tuned by

$\overset{\cdot}{\hat{W}} = K_{W} σ (q, \dot{q}) s^{T}$

(5.97)

Proof

We propose the following Lyapunov function:

$V = \frac{1}{2} s^{T} M s + \frac{1}{2} t r {{\tilde{W}}^{T} K_{W}^{- 1} \tilde{W}}$

(5.98)

Using (5.96) and $\dot{M} (q) - 2 C (q, \dot{q})$ is skew symmetric, i.e.,

$x^{T} [\dot{M} (q) - 2 C (q, \dot{q})] x = 0$

(5.99)

the derivative of the proposed function yields

$\begin{matrix} \dot{V} = s^{T} M \dot{s} + \frac{1}{2} s^{T} \dot{M} s + t r {{\tilde{W}}^{T} K_{W}^{- 1} \overset{\cdot}{\tilde{W}}} \\ = - s^{T} C s + s^{T} \tilde{W} σ (q, \dot{q}) + s^{T} ϕ - s^{T} K_{D} s + \frac{1}{2} s^{T} \dot{M} s + t r {{\tilde{W}}^{T} K_{W}^{- 1} \overset{\cdot}{\tilde{W}}} \end{matrix}$

(5.100)

Since $x^{T} (\dot{M} - 2 C) x = 0$ ,

$\dot{V} = - s^{T} K_{D} s + s^{T} η + s^{T} {\tilde{W}}^{T} σ (q, \dot{q}) + t r {{\tilde{W}}^{T} K_{W}^{- 1} \overset{\cdot}{\tilde{W}}}$

Using the adaptation law (5.97), then

$\dot{V} = - s^{T} K_{D} s + s^{T} η$

(5.101)

By the inequality,

$y^{T} A x ⩽ ‖ y ‖ ‖ A x ‖ ⩽ ‖ y ‖ ‖ A ‖ ‖ x ‖ ⩽ | λ_{m} (A) | ‖ y ‖ ‖ x ‖$

Using (5.101),

$\dot{V} ⩽ - λ_{m} (K_{D}) {‖ s ‖}^{2} + η_{N} ‖ s ‖$

(5.102)

Since $η_{N}$ is constant, $\dot{V}$ is a negative semidefinite provided that

$‖ s ‖ > \frac{η_{N}}{λ_{m} (K_{D})}$

Define a ball Σ of radius $σ > 0$ centered at the origin of the state space, which satisfies these conditions:

$Σ = {s : ‖ s ‖ ⩽ \frac{η_{N}}{λ_{m} (K_{D})}}$

(5.103)

$\dot{V}$ is negative semidefinite on the ball Σ. There exists a ball Σ of a radius $σ > 0$ centered at the origin of the state space on which $\dot{V} ⩽ 0$ . The origin of the closed-loop equation (5.96) is a stable equilibrium. Since the closed-loop equation is autonomous, we use LaSalle's theorem. Define Ω as

$\begin{matrix} Ω = {x (t) = [e, \dot{e}, \tilde{f}] \in R^{3 n} : \dot{V} = 0} \\ = {\tilde{f} \in R^{n} : e = 0 \in R^{n}, \dot{e} = 0 \in R^{n}} \end{matrix}$

(5.104)

From (5.102), $\dot{V} = 0$ if and only if $e = \dot{e} = 0$ . For a solution $x (t)$ to belong to Ω for all $t ⩾ 0$ , it is necessary and sufficient that $e = \dot{e} = 0$ for all $t ⩾ 0$ . Therefore it must also hold that $\ddot{e} = 0$ for all $t ⩾ 0$ . We conclude that from the closed-loop system (5.96), if $x (t) \in Ω$ for all $t ⩾ 0$ , then

$\begin{matrix} η (q, \overset{\cdot}{q}) = η (q^{d}, 0) = \tilde{N} + η (q^{d}, 0) \\ \frac{d}{d t} \tilde{N} = 0 \end{matrix}$

(5.105)

implies that $\tilde{N} = 0$ for all $t ⩾ 0$ . So $x (t) = [e, \dot{e}, \tilde{f}] = 0 \in R^{3 n}$ is the only initial condition in Ω for which $x (t) \in Ω$ for all $t ⩾ 0$ . Finally, we conclude from all of this that the origin of the closed-loop system (5.96) is locally asymptotically stable. The tracking error s and its derivative $\dot{s}$ are bounded, which implies boundedness of e and $\dot{e}$ . Given that the desired trajectories $q_{d}$ and ${\dot{q}}_{d}$ are also bounded, joint position q and joint velocity $\dot{q}$ are bounded. □

5.4 Experimental results of the neural PID

Recently, we construct the portable exoskeleton robot, shown in Fig. 5.1. It has two servomotors. By our special design, it can move freely in 3D space; see Appendix A. The control computer is a Pentium Core2Duo with 2GB of RAM and a 120GB hard disk. The control software is under the operating system Linux Mint 15 “Olivia.” We use this operating system, because the development software has a GNU General Public License. The development environment is Qt4 with C++, which is less runtime in a normal PC, and improves control performance compared with MATLAB Simulink^®. Another advantage is that our system can generate executable programs, which can run on any PC without the installing development environment.

Figure 5.1 The portable exoskeleton robot.

This upper limb exoskeleton is fixed on the human arm. The behavior of the exoskeleton is the same as the human arm; see Fig. 5.2. The exoskeleton is light, and the height can be adjusted for each user. The users left hand is an enable button, which released the brakes on the device and engaged the motor. The reference signals are generated by admittance control in task space. These references are sent to joint space. The robot in joint space can be regarded as free motion without human constraints. The whole control system is shown in Fig. 5.3. The objective of neural PID control is make the transient performance faster with less overshoot, such that human feel comfortable.

Figure 5.2 The exoskeleton robot vs. human arms.

Figure 5.3 The neural PID control of the exoskeleton robot.

All of the controllers employed a sampling frequency of 1 kHz. The two theorems in this paper give sufficient conditions for the minimal values of proportional and derivative gains and maximal values of integral gains. We use (5.45) to estimate the upper and the lower bounds of the eigenvalues of the inertia matrix $M (q)$ , and $k_{g}$ in (5.10). We select $λ_{M} (M) < 2.7$ , $λ_{m} (M) > 0.5$ , $k_{g} = 4.8$ . We choose $α = \frac{4 λ_{M} (K_{i})}{λ_{m} (K_{p})}$ such that $λ_{M} (K_{i}) ⩽ \frac{α}{3} λ_{m} (K_{p})$ is satisfied. $α = 0.01$ , A is chosen as $A = d i a g (3)$ , $β = \frac{1}{3}$ , so $B = d i a g (2)$ . The joint velocities are estimated by the standard filters:

$\tilde{\dot{q}} (s) = \frac{b s}{s + a} q (s) = \frac{18 s}{s + 30} q (s)$

(5.106)

The PID gains are chosen as

$\begin{matrix} K_{p} = d i a g [12, 1] \\ K_{i} = d i a g [2, 1] \\ K_{d} = d i a g [31, 31] \end{matrix}$

(5.107)

such that the conditions of Theorem 5.2 are satisfied. The initial elements of the weight matrix $W \in R^{7 \times 7}$ are selected randomly from −1 to 1. The active function in (5.26) is a Gaussian function:

$σ = \exp {- {(q_{i} - m_{i})}^{2} / 100}, i = 1, 2$

(5.108)

where $m_{i}$ is selected randomly from 0 to 2. The weights are updated by (5.62) with $K_{w} = 10$ .

The control results of Joint-1 with neural PID control is shown in Fig. 5.4, marked “Neural PID.” We compare our neural PID control with the other popular robot controllers. First, we use the linear PID (5.15). The PID gains are the same as (5.107), and the control result is shown in Fig. 5.4, marked “Linear PID-2.” Because the steady-state error is so big, the integral gains are increased as

$K_{i} = d i a g [50, 20]$

(5.109)

The control result is shown in Fig. 5.4, marked “Linear PID-1,” the transient performance is poor. There still exists regulation error. Further increasing $K_{i}$ causes the closed-loop system to be unstable. Then we use a neural compensator to replace the integrator. It is normal neural PD control (5.16). In order to decrease steady-state error, the derivative gains are increased as

$K_{d} = d i a g [200, 300]$

(5.110)

the control result is shown in Fig. 5.4, marked “Neural PD.” The response becomes very slow.

Figure 5.4 Comparison of several PID controllers for Joint 1.

Now we use Joint-2 to compare our neural PID control with the other two types of neural PID control. The control results of these three neural PID controllers are shown in Fig. 5.5. Here, we use a three-layer neural network with three nodes, which have integral, proportional, and derivative properties. A backpropagation like the training algorithm is used to ensure closed-loop stability [29]; it is marked “Neural Net in PID form.” Then we use a one-hidden layer neural network to tune the linear PID gains as in [117]; it is marked “PID tuning via neural net.” It can be found that the “Neural net in PID form” can assure stability, but the transient performance is not good. The “PID tuning via neural net” is acceptable except its slow response.

Figure 5.5 Comparison of several PID controllers for Joint 2.

Clearly, neural PID control can successfully compensate the uncertainties such as friction, gravity, and the other uncertainties of the robot. Because the linear PID controller has no compensator, it has to increase its integral gain to cancel the uncertainties. The neural PD control does not apply an integrator, its derivative gain is big.

The structure of the neural compensator is very important. The number of hidden nodes m in (5.5) constitutes a structural problem for neural systems. It is well known that increasing the dimension of the hidden layer can cause the “overlap” problem and add to the computational burden. The best dimension to use is still an open problem for the neural control research community. In this application, we did not use a hidden layer, and the control results are satisfied. The learning gain $K_{w}$ in (5.62) will influence the learning speed, so a very large gain can cause unstable learning, while a very small gain produces a slow learning process.

Now we will carry out some experimental tests by using the neural PID controller in a tracking case. To accomplish this, we will define conveniently the neural gain $K_{W} = 0.5$ , and by using [2]. We obtain the following gain matrices:

$\begin{matrix} K_{P} = d i a g [700, 1200] \\ K_{I} = d i a g [20, 20] \\ K_{D} = d i a g [44, 40] \end{matrix}$

The active function in (5.26) is a Gaussian function:

$σ = \exp {- {(q_{i} - m_{i})}^{2} / 100}, i = 1, 2$

The neural PID tracking control for the first and the second joints are shown in Fig. 5.6 and Fig. 5.7. The position error is reasonably small (between −2 and 2 degrees in both cases). The second joint carries out the extension, the neural network compensates the friction and gravity effects. The actual velocity satisfactorily follows the desired velocity.

Figure 5.6 The first joint – horizontal flexion/extension of shoulder.

Figure 5.7 The second joint – vertical flexion/extension of shoulder.

Overall, we can notice that, though we have used a small gain $K_{W}$ , the robot performance is acceptable when using the neural PID. Unfortunately, we cannot increase too much such gain, since the controller tries to compensate every single disturbance that affects the robot. Then it produces a torque that makes the joints to vibrate continuously. In fact, this vibration affects especially the small joints.

The tracking neural PID control proposed in this paper is compared with the other two types of controllers. We first use normal NN with three-layers, and each layer has 3 nodes. The backpropagation is applied to train the NN. Then we use a one-hidden layer NN to tune the linear PID gains. We find that our neural tracking PID is to ensure stability, but the transient performance is not good, while the PID tuning via neural net is acceptable except for its slow response.

5.5 Conclusions

The neural PID proposed in this chapter solves the problems of large integral and derivative gains in the linear PID control and the neural PD control. It keeps good properties of the industrial PID control and neural compensator. Semiglobal asymptotic stability of this neural PID control is proven. When the joint velocities of robot manipulators are not available, local asymptotic stability is assured with filtered positions. The stability conditions give explicit methods to select PID gains. We also apply our neural PID to the exoskeleton robot in CINVESTA-IPN. Theory analysis and experimental study show the validity of the neural PID control.

A novel PID tracking controller is proposed to solve the problem of large integral and derivative gains in the tracking case with linear PID control. Global and semiglobal asymptotic stability of this PID tracking control are proven.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter 5: PID Control with Neural Compensation

Create new playlist

Sign In

Sign Up

5.1 Stable neural PID control

5.2 Neural PID control with unmeasurable velocities

5.3 Neural PID tracking control

5.4 Experimental results of the neural PID

5.5 Conclusions

Table of Contents for
Chapter 5: PID Control with Neural Compensation