140 ◾ Advances in Communications-Based Train Control Systems
process, the moving nodes communicate with the wayside nodes. e nodes’ mov-
ing speed is controlled by the train control model, and the communication latency
is determined by the communication model in Section 7.4.3. Each node looks up
the policy table to nd out the optimal action corresponding to its current state,
and then executes action. Static variables are collected in the simulations to obtain
the train control performance
norm, train travel trajectory, hando policy, and
the trip time.
We compare the performance of our proposed system with three other schemes.
e rst scheme is an existing scheme [31], where CoMP is not used. e train MT
makes hando decisions based on the immediate reward, and the optimal guid-
ance trajectory is not recalculated when the speed deviation occurs. We denote this
scheme as the existing scheme. For the second scheme, CoMP is used in the system,
but the SMDP model is not used. Instead, the decision maker makes hando deci-
sions by the reward derived from the current channel state. e optimal guidance
trajectory is not recalculated in this scheme. We denote this scheme as the CoMP-
based greedy scheme w/o updated trajectory. For the third scheme, CoMP is used in
the system, and the SMDP model is used to calculate the hando decision policy.
Unlike the proposed scheme, the optimal guidance trajectory is not recalculated in
this scheme. We denote this scheme as the CoMP-based scheme w/o updated trajec-
tory, and our proposed scheme as the proposed scheme.
7.6.1 Train Control Performance Improvement
We rst compare the train control performance
norm in dierent schemes.
Recall that the square root of the linear quadratic cost performance measure in
Equation7.4 is equivalent to the
norm. We refer to the train control perfor-
mance measure as the
norm due to this equivalence. In this simulation scenario,
the three trains depart from a station successively with interval
interval
s
16 and the
deceleration
that denes the ATP service braking curve is set to be
1.2 /
. As we
can see from Figure 7.6, the existing scheme gives the highest
norm compared to
the other three schemes. is is because the existing scheme does not use CoMP in
the system, which can signicantly improve train control performance. Moreover,
it does not consider the dynamic transitions of wireless channels in CBTC systems,
which is very important information for the train Station Adapter (SA) to make
optimal hando decisions to get the optimal performance.
By contrast, the other three schemes can have signicant performance improve-
ment due to the adoption of CoMP. e two SMDP-based schemes give better
performance compared with the greedy scheme. is is because the SMDP model
considers the dynamic transitions of wireless channels in CBTC systems. e direct
optimization objective of the SMDP model is to minimize the linear quadratic cost
of train control. Our proposed scheme gives the lowest
H
norm. is is because
the newly calculated guidance trajectory takes full consideration of the tracking