Processing math: 100%
Stochastic Linear Quadratic Optimal Control of Speed and Position of Multiple Trains on a Single-Track Line | IEEE Journals & Magazine | IEEE Xplore

Stochastic Linear Quadratic Optimal Control of Speed and Position of Multiple Trains on a Single-Track Line


Abstract:

In the European Rail Traffic Management System (ERTMS), the Route Control Centre System (RCCS) supervises the distance between consecutive trains and generates movement a...Show More

Abstract:

In the European Rail Traffic Management System (ERTMS), the Route Control Centre System (RCCS) supervises the distance between consecutive trains and generates movement authorities, i.e. the permission for a train to move to a specific location within the constraints of the infrastructure and with supervision of speed. In this work, a control model aimed at determining the speed and position of a train platoon within a sector of the rail network is presented. The central controller, i.e. the RCCS, receives information about the current position and speed of trains and it sends them decisions about optimal corrective actions for each train. Priorities of trains are handled to respect the planned timetable, taking into account the train dynamics, limitations in divergences of positions, speeds, and tractive effort, as well as minimum distances between consecutive trains. The control approach is based on a quite innovative linear quadratic regulator allowing the definition of stochastic constraints. The validation of the model is based on data collected from a RCCS for a Section of the high-speed Paris-London line.
Published in: IEEE Transactions on Intelligent Transportation Systems ( Volume: 24, Issue: 9, September 2023)
Page(s): 9110 - 9120
Date of Publication: 28 April 2023

ISSN Information:

Figures are not available for this document.

SECTION I.

Introduction

The European Rail Traffic Management System (ERTMS) legislation covers two main aspects: its technical specifications and the process for putting the system into service. The Technical Specifications of Interoperability for Control Command and Signaling (TSI CCS) is an essential document describing the ERTMS. The revised TSI for the onboard and trackside CCS subsystems were adopted by the European Commission Regulation (EU) 2016/919 published in the Official Journal of the European Union on 15th June 2016 [1]. The Interoperability Directive (2008/57/EC) defines several essential requirements to be met for interoperability, which include safety, reliability and availability, health, environmental protection, and technical compatibility along with others specific to certain subsystems [2]. The ERTMS application includes three different levels [3]. While ERTMS levels 1 and 2 are in operation on several railway lines, ERTMS Level 3 represents an important innovation, and it is currently under test. ERTMS Level 3 has significant advantages for railway engineers, since it eliminates some trackside equipment, and it introduces the moving block signaling approach. In moving block signaling, trains are given permission to move to a specific position anywhere on the track. A safe ‘envelope’ of empty track moving with each train is generated in real time. This envelope can be sized according to the braking performance and the speed of that specific train (e.g., low-speed train closer together, and high-speed trains further apart). This is in contrast with the traditional fixed block signaling in which trains are granted permission to move to a pre-defined, fixed position.

The main difference with respect to ERTMS Level 2 is that the train autonomously detects train integrity, rather than requiring the track circuit to perform this task. So, in ERTMS Level 3, the Radio Block Centre (RBC) determines the train location in co-operation with the onboard Train Integrity Monitoring (TIM) System. Train detection equipment can therefore be removed from the track, and the train separation distance is based on these moving block sections. In this way, the RCCS can fluidly monitor distances between trains, and the headway between trains is significantly reduced.

In brief, ERTMS Level 3 operates as follows:

  • the onboard computer determines the train position and verifies that the current speed corresponds to the travelled distance;

  • the train sends its position to the control center via radio signals;

  • the control center receives all trains position and sends the new movement authorities (MAs) to each train;

  • the onboard computer calculates the speed profile and the braking points and displays them to the train driver.

The moving block is defined as an automated control system allowing each train to receive a movement authority info from the control center. The control center has to have a continuous dialogue with all the trains to know their speed and position. Similarly a Communications-Based Train Control (CBTC) is a railway signaling system that makes use of the telecommunications between the train and track equipment for the traffic management and infrastructure control [4], [5]. The CBTC is based on the principle that trains determine their positions themselves and transmit it to wayside equipment. CBTC assures that the space between trains is always safe. However, CBTC is not always interoperable with other suppliers’ products. This issue is a major problem for large-scale mainline networks that use many interoperating products. ERTMS Level 3 is then an alternative solution. Moreover, ERTMS Level 3 enable the adoption of moving block, which means that two trains in succession can run at lower speeds. In ERTMS level 3, the train location and train integrity supervision are performed by the trackside RBC in cooperation with the train which sends position reports and train integrity information to RBC. The track to train communication is bidirectional and based on Euroradio whereas Eurobalises are mainly used as spot transmission devices for location referencing (position reports) [6]. The MAs (i.e., permission for a train to move to a specific location with supervision of speed) are generated trackside and are transmitted to the train via Euroradio (GSM-R). These MAs are displayed on a Driver Machine Interface (DMI) installed in the driver’s cab. The driver shall observe the displayed information on the DMI and shall react as required by the operational rules. More specifically, MA gives the maximum speed at which a train is allowed to reach the end of its Avement Authority. Trackside equipment knows each ERTMS controlled train individually by using the ERTMS identity of the leading ERTMS on board equipment. These informations are then updated according to the track situations and movement authority is renewed. So, ERTMS Level 3 provides a continuous speed supervision system, which also protects against overrun of the authority.

In this paper, we propose a centralized control approach in which all the trains within a particular sector of a single track-line send information about their position and speed to the local RCCS. With reference to the recent review paper [7] on dynamic scheduling, operation control and their integration in high-speed railways, this approach can be classified as operation control of high-speed trains, specifically cooperative control of multiple trains. In our approach, multiple trains are present in the same sector, and they are jointly regulated. Specifically, the RCCS can compute and forward the optimal corrective decisions to each train. In this context, the RCCS objective is to issue control instructions, with different levels of train priority, to bring actual positions closer to scheduled positions, considering the train dynamics, limitations in divergences of positions, speeds, and tractive effort, as well as the headway between consecutive trains in the platoon. It must be noted that in the proposed approach the size of the platoon in a given sector, that is, the number of trains, can vary during time. From a control viewpoint, the main novelty is the adoption of a quite recent control methodology [8] which allows to limit the expected quadratic divergence of the variables related to positions, speeds, and tractive effort for each train. According to the state of the art, the set of multiple trains is hereinafter defined as a train platoon [9] following a safe train distance strategy not focusing on the two-train tracking, but on the whole efficiency of the train platoon. It is worthwhile to underline that as the scheduled headway in high-speed trains is in the order of some minutes, the expected minimum safe distance between two consecutive trains is of several kilometres.

SECTION II.

State of the Art

In [10], the authors proposed a relationship between probabilities of emergency train braking as a function of distance between subsequent trains. They considered the following main parameters: train speed, emergency braking distance, train length, position localization error. They studied a scenario with two successive trains using the same track. The first train automatically checks its integrity and determines its position. Next, the resulting status message is sent through the GSM-R module to the nearest RBC [11]. Transmission errors mean that there is a particular probability value for the message being lost. The likelihood of successful transmission is assumed to follow an exponential distribution. Finally, they compute the probability of a train successfully being stopped by emergency braking as a function of distance between trains, using results obtained from a simulation. In [12], the authors developed models and algorithms for real-time conflict resolution adapted for both fixed block (ERTMS Level 2) and moving block (ERTMS Level 3) signaling safety concepts. The train timetables are currently implemented taking into account possible delays due to disturbances when travelling. In [13], the authors proposed a robust model predictive control (MPC) for train regulation in underground railway transportation. In [14], a decentralized MPC has been developed to manage the position of one train and the headway between two consecutive vehicles by the assumption to receive data related to distance, speed, and brake demand from the preceding train. Also, the speed curve optimization, where the controller must track the optimal speed profile to assure safe and effective operations appear relevant in the tracking control of rail trains. In [15], an optimization model based on a genetic algorithm is presented for the optimal speed curve seeking. Several issues still remain open before the ERTMS Level 3 can become operational. Primarily, the train integrity issue relating to the use of additional complex equipment in the train has to be addressed. To handle situations such as failures of onboard equipment, some redundancy architectures need to be proposed and their dependability parameters (Mean Time To Failure MTTF, availability, etc.) quantified. Then, the safety standard must clearly explain all the design steps and the required dependability parameters for each additional subsystem in the switch from Level 2 to Level 3. In [16], the authors applied a robust team decision theory to control noncritical train distances in moving blocks, such as in ERTMS Level 3. Also, in [17], the authors aimed at scheduling a set of trains from opposite sides along a single track which consisted in intermediate stations. The trains could pass each other at those stations while traversal times of the trains on the blocks between the stations only depended on the block lengths but not on the trains. In [18], the authors proposed a decision-making framework to support a critical decision process by developing a SWOT (strengths, weaknesses, opportunities, and threats) analysis based on external and internal factors that simultaneously enhance and constrain the ERTMS implementation.

In this paper, we propose an approach that could be used in RCCS’s to control speeds among trains on a single-track line, under safe situations, that is when moving blocks are however securely separated. A centralized stochastic optimal control model has been developed considering train’s position, speed, and related tractive force. Section III describes the proposed approach.

SECTION III.

Model Formulation

Consider a platoon of {M} trains travelling on a single-track line in a sector of the rail network. The position of each train in the platoon is identified by a number. The first train in the platoon is identified with number 1, the second one with 2, and the last one with M (Fig. 1).

Fig. 1. - A platoon of M trains on a single-track line. Trains are travelling from left to right and are identified by the numbers 
$1,\ldots $
,M, ascending according to their schedule.
Fig. 1.

A platoon of M trains on a single-track line. Trains are travelling from left to right and are identified by the numbers 1,\ldots ,M, ascending according to their schedule.

A. Information Model

The reference information model is the one proposed for ERTMS Level 3. A communication network allows information exchange among the trains themselves and with an information center, i.e. the RCCS, responsible for one sector (Fig. 2). In the case of a single track, each train can transmit information, e.g. its current GPS position and speed, to the center.

Fig. 2. - Transmission data when using moving blocks.
Fig. 2.

Transmission data when using moving blocks.

Transmission delays and the time required to compute the optimal control law defining the train tractive effort are assumed to be negligible. This assumption obviously implies an efficient communication channel and a fast, efficient control algorithm.

At the operational decisional level, the train movements are represented by the classical time distance graphs which visualize the position of trains in space and time according to constant speed values. The associated train tractive effort in steady state conditions with no acceleration may be computed by the model introduced in the following subsection.

B. Model of the Train Motion

A recent exhaustive description of the train motion equations can be found in [19]. The general equation for the motion of the i -th train, known as Lomonossoff’s equation, can be written as follows:\begin{align*} \begin{cases} \displaystyle \frac {dx_{i,1}(t)}{dt}=x_{i,2}(t) \\ \displaystyle W'_{i}\frac {dx_{i,2}(t)}{dt}=f_{i}(t)-(C_{i}^{a}+C_{i}^{b}x_{i,2}(t)+C_{i}^{c}x^{2}_{i,2}(t)) \\ \displaystyle -W_{i}g\sin \alpha _{i}(t) \quad i=1\ldots M \end{cases} \tag{1}\end{align*} View SourceRight-click on figure for MathML and additional features. where:

  • x_{i,1}(t) [m] and x_{i,2}(t) [m/s] are respectively the position and the speed of the i -th train at time t ;

  • f_{i}(t) [kN] is the tractive effort at time t ;

  • C_{i}^{a} , C_{i}^{b} , C_{i}^{c} are the Davis constants related to resistance, where:

    • C_{i}^{a} [kN] corresponds to mechanical resistance;

    • C_{i}^{b} \left[{\frac {kN}{m/{s}}}\right] corresponds to viscous mechanical resistance;

    • C_{i}^{c} \left[{\frac {kN}{m^{2}/{s}}}\right] corresponds to aerodynamic resistance;

  • W'_{i} [tonnes] is the effective mass including rotary allowance;

  • W_{i} [tonnes] is the train mass;

  • \alpha _{i}(t) [\cdot] is the slope angle of the position of the i -th train at time t .

C. Reference Trajectory

A timetable graph is usually represented by a time/position graph. It can be defined by the planned position \bar {x}_{i,1}(t) at each time t , and, consequently, the desired average speed \bar {x}_{i,2}(t) , which is defined as constant in each time interval. A reference trajectory, as described by a timetable graph, is so planned as a list of desired positions in a given planning interval [t_{p},t_{p+1} ). The desired position is described by the progressive distance related to the specific track. For the i -th train, the planned information is given by a list of planned values. In Fig. 3, an example of timetable graph and related notation are represented for two trains 1 and 2. Specifically, t_{p} is the time in which a train is planned to be in a given position, for example entering a railway section. Each interval [t_{p}, t_{p+1} ) is also further discretized into N_{p} time intervals indexed with k and having the same duration equal to \Delta t_{p} = \frac {\left ({t_{p+1}-t_{p} }\right)}{N_{p}} .

Fig. 3. - Position of two trains, i.e. i = 1..2, in a planned train graph. In each time interval, 
$[t_{p}, t_{p+1}$
) the speed is constant.
Fig. 3.

Position of two trains, i.e. i = 1..2, in a planned train graph. In each time interval, [t_{p}, t_{p+1} ) the speed is constant.

D. Linearization of the Motion Model

Equation (1) is nonlinear, due to a quadratic term function of the speed. A linear approximation is introduced, centered around a working state/control couple (\bar {x}_{i,2} , \bar {f}_{i} ) in steady condition with acceleration equal to zero, where \bar {f}_{i} is given by \begin{equation*} \bar {f_{i}}=(C_{i}^{a}+C_{i}^{b}\bar {x}_{i,2}+C_{i}^{c}\bar {x}^{2}_{i,2})+W_{i}g\sin \alpha _{i} \tag{2}\end{equation*} View SourceRight-click on figure for MathML and additional features.

At instant {t}_{p} the planned position and speed of the i -th train are \bar {x}_{i,1}(t_{p}) and \bar {x}_{i,2}(t_{p}) , corresponding to the tractive effort \bar {f}_{i}(t_{p}) . The system can be linearized for each planned instant \bar {t}_{p} around the state vector given by:\begin{equation*} \bar {\underline {x}}_{i,p}=\left[{\begin{array}{cc} \bar {x}_{i,1}(t_{p}) & \bar {x}_{i,2}(t_{p}) \end{array}}\right]' \tag{3}\end{equation*} View SourceRight-click on figure for MathML and additional features.

At any instant t\,\,\in [t_{p},t_{p+1} ) let \underline {\delta x}_{i,p}(t) and \delta f_{i,p}(t) be the variations around such state and control given by:\begin{align*} \delta \underline {x}_{i,p}(t)&=\left[{\begin{array}{cc} {x}_{i,1}(t)-\bar {x}_{i,1}(t_{p}) & {x}_{i,2}(t)-\bar {x}_{i,2}(t_{p}) \end{array}}\right]' \tag{4}\\ \delta f_{i,p}(t)&=f_{i}(t)-\bar {f}_{i}(t_{p}) \tag{5}\end{align*} View SourceRight-click on figure for MathML and additional features.

The resulting linearized equation in matrix formulation is:\begin{equation*} \frac {d\delta \underline {x}_{i,p}(t)}{dt} =A_{p} \delta \underline {x}_{i,p}(t)+B\delta f_{i,p}(t)+ \underline {w}_{\alpha} (t)+\underline {w}_{p}(t) \tag{6}\end{equation*} View SourceRight-click on figure for MathML and additional features. where \begin{align*} A_{p}&=\begin{bmatrix} 0 &\quad 1 \\ 0 &\quad -\frac {(C_{i}^{b}+2C_{i}^{c}\bar {x}_{i,2}(t_{p}))}{W'_{i}} \end{bmatrix} \tag{7}\\ B&=\begin{bmatrix} 0 \\ \frac {1}{W'_{i}} \end{bmatrix} \tag{8}\end{align*} View SourceRight-click on figure for MathML and additional features. and \begin{align*} \underline {w}_{\alpha} (t)=\begin{bmatrix} 0 \\ -\frac {W_{i}}{W'_{i}}g\sin \alpha _{i}(t) \end{bmatrix} \tag{9}\end{align*} View SourceRight-click on figure for MathML and additional features. and \underline {w}_{p}(t) is a noise given to imperfections of the model as well as by other technical and environmental components. The model given by (4)–​(9) can be so discretized according to the zero-order hold method. The matrices related to the discretized model are indicated as \bar {A}_{p} and \bar {B} .\begin{align*} \delta \underline {x}_{i,p}(k+1)&= \bar {A}_{p}\delta \underline {x}_{i,p}(k)+\bar {B}\delta f_{i,p}(k) + \underline {w}_{\alpha,p} (k)+\underline {w}_{p}(k) \\ k&=0, \ldots, N_{p}-1 \tag{10}\end{align*} View SourceRight-click on figure for MathML and additional features. assuming that k=0 corresponds to t=t_{p} and a generic value of k indicates t=t_{p}+k*\Delta t_{p} . In addition, there are physical constraints that should be introduced with respect to this model, such as the maximum values which the physical variables as speed, acceleration, and force may assume. One specific constraint, which is often given as a characteristic of the specific train, limits the tractive effort to a maximum value which is almost constant at low speeds, while at higher speeds it is inversely proportional to the speed and directly proportional to the train power. This is respectively due to the adherence of the tractive effort and to the power limit. This constraint is qualitatively shown for a generic high-speed train in Fig. 4.

Fig. 4. - Tractive effort vs speed in high-speed trains.
Fig. 4.

Tractive effort vs speed in high-speed trains.

E. Control Model

A control model is proposed to provide the train driver with values of the tractive force to track a desired trajectory in time and space as required by the RCCS. Specifically, the goal of the control model is to keep the state of each train, in terms of desired position and speed, around the planned value. This objective can be achieved by minimizing the quadratic deviation of the state from the desired values. In addition, for safety reasons, the control must also assure that the expected headways between consecutive trains do not vary above a given value.

Without no loss of generalization and for sake of simplicity, hereinafter, it is supposed that the track has no slope, that is (9) is a zero vector.

Based on this model formulation, the control problem hereinafter quoted as \mathcal {P}_{p} defined on time interval [t_{p}, t_{p+1} ), can be expressed as:\begin{align*} J^{*}&=\min _{\delta f_{i,p}(k), k=0,\ldots,N_{p}-1} \sum _{i=1}^{M}\sum _{k=1}^{N_{p}} {\textbf {E}}[\delta \underline {x}'_{i,p}(k)Q\delta \underline {x}_{i,p}(k)] \\ &\quad +\,\sum _{i=1}^{M}\sum _{k=0}^{N_{p}-1} {\textbf {E}}[\delta f'_{i,p}(k)R\delta f_{i,p}(k)] \tag{11}\\ &\qquad \qquad \,\text {s.t.} (6) \text {and} \\ &\hphantom {\qquad \qquad \,\text {s.t.} }{\textbf {E}}[\delta \underline {x}'_{i,p}(k)\delta \underline {x}_{i,p}(k)] \leq \underline {x}^{ MAX}_{i,p}(k) \\ &\hphantom {\qquad \qquad \,\text {s.t.} }i=1,\ldots, M \quad k=1,\ldots,N_{p} \tag{12}\\ &\hphantom {\qquad \qquad \,\text {s.t.} }{\textbf {E}}[\delta f^{2}_{i,p}(k)] \leq {f}^{ MAX}_{i,p}(k) \\ &\hphantom {\qquad \qquad \,\text {s.t.} } i=1,\ldots, M \quad k=0,\ldots,N_{p}-1 \tag{13}\\ &\hphantom {\qquad \qquad \,\text {s.t.} }{\textbf {E}}[(\delta \underline {x}_{i-1,p}(1,k)- \delta \underline {x}_{i,p}(1,k))^{2}] \leq {d}^{ MAX}_{i,p}(k) \\ &\hphantom {\qquad \qquad \,\text {s.t.} }i=2,\ldots, M\quad k=1,\ldots,N_{p} \tag{14}\end{align*} View SourceRight-click on figure for MathML and additional features.

For each train, the objective function (11) minimizes the quadratic divergence, related to the real and planned positions, speeds, and tractive efforts.

Constraints (12), (13), (14), defined as power constraints, limit the variance of the state and of the control within a certain interval specific to each train and for each time instant or interval.

It should be noted that in each time interval t \in [t_{p},t_{p+1} ), each planned train position varies between \bar {x}_{i,1}(t_{p}) and \bar {x}_{i,2}(t_{p+1}) , while the speed is constant, that is \bar {x}_{i,1}(t^{+}_{p})=\bar {x}_{i,1}(t)=\bar {x}_{i,1}(t^{-}_{p+1}) , \forall t \in [t_{p},t_{p+1} ). As a consequence, the reference acceleration is always zero.

So, for the position element x_{i,1}(k) , k=0,1,\ldots,N_{p} , this is not completely adherent with what required by the cost function (11). This problem can be faced at least according to three possible strategies. The first strategy is to modify the cost function considering the deviation from the planned position at each instant k=0,1,\ldots,N_{p} , transforming the problem in a tracking problem. In addition, the constraints (12) should also take into account the squared value of desired position in the first component of the vector on the right hand side \underline {x}^{ MAX_{i,p}} .

A second option could be to penalize with a very weak cost the position deviation in the diagonal elements of matrix {Q} , but, as above, modifying properly the constraints (12).

A third option could be to define a new reference system, where, at each instant k=0,1,\ldots,N_{p} , each position is x_{i,1}(k) is referred to a desired planned position \bar {x}_{i,1}(k) and not to \bar {x}_{i,1}(t_{p}) . This option implies some additional transformation which is omitted here for sake of readability. This third option has been chosen in the case study.

Constraint (14) requires that the difference between the divergence of the positions (with respect to the planned ones) of two consecutive trains i -1 and i must be limited by d^{ MAX}_{i,p}(k) , which is the maximum expected quadratic deviation of the safe distance.

F. Adding Smoothness Penalty

To reduce sudden variations of the resulting tractive force in \mathit {P}^{N} , an additional penalty on the control smoothness has been introduced by minimizing the square of the difference \delta f_{i,p}(k)-\delta f_{i,p}(k-1) . To implement the smoothing penalty, the problem has been modified by augmenting the state with \delta f_{i,p}(k) , as shown below. Let us define:\begin{align*}\hspace {-0.5pc}\delta \delta f_{i,p}(k)=\delta f_{i,p}(k)-\delta f_{i,p}(k-1)& \quad i=1,\ldots,M \\&k=0,\ldots,N_{p}-1 \tag{15}\end{align*} View SourceRight-click on figure for MathML and additional features. with \begin{equation*} \delta f_{i,p}(-1)=\delta f_{i,p-1}(N_{p-1}) \tag{16}\end{equation*} View SourceRight-click on figure for MathML and additional features. where \delta f_{i,p-1}(N_{p-1}) is the value of \delta f_{i,p-1} at the end of the previously linearized time interval, if available, 0 otherwise.

The equation (10) can be so rewritten as:\begin{align*} \begin{bmatrix} \delta \underline {x}_{i,p}(k+1) \\ \delta f_{i,p}(k) \end{bmatrix} &= \begin{bmatrix} \bar {A}_{p} &\quad \bar {B}\\ 0 &\quad 1 \end{bmatrix} \begin{bmatrix} \delta \underline {x}_{i,p}(k) \\ \delta f_{i,p}(k-1) \end{bmatrix} \\ &\quad +\, \begin{bmatrix} \bar {B} \\ 1 \end{bmatrix} \delta \delta f_{i,p}(t) + \begin{bmatrix} \underline {w}_{p}(k) \\ 0 \end{bmatrix} \tag{17}\end{align*} View SourceRight-click on figure for MathML and additional features. Thus, for the i -th train, we can consider to rewrite (10) as \begin{equation*} \delta \underline {\bar {\bar {x}}}_{i,p}(k+1)=\bar {\bar {A}}_{p}\delta \underline {\bar {\bar {x}}}_{i,p}(k)+\bar {\bar {B}}\delta \delta {f}_{i,p}(k)+ \bar {\bar {W}}_{p}(k) \tag{18}\end{equation*} View SourceRight-click on figure for MathML and additional features. where \begin{align*} \delta \underline {\bar {\bar {x}}}_{i,p}(k+1)&=\begin{bmatrix} \delta \underline {x}_{i,p} \\ \delta f_{i,p}(k-1) \end{bmatrix} \\ \bar {\bar {A}}_{p}&=\begin{bmatrix} \bar {A}_{p} & \bar {B} \\ 0 & I \end{bmatrix} \bar {\bar {B}}=\begin{bmatrix} \bar {B} \\ I \end{bmatrix} \bar {\bar {W}}_{p}(k)=\begin{bmatrix} \underline {w}_{p}(k) \\ 0 \end{bmatrix}\end{align*} View SourceRight-click on figure for MathML and additional features. In problem \mathcal {P}_{p} , the cost function (11) can be consequently modified and for a more compact notation can be written in matrix formulation as:\begin{align*} J^{*}&=\min _{\delta \delta \underline {f}} \sum _{k=1}^{N_{p}}\mathbf {E}[\delta \underline {\bar {\bar {x}}}_{p}(k)' Q_{0} \delta \underline {\bar {\bar {x}}}_{p}(k)] \\ &\quad +\, \sum _{k=0}^{N_{p}-1}\mathbf {E}[\delta \delta \underline {f}_{p}(k)' \delta R \delta \delta \underline {f}_{p}(k)] \tag{19}\end{align*} View SourceRight-click on figure for MathML and additional features. where:

  • \delta \underline {\bar {\bar {x}}}_{p} and \delta \delta \underline {f}_{p} are vectors whose components are the ordered sequence of vectors \delta \underline {\bar {\bar {x}}}_{i,p} and \delta \delta f_{i,p}, i=1,\ldots,M , in a specified time interval [t_{p},t_{p+1} );

  • \begin{aligned} Q_{0}=\begin{bmatrix} Q_{0,xx} &\quad 0_{3M,M} \\ 0_{M,3M} &\quad \delta R \end{bmatrix} \in \mathbb {R}^{4M} \end{aligned} is a symmetric positive definite matrix, i.e the state/control cost matrix, where 0_{M,N} is a matrix of zeros with dimension M\text{x}N .

  • \begin{aligned} Q_{0,xx}=\begin{bmatrix} Q_{0,x1} &\quad 0_{M,M} &\quad 0_{M,M} \\ 0_{M,M} &\quad Q_{0,x2} &\quad 0_{M,M}\\ 0_{M,M} &\quad 0_{M,M} &\quad Q_{0,ff} \end{bmatrix} \in \mathbb {R}^{3M} \end{aligned} is a symmetric positive definite\vphantom {\sum ^{R}} matrix, i.e. the state cost matrix including the control cost at the previous instant, where Q_{0,x1}=diag(p_{i}) , Q_{0,x2}=diag(h_{i}) , and Q_{0,ff}=diag(r_{i}) .

  • \delta R \in \mathbb {R}^{M} is a diagonal matrix, i.e. the cost function of the control, minimizing the acceleration changes.

For each train, the system evolves according to the linear dynamics given by (18) and it is subject to the stochastic constraints (12), (13), and (14). In addition, another set of constraints is added to limit the variation of acceleration. Specifically:\begin{align*}&\hspace {-0.5pc}{\textbf {E}}[\delta \delta f^{2}_{i,p}(k)] \leq \delta \delta f^{ MAX}_{i,p}(k) \quad i=1,\ldots,M \quad \\&k=0,\ldots,N_{p}-1 \tag{20}\end{align*} View SourceRight-click on figure for MathML and additional features. These constraints can be put in a generic matrix formulation as shown in (21).\begin{align*} &\hspace {-2pc}{\textbf {E}}\left [{\begin{bmatrix} \delta \underline {\bar {\bar {x}}}_{p}(k) \\ \delta \delta \underline {f}_{p}(k) \end{bmatrix}' Q_{r,p} \begin{bmatrix} \delta \underline {\bar {\bar {x}}}_{p}(k) \\ \delta \delta \underline {f}_{p}(k) \end{bmatrix}}\right] \leq \gamma _{r,p}(k) \\ r&=1,\ldots,R= N_{p}\textrm {x}(4M-1) \tag{21}\end{align*} View SourceRight-click on figure for MathML and additional features. where

  • Q_{r,p} \in \mathbb {R}^{4M} are matrices associated with each constraint given by (12), (13), (14), and (20)

  • \gamma _{r,p}(k) \in \mathbb {R}^{4M-1,1} correspond to the right-hand side of constraints (12), (13), (14), and (20), specifically, for the k -th sample time the following assignments have been given:

    • \gamma _{r,p}(k)=\underline {x}^{ MAX}_{r,p}(k) in (12) r=1,\ldots,M

    • \gamma _{r,p}(k)=f^{ MAX}_{r,p}(k) in (13) r=M+1,\ldots,2M

    • \gamma _{r,p}(k)=\delta \delta {f}^{ MAX}_{r,p}(k) in (20) r=2M+1,\ldots,3M

    • \gamma _{r,p}(k)=d^{ MAX}_{r,p}(k) in (14) r=3M+1,\ldots,4M-1

So, \mathit {P}^{N} can be resumed according to a more formal and compact notation, as reported hereinafter.

Problem \mathcal {P}_{p}

Consider a train platoon subject to the dynamics given for each train by (18), linearized around a planned working state and control at instant {t}_{p} . The problem \mathcal {P}_{p} is to find the closed loop optimal control \delta \delta \underline {f}_{p}(k) function of the state as shown in (22) \begin{equation*} \delta \delta \underline {f}_{p}(k)=\mu (\delta \underline {\bar {\bar {x}}}_{p}(k)) \quad k=0,\ldots, N_{p}-1 \tag{22}\end{equation*} View SourceRight-click on figure for MathML and additional features. that minimizes the cost function (19), subject to constraints (21), on the time horizon [t_{p}, t_{p+1} ) discretized into N_{p} uniform time intervals.

SECTION IV.

Stochastic Linear Quadratic Optimal Control

The problem \mathcal {P}_{p} can be solved by the approach for stochastic linear quadratic control [8], resulting in a linear control law. Its main characteristics, with reference to this case study, are summarized by the following theorem.

theorem In the problem \mathcal {P}_{p} , the optimal control is linear, that is \delta \delta \underline {f}_{p}(k)=-L_{p}(k) \delta \underline {\bar {\bar {x}}}_{p}(k) , where L_{p}(k) is such that:\begin{align*} &\hspace {-2pc}(\bar {\bar {B}}'S_{p}(k+1)\bar {\bar {B}}+Q_{ff}(k))L_{p}(k) \\ &=\bar {\bar {B}}'S_{p}(k+1)\bar {\bar {A}}_{p}+Q'_{xf}(k) \\ k&=0,\ldots,N_{p}-1 \tag{23}\end{align*} View SourceRight-click on figure for MathML and additional features. where S_{p}(N_{p}) =0 , and Q_{ff} , Q'_{xf} , S_{p}(k+1) are defined by the following sequence of problems k=0,\ldots,N_{p}-1 :\begin{equation*} \max _{S_{p}(k),\tau _{r,p} (k) \geqslant 0} {\textbf {Tr}} S_{p}(k) - \sum _{r=1}^{R} \tau _{r,p}(k)\gamma _{p}(k) \tag{24}\end{equation*} View SourceRight-click on figure for MathML and additional features. subject to the following linear matrix inequality:\begin{align*} &\hspace {-2pc}Q(k)+ \begin{bmatrix} \bar {\bar {A}}'_{p}S_{p}(k+1)\bar {\bar {A}}_{p}-S_{p}(k) &\quad \bar {\bar {A}}'_{p}S_{p}(k+1)\bar {\bar {B}}\\ \bar {\bar {B}}'S_{p}(k+1)\bar {\bar {A}}_{p} &\quad \bar {\bar {B}}'S_{p}(k+1)\bar {\bar {B}} \end{bmatrix} \geqslant 0 \tag{25}\\ Q(k)&=Q_{0}+\sum _{r=1}^{R}\tau _{r,p}(k)Q_{r}=\begin{bmatrix} Q_{xx}(k) &\quad Q_{xf}(k) \\ Q'_{xf}(k) &\quad Q_{ff}(k) \end{bmatrix} \tag{26}\end{align*} View SourceRight-click on figure for MathML and additional features.

Proof:

The demonstration and the specification on the iterative procedure to solve this problem are in [8].

Note on the theorem

The above theorem considers a perfect observation of the state. Errors in the observation of the state (e.g. GPS measures) are supposed negligible in relation to other potential sources of error. However, it is worth remarking that where there are other sizable measurement disturbances, it can be shown that the related output feedback problem becomes equivalent to state feedback problems \mathcal {P}_{p} with a proper application of Kalman filter.

SECTION V.

Case Study

The case study refers to the timetable of nine trains over a Section of the high-speed Paris-London line. Fig. 5 shows the track topography on which validation has been achieved. Due to the low altitude of each station and the long distance between two consecutive stations, the grade of the track taken into account in (1) is negligible. These trains are scheduled along a single-track line over a period of about three hours, as shown in the timetable presented in Table I. All trains start from the same position (identified by the progressive length, starting from 0, St. Pancras). The length of the single-track line is about 78 km. Two trains (Train 2, Train 6) stop after 31 km approximately (Singlewell).

TABLE I Timetable for the 9 Trains of the Case Study.All Trains Enter at Position 0
Table I- 
Timetable for the 9 Trains of the Case Study.All Trains Enter at Position 0
Fig. 5. - Track topography on which validation has been achieved. In the time interval 630 - 834 s the three trains T1,T2,T3 are planned to cover a space shown by the related rectangles.
Fig. 5.

Track topography on which validation has been achieved. In the time interval 630 - 834 s the three trains T1,T2,T3 are planned to cover a space shown by the related rectangles.

It can be observed that several platoons form in the network at different time intervals. These platoons are characterized by different numbers of trains. This is evident in Table II showing the trains present in the rail sector in different time periods, according to the scheduled timetable. As instance, in period 2 a platoon of {M} =2 trains is present, whereas in period 3 a platoon of {M} =3 trains is formed. Different computations using the proposed approach are therefore required for each time period. Hereinafter, the period number {p} = 3 is chosen for the application of the proposed approach. As already noted, in this case M=3 and trains 1, 2 and 3 are travelling on the track line. In Fig. 6, the space-time graphs of the selected trains are shown.

TABLE II Presence of the 9 Trains Over the Railway Section
Table II- 
Presence of the 9 Trains Over the Railway Section
Fig. 6. - Progression of the selected trains on the rail line over time.
Fig. 6.

Progression of the selected trains on the rail line over time.

Fig. 7. - Planned tractive efforts of the trains in the selected time window.
Fig. 7.

Planned tractive efforts of the trains in the selected time window.

The case study has been applied on a time window which lasts 204 s. Specifically, in Fig. 6, the light grey area represents the studied time window: train 3 starts its travel at t = 630 s, train 2 concludes its travel at t = 834 s, while train 1 is covering the selected track line during the interval t = 630 s and t = 834 s.

A. Computation of the Control Law

The trains have the same physical characteristics as reported in Table III. The noises contained in matrix \bar {\bar {W}}_{p}(k) are characterized by a Normal distribution N(0,1) . The sample time duration is \Delta t=0.1 s.

TABLE III Parameters Used in the P^{N} for 3 Trains of theCase Study (Period {p} = 3 in Table II)
Table III- 
Parameters Used in the 
$P^{N}$
 for 3 Trains of theCase Study (Period 
${p}$
 = 3 in Table II)

In Table IV, the values \delta \underline {x}_{i,p}(1,0) and \delta \underline {x}_{i,p}(2,0) are respectively the divergence of the initial positions and speeds of the 3 trains with respect to the planned ones.

TABLE IV Initial Relative Position and Speed (Period {p} = 3 in Table II)
Table IV- 
Initial Relative Position and Speed (Period 
${p}$
 = 3 in Table II)

As regards the matrices and parameters required by the {R} constraints defined by (21), they have been set according to three different scenarios, in which some of such constraints have been relaxed. Specifically, the \gamma _{r,p}(k) are constant in time, that is \gamma _{r,p}(k)=\gamma _{r} , and their value in the three scenarios is shown in Table V.

TABLE V Parameter Values Used in the Three Scenarios
Table V- 
Parameter Values Used in the Three Scenarios

In the first scenario (OBJ1), more relevance is given to track the reference trajectory for each train. So the values \gamma _{i,p} for i=1,2,3 are relatively low, while the other thresholds are relaxed.

In the second scenario (OBJ2), the parameters have been set to emphasize the minimization of difference between the real and planned positions for two consecutive trains, while also limiting the variation of acceleration.

The third scenario (trade-off) is constrained with lower thresholds with respect to the previous ones. Values of the parameters are defined to obtain a certain trade-off between OBJ1 and OBJ2 scenarios. Table V contains the values of input parameters used in the different simulations.

SECTION VI.

Results

The optimal control law has been computed for each train with a receding horizon approach for a time horizon of T = 204 s. For the proposed simulations, a comparison between the planned and actual values for the states and controls variables have been carried out. In the scenarios, actual tractive effort f_{i} of the 3 trains must be limited by the following thresholds [19]:\begin{equation*} f_{i}(k)< min \begin{cases} \frac {P_{max}}{x_{i,2}(k)},F_{max} \end{cases}\Big \} \tag{27}\end{equation*} View SourceRight-click on figure for MathML and additional features. where P_{max} and F_{max} are respectively the current maximum train power and the maximum allowed tractive force, and x_{i,2}(k) is the actual speed of the i -th train, at instant k .

A. Scenario I–Tracking Position (OBJ1)

In OBJ1 scenario, the control law generates the best values for the tractive forces to track the desired trajectory for the three different trains.

Fig. 8 shows the optimal positions of the trains with respect to the planned/scheduled ones on a finite horizon of t = 204 s. The figures show a significant tracking of the planned positions more evident for trains 2 and 3 which differ from the planned trajectories respectively for 298 m and 2 m at maximum. However, in the worst case, for train 1, at the end of the time horizon, the real trajectory only diverges for 800 m. This is mainly due to the fact the planned trajectory is hardly feasible for the type of trains used in the case study.

Fig. 8. - Trend for the optimal value of the states for the 3 trains for 
$T$
=203. (Scenario I) The full lines display the real trajectories while dashed lines represent the planned ones.
Fig. 8.

Trend for the optimal value of the states for the 3 trains for T =203. (Scenario I) The full lines display the real trajectories while dashed lines represent the planned ones.

Fig. 9 displays the control variables associated to the tractive forces generated by the trains to carry out their path. It is shown that the constraint (27) related to the maximum allowed value for the tractive effort is respected (F_{max} =360 kN). Finally, in the Fig. 10, the divergence from the planned headway between two consecutive trains is shown. The lower performances are due to the choice of the parameters \gamma _{10} and \gamma _{11} relaxing the constraints related to the position of the train in respect to the previous one.

Fig. 9. - Trend for the tractive effort for the three trains (scenario I).
Fig. 9.

Trend for the tractive effort for the three trains (scenario I).

Fig. 10. - Divergence of the headway between two consecutive trains in respect to the planned positions (scenario I).
Fig. 10.

Divergence of the headway between two consecutive trains in respect to the planned positions (scenario I).

In quantitative values, the first term of objective (19) has a value of 1102, while the second one reaches 33.

B. Scenario II–Limiting Headway Values for Two Consecutive Trains (OBJ2)

In this second scenario, the performances related to the capability of the proposed approach to respect a predefined headway among trains have been tested. In particular, the parameters \gamma _{r} assumed the values shown in the second column of Table V. Fig.11 shows a better performance on the headway with respect to the previous scenario (Fig.10).

Fig. 11. - Divergence of the headway between two consecutive trains in respect to the planned positions (scenario II).
Fig. 11.

Divergence of the headway between two consecutive trains in respect to the planned positions (scenario II).

On the other hand, as expected, the tracking of the position (Fig.12) gets worse for the three trains which, at the end of the time horizon, diverge for about 1000 m from their ideal positions (1030 m for Train 1, 1009 m for Train 2 and 982 m for Train 3).

Fig. 12. - Trend for the real and planned values of the states for the three trains (scenario II). The full lines display the real trajectories while dashed lines represent the planned ones.
Fig. 12.

Trend for the real and planned values of the states for the three trains (scenario II). The full lines display the real trajectories while dashed lines represent the planned ones.

In quantitative values, the first term of objective (19) has a value of 1442, while the second one reaches 4. As expected the terms related the tractive effort have a lower value than the previous scenario, as the right hand part of the constraints defined by \gamma _{7,p} ,\gamma _{8,p} ,\gamma _{9,p} are more restrictive.

Fig.13 shows the related tractive efforts which are smoother with respect to the ones given by scenario I.

Fig. 13. - Tractive efforts of the trains in the scenario II.
Fig. 13.

Tractive efforts of the trains in the scenario II.

C. Scenario III–Tradeoff Between OBJ1 and OBJ2

The third scenario provides a more constrained scenario where the OBJ1 and OBJ2 are balanced. In this case, the three trains converge to the planned trajectories with a deviation respectively of 970 m, 676 m, and 479 m (Fig. 14). According to the imposed parameter values the divergence for the headways is also reduced compared with the scenario I. In this case, the differences related to the real and planned headways (Fig. 15) are, respectively, about 400 m between Train 1 and Train 2, and 200 m between Train 2 and Train 3. In quantitative values, the first term of objective (19) has a value of 820, while the second one reaches 3. As also shown by the graphs, this scenario represents a good trade-off between the previous ones also in the quantitative values of the relative objective.

Fig. 14. - Trend for the real and planned values of the states for the three trains (scenario III). The full lines display the real trajectories while dashed lines represent the planned ones.
Fig. 14.

Trend for the real and planned values of the states for the three trains (scenario III). The full lines display the real trajectories while dashed lines represent the planned ones.

Fig. 15. - Divergence of the headway between two consecutive trains in respect to the planned positions (scenario III).
Fig. 15.

Divergence of the headway between two consecutive trains in respect to the planned positions (scenario III).

Fig. 16 displays the values associated to the actual tractive forces in scenario III.

Fig. 16. - Tractive efforts of the trains in the scenario III.
Fig. 16.

Tractive efforts of the trains in the scenario III.

SECTION VII.

Conclusion

Recent research, e.g [20], [21], aims at contributing to the introduction of specific operating modes in the ERTMS/ETCS standard to push railway infrastructure network capacity to its limits in high traffic corridors, while enhancing safety aspects related to the headway control in a train platoon. This work is partially in line with these works, as it aims to contribute to minimize the deviation from pre-planned train schedules which in turn have been designed to optimise the infrastructure capacity, while respecting safety constraints as affected by stochastic noise. Specifically, this work adopts a linear quadratic optimal control problem with stochastic constraints for managing the movement of consecutive trains sharing a single-track railway. In the proposed model, given the position and speed of a train at each time interval, the movements of the trains are modelled by Lomonossoff’s equation.

The proposed control model provides the train traffic operators with a methodology which can adjust train timetable deviations suggesting trains’ tractive efforts under safe conditions, which is when the headway does not become a critical issue. The introduction of a smoothness penalty of the inputs has been introduced in the control model to reduce sudden variations of the tractive effort.

From a methodological viewpoint, it is worthwhile to underline the innovation introduced by the control strategy that has been applied. Although linear quadratic control approaches, even in a stochastic formulation, are a classic example of the control literature, in this paper, a more recent and innovative generalised formulation proposed by Gattami [8] has been adopted, where the main novelty is the possibility to add quadratic stochastic constraints to the problem formulation, so, limiting the stochastic effects of disturbances on the system. Specifically, the most relevant originality of the paper deals with the possibility to introduce stochastic constraints in a linear quadratic regulator, considering the whole platoon as a system. In fact, the proposed model could be improved under different aspects. For example, the approach could be enhanced taking into account the different dynamic behaviour of tractive and braking forces in the control of the train speed, also considering additional aspects such as energy saving [22]. While in the proposed control model the stochastic constraints are the same both for tractive and braking forces, a more complex behaviour due to perturbated situation can affect braking performances, as shown in previous works [23], [24]. In this context, a future development could be the definition of a switching control approach allowing the combination of different algorithms as recently proposed in a more traditional PID control context [25], applying braking constraints in deceleration contexts and tractive constraints in acceleration ones.

We would expect the method employed to be generalizable to trains that are not physically in a platoon on a single track line, via the concept of virtual platooning that was recently introduced for road vehicles [26]. In conclusion, the proposed linear quadratic optimal control scheme may be a tool suitable for implementation in rail traffic control centres in virtue of its good performance and reduced computational times.

NOTE

Open Access funding provided by ‘Università degli Studi di Genova’ within the CRUI CARE Agreement

References

References is not available for this document.