Introduction
There is no contest regarding the safety of autonomous vehicles being of the utmost importance within the industry. The inherent nature of autonomous vehicles brings forth the issue that even a minor vehicle fault may lead to serious, possibly fatal, accidents. Hence, studies on fail-safe systems (also known as fallback systems) are particularly important for autonomous vehicle research. Autonomous vehicles are equipped with various equipment such as sensors, computers, and actuators. While the various parts are communicating with each other, monitoring the condition of each part is essential; Fail-safe systems form decisions based on these conditions. For autonomous vehicle sensors, abnormal conditions can be identified with relative ease as manufacturers provide essential information regarding the sensor conditions. However, a general solution to the autonomous vehicle fail-safe system still proves to be challenging as the hardware and software configurations differ depending on the vehicle [1]. Each vehicle system requires a corresponding and independent fail-safe structure. Because numerous companies and research institutes independently develop their own autonomous vehicles, the challenge of developing a universal fail-safe system persists.
Prior studies have shown that several vehicle companies have developed a dual-system and fallback system for safety. For example, a fallback system can be found in Google Waymo’s autonomous vehicles. Waymo’s fallback system is composed of 4 backup systems: Backup computing, backup braking, backup steering, and backup power [2]. GM CRUISE, on the other hand, has a backup computer, backup actuators, and redundant signal communication [3]. Baidu’s Apollo autonomous vehicle has a guardian module and a fail-safe system. The open-source nature of the Apollo software structure was applied to all control commands through the guardian module [4]. The studies mentioned so far introduce the various independent fail-safe methodologies applied to different autonomous vehicles.
Other areas of studies apart from autonomous vehicles have also shown progress in fail-safe systems through the use of fault diagnosis and countermeasure methods for system malfunction or performance degradation. Of such, fault detection and diagnosis methods have been used in fail-safe structures to develop the stability and reliability of their fail-safe systems. Fault diagnosis usually utilizes the concept of hardware and software redundancy. The hardware redundancy concept relies on the usage of 2 pieces of the same hardware with the same input. The output signals of both pieces of hardware are compared to detect faults. However, the downside to this method is cost and limitations on mounting locations; Environment sensors such as LiDARs, RADARs, and Cameras cannot be installed on the exact same locations. Consequently, the progression of fault diagnosis has led to software redundancy (analytical redundancy) methods becoming the mainstream of fault detection and diagnosis study since the 1980s [5]. Some examples of software redundancy implementation are given below. Fault reconstruction and detection were achieved through the use of a sliding mode observer with parameters obtained from the LMI technique [6]. Fault diagnosis on applications for satellite formation flight was done through a robust unknown input observer. In this case, the isolation of faulty actuators was done while taking model uncertainty, input, and output environmental disturbance into account [7]. Fault estimation and fault-tolerant control problems of a class of switched stochastic systems were characterized using an independently designed observer; sensor faults were estimated and used for fault diagnosis. The dimensions of the designed observer were reduced when compared to existing results. [8] The advantages and practical implications of artificial intelligence (AI) were reviewed in rotating machinery. [9] Motor bearing condition monitoring and fault diagnosis via a vibrational signal were achieved using a wireless sensor network’s amplitude and frequency data [10]. Real-time monitoring and fault detection for motor arrays through vibration, current, and temperature data was achieved via a wireless sensor network, in consideration of time delays [11]. A fault diagnosis system for autonomous vehicles based on neural networks using the internet-of-things (IoT) was developed to collect various types of information [12]. Fail-safe motion planning and verification were conducted to generate vehicle trajectories that could allow vehicles to maintain desired maneuvers with enhanced safety [41]–[43]. A fail-safe priority-based intersection approach was developed to manage signal-less intersections autonomously [44]. A fault detection, isolation, and identification architecture for multi-faults in multi-sensor systems has been introduced in [39]. Algorithms designed to diagnose faults in functional parts of autonomous vehicles were developed for fail-safe systems [38], [45]–[47]. The following studies have placed greater focus on designing and analyzing environmental sensor faults. For example, vehicle chassis sensor faults [47], [48] handled faults related to vehicle chassis sensors, whereas faults in environment sensors such as LiDARs and Radars were tackled in [38], [45]–[47], [49]–[52]. [53]–[55], on the other hand, focused on actuator faults. All these studies have performed validations through simulations and vehicle tests. Various methodologies regarding fault diagnosis and fault-tolerant control were proposed [45], [46], [48], [52]–[54].
A thorough review of the studies mentioned above has shown that aspects of fail-safe systems such as fault detection and reconstruction have been studied through several methods, including observer-approaches, statistical methods, sensor monitoring, and artificial networks. However, most methods rely on the observer or estimator assumption that fault is bounded by parameter design.
This paper proposes two fault detection algorithms for longitudinal sensor fault detection. Other studies have shown the occasional use of the sliding mode observer method to detect faults. However, the conventional sliding mode observer requires an output error boundary to design the injection term parameters. Designing this observer still proves to be challenging due to the time-variant nature of the output error boundary. Therefore, in this paper, an adaptive sliding mode observer was developed to update the injection term parameters in accordance with output errors affected by acceleration faults. Methodologies with the acceleration fault can be reconstructed using the Lyapunov stability concept, also applicable in detecting acceleration sensor faults through a threshold approach. Moreover, a linear model-prediction algorithm has been designed for faults in environmental sensors as well.
In their current state, environmental sensors utilize clearance data and relative velocity to make predictions based on the discretized system model. Two predicted states can be computed from the minimum and maximum acceleration usage. The feasible boundaries are then configured from each predicted state. Together with past accumulated states, comparisons are made with the current state. The fault index can be calculated by counting values sitting outside the configured boundaries. Finally, an environmental sensor fault is determined based on a defined threshold after averaging the count. Three representative fault types are classified as follows: Power off, Holding Signal, Offset Signal. Of the type of faults mentioned so far, this paper will only consider and demonstrate the offset fault. This is because the detection of offset faults results in easy identification of the other types.
This paper also contains an emergency control portion essential for an autonomous vehicle to be in accordance with level 4 autonomous driving. This research utilizes an SMC method that takes adaptive convergence time into account. Because the convergence time of the control error was determined mathematically, the corresponding control values were designed to change this predetermined convergence time. The method incorporating convergence time was applied to the test vehicle under the fail-safe control portion. Later sections describe the method’s proof of stability in greater detail. Shown below are some of the relevant literature surveys regarding SMCs and the usage of such a method.
Jun Hu et al. proposed a survey on SMC for networked control systems [56]. This survey was able to provide a greater understanding of the current application trends of the SMC. Jun Hu et al. also investigated the uncertain nonlinear systems of actuator faults [57]. In this research, the team designed various uncertain actuator faults and validated a newly proposed controller, the ISMC scheme, to ensure stability. Validations were carried out with the rocket fairing structure model. SMC for Networked control systems (NCSs) shows great promise in fail-safe research in that methodologies used here can also be effectively utilized in fail-safe systems as well. Moving on, the SMC was effectively utilized in reducing the effects of actuator attack failures [58]. The proposed approach was able to guarantee a type-2 fuzzy system and the input-to-state stabilization of sliding motion. Among the studies showing various SMC methodologies, their approach to time delay could be categorized into 4 types: constant time-delay, time-varying delay, distributed delay, and probabilistic interval time delay [56]. The constant time-delay type could be further divided into present-based SMCs and time-shift SMCs. A related study proposed a predictive SMC for a networked control system with time delay and packet dropout [59]. A chattering-free law was also designed and utilized in the predictive SMC. Within the field of time-varying delay, recent trends have leaned towards delay-fraction SMC research.
D. Ao et al. proposed a super-twisting sliding mode control algorithm (STA), based on the Lyapunov theory, to enhance the robustness and attenuation of the chattering issue [60]. S. Wang et al. suggested a new sliding mode control strategy based on an RBF (Radial Basis Function) neural network to solve tracking errors and substantial chattering phenomena [61]. A mathematical nonlinear longitudinal model was proposed, and their algorithm was validated through a simulation study under typical driving conditions. B. Peng et al. proposed an improved sliding mode control strategy for vehicle platoons, capable of forming a platoon in shorter time periods in simulated environments with better stability [62]. The proposed strategy was able to reach a stable and controllable area rapidly. P.Wang et al. proposed a robust automatic control strategy based on the back-stepping sliding mode control theory [63]. The study conducted a co-simulation with CarSim/Simulink under various scenarios. Y. Han et al. proposed a sliding mode control method based on the improved reaching law [63]. The method was developed to control the attitude of an automated guided vehicle, where an outer loop controller outputs the desired attitude. Validations were conducted through simulations.
So far, a thorough review of the numerous related studies has shown current trends and achievements made with the SMO and SMC. However, some issues are yet to be addressed. Firstly, methodologies proposed by most studies are developed under the assumption that faults are bounded by parameter design. Secondly, there is still a lack of research on the usage of the SMO and SMC in emergency controls. Additionally, a “PC (Upper controller) Shutdown” situation, where algorithms may only work with the last available values, is a genuine possibility that cannot be ignored. This paper aims to address all these issues with the algorithms proposed by our research team. The main contributions of this work can be summarized as follows:
A newly designed framework for fail-safe modules in Autonomous Vehicles has been proposed and implemented in an actual autonomous vehicle.
An MIT rule-based adaptive SMO methodology has been proposed for the detection of longitudinal faults in chassis and environment sensors (lidar, radar) and for the reconstruction of unknown faults in real driving scenarios.
The proposed adaptive SMC (In consideration of convergence time) has been successfully implemented and tested in fault scenarios involving actual vehicle experiments. Possible fault types of AVs have been defined through these experiments as well.
Overview of a Fail-Safe System in Autonomous Vehicles
This section introduces the fail-safe system applied to the autonomous vehicle used in this paper. While the system has only been applied and tested on autonomous vehicles used in university laboratories, its uses are not limited to select test vehicles. The methods proposed in this paper are still extensively applicable in other areas. The overall controller is composed of an upper controller and a lower controller: A commercial, industrial PC (IPC) as the upper controller, and a MicroAutobox (dSPACE) as a lower controller. It is to be noted that the lower controller is more robust than the upper controller. The test vehicle’s sensor configuration is depicted in Fig. 1. In this research, the focus is primarily placed on developing a fail-safe module for an automated driving system. Fig. 2 depicts a diagram of the module that contains Fault Detection and Fail-Safe Control.
The Fault Detection portion is made up of a Hardware Fault Detection part and an Algorithm-Based Detection part. Fault detection in this paper will mainly refer to algorithm-based environmental sensor fault detection and chassis sensor reconstruction methodologies. Section III will introduce the algorithm-based fault detection methodology in greater detail.
The next part introduces the adaptive sliding mode observer-based fault reconstruction and detection. On the other hand, fail-safe control is comprised of Tolerance Control and Emergency Control parts. Due to differences in the longitudinal controller, Tolerant Control could not be effectively applied to the test vehicle. Hence it was only tested via simulations. This is further described in [13]. Within the Emergency Control portion, a reference target building block and a control block, one that follows the reference target model, can be found.
This paper also utilizes a longitudinal control algorithm encompassing the fault detection algorithms. The following three consecutive parts make up the control algorithm. First, a decision part determines the vehicle’s current state through the CAN bus signal. Next, confirmation regarding a fault where the CAN communication gets stuck occurs. Indexes based on the best reference target model are determined. Finally, the lower-level controller determines the desired control input: The steering angle is calculated through a dead reckoning method, whereas the longitudinal acceleration control input is determined based on a sliding mode control method that tracks the reference target model. Table 1 describes the overall algorithm used in this paper for the fail-safe system in autonomous vehicles. In Table 1, the fault detection method corresponding to each hardware module can be found. An appropriate maneuver strategy for each module is also proposed.
In Table 1, a single asterisk indicates that the method was applied to an actual autonomous test vehicle. A double asterisk indicates that the method was only verified through simulations.
Adaptive Sliding Mode Observer and Linear Prediction Based Sensor Fault Detection
The following driving condition was proposed to test for various fault scenarios. The test vehicle was to follow another leading vehicle located in its longitudinal direction. The environmental sensors were used to measure and obtain vehicle clearance and relative velocity information. The acceleration of the leading vehicle was measured through a wireless communication sensor. The observer’s kinematic model defined for the longitudinal direction is given in the following form: \begin{align*} \dot {x}=&Ax+Bu\tag{1a}\\ y=&Cx\tag{1b}\\ A=&\left [{ {{\begin{array}{cc} 0 &\quad 1 \\ 0 &\quad 0 \\ \end{array}}} }\right]\!,\quad B=\left [{ {{\begin{array}{c} 0 \\ 1 \\ \end{array}}} }\right]\!,~C=\left [{ {{\begin{array}{cc} 1 &\quad 1 \\ \end{array}}} }\right]\tag{1c}\end{align*}
A. Adaptive Sliding Mode Observer for Acceleration Sensor Fault Reconstruction and Detection
The adaptive sliding mode observer is designed to reconstruct the acceleration fault: see model schematics in Fig. 3. The description of Fig. 3. is as follows.
The environment, communication, and acceleration sensors transmit various sensor values to the observer. The SMO then calculates an estimated error,
The kinematic model inclusive of the fault term \begin{align*} \dot {x}=&Ax+Bu+Ff_{a}\tag{2a}\\ F=&\left [{0\quad -1}\right]^{T}\tag{2b}\end{align*}
\begin{equation*} T=\left [{Null(C)\quad C }\right]^{T}\tag{2c}\end{equation*}
When observer error is zero, acceleration faults can exist after convergence.
When convergence time is extremely small, the observer performs appropriately.







B. Stability Analysis of Sliding Mode Observer
The convergence stability of the output error can be ensured through the use of the injection term \begin{align*} V=&\frac {1}{2}e_{y}^{2}\tag{5a}\\ \dot {V} < &0,\quad \forall e_{y} \ne 0 \\ \mathop {lim}\limits _{\left |{ {e_{y}} }\right |\to \infty }~V=&\infty\tag{5b}\end{align*}
\begin{equation*} \dot {V}\le -\alpha V^{1/2}\tag{5c}\end{equation*}
\begin{equation*} t_{f} \le -\frac {2V^{1/2}(0)}{\alpha }\tag{5d}\end{equation*}
\begin{equation*} \dot {V}\le -e_{y} \dot {e}_{y}\tag{5e}\end{equation*}
\begin{align*} \dot {V}\le&-e_{y} (A_{t21} e_{1} +A_{t22} e_{y})-\rho \left |{ {e_{y}} }\right | \tag{5f}\\ < &-\left |{ {e_{y}} }\right |(\rho -\left |{ {A_{t21} e_{1} +A_{t22} e_{y}} }\right |) \\ \dot {V}\le&-\frac {\alpha }{\sqrt {2}}\left |{ {e_{y}} }\right |\tag{5g}\end{align*}
\begin{equation*} \left |{ {A_{t21} e_{1} +A_{t22} e_{y}} }\right |\le L_{b}\tag{5h}\end{equation*}
\begin{equation*} \rho =L_{b} +\frac {\alpha }{\sqrt {2}}\tag{5i}\end{equation*}
C. Adaptation Algorithm Based on the MIT Rule
This subsection introduces the utilization of an MIT adaptive rule that does not require system model parameters. Here, estimated coefficients and MIT rules were used to update feedback gain. The parameter
The equations for the cost function \begin{align*} \min \,J(\hat {\theta }(k),k)=&\frac {1}{2}\sum \limits _{i=1}^{k} {\lambda ^{k-i}(y(i)-\phi (i)\hat {\theta }(k))^{2}}\tag{6a}\\ \hat {\theta }(k)=&\hat {\theta }(k-1)+L_{g} (k)(y(k)-\phi (k)\hat {\theta }(k-1)) \\ \tag{6b}\\ L_{g} (k)=&P(k-1)\phi (k)(\lambda +\phi ^{T}(k)P(k-1)\phi (k))^{-1} \\ \tag{6c}\\ P(k)=&(I-L_{g} (k)\phi ^{T}(k))P(k-1)/\lambda\tag{6d}\end{align*}
\begin{align*} J_{e}=&\frac {1}{2}e_{y}^{2},\quad J_{p} =\frac {1}{2}e_{p}^{2}\tag{7a}\\ \frac {d\rho }{dt}=&-\gamma _{e} \frac {dJ_{e}}{d\rho }=-\gamma _{e} e_{y} \frac {\partial e_{y}}{\partial \rho }=-\gamma _{e} e_{y} \hat {c}_{e} \\ \frac {d\rho }{dt}=&-\gamma _{p} \frac {dJ_{p}}{d\rho }=-\gamma _{p} e_{p} \frac {\partial e_{p}}{\partial \rho }=-\gamma _{p} e_{p} \hat {c}_{p}\tag{7b}\end{align*}
\begin{align*} \frac {d\rho }{dt}=&-w\gamma _{e} e_{y} \hat {c}_{e} -(1-w)\gamma _{p} e_{p} \hat {c}_{p}\tag{8a}\\ \rho=&\int _{0}^{t} \left ({{-w\gamma _{e} e_{y} \hat {c}_{e} -(1-w)\gamma _{p} e_{p} \hat {c}_{p}} }\right)dt\tag{8b}\\ w=&\begin{cases} \displaystyle 1, & \left |{ {e_{y}} }\right |\ge \varepsilon \\ \displaystyle 0, & \left |{ {e_{y}} }\right | < \varepsilon \\ \displaystyle \end{cases}\tag{8c}\end{align*}
D. Linear Model Prediction-Based Environment Sensor Fault Detection
This subsection introduces the linear model prediction algorithm for environmental sensor fault detection. A predictive and accumulative method is used to derive the feasible boundary of the current measured state. The same longitudinal driving kinematic model in equations (1a) – (1c) has been used. The discretized kinematic model is used for prediction and is given as follows: \begin{align*} x(k+1)=&A_{d} x(k)+B_{d} u(k)\tag{9a}\\ A_{d}=&\left [{ {{\begin{array}{cc} 0 &\quad {\Delta t} \\ 0 &\quad 0 \\ \end{array}}} }\right]\!,\quad B_{d} =\left [{ {{\begin{array}{c} 0 \\ {\Delta t} \\ \end{array}}} }\right]\tag{9b}\end{align*}
\begin{equation*} x(k+N)=A_{d}^{N}x(k)+\sum \limits _{i=0}^{N-1} {A_{d}^{i}B_{d} u(k+N-1-i)}\tag{9c}\end{equation*}
In order to consider the predicted input term \begin{align*} x_{u} (k+N)=&A_{d}^{N}x(k)+\sum \limits _{i=0}^{N-1} {A_{d}^{i}B_{d} u(k)} \\&+\,\sum \limits _{i=0}^{N-2}{(N-i-1)A_{d}^{i}B_{d} \Delta u_{u}} \tag{9d}\\ x_{l} (k+N)=&A_{d}^{N}x(k)+\sum \limits _{i=0}^{N-1} {A_{d}^{i}B_{d} u(k)} \\&+\,\sum \limits _{i=0}^{N-2} {(N-i-1)A_{d}^{i}B_{d} \Delta u_{l}} \tag{9e}\\ \Delta u_{u}=&u(k)+Jerk\cdot \Delta t, \\ \Delta u_{l}=&u(k)-Jerk\cdot \Delta t\tag{9f}\end{align*}
\begin{align*} F_{1,i}=&\begin{cases} \displaystyle 1, & x_{1,k-i,u} (k) < x_{1,k} \,\,or\,\,x_{1,k-i,l} (k) > x_{1,k} \\ \displaystyle 0, & x_{1,k-i,u} (k)\ge x_{1,k} \ge x_{1,k-i,l} \end{cases} \\ \tag{10a}\\ F_{2,i}=&\begin{cases} \displaystyle 1, & x_{2,k-i,u} (k) < x_{2,k} \,\,or\,\,x_{2,k-i,l} (k) > x_{2,k} \\ \displaystyle 0, & x_{2,k-i,u} (k)\ge x_{2,k} \ge x_{2,k-i,l} \end{cases} \\ \tag{10b}\\ F_{s}=&\left ({{\sum \limits _{i=1}^{N} {(F_{1,i} +F_{2,i})}} }\right)/N\tag{10c}\end{align*}
Concept of linear model prediction algorithm & Environment sensor fault detection method using predicted state.
The final fault index is derived using the threshold approach as follows:\begin{align*} F_{x} =\begin{cases} \displaystyle 1 & (F_{s} \ge Th_{s}) \\ \displaystyle 0 & (F_{s} < Th_{s}) \\ \displaystyle \end{cases}\tag{10d}\end{align*}
E. Vehicle Test Results of Fault Detection Algorithm
This subsection introduces vehicle test scenarios, the test environment, and test results. Table 2 summarizes the proposed fault detection algorithm for the environment sensor and the vehicle chassis sensor. The designed fault scenario is distinguished as follows: Three scenarios were designed as shown in Table 2. A square wave fault was used with different magnitudes for acceleration sensors. Faults with the environmental sensors are designed as fault for
A vehicle test situation is designed as shown Fig. 5 to utilize equation (1c) in the kinematic model. The parameters for adaptive sliding mode observer and linear prediction algorithm are described in Table 3.
In Table 2, some results are not depicted in this paper. Some results [14], [38], [39] are not in this paper but get similar results as shown here. The omitted graphs are replaced by the resulting graph in the reference paper.
Two small and large fault signals are applied to acceleration information to evaluate the performance of the adaptation algorithm for the sliding mode observer. The results from applying a large fault are shown in Fig. 6 for comparison. The results from applying a small fault are shown in reference papers [14], [39]. Fig. 6 (a) shows comparison of results of applied and reconstructed fault between the timestamps 25 sec ~ 35 sec. From the applied fault plot, it can be observed that output error results are large. A switch in the weighting factor to 1 from 0 can be seen as well. The parameter
reconstruction and detection was checked and is secure (Fig. 6 (a) and (g)). A large value is seen because it is the initial convergence of observer output error. A high adaptive gain can be defined to ensure initial convergence performance, but the results are over a relatively large adjustment of parameters and can lead to unreasonable results. Fig. 7 shows the fault detection results for environment sensors based on the linear prediction method. Three test scenarios were considered in Table 2 for reasonable performance evaluation: clearance fault, relative velocity fault, and simultaneous faults in both sensors. The square wave fault was induced at the 20 second mark. The fault-sum-average index
Vehicle Actuator System Model
The test vehicle performs longitudinal control by applying the desired longitudinal acceleration commands to the vehicle gateway. The gateway connects the lower controller and the vehicle system through a CAN communication platform to achieve vehicle behavior that follows the desired command. As depicted in Fig. 8, the vehicle actuator system can be defined as a dynamic system consisting of a gateway and a vehicle system. The dotted line in Fig. 8 represents a schematic diagram of an actual vehicle actuator system. For the actuator system, the desired acceleration command is the input, and the actual acceleration of the vehicle is the output. Consider the case of an emergency braking scenario. The emergency control corresponds to a command is continuously applied to the gateway. e.g., negative acceleration. The vehicle then decelerates while tracking the braking input command.
From a vehicle control perspective, immediate responses from the actuator system are critical in ensuring that the desired motion planning gives an acceptable performance. An ideal actuator system outputs a vehicle acceleration response instantly when an acceleration input command is given. However, response delays exist in realistic vehicle actuator systems. This is due to the vehicle’s inherent characteristics, such as body weight and powertrain performance. These actuator delays lead to an accumulation of error within the control objective, resulting in an overshoot response to compensate for the error. Therefore, actuator response characteristics need to be accounted for to reduce the disparity between the desired and actual performance.
The system’s response characteristics can be captured in a simple dynamic model. The First Order Plus Dead Time (FOPDT) model can aptly define the vehicle actuator system as follows:\begin{equation*} \dot {a}_{x} (t)=-\frac {1}{\tau }a_{x} (t)+\frac {1}{\tau }a_{x,des} (t-t_{d})\tag{11}\end{equation*}
\begin{equation*} G(s)=\frac {a_{x} (s)}{a_{x,des} (s)}=\frac {e^{-t_{d} s}}{1+\tau s}\tag{12}\end{equation*}
Fig. 9 shows a comparison of the step input responses between the actual vehicle and a simulation based on the FOPDT model with the estimated parameters. The comparison shows that the FOPDT model has a satisfactory characterization of the vehicle response with the estimated parameters. When a step command input was given, the Root Mean Squared Error (RMSE) value between the actual and the simulated acceleration results was less than 0.12 m/s2. Fig. 9 shows that the FOPDT model with the appropriate parameters can accurately describe the actuator response’s overall characteristics. The corresponding parameters for the acceleration and braking scenario were found to be
Fail-Safe Emergency Control – Adaptive Sliding Mode Control Based (Longitudinal, Module in Vehicle)
A. Overall Hardware Structure
The hardware concept schematic of the autonomous vehicle controllers (PC, Autobox), inclusive of fail-safe algorithms, is depicted in Fig. 10. The fail-safe module and the perception, decision and control algorithms in the upper controller, were configured under typical environmental circumstances, in consideration of the autonomous vehicle’s hardware structure. If the fault detection module detects a fault and no driver intervention is given, the last available information is used to predict and control the system. The drive-able path information received from the upper controller is utilized in two ways: in the lateral direction and the longitudinal direction. In the lateral direction, the dead reckoning algorithm utilizes the distance information to calculate the appropriate steering angle. Lateral control only utilizes the vehicle chassis information. This algorithm uses the last information (desired path) available from the upper controller to follow the desired path using the DR method. In the longitudinal direction, reference target building and sliding mode control-based deceleration algorithms are executed. Longitudinal control works by calculating the reference model based on the received drive-able path information. Information is delivered to the lower controller in real-time, and the control constantly operates in the vehicle. The entire module is composed of a fault detection portion that detects fault within the total module. The module is comprised of a fault detection part that classifies the fault and a control model that controls deceleration with limited information.
B. Reference Deceleration Model Rebuilding and Filtering
The typical reference deceleration model is made from general driver deceleration data [24]. The typical model is described in Fig. 10 under the fail-safe control portion. The model considers driver safety and ride comfort. A first-integrated velocity model and a second-integrated station model are used to construct an algorithm for stopping at safe distances. Images and formulas for the longitudinal acceleration, the longitudinal velocity, and the longitudinal distance models are illustrated in Fig. 11. Here,
This paper has adopted a new method to rebuild the reference model offline, according to the vehicle’s velocity. A reference model for each velocity was first created. Next, the Carsim simulator was used to evaluate the reference model’s three main index types. In this paper, the proposed indexes regarding safety and comfort are described in two parts. Under safety, the indexes utilized are the time to collision inverse (TTC inv) index and longitudinal warning index. Under comfort, the indexes utilized are pitch, pitch rate, and vertical acceleration. More detail regarding the safety indexes can be found in [25], [26]. The index proposed here is shown in Fig. 12. For Comfort indexes, this study used the passenger’s ride comfort data and indexes from a related paper [27]. In [27], jerk value and vertical acceleration are proposed as riding comfort indexes. Other indexes utilized in [27] are shown in Fig. 13. Table 4 summarizes the indexes regarding safety and comfort. For this paper, a new index was developed through a combination of the indexes investigated.
C. New Index-Based Reference Model Rebuilding and Filtering
This section introduces a methodology for filtering out improper reference models using the newly proposed indexes. Fig. 11 shows an example of an improper reference model [24]. Reference target models were first generated according to the velocity at the moment of vehicle failure. Filtering of these models using safety and omfort indexes was carried out to generate a final filtered reference model.
Through a Carsim simulation test, the pitch, vertical acceleration, and jerk value could be derived. In the Carsim simulation, each test vehicle was driven with the velocity and station of each reference model. A total of 78 derived cases were carried out. The Safety and Comfort indexes used and referred to are presented in Table 4, and the derived results are shown in Figs. 14 to 16. The guidelines for filtering are as follows: 1. overshot of indexes occurs when its value is not within a reasonable range and the passenger is deemed to be uncomfortable [27]. The filtered referenced model used for the vehicle tests is shown in Fig. 17. The model was constructed for vehicle tests of velocities between 30-50 kph, shown in Fig. 17
D. Sliding Mode Control - Adaptive Converge Time Gain and Stability
This subsection proposes a control methodology applicable for the fail-safe control portion in an autonomous vehicle. Reasons for the usage of the sliding mode control method and the adaptive convergence time method are detailed in this section. To ensure a strict level of safety with the designed conditions, knowledge regarding response time to a failure is vital. Therefore, the convergence time of the control error was determined mathematically, and the corresponding control values were designed to be changed based on the predetermined convergence time. Thus, this particular method was applied to the test vehicle within the fail-safe control portion. This subsection demonstrates the stability of the longitudinal emergency braking system. The proof of stability is also shown in this subsection. In the fail-safe control part, the longitudinal model is a nonlinear system without any disturbance and uncertainties. The model can be described as follows: \begin{equation*} \dot {x}(t)=f(x,t)+g(x,t)\cdot u(t)\tag{13a}\end{equation*}
From section IV (vehicle actuator module system identification), it was determined that the actuator system followed a first-order delay system [35]–[37]. We define the longitudinal model and error dynamics as follows:\begin{align*} \dot {x}_{long} (t)&=Ax(t)+Bu_{long} (t)\tag{13b}\\ A&=\left [{ {{\begin{array}{ccc} 0 &\quad 1 &\quad 0 \\ 0 &\quad 0 &\quad 1 \\ 0 &\quad 0 &\quad {-1/\tau _{ax}} \\ \end{array}}} }\right]\!,\quad B=\left [{ {{\begin{array}{c} 0 \\ 0 \\ {1/\tau _{ax}} \\ \end{array}}} }\right]\tag{13c}\\ e_{1} (t)&=x_{1,ref} (t)-x_{1} (t)=s_{x,ref} (t)-s_{x} (t)~ \\ e_{2} (t)&=x_{2,ref} (t)-x_{2} (t)=v_{x,ref} (t)-v_{x} (t)~ \\ e_{3} (t)&=x_{3,ref} (t)-x_{3} (t)=a_{x,ref} (t)-a_{x} (t)~\tag{14}\end{align*}
\begin{align*} \dot {e}_{1} (t)=&\dot {x}_{1,ref} (t)-\dot {x}_{1} (t)=e_{2} (t)+\dot {x}_{1,ref} (t)-x_{2,ref} (t)~ \\ \dot {e}_{2} (t)=&\dot {x}_{2,ref} (t)-\dot {x}_{2} (t)=e_{3} (t)+\dot {x}_{2,ref} (t)-x_{3,ref} (t)~ \\ \dot {e}_{3} (t)=&-\frac {e_{3} (t)}{\tau _{ax}}-\frac {k}{\tau _{ax} }u_{long} +\dot {x}_{3,ref} (t)-\left({-\frac {x_{3,ref} (t)}{\tau _{ax}}}\right) \\\tag{15}\end{align*}
\begin{align*} s(t)=&e_{1} (t)-\lambda _{1} e_{3} (t)~ \\ \dot {s}(t)=&\dot {e}_{1} (t)-\lambda _{1} \dot {e}_{3} (t)\tag{16}\end{align*}
\begin{align*} \dot {s}(t)=&\dot {e}_{1} (t)+\lambda _{1} \dot {e}_{3} (t)~ \\=&\underbrace {e_{2} (t)+\dot {x}_{1,ref} (t)-\dot {x}_{3,ref} (t)}_{\dot {e}_{1} (t)} \\&+\,\lambda _{1} \cdot \underbrace {\left [{ {\begin{array}{l} -\displaystyle \frac {1}{\tau _{ax}}\cdot e_{3} (t)-\displaystyle \frac {k}{\tau _{ax}}\cdot u_{long} (t)~\\ +x_{3,ref} (t)+\displaystyle \frac {1}{\tau _{ax}}\cdot x_{3,ref} (t)~\\ \end{array}} }\right]}_{\dot {e}_{3} (t)}\tag{17}\end{align*}
\begin{equation*} V(t)=\frac {1}{2}s(t)^{2}\tag{18}\end{equation*}
\begin{align*} \dot {V}(t)=&s(t)\dot {s}(t)=s(t)\left [{ {\dot {e}_{1} (t)+\lambda _{1} \dot {e}_{3} (t)} }\right] \\=&\left |{ {s(t)} }\right |\cdot \left [{ {\begin{array}{l} \underbrace {\begin{array}{l} e_{2} (t)+\dot {x}_{1,ref} (t)-\dot {x}_{2,ref} (t)-\displaystyle \frac {\lambda _{1} }{\tau _{ax}}\cdot \\ e_{3} (t)+\lambda _{1} x_{3,ref} (t)+\displaystyle \frac {\lambda _{1}}{\tau _{ax}}\cdot x_{3,ref} (t)~\\ \end{array}}_{R_{0} \left [{ {x(t)} }\right]} \\ +\left ({{\underbrace {-\frac {k\lambda _{1}}{\tau _{ax}}}_{Q(\lambda _{1})}\cdot u_{long} (t)} }\right) \\ \end{array}} }\right] \\\tag{19}\end{align*}
\begin{align*} \underbrace {\left |{ {\begin{array}{l} e_{2} (t)+\dot {x}_{1,ref} (t)-\dot {x}_{2,ref} (t)-\displaystyle \frac {\lambda _{1} }{\tau _{ax}}\cdot \\ e_{3} (t)+\lambda _{1} x_{3,ref} (t)+\displaystyle \frac {\lambda _{1}}{\tau _{ax}}\cdot x_{3,ref} (t)~\\ \end{array}} }\right |}_{R_{0} [x(t)] }\le R\left [{ {x(t)} }\right]\tag{20}\end{align*}
\begin{align*} \dot {V}(t)\le&\left |{ {s(t)} }\right |\cdot R\left [{ {x(t)} }\right]-\left |{ {s(t)} }\right |\cdot \left ({{-\frac {k\lambda _{1}}{\tau _{ax}}\cdot u_{long} (t)} }\right) \\=&\left |{ {s(t)} }\right |\cdot Q(\lambda _{1})\cdot \left [{ {\frac {R\left [{ {x(t)} }\right]}{Q(\lambda _{1})}} }\right]+Q(\lambda _{1})\cdot s(t)\cdot u_{long} (t) \\\tag{21}\end{align*}
From equation (19) and (20), one can obtain a longitudinal control input and an equivalent control input as follows:\begin{align*} u_{long} (t)=&-\rho _{m} sgn(s(t)) \\ u_{eq} (t)=&-\rho _{m} sgn(s(t))-K\cdot sgn(s(t)) \\ \rho _{m}=&\left [{ {\frac {R\left [{ {x(t)} }\right]}{Q(\lambda _{1})}+\alpha } }\right]\!,\quad K > 0,~\alpha > 0\tag{22}\end{align*}
\begin{align*} \dot {V}(t)\le&\left |{ {s(t)} }\right |\cdot R\left [{ {x(t)} }\right]-s(t)[-Q(\lambda _{1})]\cdot u_{eq} (t)~ \\=&\left |{ {s(t)} }\right |\cdot R\left [{ {x(t)} }\right]-s(t)\cdot R\left [{ {x(t)} }\right]\cdot sgn[s(t)] \\&-\,Q(\lambda _{1})\cdot K\cdot \left |{ {s(t)} }\right |-\alpha \cdot K\cdot \left |{ {s(t)} }\right | \\=&-Q(\lambda _{1})K\sqrt {2V(t)} -\alpha K\sqrt {2V(t)}\tag{23}\end{align*}
\begin{equation*} W(t):=\sqrt {2V\{s[e(t)]\}}\tag{24}\end{equation*}
\begin{align*} \frac {dW(t)}{dt}=&\frac {2\dot {V}(t)}{2\sqrt {2V(t)}}\le \frac {-2\sqrt {2V(t)} Q(\lambda _{1})\cdot [K+\alpha]}{2\sqrt {2V(t)}} \\=&-Q(\lambda _{1})\cdot [K+\alpha]\tag{25}\end{align*}
\begin{equation*} \dot {V}(t) < -\alpha V^{1/2}(t),\quad \alpha > 0\tag{26}\end{equation*}
\begin{align*} t_{converge}\le&\frac {2V^{1/2}(t_{0})}{\alpha }=t_{bound}\tag{27a}\\ V^{1/2}(t_{0})=&\frac {1}{2}s(t_{0})=\frac {1}{2}[e_{1} (t_{0})+\lambda _{1} e_{3} (t_{0})]\tag{27b}\end{align*}
\begin{align*} u_{eq} (t)=&\underbrace {\left [{ {R\left [{ {x(t)} }\right]+\frac {\alpha }{\sqrt {2}}} }\right]}_{\rho _{m}}\cdot sgn[s(t)]-K\cdot sgn[s(t)] \\ \tag{28a}\\ R_{0} [x(t)]=&\left ({{\begin{array}{l} e_{2} (t)+\dot {x}_{1,ref} (t)-\dot {x}_{2,ref} (t)-\displaystyle \frac {\lambda _{1} }{\tau _{ax}}\cdot \\ e_{3} (t)+\lambda _{1} x_{3,ref} (t)+\displaystyle \frac {\lambda _{1}}{\tau _{ax}}\cdot x_{3,ref} (t)~\\ \end{array}} }\right)\tag{28b}\end{align*}
\begin{align*} R_{0} [x(t_{0})]=&\left ({{\begin{array}{l} e_{2} (t_{0})+\dot {x}_{1,ref} (t_{0})-\dot {x}_{2,ref} (t_{0})-\displaystyle \frac {\lambda _{1}}{\tau _{ax}}\cdot \\ e_{3} (t_{0})+\lambda _{1} x_{3,ref} (t_{0})+\displaystyle \frac {\lambda _{1}}{\tau _{ax}}\cdot x_{3,ref} (t_{0}) \\ \end{array}} }\right) \\\le&R[x(t)]\tag{28c}\end{align*}
\begin{equation*} t_{converge} =\frac {2V^{1/2}(t_{0})}{\alpha }=t_{bound}\tag{29}\end{equation*}
\begin{equation*} \rho _{m} \ge R\left [{ {x_{t} (t)} }\right]+\frac {\alpha }{\sqrt {2}}+\eta\tag{30}\end{equation*}
E. Vehicle Validation
The proposed algorithm has been validated through actual vehicle tests on two different testbeds. The first testbed is located in Seoul National University (SNU), Gwanak-gu Seoul-si. A straight road test at the SNU campus beltway testbed was validated. The second testbed is located in the Future Mobility Technology Center (FMTC), Gyeonggi-do Siheung-si Seoul National University. On the FMTC testbed, we validated the system with test scenarios inclusive of curved roads. The testbed environment is depicted in Fig. 18 (Regions highlighted in orange show the main test area).
All velocity-specific (30 - 50 kph) test results are shown simultaneously in Fig. 19. The results for station and acceleration errors from designed velocity, ranging from 30 kph to 50 kph, are shown in Fig. 19. Of the two kinds of errors show, most have values within an acceptable range. Detailed error analysis is tabulated in Table 4. However, a high degree of variance in acceleration error can be observed. The right axis of Fig. 19 represents the gear state of the vehicle. A high degree of variance in acceleration error can be observed during a change in gear number. Further detailed and specific changes were obtained through additional experiments, shown in Fig. 20. The three error values exist between the range of −0.1445 and 0.9815. From Fig. 21, it can be observed that these values are within an acceptable range. The value of the lateral axis in Fig. 21 includes numerically meaningful but physically meaningless units. However, the station error has the highest value at around 0.3 m. These values mainly occurred upon switching off the autonomous driving mode. When this fail-safe control methodology was applied to the autonomous vehicle, the control input value changed to a constant negative value when vehicle velocity was close to zero. It is to be noted that the error values beyond 10 seconds in Fig. 20 and Fig. 21 do not hold any meaning. Overall, the vehicle experiments have shown that errors occur within reasonable boundaries.
Vehicle test result with gear value (15 cases, 5 error occurred final vehicle velocity-based case: 30~50 kph case; autonomous driving finished at orange color line).
The RoA figure shows a locally and asymptotically stable value that could be calculated using the Lyapunov function [5]. Fig. 22 contains two figures. The first background figure represents a contour, indicated mainly in yellow, which is a part of the set and satisfies
Error phase portrait of vehicle result in Lyapunov function value considered background phase.
Throughout the course of this study, our research team faced 2 major challenges with regards to obtaining data. The first challenge was in acquiring real AV fault data. This issue stands with many other previous studies as well. While these studies made assumptions regarding failures and their sizes, our research team overcame this problem through continuous vehicle testing. A large number of tests allowed us to obtain real fault data and determine failure types. The second major challenge was in obtaining vehicle data for SMO research. Obtaining accurate and precise vehicle data for the leading vehicle and the test vehicle proved to be difficult. This problem was mitigated through extensive use of data from the RT-Range.
Conclusion
In this paper, a newly designed fail-safe structure for autonomous vehicles has been proposed. The main contributions of this paper can be summarized into three parts. First, the framework of the fail-safe module for autonomous vehicles was introduced and implemented to an actual autonomous vehicle. Second, an adaptive sliding mode observer methodology for longitudinal fault detection about chassis sensors and environment sensors (lidar, radar) in autonomous vehicles was presented. An MIT rule-based adaptation rule to determine the magnitude of the observer injection term was proposed for the reconstruction of unknown faults. Third, fault types of autonomous vehicles are defined through vehicle experimentation. The designed logic has been applied to vehicles and operated to manage a fault situation. In the fail-safe control portion, fault was defined as any situation where a shutdown of the upper controller occurs, be it from continuous vibrations or large physical shocks. The lower controller conducts safety control using the last available scene information from the upper controller. The stability of the system was proved, and the proposed algorithm was investigated via actual vehicle tests. The test results have shown that the proposed algorithm was able to perform the desired vehicle actions successfully. Additionally, error variance has shown to converge within acceptable ranges. The vehicle tests have led to the discovery of a fault in autonomous vehicles’ upper controls.
Fault detection can occur in a number of cases, including, but not limited to, sensor performance degradations and adverse weather conditions. Decision tree-based pull-over controls will be the topic of future research in this field for our research team.
For future research, our research them plans to further develop the fail-safe module our automated vehicle. We plan to utilize a LiDAR point-wise de-noising approach to classify and filter out noise in adverse weather conditions. Our team also plans to develop a tolerant control scheme to cover fatal cases of autonomous vehicle faults, excluding actuator faults. Furthermore, an emergency pull-over algorithm for fail-safe systems corresponding to level-4 autonomous driving is currently being investigated, with emphasis on the control method for achieving a safe emergency pull-over maneuver.