Reconfiguration Control Design of UAV against Actuator Faults Based on Control Allocation Method

The paper concentrates on the problem of fault-tolerant control of UAV against actuator faults from the perspective of ﬂ ight control system architecture. Using backstepping control method and inverse optimization theory, the design of backstepping optimal control law was constructed. Based on the fault monitoring mechanism of vehicle management computer in the distributed ﬂ ight control and control allocation system, a fault-tolerant control design method was established in the case of multiple failure modes of the actuators, which compensates the in ﬂ uence caused by the failures. Finally, the e ﬀ ectiveness of the proposed strategy was veri ﬁ ed by numerical simulation.


Introduction
In recent years, XQ-58A "Valkyrie" developed by the United States has started to verify its combat technology with manned fighters F-15X and F-35. While emphasizing joint operations, it also puts forward low-cost development requirements. This medium-size UAV with low-cost attritable aircraft technology (LCAAT) is more affordable than the X47B stealth UAV. In 2020, Russia demonstrated a highspeed, unmanned loyal wingman, named "Thunder". Similar to XQ-58A, it is capable of partnering with manned fighters and serving as a wingman in a front-line attack and detecting and destroying enemy and air defense targets, as discussed elsewhere [1][2][3].
With the worldwide research and development of all kinds of unmanned systems, the application of manned/ unmanned aerial vehicles and unmanned swarm systems has been promoted. Unmanned systems are no longer synonymous with low cost and high risk but increasingly emphasize the following two aspects of technical requirements: (i) Low cost and affordable economic cost (ii) High fault tolerance and affordable security risks For the fighter or large reconnaissance and combat UAVs, its cost can be increased. Therefore, the architecture of multiredundant hardware design is often used to improve the fault-tolerant ability of the entire aircraft system, especially for the flight control system with high safety and reliability. However, for consumable and low-cost unmanned systems such as loyal wingman, the architecture based on redundant hardware design brings high cost, large volume, large weight, and small load, which is often not acceptable. However, it is still expected to minimize the impact of failure, that is, to have higher fault tolerance.
There are many fault diagnosis and fault-tolerant control methods for sensors and actuators of flight control system. As early as 1990, as discussed by Professor Frank [4], international authority on fault diagnosis of control system divided fault diagnosis methods into three categories: analytical model-based methods, signal process-based methods, and knowledge-based methods, which have been accepted by many scholars [5,6].
For the application of UAV flight control system, modelbased fault detection, isolation, and adjustment methods can be used to reconstruct low redundancy/no redundancy signals for sensor faults, and the fault tolerance problem of sensor signals can be solved by means of analytical redundancy, as discussed elsewhere [7][8][9][10][11].
But when the faults occur in the aircraft surface because of actuators, the detection and isolation method based on model cannot work, and it must be reconstructed through the control compensation of other surfaces to achieve fault tolerance. At present, many scholars have proposed adaptive control algorithms to solve various adaptive control problems in the case of surface failure, so as to achieve flight control, but the real-time performance of these algorithms is generally doubted, and a large number of algorithms need accurate mathematical models [12][13][14][15].
At present, the fault tolerant control towards actuator's faults is based on the results of fault detection [16]. Firstly, the faulty actuator should be isolated to avoid its fault spreading. Then, by adjusting the parameters of the controller or changing the structure of the controller, the stability and control performance of the system can be guaranteed. However, this method must be offline calculation of control law parameters required under various faults, and these parameters are stored in the flight control computer in advance. During the flight, according to the fault information obtained by fault detection, the matched control law parameters are selected to obtain the reconstructed flight control law. Since the parameters of the readjusted control law are designed off-line, the reconstructed flight control law can only tolerate the fault modes considered in advance, which limits the scalability of this method.
At the same time, we consider the possible failure modes of UAV actuators. At present, for small and medium-sized UAVs, electric actuators are basically used as the driving mechanism; and for large UAVs, with the continuous maturity of high-power electric actuators, the trend of replacing hydraulic actuators is also accelerating, due to the maintenance, pipeline, weight, leakage, and other problems caused by hydraulic actuators. Consider that there are two main failure modes of the electric actuators: nonoutput force and the output shaft stuck. From the perspective of surface control, the influence of the former is relatively small. And the most important thing is to realize fault detection, so that the fault can be found quickly. The second failure mode is trickier because it introduces additional unwanted moments that need to be balanced first in flight control.
In view of the multiple failure modes of UAV actuators, the strategy proposed in the paper adopts backstepping optimal control law and reconfiguration design based on a control allocation method to compensate the influence of faulty actuators, so as to achieve the goal of fault-tolerant control. The main contributions of the research are summarized as follows: (1) Different from the adaptive control method, as referred to References [12][13][14][15], based on the optimal control law and the fault diagnosis result, control allocation methods carry out the mode switch. It does not need to carry out complex online calculation through the optimization algorithm, so it will not affect the real-time performance (2) Compared with constructing off-line databases and adjusting the parameters or structure of controller, the reconfiguration control methods based on control allocation can compensate the influence of fault surfaces without adjusting the control law, thus extending its application scope (3) More importantly, the paper presents a systematic solution, not just for the control algorithms. Through the modular control structure, the fault detection, control law, and control allocation algorithm are organically integrated, and a relatively comprehensive solution is proposed for the reconfiguration control against actuator faults

System Description and Preliminaries
A UAV adopts a conventional layout, and the independently controlled surfaces include the following: left fully moving elevator, right fully moving elevator, aileron, and V-shaped rudder. Because the left and right fully moving elevator can be controlled independently; that is to say, in addition to the pitching moment produced by the same deflection, the rolling moment can also be produced by the differential deflection, which provides additional control moment in the lateral direction. At the same time, the application of V-shaped rudder also provides a supplement for the generation of pitching moment. Therefore, the characteristics of multicontrol surfaces of the UAV provide conditions for the compensation control of actuators faults. Firstly, the aircraft attitude dynamic equation in the body axial coordinate system is as follows: where α, β, and μ are, respectively, the angle of attack, sideslip angle, and roll angle and p, q, and r are, respectively, the pitch angle rate, yaw angle, rate and roll angle rate.
International Journal of Aerospace Engineering where m is mass, V is flight speed, γ is flight path angle, T is engine thrust, dT is thrust coefficient, and D, L, and Y are resistance force, lift force, and lateral force, respectively, as described below.
where S is wing area, q is kinetic pressure, and c d , c l , and c y are drag coefficient, lift coefficient, and lateral force coefficient, respectively.
Remark 1. The flight control system architecture of UAV can support the integrated design of actuator fault detection and flight control law. The following distributed system architecture in Figure 1 can be adopted [17][18][19]. The integrated VMC (vehicle management computer) realizes the fault detection of actuators, the calculation of flight control law, and the calculation of control allocation algorithm and finally forms the control instruction for each surface and transmits them to the SMART actuator located near the surface.
Remark 2. As shown in Figure 2, the integrated VMC will perform the fault detection, flight control law, and control allocation algorithm continually during the whole flight mission. The distributed system architecture, depending on the integrated design of VMC, can realize the reconfiguration control based on the control allocation method more conveniently, without affecting the servo control of the back-end actuator control loop in SMART. Meanwhile, it can be seen that after the introduction of control allocation module, the design of control law module in VMC does not need to be changed when some of actuators are faulty, which provides convenience for its application. 3 International Journal of Aerospace Engineering is cut off, and at this time, the actuator is in a state of nonoutput force. The paper does not make a comprehensive study of the specific fault detection strategy, only as the input conditions of the research.

Backstepping Optimal Controller Design
As we all know, the design of backstepping control law only considers the stability and convergence of the system and does not consider the optimal performance of the system [20,21]. For the optimal control of the system, it is usually to find an admissible control with a given performance index, and the target functional takes the minimum value. This kind of problem ultimately comes down to the solution of Hamilton-Jacobi-Bellman partial differential equation. Compared with linear systems, the difficulty of nonlinear systems lies in that the HJB equation to be solved is often nonexistent or nonunique, which becomes the main obstacle to optimal control of nonlinear systems. This problem can be solved by introducing the inverse optimization into the design of uncertain nonlinear control systems through the robust control Lyapunov function [22]. The inverse optimization problem does not minimize the control law of a given target functional, but with the Lyapunov function for the robust stability of the system to make the controller is designed to minimize a backlog of objective functional, that is to say, first of all, get the control law, and then determine its optimal value function, so as to determine the optimal controller.
3.1. Derive the Control Law. For second order nonlinear systems, Step 1) For the subsystem x 1 in equation (4), virtual control law is designed with x 2 as the control input.
Choose the following form of control law: Construct the following control Lyapunov function: Substituting into x 2 = x des 2 , its differential is When ðϕðx 1 Þ − ψðx 1 ÞÞx 1 < 0 is satisfied, its negative definite can be guaranteed. (Step 2) Introduce an error variable: So the system equation (4) is equal to Construct the following control Lyapunov function: where Fðx 1 Þ is any effective control Lyapunov function of subsystem x 1 , which means that when x 2 = x des 2 is satisfied, where Uðx 1 Þ is positive definite.
the item x 1 in the second items can be cancelled out.
And then, the Lyapunov function of the system is In order to make _ V negative definite, the control law can be selected finally: When

Determine Its Optimal Value Function
Lemma 4 (see [23,24]). Consider the nonlinear system: where x ∈ R n is the state variable and u ∈ R m is the control input.

International Journal of Aerospace Engineering
And assume gðxÞ ≠ 0. For a given feedback control uðxÞ, define the optimal value function: where QðxÞ is positive definite and RðxÞ is symmetric positive definite.
The optimal value function JðxÞ can be selected as the Lyapunov function VðxÞ. Then, the optimal control can be obtained: Using Hamilton-Jacobi-Bellman equation it can be obtained inversely: Based on Lemma 4, comparing the error system shown in equation (8) with the second order system shown in equation (4), the following can be obtained: And Therefore, the optimal control performance parameters that can be achieved by the control law (17) are obtained by substituting formulas (20) and (21): If virtual control law x des 2 = −ψðx 1 Þ = −k 1 x 1 is selected, then, the linear control law of the system (4) is In order to QðxÞ be positive definite and the cost function to be a meaningful optimal performance, it must be guaranteed that Therefore, combining (12), (14), and (23), we can get: when 0 < 2k 1 < k 2 is satisfied; the origin of the system can be given global asymptotic stability.
And minimize the following optimal value function: Similarly, if virtual control law x des 2 = −ψðx 1 − rÞ = −k 1 ðx 1 − rÞ is selected, then, the linear control law of the system (4) is Defining x − r = e, the tracking problem of the system at y = r can be transformed into the global asymptotic stability problem of the system at x − r, and the results of equations (23) and (24) above can be applied. Then, the system obtains global asymptotic stability at y = r.
And minimize the following optimal value function: Therefore, with the angular acceleration after coordinate transformation as the control input, a three-axis decoupled 5 International Journal of Aerospace Engineering second-order nonlinear system can be obtained, as shown below: It can be found that the structures of equations (30), (31), (32), and (4) are exactly the same.
Comparing equations (4) and (30), the following can be found: Therefore, it is easy to obtain the backstepping optimal controller to track control instructions r = α cmd : At the same time, the following optimal value function can be minimized: Similarly, comparing equations (4) and (31), the following can be found: Therefore, it is easy to obtain the backstepping optimal controller to track control instructions r = β cmd : At the same time, the following optimal value function can be minimized: Similarly, comparing equations (4) and (32), the following can be found: Therefore, it is easy to obtain the backstepping optimal controller to track control instructions r = μ cmd : At the same time, the following optimal value function can be minimized: Through formulas (34), (38), and (42), the angular acceleration ω d ′ des = ½p dcmd ′ q dcmd ′ r dcmd ′ T required to achieve attitude maneuver can be obtained. After integral operation and inverse coordinate transformation, the following formula can be obtained: To this end, according to the expected flight attitude requirements, the angular velocity required to complete the maneuver can be obtained by using formula (45), and then, 6 International Journal of Aerospace Engineering through the following formula, it can be converted into the torque coefficient required by the maneuver, which is also the input of control allocation problem in the next section.
where T bd is the coordinate transformation matrix converted to the body shafting, I is the rotational inertia matrix of the aircraft, b is the wingspan length, and c is the average aerodynamic chord length.

Fault Tolerant Control Design Based on Control Allocation
As mentioned above, as UAV plays an increasingly important role in the air combat system, the tasks it undertakes become more and more complex, and it needs to bear certain impact of faults. For the flight control system, the UAV has the characteristics of multicontrol surfaces, especially the introduction of multiredundant control surfaces, which provides a prerequisite for the reconfiguration control of UAV against actuator failures. As shown in Figure 2, when the fault detection module in VMC finds that there is any fault mode toward actuators, the control allocation module in VMC will switch from the normal state to the specific fault state. Then, the fault tolerant control strategy based on the control allocation works, and the control instructions solved by the control allocation module will reflect the influence of the actuator fault mode.

Nonlinear Control Allocation Design Considerations. For the nonlinear system, assume its motion equation is
where f : R n × R k ↦ R n and g : R n × R m ↦ R k are nonlinear and k < m. It can be written as where B v ∈ R n×k , f , and g have the same mapping form as above. Introduce virtual controls: where v ∈ R k ; the state equation of the system can be rewritten as Thus, nonlinear system (48) can be converted into equations (49) and (50), which can also use the standard form of the control allocation problem. It can be seen that with linear control allocation problem the difference is that mapping is a nonlinear form.
In the flight control system, the control allocation strategy is required to be solved in real time, but the nonlinear control allocation problem cannot be solved in real time. Therefore, one of the methods to solve this problem is to use the method of local approximation mapping, using linear mapping to approximate nonlinear mapping.
Through local Taylor form expansion, linearization at point u 0 can be obtained: So introduce a linear mapping BðxÞ = ð∂g/∂uÞðx, u 0 Þ; the nonlinear control allocation problem (49) can be converted to the linear control allocation problem: When a nonlinear control allocation problem is transformed into a linear problem, it can be solved in many ways. Optimization algorithms based on linear programming, with its lower operation cost and simplex method, have been widely studied in the process of solving control allocation problems [25,26]. According to the authors' previous research results [27], the allocation algorithm based on linear programming is directly applied here.
Combined with formulas (52) and (53), the control allocation problem is described as follows.
For the known control efficiency matrix B and the given virtual control quantity vðtÞ, the feasible instruction uðtÞ of the control surface is determined by considering the position limit and rate limit uðtÞ ≤ uðtÞ ≤ uðtÞ of the control surfaces, so that BuðtÞ = vðtÞ is satisfied.
Thus, the linear programming problem can be solved with the following matrix: The corresponding optimization objectives are In the three-dimensional control allocation problem, the matrix M has only two rows, i.e., and the coefficient matrix A = M ⋅ B of constraint conditions 7 International Journal of Aerospace Engineering also has only two rows, whose rank is 2. Therefore, the solution process is relatively simple.
The application of the method in the flight control system is toward to v = ðc l c m c n Þ T = ð1/S qÞ diag ðb, c, bÞ −1 T, and the specific process can be summarized as the following steps: (Step 1) According to the theory of nonlinear control allocation, the control efficiency matrix under nonlinear condition should be calculated from the expression of aerodynamic derivative C M = ðC l C m C n Þ T : δ 0 can be selected as the control input uðt − TÞ of previous sampling time or a fixed point, such as δ 0 = 0.

(Step 2) Then the virtual control input after Taylor linear expansion is calculated:
Step 3) Using the control allocation method of formula (54) to solve the above formula, the solution of formula (58) can be obtained.

Fault Tolerant Control Design for Actuators.
This section describes the fault-tolerant control scheme according to the control allocation design method described in Section 4.1 for the two possible fault modes of the electric actuators described in Figure 1. Fault mode 1 is nonoutput force of the actuator, and the other fault mode is output shaft stuck of the actuator.
The nonlinear system equations of an aircraft under normal conditions are described as follows: where gðx, uÞ represents the control efficiency of surfaces, which will change when the surfaces fails due to actuators. The following equation describes the nonlinear system equation of the aircraft in the case of failure: where g f ðx, u f Þ represents the control efficiency matrix in the case of failure and u f ∈ R n represents the control input. For different fault modes, equation (60) corresponds to different function forms.
According to the requirements of reconfiguration control, control allocation needs to achieve: For the control allocation problem under normal conditions, the allocation objective is The nonlinear control allocation problem (62) can be converted to the linear control allocation problem: Similarly, for the control allocation problem under faulty conditions, the allocation objective is After linearization, The control efficiency matrix B f ðxÞ = ð∂g f /∂uÞðx, u 0 Þ is introduced, and the linear control allocation problem similarly is as follows: When the fault modes of the actuators occur, the change of aerodynamic coefficient, weight, and center of gravity of the aircraft can be ignored. On this premise, the change of control capability caused by the faulty actuators only comes from the change of control input.
Meanwhile, when the linearized equilibrium points are u 0 = 0, the following formula can be obtained: Thus, compare formulas (64) and (67), and the requirements of reconfiguration control (61) can be converted into 4.2.1. Fault Mode 1: Output Shaft Stuck. The output shaft stuck of actuator refers to the fault mode in which the actuator is in a fixed position due to motor blocking or other mechanical reasons, which can be realized by the fault detection module of VMC. In this fault mode, the control surface connected to the actuator will also be stuck in a certain position. In this case, its deflection will not only fail to produce the desired control effect but also to produce unwanted additional forces and unwanted additional torques. Therefore, 8 International Journal of Aerospace Engineering the influence of the stuck actuator must be offset in the control allocation. When the shaft of actuator is stuck, the motion equation of the aircraft does not change, as shown in equation (60).
According to formula (66), the solution of control allocation for reconstruction control can be obtained: where assuming that the aircraft has m surfaces, u * f = ðδ 1 , f is the remaining control efficiency matrix after removing the stuck surface in B f , b j is the control efficiency coefficient of the stuck surface, and δ f is the stuck position of the faulty surface.
After further derivation, the following can be obtained: Among them, the first item redistributes the torque required by the original surface to the remaining effective surfaces, and the second item is used to offset the additional influence caused by the stuck surface, so that the actual output of the stuck rudder surface is 4.2.2. Fault Mode 2: Nonoutput Force. Nonoutput force of actuator refers to the failure mode in which the clutch cuts off the current control output due to various reasons, and then, the actuator has no output force. It can be realized by the fault detection module of VMC. In this fault mode, the surface connected with the actuator will also be in a loosely floating state, that is, the control efficiency of the surface is zero. Then, it can be assumed that the loosely floating surface will not produce any aerodynamic and aerodynamic torque; that is, the effect of the control surface on the control input of the aircraft is zero. Similar to fault mode 1, when the actuator has no output force, the state equation of the aircraft does not change. That is, only the control input of the loosely floating surface is zero, and the loss of control capability caused by the loss of control capability needs to be compensated by the remaining effective control surfaces.
Similarly, assuming u * f is the input of the remaining surface after removing the loosely-floating one, then, where, assuming that the aircraft has m surfaces, u * f = ðδ 1 , δ 2 , ⋯δ j ⋯ δ m−1 Þ ∈ R m−1 , B * f is the remaining control efficiency matrix after removing the loosely floating surface in Further derivation can be obtained: Then, the actual output of the surfaces when the actuator has no output force can be obtained:

International Journal of Aerospace Engineering
In order to prove the correctness of the control law, the numerical simulation is carried out under the comparison of the two cases. The first set is three-axis maneuver without actuator failures, and the control objectives are α cmd = 25 ∘ , β cmd = 0 ∘ , μ cmd = 25 ∘ .
The fault tolerant control of three-axis maneuver under the condition of actuator faults is the second set. As above, the same simulated UAV object is used, and the initial trim conditions are the same. In the simulation, the parameters of backstepping optimal control law do not change, but the allocation algorithm switches from normal modes to the fault modes in control allocation. Also consider the threeaxis maneuver; the control objectives are α cmd = 25°, β cmd = 0°, μ cmd = 25°.
Firstly, the deflection characteristics of the four surfaces of the UAV, namely, left elevator, right elevator, aileron, and rudder, are limited as follows: It should be added here that the surface is stuck beyond the deflection limit due to mechanical reasons, and VMC cannot give the correct diagnosis through the fault detection module. The control allocation method may invalidate the input saturation strategy in these fault modes. And this fail-ure condition is not covered in the paper because it cannot be correctly diagnosed.

Simulation
Result. It can be seen from the simulation results that the backstepping optimal controller can achieve a good control effect. In the control channels of the three axes, the controlled quantity can meet the requirements of the control instruction within 5 s, and the response has no overshoot and no steady-state error, refer to Figure 3. The black dotted lines in Figure 4 are the upper and lower limits of surface deflection. This proves that it is feasible and correct to design flight control system using backstepping optimal controller. At the same time, it also shows that the control allocation method can realize the correct distribution of control instructions on each controllable surface.

Simulation of Fault Mode 1.
In the simulation, the electric actuator attached to the rudder is set to be stuck at 5°. Then, the rudder will also be stuck at 5°, and its deflection

12
International Journal of Aerospace Engineering will fail not only to produce the desired control effect but also to produce unwanted control effect.

Simulation of Fault Mode 2.
In the simulation, the electric actuator attached to the rudder is set to nonoutput force. Then, the rudder will also be in a loosely floating state. Then, it can be assumed that the loosely floating rudder will not produce any aerodynamic and aerodynamic torque. In Figures 5 and 6, the red curve is the deflection angle of surfaces without the actuator failure, and the blue curve is the deflection angle of surfaces in the corresponding fault mode, and the black dotted lines are the upper and lower limits of surface deflection. It can be seen that when actuator failures occur, the reconfiguration based on the control allocation method ensures remaining surfaces can compensate the influence caused by the faulty actuator. The angle of attack and the angle of roll track the control objectives quickly with stable performance; refer to Figures 7 and 8. Therefore, the design of control allocation completes the task of reconfiguration control well, obtains good control effect, and achieves the goal of fault-tolerant control.

Conclusion
The fault-tolerant control of UAV in the case of actuator failure in the flight control system is studied. Based on the backstepping control method derived from Lyapunov function, the nonlinear optimal control law was constructed by introducing inverse optimization strategy. And reconfiguration design based on control allocation is proposed. Through linearized nonlinear mapping, a fault-tolerant control scheme is designed for the actuators in two typical fault modes: stuck and nonoutput force. Through rigorous mathematical analysis and numerical simulation, the effectiveness of the control strategy is verified.
The integrated VMC in the FCS can realize online monitor of limited fault modes of actuators, which ensures the scheme proposed achieves better fault-tolerant control after the occurrence of known fault modes. However, mechanical transmission from the actuators to the surfaces, or the surfaces itself, cannot be directly dealt with by using the strategy proposed in the paper due to the lack of monitor results, such as the loss of part of the surface. It is worth further study.

Data Availability
No data were used to support to this study.

Conflicts of Interest
The authors declare that there are no conflicts of interest regarding publication of this article.