Neural-Fuzzy Digital Strategy of Continuous-Time Nonlinear Systems Using Adaptive Prediction and Random-Local-Optimization Design

A tracking problem, time-delay, uncertainty and stability analysis of a predictive control system are considered. The predictive control design is based on the input and output of neural plant model (NPM), and a recursive fuzzy predictive tracker has scaling factors which limit the value zone of measured data and cause the tuned parameters to converge to obtain a robust control performance. To improve the further control performance, the proposed random-local-optimization design (RLO) for a model/controller uses offline initialization to obtain a near global optimal model/controller. Other issues are the considerations of modeling error, input-delay, sampling distortion, cost, greater flexibility, and highly reliable digital products of the model-based controller for the continuous-time (CT) nonlinear system. They are solved by a recommended two-stage control design with the first-stage (offline) RLO and second-stage (online) adaptive steps. A theorizingmethod is then put forward to replace the sensitivity calculation, which reduces the calculation of Jacobin matrices of the back-propagation (BP) method. Finally, the feedforward input of reference signals helps the digital fuzzy controller improve the control performance, and the technique works to control the CT systems precisely.


Introduction
During the past decade, many fuzzy theories [1][2][3][4][5][6][7] and delay analysis [8][9][10][11][12][13] have attracted great attention from both the academic and industrial communities, and there have been many successful applications.Despite this success, it has become evident that many basic and important issues remain to be further addressed.These stability analysis and systematic designs are among the most important issues for robust control systems [1], and there has been significant research on these issues (see [2,[14][15][16]).However, the problems of modeling error and prediction are still worth to be solved further.Therefore, the two-stage control method in this paper is proposed to suppress the modeling error to guarantee the stability of predictive control system in the presence of this modeling error.
In addition, neural-network-(NN-) based modeling has become an active research field because of its unique merits in solving complex nonlinear system identification and control problems (see [15]).Neural networks (NNs) or NARMAX/NARX neural networks [17] are composed of simple elements operating in parallel, inspired by biological nervous systems.A neural network can be trained to represent a particular function by adjusting the weights between elements.Due to discrete-time (DT) controllers (microcontrollers) being cheaper and more flexible than continuoustime (CT) controllers, the DT control problem for CT plant is worth studying.In modern control engineering, controllers are commonly implemented directly by the hardware or software of digital computers.However, one important issue has to be faced; that is, the proposed design (DT-CT design) in this paper effects a new type of application.
The study of CT control of CT time-delay systems has received considerable attention in recent years since delay is a major cause of poor performance in many important engineering systems [18,19].As is known, the delay control problem is an important and complex factor in the stability performance of CT nonlinear systems.In general, a delay Mathematical Problems in Engineering signal happens in a signal's long-distance or heat translations.However, the amount of delay has different impacts on the various approaches [1,3].In this study, the direction of CT time-delay control systems needs to involve the DT control problem.Moreover, the recommended adaptive NNmodel-based design method has not yet been developed to adjust the parameters of a discrete-time (DT) adaptive fuzzy controller such that the original continuous-time (CT) system, with time delays and uncertainties, is uniformly ultimately bounded (UUB) stable in Section 2.
Based on the timer of the micro-controller, the effect of delay in neural system identification can be approximated by many tape-delay terms.This reduces the difficulty of delay identification.The DT NARMAX model is generally sufficient to approximate an unknown, nonlinear, dynamical, and delayed CT system by selecting an appropriate sampling time.Moreover, the measured modeling error between the model and the physical system is considered in the theorems by Lyapunov functions.In Remark 1, this kind of modeling error is used to estimate the proposed corner of sampling period (CSP) which decides this kind of sampling time to support the intersample method.
The feedforward term in [20] is derived indirectly by assuming some constraints.Due to the overfitting problem and the local optimal problem of NN, the method [20] is not suitable for real applications because of the need for lengthy convergence time.These neural techniques [20,21] have usually been demonstrated under nonlinear control due to their powerful nonlinear modeling capability [22] and adaptability.However, they must exhibit the optimal problems of falling into the local minimum easily by using the back-propagation (BP) [23] method.To alleviate the requirements for accurate modeling of the plant, the proposed NARMAX plant and control models are trained by initially using novel offline methods with the RLO algorithm to improve this drawback.It not only guarantees the gradient decent method against the local optimal solution but also speeds up the convergence of the PSO [24].
Conventional optimization methods assume that all parameters and goals of a model are precisely known [25,26].However, in many practical problems incomplete and unreliable information exists [27].Therefore, the two-stage scheme is proposed to treat this unknown system as reliability problem.
Inspired by the DT neural controller of [20] only for a DT system, a digital fuzzy control design for a CT system is proposed and an approximate inverse of the delayed plant dynamics is used to act as the NARMAX fuzzy controller.The adaptive controller and NARMAX models are easier to converge than [20,21] by the recommended two-stage scheme.Moreover, this paper concludes with a simulation example and experimental data to demonstrate these techniques.
The remainder of this paper is organized as follows.In Section 2, the system description and its problem formulation are provided.In Section 3, robustness controllers and stability analysis in the presence of modeling errors for a series of control design algorithms are introduced.In Section 4, the effectiveness of the proposed approach is illustrated by a numerical example with three cases.Finally, the conclusions are drawn in Section 5.

System Description and Problem Formulation
First, the conventional PWM buck converter by using AM-OTS-DS [28] methodology is modeling to the following equivalent circuit plant: In this paper, the robustness of the above control system is emphasized, so uncertainty Δ and delay  are added to the original control system (1a) where  =  + Δ;  = 6;  in = 30 is a DC voltage source;  = 202.5 × 10 −6 ; Δ = 0.52 sin(2⋅/3);  = 0.1;  = 10 −2 (1+ sin(10));  = 98.58 × 10 −6 ;   = 48.5 × 10 −3 , and   = 162 × 10 −3 are the parasitic resistances of the inductor and capacitor, respectively.The element   = 0.27 is the static drain to source resistance of the power MOSFET, and   = 0.82 is the forward voltage of the power diode.() is duty ratio of conventional PWM buck converter.The state () =  = [  (), V  ()]  and the output () = V  () of this power system are defined.The nonlinear, uncertain, hotter circuit's components, time-delay, and digital control problems of PWM buck converter CT system renders a tracking control problem difficult to analyze.A simulation system in (1a) and (1b) is built with uncertainty.In this study, it is assumed that the parameters of the circuit's components are not ideal, and the capacity of the digital controller is limited by using a lowercost chip.Here, the sampling period   = 10 −4 is designed for this power system (1a) and (1b).Hence, the delay  is very large for this system.Then, consider a general nonlinear system with delays described as follows: where the bounded uncertainties Δ() create the dynamic quality of the system parameters which refer to electrical elements of the power system; the zero-order-hold control input () = ( ⋅   ) = () =   ,  =  ⋅   ,   is the appropriate sample time of DT-CT design,  is the index of the discrete result   of () referring to the NN model of (1a) and (1b);  is the time delay; (⋅) is the relational function of the state () and system output ().
Remark 1.With the understanding that normal physical systems are low-pass filter and smooth nonlinearity, the upper bound, UBSP (shown in Figure 1), of the test sampling period   is decided based on a test intersample method with the plot of logarithm -axis for   and logarithm -axis for ∑  | mod ()|, where  mod () = ŷ −   .In Figure 1, we can find an obvious corner of sampling period (CSP) easily to support the intersample method.

Stability Analysis with Neural-Fuzzy Design.
In this paper, an adaptive prediction control structure is proposed, as shown in Figure 2, where the FRP controller   is designed as follows: where the switch index where = (),   = ().The controller has two working phases: () is the data vector of the testing phase, and ẑ() is the data vector of the training phase.The fuzzy controller is as follows: The tuned parameter vector of controller is as follows: The proposed digital fuzzy controller   has feedforward terms and recursive structure.It can use a NARMAX neural model or inverse of the plant dynamics to aid control precision in the face of a delayed plant.Owing to the adaptive fuzzy controller, the uncertainty of the plant can be suppressed.Although the structure of the fuzzy controller is chosen, the parameter vector (  )  is not specified; therefore, the fuzzy controller has not been designed. ⋅   is the tape-delay time,  is a positive integer.The idea of the inverse-modelbased fuzzy controller is proposed by the following simplified relation: where P(⋅) is the adaptive NARMAX neural model of plant;   (⋅) is the adaptive NARMAX fuzzy controller;   is the desired output.According to the idea of (9), the recursive structure P(⋅) can be designed with tape delays as follows: but the weights and biases (  )  of this model are trained by the way of the feedforward structure neural network (NN) as follows: Due to the parameters of the recursive structure of the plant model, convergence may be much stricter, where ,  + 1 are the amount of tape delays of ŷ, , respectively.The plant output is compared with the desired output to create a system error signal   =   −   .The errors ê =   − ŷ and   are used by the adaptation algorithm to update the parameters of P and   .Next, the performance index is designed, as follows: is a cost function to be minimized by the proposed algorithm.
Then, the BP algorithm adapts the control parameters (  )  .
That is, the change in control parameters (Δ  )  is calculated as where the small positive   () can be selected as a stable learning rate via the following theorems.Proof of Theorems.First, consider the following ideal Lyapunov candidate [29] of the model part, where where (Δ  )  =   ( + 1) −   ();   () = (  )  .Then, the following Lyapunov candidate of the controller part is designed: Thus, the change in the Lyapunov function is obtained by Finally, the update law of the control parameters of the controller is obtained as follows: This study develops some convergence theorems to select appropriate stable learning rates.First, the difference of modeling error   () = ŷ −   can be represented by be satisfied.Then, the nonlinear systems (1a) and (1b) are UUB stable, and the tracking errors () =   −   are bounded via the controller.
Hence, the dynamic response of the system  can be controlled using   , as shown in Figure 2.This   needs the plant model P to adjust control parameters via sensitivity function  ŷ / − .
The digital feedback controller includes a delay block , as shown in Figure 2. Here, the error ẽ+1 is used to estimate   , and the proposed predictor of the delayed system can let us cancel some complex computations, such as  ŷ /  ≈ ( ŷ − ŷ−1 )/Δ  = Δŷ  /Δ  = Δŷ  /  (), of sensitivity function  ŷ / − in the BP algorithm.Hence, the following theorem is proposed to update the control parameters of FRP under the assumption of providing a model which applies a lower prediction error and a more correct   .The prediction error ê+1 =  +1 − ŷ+1 , being bounded due to the previous ê =   − ŷ , is bounded at any time.Hence, the prediction error ê+1 will be bounded by using Theorems 2 and 3. Furthermore, the following theorem is obtained for the convergence of the adaptive prediction controller by the same procedure as Theorem 2.

Two-Stage Scheme in the Presence of Stability.
Figure 2 shows a block-diagram of an adaptive recursive control system.The system to be controlled is labeled as the plant , which is subject to modeling errors, uncertainties, and delays.Due to gradient-descent-based training algorithms, let the model/controller converge to some local minimums which destroy the control system stability by enlarging the modeling error.Hence, we proposed the two-stage training algorithm to reduce this modeling error as follows.
In the first stage, the measured data is used to train the global optimal NARMAX plant and the fuzzy controller by the training-data-shuffle method.This method shuffles the training data to avoid most of the local optimal solutions obtained by the offline training procedure in the next section.The measured data is divided into a training data and other testing data.This testing data is not used for training the NN.However, the final performance of the NN is decided by the testing data and the training data.
In the second stage, the global optimal NARMAX plant model and fuzzy controller is adapted.The two stages are divided into the following five steps.
Step 1. First, the reference signal () =   is designed.By the white noise of input () =   for plant, output data () =   is collected, and a training-data-shuffle method is used to shuffle the input/output pairs' data.These shuffled data are ready to train the NARMAX model/controller.Here, the following reasonable conditions need to be taken into account: need be satisfied, where   is the upper bound of   , and   is the lower bound of   .According to (27), much of the excessive control effort   can be avoided.If (27) is satisfied, then go to Step 2.
Step 2. The feedforward structure model P is trained/tested offline via the shuffled input/output pairs' data.After system identification, P is performed, and the digital fuzzy controller   for the CT system can be built by using this inverse NARMAX plant model P−1 in the next step.
Step 3. In practice, according to the exchanged output/input pairs' data from Step 2, the offline stage to train/test the fuzzy controller can be passed through If ( 28) and ( 29) work, go to Step 4.
Step 4. Update the online weights and biases   of the recursive structure model P: ŷ () = P (    ,    −1 , . . .,    ( −   ) ,   ŷ−1 , to approximate the CT nonlinear system by using Theorem 2. Due to the adaption laws for ( 28) and ( 30), an exchange for both of them can be designed to switch into the system, as a switching in Figure 2, when (30)'s absolute approximation error is too big.If (28) and (30) work, go to Step 5.
Step 5. Adapt the digital fuzzy controller for the modeling error and tracking error by using Theorems 2 and 3. Finally, update the online parameters of the fuzzy controller to minimize the tracking error, and finish the above two stages: the offline stage and online stage.
To make sure of the robustness of the control system, the convergence to the global optimal solution of parameters of the model/controller has to be guaranteed.Hence, some random initial weights and biases of the model are designed by particle swarm optimization (PSO) [24] with the parameters of the controller first.The PSO algorithm consists of the velocity and position where  = 1, 2, . . .,  is the particle index;  = 1, 2, . . .,  is the iteration index; V  is the velocity of th particle;   is the position of th particle;   is the best position found by th particle (personal best);  is the best position found by the swarm (global best, best of personal best);  1 ,  2 are the random numbers on the interval [0, 1] applied to the th particle.
The PSO supplies random initial parameters, hence, it is an initial parameters' conductor.These initial parameters are then converged locally by the BP method and the best solution for the initial model/controller is chosen.Finally, the global optimal solution of parameters can be found every time.Hence, this idea has been named the randomlocal-optimization (RLO) algorithm.The RLO algorithm is a composite of the BP algorithm and a random initialization procedure of evaluating fitness value 1/(Ξ + 0.01), where Ξ =  ⋅ Ξ 1 + (1 − ) ⋅ Ξ 2 ,  ∈ [0, 1].The total of absolute training error Ξ 1 is obtained by BP via the training data, and Ξ 2 is the total of absolute testing error of the model/controller output via the testing data input.In this paper, offline RLO is used as a learning algorithm for the feedforward structure model (28) due to the online tuning parameters of the recursive structure of the plant model being not converged.After the offline training stage, in order to tune the online parameters of the plant model (30) recursively,  ŷ /(  )  of ( 14) needs to be calculated as follows: Similarly, in order to tune the online parameters of the controller (31) recursively, and  ŷ /(  )  of (24) needs to be calculated as follows: where Hence, the following algorithm adapts a NARMAX neural controller for a NARMAX neural model of plant.
Step  Step then go to Step 1.
To clarify this method, in [20], a robust and adaptive method was used to allow learning to occur online, tuning performance as the system runs.But, [20] did not consider the prediction, modeling error, global optimal initialization of control parameters, the problem of lengthy convergence time of online control, delayed terms, uncertainties in plant, and DT-CT problems.Moreover, the choice method of initial parameters of the online controller still lacks the ability to overcome the overfitting problem of the controller.Hence, the offline stage is proposed for an RLO learning algorithm to choose the initial weights and biases of the online neural controller in the simulation example of the power plant, as shown in the following case study.

Cases Study
First, the conventional PWM buck converter, by using AM-OTS-DS [28] methodology, is modeled to the equivalent circuit plant.In this paper, the robustness of this control system is emphasized, so uncertainty and delay are considered in (1b).The state is () =  = [  (), V  ()]  , and the output is () = V  () in this power system.
Referring to Figure 2, and the above sections about twostage control structure, it can be seen how to model the CT plant dynamics from Figure 2(a) by considering the modeling error, and how to use the DT neural model of plant to adapt a DT fuzzy controller and predict the DT compensation of total control input   from Figure 2(b).
To compare with other methods, the following cases are introduced.
Case 1.This case is in [28], and its digital controller is a kind of fuzzy controller with integral term shown in (38).
Case 2. This case is in [30], and its controller is a kind of single neuron shown in (39) without multilayer structure, and this CT controller is discretized to DT controller to compare with Case 3.
Case 3.This is the control method presented here, and the proposed neural-model-based fuzzy controller is adaptive, predictive, and globally optimal.
The detail designs of the Cases 1-3 are as follows.
Case 1 is an LMI control method of original example for this power plant.The control parameters of Case 1 are solved by the LMI toolbox as follows.where ∑      is integral term.
Case 2 is an adaptive control method.This neural controller is designed as where  =   is a net input of single neuron.
shows the LMI control performance of Case 1. From the control results of Cases 1 and 3 shown in Figure 5(d), the proposed adaptive and predictive laws do let the proposed controllers adapt and predict themself to overcome these problems about robustness as the delay, uncertainty, and nonlinearity.
It is clear that the two-stage scheme, Case 3, for the proposed adaptive controller and predictive controller has excellent tracking performance when compared with Cases 1 and 2.

Conclusion
The proposed two-stage adaptive prediction control converges very fast, works highly effective, and precise.It simplifies the complex model-based adaptive control design, and works for nonlinear delayed plants with uncertainty.The proposed recursive and feedforward control scheme is partitioned into two stages that can be independently optimized.First, an offline neural model of continuous-time (CT) nonlinear power plant is made.Second, a constrained offline digital fuzzy controller is generated; then, an adaptive plant model is made, and an adaptive NARMAX prediction tracker is generated.Finally, all processes may continue concurrently, and robustness and adaptive prediction design with DT-CT problems for a power plant are solved.Although this power system is only a simulation, the control strategy can be extended to LED dimmer systems and time-delay robot systems based on visual servo, and is within my plans of future research.

Figure 2 :
Figure 2: Two-stage adaptive prediction structure of DT-CT control system.(a) Two-stage block diagram and (b) online adaptive prediction block diagram.

Figure 3 :Figure 4 :
Figure 3: (a) The learning curve of the summation of   , (b) the learning curve of the summation of   , (c) the tracking control performance, (d) the modeling error ŷ −   =   −   , (e) the trajectories of predictive input signal   , and (f) total input signal  of Case 3.

Theorem 2 .
If the amount of neurons of the neural model is sufficient and the appropriate sampling time   is selected to let ‖  −   ‖ ≤  and the following condition be satisfied, where   is an output of the optimal model, then the trajectories ŷ of converging to plant output   is a uniformly ultimately bounded (UUB) approximation on the bounded error ŷ −   .
an actual Lyapunov candidate of reachable and assumptive trajectory   ; the bounded approximation error  ≈   .The next task is to train the model such that  2 is minimized, −       2 + ( ŷ −   )  (  −   )(16)and the number of neurons of the neural model are sufficient, and the appropriate sampling time   is selected to let (20)  )  ]    ()( ŷ /(  )  )) < 1 and ‖  −   ‖ ≤ , then  2 ( + 1) <  2 (), that is,  2 () → 0 or ŷ →   makes the UUB approximation of this model on the bounded ŷ −   .The proof is thereby completed.Furthermore, the following theorem for the convergence of the controller is obtained by the same procedure as the above proof.If Theorem 2 in (14) is satisfied, the function  ŷ /(  )  in(20)is computed to let the following condition 0 <   () < 2      ŷ /(  ) thus, the change in the Lyapunov function is obtained by 2 ( + 1) −  2 () 2 ≤   ,