Sampled Characteristic Modeling and Forgetting Gradient Learning Algorithm for Robot Servo Systems

Servo systems of robotic exhibit nonlinear coupling with multidimensional characteristics, which poses a challenge to existing modeling and identification techniques. According to a kind of robot servo system which runs repetitively operations over a prespecified finite time interval, a low-order sampling characteristic modeling method is derived in this work. Characteristic parameters are allowed to vary from both time axis and iteration one; the forgetting gradient learning algorithm is utilized to estimate characteristic parameters. Furthermore, the effectiveness of the proposed algorithms is proved via theoretical analyses and numerical simulations.


Introduction
As an important part of the industrial Internet system, robots are attracting more and more attention. Existing robot control methods mostly depend on models. Because of the dynamic characteristics of controlled objects, the complexity of control tasks and operating environment, it is often di cult to establish accurate mathematical models. Nonnegligible unmodeled dynamics have higher requirements on system models and the robust performance of closed-loop systems needs to be improved. In order to reduce the impact of unmodeled dynamics on the performance of control systems, most existing methods build highorder models of systems. Modeling and control technology for high-order systems is extremely complex and di cult to implement. e feature modeling method provides a predictable way to model complex systems. It does not need to establish a precise dynamic model of the system nor does it use a system reduction or linearization method. Instead, it expresses the model as a low-order time-varying di erence equation, taking into account factors, such as system state, external environment, and control variables, and compresses the dynamic information into feature parameters. e Euler approximation is often used to discretize the continuous system, and the equivalent characteristic model [1][2][3][4][5][6] is given. e feature model can also be performed with exact discretization to give a sampling feature model of the system. Without limiting the sampling interval and control tasks, a feature model with rapidly changing (even abrupt) characteristic parameters may be obtained [7,8].
Stochastic gradient algorithm and its derivative algorithm are popular for training network weights and avoid the matrix operation compared with the recurrence least squares algorithm and the small online computation, which has attracted much attention [9][10][11][12][13][14][15][16][17]. e consistent convergence results of the parameters are still available for the stochastic gradient algorithm under weaker incentives. e martingale theory and stochastic process theory are powerful tools to analyze the convergence of recurrence algorithms. From the published results, most methods are targeted for stationary systems, where the estimated parameters are stationary.
When dealing with time-varying systems, it is found that the stochastic gradient algorithms do not have the ability to track the time-varying parameters. In the eld of timevarying system identi cation, more consideration is given to how to correct the recursive algorithms, that is, how to construct the correction algorithm of the recursive algorithms, so as to achieve e ective tracking of the time-varying parameters and improve the convergence rate of the parameters. It is well known that if the time-varying parameter change law is not known, there is no consistent parameter estimation convergence [18][19][20][21]. Based on this conclusion, one abandons the attempt to achieve complete tracking of time-varying parameters and instead works on a correction algorithm of how to construct and analyze recursive algorithms, expecting to give lower upper bounds on the estimation error of time-varying parameters. Common correction algorithms such as weighted recursive algorithms include rectangular windows, exponential windows (forgetting factors), etc. For irreducible nonstationary processes, these results can more accurately evaluate the parameter estimation accuracy, which has important applications in practical engineering applications.
When the system parameters are repetitive and independent, the instant change system runs repeatedly on a finite interval, the system parameters change over time, but the next time, it repeats this change, the instant change parameter to a repetition invariant. "Recursive" algorithms are constructed along the repetition axis, and the learning algorithm gives its complete estimates regardless of parameter slow change, fast change, and even mutation [22,23]. However, in practical situations, there are often system parameters that change not only along the time domain but also with the number of iterations, that is, iteration dependence. If the time-varying parameter changes with the number of iterations are unknown, then similar to the recurrence algorithm, the iterative learning identification algorithm based on the principle of repeated invariant cannot track the system parameters effectively. We consider introducing the forgetting factor in the iterative learning algorithm and propose the forgetting gradient learning algorithm to estimate the iteration-dependent time-dependent system parameters. en, under the premise of satisfying the repeated continuous incentive conditions, the performance analysis of the proposed algorithm is used and the simulation examples are completed to illustrate the effectiveness of the forgetting gradient learning algorithm.

Description of the Problem
Actual robot servo systems are mostly continuous nonlinear time-varying coupling processes, considering the following systems: where u stands for input and y for output, and ξ(t) represents external disturbances.
With the development of computer control technology, the analysis and synthesis process of the robot servo system needs a sampling model. e following discrete nonlinear time-varying coupling processes are also considered: (2) Here, we consider relaxing the initial condition that f(0, 0, · · · , 0, t) ∧ f(0, 0, · · · , 0, k) can be arbitrary values. e goal of this paper is to build a sampling feature model for a robot servo system. For iteration-dependent and time-varying feature parameters in the model, the amnestic gradient learning algorithm is used to estimate them. To further validate the effectiveness of the proposed learning algorithm, the random process theory is used to analyze the model and a numerical example is given to verify the effectiveness of the proposed algorithm. e rest of this paper is arranged as follows: the third part establishes the sampling characteristic model of the servo system, the fourth part puts forward the forgotten gradient learning algorithm, the fifth part gives the convergence analysis process of the learning algorithm, the sixth part verifies by numerical examples, and the seventh part gives the conclusion of this paper.

Characteristic Modeling for the Servo System
We consider the following discrete nonlinear system: where in the same way, we can obtain the following: e characteristic modeling method establishes a characteristic model for a nonlinear higher-order system by compressing the characteristic information to the characteristic parameters. Obviously, the lower the order of the characteristic model, the faster the change of the feature parameters. We consider building a first-order sampling characteristic model for the controlled system [7], and we can see that In general, we can also set up second-order and thirdorder sampling feature models of the controlled system, that is, formula (5) can be written into formula (3).

Forgetting Gradient Learning Algorithm.
We consider the following single-input single-output (SISO) discrete time-varying system repetitively operates over a prespecified finite time interval: where t � (0, 1, · · · , N) denotes time domain, and k � (1, 2, · · ·) denotes iteration domain. u k (t) and y k (t) represent the input and output of the system, respectively Equation (6) can be rewritten into the following regression model: Similar to the stochastic gradient algorithm, a forgetting gradient learning algorithm for identifying iteratively dependent time-varying systems is presented:

Convergence Analysis of Forgetting Gradient
Algorithm. e learning algorithm obtains parameter estimation based on the input and output data obtained when the system runs repeatedly in the operating interval. Because the operating interval is limited, we cannot obtain the convergence analysis results in the conventional sense. Only repetitive convergence results can be obtained. at is, for t ∈ 0, 1, { · · · , N − 1}, the corresponding convergence classification is as follows Repetitive consistency: Repetitive boundedness: When the system parameters are iteratively independent, using the learning algorithm, we can obtain the repeated consistency convergence results of the parameters. When the system parameters are iteratively dependent, we analyze the convergence performance of the forgetting gradient learning algorithm represented by equations (9) and (10).
For fixed time t ∈ 0, 1, · · · , N − 1 { }, F k (t) is denoted as a σ algebra consisting of the input and output data obtained by k repeated operations. In order to analyze the convergence of the proposed learning algorithm, the following assumptions are derived.

Mobile Information Systems
where Hypothesis 3 is satisfied, i.e., Take limit of both sides of this inequality For system (6), we define the transition matrix as follows: e upper bound of λ max [L T k+N,k (t)L k+N,k (t)] is then solved, and this bound is denoted as A(t). Let x k (t) be the unit eigenvector corresponding to the maximum eigenvalue λ max (t) of the matrix L T k+N,k (t)L k+N,k (t), and we construct the difference equation Taking norm on both sides of the abovementioned equation, and it follows that ‖x k+N (t) We transpose both sides of this inequality and that for any i ∈ [0, N − 1], Taking trace to repetitive excitation condition A3), we obtain as follows: In the condition (A3), we multiply x T k (t) to the left and x k (t) to the right and use the formulas (15), (19)- (22) to obtain as follows: It is derived as follows: Lemma 1. Let the nonnegative sequence x k (t), a k (t), b k (t) satisfy the following relation: where where the right limit is assumed to exist.
Suppose the observed noise v k (t) and the parameter iteration-dependent system parameter change rate ω k (t) � θ k (t) − θ k−1 (t) are zero-mean random noise sequences unrelated to the input u k (t) and the following relationships are satisfied: If PE condition (A1) is satisfied, the parameter estimation error θ k (t) − θ k (t) given by the forgotten gradient learning algorithm is repetitively bounded, which is solved below.
Taking norms on both sides of formula (34)

Mobile Information Systems
Taking expectation on both sides of formula (35), From Lemma 1 and A(t) < 1, it is derived as follows: Here, we give the convergence analysis results of the forget gradient learning algorithm in the case of parameter iteration dependence. According to formula (37), we can obtain the bounded convergence effect of the algorithm, that is, the parameter bounds converge to the true values and we give the bounds of the convergence bounds. From formula (37), when σ 2 ω � 0, which is equal to the system parameters iterate independently, we take λ � 1. A random gradient learning algorithm is obtained.
In this case, the consistent convergence result of parameters can be obtained according to formula (37), that is, the parameter estimation converges completely to the parameter truth value.

Numerical Results
is section completes numerical examples to demonstrate that the learning identification algorithm can be used to estimate time-varying parameters in dynamic systems as shown in Figures 1 and 2.
Example 1. Consider the following nonlinear system: e expected trajectory is y d (t) � 10(10t 3 − 15t 4 + 6t 5 ). Using the sampling feature modeling method provided in this paper and the adaptive learning control method provided in reference [7], a first-order sampling feature model is established. Where the sampling time T � 0.005, the initial Mobile Information Systems value r � 1; the forgetting factor λ � 0.65, we can obtain as follows: Tracking performance is shown in Figure 3. From the simulation results, using the sampled characteristic modeling method provided in this paper, although fast time-varying characteristic parameters are obtained, the output of the system enables efficient tracking of desired trajectories.
Example 2. Consider the following finite interval timevarying system. y k (t + 1) + a 1,k (t)y k (t) + a 2,k (t)y k (t − 1) where In the simulation, we set finite interval length N � 1000. For t � 1, 2, · · · , N, k � 1, 2, · · · , 4000, u k (t) as uniformly distributed random variables on [-0.5, 0.5], forgetting factor λ � 0.7, v k (t) � 0.01 randn. Here, randn is the production function of random variables that obey (0, 1) normal distribution. In the random dependence learning algorithms (8) and (9), we set the initial value r 0 (t) � 1, θ 0 (t) � 0. To examine the convergence performance, we define e simulation results are shown in Figure 4-6. e prediction error is shown in Figure 4, and the parameter estimation error is shown in Figure 5, and the parameters estimate values in Figure 6. It can be seen from Figure 4 that the prediction error decreases rapidly with the increase of the number of iterations. In Figure 5, the parameter estimation error asymptotically converges to zero in a small field, and the simulation results show the uniform convergence of the parameter estimation, which can almost converge to real values. To demonstrate the effectiveness of the proposed algorithm, the simulation results are compared with stochastic gradient method appeared in reference [17], where the finite interval length N � 1000 * 4000 and the other conditions are same to which of the abovementioned simulation. e errors with respect to every interval are shown in Figure 7.
Parameter estimation errors after last recursive process and parameter estimations after last recursive process are shown in Figures 8 and 9.
From the simulation results, the bounded convergence result is guaranteed, but the identification result is weaker than the result of the method presented in this paper.

Conclusions
As modeling technology is of great importance to robots for industrial Internet systems, the proposed sampled characteristic model and forgetting gradient learning identification method can be used to solve the parameter estimation problem of time-varying systems running round-trip over finite intervals. e forgetting gradient learning algorithm is derived for time-varying systems under finite-interval repeat operations. We prove the repetition boundedness of the learning algorithm under repeated continuous excitation conditions and give the estimation error bounds given by the proposed algorithm. e completed simulations also verify the effectiveness of the learning algorithm. Further, we present the convergence analysis results of the stochastic gradient learning algorithm, which can obtain consistent convergence results for the parameters when parameter iterations are independent. e main purpose of this paper is to propose this learning identification method and to clarify the connection and difference between the learning identification and the existing recursive identification algorithms. For the completeness of the theory and the expression simplicity, for the consistency analysis of the learning algorithm, we learn from the mature results of the learning algorithm. However, there are still differences between the two algorithms, such as the recurrence algorithm requires the PE condition along the time domain, while the learning algorithm requires the repeated PE condition; the assumption of the convergence consistency of the learning algorithm and the estimation of the obtained convergence rate are allowed to depend on time. Systematic results for recursive identification are presented in literature [17], from which we can learn for follow-up studies, including the case of system interference such as colored noise, continuous excitation, improvements of SPR conditions, and convergence rate estimation.

Data Availability
e data used to support the findings of this study are included within the article.

Conflicts of Interest
e authors declare that they have no conflicts of interest.