Adjusting LQG with Noise Signals, PID Controller for Acrobot System

Acrobot is a robotic system that is actuated to achieve a certain degree through a controller and/or motion motors. In order to control acrobot in a given position, the designers have to come up with appropriate algorithms. In fact, input signals into the system are aﬀected by noise signals. Therefore, it is necessary to remove noisy signals. In this paper, a method to control the balance under the inﬂuence of noise signals aﬀecting the system (LQG) is proposed. Through acrobot’s equations of motion, the system is linearized to facilitate the use of the above regulator for positions around the equilibrium point. The role of a Kalman ﬁlter of an LQG regulator is also depicted. Besides, a PID controller was used to survey this system as a comprehensive assessment of the eﬀectiveness of control methods for acrobot. As acrobot stability surveys are rare in recent years according to the data referenced below, this paper is useful for acrobot control purposes and can serve as an important database for the application of acrobots in practice. Simulations are performed on Matlab.


Introduction
Nowadays, controllers such as PID, fuzzy, and sliding mode are applied to control all kinds of robots. is is a strong step toward helping robots achieve the requirements set out. For the acrobot system, the algorithms to control the balance are always focused on the selection of controllers. Modern robots are controlled with classical algorithms and genetic algorithms [1] and intelligent algorithms [2] where noise signals are not taken into account. In [1,2], it is possible that the noise signals considered are negligible. Algorithms can be widely applied in the future such as selfdriving subways, self-driving fighter planes, and so on. In this paper, a method to control the balance under the influence of noise signals affecting the system (LQG) is proposed. In the section presented below, the role of Kalman filter to form an LQG regulator is depicted. is method will optimally control a system in the presence of unwanted signals.
Other systems have similar motion characteristics like acrobot, including pendulum, inverted pendulum [3], and reaction wheel inverted pendulum [4], which are nonlinear systems such as single-input multi-output (SIMO). In SIMO systems, the designers have a lot of difficulty in determining the exact position that has been placed for the pendulum according to the requirements. e flexible structured models using the algorithms in [5][6][7] are interesting experiences. Robustness in the optimal control method based on multivariable control design [8] is a new topic. Some extensions of loop transfer recovery [9] for flexible structures are regarded as a promising application. Loop recovery and robust state estimate feedback designs [10] for robotic systems are always welcome. Loop transfer recovery with nonminimum phase zeros [11] for electric motors is also interesting work. e LQG/LTE procedure for multivariable feedback design [12] is necessary to experience the application of any particular model. Multivariable feedback design concepts for a classical/ modern synthesis [13] are useful as a reference for setting up newer algorithms. Feedback property of multivariable systems [14] is an interesting subject for its application to a system with a flexible structure. It is necessary to develop algorithms [15][16][17][18][19] so that they are more in line with today's trend. Based on [20][21][22][23], the author applied algorithms [20][21][22][23] for robot models. e author used modern algorithms such as neural control, adaptive fuzzy control, and sliding fuzzy control to control the model [24][25][26][27], and efficiency levels of the above control methods are presented in detail. is paper focuses on the process of investigating the effectiveness of using two control methods as mentioned in the problem. e purpose of this is to cater to practical requirements and considerations in the selection of control methods. Acrobot is a new automated system, which has many utilities in life. e aforementioned studies have not dealt with acrobots in detail, so it is always an interesting experience to investigate the properties of acrobot systems. e acrobot system is described by a link between link 1 and link 2 as shown below (Figure 1).

Dynamic Equation of Model System
It is necessary to establish a mathematical equation that describes the motion of any model. rough these equations of motion, controllers can be installed inside the system. ey will be connected to the system via wireless devices or wired devices. Usually, control systems have one or more controllers connected to the systems by wired devices. With the successful establishment of the mathematical equations, the author calculated the values of K in LQG regulator. For example, acrobot or an inverted pendulum is described by mathematical equations. Acrobot has a similar structure to Pendubot [3], but the difference between the two lies in their control joint positions. e two-dimensional coordinate system: Ox and Oy, is depicted in Figure 1 [28].
Parameters of the acrobot system are listed in Table 1. Variables are q 1 , q 2 , and τ 2 , where q 1 and q 2 are output signals and τ 2 is input signal. e selection of variables was based on the structure of the system as shown in Figure 1. Variables have unknown values. In the process of designing controllers for these systems, readers can better understand the nature of the relationships between input signals and output signals through voltage signals and the torque of an electric motor. at is, these values do not have a specific value during the analysis, but their influence on the system is important.
In Figure 1, the x-axis of the Cartesian coordinate system was chosen to be the reference level of zero potential energy. Letting X i � [X x i , X y i ] T ∈ R 2 be the absolute position of the COM of the i th link gives [24] X 1 � Lc 1 sin q 1 Lc 1 cos q 1 , X 2 � L 1 sin q 1 + Lc 2 sin q 1 + q 2 L 1 cos q 1 + Lc 2 cos q 1 + q 2 .
(1) e kinetic energy is K(q, _ q), and the potential energy is V(q): where M q 2 � a 1 + a 2 + 2a 3 cos q 2 a 2 + a 3 cos q 2 a 2 + a 3 cos q 2 a 2 , Considering that friction is very small, the author used Lagrangian of acrobot; then, the dynamic equation of the mechanical system is d dt where L(q, _ q) � K(q, _ q) − V(q) and τ 1 � 0. Equation (5) is equivalent to  where State-space equations are created as Let Dynamic equation (6) can be described as where P(i, j) and Z(i, j) are matrices having the i th row and j th column of P and Z. erefore, P and Z are determined as e establishment of acrobot's equation of state is based on the values of Table 2.
e state variable equations of the system are described as follows: Distance from active joint to the center of mass of link 1 Gravitational acceleration τ 2 Torque applied to active joint Table 2: Parameters of the acrobot system.

Parameters
Values Mathematical Problems in Engineering With the above parameters and matrices A and B of the state-space model, G 1 (s), G 2 (s), G 3 (s), and G 4 (s) are calculated: Transfer function of acrobot:

LQG Regulator
LQG Regulator (linear quadratic Gaussian) is used for optimal control of a system while noise signals "affect" the system. e author considered the following diagram ( Figure 2). e goal of adjusting is to stabilize the output at zero; under the influence of input white noise signals (w) and noise signals due to (v), the control signal is "u," and the equation of state is LQG Regulator consists of using a state feedback unit and a state estimator (Kalman filter) (Figure 3). e quality indicator (J) is used to find the state feedback matrix K � lqr(A, B, Q, R, N).
e control signal is u � − K × x, and xis inferred as where "L" is the Kalman gain and "L" is calculated by the following commands: where Q n , R n , and T s represent the variance of the noise signal.E(ww T ) � Q n , E(vv T ) � R n , E(wv T ) � N n , and k est is the model of the Kalman filter. Kalman filter calculates x to minimize the variance: e equation of state for a regulator (LQG) is where u � − Kx is created by the following commands: where rlqg is the state-space model of a set of LQG, the output is u, the input is y, and the state is x. Finally, the command is requested: The object The regulator noise signals y + + w noise signals u v Figure 2: Diagram of a regulator for an object with noise signals ("v": the white noise and "w": the color noise signal).
e author established a closed system with the value of "sys" as the object model, and the value of "K" is the state feedback matrix, "L" is the state estimation matrix, "sensors" represent a subset of outputs, the value of "y" returns the estimator, "known" represents the inputs, "u d " affects the estimator, and "controls" are inputs of "sys" that is used for control.

Simulation Results and Discussion
Simulation results are shown in Figures 4-35. Diagram of the system using the LQG regulator is shown in Figures 36-39. Diagram of the system using the PID controller is shown in Figures 40-43.
In Figures 36-39, the value of "d" is a color noise signal with a spectral density of less than 10 rad/s, and the value of "n" is a white noise signal E(n 2 ) � 0.01. e quality indicator is 'J': e impulse response for the closed system (highlighted in red in Figure 6) is better than that for the open system (highlighted in green in Figure 7). e value of amplitude of the oscillation of the closed system in this case is zero, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. e value of amplitude of the oscillation of the open system in this case is large, and the open system does not reach a steady state. e step response for the closed system (highlighted in blue in Figure 4) is better than that for the open system (highlighted in red in Figure 5). e value of amplitude of the oscillation of the closed system in this case is − 0.6, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. Meanwhile, the open system cannot respond well. In general, for a regulator of this type, the system responds well to the presence of noisy signals. e impulse response for the closed system (highlighted in red in Figure 10) is better than that for the open system (highlighted in blue in Figure 11). e value of amplitude of the oscillation of the closed system in this case is zero, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. e value of amplitude of the oscillation of the open system in this case is large, and the open system does not reach a steady state. e step response for the closed system (highlighted in green in Figure 8) is better than that for the open system (highlighted in green in Figure 9). e value of amplitude of the oscillation of the closed system in this case is − 30, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. Meanwhile, the open system cannot respond well. In general, for a regulator of this type, the system responds well to the presence of noisy signals. e impulse response for the closed system (highlighted in blue in Figure 14) is better than that for the open system (highlighted in green in Figure 15). e value of amplitude of the oscillation of the closed system in this case is zero, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. e value of amplitude of the oscillation of the open system in this case is large, and the open system does not reach a steady state. e step response for the closed system (highlighted in red in Figure 12) is better than that for the open system (highlighted in red in Figure 13). e value of amplitude of the oscillation of the closed system in this case is zero, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. Meanwhile, the open system cannot respond well. In general, for a regulator of this type, the system responds well to the presence of noisy signals. Step Response Step response of LQG regulator for the closed system "G 2 (s)." Step Response 0. Step Response Figure 12: Step response of LQG regulator for the closed system "G 3 (s)." Step Response Step Response Amplitude ×10 8 Step Response Step Response Step Response Step Response Figure 26: Step response of PID controller for the closed system "G 3 (s)."  10 Step Response Step Response e impulse response for the closed system (highlighted in blue in Figure 18) is better than that for the open system (highlighted in green in Figure 19). e value of amplitude of the oscillation of the closed system in this case is zero, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. e value of amplitude of the oscillation of the open system in this case is large, and the open system does not reach a steady state. e step response for the closed system (highlighted in green in Figure 16) is better than that for the open system (highlighted in red in Figure 17). e value of amplitude of the oscillation of the closed system in this case is − 200, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. Meanwhile, the open system cannot respond well. In general, for a regulator of this type, the system responds well to the presence of noisy signals. e impulse response for the closed system (highlighted in green in Figure 20) is worse than that for the open system (highlighted in blue in Figure 21). e value of amplitude of the oscillation of the closed system in this case is large, and the closed system does not reach a steady state. For this type of regulator, the closed system does not respond well. e value of amplitude of the oscillation of the open system in this case is large, and the open system does not reach a steady state. e step response for the closed system (highlighted in red in Figure 22) is better than that for the open system (highlighted in green in Figure 23). e value of amplitude of the oscillation of the closed system in this case is large, and the closed system does not reach a steady state. For this type of regulator, the closed system does not respond well.
Meanwhile, the open system cannot respond well. In general, for a regulator of this type, the system does not respond well. e impulse response for the closed system (highlighted in green in Figure 28) is better than that for the open system (highlighted in blue in Figure 29). e value of amplitude of the oscillation of the closed system in this case is zero, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. e value of amplitude of the oscillation of the open system in this case is large, and the open system does not reach a steady state. e step response for the closed system (highlighted in red in Figure 30) is better than that for the open system (highlighted in green in Figure 31). e value of amplitude of the oscillation of the closed system in this case is 1.0, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. Meanwhile, the open system cannot respond well. In general, for a regulator of this type, the system responds well. e impulse response for the closed system (highlighted in red in Figure 24) is better than that for the open system (highlighted in green in Figure 25). e value of amplitude of the oscillation of the closed system in this case is zero, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. e value of amplitude of the oscillation of the open system in this case is large, and the open system does not reach a steady state. e step response for the closed system (highlighted in green in Figure 26) is better than that for the open system (highlighted in red in Figure 27). e value of amplitude of the oscillation of the closed system in this case is 0.992, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. Meanwhile, the open system cannot respond well. In general, for a regulator of this type, the system responds well. e impulse response for the closed system (highlighted in green in Figure 32) is better than that for the open system (highlighted in blue in Figure 33). e value of amplitude of the oscillation of the closed system in this case is large, and the closed system does not reach a steady state. For this type of regulator, the closed system does not respond well. e value of amplitude of the oscillation of the open system in this case is large, and the open system does not reach a steady state. e step response for the closed system (highlighted in blue in Figure 34) is better than that for the open system (highlighted in red in Figure 35). e value of amplitude of the oscillation of the closed system in this case is large, and the closed system does not reach a steady state. For this type of regulator, the closed system does not respond well. Meanwhile, the open system cannot respond well. In general, for a regulator of this type, the system does not respond well. e efficiency of the above control methods is in the following descending order: (A) LQG regulator (B) PID controller

Conclusions
LQG regulator in the case of interference signals affecting this system has been proposed in this study. Simulation shows positive results. is allows the closed system to reach steady state in a short time. At the same time, it also helps the open system to reach steady state in half the time compared to the closed system at the beginning of the operating cycle of the system. Although the steady state is achieved, simulation results of this type of regulator are only true in the case where the link angle between link 1 and link 2 is small compared to the equilibrium point. To widen the link angle between link 1 and link 2, further studies can be conducted in the future.

Data Availability
e data used to support the findings of this study are included within the article.

Conflicts of Interest
e author declares that there are no conflicts of interest.