A Novel Closed-Loop Structure for Drag-Free Control Systems with ESKF and LQR

Ye, Xiaorong; Lian, Junxiang; Zhao, Guoying; Zhang, Dexuan

doi:10.3390/s23156766

Open AccessArticle

A Novel Closed-Loop Structure for Drag-Free Control Systems with ESKF and LQR

by

Xiaorong Ye

^1,2,

Junxiang Lian

^1,2,*,

Guoying Zhao

^1,2 and

Dexuan Zhang

^1,2

¹

TianQin Research Center for Gravitational Physics, Sun Yat-sen University, Zhuhai 519082, China

²

School of Physics and Astronomy, Sun Yat-sen University, Zhuhai 519082, China

^*

Author to whom correspondence should be addressed.

Sensors 2023, 23(15), 6766; https://doi.org/10.3390/s23156766

Submission received: 4 July 2023 / Revised: 25 July 2023 / Accepted: 27 July 2023 / Published: 28 July 2023

(This article belongs to the Special Issue Sensing and Modern Control Techniques for Aerospace Systems)

Download

Browse Figures

Versions Notes

Abstract

:

Space-borne gravitational wave detection satellite confronts many uncertain perturbations, such as solar pressure, dilute atmospheric drag, etc. To realize an ultra-static and ultra-stable inertial benchmark achieved by a test-mass (TM) being free to move inside a spacecraft (S/C), the drag-free control system of S/C requires super high steady-state accuracies and dynamic performances. The Active Disturbance Rejection Control (ADRC) technique has a certain capability in solving problems with common perturbations, while there is still room for optimization in dealing with the complicated drag-free control problem. When faced with complex noises, the steady-state accuracy of the traditional control method is not good enough and the convergence speed of regulating process is not fast enough. In this paper, the optimized Active Disturbance Rejection Control technique is applied. With the extended state Kalman filter (ESKF) estimating the states and disturbances in real time, a novel closed-loop control structure is designed by combining the linear quadratic regulator (LQR) and ESKF, which can satisfy the design targets competently. The comparative analysis and simulation results show that the LQR controller designed in this paper has a faster response and a higher accuracy compared with the traditional nonlinear state error feedback (NSEF), which uses a deformation of weighting components of classical PID. The new drag-free control structure proposed in the paper can be used in future gravitational wave detection satellites.

Keywords:

extended state Kalman filter; linear quadratic regulator; drag-free control

1. Introduction

Gravitational wave astronomy provides a new tool to explore black holes, dark matter, the early universe, and the evolution of the universe. To detect gravitational waves in space, a strategy involves deploying multiple satellites in mega-satellite formations to measure tiny changes in the relative distances between satellites when the gravitational wave goes through. However, the challenge lies in the weak characteristics (to the order of

10^{- 21}

) in the changes caused by gravitational waves, which can be easily influenced by extraneous perturbations and noise. To address this issue, the typical approach is to employ drag-free satellites: an inner test mass is shielded and free-falls along the geodesic of spacetime, and the outer satellite counteracts non-conservative forces and tracks the test mass in a sensitive axis. This setup creates an ultra-static and ultra-stable platform, with the test mass serving as an inertial reference for the measurement of relative distances in space. Using this approach, it is possible to obtain accurate and reliable measurements of distance variation. And the variation represents the effect of gravitational waves.

In the drag-free control loop, there are many factors that can impact the effect of the controller, such as external environmental disturbances, sensor measurement noise, process noise, and other inevitable disturbances and noises. Models of these disturbances and noises are hard to be built precisely, which makes it difficult to determine the appropriate models for compensation. To address this challenge, the Extended State Observer (ESO) method has emerged as an effective solution for modeling, estimating, and identifying disturbances in the drag-free control loop. Using the ESO method, it is possible to obtain estimates of the disturbances, which can help the design of the controller. This method has significant implications for the development of advanced control systems for drag-free satellites, thereby improving performance and reliability.

The Active Disturbance Rejection Control (ADRC) technique, proposed by Han [1,2], combines the “anti-disturbance” and “model independence” of PID control with the idea of the state observer. The Extended State Observer (ESO) is the core of ADRC, providing a way to estimate and compensate for disturbances and uncertainties. Huang [3,4] demonstrated the design method and proof of convergence for nonlinear ESO of second- and third-order systems, showing that it can achieve fast convergence without oscillation, even in the presence of model uncertainty and disturbances. However, the complexity of nonlinear ESO increases with the growth of the number of parameters, making tuning more challenging. Despite this drawback, the effectiveness of nonlinear ESO in mitigating disturbances and uncertainties makes it a promising technique for advanced control systems in various applications. Gao [5] proposed a parameter design method for linearized ESOs based on bandwidth, which effectively reduces the design threshold and improves the convenience of the application. Yang [6] analyzed the observation error of the ESO for different forms of disturbances and concludes that when the disturbance is bounded or its derivative is bounded, ESO can effectively estimate it, and the observation error is bounded. In addition, Jin [7], Chen [8], and Gan [9] analyzed the stability of the ESO using different methods, and Shao [10] analyzed high-order ESOs by adding higher-order derivative of disturbance as the extended state. Although increasing the extended order can effectively reduce the estimation error of each state, increasing the order and bandwidth simultaneously also affects the high-frequency noise suppression effect. Therefore, a trade-off between the expansion order and bandwidth is needed to balance the estimation accuracy and the high-frequency noise suppression effect. The Extended State Kalman filter (ESKF) proposed by Xue [11] combines the advantages of both extended state observer and Kalman filter to filter the noise and estimate the system state and disturbance in dealing with nonlinear systems with strong nonlinearity, large initial estimation error, and severe noise. The Extended State Kalman filter provides a potential solution to the problem of disturbance identification for the drag-free control of gravitational wave detection satellites, when the conventional filtering methods are not sufficient to estimate the disturbance.

The Linear Quadratic Regulator (LQR) is a widely used engineering tool in the aerospace industry due to its ability to achieve optimal control under specific performance requirements. Its simplicity in design has made it a popular choice for various applications, such as quadrotors [12,13,14,15,16], hypersonic vehicles [17,18], airborne remote gimbal [19], and satellite formation problems [20,21] etc.

In the field of control engineering, the Linear Quadratic Regulator (LQR) is widely used for linear problems. To ensure the effectiveness of the controller, an accurate linear model must be established or a nonlinear model should be linearized prior to the LQR design. In cases where low control performance requirements are sufficient, typically disturbances and noise are not handled directly but are instead compensated for through control. In order to improve the robustness and disturbance resistance of the traditional LQR controller, Lu [12] introduced the Extended State Observer (ESO) to estimate random low-frequency disturbances and the estimation of ESO is used by LQR. Attitude control of spacecraft with low precision requirements, disturbances, and noise are often not preprocessed and are only compensated for through LQR controllers. For systems with higher performance requirements, such as the six-degree-of-freedom attitude control system described in Ref. [17], a combination of ESO and LQR is used to achieve higher control accuracy and stronger disturbance resistance compared to using LQR control alone. In Lin’s research [19], a standard nonlinear ESO was employed in combination with LQR to estimate and compensate for multi-source perturbations, ultimately improving the control of LQR for uncertain systems. While these studies successfully combined an extended state observer with LQR control and achieved some improvement, they focused solely on the estimation and compensation of perturbations without considering the suppression of noise.

The accuracy required for drag-free and attitude control in gravitational wave detection is crucial, so the impact of noise must be considered. Previous methods are insufficient in dealing with the noise affecting gravitational wave detection satellites, and cannot estimate the perturbations effectively. Those methods also suffer from longer setting times. To design a successful control system, it is necessary to develop effective strategies to reduce the noise impact on control performance. In this study, we propose a novel approach that combines ESKF with LQR control. We use the state and disturbance estimated by ESKF as the input information for the controller, ensuring optimized control. Our analysis and simulations show that this new approach outperforms traditional solutions. It effectively shortens the adjustment time, reduces the number of oscillations, compensates for disturbances, and suppresses noise, ultimately achieving the desired design specifications.

The paper is organized as follows: in Section 2, a dynamic model of a single test mass drag-free satellite is established. In Section 3 and Section 4, the design process and calculation methods of the ESKF and LQR are presented, respectively. In Section 5, the performance of the system using the LQR controller and NSEF controller is compared through numerical simulations, indicating that the overall performance of the LQR controller is superior to that of the NSEF controller when using ESKF as the estimation method. The conclusion is given in Section 6.

2. Dynamics Modeling

This paper takes a single test mass, a drag-free satellite in geocentric orbit as the research subject, as is shown in Figure 1, where C indicates the center of mass of an object, h represents the sensitive cavity, which is fixed to the satellite, then the position vector

r_{h}

from the center of the sensitive cavity to the center of mass (CoM) of the satellite is constant, and the position vector of test mass relative to the satellite is

r = r_{h} + r_{r e l}

. The relative translation equations of motion in the inertial system are first transformed into the satellite body coordinate system, similarly the relative attitude equations of motion are projected into the TM body coordinate system, which is illustrated in Figure 2. Then a comprehensive drag-free satellite dynamics model can be established as follows.

{\ddot{φ}}_{sc} = I_{sc}^{- 1} (T_{Csc} + w_{T_{Csc}} + T_{D s c}),

(1)

\begin{matrix} {\ddot{r}}_{r e l}^{h} & = \frac{1}{m_{t m}} (F_{G t m}^{h} + F_{D t m}^{h} + F_{S C t m}^{h}) - \frac{1}{m_{s c}} (F_{G s c}^{h} + F_{C s c}^{h} + F_{D s c}^{h} + F_{T M s c}^{h}) \\ - 2 ω_{s c}^{h} \times {\dot{r}}_{r e l}^{h} - ω_{s c}^{h} \times (ω_{s c}^{h} \times (r_{h}^{h} + r_{r e l}^{h})) - {\dot{ω}}_{s c}^{h} \times (r_{h}^{h} + r_{r e l}^{h}), \end{matrix}

(2)

\begin{matrix} {\ddot{φ}}_{r e l}^{t m} & = {(I_{t m})}^{- 1} [- (ω_{r e l}^{t m} ω_{s c}^{t m}) \times (I_{t m} (ω_{r e l}^{t m} + ω_{s c}^{t m}))] + {(I_{t m})}^{- 1} [T_{G t m}^{t m} + T_{D t m}^{t m} + T_{S C t m}^{t m}] \\ - A_{T S} {\dot{ω}}_{s c}^{s c} - A_{T S} ω_{s c}^{s c} \times ω_{r e l}^{t m} \end{matrix}

(3)

where

w_{T_{Csc}}

indicates input noise, here assume that all input forces and moments acting on the satellite, as well as the measurement output of the sensor, are subject to noise,

T_{D s c}

denotes the disturbance moment to the satellite,

F_{D t m}

and

F_{D s c}

denote the test mass and the disturbance force on the satellite, respectively, and

T_{D t m}

indicates the disturbance moment to the test mass. The

s c

,

t m

,

r e l

, C, D, and G subscripts indicate the S/C, Test Mass, measurements RELated to the sensitive cavity, Control command, Disturbance, and Gravity. The h superscript indicates the components in the sensitive cavity frame, and

A_{T S}

is the coordinate transformation from the satellite frame to the test mass frame.

The dynamic model is expressed in the form of state space equations which are presented as follows:

\begin{matrix} {\dot{X}}_{0} & = A_{0} X_{0} + B_{0} (u + w + f) \\ Y & = C_{0} X_{0} + d \end{matrix}

(4)

where

\begin{matrix} A_{0} = [\begin{matrix} 0_{3} & I_{3} & 0_{3} & 0_{3} & 0_{3} & 0_{3} \\ 0_{3} & 0_{3} & 0_{3} & 0_{3} & 0_{3} & 0_{3} \\ 0_{3} & 0_{3} & 0_{3} & I_{3} & 0_{3} & 0_{3} \\ 0_{3} & 0_{3} & - \frac{K_{t r a n s}}{m_{t m}} & - \frac{D_{t r a n s}}{m_{t m}} & 0_{3} & 0_{3} \\ 0_{3} & 0_{3} & 0_{3} & 0_{3} & 0_{3} & 0_{3} \\ 0_{3} & 0_{3} & 0_{3} & 0_{3} & I_{t m}^{- 1} K_{r o t} & I_{t m}^{- 1} D_{r o t} \end{matrix}], B_{0} = [\begin{matrix} 0_{3} & 0_{3} & 0_{3} \\ I_{s c}^{- 1} & 0_{3} & 0_{3} \\ 0_{3} & 0_{3} & 0_{3} \\ 0_{3} & - \frac{I_{3}}{m_{s c}} & 0_{3} \\ 0_{3} & 0_{3} & 0_{3} \\ - I_{s c}^{- 1} & 0_{3} & I_{t m}^{- 1} \end{matrix}], \\ C_{0} = [\begin{matrix} I_{3} & 0_{3} & 0_{3} & 0_{3} & 0_{3} & 0_{3} \\ 0_{3} & 0_{3} & I_{3} & 0_{3} & 0_{3} & 0_{3} \\ 0_{3} & 0_{3} & 0_{3} & 0_{3} & I_{3} & 0_{3} \end{matrix}] \end{matrix}

where

K_{t r a n s}

,

D_{t r a n s}

,

K_{r o t}

,

D_{r o t}

are the coupling coefficient matrices for translation and rotation, respectively. u is the system control variable, w is input noise, d is measurement noise, and f represents the total perturbation affecting the system, including the known part and the unmodeled part.

The Gravitational Wave Detector-TianQin requires detection satellites in deep-space orbit. The main disturbance on the satellite comes from solar pressure. To ensure a steady power supply and minimize fluctuations in the satellite’s internal thermal environment for ultra-stability, the drag-free satellite uses a body-attached battery array.

The expression of this perturbation is shown below:

{\vec{F}}_{r} = - k C_{R} ρ_{S R} (\frac{S_{R}}{m}) {\vec{r}}_{s}

(5)

where

C_{R}

indicates surface reflection coefficient, normally 1–1.44,

ρ_{S R}

indicates the solar pressure near the Earth,

4.56 \times 10^{- 6} N / m^{2}

,

(\frac{S_{R}}{m})

is the surface-to-mass ratio of the spacecraft,

S_{R}

is the projected area of the spacecraft facing the sun,

{\vec{r}}_{s}

is the unit vector indicating the direction from the center of the Earth to the sun. The sun exposure factor, denoted as k, is assumed to be 1 for the light area and 0 for the ground shadow area. The amplitude spectral density of the solar pressure on the satellite is shown in Figure 3.

In a drag-free satellite with a single test mass, the displacement measurements between the CoM of the TM and the CoM of the satellite, as well as the attitude measurements of the TM relative to the satellite, are obtained by an inertial sensor. The attitude of the satellite is determined through a star sensor, and the micro-propulsion provides the necessary control forces and moments to maintain the desired position and attitude of the satellite. At present, the typical micro-propulsion systems have a noise power spectral density of

1 \times 10^{- 6} N / \sqrt{Hz}

under open-loop condition [22]. Their corresponding power spectral densities are presented in Figure 4, Figure 5 and Figure 6.

Based on the data of Ref. [23], it can be inferred that electrostatic actuation noise can be treated as white noise in the frequency band needed for the detection of gravitational waves by the TianQin detector.

In the case of capacitive displacement sensors, the noise levels for displacement measurements are equal in the x, y, and z directions, while the noise levels for angle measurements in the

θ

direction are one order of magnitude lower than those in the

η

and

ϕ

directions [23]. Figure 6 displays the measurement noise in each direction.

The spectral density curves of perturbation and noise are given in Figure 3, Figure 4, Figure 5 and Figure 6, which will be used as the basis for the modeling and simulation calculations later.

3. Extended State Kalman Filter Design

Achieving high accuracy of relative displacement and relative attitude control in a noisy and disturbed environment requires multiple steps, including disturbance estimation, noise suppression, and state control.

The Extended State Kalman Filter (ESKF) can estimate nonlinear uncertainty. In cases of initial error, uncertain dynamics (perturbation), and bounded noise, the perturbation is estimated and compensated for by the extended state, and noise effects can be suppressed. This paper uses ESKF to estimate disturbance forces and moments, such as solar pressure on the satellite and anomalous electromagnetic forces and moments on the test mass. First, we present the design scheme of ESKF, then apply it to uncertain disturbance estimation in drag-free control.

3.1. Extended State Kalman Filter

For the following discrete system containing uncertain perturbations

\{\begin{matrix} W_{k + 1} = A_{k} W_{k} + B_{k} f (W_{k}, k) + w_{k} \\ Y_{k} = C_{k} W_{k} + n_{k} \end{matrix}, k = 0, 1, 2, \dots,

(6)

where

W_{k}

is the system state,

A_{k}, B_{k}

are system matrices,

C_{k}

is the measurement matrix,

f (W_{k}, k)

is the nonlinear uncertain part in the system (6),

w_{k}, n_{k}

are the process noise and measurement noise, respectively, and

Y_{k}

is the system measurement output. Treating

f (W_{k}, k)

as an additional state variable

f_{k}

, which is then estimated and compensated for. The extended system is described as

\{\begin{matrix} [\begin{matrix} W_{k + 1} \\ f_{k + 1} \end{matrix}] = {A_{k}}^{'} [\begin{matrix} W_{k} \\ f_{k} \end{matrix}] + {B_{k}}^{'} G_{k} + [\begin{matrix} w_{k} \\ 0 \end{matrix}] \\ Y_{k} = {C_{k}}^{'} [\begin{matrix} W_{k} \\ f_{k} \end{matrix}] + n_{k} \end{matrix}

(7)

where

A_{k}^{'} = [\begin{matrix} A_{k} & B_{k} \\ 0 & I \end{matrix}]

,

B_{k}^{'} = [\begin{matrix} 0 \\ I \end{matrix}]

,

C_{k}^{'} = [\begin{matrix} C_{k} & 0 \end{matrix}]

,

f_{k} = f (W_{k}, k)

,

G_{k} \overset{Δ}{=} f_{k + 1} - f_{k}

, assume

w_{k}

,

n_{k}

are unrelated zero-mean Gaussian random series and

E (n_{k} {n_{k}}^{T}) \leq R_{k}

,

E (w_{k} {w_{k}}^{T}) \leq S_{k}

,

E ([\begin{matrix} W_{0} - {\hat{W}}_{0} \\ f_{0} - {\hat{f}}_{0} \end{matrix}] {[\begin{matrix} W_{0} - {\hat{W}}_{0} \\ f_{0} - {\hat{f}}_{0} \end{matrix}]}^{T}) \leq P_{0}

,

{\hat{W}}_{0}

is the estimation of

W_{0}

,

{\hat{f}}_{0}

is the initial value of the nominal part of

f (W_{k}, k)

,

P_{0}

is a known constant matrix, and

E (G_{i}^{2}) \leq {\bar{q}}_{i}, i = 1, 2, \dots

,

q_{i}

is bounded.

According to the classical state observer theory, the extended state observer for the extended state Equation (7) is shown below, where

X_{k + 1} = [\begin{matrix} W_{k + 1} \\ f_{k + 1} \end{matrix}]

{\hat{X}}_{k + 1} = A_{k} {\hat{X}}_{k} + B_{k} {\hat{G}}_{k} - K_{k} (Y_{k} - C_{k} {\hat{X}}_{k})

(8)

Based on the given initial estimation

{\hat{X}}_{0}

and initial value of covariance matrix

P_{0}

, we obtain the middle value of the estimated quantity

{\hat{X}}_{k}^{-}

and update the value of the covariance matrix

{P_{k}}^{-}

{\hat{X}}_{k}^{-} = A_{k}^{'} {\hat{X}}_{k - 1} + B_{k}^{'} u_{k - 1} + B_{e} {\hat{G}}_{k}

(9)

\begin{matrix} {P_{k}}^{-} & = (1 + θ) A_{k}^{'} P_{k - 1} {A_{k}^{'}}^{T} + (1 + \frac{1}{θ}) Q_{1, k - 1} + Q_{2, k - 1} \end{matrix}

(10)

where

θ = \sqrt{\frac{tr (Q_{1, 0})}{tr (P_{0})}}

is used to decouple the cross terms of estimation error and uncertainty,

Q_{1, 0} = 4 B_{e} Q_{0} {B_{e}}^{T}

,

Q_{2, 0} = B_{k}^{'} S_{k} {B_{k}^{'}}^{T}

,

S_{k}

is the variance of

w_{k}

,

R_{k}

is the variance of

n_{k}

,

{\hat{G}}_{k}

is the estimation of

G_{k}

, whose value is calculated by

{\hat{G}}_{k} = s a t ({\bar{G}}_{k}, \sqrt{q_{i}})

, where

s a t (f, b) = {\begin{matrix} b & f > b \\ \begin{matrix} f \\ - b \end{matrix} & \begin{matrix} b > f > - b \\ - b > f \end{matrix} \end{matrix}

(11)

by calculating

K_{k}

, update the estimation

{\hat{X}}_{k}

and the covariance matrix

P_{k}

K_{k} = {P_{k}}^{-} {C_{k}^{'}}^{T} {({C^{'}}_{k} {P_{k}}^{-} {C^{'}}_{k}^{T} + R_{k})}^{- 1}

(12)

{\hat{X}}_{k} = {\hat{X}}_{k}^{-} + K_{k} (Y_{k} - {C^{'}}_{k} {\hat{X}}_{k}^{-})

(13)

P_{k} = (I - K_{k} C_{k}^{'}) {P_{k}}^{-} {(I - K_{k} C_{k}^{'})}^{T} + K_{k} R_{k} {K_{k}}^{T}

(14)

After that, calculate the control variable

u_{0}

based on the error between the state estimate and the reference, then the calculated control variable

u_{0}

and the estimated value

\hat{f}

of the disturbance are used to calculate the final control variable u, and the flow chart is shown in Figure 7.

3.2. Extended State Design of Drag-Free Control System

The complex space environment presents a challenge in accurately modeling and describing perturbations affecting on satellite and test masses. Conventional control methods are model-dependent, and their performance can be severely affected by the inaccuracies of the perturbation model. To address this issue, Section 3.1 proposes a method to expand the perturbations into new states and create a new filter model. Then the perturbation can be compensated and the noise can be suppressed.

The uncertain disturbance term is

f = {[T_{D s c}, F_{D s c}, T_{D t m}]}^{T}

.

T_{D s c}

represents the disturbance moments that the satellite is subjected to,

F_{D s c}

represents the disturbance forces that the satellite is subjected to, and

T_{D t m}

represents the disturbance moment that TM is subjected to. Those are treated as extended states. The state vector is taken as

X = {[φ_{s c}, {\dot{φ}}_{s c}, r_{r e l}, {\dot{r}}_{r e l}, φ_{r e l}, {\dot{φ}}_{r e l}, T_{D s c}, F_{D s c}, T_{D t m}]}^{T}

,

Y = {[φ_{s c}, r_{r e l}, φ_{r e l}]}^{T}

indicates related attitude and displacement that can be measured.

According to Section 3.1, the extended state differential equation is obtained as

\begin{matrix} \dot{X} = A X + B (u + w) + B_{e} \dot{f} \\ Y = C X + d \end{matrix}

(15)

where

A = [\begin{matrix} A_{0} & B_{0} \\ 0 & I \end{matrix}], B = [\begin{matrix} B_{0} \\ 0 \end{matrix}], C = [\begin{matrix} C_{0} & 0 \end{matrix}]

,

B_{e} = {[0, 0, 1]}^{T}

, symbols has the same physical meaning as in Equation (4).

The extended state Equation (15) is discretized to obtain the discrete difference equation model (16)

\begin{matrix} X_{k + 1} & = A_{d} X_{k} + B_{d} (u + w) + B_{d} G_{k} \\ Y_{k} & = C_{d} X_{k} + d_{k} \end{matrix}

(16)

where

G_{k} \overset{Δ}{=} f_{k + 1} - f_{k}

.

The ESKF is designed according to the flowchart shown in Figure 7 to estimate the kinematic state parameters and the unknown disturbances, while implementing feedback control. Where

q_{i} = {(\max |f_{i + 1} - f_{i}|)}^{2}

,

Q_{0} = i diag (q_{i})

, i is the number of state vector.

4. Feedback Controller Design

Once the estimation of states and perturbations has been successfully obtained, the next step is the selection of an appropriate controller. The Linear Quadratic Regulator (LQR) is a control strategy designed to minimize a cost function. The optimal control law is obtained through the design of the state feedback controller, which allows for the completion of closed-loop optimal control in a fast, stable, and accurate manner.

The performance index for LQR control reflects the requirements for both state and control quantities, and the cost function used in this paper is

J = \sum_{0}^{n} (x {(k)}^{T} Q x (k) + u {(k)}^{T} R u (k))

(17)

The weighting matrix Q is semi-positive definite and R is positive definite, which are set as a diagonal matrix in the subsequent simulation. For the first term in the cost function J, each component is required to be small in the control process. The larger the weight in Q means the stricter the constraint on the components; while the second term in the cost function indicates the requirement for the control output, which is weighted according to the different characteristics of each component.

The Ricatti equation

P A + A^{T} P - P B R^{- 1} B^{T} P + Q = 0

is used to calculate P, then based on

K = R^{- 1} B^{T} P

, the feedback gain matrix is calculated, the control law of LQR is chosen as

u (k) = - K \hat{x} (k)

.

In order to maximize the effective time of gravitational wave detection and minimize the output of thrusters (actuators), it is necessary to ensure that the detection satellite is maintained in a free and stable flight for as long a time as possible. To achieve this goal, the adjustment transition process of drag-free control needs to be as fast as possible, allowing the detection system to quickly reach an ultra-static and ultra-stable state. Our research shows that the LQR control strategy can satisfy the optimal control requirements, enabling a rapid transition to a steady state and achieving the required control accuracy. By appropriately designing the weights of each component, in addition to the values of Q and R, it is possible to achieve a more refined control.

Nonlinear error feedback is used in classical Active Disturbance Rejection Control [1], mainly by rewriting the weighting of classical PID control into a nonlinear combination, as shown in Equation (18)

f a l (e, α, δ) = {\begin{matrix} \frac{e}{δ^{α - 1}}, |e| \leq δ \\ sgn (e) {|e|}^{α}, |e| > δ \end{matrix}

(18)

It is a continuous power series function with linear segments near the origin,

δ > 0

denotes the length of the interval of the linear segment, and e indicates the amount of error.

This function

f a l ()

is characterized by increasing the gain when the error is small and using a small gain when the error is large, which prevents high-frequency chattering due to excessive gain calculated when the error is small [24,25]. The control law for nonlinear error feedback is

\begin{matrix} u & = γ_{1} f a l (e_{1}, α_{1}, δ) + γ_{2} f a l (e_{2}, α_{2}, δ) + γ_{3} f a l (e_{3}, α_{3}, δ) \end{matrix}

(19)

where

e_{1} (k) = r - \hat{x} (k), e_{2} (k) = \sum_{0}^{k} (r - \hat{x} (k)), e_{3} = \dot{r} - \hat{\dot{x}} (k)

. Parameter triples

γ_{1}, γ_{2}, γ_{3}

determine the final control variable, the value of

δ

is generally selected as the sampling time for discrete systems, the value of

α

satisfies

α \in (0, 1)

.

A simulation program using the combination of ESKF and LQR is developed, then the comparison results of the combination of ESKF+NSEF and the combination of ESKF+LQR is presented and analyzed in the next section.

5. Simulation Analysis

According to the discrete model (16), a block diagram of the drag-free satellite control system is designed and presented in Figure 8. The measurement mechanism provides information on the attitude angle of the satellite, the displacement, and the attitude of the test mass relative to the center of the cavity. The output command of actuators and output measurement information of the sensors are both inputs of ESKF. The ESKF estimates the attitude, angular velocity, displacement, velocity, disturbance forces, and disturbance moments. By selecting an appropriate controller with the ESKF, high-accuracy anti-disturbance control of the drag-free satellite/test mass dynamic system is achieved.

First, we consider the LQR controller. The ESKF filter design discussed in Section 3 was simulated in MATLAB with a time step of 0.01 s. The ESKF consists of 27 states, which are divided into three groups of nine states each. After the ESKF output was stable and tracking accurately. The LQR control algorithm was then employed for designing the control law, and the corresponding simulation results were obtained.

Initial conditions of simulation:

The perturbation forces and moments of the satellite are modeled as constant superposition sinusoidal perturbations with phase differences in each axial direction. Specifically:

F_{D s c} = [\begin{matrix} - 12.8 + 7.7 \times sin (ω_{d} t) \\ - 12.8 + 7.7 \times sin (ω_{d} t + \frac{2 π}{3}) \\ - 12.8 + 7.7 \times sin (ω_{d} t + \frac{4 π}{3}) \end{matrix}] \times 10^{- 7} (N)

, the disturbance moment to the satellite is modeled as

T_{D s c} = [\begin{matrix} - 1.2 + 6.6 \times sin (ω_{d} t) \\ - 1.2 + 6.6 \times sin (ω_{d} t + \frac{2 π}{3}) \\ - 1.2 + 6.6 \times sin (ω_{d} t + \frac{4 π}{3}) \end{matrix}] \times 10^{- 6} (N \cdot m)

, moment of disturbance to the test mass is

T_{D tm} = [\begin{matrix} - 1.2 + 7.7 \times sin (ω_{d} t) \\ - 1.2 + 7.7 \times sin (ω_{d} t + \frac{2 π}{3}) \\ - 1.2 + 7.7 \times sin (ω_{d} t + \frac{4 π}{3}) \end{matrix}] \times 10^{- 12} (N \cdot m)

, where

ω_{d} = 1.2 \times 10^{- 3} Hz

. The simulation program will achieve real-time estimation of the extended state for the above perturbations, and the results of the error analysis of the estimation are given later. The expectation of the input noise of the thrusters providing force and moment to the satellite are

1 \times 10^{- 9} N / \sqrt{Hz}

and

1 \times 10^{- 9} N \cdot m / \sqrt{Hz}

, respectively. The input noise expectation of the electrostatic actuator providing the test mass control torque is

1 \times 10^{- 15} N \cdot m / \sqrt{Hz}

, the expectation of the measurement noise of the star sensor providing satellite attitude measurement is set as

1 \times 10^{- 7} rad / \sqrt{Hz}

. And the expectation of displacement measurement noise of inertial sensor is

1 \times 10^{- 9} m / \sqrt{Hz}

, the expectation of attitude measurement noise is

1 \times 10^{- 8} rad / \sqrt{Hz}

. To meet the requirements of gravitational wave detection, the control loop’s design objective is set as follows: the setting time should be less than 1 min, and the amplitude spectral density of the relative displacement between test mass and satellite should both be less than

10^{- 8} m / \sqrt{Hz}

within the detection frequency band.

The performance of the combination of LQR and ESKF is evaluated initially. Subsequently, the performance of using the combination of LQR and ESKF is compared with the performance of the combination of NSEF and ESKF in terms of control accuracy and setting time. The intrinsic reasons for any differences observed are analyzed, and recommendations for engineering design are provided.

5.1. ESFK+LQR

By utilizing the ESKF-estimated states as input, we designed an LQR controller based on Equation (17). The initial value of state estimation was

{\hat{X}}_{0} = {[0, 0, 0, 0, 0, 0, 0, 0, 0]}^{T}

, and the values of Q and R were selected as

R = 1 \times 10^{- 4} diag (I_{3}, 10 I_{3}, I_{3})

. It should be noted that a larger value of Q can facilitate faster convergence of the states for the LQR controller.

5.1.1. Satellite Attitude

Figure 9 demonstrates the effectiveness of the ESKF and LQR controllers in controlling the attitude angle of the satellite. The results show that the attitude angle was successfully controlled from the initial

φ_{s c} = {[7 \times 10^{- 5}, 0, 0]}^{T} rad

to

\pm 3 \times 10^{- 8} rad

for all three attitude angles with a setting time of about 7 s, achieving the desired control target. The estimation error of the attitude angle was measured to be

\pm 2 \times 10^{- 8} rad

, while the estimation error of the attitude angular velocity and disturbance moment were

\pm 2 \times 10^{- 10} rad / s

and

\pm 1 \times 10^{- 5} N \cdot m

, respectively.

The amplitude spectral density curves for each attitude angle control error are presented in Figure 10. Based on the results shown in Figure 10, it can be observed that within the measurement bandwidth, the amplitude spectral density of the satellite’s attitude control error conforms to the design requirements of

10^{- 7} rad / \sqrt{Hz}

.

5.1.2. Test Mass and Satellite Relative Displacement

The results in Figure 11 show that the relative displacement was successfully controlled from

r_{r e l} = {[0.0005, 0.0009, - 0.0006]}^{T} m

to

\pm 2 \times 10^{- 9} m

for all three axes within an adjustment time of 20 s, achieving the desired control target. The position estimation error was measured to be

\pm 2.2 \times 10^{- 9} m

, while the velocity estimation error and disturbance estimation error were

\pm 4 \times 10^{- 10} m / s

and

\pm 4 \times 10^{- 8} N

, respectively.

The amplitude spectral density curves for each axial direction are presented in Figure 12. Based on the observations from Figure 12, it can be concluded that the kinematic indexes of the translation within the frequency band satisfy the design requirements and achieve

10^{- 8} m / \sqrt{Hz}

.

5.1.3. Relative Attitude between Test Mass and Satellite

Figure 13 shows the simulation results of ESKF and LQR controller dealing with the relative attitude between test mass and satellite. Based on the results from Figure 13, it can be concluded that the ESKF and LQR controllers designed in this paper were capable of controlling the relative attitude between the test mass and the satellite from

φ_{r e l} = {[0, 4 \times 10^{- 5}, 0]}^{T} rad

to

\pm 5 \times 10^{- 8} rad

for all three axes. The control result was found to be essentially oscillation-free, with an adjustment time of about 6s, achieving the desired control target. The estimation error of the attitude angle was measured to be

\pm 5 \times 10^{- 8} rad

, while the estimation errors of the attitude angular velocity and disturbance moment were

\pm 6 \times 10^{- 8} rad / s

and

\pm 2 \times 10^{- 11} N \cdot m

, respectively.

The amplitude spectral density curves for each attitude angle are presented in Figure 14. The amplitude spectral densities are lower than

5 \times 10^{- 8} rad / \sqrt{Hz}

in the whole frequency band.

5.2. Comparison Evaluation

To demonstrate the efficacy of the designed method, we present a simulation to compare ESKF+LQR with ESKF+NSEF. This allows us to illustrate the functions of our objectives and evaluate their performance against an established method. Simulation results is given in Figure 15, Figure 16 and Figure 17

Since the convergence facility of the

f a l ()

function used in NSEF is mainly based on the value of

α

, while the proof of certain physical relation is extremely complicated, the value of

α

is determined by rule of thumb. Based on the method in Ref. [26], we take

α_{1} = α_{2} = α_{3} = 0.5

in simulation. The simulation results of relative displacement of the x-axis are shown in Figure 16, while the control accuracy is satisfied, and the convergence process of ESKF+LQR was found to be much faster than that of ESKF+NSEF.

The RMS of the two approaches is listed in Table 1. Additionally, the LQR controller was observed to produce negligible oscillations in the system, which can be advantageous in terms of reducing energy consumption and extending the effective time of gravitational wave signal detection.

6. Conclusions

Integrating perturbations into states vector, the ESKF method demonstrates the capability to accurately estimate the state and disturbance of the drag-free satellite dynamics, and effectively suppressing handling noise. This approach lays the foundation for the controller to achieve accurate adjustment of the system state, and it is recommended to consider the ESKF method as a viable alternative for estimating uncertain disturbances in future drag-free engineering designs.

The LQR controller’s feedback parameter matrix can be rigorously derived using the generalized index to ensure optimal control performance. By combining the reasonable values of the state covariance matrix, the designed gain matrix guarantees the convergence speed of the system state, while the accurate estimation of ESKF enhances the relative attitude control accuracy by an order of magnitude.

For the relative displacement, the adjustment time of NSEF fails to reach the control task requirements. This is mainly due to the relatively slow control process of the relative kinematic parameters between the satellite and the test mass in NSEF, and the lack of a complete theoretical method to adjust and optimize the nonlinear error feedback parameters.

In summary, the ESKF+LQR control approach enables the system to reach a steady state rapidly and smoothly, thereby increasing the free flight time available for gravitational wave observation. This is more in line with the desired observation duration for gravitational wave detection. In comparison to the combination of ESKF+NSEF, the combination of ESKF+LQR is capable of adjusting to the reference target swiftly and reducing oscillations, leading to a reduction in thruster mass consumption.

Author Contributions

Conceptualization, X.Y. and J.L.; methodology, X.Y. and J.L.; software, X.Y.; validation, X.Y.; formal analysis, X.Y.; data curation, X.Y.; writing—original draft preparation, X.Y.; writing—review and editing, J.L., X.Y., G.Z. and D.Z.; visualization, X.Y.; supervision, J.L.; project administration, J.L.; funding acquisition, J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by The National Key Basic Research and Development Program (2020YFC2200701) and Zhujiang Talent Plan (2021QN02Z097).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank the editors and reviewers for their constructive comments to improve the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

ADRC	Active Disturbance Rejection Control
NSEF	Nonlinear State Error Feedback
LQR	Linear Quadratic Regulator
ESO	Extended State Observer

References

Han, J. From PID technology to “Active Disturbance Rejection control” technology. Control Eng. China 2002, 9, 13–18. [Google Scholar]
Han, J. From PID to Active Disturbance Rejection Control. IEEE Trans. Ind. Electron. 2009, 56, 900–906. [Google Scholar] [CrossRef]
Huang, Y.; Han, J. Analysis and design for the second order nonlinear continuous extended states observer. Chin. Sci. Bull. 2000, 45, 1938–1944. [Google Scholar] [CrossRef]
Huang, Y.; Wan, H.; Song, J. Analysis and Design for Third Order Nonlinear Continuous Extended States Observer. In Proceedings of the 19th Chinese Control Conference, Hongkong, China, 6–8 December 2000. [Google Scholar]
Gao, Z. Scaling and bandwidth-parameterization based controller tuning. In Proceedings of the American Control Conference, Denver, CO, USA, 4–6 June 2003. [Google Scholar]
Yang, X.; Huang, Y. Capabilities of extended state observer for estimating uncertainties. In Proceedings of the American Control Conference, Hyatt Regency Riverfront, St. Louis, MO, USA, 10–12 June 2009. [Google Scholar]
Jin, H.; Liu, L.; Lan, W. Stability conditions for linear Active Disturbance Rejection control of second-order systems. Acta Autom. Sin. 2018, 44, 1725–1728. [Google Scholar]
Chen, Z.; Sun, M.; Yang, R. Stability Study of Linear Active Disturbance Rejection Controller. Acta Autom. Sin. 2013, 39, 574–580. [Google Scholar]
Gan, Z.; Han, J. Lyapunov function construction for second-order ESO. In Proceedings of the 21th Chinese Control Conference, Hangzhou, China, 12–16 August 2002. [Google Scholar]
Shao, X.; Wang, H. Performance analysis of linearly expanding state observers and their higher-order forms. Control Decis. 2015, 30, 815–822. [Google Scholar]
Bai, W.; Xue, W.; Huang, Y. On extended state based Kalman filter design for a class of nonlinear time-varying uncertain systems. Sci. China Inf. Sci. 2018, 61, 042201. [Google Scholar] [CrossRef]
Pan, J.; Liu, C. ESO-based LQR controller for UAV attitude control. J. Syst. Simul. 2018, 30, 753–759. [Google Scholar]
Lu, K.; Yang, Z.; Xu, C. Study on the flight control of tiltable quadcopter LQR based on nonlinear separation. J. Nanjing Univ. Inf. Sci. Technol. (Nat. Sci. Ed.) 2019, 11, 390–397. [Google Scholar]
Mu, H. Research on Quadcopter Attitude Control Method Based on Flight Simulator. Master’s Thesis, Jiangxi University of Science and Technology, Ganzhou, China, 2021. [Google Scholar]
Chen, H. UAV Attitude Control Based on LQR and Sliding Mode Controller Design. Master’s Thesis, Guilin University of Electronic Technology, Guilin, China, 2020. [Google Scholar]
Liu, L.; Zuo, J.; Wu, J. LQR control method of quadcopter based on RBF-ARX model. Comput. Digit. Eng. 2017, 45, 659–664. [Google Scholar]
Gao, K.; Song, J.; Ai, S. Design of LQR self-disturbance control method for re-entry stage of hypersonic aircraft. J. Astronaut. 2020, 41, 1418–1423. [Google Scholar]
Li, M. Hypersonic Vehicle Controller Design and Performance Evaluation. Master’s Thesis, Huazhong University of Science and Technology, Wuhan, China, 2017. [Google Scholar]
Lin, F.; Ma, H.; Lu, Y. Composite control method of ESO LQR for airborne three-axis gimbal. J. Shenyang Univ. Aeronaut. Astronaut. 2021, 38, 47–53. [Google Scholar]
Wang, L.; Zhang, Q.; Chen, H. Optimal control method for swarm systems formation achievement problem with LQR performance index. Acta Aeronaut. Astronaut. Sin. 2022, 43, 1–10. [Google Scholar]
Xing, J.; Yu, Y.; Wang, Y. Robust control of NEO formation satellites based on improved linear quadratic regulator. J. Natl. Univ. Def. Technol. 2016, 38, 100–106. [Google Scholar]
Yu, D.; Cui, K.; Liu, H.; Zeng, M.; Jiang, W. Micro-newton hall electric propulsion technology for gravitational wave detection. J. Harbin Inst. Technol. 2020, 52, 171–181. [Google Scholar]
Virdis, M. A Meteoroid Impact Recovery Control System for the LISA Gravitational Wave Observatory. Ph.D. Thesis, Politecnico di Torino, Torino, Italy, 2021. [Google Scholar]
Gao, Z.; Huang, Y.; Han, J. An alternative paradigm for control system design. In Proceedings of the 40th IEEE Conference on Decision and Control, Orlando, FL, USA, 4–7 December 2001. [Google Scholar]
Pu, M.; Liu, P.; Xiong, A. Improvement of Fal function and 3 new nonlinear expansion state observers. Control Decis. 2021, 36, 1655–1662. [Google Scholar]
Han, J. Active Disturbance Rejection Control Technology: Control Technology for Estimating Compensation of Uncertainties; Defense Industry Press: Beijing, China, 2008. [Google Scholar]

Figure 1. Relative positions of CoM of satellite, CoM of test mass, and center of cavity.

Figure 2. Coordinate system diagram.

Figure 3. Solar pressure amplitude spectrum density.

Figure 4. Electrostatic suspension actuation noise.

Figure 5. Thruster force noise.

Figure 6. Inertial sensor measurement noise.

Figure 7. ESKF calculation flow chart.

Figure 8. ESKF-based ADRC system design.

Figure 9. Simulation results of satellite attitude. (a) Estimation error of the attitude angle. (b) Estimation error of the attitude angular velocity. (c) Estimation error of disturbance moment. (d) Attitude control results.

Figure 10. Amplitude spectral density of attitude of the satellite.

Figure 11. Simulation results of relative displacement between test mass and satellite. (a) Estimation error of the relative displacement. (b) Estimation error of the velocity. (c) Estimation error of disturbance. (d) Control results of relative displacement.

Figure 12. Amplitude spectral density of relative displacement.

Figure 13. Simulation results of relative attitude between test mass and satellite. (a) Estimation error of the relative attitude. (b) Estimation error of the attitude angular velocity. (c) Estimation error of disturbance moment. (d) Relative attitude control results.

Figure 14. Amplitude Spectral Density of the related attitude between test mass and satellite.

Figure 15. Simulation result of satellite attitude (pitch angle) between LQR and NSEF. (a) Control result in time domain. (b) Comparison of Amplitude Spectral Density.

Figure 16. Simulation result of relative displacement (x-axis) between LQR and NSEF. (a) Control result in the time domain. (b) Comparison of Amplitude Spectral Density.

Figure 17. Simulation result of relative attitude (yaw angle) between LQR and NSEF. (a) Control result in time domain. (b) Comparison of Amplitude Spectral Density.

Table 1. RMS of two different control approaches.

Axis	ESKF+LQR	ESKF+NSEF
satellite attitude (pitch)	6.3076 × $10^{- 9}$ rad	7.4918 × $10^{- 9}$ rad
satellite attitude (yaw)	7.2104 × $10^{- 9}$ rad	5.9713 × $10^{- 9}$ rad
satellite attitude (roll)	7.4293 × $10^{- 9}$ rad	4.7966 × $10^{- 9}$ rad
relative displacement (x)	4.5918 × $10^{- 10}$ m	8.9468 × $10^{- 10}$ m
relative displacement (y)	4.5422 × $10^{- 10}$ m	9.3517 × $10^{- 10}$ m
relative displacement (z)	4.7442 × $10^{- 10}$ m	9.4479 × $10^{- 10}$ m
relative attitude (pitch)	1.0859 × $10^{- 10}$ rad	2.7201 × $10^{- 8}$ rad
relative attitude (yaw)	1.0480 × $10^{- 10}$ rad	3.1758 × $10^{- 8}$ rad
relative attitude (roll)	1.0299 × $10^{- 10}$ rad	3.2608 × $10^{- 8}$ rad

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ye, X.; Lian, J.; Zhao, G.; Zhang, D. A Novel Closed-Loop Structure for Drag-Free Control Systems with ESKF and LQR. Sensors 2023, 23, 6766. https://doi.org/10.3390/s23156766

AMA Style

Ye X, Lian J, Zhao G, Zhang D. A Novel Closed-Loop Structure for Drag-Free Control Systems with ESKF and LQR. Sensors. 2023; 23(15):6766. https://doi.org/10.3390/s23156766

Chicago/Turabian Style

Ye, Xiaorong, Junxiang Lian, Guoying Zhao, and Dexuan Zhang. 2023. "A Novel Closed-Loop Structure for Drag-Free Control Systems with ESKF and LQR" Sensors 23, no. 15: 6766. https://doi.org/10.3390/s23156766

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Novel Closed-Loop Structure for Drag-Free Control Systems with ESKF and LQR

Abstract

1. Introduction

2. Dynamics Modeling

3. Extended State Kalman Filter Design

3.1. Extended State Kalman Filter

3.2. Extended State Design of Drag-Free Control System

4. Feedback Controller Design

5. Simulation Analysis

5.1. ESFK+LQR

5.1.1. Satellite Attitude

5.1.2. Test Mass and Satellite Relative Displacement

5.1.3. Relative Attitude between Test Mass and Satellite

5.2. Comparison Evaluation

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI