1. Introduction
Gravitational wave astronomy provides a new tool to explore black holes, dark matter, the early universe, and the evolution of the universe. To detect gravitational waves in space, a strategy involves deploying multiple satellites in mega-satellite formations to measure tiny changes in the relative distances between satellites when the gravitational wave goes through. However, the challenge lies in the weak characteristics (to the order of ) in the changes caused by gravitational waves, which can be easily influenced by extraneous perturbations and noise. To address this issue, the typical approach is to employ drag-free satellites: an inner test mass is shielded and free-falls along the geodesic of spacetime, and the outer satellite counteracts non-conservative forces and tracks the test mass in a sensitive axis. This setup creates an ultra-static and ultra-stable platform, with the test mass serving as an inertial reference for the measurement of relative distances in space. Using this approach, it is possible to obtain accurate and reliable measurements of distance variation. And the variation represents the effect of gravitational waves.
In the drag-free control loop, there are many factors that can impact the effect of the controller, such as external environmental disturbances, sensor measurement noise, process noise, and other inevitable disturbances and noises. Models of these disturbances and noises are hard to be built precisely, which makes it difficult to determine the appropriate models for compensation. To address this challenge, the Extended State Observer (ESO) method has emerged as an effective solution for modeling, estimating, and identifying disturbances in the drag-free control loop. Using the ESO method, it is possible to obtain estimates of the disturbances, which can help the design of the controller. This method has significant implications for the development of advanced control systems for drag-free satellites, thereby improving performance and reliability.
The Active Disturbance Rejection Control (ADRC) technique, proposed by Han [
1,
2], combines the “anti-disturbance” and “model independence” of PID control with the idea of the state observer. The Extended State Observer (ESO) is the core of ADRC, providing a way to estimate and compensate for disturbances and uncertainties. Huang [
3,
4] demonstrated the design method and proof of convergence for nonlinear ESO of second- and third-order systems, showing that it can achieve fast convergence without oscillation, even in the presence of model uncertainty and disturbances. However, the complexity of nonlinear ESO increases with the growth of the number of parameters, making tuning more challenging. Despite this drawback, the effectiveness of nonlinear ESO in mitigating disturbances and uncertainties makes it a promising technique for advanced control systems in various applications. Gao [
5] proposed a parameter design method for linearized ESOs based on bandwidth, which effectively reduces the design threshold and improves the convenience of the application. Yang [
6] analyzed the observation error of the ESO for different forms of disturbances and concludes that when the disturbance is bounded or its derivative is bounded, ESO can effectively estimate it, and the observation error is bounded. In addition, Jin [
7], Chen [
8], and Gan [
9] analyzed the stability of the ESO using different methods, and Shao [
10] analyzed high-order ESOs by adding higher-order derivative of disturbance as the extended state. Although increasing the extended order can effectively reduce the estimation error of each state, increasing the order and bandwidth simultaneously also affects the high-frequency noise suppression effect. Therefore, a trade-off between the expansion order and bandwidth is needed to balance the estimation accuracy and the high-frequency noise suppression effect. The Extended State Kalman filter (ESKF) proposed by Xue [
11] combines the advantages of both extended state observer and Kalman filter to filter the noise and estimate the system state and disturbance in dealing with nonlinear systems with strong nonlinearity, large initial estimation error, and severe noise. The Extended State Kalman filter provides a potential solution to the problem of disturbance identification for the drag-free control of gravitational wave detection satellites, when the conventional filtering methods are not sufficient to estimate the disturbance.
The Linear Quadratic Regulator (LQR) is a widely used engineering tool in the aerospace industry due to its ability to achieve optimal control under specific performance requirements. Its simplicity in design has made it a popular choice for various applications, such as quadrotors [
12,
13,
14,
15,
16], hypersonic vehicles [
17,
18], airborne remote gimbal [
19], and satellite formation problems [
20,
21] etc.
In the field of control engineering, the Linear Quadratic Regulator (LQR) is widely used for linear problems. To ensure the effectiveness of the controller, an accurate linear model must be established or a nonlinear model should be linearized prior to the LQR design. In cases where low control performance requirements are sufficient, typically disturbances and noise are not handled directly but are instead compensated for through control. In order to improve the robustness and disturbance resistance of the traditional LQR controller, Lu [
12] introduced the Extended State Observer (ESO) to estimate random low-frequency disturbances and the estimation of ESO is used by LQR. Attitude control of spacecraft with low precision requirements, disturbances, and noise are often not preprocessed and are only compensated for through LQR controllers. For systems with higher performance requirements, such as the six-degree-of-freedom attitude control system described in Ref. [
17], a combination of ESO and LQR is used to achieve higher control accuracy and stronger disturbance resistance compared to using LQR control alone. In Lin’s research [
19], a standard nonlinear ESO was employed in combination with LQR to estimate and compensate for multi-source perturbations, ultimately improving the control of LQR for uncertain systems. While these studies successfully combined an extended state observer with LQR control and achieved some improvement, they focused solely on the estimation and compensation of perturbations without considering the suppression of noise.
The accuracy required for drag-free and attitude control in gravitational wave detection is crucial, so the impact of noise must be considered. Previous methods are insufficient in dealing with the noise affecting gravitational wave detection satellites, and cannot estimate the perturbations effectively. Those methods also suffer from longer setting times. To design a successful control system, it is necessary to develop effective strategies to reduce the noise impact on control performance. In this study, we propose a novel approach that combines ESKF with LQR control. We use the state and disturbance estimated by ESKF as the input information for the controller, ensuring optimized control. Our analysis and simulations show that this new approach outperforms traditional solutions. It effectively shortens the adjustment time, reduces the number of oscillations, compensates for disturbances, and suppresses noise, ultimately achieving the desired design specifications.
The paper is organized as follows: in
Section 2, a dynamic model of a single test mass drag-free satellite is established. In
Section 3 and
Section 4, the design process and calculation methods of the ESKF and LQR are presented, respectively. In
Section 5, the performance of the system using the LQR controller and NSEF controller is compared through numerical simulations, indicating that the overall performance of the LQR controller is superior to that of the NSEF controller when using ESKF as the estimation method. The conclusion is given in
Section 6.
2. Dynamics Modeling
This paper takes a single test mass, a drag-free satellite in geocentric orbit as the research subject, as is shown in
Figure 1, where
C indicates the center of mass of an object,
h represents the sensitive cavity, which is fixed to the satellite, then the position vector
from the center of the sensitive cavity to the center of mass (CoM) of the satellite is constant, and the position vector of test mass relative to the satellite is
. The relative translation equations of motion in the inertial system are first transformed into the satellite body coordinate system, similarly the relative attitude equations of motion are projected into the TM body coordinate system, which is illustrated in
Figure 2. Then a comprehensive drag-free satellite dynamics model can be established as follows.
where
indicates input noise, here assume that all input forces and moments acting on the satellite, as well as the measurement output of the sensor, are subject to noise,
denotes the disturbance moment to the satellite,
and
denote the test mass and the disturbance force on the satellite, respectively, and
indicates the disturbance moment to the test mass. The
,
,
,
C,
D, and
G subscripts indicate the S/C, Test Mass, measurements RELated to the sensitive cavity, Control command, Disturbance, and Gravity. The
h superscript indicates the components in the sensitive cavity frame, and
is the coordinate transformation from the satellite frame to the test mass frame.
The dynamic model is expressed in the form of state space equations which are presented as follows:
where
where
,
,
,
are the coupling coefficient matrices for translation and rotation, respectively.
u is the system control variable,
w is input noise,
d is measurement noise, and
f represents the total perturbation affecting the system, including the known part and the unmodeled part.
The Gravitational Wave Detector-TianQin requires detection satellites in deep-space orbit. The main disturbance on the satellite comes from solar pressure. To ensure a steady power supply and minimize fluctuations in the satellite’s internal thermal environment for ultra-stability, the drag-free satellite uses a body-attached battery array.
The expression of this perturbation is shown below:
where
indicates surface reflection coefficient, normally 1–1.44,
indicates the solar pressure near the Earth,
,
is the surface-to-mass ratio of the spacecraft,
is the projected area of the spacecraft facing the sun,
is the unit vector indicating the direction from the center of the Earth to the sun. The sun exposure factor, denoted as
k, is assumed to be 1 for the light area and 0 for the ground shadow area. The amplitude spectral density of the solar pressure on the satellite is shown in
Figure 3.
In a drag-free satellite with a single test mass, the displacement measurements between the CoM of the TM and the CoM of the satellite, as well as the attitude measurements of the TM relative to the satellite, are obtained by an inertial sensor. The attitude of the satellite is determined through a star sensor, and the micro-propulsion provides the necessary control forces and moments to maintain the desired position and attitude of the satellite. At present, the typical micro-propulsion systems have a noise power spectral density of
under open-loop condition [
22]. Their corresponding power spectral densities are presented in
Figure 4,
Figure 5 and
Figure 6.
Based on the data of Ref. [
23], it can be inferred that electrostatic actuation noise can be treated as white noise in the frequency band needed for the detection of gravitational waves by the TianQin detector.
In the case of capacitive displacement sensors, the noise levels for displacement measurements are equal in the x, y, and z directions, while the noise levels for angle measurements in the
direction are one order of magnitude lower than those in the
and
directions [
23].
Figure 6 displays the measurement noise in each direction.
The spectral density curves of perturbation and noise are given in
Figure 3,
Figure 4,
Figure 5 and
Figure 6, which will be used as the basis for the modeling and simulation calculations later.
3. Extended State Kalman Filter Design
Achieving high accuracy of relative displacement and relative attitude control in a noisy and disturbed environment requires multiple steps, including disturbance estimation, noise suppression, and state control.
The Extended State Kalman Filter (ESKF) can estimate nonlinear uncertainty. In cases of initial error, uncertain dynamics (perturbation), and bounded noise, the perturbation is estimated and compensated for by the extended state, and noise effects can be suppressed. This paper uses ESKF to estimate disturbance forces and moments, such as solar pressure on the satellite and anomalous electromagnetic forces and moments on the test mass. First, we present the design scheme of ESKF, then apply it to uncertain disturbance estimation in drag-free control.
3.1. Extended State Kalman Filter
For the following discrete system containing uncertain perturbations
where
is the system state,
are system matrices,
is the measurement matrix,
is the nonlinear uncertain part in the system (
6),
are the process noise and measurement noise, respectively, and
is the system measurement output. Treating
as an additional state variable
, which is then estimated and compensated for. The extended system is described as
where
,
,
,
,
, assume
,
are unrelated zero-mean Gaussian random series and
,
,
,
is the estimation of
,
is the initial value of the nominal part of
,
is a known constant matrix, and
,
is bounded.
According to the classical state observer theory, the extended state observer for the extended state Equation (
7) is shown below, where
Based on the given initial estimation
and initial value of covariance matrix
, we obtain the middle value of the estimated quantity
and update the value of the covariance matrix
where
is used to decouple the cross terms of estimation error and uncertainty,
,
,
is the variance of
,
is the variance of
,
is the estimation of
, whose value is calculated by
, where
by calculating
, update the estimation
and the covariance matrix
After that, calculate the control variable
based on the error between the state estimate and the reference, then the calculated control variable
and the estimated value
of the disturbance are used to calculate the final control variable
u, and the flow chart is shown in
Figure 7.
3.2. Extended State Design of Drag-Free Control System
The complex space environment presents a challenge in accurately modeling and describing perturbations affecting on satellite and test masses. Conventional control methods are model-dependent, and their performance can be severely affected by the inaccuracies of the perturbation model. To address this issue,
Section 3.1 proposes a method to expand the perturbations into new states and create a new filter model. Then the perturbation can be compensated and the noise can be suppressed.
The uncertain disturbance term is . represents the disturbance moments that the satellite is subjected to, represents the disturbance forces that the satellite is subjected to, and represents the disturbance moment that TM is subjected to. Those are treated as extended states. The state vector is taken as , indicates related attitude and displacement that can be measured.
According to
Section 3.1, the extended state differential equation is obtained as
where
,
, symbols has the same physical meaning as in Equation (
4).
The extended state Equation (
15) is discretized to obtain the discrete difference equation model (
16)
where
.
The ESKF is designed according to the flowchart shown in
Figure 7 to estimate the kinematic state parameters and the unknown disturbances, while implementing feedback control. Where
,
,
i is the number of state vector.
4. Feedback Controller Design
Once the estimation of states and perturbations has been successfully obtained, the next step is the selection of an appropriate controller. The Linear Quadratic Regulator (LQR) is a control strategy designed to minimize a cost function. The optimal control law is obtained through the design of the state feedback controller, which allows for the completion of closed-loop optimal control in a fast, stable, and accurate manner.
The performance index for LQR control reflects the requirements for both state and control quantities, and the cost function used in this paper is
The weighting matrix Q is semi-positive definite and R is positive definite, which are set as a diagonal matrix in the subsequent simulation. For the first term in the cost function J, each component is required to be small in the control process. The larger the weight in Q means the stricter the constraint on the components; while the second term in the cost function indicates the requirement for the control output, which is weighted according to the different characteristics of each component.
The Ricatti equation is used to calculate P, then based on , the feedback gain matrix is calculated, the control law of LQR is chosen as .
In order to maximize the effective time of gravitational wave detection and minimize the output of thrusters (actuators), it is necessary to ensure that the detection satellite is maintained in a free and stable flight for as long a time as possible. To achieve this goal, the adjustment transition process of drag-free control needs to be as fast as possible, allowing the detection system to quickly reach an ultra-static and ultra-stable state. Our research shows that the LQR control strategy can satisfy the optimal control requirements, enabling a rapid transition to a steady state and achieving the required control accuracy. By appropriately designing the weights of each component, in addition to the values of Q and R, it is possible to achieve a more refined control.
Nonlinear error feedback is used in classical Active Disturbance Rejection Control [
1], mainly by rewriting the weighting of classical PID control into a nonlinear combination, as shown in Equation (
18)
It is a continuous power series function with linear segments near the origin, denotes the length of the interval of the linear segment, and e indicates the amount of error.
This function
is characterized by increasing the gain when the error is small and using a small gain when the error is large, which prevents high-frequency chattering due to excessive gain calculated when the error is small [
24,
25]. The control law for nonlinear error feedback is
where
. Parameter triples
determine the final control variable, the value of
is generally selected as the sampling time for discrete systems, the value of
satisfies
.
A simulation program using the combination of ESKF and LQR is developed, then the comparison results of the combination of ESKF+NSEF and the combination of ESKF+LQR is presented and analyzed in the next section.
5. Simulation Analysis
According to the discrete model (
16), a block diagram of the drag-free satellite control system is designed and presented in
Figure 8. The measurement mechanism provides information on the attitude angle of the satellite, the displacement, and the attitude of the test mass relative to the center of the cavity. The output command of actuators and output measurement information of the sensors are both inputs of ESKF. The ESKF estimates the attitude, angular velocity, displacement, velocity, disturbance forces, and disturbance moments. By selecting an appropriate controller with the ESKF, high-accuracy anti-disturbance control of the drag-free satellite/test mass dynamic system is achieved.
First, we consider the LQR controller. The ESKF filter design discussed in
Section 3 was simulated in MATLAB with a time step of 0.01 s. The ESKF consists of 27 states, which are divided into three groups of nine states each. After the ESKF output was stable and tracking accurately. The LQR control algorithm was then employed for designing the control law, and the corresponding simulation results were obtained.
Initial conditions of simulation:
The perturbation forces and moments of the satellite are modeled as constant superposition sinusoidal perturbations with phase differences in each axial direction. Specifically: , the disturbance moment to the satellite is modeled as , moment of disturbance to the test mass is , where . The simulation program will achieve real-time estimation of the extended state for the above perturbations, and the results of the error analysis of the estimation are given later. The expectation of the input noise of the thrusters providing force and moment to the satellite are and , respectively. The input noise expectation of the electrostatic actuator providing the test mass control torque is , the expectation of the measurement noise of the star sensor providing satellite attitude measurement is set as . And the expectation of displacement measurement noise of inertial sensor is , the expectation of attitude measurement noise is . To meet the requirements of gravitational wave detection, the control loop’s design objective is set as follows: the setting time should be less than 1 min, and the amplitude spectral density of the relative displacement between test mass and satellite should both be less than within the detection frequency band.
The performance of the combination of LQR and ESKF is evaluated initially. Subsequently, the performance of using the combination of LQR and ESKF is compared with the performance of the combination of NSEF and ESKF in terms of control accuracy and setting time. The intrinsic reasons for any differences observed are analyzed, and recommendations for engineering design are provided.
5.1. ESFK+LQR
By utilizing the ESKF-estimated states as input, we designed an LQR controller based on Equation (
17). The initial value of state estimation was
, and the values of
Q and
R were selected as
. It should be noted that a larger value of
Q can facilitate faster convergence of the states for the LQR controller.
5.1.1. Satellite Attitude
Figure 9 demonstrates the effectiveness of the ESKF and LQR controllers in controlling the attitude angle of the satellite. The results show that the attitude angle was successfully controlled from the initial
to
for all three attitude angles with a setting time of about 7 s, achieving the desired control target. The estimation error of the attitude angle was measured to be
, while the estimation error of the attitude angular velocity and disturbance moment were
and
, respectively.
The amplitude spectral density curves for each attitude angle control error are presented in
Figure 10. Based on the results shown in
Figure 10, it can be observed that within the measurement bandwidth, the amplitude spectral density of the satellite’s attitude control error conforms to the design requirements of
.
5.1.2. Test Mass and Satellite Relative Displacement
The results in
Figure 11 show that the relative displacement was successfully controlled from
to
for all three axes within an adjustment time of 20 s, achieving the desired control target. The position estimation error was measured to be
, while the velocity estimation error and disturbance estimation error were
and
, respectively.
The amplitude spectral density curves for each axial direction are presented in
Figure 12. Based on the observations from
Figure 12, it can be concluded that the kinematic indexes of the translation within the frequency band satisfy the design requirements and achieve
.
5.1.3. Relative Attitude between Test Mass and Satellite
Figure 13 shows the simulation results of ESKF and LQR controller dealing with the relative attitude between test mass and satellite. Based on the results from
Figure 13, it can be concluded that the ESKF and LQR controllers designed in this paper were capable of controlling the relative attitude between the test mass and the satellite from
to
for all three axes. The control result was found to be essentially oscillation-free, with an adjustment time of about 6s, achieving the desired control target. The estimation error of the attitude angle was measured to be
, while the estimation errors of the attitude angular velocity and disturbance moment were
and
, respectively.
The amplitude spectral density curves for each attitude angle are presented in
Figure 14. The amplitude spectral densities are lower than
in the whole frequency band.
5.2. Comparison Evaluation
To demonstrate the efficacy of the designed method, we present a simulation to compare ESKF+LQR with ESKF+NSEF. This allows us to illustrate the functions of our objectives and evaluate their performance against an established method. Simulation results is given in
Figure 15,
Figure 16 and
Figure 17Since the convergence facility of the
function used in NSEF is mainly based on the value of
, while the proof of certain physical relation is extremely complicated, the value of
is determined by rule of thumb. Based on the method in Ref. [
26], we take
in simulation. The simulation results of relative displacement of the x-axis are shown in
Figure 16, while the control accuracy is satisfied, and the convergence process of ESKF+LQR was found to be much faster than that of ESKF+NSEF.
The RMS of the two approaches is listed in
Table 1. Additionally, the LQR controller was observed to produce negligible oscillations in the system, which can be advantageous in terms of reducing energy consumption and extending the effective time of gravitational wave signal detection.
6. Conclusions
Integrating perturbations into states vector, the ESKF method demonstrates the capability to accurately estimate the state and disturbance of the drag-free satellite dynamics, and effectively suppressing handling noise. This approach lays the foundation for the controller to achieve accurate adjustment of the system state, and it is recommended to consider the ESKF method as a viable alternative for estimating uncertain disturbances in future drag-free engineering designs.
The LQR controller’s feedback parameter matrix can be rigorously derived using the generalized index to ensure optimal control performance. By combining the reasonable values of the state covariance matrix, the designed gain matrix guarantees the convergence speed of the system state, while the accurate estimation of ESKF enhances the relative attitude control accuracy by an order of magnitude.
For the relative displacement, the adjustment time of NSEF fails to reach the control task requirements. This is mainly due to the relatively slow control process of the relative kinematic parameters between the satellite and the test mass in NSEF, and the lack of a complete theoretical method to adjust and optimize the nonlinear error feedback parameters.
In summary, the ESKF+LQR control approach enables the system to reach a steady state rapidly and smoothly, thereby increasing the free flight time available for gravitational wave observation. This is more in line with the desired observation duration for gravitational wave detection. In comparison to the combination of ESKF+NSEF, the combination of ESKF+LQR is capable of adjusting to the reference target swiftly and reducing oscillations, leading to a reduction in thruster mass consumption.