1. Introduction
Since the late 19th and early 20th centuries, when space technology experienced rapid advancement, human exploration of space has been ceaseless. Over more than a century of progress, space-based optoelectronic systems have evolved into crucial components of modern defense systems. They play a crucial role in predicting near-Earth object threats, disaster forecasting, and post-disaster search and rescue activities [
1,
2]. Additionally, space-based optoelectronic detection systems are a key method for tracking deep space targets, with major advantages including long-range detection, wide coverage, high measurement accuracy, and strong stealth capabilities [
3,
4].
Space-based multi-target tracking is an important research area in the application of space-based detection, playing a crucial role in space surveillance, early warning, and spacecraft safety [
5,
6]. The term space target refers to all outer space objects, including nonfunctional space craft, spent upper stages, and space debris. The escalating problem of space debris poses a significant threat to the safe operation of spacecraft in orbit. Consequently, accurate tracking of debris is of paramount importance for early warning and mitigation measures [
7,
8]. There are many challenges in deep space target tracking based on space-based platforms, including the following: (1) The imaging distance is far, and targets often appear as weak points, lines, or jagged shapes in the image that lack obvious contours, texture, and other features [
9]. (2) In scenarios where stars are densely packed, it can be challenging to distinguish real targets from stars, as they often appear only as one or two pixels. This can lead to point targets being easily confused and misidentified. (3) The stability of tracking algorithms is compromised due to interference from atmospheric conditions and various intertwined noises. Therefore, how to accurately track multiple targets under the premise of overcoming the influence of these factors is a key problem to be solved in the field of space-based optoelectronic tracking [
10].
In the field of multi-target tracking, Goodman et al. [
11] proposed a mathematical framework based on random finite set theory to address multi-target tracking problems. This framework effectively deals with the randomness, uncertainty, and correlation of targets. Mahler proposed the probability hypothesis density (PHD) [
12] method based on the theory of random finite sets. This method estimates the target state by recursively updating the posterior probability of the target within the Bayesian framework. The PHD filter is particularly effective in addressing complex challenges such as unknown target quantities, dynamic target variations, and sensor inaccuracies, making it widely applicable. However, the computational complexity of the PHD algorithm arises from the use of multidimensional integrals. To tackle this issue, Vo et al., introduced the Gaussian mixture probability hypothesis density (GM-PHD) [
13] and sequential Monte Carlo probability hypothesis density (SMC-PHD) [
14] methods. Zeyu Fu proposed a multi-level cooperative fusion approach to address the online multiple human tracking problem in a GM-PHD filter framework [
15]. Xiaolong Zhou developed a multi-target visual tracking system that combines the GM-PHD filter with object detection; the experimental results illustrate the significant improvement of the algorithm tracking performance when combined with GM-PHD [
16]. Gonzales proposed a multi-target tracking algorithm based on the GM-PHD filter and the unscented Kalman filter (UKF); the experimental results show that the proposed algorithm achieves appreciable tracking performance while requiring fewer assumptions and parameters [
17]. These advancements have offered new perspectives and techniques for resolving multi-target tracking issues [
18,
19]. In the field of space-based background, the authors of [
20] proposed an improved PHD filter that incorporates Doppler information. This method applies PHD filtering to space-based radar systems by combining radar cross-section (RCS) distribution models and Doppler likelihood models into the update step of PHD filtering. This approach addresses issues caused by inaccurate dynamic RCS distributions when tracking space targets using a space-based radar. In the field of optoelectronic systems, the authors of [
21] introduced the application of a CPHD filter to estimate the multi-target state of both known and unknown targets. This approach emphasizes the efficient identification and tracking of these targets in complex environments through an improved birth model. In contrast to the aforementioned methods, this paper presents an enhanced algorithm based on spatio-temporal pipeline filtering, which further integrates classical GM-PHD filtering techniques. The focus is on mitigating the tracking bias associated with conventional GM-PHD due to stellar interference and other factors specific to space-based environments.
In deep space backgrounds, an abundance of densely clustered stars exists, and these stars do not show discernible differences from targets in sensor data. Consequently, employing the classic GM-PHD method may result in a struggle to effectively differentiate between background stars and targets, potentially resulting in deviations in target tracking. To address the above problems, this paper considers improving the traditional GM-PHD filtering method by employing the spatio-temporal pipeline method [
22,
23]. The spatio-temporal pipeline filtering method integrates spatial and temporal features by leveraging the target’s motion trajectory continuity against a stochastic noise distribution [
24]. This approach effectively removes data noise, smooths the dataset, and extracts target characteristics. Its key strengths lie in its real-time processing capability, ease of implementation, and efficient feature extraction. With extensive research by numerous scholars, the spatio-temporal pipeline filtering method has been widely applied in various fields. For instance, the authors of [
25] applied this method in the field of video surveillance, reducing the transmission of large data streams while enabling more precise data analysis. The authors of [
26] improved the traditional spatio-temporal pipeline filtering method by fully utilizing the spatio-temporal correlation between targets and local backgrounds, proposing a highly accurate and real-time infrared weak-target tracking method. The authors of [
27] utilized the continuity of target trajectories in adjacent frames to estimate target motion direction, employing pipeline filtering to suppress noise interference from non-target motion directions outside the pipeline. These studies have enriched the application scenarios of the spatio-temporal pipeline filtering method and provided valuable references for its further development. The traditional spatio-temporal pipeline filtering algorithm requires that the motion characteristics of the target satisfy trajectory continuity and smoothness, as well as small variations in target speed. Therefore, it has the following shortcomings [
28,
29]: (1) When there are continuous strong noise points in a fixed position area of the pipeline, the algorithm cannot distinguish the difference between the target and the noise, resulting in the noise being treated as a target point and affecting the accuracy of subsequent target tracking. (2) When the target is occluded or submerged, the algorithm has no information compensation capability, resulting in tracking failure. (3) When the target speed is fast, the pipeline cannot capture the target position, leading to target tracking failure. The main issue with traditional algorithms is their sole reliance on counting the occurrences of targets and noise within the pipeline without fully utilizing the characteristics of target motion and noise distribution. Consequently, when the target energy is weak or temporarily lost, the algorithm becomes ineffective. To address these limitations, this paper proposes an adaptive spatio-temporal pipeline filtering algorithm, taking into account the distinguishing features between targets and noise in the context of celestial map backgrounds. This approach aims to enhance the performance of traditional spatio-temporal pipeline filtering methods. Firstly, the recursive expression in the prediction and updating steps of the GM-PHD algorithm is used to predict the target motion information. Secondly, the center position of the pipeline is adaptively adjusted based on prediction and updating steps. Finally, the target information is extracted by combining the difference in the trajectories between the star and the target. Compared with the existing literature, this paper addresses the multi-target tracking problem in a space-based context, combining the advantages of GM-PHD in terms of target number and state estimation and introducing spatio-temporal pipeline filtering into the GM-PHD multi-target tracking algorithm, with the main contributions including the following: (1) Distinguishing between stars and targets. In a space-based background, if a target is gradually approaching a star, the prediction weight of the star may exceed the pruning threshold, causing the algorithm to misclassify the star as the real target. To address this problem, this paper introduces trajectory similarity into spatio-temporal pipeline filtering, so that if a star appears in the pipeline, the target and the star can be distinguished by the results of the trajectory similarity function calculation, thus improving the target tracking accuracy. (2) Suppressing clutter and noise. Since the spatio-temporal pipeline filtering does not only rely on a single observation but considers the entire spatio-temporal trajectory of the target, the spatio-temporal pipeline filtering improves the accuracy and robustness of the measurements in the presence of observation noise, transient noise, or when there is a loss of observations, which helps to improve the estimation of the target state by the GM-PHD filter, thus improving the tracking performance. (3) Improved target estimation accuracy. Spatio-temporal pipeline filtering builds a more accurate target state prediction model by constructing a target state pipeline using the time series and correlating multi-frame data, which helps the GM-PHD algorithm to maintain the ability to track the target stably in complex environments and achieve more effective target state estimation. (4) Reduced algorithm complexity. Spatio-temporal pipeline filtering only needs to process the data within the pipeline, not all the data in the whole measurement space. This reduces the computational overhead of the GM-PHD filter, thus making it suitable for environments with limited space-based computational resources.
Section 2 delineates the recursive steps of GM-PHD filtering and presents the overarching framework of the algorithmic flow in this paper. In
Section 3, the adaptive adjustment of the pipeline center position is integrated with the characteristics of GM-PHD, leading to an enhanced spatio-temporal pipeline filtering method that incorporates the discrepancy between stars and targets. Finally,
Section 4 validates the proposed algorithms through simulation experiments and real-world data analysis.
2. GM-PHD Multi-Target Tracking
In this study, the GM-PHD filtering algorithm served as the core framework, with the improved spatio-temporal pipeline filtering method integrated into the GM-PHD filtering algorithm to enhance its performance in tracking targets in the space-based observation mode. The GM-PHD filtering algorithm models the multi-target probability hypothesis density using a Gaussian mixture and iteratively updates the individual Gaussian components within the PHD through mean, weight, and covariance initialization, prediction, and update, thereby estimating the target state. As depicted in
Figure 1, the GM-PHD method accounts for various scenarios, such as target state transitions, target births, target derivations, and target miss detections, enabling a detailed description of target motion states to achieve multi-target tracking in diverse environments.
The Gaussian mixture form in the GM-PHD algorithm is a linear cumulative representation of multiple weighted Gaussian components used to represent a probability distribution. Each Gaussian component is described by a mean and a covariance matrix. Specifically, the linear cumulative representation of the multi-objective PHD can be expressed as follows:
where
is the probability density function of the multi-objective PHD at state
x and time
k,
denotes the number of Gaussian components,
is the Gaussian component mean,
is the Gaussian component weight value, and
is the covariance matrix. The GM-PHD condensed representation takes the following form:
The GM-PHD filtering algorithm for target state estimation consists of five main steps:
The probability hypothesis density for multiple targets at the initial moment 0 is
The density of multi-target probability assumptions at moment
k − 1 is expressed as follows:
The expression for the predicted multi-objective PHD is as follows:
where
is the PHD of the nascent target at moment
,
is the predicted PHD of the surviving target, and
is the predicted PHD of the derived target.
The Gaussian mixture of
is of the form
With a given multi-objective PHD at moment , the weights, means, and covariance matrices of each of the predicted PHD can be calculated.
The updated multi-objective PHD expression is as follows:
where
is the posterior density, which represents the probability density of the target state at time
k after the updating step;
denotes the probability of target detection at time
k;
is the prior density, which represents the probability density of the predicted state at time
k;
denotes the set of observations at time
k; and
denotes the probability density of the target state after updating using the observation data
z.
where
responds to the probability of observing
z in state
x. A higher
increases the weight of the observations in the state update, making the algorithm more reliant on the observations for state estimation, and conversely, a lower
decreases the weight of the observations, making the algorithm more reliant on a priori prediction.
denotes the clutter density function.
denotes the probability density of all possible states at time
k corrected by the detection and observation models, which are used to ensure that the probability densities sum to 1. The role of the
is to normalize the probability densities of the observations after updating and to ensure the stability and correctness of the updating process.
where
is the observation matrix and
is the observation noise covariance matrix.
The Gaussian mixture of
takes the following form:
At a given time, the updating step computes the mean, weight, and covariance matrix of the updated multi-object PHD using a closed-form solution based on the predicted multi-object PHD at that time and the new observation set.
As the iterations of multi-object PHD prediction and updating progress, the number of Gaussian components continues to increase, resulting in a growing recursive operation for the Gaussian mixture PHD. The number of Gaussian components in the updated PHD
for the Gaussian mixture filter at moment
k is as follows:
where
is the number of observations at moment
k,
is the number of Gaussian components of the multi-target PHD
at moment
, and
and
stand for the number of Gaussian components of the derived and de novo target PHDs at moment
k, respectively. This shows that the number of Gaussian components grows uncapped over time.
To ensure the precision of GM-PHD filtering while reducing computational resources, a pruning strategy is employed to address the issue of increasing Gaussian component numbers. Typically, the GM-PHD filter prunes Gaussian components with weights lower than a predefined threshold, retaining only those with relatively higher weights. Pruning serves to control the number of Gaussian components after each iteration update and reduces the interference of low-weight Gaussian components on state estimation.
In space-based multi-object tracking, the dense concentration of stars can cause multiple Gaussian components to be closely positioned in space, and targets may not be distinguishable from stars. This can lead to the misclassification of stars as targets by traditional GM-PHD filters. Therefore, traditional methods often fail to prune the Gaussian components corresponding to stars, resulting in biased target estimation. To address this issue, this paper proposes a spatio-temporal pipeline filtering method that effectively distinguishes between targets and stars, allowing for the extraction of accurate target information. The specific implementation details will be discussed in
Section 3.
After processing through the spatio-temporal pipeline filtering, the resulting pruned and merged multi-object PHD allows for the estimation of the multi-object states from the weight and mean information of the Gaussian components. The estimated number of targets can be determined using Equation (9).
Based on the output of the PHD filter, which is the processed, pruned, and merged multi-target PHD, target states can be estimated from the weight
and mean
information of the Gaussian components
. Specifically, for Gaussian components with weights exceeding a predetermined threshold
, their means are considered as estimates of the target states. Additionally, for Gaussian components meeting the condition
, their means
are repeatedly counted
times to provide estimation results for the states. Through this step, estimations of the target states and the number of targets can be obtained.
The overall flow of the algorithm proposed in this paper is shown in
Figure 2.
As illustrated in
Figure 2, the core framework of the multi-target tracking algorithm presented in this paper is based on the GM-PHD filtering method and introduces significant enhancements. The improvements are primarily focused on two aspects:
Adaptive spatio-temporal pipeline filtering method: After obtaining the predicted update state of the target using the GM-PHD algorithm, the velocity and position information of the target is determined. This information is utilized to initialize or update the state of the spatio-temporal pipeline filter, thereby capturing the target’s spatial position changes and dynamic information over time. This approach leverages the strengths of both GM-PHD and spatio-temporal pipeline filtering, enabling the algorithm to adapt to dynamic target changes while effectively mitigating noise interference, thus enhancing the continuity and robustness of target tracking.
Trajectory similarity method: In star-dense scenes, the traditional GM-PHD filtering algorithm occasionally misjudges stars as real targets due to their weights exceeding the pruning threshold, resulting in tracking deviations. In this study, a trajectory similarity method is employed to distinguish between stars and targets. This method combines the advantages of continuous detection and multi-frame verification inherent in spatio-temporal pipeline filtering. By analyzing the differences in the motion characteristics of targets and stars, stellar weights are pruned effectively, thereby mitigating stellar interference and improving tracking accuracy.
3. Target State Estimation Based on Spatio-Temporal Pipeline Filtering
This paper addresses the problem of target tracking in a space-based environment, particularly in the presence of a large number of stars and noise in the sky background. The traditional GM-PHD algorithm often fails to effectively distinguish targets from background stars when dealing with multiple point targets in the space-based background. To address this issue, this paper proposes a spatio-temporal pipeline filtering method. This method establishes a circular pipeline region around the candidate targets and only considers the pixels within the pipeline when processing the sequence images, instead of processing the entire image as in the traditional GM-PHD algorithm. This approach effectively reduces the interference of background stars and also reduces the computational complexity of the algorithm.
The principle of spatio-temporal pipeline filtering is depicted in
Figure 3. The pipeline length comprises L frames of images, and the pipeline window represents a neighborhood centered around the candidate target, which is typically set to be circular. Following the first-in-first-out principle, images entering the pipeline’s top are prioritized for output. The primary rationale for employing this method lies in the clear distributional disparities between targets and noise. A target’s motion trajectory is continuous, and with a judiciously set sampling frame rate and pipeline size, the target’s position in the video sequence image remains relatively stable. Conversely, noise in the video image undergoes random fluctuations and is independent of the target’s trajectory. Hence, the established pipeline enables us to monitor the occurrences of each pixel within the sequence image. Through statistical analysis, points exceeding the predetermined occurrence threshold within the pipeline are identified as candidate targets and retained, while those falling below the threshold are classified as noise and removed accordingly.
The three main parameters of the pipeline filtering algorithm are length, radius, and center position. The pipeline length indicates the number of frames required for detection, which is determined by the range of the target
s. If
s varies in the interval [0, L], the length of the pipeline is L. The pipeline radius indicates the size of the pixel range containing the target field, which is usually set as a constant. The position of the center of the pipeline is determined by the center coordinates of the spatial target in the current image:
where
denotes the center position of the pipe in the
kth-frame image target, and
and
denote the coordinates of the pipe in two dimensions.
3.1. Pipe Position Adaptation
The classical pipeline filtering method typically sets a fixed pipeline radius, denoted as r, which determines the neighborhood range of the target. If the value of r is too small, the pipeline cannot adapt to the target’s motion range, leading to the target appearing outside the pipeline. If r is too large, it results in more noise and interference within the pipeline. Therefore, under the condition of a pipeline radius r, the accuracy of the pipeline center directly determines whether the target can be successfully detected.
In response to the aforementioned issues, this paper proposes improvements to the classical pipeline filtering algorithm. Since the target’s motion trend is continuously smooth, it is possible to combine target state and measurement information to predict the target’s position and velocity, thereby adaptively adjusting the pipeline center and radius. This approach allows better adaptation to the target’s motion range and improves the accuracy of the pipeline center, thereby enhancing the effectiveness of target detection and tracking.
The target state includes the target position as well as target velocity information and obeys a Gaussian linear model:
where
denotes the state of the target at moment
k;
and
denote the position and velocity information of the target, respectively;
denotes the process noise; and
t is the sampling period, set to 1 s.
The linear quantitative model is as follows:
where
is the set of target position observations at moment
k, and
denotes the measurement noise. The measurement noise is Gaussian white noise,
, and the variance
.
By substituting the target state equation and measurement equation into the prediction and update equations of GM-PHD, the position and velocity of the target can be obtained. Adaptive adjustments to the pipeline center are made based on the predicted target position information. The formula is as follows:
where
and
denote the coordinates of the center of the pipe in the
x-axis and
y-axis of the
kth frame, respectively;
and
denote the coordinates of the position of the center of the pipe in the
k − 1 frame, respectively; and
and
denote the error correction terms.
where
denotes the predicted coordinates of the
kth-frame target in the
x-axis, and
denotes the predicted coordinates of the (
k-1)th-frame target in the
x-axis.
denotes the predicted coordinates of the
kth-frame target in the
y-axis, and
denotes the predicted coordinates of the (
k − 1)th-frame target in the
y-axis.
To obtain the error correction terms and in Equations (17) and (18), we follow these steps:
We start by determining the target’s velocity and position at moment using the Gaussian linear model as described by Equation (14), .
The term
, as represented in Equation (6), describes the linear state transition model for the evolution of the target state between moments
and
. The expression is given by
where
denotes the state transition matrix,
denotes the target position and velocity values for the
ith target at moment
,
denotes the control input matrix, and
denotes the control input. This model provides the predicted values of the target position and velocity at moment
,
.
The value of the error correction terms and can be obtained from steps 1 and 2. However, the exact values are determined by the difference in the target’s position coordinates between the two frames. The spatio-temporal pipeline filtering center position coordinates , , , and can be obtained from the GM-PHD filter update model described in Equation (10).
The error correction term ensures adaptive adjustment of the pipeline center position. Since the target is in continuous motion, the difference in the target’s center coordinates between two frames is determined by the target’s velocity. Therefore, the error correction term is set as the product of the target velocity, and the sampling period is set as the threshold. If the inter-frame pipeline position difference is less than the error correction threshold, residual compensation is applied to the pipeline center position. If the inter-frame position difference exceeds the error correction threshold, indicating that the position here deviates from the pipeline center due to strong noise interference, the error correction term will remove the current frame position coordinates to avoid pipeline center deviation. Instead, it retains the position coordinates from the previous frame.
3.2. Trajectory Similarity
Although spatio-temporal pipeline filtering methods can effectively distinguish noise from targets, there are challenges when applied to celestial maps due to the presence of a large number of stars. On the one hand, an excessive number of stars on the map leads to the generation of numerous redundant pipelines, thereby wasting computational resources. On the other hand, when multiple stars appear within a pipeline, the algorithm struggles to effectively differentiate between stars and targets. Hence, there is a need to enhance traditional spatio-temporal pipeline filtering methods by incorporating the motion characteristics of celestial objects and background stars, thereby devising an algorithm suitable for celestial systems.
In the star tracking model, as depicted in
Figure 4, within each circular pipeline frame, although background stars exhibit random grayscale variations, their overall positional changes across successive frames are not pronounced. Conversely, spatial targets demonstrate distinct motion characteristics, leading to noticeable differences in their positions across frames. Consequently, accurately delineating the characteristic disparities between stars and targets to classify them effectively is pivotal for refining the algorithm.
To address the issue of stellar interference in celestial backgrounds, we employ a trajectory similarity metric [
30,
31,
32] to distinguish between stars and targets. Trajectory similarity refers to the degree of similarity between two or more trajectories in space or time. From
Figure 4, it can be observed that as the number of detection frames increases, the distance between stars remains relatively stable, with the white connecting lines between stars hardly changing, indicating strong trajectory similarity. In contrast, due to their motion characteristics, targets exhibit continuous changes in their distances to stars, resulting in the lengths of the blue connecting lines between targets and stars constantly changing. Therefore, we can differentiate between stars and targets based on the differences in their distances. The following describes the steps involved: