A Novel Real-Time Autonomous Localization Algorithm Based on Weighted Loosely Coupled Visual–Inertial Data of the Velocity Layer

Liu, Cheng; Wang, Tao; Li, Zhi; Tian, Peng

doi:10.3390/app15020989

Open AccessArticle

A Novel Real-Time Autonomous Localization Algorithm Based on Weighted Loosely Coupled Visual–Inertial Data of the Velocity Layer

¹

State Key Laboratory of Explosion Science and Safety Protection, Beijing Institute of Technology, Beijing 100081, China

²

Beijing Institute of Technology Chongqing Innovation Center, Chongqing 401100, China

³

School of Information Science and Engineering, Chongqing Jiaotong University, Chongqing 401100, China

^*

Authors to whom correspondence should be addressed.

Appl. Sci. 2025, 15(2), 989; https://doi.org/10.3390/app15020989

Submission received: 10 December 2024 / Revised: 14 January 2025 / Accepted: 17 January 2025 / Published: 20 January 2025

(This article belongs to the Section Robotics and Automation)

Download

Browse Figures

Versions Notes

Abstract

:

IMUs (inertial measurement units) and cameras are widely utilized and combined to autonomously measure the motion states of mobile robots. This paper presents a loosely coupled algorithm for autonomous localization, the ICEKF (IMU-aided camera extended Kalman filter), for the weighted data fusion of the IMU and visual measurement. The algorithm fuses motion information on the velocity layer, thereby mitigating the excessive accumulation of IMU errors caused by direct subtraction on the positional layer after quadratic integration. Furthermore, by incorporating a weighting mechanism, the algorithm allows for a flexible adjustment of the emphasis placed on IMU data versus visual information, which augments the robustness and adaptability of autonomous motion estimation for robots. The simulation and dataset experiments demonstrate that the ICEKF can provide reliable estimates for robot motion trajectories.

Keywords:

loosely coupled visual–inertial measurement; visual inertial odometry; robot autonomous localization; data fusion

1. Introduction

1.1. Motivation

As the demand for autonomous mobility of robots continues to increase, localization technology continues to evolve. When robots perform motion tasks in environments such as extraterrestrial planets, rugged ground terrains, aerial spaces, or maritime environments [1,2,3], they face various challenges, including unstable satellite signals, impact and vibration upon sensors, limited visual features, and unpredictable light conditions. Furthermore, autonomous localization is a requisite ability for aerial [4], ground [5,6], or maritime [3,7] autonomous robots to successfully accomplish challenging tasks of local planning aiming for navigation to prevent collisions.

Integrated visual–inertial sensors do not require active signal emission or depend on external preset references, which benefits further data fusion as autonomous odometry. Two primary fusion strategies exist to facilitate fusion: loosely coupled methods and tightly coupled methods [8].

Loosely coupled methods are computationally efficient and maintain a higher update rate than tightly coupled methods do, thereby enabling low-cost and compact implementations. This approach additionally allows the simultaneous integration of localization information from multiple sources such as visual measurement, GNSS (global navigation satellite system), LIDAR (light detection and ranging), etc., [9,10]. Especially for application on the multiagent [11], loosely coupled methods are naturally capable of processing simultaneous multi-source information. Although the estimation accuracy is relatively low owing to the lack of an optimization process, which tight-coupled methods [12,13] usually perform, the integration of multiple measurement data into a loosely coupled framework is inevitable to address the challenges of environmental uncertainty during practical utilization. Therefore, a loosely coupled data fusion approach that is capable of adjusting the weights of the measurement results from individual sensors or agents is necessary to achieve a balanced output, considering how much the information source can be trusted.

1.2. Related Work

Loosely coupled visual–inertial odometry methods are usually constructed based on filter frameworks such as the EKF [14,15]. High-confidence visual keyframe measurements are then periodically utilized to correct the high-frequency integration results from IMUs. This basic structure effectively balances the updating rate and accuracy.

Kelly and Sukhatme [16] proposed a data fusion algorithm for the self-calibration of a monocular visual–inertial system with proven observability. With a similar filtering structure, Weiss and Siegwart [17] incorporated the world coordinate system drift in visual measurements, as well as the spatial relationships between the IMU and the monocular camera, into the EKF framework. This approach enables failure detection and scale-drift estimation by fusing of measurement data on the position and attitude layers.

Achtelik and Weiss [18] constructed a filter-based framework to recover the relative configuration of two drones performing IMUs and monocular visual measurements. To address error amplification during the error propagation process in EKFs due to possible inaccurate state modeling, BROSSARD et al. [19] proposed the visual–inertial invariant EKF algorithm, which is based on Lie algebra rules. Furthermore, researchers designed visual–inertial odometry that embraces the characteristics of novel filers such as the MEKF [20] and the equivariant filter [21,22] to broaden the application of filter-based techniques in motion estimation.

The aforementioned loosely coupled strategies have demonstrated remarkable performance when mainly focused on the fusion of positional layers. However, since the high-frequency update and the intrinsic drift of IMUs inevitably result in fast error accumulation, the effect of the final correction on the position and attitude layers will be suboptimal when the visual measurement update frequency drops or accuracy deteriorates. Furthermore, these methods are thus incapable of adjusting the emphasis on either source while encountering external visual interference.

1.3. Our Approach

This paper presents a loosely coupled algorithm for autonomous localization with a weighted data fusion on the linear velocity and angular velocity layers, namely the ICEKF. The weight of the IMU measurement can be adjusted according to whether the external conditions are favorable for visual measurement, thus increasing the overall accuracy of autonomous localization. The real-time performance is promising since the time complexity is

O (n)

. The local weak observability of the ICEKF is demonstrated.

The organization of this paper is as follows: the state vector of the ICEKF is established in the second section, the design of the propagation and update of the filter is discussed in the third section, the observability analysis is presented in the fourth section, and the fifth section carries out numerical studies of simulations and dataset experiments. Finally, a summary is provided.

2. Design of the State Vector

This section discusses the construction of the state vector and analyzes the weighted coupling process on the velocity layer. The forms of the ICEKF state vector are then deducted.

2.1. Definition of Variables in the ICEKF

The descriptions of the main coordinate frames and variables used in this study are listed in Table 1. The relationships between the coordinate frames of the IMU-aided camera integration system are established, as shown in Figure 1, where

p_{c}^{i}

and

{\bar{q}}_{c}^{i}

represent the relative linear translation and rotation between the IMU and the camera, which are considered constants once calibrated. The IMU outputs linear acceleration

a_{m i}

and angular velocity

ω_{m i}

in its rigid body coordinate frame. This paper considers the visual black-box measurement outputting rotation

{\bar{q}}_{w}^{c}

and unscaled linear translation

p_{w}^{c}

in the world frame. The fusion result is attached to the IMU coordinate frame, with the coupled linear translation and rotation denoted as

p_{w}^{i c}

and

{\bar{q}}_{w}^{i c}

, respectively. The derivatives of the translation and rotation are

v_{w}^{i c}

and

{\dot{\bar{q}}}_{w}^{i c}

. The units used in this paper are described in the SI, such as acceleration in m²/s and angular velocity in rad/s.

2.2. Construction of the State Vector

Given the necessity for information fusion on the velocity layer, while containing first-order derivatives and integrals, the state of the ICEKF is defined as a column vector consisting of 30 elements, as follows:

X = {\{{p_{w}^{i c}}^{T} {v_{w}^{c}}^{T} {v_{w}^{i}}^{T} {\bar{q}}_{w}^{i c}^{T} {\bar{q}}_{w}^{i}^{T} {ω_{c}^{c}}^{T} {ω_{i}^{i}}^{T} {b_{a i}}^{T} {b_{ω i}}^{T} λ\}}^{T},

(1)

where

v_{w}^{c}

is the linear velocity of the visual measurement derived from

p_{w}^{c}

;

ω_{c}^{c}

is the equivalent body angular velocity of the camera measured in the IMU frame;

{\bar{q}}_{w}^{i}

and

ω_{i}^{i}

are the rotation velocity and body angular velocity, respectively;

b_{a i}

and

b_{ω i}

are the biases of

a_{m i}

and

ω_{m i}

, respectively;

λ

is the scale coefficient of the monocular visual translation. The remaining variables are described in the last section. Let

a_{w}^{i}

be the linear acceleration of the IMU measured in the world frame, where

a_{w}^{i} = R_{w}^{i} (a_{m i} - b_{a i} - n_{a_{m i}})

. Additionally,

ω_{i}^{i} = ω_{m i} - b_{ω i} - n_{ω_{m i}}

.

The relationships among the variables in the ICEKF state are shown in Figure 2.

When the visual measurement is treated as a black box,

v_{w}^{c} = {\dot{p}}_{w}^{c}

. To obtain

ω_{c}^{c}

,

{\bar{q}}_{w}^{c}

should be transformed to

{\bar{q}}_{w}^{c^{'}}

, which is the visual rotation described in the IMU frame. Then,

ω_{c}^{c}

can be obtained from the derivatives of the Euler angles recovered from

{\bar{q}}_{w}^{c^{'}}

. The coupling coefficients of the linear and angular velocities,

μ_{v}, μ_{ω} \in [0, 1]

, are independent of the state update. This design guarantees that the coupling weights are always adjustable, which increases the accuracy of motion estimation without impacting observability.

To focus on the filter construction in this paper, the following is assumed:

1. The drift of the visual world frame in the visual black-box measurement is negligible due to its slow variation.

2. The outcomes acquired through the first-order derivation of measurement results are modeled as random walk, including

ω_{c}^{c}

and

v_{w}^{c}

.

2.3. Coupling Process

The coupling process of the linear velocities of the IMU-camera system requires a weighted summation as follows:

v_{w}^{i c} = μ_{v} v_{w}^{i} + λ (1 - μ_{v}) R_{c}^{i} v_{w}^{c},

(2)

The angular velocity, which covaries with the rotation of the coordinate frames, cannot be directly summed. According to Appendix A and the proposed lemmas in Appendix B, with

μ_{ω} \in [0, 1]

, the following equations hold:

\frac{d}{d t} {({\bar{q}}_{w}^{i})}^{μ_{ω}} = \frac{1}{2} {({\bar{q}}_{w}^{i})}^{μ_{ω}} \otimes (μ_{ω} {\bar{ω}}_{i}^{i}),

(3)

\frac{d}{d t} {({\bar{q}}_{w}^{c^{'}})}^{1 - μ_{ω}} = \frac{1}{2} {({\bar{q}}_{w}^{c^{'}})}^{1 - μ_{ω}} \otimes ((1 - μ_{ω}) {\bar{ω}}_{c}^{c}),

(4)

where

{\bar{q}}_{w}^{c^{'}}

is the equivalent rotation of the visual measurement in the IMU frame.

After further deduction according to Appendix A, the derivative of

{\bar{q}}_{w}^{i c}

is written as follows:

\begin{array}{l} {\dot{\bar{q}}}_{w}^{i c} = \frac{d}{d t} {({\bar{q}}_{w}^{i})}^{μ_{ω}} \otimes {({\bar{q}}_{w}^{c^{'}})}^{1 - μ_{ω}} + {({\bar{q}}_{w}^{i})}^{μ_{ω}} \otimes \frac{d}{d t} {({\bar{q}}_{w}^{c^{'}})}^{1 - μ_{ω}} \\ = \frac{μ_{ω}}{2} {({\bar{q}}_{w}^{i})}^{μ_{ω}} \otimes {\bar{ω}}_{i}^{i} \otimes {({\bar{q}}_{w}^{c^{'}})}^{1 - μ_{ω}} + \frac{1 - μ_{ω}}{2} {\bar{q}}_{w}^{i c} \otimes {\bar{ω}}_{c}^{c} \end{array},

(5)

where

\bar{ω} = [0, ω]

is the supplementary vector of the body angular vector

ω

.

Angular velocity fusion according to Equation (5) is performed while assuming that

{\bar{q}}_{w}^{i c} = {\bar{q}}_{w}^{c^{'}} = {\bar{q}}_{w}^{i}

, which are all attached to the IMU frame. Thus, potential errors can be caused by the assembly and measurement noise for a real IMU-camera integrated rig. However, after careful calibration and the filter process, the influence of the possible inconsistency is negligible.

2.4. Simplification of the State Vector

To simplify further discussion, the subscripts of the variables described in the world coordinate system are omitted; thus, the ICEKF state vector in Equation (1) is rewritten as follows:

X = {\{{p_{i c}}^{T} {v_{c}}^{T} {v_{i}}^{T} {\bar{q}}_{i c}^{T} {\bar{q}}_{i}^{T} {ω_{c}}^{T} {ω_{i}}^{T} {b_{a}}^{T} {b_{ω}}^{T} λ\}}^{T},

(6)

The derivatives of the variables in the vector are as follows:

{\dot{p}}_{i c} = v_{i c} = μ_{v} v_{i} + λ (1 - μ_{v}) R_{c}^{i} v_{c}, {\dot{v}}_{c} = n_{v_{c}}, {\dot{v}}_{i} = R_{i} (a_{m i} - b_{a i} - n_{a_{m i}}) - g, {\dot{\bar{q}}}_{i c} = \frac{μ_{ω}}{2} {\bar{q}}_{i}^{μ_{ω}} \otimes {\bar{ω}}_{i} \otimes {\bar{q}}_{c}^{1 - μ_{ω}} + \frac{1 - μ_{ω}}{2} {\bar{q}}_{i c} \otimes {\bar{ω}}_{c}, {\dot{\bar{q}}}_{i} = \frac{1}{2} {\bar{q}}_{i} \otimes {\bar{ω}}_{i}, {\dot{ω}}_{c} = n_{ω_{c}}, {\dot{ω}}_{i} = \frac{d}{d t} (ω_{m i} - b_{ω_{i}} - n_{ω_{m i}}), {\dot{b}}_{a i} = n_{b_{a i}}, {\dot{b}}_{ω i} = n_{b_{ω i}}, \dot{λ} = n_{λ},

(7)

where

v_{c}

,

ω_{c}

, and

λ

are modeled as random walk, and notably

{\bar{q}}_{c} = {\bar{q}}_{c}^{i} \otimes {\bar{q}}_{w}^{c} = {\bar{q}}_{w}^{c^{'}}

.

2.5. Error of the State Vector

Let

\hat{X}

denote the expectation of

X

, and let

\tilde{X}

denote the error between the expectation, and the measurement, written as

\tilde{X} = X - \hat{X}

.

\tilde{X}

with 28 elements, is described as follows:

\tilde{X} = {\{Δ {p_{i c}}^{T} Δ {v_{c}}^{T} Δ {v_{i}}^{T} δ {θ_{i c}}^{T} δ {θ_{i}}^{T} Δ {ω_{c}}^{T} Δ {ω_{i}}^{T} Δ {b_{a i}}^{T} Δ {b_{ω i}}^{T} Δ λ\}}^{T},

(8)

With the small-angle assumption, when the rotation angle corresponding to a unit quaternion

\bar{q}

is very small, the error of

\bar{q}

is written as

δ \bar{q} = {[q_{0}, δ q^{T}]}^{T} \approx {[1, \frac{1}{2} δ θ^{T}]}^{T}

[23]. Because the algorithm operates at a high update rate, the high-order terms that yield negligible computational results are disregarded, such as

δ q \cdot Δ ω

,

δ q \cdot δ q

, and

δ q \cdot n

. Then, the description of the derivative

\tilde{\dot{X}}

can be inspected.

According to Equation (2), the derivative of the coupled translation error is as follows:

\dot{Δ p_{i c}} = μ_{v} Δ v_{i} + λ (1 - μ_{v}) R_{c}^{i} Δ v_{c},

(9)

The expectation of the linear acceleration of the IMU is

{\hat{a}}_{i} = a_{m i} - {\hat{b}}_{a i}

, and

Δ b_{a i} = b_{a i} - {\hat{b}}_{a i}

. According to Appendix A, the rotation can be reformed as

R_{i} = R ({\bar{q}}_{i}) \approx {\hat{R}}_{i} (I_{3} + ⌊δ θ_{i} \times⌋)

under the small angle assumption. After neglecting the high-order terms, the error of the linear velocity of the IMU is written as follows:

\begin{array}{l} \dot{Δ v_{i}} = a_{i} - {\hat{a}}_{i} = R_{i} (a_{m i} - b_{a i} - n_{a_{m i}}) - g - {\hat{R}}_{i} (a_{m i} - {\hat{b}}_{a i}) + g \\ \approx - {\hat{R}}_{i} ⌊{\hat{a}}_{i} \times⌋ δ θ_{i} - {\hat{R}}_{i} Δ b_{a i} - n_{a_{m i}} \end{array},

(10)

For general quaternions and angular velocities,

\bar{q} = \hat{\bar{q}} \otimes δ \bar{q}

and

\bar{ω} = \hat{\bar{ω}} + Δ \bar{ω}

. By subjecting

{\bar{q}}_{i c}^{*} = {({\bar{q}}_{c})}^{1 - μ_{ω}}^{*} \otimes {({\bar{q}}_{i})}^{μ_{ω}}^{*}

and

{({\bar{q}}_{i}^{μ_{ω}})}^{*} \otimes {\bar{q}}_{i}^{μ_{ω}} = 1

to Equation (5), the error of the coupled rotation can be deducted as follows:

\begin{array}{l} δ {\dot{\bar{q}}}_{i c} = {\hat{\bar{q}}}_{i c}^{*} \otimes (\frac{μ_{ω}}{2} {\bar{q}}_{i}^{μ_{ω}} \otimes {\bar{ω}}_{i} \otimes {\bar{q}}_{c}^{1 - μ_{ω}} + \frac{1 - μ_{ω}}{2} {\bar{q}}_{i c} \otimes {\bar{ω}}_{c} - \frac{μ_{ω}}{2} {\hat{\bar{q}}}_{i}^{μ_{ω}} \otimes {\hat{\bar{ω}}}_{i} \otimes {\hat{\bar{q}}}_{c}^{1 - μ_{ω}} \otimes δ {\bar{q}}_{i c} - \frac{1 - μ_{ω}}{2} {\hat{\bar{q}}}_{i c} \otimes {\hat{\bar{ω}}}_{c} \otimes δ {\bar{q}}_{i c}) \\ = [\frac{μ_{ω}}{2} δ {\bar{q}}_{i c} \otimes {\bar{q}}_{i c}^{*} \otimes {\bar{q}}_{i}^{μ_{ω}} \otimes ({\hat{\bar{ω}}}_{i} + Δ {\bar{ω}}_{i}) \otimes {\bar{q}}_{c}^{1 - μ_{ω}} - \frac{μ_{ω}}{2} {({\hat{\bar{q}}}_{c}^{1 - μ_{ω}})}^{*} \otimes {({\hat{\bar{q}}}_{i}^{μ_{ω}})}^{*} \otimes {\hat{\bar{q}}}_{i}^{μ_{ω}} \otimes {\hat{\bar{ω}}}_{i} \otimes {\hat{\bar{q}}}_{c}^{1 - μ_{ω}} \otimes δ {\bar{q}}_{i c}] \\ + [\frac{1 - μ_{ω}}{2} δ {\bar{q}}_{i c} \otimes ({\hat{\bar{ω}}}_{c} + Δ {\bar{ω}}_{c}) - \frac{1 - μ_{ω}}{2} {\hat{\bar{ω}}}_{c} \otimes δ {\bar{q}}_{i c}] \\ = [\begin{array}{l} \frac{μ_{ω}}{2} δ {\bar{q}}_{i c} \otimes {({\bar{q}}_{c}^{1 - μ_{ω}})}^{*} \otimes {\hat{\bar{ω}}}_{i} \otimes {\bar{q}}_{c}^{1 - μ_{ω}} \\ + \frac{μ_{ω}}{2} δ {\bar{q}}_{i c} \otimes {({\bar{q}}_{c}^{1 - μ_{ω}})}^{*} \otimes Δ {\bar{ω}}_{i} \otimes {\bar{q}}_{c}^{1 - μ_{ω}} \\ - \frac{μ_{ω}}{2} {({\hat{\bar{q}}}_{c}^{1 - μ_{ω}})}^{*} \otimes {\hat{\bar{ω}}}_{i} \otimes {\hat{\bar{q}}}_{c}^{1 - μ_{ω}} \otimes δ {\bar{q}}_{i c} \end{array}] + [\begin{array}{l} \frac{1 - μ_{ω}}{2} δ {\bar{q}}_{i c} \otimes {\hat{\bar{ω}}}_{c} \\ + \frac{1 - μ_{ω}}{2} δ {\bar{q}}_{i c} \otimes Δ {\bar{ω}}_{c} \\ - \frac{1 - μ_{ω}}{2} {\hat{\bar{ω}}}_{c} \otimes δ {\bar{q}}_{i c} \end{array}] \end{array},

(11)

According to Appendix A, the rotation of

p_{i c}

with respect to

{\bar{q}}_{c}^{1 - μ_{ω}}

is written as follows:

{({\hat{\bar{q}}}_{c}^{1 - μ_{ω}})}^{*} \otimes {\bar{p}}_{i c} \otimes {\hat{\bar{q}}}_{c}^{1 - μ_{ω}} ≙ \hat{R} {({\hat{\bar{q}}}_{c}^{1 - μ_{ω}})}^{T} p_{i c} = {\hat{R}}_{μ_{ω} c}^{T} p_{i c},

(12)

where

{R_{μ_{ω} c}}^{T}

is the rotation matrix converted from

{\bar{q}}_{c}^{1 - μ_{ω}}

.

By subjecting Equation (12) to Equation (11) and disregarding the high-order terms, the following is obtained:

\dot{δ {\bar{q}}_{i c}} = \{\begin{array}{l} \frac{μ_{ω}}{2} δ {\bar{q}}_{i c} \otimes [\begin{array}{l} 0 \\ {R_{μ_{ω} c}}^{T} {\hat{ω}}_{i} \end{array}] \\ - \frac{μ_{ω}}{2} [\begin{array}{l} 0 \\ {R_{μ_{ω} c}}^{T} {\hat{ω}}_{i} \end{array}] \otimes δ {\bar{q}}_{i c} \\ + \frac{μ_{ω}}{2} δ {\bar{q}}_{i c} \otimes [\begin{array}{l} 0 \\ {R_{μ_{ω} c}}^{T} Δ ω_{i} \end{array}] \end{array}\} + [\begin{array}{l} \frac{1 - μ_{ω}}{2} δ {\bar{q}}_{i c} \otimes {\hat{\bar{ω}}}_{c} \\ - \frac{1 - μ_{ω}}{2} {\hat{\bar{ω}}}_{c} \otimes δ {\bar{q}}_{i c} \\ + \frac{1 - μ_{ω}}{2} δ {\bar{q}}_{i c} \otimes Δ {\bar{ω}}_{c} \end{array}] = \frac{μ_{ω}}{2} \{\begin{array}{l} [\begin{matrix} 0 & - {({R_{μ_{ω} c}}^{T} {\hat{ω}}_{i})}^{T} \\ {R_{c}}^{T} {\hat{ω}}_{i} & - ⌊({R_{μ_{ω} c}}^{T} {\hat{ω}}_{i}) \times⌋ \end{matrix}] [\begin{array}{l} q_{0} \\ δ q_{i c} \end{array}] \\ - [\begin{matrix} 0 & - {({R_{μ_{ω} c}}^{T} {\hat{ω}}_{i})}^{T} \\ {R_{c}}^{T} {\hat{ω}}_{i} & ⌊({R_{μ_{ω} c}}^{T} {\hat{ω}}_{i}) \times⌋ \end{matrix}] [\begin{array}{l} q_{0} \\ δ q_{i c} \end{array}] \\ + [\begin{matrix} 0 & - {({R_{μ_{ω} c}}^{T} Δ ω_{i})}^{T} \\ {R_{c}}^{T} Δ ω_{i} & - ⌊({R_{μ_{ω} c}}^{T} Δ ω_{i}) \times⌋ \end{matrix}] [\begin{array}{l} q_{0} \\ δ q_{i c} \end{array}] \end{array}\} + \frac{1 - μ_{ω}}{2} [\begin{array}{l} [\begin{matrix} 0 & - {\hat{ω}}_{c}^{T} \\ {\hat{ω}}_{c} & - ⌊{\hat{ω}}_{c} \times⌋ \end{matrix}] [\begin{array}{l} q_{0} \\ δ q_{i c} \end{array}] \\ - [\begin{matrix} 0 & - {\hat{ω}}_{c}^{T} \\ {\hat{ω}}_{c} & ⌊{\hat{ω}}_{c} \times⌋ \end{matrix}] [\begin{array}{l} q_{0} \\ δ q_{i c} \end{array}] \\ + [\begin{matrix} 0 & - {(Δ ω_{c})}^{T} \\ Δ ω_{c} & - ⌊Δ {\hat{ω}}_{c} \times⌋ \end{matrix}] [\begin{array}{l} q_{0} \\ δ q_{i c} \end{array}] \end{array}] \approx - \frac{μ_{ω}}{2} \{[\begin{array}{l} 0 \\ 2 ⌊({R_{μ_{ω} c}}^{T} {\hat{ω}}_{i}) \times⌋ δ q_{i c} \end{array}] - [\begin{array}{l} 0 \\ q_{0} {R_{μ_{ω} c}}^{T} Δ ω_{i} \end{array}]\} - \frac{1 - μ_{ω}}{2} \{[\begin{array}{l} 0 \\ 2 ⌊{\hat{ω}}_{c} \times⌋ δ q_{i c} \end{array}] - [\begin{array}{l} 0 \\ q_{0} Δ ω_{c} \end{array}]\},

(13)

Then, with the small angle assumption

δ {\bar{q}}_{i c} = {[q_{0}, δ {q_{i c}}^{T}]}^{T} \approx {[1, \frac{1}{2} δ {θ_{i c}}^{T}]}^{T}

, the error of the coupled equivalent Euler angle is simplified as follows:

\dot{δ θ_{i c}} = - μ_{ω} [⌊({R_{μ_{ω} c}}^{T} {\hat{ω}}_{i}) \times⌋ δ θ_{i c} - {R_{μ_{ω} c}}^{T} Δ ω_{i}] - (1 - μ_{ω}) [⌊{\hat{ω}}_{c} \times⌋ δ θ_{i c} - Δ ω_{c}],

(14)

The expectation of the body angular velocity of the IMU is

{\hat{ω}}_{i} = ω_{m i} - {\hat{b}}_{ω i}

, and

Δ b_{ω i} = b_{ω i} - {\hat{b}}_{ω i}

. Resembling the process above, the error of the rotation of the IMU is written as follows:

\begin{array}{l} \dot{δ {\bar{q}}_{i}} = {\hat{\bar{q}}}_{i}^{*} \otimes (\dot{{\bar{q}}_{i}} - \dot{{\hat{\bar{q}}}_{i}} \otimes δ {\bar{q}}_{i}) = \frac{1}{2} δ {\bar{q}}_{i} \otimes ({\hat{\bar{ω}}}_{i} + Δ {\bar{ω}}_{i}) - \frac{1}{2} {\hat{\bar{ω}}}_{i} \otimes δ {\bar{q}}_{i} \\ = \frac{1}{2} [\begin{matrix} 0 & - {\hat{ω}}_{i} \\ {\hat{ω}}_{i} & - ⌊{\hat{ω}}_{i} \times⌋ \end{matrix}] [\begin{array}{l} q_{i 0} \\ δ q_{i} \end{array}] - \frac{1}{2} [\begin{matrix} 0 & - {\hat{ω}}_{i} \\ {\hat{ω}}_{i} & ⌊{\hat{ω}}_{i} \times⌋ \end{matrix}] [\begin{array}{l} q_{i 0} \\ δ q_{i} \end{array}] + \frac{1}{2} [\begin{matrix} 0 & - Δ ω_{i} \\ Δ ω_{i} & - ⌊Δ ω_{i} \times⌋ \end{matrix}] [\begin{array}{l} q_{i 0} \\ δ q_{i} \end{array}] \\ \approx [\begin{array}{l} 0 \\ - ⌊{\hat{ω}}_{i} \times⌋ δ q_{i} \end{array}] - [\begin{array}{l} 0 \\ \frac{1}{2} q_{i 0} Δ ω_{i} \end{array}] \end{array},

(15)

Equation (15) can be simplified with the small angle assumption as follows:

\dot{δ θ_{i}} = - ⌊{\hat{ω}}_{i} \times⌋ δ θ_{i} - Δ ω_{i} = - ⌊{\hat{ω}}_{i} \times⌋ δ θ_{i} - Δ b_{ω i} - n_{ω_{m i}},

(16)

The derivatives of the remaining terms in

\tilde{X}

are as follows:

{\dot{Δ v}}_{c} = n_{v_{c}}, \dot{Δ ω_{c}} = n_{ω_{c}}, \dot{Δ ω_{i}} = \frac{d}{d t} (Δ b_{ω i} + n_{ω_{m i}}) = n_{ω i}, \dot{Δ b_{a i}} = n_{b_{a i}}, \dot{Δ b_{ω i} =} n_{b_{ω i}}, \dot{Δ λ} = n_{λ},

(17)

3. Propagation and Update of the ICEKF

The propagation and update of the ICEKF are described in detail in this section. The key matrices in the propagation step determine the internal transition process of the filter. The update step rectifies the filtered outcomes in reference to the measurement results.

3.1. Propagation

For the linearized continuous-time errors of an ICEKF state, the following equation exists [24]:

\tilde{\dot{X}} = F_{c} \tilde{X} + G_{c} n,

(18)

where

n = {[n_{ω_{c}}^{T} n_{a_{m i}}^{T} n_{ω_{c}}^{T} n_{ω_{m i}}^{T} n_{b_{a i}}^{T} n_{b_{ω i}}^{T}]}^{T}

is the propagation noise vector for the ICEKF following the Gaussian distribution.

F_{c}

and

G_{c}

are considered constant during every iteration.

To discretize Equation (18) during the period

Δ t

, we write the description of the discrete state transition matrix

F_{d}

and the discrete noise covariance matrix

Q_{d}

as follows [24]:

\{\begin{cases} F_{d} = \exp (F_{c} Δ t) = I_{d} + F_{c} Δ t + \frac{1}{2!} F_{c}^{2} Δ t^{2} + \dots \\ Q_{d} = \int_{t}^{t + Δ t} F_{d} (τ) G_{c} Q_{c} G_{c}^{T} F_{d} {(τ)}^{T} d τ \end{cases},

(19)

where

Q_{c} = d i a g (σ_{n_{v_{c}}}^{2}, σ_{n_{a_{i}}}^{2}, σ_{n_{ω_{c}}}^{2}, σ_{n_{ω_{i}}}^{2}, σ_{n_{b_{a i}}}^{2}, σ_{n_{b_{ω i}}}^{2})

is the diagonal matrix converted from Gaussian noise.

Section 2.5 presents the specific expression of

\tilde{\dot{X}}

. Considering that the algorithm operates at a high frequency, only the first-order expansion in Equation (19) is considered. The complete expression of

F_{d}

is derived as follows:

F_{d} = {[\begin{array}{l} I_{3} A_{1} A_{2} \dots 0_{3 \times 19} \\ 0_{3} I_{3} \dots 0_{3 \times 22} \\ 0_{3} 0_{3} I_{3} 0_{3} A_{3} 0_{3} 0_{3} A_{4} 0_{3} 0_{3 \times 1} \\ 0_{3} 0_{3} 0_{3} A_{5} 0_{3} A_{6} A_{7} 0_{3} 0_{3} 0_{3 \times 1} \\ 0_{3} 0_{3} 0_{3} 0_{3} A_{8} 0_{3} 0_{3} 0_{3} A_{9} 0_{3 \times 1} \\ 0_{13 \times 15} \dots I_{13 \times 13} \end{array}]}_{28 * 28},

(20)

where

A_{1} = λ (1 - μ_{v}) R_{c}^{i} Δ t

,

A_{2} = μ_{v} I_{3} Δ t

,

A_{3} = - {\hat{R}}_{i} ⌊{\hat{a}}_{i} \times⌋ Δ t

,

A_{4} = - {\hat{R}}_{i} Δ t

,

A_{5} = I_{3} - (μ_{ω} ⌊({R_{μ_{ω} c}}^{T} {\hat{ω}}_{i}) \times⌋ + (1 - μ_{ω}) ⌊{\hat{ω}}_{c} \times⌋) Δ t

,

A_{6} = (1 - μ_{ω}) I_{3} Δ t

,

A_{7} = μ_{ω} {R_{μ_{ω} c}}^{T} Δ t

,

A_{8} = I_{3} - ⌊{\hat{ω}}_{I} \times⌋ Δ t

, and

A_{9} = - I_{3} Δ t

.

Q_{d}

can be further derived by combining Equation (20) and

G_{c}

, and the explicit form of

G_{c}

is recovered according to Equation (18) as follows:

G_{c} = {[\begin{array}{l} 0_{3} 0_{3} 0_{3} 0_{3} 0_{3} 0_{3} \\ I_{3} 0_{3} 0_{3} 0_{3} 0_{3} 0_{3} \\ 0_{3} - {\hat{R}}_{i} 0_{3} 0_{3} 0_{3} 0_{3} \\ 0_{3} 0_{3} 0_{3} 0_{3} 0_{3} 0_{3} \\ 0_{3} 0_{3} 0_{3} - I_{3} 0_{3} 0_{3} \\ 0_{3} 0_{3} I_{3} 0_{3} 0_{3} 0_{3} \\ 0_{3} 0_{3} 0_{3} 0_{3} I_{3} 0_{3} \\ 0_{7 \times 18} \end{array}]}_{28 * 18} .

(21)

3.2. Measurement

Let the state measurement matrix at the

k

-th step of the ICEKF be

z_{k}

. After the measured linear translation

z_{P}

, the attitude quaternion

{\bar{z}}_{q}

, the linear velocity

z_{v}

, and the angular velocity

z_{ω}

are obtained by comparing the results from the keyframes of the visual measurement with the expectation of the ICEKF,

z_{k}

is obtained as follows:

{\tilde{z}}_{k} = {[{\tilde{z}}_{p}^{T} {\tilde{\bar{z}}}_{q}^{T} {\tilde{z}}_{v}^{T} {\tilde{z}}_{ω}^{T}]}^{T},

(22)

For the error of the linear translation, the following holds:

\begin{array}{l} {\tilde{z}}_{p} = (p_{i c} - R_{i c} p_{i}^{c}) λ + n_{p} - ({\hat{p}}_{i c} - {\hat{R}}_{i c} p_{i}^{c}) \hat{λ} \\ = (p_{i c} λ - {\hat{p}}_{i c} λ) + ({\hat{p}}_{i c} λ - {\hat{p}}_{i c} \hat{λ}) + [{\hat{R}}_{i c} p_{i}^{c} λ - {\hat{R}}_{i c} (I_{3} + ⌊δ θ_{i c} \times⌋) p_{i}^{c} λ] + (- {\hat{R}}_{i c} p_{i}^{c} λ + {\hat{R}}_{i c} p_{i}^{c} \hat{λ}) + n_{p} \\ = λ Δ p + {\hat{R}}_{i c} ⌊(p_{i}^{c} λ) \times⌋ δ θ_{i c} + ({\hat{p}}_{i c} - {\hat{R}}_{i c} p_{i}^{c}) Δ λ + n_{p} \end{array},

(23)

For the error of the attitude quaternion considering the small angle assumption, the following equation is used:

{\tilde{\bar{z}}}_{q} = {\hat{\bar{q}}}_{i c}^{*} \otimes ({\bar{q}}_{c}^{i} \otimes {\bar{q}}_{c}) = δ {\bar{q}}_{i c} \approx [\begin{matrix} 1 \\ \frac{1}{2} δ θ_{i c} + n_{θ} \end{matrix}],

(24)

For the error of the linear velocity, the following holds:

\begin{array}{l} {\tilde{z}}_{v} = v_{i c} - {\hat{v}}_{i c} \\ = μ_{v} \dot{Δ v_{i}} Δ t + λ (1 - μ_{v}) R_{c}^{i} Δ v_{c} \\ = - μ_{v} Δ t {\hat{R}}_{i} ⌊{\hat{a}}_{i} \times⌋ δ θ_{i} - μ_{v} Δ t {\hat{R}}_{i} Δ b_{a} + λ (1 - μ_{v}) R_{c}^{i} Δ v_{c} + n_{v} \end{array},

(25)

Since the angular velocities in different coordinate frames cannot be directly subtracted, the measurement error can be indirectly obtained according to Equation (14) as follows:

\begin{array}{l} {\tilde{z}}_{ω} = \dot{δ θ_{i c}} Δ t + n_{ω} \\ = - Δ t [μ_{ω} ⌊({R_{μ_{ω} c}}^{T} {\hat{ω}}_{i}) \times⌋ + (1 - μ_{ω}) ⌊{\hat{ω}}_{c} \times⌋] δ θ_{i c} + (1 - μ_{ω}) Δ t Δ ω_{c} + μ_{ω} Δ t {R_{μ_{ω} c}}^{T} Δ ω_{i} + n_{ω} \end{array},

(26)

Based on the description of

\tilde{X}

in Equation (8) and the above expansion of the measurement process, under the small angle assumption, the measurement error is reformulated as follows:

{\tilde{z}}_{k} ≃ H_{k} {\tilde{X}}_{k} + n_{m},

(27)

where

H_{k}

is the measurement matrix, and the measurement noise is simplified as

n_{m} = {[{n_{p}}^{T} {n_{θ}}^{T} {n_{v}}^{T} {n_{ω}}^{T}]}^{T}

. Thus,

H_{k}

can be recovered from Equations (22)–(26) as follows:

H_{k} = {[\begin{array}{l} λ I_{3 \times 3} 0_{3 \times 6} B_{1} 0_{3 \times 15} B_{2} \\ 0_{3 \times 9} \frac{1}{2} I_{3 \times 3} 0_{3 \times 16} \\ 0_{3 \times 3} B_{3} 0_{3 \times 6} B_{4} 0_{3 \times 6} - μ_{v} Δ t {\hat{R}}_{i} 0_{3 \times 4} \\ 0_{3 \times 9} B_{5} 0_{3 \times 3} (1 - μ_{ω}) Δ t I_{3 \times 3} μ_{ω} Δ t {R_{μ_{ω} c}}^{T} 0_{3 \times 7} \end{array}]}_{12 * 28},

(28)

where

B_{1} = {\hat{R}}_{i c} ⌊(p_{i}^{c} λ) \times⌋

,

B_{2} = {\hat{p}}_{i c} - {\hat{R}}_{i c} p_{i}^{c}

,

B_{3} = λ Δ t (1 - μ_{v}) R_{c}^{i}

,

B_{4} = - μ_{v} Δ t {\hat{R}}_{i} ⌊{\hat{a}}_{i} \times⌋

and

B_{5} = - Δ t [μ_{ω} ⌊({R_{μ_{ω} c}}^{T} {\hat{ω}}_{i}) \times⌋ + (1 - μ_{ω}) ⌊{\hat{ω}}_{c} \times⌋]

.

3.3. Entire ICEKF Process

The entire process of the ICEKF in the k-th iteration is presented as follows:

Step 1. By calculating

F_{d}

and

Q_{d}

according to Equation (19), the prior covariance matrix of errors

P_{k + 1 | k}

can be obtained from the following:

P_{k + 1 | k} = F_{d} P_{k | k} F_{d} + Q_{d},

(29)

Step 2. The Kalman gain matrix

K_{k}

is updated as follows:

\{\begin{cases} S_{k} = H_{k} P_{k + 1 | k} H_{k}^{T} + R_{k} \\ K_{k} = P_{k + 1 | k} H_{k}^{T} S_{k}^{- 1} \end{cases},

(30)

where

R_{k}

is the measurement noise matrix.

Step 3. The current state

{\hat{X}}_{k + 1 | k + 1} = {\hat{X}}_{k + 1 | k} + {\hat{\tilde{X}}}_{k}

is calculated according to

{\hat{\tilde{X}}}_{k} = K_{k} {\tilde{z}}_{k}

, where

{\tilde{z}}_{k}

is obtained via Equation (27).

Step 4. The posterior covariance matrix of errors

P_{k + 1 | k + 1}

is updated, and Step 1 is carried out again according to the following:

P_{k + 1 | k + 1} = (I_{28 \times 28} - K_{k} H_{k}) P_{k + 1 | k} {(I_{28 \times 28} - K_{k} H_{k})}^{T} + K_{k} R_{k} K_{k}^{T},

(31)

During Step 2, the measurement error model of the visual measurement can be preestablished as described in [25,26]. While updating the attitude during Step 3, the angular error of rotation in

{\hat{\tilde{X}}}_{k}

is achieved with the small angle assumption. Therefore, the quaternion form of the attitude expectation

{\hat{\bar{q}}}_{k + 1}

must be recovered as follows [23]:

\{\begin{cases} δ {\hat{q}}_{k} \approx \frac{1}{2} δ {\hat{θ}}_{k} \\ {\hat{\bar{q}}}_{k + 1} = \{\begin{cases} {[\sqrt{1 - δ {\hat{q}}_{k}^{T} δ {\hat{q}}_{k}}, δ {\hat{q}}_{k}^{T}]}^{T} if δ {\hat{q}}_{k}^{T} δ {\hat{q}}_{k} \leq 1 \\ \frac{1}{\sqrt{1 + δ {\hat{q}}_{k}^{T} δ {\hat{q}}_{k}}} {[1, δ {\hat{q}}_{k}^{T}]}^{T} if δ {\hat{q}}_{k}^{T} δ {\hat{q}}_{k} > 1 \end{cases} \end{cases},

(32)

For a clearer view of the entire process, the data flow during the k-th iteration is illustrated by Figure 3.

4. Nonlinear Observability Analysis

The nonlinear system with an ICEKF can function properly if it has local weak observability, as described in [27]. To simplify the observability analysis with angular velocity layer fusion, a pair of virtual coupled measurement variables are defined as angular velocity

ω_{m i c}

with bias

b_{ω_{i c}}

, which can be regarded as the yields obtained from correcting

ω_{m i}

and

b_{ω_{i}}

with visual measurement. Thus, referring to the observation system modeling in [16], the nonlinear system representing the fusion measurement results can be expressed as follows:

\dot{X} = [\begin{array}{l} {\dot{p}}_{i c} \\ {\dot{v}}_{c} \\ {\dot{v}}_{i} \\ {\dot{\bar{q}}}_{i c} \\ {\dot{\bar{q}}}_{i} \\ {\dot{ω}}_{c} \\ {\dot{ω}}_{i} \\ {\dot{b}}_{a_{i}} \\ {\dot{b}}_{ω_{i}} \\ \dot{λ} \end{array}] = \overset{f_{0}}{\overset{︷}{[\begin{array}{l} μ_{v} v_{i} + λ (1 - μ_{v}) R_{c}^{i} v_{c} \\ 0_{3 \times 1} \\ - R_{i} b_{a_{i}} - g \\ \frac{1}{2} Ξ ({\dot{\bar{q}}}_{i c}) b_{ω_{i c}} \\ \frac{1}{2} Ξ ({\dot{\bar{q}}}_{i}) b_{ω_{i}} \\ 0_{3 \times 1} \\ 0_{3 \times 1} \\ 0_{3 \times 1} \\ 0_{3 \times 1} \\ 0 \end{array}]}} + \overset{f_{1}}{\overset{︷}{[\begin{array}{l} 0_{3 \times 3} \\ 0_{3 \times 3} \\ 0_{3 \times 3} \\ 0_{4 \times 3} \\ \frac{1}{2} Ξ ({\dot{\bar{q}}}_{i}) \\ 0_{3 \times 3} \\ 0_{3 \times 3} \\ 0_{3 \times 3} \\ 0_{3 \times 3} \\ 0 \end{array}]}} ω_{m i} + \overset{f_{2}}{\overset{︷}{[\begin{array}{l} 0_{3 \times 3} \\ 0_{3 \times 3} \\ 0_{3 \times 3} \\ \frac{1}{2} Ξ ({\dot{\bar{q}}}_{i c}) \\ 0_{4 \times 3} \\ 0_{3 \times 3} \\ 0_{3 \times 3} \\ 0_{3 \times 3} \\ 0_{3 \times 3} \\ 0 \end{array}]}} ω_{m i c} + \overset{f_{3}}{\overset{︷}{[\begin{array}{l} 0_{3 \times 3} \\ 0_{3 \times 3} \\ R_{i} \\ 0_{4 \times 3} \\ 0_{4 \times 3} \\ 0_{3 \times 3} \\ 0_{3 \times 3} \\ 0_{3 \times 3} \\ 0_{3 \times 3} \\ 0 \end{array}]}} a_{m i},

(33)

where for a general unit quaternion

\bar{q}

, there is the following:

\{\begin{cases} \dot{\bar{q}} = 0.5 Ξ (\dot{\bar{q}}) ω \\ Ξ (\bar{q}) = [\begin{matrix} - q^{T} \\ q_{0} I_{3} - ⌊q \times⌋ \end{matrix}] \end{cases},

(34)

The measurement functions are designed as

h (\dot{X}) = {[{h_{1}}^{T}, \dots, {h_{8}}^{T}]}^{T}

, with

h_{1} = (p_{i c} - R_{i c} p_{i}^{c}) λ

,

h_{2} = {\bar{q}}_{i c}

,

h_{3} = {\bar{q}}_{i}

,

h_{4} = {\bar{q}}_{i c}^{T} {\bar{q}}_{i c}

,

h_{5} = {\bar{q}}_{i}^{T} {\bar{q}}_{i}

,

h_{6} = v_{c}

,

h_{7} = ω_{c}

, and

h_{8} = ω_{i}

.

According to the detailed deduction in Appendix C, the observability matrix of the system is constructed with the Lie derivative as follows:

Ω = [\begin{array}{l} \nabla L^{0} h_{1} \\ \nabla L^{0} h_{2} \\ \nabla L^{0} h_{3} \\ \nabla L^{0} h_{4} \\ \nabla L^{0} h_{5} \\ \nabla L^{0} h_{6} \\ \nabla L^{0} h_{7} \\ \nabla L^{0} h_{8} \\ \nabla L_{f_{0}}^{1} h_{1} \\ \nabla L_{f_{0}}^{1} h_{3} \\ \nabla L_{f_{0}}^{2} h_{1} \\ \nabla L_{f_{3}}^{1} L_{f_{0}}^{1} h_{1} \end{array}] = {[\begin{matrix} \overset{p_{i c}}{\overset{︷}{U_{1}}} & \overset{v_{c}}{\overset{︷}{0}} & \overset{v_{i}}{\overset{︷}{0}} & \overset{{\bar{q}}_{i c}}{\overset{︷}{U_{2}}} & \overset{{\bar{q}}_{i}}{\overset{︷}{0}} & \overset{ω_{c}}{\overset{︷}{0}} & \overset{ω_{i}}{\overset{︷}{0}} & \overset{b_{a}}{\overset{︷}{0}} & \overset{b_{ω}}{\overset{︷}{0}} & \overset{λ}{\overset{︷}{U_{3}}} \\ 0 & 0 & 0 & U_{4} & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & U_{5} & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & U_{6} & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & U_{6} & 0 & 0 & 0 & 0 & 0 \\ 0 & U_{8} & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & U_{9} & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & U_{10} & 0 & 0 & 0 \\ 0 & U_{11} & U_{12} & G_{[9, 4]} & 0 & 0 & 0 & G_{[9, 8]} & 0 & G_{[9, 10]} \\ 0 & 0 & 0 & 0 & 0 & U_{13} & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & G_{[11, 4]} & G_{[11, 5]} & 0 & 0 & U_{15} & G_{[11, 9]} & G_{[11, 10]} \\ 0 & 0 & 0 & G_{[12, 4]} & G_{[12, 5]} & 0 & 0 & 0 & 0 & U_{16} \end{matrix}]}_{38 \times 30},

(35)

where each column corresponds to the entries in the state vector of the ICEKF, the matrices

G

with indices represent blocks that are irrelevant to the rank analysis, and the matrices

U

with subscripts are blocks contributing to the column rank of

Ω

, as follows:

U_{1} = I_{3} λ

,

U_{4}, U_{5} = I_{4}

,

U_{8}, U_{9}, U_{10} = I_{3}

,

U_{12} = λ μ_{v} I_{3}

,

U_{14} = 0.5 Ξ ({\dot{\bar{q}}}_{i})

,

U_{15} = - λ μ_{v} R_{i}

, and

U_{16} = U (R_{i})

.

To prove that

Ω

has full rank, block Gaussian elimination is applied. The rows of blocks can be rearranged so that only one block in each row is allowed to determine whether the corresponding column of blocks has full column rank. Following this process, all the block columns in

Ω

, with the exception of the last one corresponding to

U_{16}

, have full column rank when

λ, μ_{v}

are nonzero.

Expanding the rotation matrix

R_{i}

as three columns yields the following:

L_{f_{3}}^{1} L_{f_{0}}^{1} h_{1} = λ μ_{v} R_{i} = λ μ_{v} [r_{x} r_{y} r_{z}],

(36)

An explicit form is achieved by inspecting the Lie derivative of Equation (36) as

U_{16} = μ_{v} {[{r_{x}}^{T} {r_{y}}^{T} {r_{z}}^{T}]}^{T}_{9 \times 1}

. As described in [16], if the linear acceleration was excited on at least one axis, one of

r_{x}

,

r_{y}

, or

r_{z}

would be a nonzero vector.

In accordance with the analysis above, it can be demonstrated that for the visual–inertial system with the ICEKF described in this paper, when

λ, μ_{v}

are nonzero and the IMU is excited in any direction, the observability matrix has full column rank, which means that the system has local weak observability [27]. The above conditions are readily fulfilled during practical applications.

5. Simulation and Experiments

Three-dimensional motion simulation and dataset experiments are conducted in this section to analyze the performance of the ICEKF.

5.1. Simulation

Assuming that the coordinate frames of the IMU and the camera are aligned, the pose transformation between them can be omitted. To observe the convergence process, we deliberately assign random initial values to the prior covariance matrix

P_{k | k}

. The starting point is [1,0,0]. The covariance matrix of measurement error

R_{k}

is designed as a skew-symmetric matrix with small random values. Gaussian noise is introduced to all the virtual measurement values. The coefficients are set as

μ_{v} = 0.5

,

μ_{ω} = 0.5

, and

λ = 1

. The virtual IMU-camera rig is directed to move along a virtual helical trajectory, as described by the following equation:

\{\begin{cases} x = \cos (\frac{1}{2} π t) \\ y = \sin (\frac{1}{2} π t) \\ z = t \end{cases},

(37)

The three-dimensional position curves of the ICEKF and the ground truth are shown in Figure 4, which reveals that the estimated results align closely with the true values. The position errors of the ICEKF estimation against the ground truth of the IMU-camera rig along the three axes are depicted in Figure 5, and Figure 6 shows the orientation errors. The curve of

λ

is shown in Figure 7.

The norm errors between the ground truth and the simulation results, including the statistical values of the RMSE (root mean square error), mean error, and STD (standard deviation) for both coupled translation

p_{i c}

and attitude

{\bar{q}}_{i c}

across all three axes, are detailed in Table 2. The errors of the attitude are obtained by converting

{\bar{q}}_{i c}

to roll-pitch-yaw.

The curves and numerical statistics show that despite the randomized

P_{k | k}

and

R_{k}

alongside a high yaw angle velocity disturbing the initial convergence, the ICEKF ultimately converges without much specific parameter tuning. This indicates that the ICEKF possesses strong robustness while yielding reliable motion estimates.

To further test the filter’s ability to converge, an initial state estimate set is designed which conducts the virtual rig starting from the different three-dimensional points to the actual desired starting point [1,0,0]. The initial estimate set, varying from [0,−1,−1] to [1,0,0], and the converging process are illustrated by Figure 8, which depicts how even when facing uncertain initial estimates, the filter can converge with a decent performance.

5.2. Dataset Experiment

To inspect the effectiveness of the algorithm in real-world scenarios, this paper employs the EuRoC ROOM01 and MainHall02 datasets [28] for validation. Since this study investigates the performance of the loosely coupled visual–inertial algorithm itself instead of visual odometry, one of the state-of-the-art visual SLAM algorithms, monocular ORB-SLAM V3 [29], serves as the visual black-box approach. Given that this algorithm has been confirmed to exhibit exceptional ATE performance, this paper focuses on RPE comparisons of different algorithms by employing EVO tools [30] to analyze trajectories. The ICEKF operates with the IMU data at 200 Hz, and meanwhile the visual measurement runs at 20 Hz. To facilitate the comparison, each frame from the visual measurement with ORB-SLAM is considered a keyframe.

The coefficients are

μ_{v} = 0.9

and

μ_{ω} = 0.5

, and the initial scale factor is determined based on the prior results of the visual algorithm as

λ = 2.04

. Given that the hardware setup and scenario are fixed in the dataset, the initial covariance matrix

P_{k | k}

of the ICEKF can be easily adjusted via multiple runs. In the first run, the initial

P_{k | k}

is selected randomly, and eventually stabilizes as the algorithm progresses. The final updated

P_{k | k}

is utilized to run the scenario test again, which greatly increases the convergence speed and provides high resilience against noise. For applications in unknown environments, initial covariance estimation can be conducted in a similar scenario to further improve convergence efficiency.

Dataset experiments with monocular ORB-SLAM, its IMU variant, and the monocular MSCKF with an IMU were performed. The entire experiment processes are shown in Figure 9 and Figure 10.

The contrast between the three-dimensional position output of the ICEKF and the ground truth is shown in Figure 11 and Figure 12. The three-dimensional position errors and the orientation errors between the output of the ICEKF and the ground truth after applying alignment using Umeyama’s method [30] from the two dataset tests are shown in Figure 13, Figure 14, Figure 15 and Figure 16.

Trajectory analysis using EVO is conducted, in which the ICEKF with the results from ORB-SLAM and its IMU variant, as well as the data obtained from the monocular MSCKF [31], are compared against the ground truth, with the statistical RPE results presented in Table 3 and Table 4.

The comparative results indicate that the ICEKF ensures real-time measurement through high-frequency IMU updates and achieves an accuracy comparable to that of the monocular ORB-SLAM and its IMU variant while outperforming the monocular MSCKF. Although the trajectory does not reach the high accuracy of ORB-SLAM-based methods owing to the lack of posterior batch optimization processing, the computational complexity of the coupled part of the algorithm is only

O (n)

, which benefits localization applications. Moreover, the algorithm framework can be directly utilized with other localization algorithms to improve their measurement stability in complex environments.

To assess the robustness of the ICEKF with regard to leap noise from visual measurements, this study incorporates random stimulations with an amplitude of 0.5 m into the visual measurement result to imitate harsh visual measurement failure. Figure 17 shows a comparison of the partial trajectory from the ICEKF against the corrupted visual measurement. This indicates that the ICEKF can significantly mitigate the impact of sporadic instability from visual measurements.

6. Conclusions

This study introduces a novel weighted loosely coupled algorithm for fusing data from IMUs and visual measurements and demonstrates its observability. The algorithm, of which the coupled part possesses a computational complexity of

O (n)

, retains the high-frequency update capability inherent to the EKF framework, thus enabling high-accuracy active localization. Both simulation and dataset tests reveal that the ICEKF achieves deep integration of the IMU and the visual data on the velocity layer throughout the updating process, which benefits the localization performance. By conveniently tuning the weights assigned to different data sources, the framework improves both the fusion accuracy and the resilience to abrupt noise. In future studies, we will aim to apply the approach across multiple devices by analyzing the sensitivity of the coupling coefficients in challenging real-life scenarios.

Author Contributions

Conceptualization, Z.L. and P.T.; methodology, C.L.; software, Z.L. and T.W.; validation, C.L.; formal analysis, C.L.; investigation, Z.L. and P.T.; resources, Z.L. and P.T.; data curation, C.L. and T.W.; writing—original draft preparation, Z.L. and C.L.; writing—review and editing, T.W. and P.T.; visualization, Z.L. and C.L.; supervision, P.T.; project administration, Z.L. and P.T.; funding acquisition, T.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was sponsored by the National Key Research and Development Program of China [Grant number 2022YFC3320503] and the Foundation for Innovative Research Groups of the National Natural Science Foundation of China [Grant number 12221002], funded by Tao Wang.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The source code presented in this study is available upon request from the corresponding author.

Acknowledgments

The authors acknowledge the anonymous reviewers for their helpful comments on the manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Assuming that the rotation

\bar{q}

is accomplished during a certain period and that the corresponding body angular velocity is

ω

,

\dot{\bar{q}} = \frac{1}{2} \bar{q} \otimes \bar{ω}

[30], where

\bar{ω} = [0, ω]

. The

μ

-th power of

\bar{q}

is written as

{\bar{q}}^{μ}

, which is a unit quaternion that denotes scaling the rotation angle around the virtual axis with

μ \in [0, 1]

[32].

A three-element vector

p

rotating according to

\bar{q}

is written as

\bar{q} \otimes \bar{p} \otimes {\bar{q}}^{*} ≙ R (\bar{q}) p

with

\bar{p} = {[0, p^{T}]}^{T}

.

For general quaternions, the error between the measurement and the expectation is defined as

δ \bar{q}

from

\bar{q} = \hat{\bar{q}} \otimes δ \bar{q}

. The derivative form is

δ \dot{\bar{q}} = {\hat{\bar{q}}}^{*} \otimes (\dot{\bar{q}} - \dot{\hat{\bar{q}}} \otimes δ \bar{q})

[16].

According to

R (δ \bar{q}) \approx I_{3} + ⌊δ θ \times⌋

[23], there is

R (\bar{q}) = \hat{R} (\hat{\bar{q}}) R (δ \bar{q}) \approx \hat{R} (\hat{\bar{q}}) (I_{3} + ⌊δ θ \times⌋)

.

Assuming that there are two quaternions,

\bar{q} = {[q_{0}, q^{T}]}^{T}

and

\bar{p} = {[p_{0}, p^{T}]}^{T}

, the following equation exists:

\bar{q} \otimes \bar{p} = [\begin{matrix} q_{0} & - q^{T} \\ q & q_{0} I_{3} + ⌊q \times⌋ \end{matrix}] \otimes \bar{p} = [\begin{matrix} p_{0} & - p^{T} \\ p & p_{0} I_{3} + ⌊p \times⌋ \end{matrix}] \otimes \bar{q} .

(A1)

Appendix B

The following lemmas are proposed in this paper for the coupling process on the velocity layer.

Lemma A1.

For the IMU-camera measurement system in Figure 1, let

{\bar{q}}_{w}^{c^{'}}

be the equivalent rotation of

{\bar{q}}_{w}^{c}

described in the IMU frame. Assuming that

{\bar{q}}_{w}^{i c}

,

{\bar{q}}_{w}^{i}

and

{\bar{q}}_{w}^{c^{'}}

are unit quaternions with

{\bar{q}}_{w}^{i c} = {\bar{q}}_{w}^{c^{'}} = {\bar{q}}_{w}^{i}

,

{\bar{q}}_{w}^{i c} = {({\bar{q}}_{w}^{i})}^{μ_{ω}} \otimes {({\bar{q}}_{w}^{c^{'}})}^{1 - μ_{ω}}

, where

μ_{ω} \in [0, 1]

.

Proof of Lemma A1.

Rewriting the quaternions into axis-angle form [32] yields

{\bar{q}}_{w}^{i c} = [\cos θ_{i c}, r_{i c} \sin θ_{i c}]

,

{\bar{q}}_{w}^{c^{'}} = [\cos θ_{c}, r_{c} \sin θ_{c}]

, and

q_{w}^{i} = [\cos θ_{i}, r_{i} \sin θ_{i}]

, with

θ_{c} = θ_{i} = θ_{i c} = θ

, and unit vectors

r_{i c} = r_{c} = r_{i} = r

.

Inspecting the exponential form of quaternions [33], we have the following:

\begin{array}{l} {({\bar{q}}_{w}^{c^{'}})}^{(1 - μ_{ω})} = \exp ((1 - μ_{ω}) \log {\bar{q}}_{w}^{c^{'}}) = \exp ((1 - μ_{ω}) [0, θ r]) \\ = [\cos ((1 - μ_{ω}) θ), r \sin ((1 - μ_{ω}) θ)] \end{array},

(A2)

\begin{array}{l} {({\bar{q}}_{w}^{i})}^{μ_{ω}} = \exp (μ_{ω} \log {\bar{q}}_{w}^{i}) = \exp (μ_{ω} [0, θ r]) \\ = [\cos (μ_{ω} θ), r \sin (μ_{ω} θ)] \end{array} .

(A3)

Three conditions are discussed as follows:

When $μ_{ω} = 1$ , ${({\bar{q}}_{w}^{c^{'}})}^{0} = [1, 0]$ , and ${({\bar{q}}_{w}^{i})}^{1} = [\cos θ, r \sin θ]$ , ${({\bar{q}}_{w}^{c^{'}})}^{0} \otimes {({\bar{q}}_{w}^{i})}^{1} = [\cos θ, r \sin θ] = {\bar{q}}_{w}^{i c}$ .
When $μ_{ω} = 0$ , similarly, ${({\bar{q}}_{w}^{c^{'}})}^{1} \otimes {({\bar{q}}_{w}^{i})}^{0} = {\bar{q}}_{w}^{i c}$ .
When $μ_{ω} \in (0, 1)$ , there is the following:

\begin{array}{l} {({\bar{q}}_{w}^{i})}^{μ_{ω}} \otimes {({\bar{q}}_{w}^{c^{'}})}^{(1 - μ_{ω})} = [\cos ((1 - μ_{ω}) θ), r \sin ((1 - μ_{ω}) θ)] [\cos (μ_{ω} θ), r \sin (μ_{ω} θ)] \\ = [\cos ((1 - μ_{ω}) θ) \cos (μ_{ω} θ) - \sin ((1 - μ_{ω}) θ) \sin (μ_{ω} θ) (r \cdot r), \\ r \cos ((1 - μ_{ω}) θ) \sin (μ_{ω} θ) + r \cos (μ_{ω} θ) \sin ((1 - μ_{ω}) θ) + (r \times r) \sin ((1 - μ_{ω}) θ) \sin (μ_{ω} θ)] \\ = [\cos θ, r \sin θ] = {\bar{q}}_{w}^{i c} \end{array} .

(A4)

Therefore, for all three conditions,

{\bar{q}}_{w}^{i c} = {({\bar{q}}_{w}^{i})}^{μ_{ω}} \otimes {({\bar{q}}_{w}^{c^{'}})}^{1 - μ_{ω}}

holds. □

Lemma A2.

Letting

μ \in [0, 1]

, for the derivative of the

μ

-th power of a general unit quaternion

\bar{q}

,

\frac{d}{d t} ({\bar{q}}^{μ}) = \frac{1}{2} {\bar{q}}^{μ} \otimes (μ \bar{ω})

.

Proof of Lemma A2.

Referring to Equation (314) in [32] (pp. 483), for a general quaternion in the space-reference inertia axes, namely, the body frame in this paper,

\frac{d}{d t} (\bar{q}) = \frac{1}{2} \bar{q} \otimes (\bar{ω})

.

Referring to Equation (15) in [34] (pp. 168), for a unit quaternion

\bar{q}

denoting rotation

α

around unit vector

r

,

\frac{d}{d t} α = ω \cdot r

exists.

To scale the rotation angle with a coefficient

μ \in [0, 1]

, since the vector

r

remains unchanged,

\frac{d}{d t} μ α = (μ ω) \cdot r

. Thus,

\frac{d}{d t} ({\bar{q}}^{μ}) = \frac{1}{2} {\bar{q}}^{μ} \otimes (μ \bar{ω})

holds. □

Appendix C

The Lie derivative of the measurement function

h (x)

with respect to the vector

f (x)

is written as follows:

L_{f} h (x) = \nabla_{f} h (x) = \frac{\partial h (x)}{\partial x} f (x) .

(A5)

The

k

-th order Lie derivative of

h (x)

with respect to

f (x)

is written as follows:

L_{f}^{k} h (x) = \frac{\partial (L_{f}^{k - 1} h (x))}{\partial x} f (x) .

(A6)

Specifically, the zeroth-order Lie derivative of

h (x)

is the measurement function itself, namely

L^{0} h (x) ≜ h (x)

.

By deducting the gradient of the zeroth-order Lie derivatives of the measurement function among

h (\dot{X}) = {[{h_{1}}^{T}, \dots, {h_{8}}^{T}]}^{T}

, we obtain the following:

\{\begin{cases} \nabla L^{0} h_{1} = [I_{3} λ 0_{3 \times 3} 0_{3 \times 3} λ Γ ({\bar{q}}_{i c}, p_{i}^{c}) 0_{3 \times 4} 0_{3 \times 3} 0_{3 \times 3} 0_{3 \times 3} 0_{3 \times 3} (p_{i c} - R_{i c} p_{i}^{c})] \\ \nabla L^{0} h_{2} = [0_{4 \times 3} 0_{4 \times 3} 0_{4 \times 3} I_{4} 0_{4 \times 4} 0_{4 \times 3} 0_{4 \times 3} 0_{4 \times 3} 0_{4 \times 3} 0_{4 \times 1}] \\ \nabla L^{0} h_{3} = [0_{4 \times 3} 0_{4 \times 3} 0_{4 \times 3} 0_{4 \times 4} I_{4} 0_{4 \times 3} 0_{4 \times 3} 0_{4 \times 3} 0_{4 \times 3} 0_{4 \times 1}] \\ \nabla L^{0} h_{4} = [0_{1 \times 3} 0_{1 \times 3} 0_{1 \times 3} 2 {({\bar{q}}_{i c})}^{T} 0_{1 \times 4} 0_{1 \times 3} 0_{1 \times 3} 0_{1 \times 3} 0_{1 \times 3} 0] \\ \nabla L^{0} h_{5} = [0_{1 \times 3} 0_{1 \times 3} 0_{1 \times 3} 0_{1 \times 4} 2 {({\bar{q}}_{i})}^{T} 0_{1 \times 3} 0_{1 \times 3} 0_{1 \times 3} 0_{1 \times 3} 0] \\ \nabla L^{0} h_{6} = [0_{3 \times 3} I_{3} 0_{3 \times 3} 0_{3 \times 4} 0_{3 \times 4} 0_{3 \times 3} 0_{3 \times 3} 0_{3 \times 3} 0_{3 \times 3} 0_{3 \times 1}] \\ \nabla L^{0} h_{7} = [0_{3 \times 3} 0_{3 \times 3} 0_{3 \times 3} 0_{3 \times 4} 0_{3 \times 4} I_{3} 0_{3 \times 3} 0_{3 \times 3} 0_{3 \times 3} 0_{3 \times 1}] \\ \nabla L^{0} h_{8} = [0_{3 \times 3} 0_{3 \times 3} 0_{3 \times 3} 0_{3 \times 4} 0_{3 \times 4} 0_{3 \times 3} I_{3} 0_{3 \times 3} 0_{3 \times 3} 0_{3 \times 1}] \end{cases} .

(A7)

The first-order Lie derivative of

h_{1}

with respect to

f_{0}

and its gradient are as follows:

L_{f_{0}}^{1} h_{1} = \nabla L^{0} h_{1} \cdot f_{0} = λ (μ_{v} v_{i} + λ (1 - μ_{v}) R_{c}^{i} v_{c}) + \frac{1}{2} λ Γ ({\bar{q}}_{i c}, p_{i}^{c}) Ξ ({\dot{\bar{q}}}_{i c}) b_{ω i} .

(A8)

\nabla L_{f_{0}}^{1} h_{1} = [0_{3 \times 3} λ^{2} (1 - μ_{v}) R_{c}^{i} λ μ_{v} I_{3} G_{[9, 4]} 0_{3 \times 4} 0_{3 \times 3} 0_{3 \times 3} G_{[9, 8]} 0_{3 \times 3} G_{[9, 10]}]

(A9)

The first-order Lie derivative of

h_{3}

with respect to

f_{0}

and its gradient are as follows:

L_{f 0}^{1} h_{3} = \nabla L^{0} h_{3} \cdot f_{0} = 0.5 Ξ ({\dot{\bar{q}}}_{i}) b_{ω i} .

(A10)

\nabla L_{f 0}^{1} h_{3} = [0_{4 \times 3} 0_{4 \times 3} 0_{4 \times 3} 0_{4 \times 4} 0_{4 \times 4} I_{3} 0_{4 \times 3} 0_{4 \times 3} 0.5 Ξ ({\dot{\bar{q}}}_{i}) 0_{4 \times 1}]

(A11)

The second-order Lie derivative of

h_{1}

with respect to

f_{0}

and its gradient are as follows:

L_{f_{0}}^{2} h_{1} = \nabla L_{f_{0}}^{1} h_{1} \cdot f_{0} = λ μ_{v} (- R_{i} b_{a i} - g) + \frac{1}{2} G_{[9, 4]} Ξ ({\dot{\bar{q}}}_{i c}) b_{ω i} .

(A12)

\nabla L_{f_{0}}^{2} h_{1} = [0_{3 \times 3} 0_{3 \times 3} 0_{3 \times 3} G_{[11, 4]} G_{[11, 5]} 0_{3 \times 3} 0_{3 \times 3} - λ μ_{v} R_{i} G_{[11, 9]} G_{[11, 10]}]

(A13)

The second-order Lie derivative of

h_{1}

with respect to

f_{0}

as well as

f_{3}

and its gradient are as follows:

L_{f_{3}}^{1} L_{f_{0}}^{1} h_{1} = \nabla L_{f_{0}}^{1} h_{1} \cdot f_{3} = λ μ_{v} R_{i} = {[L_{f_{3, 1}}^{1} L_{f_{0}}^{1} h_{1} L_{f_{3, 2}}^{1} L_{f_{0}}^{1} h_{1} L_{f_{3, 3}}^{1} L_{f_{0}}^{1} h_{1}]}_{3 \times 3} .

(A14)

\begin{array}{l} \nabla L_{f_{3}}^{1} L_{f_{0}}^{1} h_{1} = [\begin{array}{l} \nabla L_{f_{3, 1}}^{1} L_{f_{0}}^{1} h_{1} \\ \nabla L_{f_{3, 2}}^{1} L_{f_{0}}^{1} h_{1} \\ \nabla L_{f_{3, 3}}^{1} L_{f_{0}}^{1} h_{1} \end{array}] \\ = {[0_{9 \times 3} 0_{9 \times 3} 0_{9 \times 3} G_{[12, 4]} G_{[12, 5]} 0_{9 \times 3} 0_{9 \times 3} 0_{9 \times 3} 0_{9 \times 3} U (R_{i})]}_{9 \times 30} \end{array}

(A15)

References

Servières, M.; Renaudin, V.; Dupuis, A.; Antigny, N. Visual and Visual-Inertial SLAM: State of the Art, Classification, and Experimental Benchmarking. J. Sens. 2021, 2021, 2054828. [Google Scholar] [CrossRef]
He, M.; Zhu, C.; Huang, Q.; Ren, B.; Liu, J. A review of monocular visual odometry. Vis. Comput. 2020, 36, 1053–1065. [Google Scholar] [CrossRef]
Namgung, H.; Kim, J.S. Collision risk inference system for maritime autonomous surface ships using COLREGs rules compliant collision avoidance. IEEE Access 2019, 9, 7823–7835. [Google Scholar] [CrossRef]
Lin, Y.; Gao, F.; Qin, T.; Gao, W.; Liu, T.; Wu, W.; Yang, Z.; Shen, S. Autonomous aerial navigation using monocular visual-inertial fusion. J. Field Robot. 2018, 35, 23–51. [Google Scholar] [CrossRef]
Nemec, D.; Šimák, V.; Janota, A.; Hruboš, M.; Bubeníková, E. Precise localization of the mobile wheeled robot using sensor fusion of odometry, visual artificial landmarks and inertial sensors. Robot. Auton. Syst. 2019, 112, 168–177. [Google Scholar] [CrossRef]
Li, Z.; You, B.; Ding, L.; Gao, H.; Huang, F. Trajectory Tracking Control for WMRs with the Time-Varying Longitudinal Slippage Based on a New Adaptive SMC Method. Int. J. Aerosp. Eng. 2019, 2019, 4951538. [Google Scholar] [CrossRef]
Namgung, H. Local route planning for collision avoidance of maritime autonomous surface ships in compliance with COLREGs rules. Sustainability 2021, 14, 198. [Google Scholar] [CrossRef]
Alatise, M.B.; Hancke, G.P. A review on challenges of autonomous mobile robot and sensor fusion methods. IEEE Access 2020, 8, 39830–39846. [Google Scholar] [CrossRef]
Tonini, A.; Castelli, M.; Bates, J.S.; Lin, N.N.N.; Painho, M. Visual-Inertial Method for Localizing Aerial Vehicles in GNSS-Denied Environments. Appl. Sci. 2024, 14, 9493. [Google Scholar] [CrossRef]
Hou, Z.; Wang, R. A Loosely-Coupled GNSS-Visual-Inertial Fusion for State Estimation Based on Optimation. In Proceedings of the 2021 IEEE 3rd International Conference on Frontiers Technology of Information and Computer (ICFTIC), Greenville, SC, USA, 12–14 November 2021; pp. 163–168. [Google Scholar]
Talebi, S.P.; Mandic, D.P. On the Dynamics of Multiagent Nonlinear Filtering and Learning. In Proceedings of the 2024 IEEE 34th International Workshop on Machine Learning for Signal Processing (MLSP), London, UK, 22–25 September 2024; pp. 1–6. [Google Scholar]
He, X.; Li, B.; Qiu, S.; Liu, K. Visual–Inertial Odometry of Structured and Unstructured Lines Based on Vanishing Points in Indoor Environments. Appl. Sci. 2024, 14, 1990. [Google Scholar] [CrossRef]
Sun, Z.; Gao, W.; Tao, X.; Pan, S.; Wu, P.; Huang, H. Semi-Tightly Coupled Robust Model for GNSS/UWB/INS Integrated Positioning in Challenging Environments. Remote Sens. 2024, 16, 2108. [Google Scholar] [CrossRef]
Gopaul, N.S.; Wang, J.; Hu, B. Loosely coupled visual odometry aided inertial navigation system using discrete extended Kalman filter with pairwise time correlated measurements. In Proceedings of the 2017 Forum on Cooperative Positioning and Service (CPGPS), Harbin, China, 19–21 May 2017; pp. 283–288. [Google Scholar]
Weiss, S.M. Vision Based Navigation for Micro Helicopters. Ph.D. Thesis, ETH Zurich, Zurich, Switzerland, 2012. [Google Scholar]
Kelly, J.; Sukhatme, G.S. Visual-inertial sensor fusion: Localization, mapping and sensor-to-sensor self-calibration. Int. J. Robot. Res. 2011, 30, 56–79. [Google Scholar] [CrossRef]
Weiss, S.; Siegwart, R. Real-time metric state estimation for modular vision-inertial systems. In Proceedings of the 2011 IEEE International Conference on Robotics and Automation, Shanghai, China, 9–13 May 2011; pp. 4531–4537. [Google Scholar]
Achtelik, M.W.; Weiss, S.; Chli, M.; Dellaerty, F.; Siegwart, R. Collaborative stereo. In Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, San Francisco, CA, USA, 25–30 September 2011; pp. 2242–2248. [Google Scholar]
Brossard, M.; Bonnabel, S.; Barrau, A. Invariant Kalman filtering for visual inertial SLAM. In Proceedings of the 2018 21st International Conference on Information Fusion (FUSION), Cambridge, UK, 10–13 July 2018; pp. 2021–2028. [Google Scholar]
Sun, W.; Li, Y.; Ding, W.; Zhao, J. A Novel Visual Inertial Odometry Based on Interactive Multiple Model and Multi-state Constrained Kalman Filter. IEEE Trans. Instrum. Meas. 2023, 73, 5000110. [Google Scholar] [CrossRef]
Fornasier, A.; Ng, Y.; Mahony, R.; Weiss, S. Equivariant filter design for inertial navigation systems with input measurement biases. In Proceedings of the 2022 International Conference on Robotics and Automation (ICRA), Philadelphia, PA, USA, 23–27 May 2022; pp. 4333–4339. [Google Scholar]
van Goor, P.; Mahony, R. An equivariant filter for visual inertial odometry. In Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi’an, China, 30 May–5 June 2021; pp. 14432–14438. [Google Scholar]
Trawny, N.; Roumeliotis, S.I. Indirect Kalman Filter for 3D Attitude Estimation; Technical Report; University of Minnesota, Department of Computer Science and Engineering: Minneapolis, MN, USA, 2005; Volume 2. [Google Scholar]
Maybeck, P. Stochastic Models, Estimation, and Control; Academic Press: Cambridge, MA, USA, 1982. [Google Scholar]
Beder, C.; Steffen, R. Determining an initial image pair for fixing the scale of a 3d reconstruction from an image sequence. In Proceedings of the Joint Pattern Recognition Symposium, Berlin, Germany, 12–14 September 2006; pp. 657–666. [Google Scholar]
Eudes, A.; Lhuillier, M. Error propagations for local bundle adjustment. In Proceedings of the 2009 IEEE Conference On Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009; pp. 2411–2418. [Google Scholar]
Hermann, R.; Krener, A. Nonlinear controllability and observability. IEEE Trans. Autom. Control 1977, 22, 728–740. [Google Scholar] [CrossRef]
Burri, M.; Nikolic, J.; Gohl, P.; Schneider, T.; Rehder, J.; Omari, S.; Achtelik, M.W.; Siegwart, R. The EuRoC micro aerial vehicle datasets. Int. J. Robot. Res. 2016, 35, 1157–1163. [Google Scholar] [CrossRef]
Campos, C.; Elvira, R.; Rodríguez, J.J.G.; Montiel, J.M.; Tardós, J.D. Orb-slam3: An accurate open-source library for visual, visual–inertial, and multimap slam. IEEE Trans. Robot. 2021, 37, 1874–1890. [Google Scholar] [CrossRef]
Grupp, M. Evo: Python Package for the Evaluation of Odometry and SLAM. Available online: https://github.com/MichaelGrupp/evo (accessed on 9 December 2024).
Li, M.; Mourikis, A.I. High-precision, consistent EKF-based visual-inertial odometry. Int. J. Robot. Res. 2013, 32, 690–711. [Google Scholar] [CrossRef]
Shuster, M.D. A survey of attitude representations. Navigation 1993, 8, 439–517. [Google Scholar]
Dam, E.B.; Koch, M.; Lillholm, M. Quaternions, Interpolation and Animation; Datalogisk Institut, Københavns Universitet: Copenhagen, Denmark, 1998; Volume 2. [Google Scholar]
Gelman, H. A note on the time dependence of the effective axis and angle of rotation. J. Res. Natl. Bur. Stand. 1971, 75, 165–171. [Google Scholar] [CrossRef]

Figure 1. Relationships of the coordinate frames in the ICEKF.

Figure 2. Relationships among the variables in the ICEKF state vector.

Figure 3. The data flow of the variables in the ICEKF.

Figure 4. Three-dimensional position curves of the simulation.

Figure 5. Position error of the simulation.

Figure 6. Orientation error of the simulation.

Figure 7. Visual scale of the simulation.

Figure 8. The initial estimate set and the converging process.

Figure 9. Dataset experiments (ROOM01): (a) monocular ORB-SLAM, (b) monocular ORB-SLAM with IMU, and (c) monocular MSCKF.

Figure 10. Dataset experiments (MainHall02): (a) monocular ORB-SLAM, (b) monocular ORB-SLAM with IMU, and (c) monocular MSCKF.

Figure 11. Position output of the ICEKF and the ground truth (ROOM01).

Figure 12. Position output of the ICEKF and the ground truth (MainHall02).

Figure 13. Position error between the ICEKF and the ground truth (ROOM01).

Figure 14. Orientation error between the ICEKF and the ground truth (ROOM01).

Figure 15. Position error between the ICEKF and the ground truth (MainHall02).

Figure 16. Orientation error between the ICEKF and the ground truth (MainHall02).

Figure 17. Comparison of the partial trajectory from the ICEKF against the visual measurement with leap noise.

Table 1. Coordinate frames and notations in the ICEKF.

Symbol	Description
w	fixed world coordinate frame
i	coordinate frame attached to the IMU
c	coordinate frame attached to the camera
ic	coordinate frame attached to the IMU-aided camera system
$x_{B}^{A}$	$x$ represents a general viable vector, A is the coordinate frame attached to the vector, and B is the reference frame; for example, $p_{w}^{c}$ denotes the linear translation of the camera with the frame c, measured with respect to the world frame w
$p$	translation vector of rigid bodies along 3 axes, of which the quasi-quaternion description is $\bar{p} = {[0, p^{T}]}^{T}$
$\bar{q}$	unit quaternion according to the Hamilton notation [16], written as $\bar{q} = {[q_{0}, q_{1}, q_{2}, q_{3}]}^{T} = {[q_{0}, q^{T}]}^{T}$
${\bar{q}}^{*}$	conjugate quaternion of $\bar{q}$ $, and {\bar{q}}^{*} \otimes \bar{q} = 1$
$R$	rotation matrix converted from $\bar{q}$ $, such as R_{w}^{i} = R ({\bar{q}}_{w}^{i})$
$⌊x \times⌋$	skew-symmetric matrix of $x$ $, and ⌊x \times⌋ y = - x ⌊y \times⌋$ [23]
$n$	white Gaussian noise vector with zero mean and covariance $σ^{2}$
$g$	gravity vector in the world frame

Table 2. Norm errors of the translation and attitude between the ground truth and the simulation results.

Translation RMSE	Translation Mean Error	Translation STD	Attitude RMSE	Attitude Mean Error	Attitude STD
0.207 m	0.1447 m	0.1408 m	0.1684 rad	0.1348 rad	0.1008 rad

Table 3. Norm errors of the translations between the ground truth and the results of the dataset experiments (ROOM01).

	Translation RMSE	Translation Mean Error	Translation STD
ICEKF	0.04153 m	0.00574 m	0.04112 m
Monocular ORB-SLAM V3	0.0393 m	0.0355 m	0.01678 m
Monocular ORB-SLAM V3 with IMU	0.003841 m	0.002973 m	0.002433 m
Monocular MSCKF	0.1305 m	0.05366 m	0.119 m

Table 4. Norm errors of the translations between the ground truth and the results of the dataset experiments (MainHall02).

	Translation RMSE	Translation Mean Error	Translation STD
ICEKF	0.08922 m	0.009765 m	0.08869 m
Monocular ORB-SLAM V3	0.735 m	0.533 m	0.506 m
Monocular ORB-SLAM V3 with IMU	0.001366 m	0.004981 m	0.01272 m
Monocular MSCKF	0.2689 m	0.09905 m	0.25 m

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, C.; Wang, T.; Li, Z.; Tian, P. A Novel Real-Time Autonomous Localization Algorithm Based on Weighted Loosely Coupled Visual–Inertial Data of the Velocity Layer. Appl. Sci. 2025, 15, 989. https://doi.org/10.3390/app15020989

AMA Style

Liu C, Wang T, Li Z, Tian P. A Novel Real-Time Autonomous Localization Algorithm Based on Weighted Loosely Coupled Visual–Inertial Data of the Velocity Layer. Applied Sciences. 2025; 15(2):989. https://doi.org/10.3390/app15020989

Chicago/Turabian Style

Liu, Cheng, Tao Wang, Zhi Li, and Peng Tian. 2025. "A Novel Real-Time Autonomous Localization Algorithm Based on Weighted Loosely Coupled Visual–Inertial Data of the Velocity Layer" Applied Sciences 15, no. 2: 989. https://doi.org/10.3390/app15020989

APA Style

Liu, C., Wang, T., Li, Z., & Tian, P. (2025). A Novel Real-Time Autonomous Localization Algorithm Based on Weighted Loosely Coupled Visual–Inertial Data of the Velocity Layer. Applied Sciences, 15(2), 989. https://doi.org/10.3390/app15020989

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Novel Real-Time Autonomous Localization Algorithm Based on Weighted Loosely Coupled Visual–Inertial Data of the Velocity Layer

Abstract

1. Introduction

1.1. Motivation

1.2. Related Work

1.3. Our Approach

2. Design of the State Vector

2.1. Definition of Variables in the ICEKF

2.2. Construction of the State Vector

2.3. Coupling Process

2.4. Simplification of the State Vector

2.5. Error of the State Vector

3. Propagation and Update of the ICEKF

3.1. Propagation

3.2. Measurement

3.3. Entire ICEKF Process

4. Nonlinear Observability Analysis

5. Simulation and Experiments

5.1. Simulation

5.2. Dataset Experiment

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix B

Appendix C

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI