Remote Tube-based MPC for Tracking Over Lossy Networks

David Umsonst^† and Fernando S. Barbosa^† ^$\dagger$ Ericsson Research, Stockholm, Sweden.
{david.umsonst, fernando.dos.santos.barbosa}@ericsson.com

Abstract

This paper addresses the problem of controlling constrained systems subject to disturbances in the case where controller and system are connected over a lossy network. To do so, we propose a novel framework that splits the concept of tube-based model predictive control into two parts. One runs locally on the system and is responsible for disturbance rejection, while the other runs remotely and provides optimal input trajectories that satisfy the system’s state and input constraints. Key to our approach is the presence of a nominal model and an ancillary controller on the local system. Theoretical guarantees regarding the recursive feasibility and the tracking capabilities in the presence of disturbances and packet losses in both directions are provided. To test the efficacy of the proposed approach, we compare it to a state-of-the-art solution in the case of controlling a cartpole system. Extensive simulations are carried out with both linearized and nonlinear system dynamics, as well as different packet loss probabilities and disturbances. The code for this work is available at https://github.com/EricssonResearch/Robust-Tracking-MPC-over-Lossy-Networks

I Introduction

Wireless communication has evolved to enable higher and faster data transfer, with 5G being envisioned as being a key enabler of Industry 4.0 [1, 2] and of mass digitalization. Looking into control systems and robotics in general, faster and more reliable wireless communication enables plants and systems to be controlled remotely, utilizing edge and cloud computing, in a so-called offloaded control [3]. Running heavy-processing components remotely allows industries to save costs with cabling and processing power in the plant, easier integration of autonomous mobile agents in the industrial floor, and also a reduced energy consumption on battery-powered agents.

However, any wireless network is subject to imperfections and constraints. The former means that it can present delays, packet drops, and even longer outages. The latter implies that its resources, such as throughput and load, are constrained. These two factors are specially precarious for time- and safey-critical systems, such as unstable plants, mobile robots and autonomous cars [4].

A popular approach to address the problem of stabilization under safety and actuator constraints is Model Predictive Control (MPC) [5], since such constraints can be explicitly accounted for in its formulation. Several approaches have been proposed to make MPC robust to network imperfection. Looking specifically into the stabilization problem, [6] considers a bound on the amount of consecutively lost packets, while [7] considers bounded delay. Moving to trajectory tracking problems, [8] assumes Bernoulli distributed packet loss, while [9] only assumes that from time to time there are consecutive successful packet deliveries from the plant to the controller and back. In addition to network imperfections, [7] considers a bounded disturbance and [8] considers an unbounded zero mean stochastic disturbance acting on the plant.

Extensive research has been carried out on MPC that disregards the effects of imperfect communication, either because the controller is running onboard or because perfect communication was assumed, but can handle local disturbances. Limon et al. [10] propose a robust tracking MPC that keeps the plant state in a bounded neighborhood of the nominal plant state, while tracking a constant reference. Here, the nominal plant represents the plant dynamics without a disturbance present. Roque et al. [11] combine control barrier function with the nominal system to guarantee that the continuous system is within a bounded neighborhood of the desired reference in between discrete controller updates. Neither [10] nor [11] can handle network imperfections.

In our work, we combine the mild network assumptions of [9] with the disturbance rejection of [10] to develop a novel remote tracking MPC framework. This framework guarantees the satisfaction of state and actuator constraints in the presence of a local disturbance and a lossy network. The key idea is to use a nominal model on the local plant to simulate the nominal plant state in case of packet losses. This nominal plant state allows us to reduce the bandwidth by sending only control input trajectories over the network and it is used in an ancillary controller to reject the disturbance. This allows us to handle both packet losses and local disturbances. Furthermore, the code for our approach is available online.¹¹1https://github.com/EricssonResearch/Robust-Tracking-MPC-over-Lossy-Networks

Notation: Let $x\in\mathbb{R}^{n}$ and $A\in\mathbb{R}^{n\times m}$ be a real-valued $n$ -dimensional column vector and matrix with $n$ rows and $m$ columns, respectively. The transpose of a vector $x$ and matrix $A$ are $x^{\top}$ and $A^{\top}$ , respectively. The spectral radius and matrix square root of a square matrix $A$ are denoted by $\rho(A)$ and $A^{\frac{1}{2}}$ , respectively. The $n$ dimensional identity matrix is denoted by $I_{n}$ , while $0$ denotes a scalar, vector, or matrix with zero elements of appropriate dimensions. A symmetric and square positive (semi-)definite matrix $A$ is denoted by $A>0(A\geq 0)$ and we use $\|x\|_{A}^{2}=x^{\top}Ax$ . For a set $\mathbb{P}$ and a matrix $A$ of appropriate dimension, we define $A\mathbb{P}=\{Ap\ |\ p\in\mathbb{P}\}$ . For two sets $\mathbb{P}$ and $\mathbb{Q}$ , the Minkowski sum and the Pontryagin difference are denoted as $\mathbb{P}\oplus\mathbb{Q}$ and $\mathbb{P}\ominus\mathbb{Q}$ , respectively. The probability of an event $E$ is denoted by $\mathrm{Prob}(E)$ .

II Problem Definition

A block diagram summarizing the components involved in our setup is presented in Figure 1. In what follows in this section, we describe such components and formulate the problem addressed in this paper.

Refer to caption — Figure 1: Block diagram of the problem setup

II-1 Network

Local plant and remote controller communicate via a potentially lossy network, in which network packets can be lost in both directions. Reasons for a lost packet include a large transmission delay, a packet drop in the network, reordering, or a short network outage. To model these packet losses, we introduce two variables: $\theta_{k}$ and $\gamma_{k}$ . The variable $\theta_{k}\in\{0,1\}$ indicates whether the local plant has received the packet $U_{k}$ or not, i.e., $\theta_{k}=1$ if $U_{k}$ , sent from the remote controller at time step $k$ , has been received at the local plant, and $\theta_{k}=0$ otherwise. Similarly, the variable $\gamma_{k}\in\{0,1\}$ indicates that the packet $X_{k}$ sent from the local plant has been received at the remote controller ( $\gamma_{k}=1$ ) or not ( $\gamma_{k}~{}=~{}0$ ).

Assumption 1.

Over time, there is an infinite amount of two successful consecutive transmissions from plant to controller and controller to plant, i.e.,

\mathrm{Prob}(\cap_{t\geq k}\{\gamma_{t-1}\theta_{t}=0\})=0\ \forall\ k\geq 0.

(1)

This assumption is as in [9], and does not put any major restrictions on the reasons for the packet loss, such as a fixed distribution or a maximum amount of lost packets in a row.

II-2 Local plant

Consider a linear time-invariant discrete-time plant with additive disturbance given by

	$\displaystyle x(k+1)$	$\displaystyle=Ax(k)+Bu(k)+w(k),$		(2)
	$\displaystyle y(k)$	$\displaystyle=Cx(k),$		(2)

where $x(k)\in\mathbb{R}^{n_{x}}$ , $u(k)\in\mathbb{R}^{n_{u}}$ , $y(k)\in\mathbb{R}^{n_{y}}$ and ${w(k)\in\mathbb{R}^{n_{x}}}$ are the plant’s state, control input, output, and disturbance at time step $k\in\mathbb{N}_{\geq 0}$ , respectively. Here, ${A\in\mathbb{R}^{n_{x}\times n_{x}}}$ , ${B\in\mathbb{R}^{n_{x}\times n_{u}}}$ , and $C\in\mathbb{R}^{n_{y}\times n_{x}}$ are the system, input, and output matrices, respectively.

Assumption 2.

The system $(A,B)$ is stabilizable.

This assumption is necessary to be able to design a controller that stabilizes the plant (2).

Assumption 3.

The disturbance is bounded by a compact set $\mathbb{W}$ , such that $w(k)\in\mathbb{W}$ for all $k$ , where

\mathbb{W}=\{w\in\mathbb{R}^{n_{x}}\ |\ H_{w}w\leq h_{w}\},

(3)

and $\mathbb{W}$ contains the origin in its interior.

This assumption confines the disturbance to a bounded set, which could, for example, depend on the modelling errors.

Furthermore, we also consider constraints in state $x(k)\in\mathbb{X}$ and input $u(k)\in\mathbb{U}$ . These sets indicate, for example, safe set of states in which the plant should evolve, and actuator saturation. If $x(k)\in\mathbb{X}$ and $u(k)\in\mathbb{U}$ , $x(k)$ and $u(k)$ are called admissible.

Assumption 4.

The sets $\mathbb{X}$ and $\mathbb{U}$ are bounded sets containing the origin in their interior and are defined as

	$\displaystyle\mathbb{X}$	$\displaystyle=\{x\in\mathbb{R}^{n_{x}}\ \|\ H_{x}x\leq h_{x}\},$		(4)
	$\displaystyle\mathbb{U}$	$\displaystyle=\{u\in\mathbb{R}^{n_{u}}\ \|\ H_{u}u\leq h_{u}\}.$		(5)

The control input is determined as ${u(k)=f(x(k),\{U_{i}\}_{i=0}^{k},\{\theta_{i}\}_{i=0}^{k})}$ , where $\{U_{i}\}_{i=0}^{k}$ and $\{\theta_{i}\}_{i=0}^{k}$ are the sequence of packets sent from the remote controller to the local plant and the binary sequence indicating the successful transmission of them, respectively. Note that this function can make use of all previously received packets.

II-3 Remote Controller

The remote controller is used to determine the controller packet $U_{k}$ based on the received packets $X_{i}$ and the desired reference $x_{r}$ . More formally the controller is defined as $g(\{U_{i}\}_{i=0}^{k-1},\{X_{i}\}_{i=0}^{k-1},\{\gamma_{i}\}_{i=0}^{k-1},x_{r})$ , which has access to all previous controller packets and can make use of all previously received plant packets. Here, $\{X_{i}\}_{i=0}^{k-1}$ and $\{\gamma_{i}\}_{i=0}^{k-1}$ are defined similarly as $\{U_{i}\}_{i=0}^{k}$ and $\{\theta_{i}\}_{i=0}^{k}$ above.

II-4 Problem Formulation

Now that all the components are defined, let us formulate the problem we want to solve.

Problem 1.

Given a local plant (2), design $f(\cdot)$ and $g(\cdot)$ such that i) state and input constraints are respected, i.e. $x(k)\in\mathbb{X}$ and $u(k)\in\mathbb{U}$ for $k\geq 0$ , and ii) $x(k)$ converges to a bounded neighborhood of reference $x_{r}(k)\in\mathbb{R}^{n_{x}}$ , despite the lossy network and the disturbance $w(k)$ .

III Preliminaries

In the previous section, we have set up our problem and and now we will present several preliminaries, found, e.g., in [9, 10, 12], necessary for our proposed approach. This section introduces the nominal plant dynamics, i.e. the plant dynamics without an additive disturbance, the error between the actual and the nominal plant state, as well as the steady-state behaviour of the nominal plant.

III-1 Nominal plant

The nominal plant [12] is given by

	$\displaystyle x_{\text{n}}(k+1)$	$\displaystyle=Ax_{\text{n}}(k)+Bu_{\text{n}}(k),$		(6)
	$\displaystyle y_{\text{n}}(k)$	$\displaystyle=Cx_{\text{n}}(k),$		(6)

where $x_{\text{n}}(k)\in\mathbb{R}^{n_{x}}$ , $u_{\text{n}}(k)\in\mathbb{R}^{n_{u}}$ , and $y_{\text{n}}(k)\in\mathbb{R}^{n_{y}}$ are the nominal state, the nominal control input, and the nominal output, respectively. Due to the disturbance $w(k)$ in (2), the plant state differs from the nominal state and subsequently we want to show how close the plant state is to the nominal state. To do so, we introduce the error $e(k)=x(k)-x_{\text{n}}(k)$ .

If $A$ is unstable, then the error will diverge such that the plant state is not close to the nominal state. To prevent that, we introduce an ancillary controller, which will be used by the plant to track the nominal state. The ancillary controller is given by

u(k)=u_{\text{n}}(k)-K\left(x(k)-x_{\text{n}}(k)\right),

(7)

where $K\in\mathbb{R}^{n_{u}\times n_{x}}$ is a linear state feedback controller chosen such that $\rho(A-BK)<1$ , which is possible due to Assumption 2. Note that if the system matrix $A$ is stable, i.e., $\rho(A)<1$ , then we could choose $K=0$ .

When the plant uses the ancillary controller (7), we obtain the following error dynamics

e(k+1)=(A-BK)e(k)+w(k).

(8)

The evolution of $e(k)$ is bounded, because $\mathbb{W}$ is a compact set and $A-BK$ is stable [13].

We introduce the minimal robust positively invariant set [14] to determine the bounded set in which $e(k)$ evolves as

\mathbb{Z}_{K}=\bigoplus_{i=0}^{\infty}(A-BK)^{i}\mathbb{W}.

(9)

It is guaranteed that $(A-BK)\mathbb{Z}_{K}\oplus\mathbb{W}\subseteq\mathbb{Z}_{K}$ , i.e., if ${e(k_{0})\in\mathbb{Z}_{K}}$ , then $e(k)\in\mathbb{Z}_{K}$ for all $k>k_{0}$ . Since $0\in\mathbb{W}$ , we have $0\in\mathbb{Z}_{K}$ [14]. The set $\mathbb{Z}_{K}$ can be overapproximated with, for example, the methods proposed in [14] and [15].

With $\mathbb{Z}_{K}$ defined, it is known that [12]

x(k)\in\{x_{\text{n}}(k)\}\oplus\mathbb{Z}_{K}\ \forall k>0,

(10)

given that $x(0)\in\{x_{\text{n}}(0)\}\oplus\mathbb{Z}_{K}$ This means that the plant state evolves in a bounded neighborhood $\mathbb{Z}_{K}$ around the nominal state. This bounded neighborhood is often called a tube. The size of $\mathbb{Z}_{K}$ depends on the ancillary controller $K$ , so that the ancillary controller determines how close the plant state will track the nominal state. Similarly, we obtain

u(k)\in\{u_{\text{n}}(k)\}\oplus(-K)\mathbb{Z}_{K},

(11)

which means that the control input also evolves in a bounded neighborhood around the nominal control input.

Therefore, we will introduce tightened constraint sets [12] in which the nominal state and input trajectory should evolve, i.e., ${x_{\text{n}}(k)\in\mathbb{X}_{\text{c}}}$ and ${u_{\text{n}}(k)\in\mathbb{U}_{\text{c}}}$ , which guarantee that the plant state and input trajectories evolve in the sets $\mathbb{X}$ and $\mathbb{U}$ , respectively. We define the tightened sets $\mathbb{X}_{\text{c}}=\mathbb{X}\ominus\mathbb{Z}_{K}$ and ${\mathbb{U}_{\text{c}}=\mathbb{U}\ominus(-K)\mathbb{Z}_{K}}$ , which guarantee that $\mathbb{X}_{\text{c}}\oplus\mathbb{Z}_{K}\subseteq\mathbb{X}$ and $\mathbb{U}_{\text{c}}\oplus(-K)\mathbb{Z}_{K}\subseteq\mathbb{U}$ .

III-2 Steady-state behavior

Next, we look into the steady-states of the nominal plant [10, 9] and how to control the nominal plant towards a steady state while guaranteeing that the nominal state and input remain in $\mathbb{X}_{\text{c}}$ and $\mathbb{U}_{\text{c}}$ , respectively.

The steady-state equations of (6) are given by

\displaystyle\begin{bmatrix}A-I_{n_{x}}&B\end{bmatrix}\begin{bmatrix}\bar{x}\\ \bar{u}\end{bmatrix}=0,

(12)

which have a solution due to Assumption 2. Here, $\bar{x}~{}\in~{}\mathbb{R}^{n_{x}}$ and $\bar{u}\in\mathbb{R}^{n_{u}}$ are a steady state and steady-state input, respectively. To control the nominal system towards the steady state, we introduce the state feedback controller ${\bar{K}\in\mathbb{R}^{n_{x}\times n_{u}}}$ for the nominal plant

u_{\text{n}}(k)=\bar{u}-\bar{K}(x_{\text{n}}(k)-\bar{x}),

(13)

where $\bar{K}$ is chosen such that $\rho(A-B\bar{K})<1$ . However, we want to guarantee that $x_{\text{n}}\in\mathbb{X}_{\text{c}}$ and $u_{\text{n}}\in\mathbb{U}_{\text{c}}$ . Thus, we define the augmented state ${x_{a}(k)=[x_{\text{n}}^{\top}(k),\ \bar{x}^{\top},\ \bar{u}^{\top}]^{\top}}$ and its dynamics with the controller in (13) are given by

\displaystyle x_{\text{a}}(k+1)=A_{\text{a}}x_{\text{a}}(k)\ \mathrm{with}\ A_% {\text{a}}=\begin{bmatrix}A-B\bar{K}&B\bar{K}&B\\ 0&I_{n_{x}}&0\\ 0&0&I_{n_{u}}\end{bmatrix}.

Next, we define the maximum admissible set [13]

X_{f,\bar{K}}=\{x_{\text{a}}\ |\ A_{\text{a}}^{k}x_{\text{a}}\in\mathbb{X}_{% \text{a},\bar{K}}\ \forall\,k\in\mathbb{N}_{\geq 0}\},

(14)

where ${\mathbb{X}_{\text{a},\bar{K}}=\{x_{\text{a}}|x_{\text{n}}\in\mathbb{X}_{\text% {c}},\bar{u}-\bar{K}(x_{\text{n}}(k)-\bar{x})\in\mathbb{U}_{\text{c}}\}}$ . If ${[x_{\text{n}}(0)^{\top},\bar{x}^{\top},\bar{u}^{\top}]^{\top}\in X_{f,\bar{K}}}$ , then the nominal plant (6) using the control law (13) guarantees that ${[x_{\text{n}}(k)^{\top},\bar{x}^{\top},\bar{u}^{\top}]^{\top}\in X_{f,\bar{K}}}$ for all $k>0$ and that $x_{\text{n}}(k)$ converges to the steady state $\bar{x}$ . We can compute $X_{f,\bar{K}}$ as described in [13]. Since $X_{f,\bar{K}}$ might not be finitely determined, i.e., the polytope $X_{f,\bar{K}}$ cannot be described by a finite amount of inequalities, we introduce

X_{f,\bar{K}}^{\lambda}=X_{f,\bar{K}}\cap\{\bar{x},\ \bar{u}\ |\ \bar{x}\in% \lambda\mathbb{X}_{\text{c}},\ \bar{u}\in\lambda\mathbb{U}_{\text{c}}\},

(15)

with $\lambda\in(0,1)$ . This is a finitely determined set that approximates $X_{f,\bar{K}}$ arbitrarily well as $\lambda\rightarrow 1$ [13].

IV Remote Tube-Based Tracking MPC
over Lossy Networks

In this section, we describe in more details the Remote Tube-based Tracking MPC over Lossy Networks approach that we propose to solve Problem 1 and its theoretical guarantees. As mentioned earlier, the proposed approach is an extension of those presented in [9] and [10] that enables remote tracking of references even in the presence of disturbance on the plant and lossy networks.

Figure 2 presents the architecture of our proposed approach. It is composed of five parts: two are placed remotely representing $g(\cdot)$ , namely the MPC controller and the state estimator, and three are placed together with the local plant representing $f(\cdot)$ , namely the consistent actuator, the nominal plant, and the ancillary controller.

IV-A Remote Model Predictive Controller For Tracking

To track the reference $x_{r}$ , we will use a model predictive controller on the remote controller-side, which is inspired by [9]. The cost function optimized in the MPC is given by

c(\mathbf{u},\mathbf{x},\bar{x},\bar{u},x_{r})=\sum_{i=0}^{N-1}\left(c_{i}(% \mathbf{u},\mathbf{x},\bar{x},\bar{u})\right)+\bar{c}(\mathbf{x},\bar{x},x_{r}),

(16)

where $\mathbf{u}=\{\mathbf{u}(0),\ldots,\mathbf{u}(N)\}$ , $\mathbf{x}=\{\mathbf{x}(0),\ldots,\mathbf{x}(N)\}$ ,

	$\displaystyle c_{i}(\mathbf{u},\mathbf{x},\bar{x},\bar{u})$	$\displaystyle=\\|\mathbf{x}(i)-\bar{x}\\|_{Q}^{2}+\\|\mathbf{u}(i)-\bar{u}\\|_{R}^% {2},$		(17)
	$\displaystyle\bar{c}(\mathbf{x},\bar{x},x_{r})$	$\displaystyle=\\|\mathbf{x}(N)-\bar{x}\\|_{P}^{2}+\\|\bar{x}-x_{r}\\|_{T}^{2},$		(18)

and $Q\geq 0$ , $R>0$ , and $T>0$ are the symmetric cost matrices for the state, input, and the tracking output, and $P$ is the solution of $P=(A-B\bar{K})^{\top}P(A-B\bar{K})+Q+\bar{K}^{\top}R\bar{K}.$ Given the cost function, a state estimate $\hat{x}(k|k-1)$ and a reference signal $x_{r}$ , the optimization problem of the MPC is formulated as follows


$\displaystyle\min_{\mathbf{u},\bar{x},\bar{u}}\$	$\displaystyle c(\mathbf{u},\mathbf{x},\bar{x},\bar{u},x_{r})$	(19a)
$\displaystyle\mathrm{s.t.}\$	$\displaystyle\mathbf{x}(i+1)=A\mathbf{x}(i)+B\mathbf{u}(i),$	(19b)
	$\displaystyle\mathbf{x}(i)\in\mathbb{X}_{\text{c}},\ \mathbf{u}(i)\in\mathbb{U% }_{\text{c}},\ i\in\{0,\ldots,N-1\},$	(19c)
	$\displaystyle\mathbf{x}(0)=\hat{x}(k\|k-1),$	(19d)
	$\displaystyle(\mathbf{x}(N),\,\bar{x},\,\bar{u})\in X_{f,\bar{K}}^{\lambda},$	(19e)
	$\displaystyle\begin{bmatrix}A-I_{n_{x}}&B\\ \end{bmatrix}\begin{bmatrix}\bar{x}\\ \bar{u}\end{bmatrix}=0$	(19f)

where the sets $\mathbb{X}_{\text{c}}$ , $\mathbb{U}_{\text{c}}$ , and $X_{f,\bar{K}}^{\lambda}$ , are as in Section III.

Compared to the remote MPC formulated in [9], the MPC (19) generates trajectories for the nominal plant by using the tightened sets $\mathbb{X}_{\text{c}}$ , $\mathbb{U}_{\text{c}}$ , and $X_{f,\bar{K}}^{\lambda}$ . This difference is inspired by [10] and we make use of it in Section IV-C to generate inputs $u(k)\in\mathbb{U}$ , which guarantee $x(k)\in\mathbb{X}$ for all ${k\in\mathbb{N}_{\geq 0}}$ . Since the communication from the plant to the remote controller is lossy, we are not guaranteed to have the plant state $x(k)$ available at time $k$ . Therefore, we use the estimate ${\hat{x}(k|k-1)}$ based on the previously received packets as in [9] (see Section IV-D) instead of the true state.

Let the optimal solution of (19) at time step $k$ be $\mathbf{u}_{k}^{*},\bar{u}_{k}^{*}$ , and $\bar{x}_{k}^{*}$ . With that, the packet $U_{k}$ is, similar to [9], constructed as follows

U_{k}=\{\mathbf{u}_{k}^{*},\bar{u}_{k}^{*}+\bar{K}\bar{x}_{k}^{*},q_{k}\},

(20)

where $q_{k}$ is the time instance when the remote estimator has last received a packet from the local plant. The packet contains the optimal nominal input trajectory at time step $k$ and the steady-state control input for the nominal plant.

IV-B Consistent Actuator

The consistent actuator is located at the local plant and is responsible for deciding the next nominal control input $u_{\text{n}}(k)$ . It has the same functionality as the Smart Actuator in [9]. When a packet $U_{k}$ is received, the consistent actuator needs to decide if $U_{k}$ will be used or if it will be discarded; in the latter, the packet already in use continues to be applied.

The consistent actuator might discard a received packet because the estimated state on the remote controller side is inconsistent with the actual state on the plant. This means that the control inputs have been calculated based on an incorrectly estimated state. To determine consistency, we use a variable $\Theta_{k}$ as in [9], which is calculated as follows

\displaystyle\Theta_{k}=\begin{cases}\prod_{i=q_{k}+1}^{k}\theta_{i}&\mathrm{% if}\ \theta_{k}=1,\\ 0&\mathrm{otherwise.}\end{cases}

(21)

We observe that if $\theta_{k}=1$ , i.e., the packet is received at time step $k$ , then we can calculate the product and otherwise ${\Theta_{k}=0}$ . Once $\Theta_{k}$ is determined the consistent actuator updates its internal state $s_{k}$ as follows

s_{k}=\Theta_{k}k+(1-\Theta_{k})s_{k-1}.

(22)

This internal state keeps track of which packet $U_{s_{k}}$ should be used by the consistent actuator at time step $k$ . Note that if $\Theta_{k}=1$ then $s_{k}=k$ and the latest packet $U_{k}$ will be used. Once $s_{k}$ has been determined, the packet $X_{k}$ is sent from the plant to the controller with the following content,

X_{k}=\{x_{\text{n}}(k),s_{k}\}.

(23)

While in [9] the packet $X_{k}$ contains $x(k)$ and $s_{k}$ , our proposed solution sends the nominal state $x_{\text{n}}(k)$ to the remote controller, which is obtained as described in Section IV-C.

The consistent actuator determines $u_{\text{n}}(k)$ as

\displaystyle u_{\text{n}}(k)=\begin{cases}\mathbf{u}_{s_{k}}^{*}(k-s_{k})&% \mathrm{if}\ k-s_{k}<N,\\ \bar{u}_{s_{k}}^{*}+\bar{K}\bar{x}_{s_{k}}^{*}-\bar{K}x_{\text{n}}(k)&\mathrm{% otherwise}.\end{cases}

(24)

In a nutshell, the consistent actuator uses all predicted control inputs in a packet $U_{k}$ if no new consistent packet has been received and once there are no more predicted inputs available it uses the controller $\bar{K}$ in (13) to control the nominal plant around the steady state $\bar{x}_{s_{k}}^{*}$ .

Note that since $u_{\text{n}}(k)$ is determined from an optimal trajectory coming from the MPC, it is guaranteed that ${u_{\text{n}}(k)\in\mathbb{U}_{\text{c}}}$ .

IV-C Nominal Plant and Ancillary Controller

The main idea of our proposed approach is that a model of the nominal plant runs on the local plant to determine the nominal plant state $x_{\text{n}}(k)$ . Here, $x_{\text{n}}(k)$ together with $u_{\text{n}}(k)$ coming from (24) are used to determine the control input $u(k)$ for the plant via the ancillary controller $K$ in (7).

The nominal control input coming from the consistent actuator is then applied to the model of the nominal plant, which evolves as described in (6). Since the nominal control inputs are determined by the MPC problem, they guarantee that $x_{\text{n}}(k)\in\mathbb{X}_{\text{c}}$ for all $k\geq 0$ .

As described in Section III, the ancillary controller will guarantee that $x(k)\in\{x_{\text{n}}(k)\}\oplus\mathbb{Z}_{K}\subseteq\mathbb{X}$ for all $k\in\mathbb{N}_{\geq 0}$ and $u(k)\in\{u_{\text{n}}(k)\}\oplus K\mathbb{Z}_{K}\subseteq\mathbb{U}$ if $x_{\text{n}}(k)\in\mathbb{X}_{\text{c}}$ for all $k\in\mathbb{N}_{\geq 0}$ , since $u_{\text{n}}(k)\in\mathbb{U}_{\text{c}}$ for all $k\in\mathbb{N}_{\geq 0}$ .

The nominal plant and ancillary controller on the local plant are the key to make our approach work because they enable us to track a reference $x_{r}$ in the presence of a disturbance $w(k)$ , and they are the main architectural difference to [9]. Furthermore, running a nominal model is computationally cheaper than running a robust MPC as in [10] on the local plant. This makes our proposed approach more applicable to lightweight devices controlled over a lossy network without sacrificing robustness.

Remark 1.

The ancillary controller $K$ and the steady-state controller $\bar{K}$ are not necessarily the same. This enables us to tune $K$ to minimize $\mathbb{Z}_{K}$ , while $\bar{K}$ can be tuned to increase the size of $X_{f,\bar{K}}^{\lambda}$ . For the former, Section 7 in [10] proposed a semi-definite program to design $K$ , which minimizes $\mathbb{Z}_{K}$ while guaranteeing that $\mathbb{X}_{\text{c}}$ and $\mathbb{U}_{\text{c}}$ are non-empty. For the latter, a common choice in the literature is to choose $\bar{K}$ as the optimal LQR gain.

IV-D Estimator

The estimator, similar to [9], is used to estimate the state of the nominal plant at time step $k+1$ as $\hat{x}(k+1|k)$ . Based on the reception of $X_{k}$ , it estimates the nominal plant state

\hat{x}(k+1|k)=A\hat{x}(k|k)+B\hat{u}(k|k),

(25)

where

	$\displaystyle\hat{x}(k\|k)$	$\displaystyle=\gamma_{k}x_{\text{n}}(k)+(1-\gamma_{k})\hat{x}(k\|k-1),$		(26)
	$\displaystyle\hat{u}(k\|k)$	$\displaystyle=\gamma_{k}u_{\text{n}}(k)+(1-\gamma_{k})\mathbf{u}_{k}^{*}(0).$		(27)

Since only $x_{\text{n}}(k)$ and $s_{k}$ are sent to the remote controller, the remote controller also needs to run a consistent actuator (24) to determine $u_{\text{n}}(k)$ . Furthermore, $q_{k}$ is updated as follows

q_{k+1}=\gamma_{k}k+(1-\gamma_{k})q_{k}

(28)

to keep track of which packet $X_{k}$ has been received last at the remote controller.

Other than in [9], we estimate the nominal plant state in the estimator and not the plant state. This guarantees that $\hat{x}(k|k-1)\in\mathbb{X}_{\text{c}}$ , such that the constraints $\mathbf{x}(0)=\hat{x}(k|k-1)$ and $\mathbf{x}(0)\in\mathbb{X}_{\text{c}}$ in the optimization problem (19) will not lead to an infeasible optimization problem.

IV-E Theoretical Guarantees

In this section, we provide theoretical guarantees for our proposed MPC. The key insight for our theoretical guarantees is that the closed-loop system involving the MPC in Figure 2 acts on the nominal plant and not the plant itself. This means that inside this closed-loop system there is no disturbance, such that it represents the disturbance-free system assumed in [9]. Hence, the theoretical guarantees of [9] will hold for the closed-loop system involving the MPC in our proposed approach given Assumption 5 below.

Assumption 5.

In addition to Assumptions 1 – 4, the following conditions hold:

1.

$Q$ , $R$ , and $T$ are positive definite.
2.

The system $(Q^{\frac{1}{2}},A)$ is observable.
3.

The gains $K$ and $\bar{K}$ are such that $\rho(A-BK)<1$ and $\rho(A-B\bar{K})<1$ , respectively.
4.

The matrix $P$ satisfies

$P=(A-B\bar{K})^{\top}P(A-B\bar{K})+Q+\bar{K}^{\top}R\bar{K}$ .

We begin by showing that the plant state is in a bounded neighbourhood around the estimated state if $\Theta_{k}=1$ .

Proposition 1.

If $\Theta_{k}=1$ , then $x(k)\in\{\hat{x}(k|k-1)\}\oplus\mathbb{Z}_{K}$ .

Proof.

Since the closed-loop system of our proposed approach acts on the disturbance-free nominal plant (see Figure 2), we can use Proposition 1 of [9] to show that if $\Theta_{k}=1$ then $\hat{x}(k|k-1)=x_{\text{n}}(k)$ . Due the ancillary controller, we know that $x(k)\in\{x_{\text{n}}(k)\}\oplus\mathbb{Z}_{K}$ holds. ∎

This shows that when the estimate is consistent with the nominal plant state, i.e. $\Theta_{k}=1$ , then we know that the plant state is in a tube around the estimated state.

Next, we show recursive feasibility of our proposed remote MPC and that the plant will always evolve in the constraints regardless of the network quality.

Proposition 2.

Let Assumption 5 hold, and assume there exists a $k_{0}$ such that $\gamma_{k_{0}-1}=1$ , $\theta_{k_{0}}=1$ , ${x(k_{0})-x_{\text{n}}(k_{0})\in\mathbb{Z}_{K}}$ , and that the optimization problem (19) is feasible. If the consistent actuator (24) and the ancillary controller (7) are used, the optimization problem (19) is feasible, and $x(k)\in\mathbb{X}$ and $u(k)\in\mathbb{U}$ for all $k\geq k_{0}$ .

Proof.

Given the conditions above, Proposition 2 of [9] shows us that optimization problem (19) is feasible and $x_{\text{n}}(k)\in\mathbb{X}_{\text{c}}$ and $u_{\text{n}}(k)\in\mathbb{U}_{\text{c}}$ for all $k\geq k_{0}$ . The constraint satisfaction of $x(k)\in\mathbb{X}$ and $u(k)\in\mathbb{U}$ is guaranteed since the ancillary controller (7) guarantees that $x(k)\in\{x_{\text{n}}(k)\}\oplus\mathbb{Z}_{K}\subseteq\mathbb{X}$ and $u(k)\in\{u_{\text{n}}(k)\}\oplus(-K)\mathbb{Z}_{K}\subseteq\mathbb{U}$ . ∎

Note that the feasibility of the MPC does not depend on the value of $x_{r}$ , such that for all reference values our solution is recursively feasible according to Proposition 2.

Finally, the following theorem states the tracking capabilities of our approach given a constant reference $x_{r}$ .

Theorem 1.

Let Assumption 5 hold and $[x_{r}^{\top},\ \tilde{u}^{\top}]^{\top}$ fulfil the steady-state equation (12). If the consistent actuator (24) and the ancillary controller (7) are used, then almost surely $\lim_{k\rightarrow\infty}x(k)\in\{\tilde{x}_{r}\}\oplus\mathbb{X}$ , where $\tilde{x}_{r}=x_{r}$ if $x_{r}\in\lambda\mathbb{X}_{\text{c}}$ and $\tilde{u}\in\lambda\mathbb{U}_{\text{c}}$ , and $\tilde{x}_{r}=\arg\min_{x\in\lambda\mathbb{X}_{\text{c}}}\|x-x_{r}\|_{T}^{2}$ otherwise.

Proof.

From Proposition 3 of [9] we obtain that $\lim_{k\rightarrow\infty}x_{\text{n}}(k)=\tilde{x}_{r}$ almost surely, while Theorem 1 of [10] states that $\tilde{x}_{r}=\arg\min_{x\in\lambda\mathbb{X}_{\text{c}}}\|x-x_{r}\|_{T}^{2}$ such that $\tilde{x}_{r}=x_{r}$ if $x_{r}\in\lambda\mathbb{X}_{\text{c}}$ and $\tilde{u}\in\lambda\mathbb{U}_{\text{c}}$ . The ancillary controller guarantees that $\lim_{k\rightarrow\infty}x(k)\in\{\tilde{x}_{r}\}\oplus\mathbb{X}$ almost surely. ∎

Corollary 1.

Theorem 1 and Proposition 2 show us that by choosing $f(\cdot)$ and $g(\cdot)$ as in our approach, we have solved Problem 1 for constant references.

IV-F Extension to include state feedback

While our proposed approach does not require feedback from $x(k)$ , it is common to send the state also to the remote controller, for example, for anomaly detection purposes. Therefore, we will now propose an extension to our approach, which includes state feedback, while inheriting the theoretical guarantees of our previously described approach.

To include the state, we change the content of the plant packet (23) as follows

X_{k}=\{x(k),x_{\text{n}}(k),s_{k}\}.

(29)

With the new package (29), the estimator in (25) uses

	$\displaystyle\hat{x}(k\|k)$	$\displaystyle=\gamma_{k}x(k)+(1-\gamma_{k})\mathbf{x}_{k}^{*}(0),$		(30)
	$\displaystyle\hat{u}(k\|k)$	$\displaystyle=\gamma_{k}u(k)+(1-\gamma_{k})\mathbf{u}_{k}^{*}(0).$		(31)

Note that we use the state $x(k)$ and control input $u(k)$ , when $\gamma_{k}=1$ , where $u(k)$ can be calculated according to (7). This leads to $x(k+1)\subseteq\hat{x}(k+1|k)\oplus\mathbb{W}$ , which gives us a better estimate than with the estimator of Section IV-D, where ${x(k+1)\subseteq\hat{x}(k+1|k)\oplus\mathbb{Z}_{K}}$ . Otherwise, the estimator will use the last optimal trajectory of the MPC to estimate the next state, which gives us again an estimate of the nominal plant. However, this new estimate does not guarantee that $\hat{x}(k+1|k)\in\mathbb{X}_{\text{c}}$ when $\gamma_{k}=1$ , which requires us to change the constraint (19d) in our MPC described in Section IV-A to guarantee feasibility. Thus, we replace constraint (19d) with

\displaystyle\{\hat{x}(k-1|k)\}\oplus\mathbb{W}\subseteq\{\mathbf{x}_{k}(0)\}% \oplus\mathbb{Z}_{K},

(32)

when $\gamma_{k-1}=1$ and otherwise we keep (19d). Hence, the MPC algorithm is now made aware if packets have been received. Furthermore, the constraint (32) allows the MPC to reset the nominal state trajectory, since now it is not necessarily true that $\mathbf{x}_{k}(0)=\hat{x}(k|k-1)$ as it is the case for (19d). This can improve the convergence as discussed in Chapter 3.5 of [5].

Since the MPC can change the optimal trajectory of the nominal plant, we need to update the trajectory on the nominal plant if a consistent packet has been received. This is done by changing the controller packet (20) to

U_{k}=\{\mathbf{u}_{k}^{*},\bar{u}_{k}^{*}+\bar{K}\bar{x}_{k}^{*},\mathbf{x}_{% k}^{*}(0),q_{k}\},

(33)

and setting $x_{\text{n}}(k)=\mathbf{x}_{k}^{*}(0)$ if $\Theta_{k}=1$ .

Proposition 3.

Let Assumption 5 hold, and assume there exists a $k_{0}$ such that $\gamma_{k_{0}-1}=1$ , $\theta_{k_{0}}=1$ , ${x(k_{0})-x_{\text{n}}(k_{0})\in\mathbb{Z}_{K}}$ , and that the optimization problem (19) is feasible with the new constraint (32). If the consistent actuator (24) with the nominal state update and the ancillary controller (7) are used, the optimization problem (19) with the new constraint (32) is feasible, and $x(k)\in\mathbb{X}$ and $u(k)\in\mathbb{U}$ for all $k\geq k_{0}$ .

Proof.

If $\gamma_{k}=0$ , the problem is feasible, since the nominal state is used in the estimator. If $\gamma_{k}=1$ we can show that

\displaystyle\hat{x}(k|k-1)\in\{x_{\text{n}}(k)\}\oplus(A-BK)\mathbb{Z}_{K}.

(34)

holds. This leads to

\displaystyle x(k+1)

\displaystyle\in\{\hat{x}(k|k-1)\}\oplus\mathbb{W}\subseteq\{x_{\text{n}}(k)\}% \oplus\mathbb{Z}_{K}.

(35)

Hence, the constraints $\{\hat{x}(k|k-1)\}\oplus\mathbb{W}\subseteq\{\mathbf{x}_{k}(0)\}\oplus\mathbb{% Z}_{K}$ and $\mathbf{x}_{k}(0)\in\mathbb{X}_{\text{c}}$ are feasible with the choice of ${\mathbf{x}_{k}(0)=x_{\text{n}}(k)}$ . So the optimal solution of our original MPC (19) is a feasible solution of the extended MPC with constraint (32). Thus, the extended MPC with state feedback is recursively feasible for all $k\geq k_{0}$ , since the original MPC is recursive feasible as shown in Proposition 2. Since $\mathbf{x}_{k}^{*}(0)\in\mathbb{X}_{\text{c}}$ , the nominal state update, when $\Theta_{k}=1$ , will not change the guarantees given by the ancillary controller, such that $x(k)\in\mathbb{X}$ and $u(k)\in\mathbb{U}$ for all $k\geq k_{0}$ . ∎

Corollary 2.

The tracking guarantees of Theorem 1 hold for the extended MPC with state feedback as well.

Proof.

Since Proposition 3 shows that the solution of the original MPC is a feasible solution of the extended MPC, we can deduce that the tracking guarantees of the original MPC also hold for the extended MPC. ∎

In summary, this extension includes state feedback from the plant, which can change the optimal trajectory of the nominal plant to improve performance as well with the same theoretical guarantees of the previous approach. However, this approach requires more bandwidth and might change the execution times of the MPC.

V Numerical Examples

To demonstrate the efficacy of our proposed approach, henceforth called RT-MPC and ERT-MPC for the extended version with state feedback (see Section IV-F), we use it to track a position reference of a cartpole system, where the pole is in the upright unstable configuration. We compare our approach with the approach of [9], subsequently called R-MPC. Scripts to reproduce the results presented are included in our open-source code.

In order to design our nominal plant, we linearize the nonlinear dynamics around the unstable equilibrium point, where the pole is pointing up. The resulting continuous-time matrices are defined as follows for the state $x~{}=~{}\begin{bmatrix}p&\dot{p}&\phi&\dot{\phi}\end{bmatrix}^{\top}$ :

A_{c}=\begin{bmatrix}0&1&0&0\\ 0&\frac{-(I+ml^{2})b}{r}&\frac{-m^{2}gl^{2}}{r}&0\\ 0&0&0&1\\ 0&\frac{-(mlb)}{r}&\frac{mgl(M+m)}{r}&0\end{bmatrix}\;B_{c}=\begin{bmatrix}0\\ \frac{I+ml^{2}}{r}\\ 0\\ \frac{-ml}{r}\end{bmatrix},

where $p$ is the position of the cart, $\phi$ the angle of the pole, ${r=I(M+m)+Mml^{2}}$ , with the remaining parameters and their values defined in Table I. The system is then discretized with a zero-order hold and a sampling time of $T_{s}=$20\text{\,}\mathrm{ms}$$ in order to obtain (6). The controllers $K$ and $\bar{K}$ are designed as a discrete LQR controller with cost matrices ${Q=\mathrm{diag}(100,10,100,10)}$ and $R=0.1$ . Furthermore, we choose $|p|\leq$5\text{\,}\mathrm{m}$$ , $|\dot{p}|\leq$5\text{\,}\mathrm{m}\mathrm{/}\mathrm{s}$$ , $|\phi|\leq$0.3\text{\,}\mathrm{rad}$$ , $|\dot{\phi}|\leq$2\text{\,}\mathrm{rad}\mathrm{/}\mathrm{s}$$ , and $|u|\leq$10\text{\,}\mathrm{N}$$ to define $\mathbb{X}$ and $\mathbb{U}$ . The constraints on $\phi$ and $\dot{\phi}$ guarantee that the LQR controller stabilizes the system. Finally, we choose $N=20$ as the horizon for the MPC.

V-A Disturbance set $\mathbb{W}$

The linearized model will inherently differ from the nonlinear one, and such model error will be represented as the disturbance $w(k)$ . To estimate the set $\mathbb{W}$ , we run several simulations with randomly chosen initial conditions, and let the LQR controller bring the system back to the origin. The disturbance is then estimated as the difference between the actual state and the linear model, i.e. $w(k)=x(k+1)-(A-BK)x(k)$ . This results in the following bounds for the disturbance of the position $|w_{p}|\leq$0.0001\text{\,}\mathrm{m}$$ , velocity $|w_{\dot{p}}|\leq$0.0027\text{\,}\mathrm{m}\mathrm{/}\mathrm{s}$$ , angle $|w_{\phi}|\leq$0.0003\text{\,}\mathrm{rad}$$ , and angular velocity $|w_{\dot{\phi}}|\leq$0.043\text{\,}\mathrm{rad}\mathrm{/}\mathrm{s}$$ . To approximate $\mathbb{Z}_{K}$ we use a method described in [15].

TABLE I: Parameters used in the numerical examples.

	Definition	Value
$I$	Pendulum’s inertia	$0.001\text{\,}\mathrm{k}\mathrm{g}\cdot\mathrm{m}^{2}$
$l$	Length to pendulum center of mass	$0.5\text{\,}\mathrm{m}$
$m$	Pendulum’s mass	$0.1\text{\,}\mathrm{kg}$
$M$	Cart’s mass	$1.0\text{\,}\mathrm{kg}$
$b$	Cart’s coefficient of friction	$0\text{\,}\mathrm{N}\mathrm{/}\mathrm{m}\mathrm{/}\mathrm{s}$
$g$	Gravity acceleration	$9.8\text{\,}\mathrm{m}\mathrm{/}\mathrm{s}^{2}$
$T_{s}$	Sampling Time	$0.02\text{\,}\mathrm{s}$

V-B Reference Tracking

Next, we present results for the tracking of a constant reference in position $p$ . To do so, the cartpole system is always initialized at the origin, and the reference is set to $r(k)=\begin{bmatrix}0.5,0,0,0\end{bmatrix}^{\top}$ . To evaluate the performance, we use the average tracking error $\frac{1}{T+1}\sum_{i=0}^{T}\|x(k)-r(k)\|_{2}$ . For the lossy network, we assume a constant packet loss probability of $\varrho$ and investigate $\varrho\in\{0,0.1,\ldots,0.9\}$ . In addition to that, we perform $20$ simulations for each value of $\varrho$ and record the average tracking error to get a better insight for different realizations of the lossy network.

V-B1 Linear Plant

We begin by comparison with the plant being simulated with linear dynamics, where the disturbance $w(k)$ is sampled uniformly from the set $\mathbb{W}$ at each time step. The results are presented as a box plot in Figure 3(a).

First, note that both RT-MPC and ERT-MPC outperform R-MPC for every packet loss probability investigated. Second, for a packet loss probability of $\varrho=0.9$ , the average tracking error decreases. The reason for that is that due to the large packet loss the cartpole moves less aggressively than when there is less packet loss. This leads to a smaller tracking error for the velocity, angle, and angular velocity, since their reference values are zero, which lowers the overall tracking error. Third, during our simulations, we encountered infeasibility issues for R-MPC. While [9] proves recursive feasibility for the plant without a disturbance present, the presence of a disturbance in our simulations showed that infeasibility can occur. Hence, modelling errors can result in infeasible MPC problems for R-MPC, which we will encounter again when the nonlinear plant is used. Comparing RT-MPC and ERT-MPC we observe that the performance of ERT-MPC seems almost constant, while the tracking error for RT-MPC increases with the packet loss probability. The ability to reset the nominal trajectory is likely the reason for the constant performance of ERT-MPC.

V-B2 Nonlinear Plant

Next, we compare the controllers on the nonlinear cartpole simulated using PyBullet. To do so, the physics simulators runs at a higher frequency than the controllers ( $500\text{\,}\mathrm{Hz}$ , to be precise), and a zero-order hold keeps the control input constant between controller updates.

Figure 3(b) shows the box plots of the average tracking error for the different packet loss probabilities. We observed that R-MPC struggled with infeasibility issues; notably, for $\varrho\in\{0,0.1,0.2,0.3,0.4\}$ R-MPC is always infeasible in our simulations, and the larger $\varrho$ the less infeasible problems were encountered. Therefore, the corresponding box plots only present the results of runs without an infeasible MPC problem. Our approach, on the other hand, is recursively feasible for all simulations performed.

The infeasibility issues decreasing with the increase of packet losses for R-MPC was a surprising result, since the opposite could sound more logical. Our intuition for this is that the LQR controller $\bar{K}$ used as the steady state controller is able to handle the nonlinearities of the system better than R-MPC, since it uses direct state feedback, while R-MPC estimates the next state based on the currently received state. Hence, the more packet loss there is, the more often the steady-state controller is used, which brings the plant to a state that R-MPC can actually handle well. Our approach, on the other hand, uses the LQR controller both as the steady-state controller in the MPC as well as the tracking controller to track the nominal plant state and, in addition to that, tightens the constraint set of the MPC by taking the propagation of the modelling error into account. This can be observed in Figure 3(c), where we present one trajectory of the position and angle at a packet loss of $\varrho=0.4$ , and the star marks when the infeasibility occurred in R-MPC. R-MPC exhibits an oscillatory behaviour before it becomes infeasible, while RT-MPC has a smoother trajectory, which reaches the desired reference. By including actual state feedback in ERT-MPC the trajectory becomes even smoother due to the ability to reset the nominal trajectory based on the state $x(k)$ .

While our approaches have not shown any infeasibility issues, we noticed that the state is not always in a tube around the nominal state for ERT-MPC. These violations happened in the beginning of the simulation and then stopped. We believe that $\mathbb{W}$ does not capture the differences well in the beginning of the reference tracking which leads to these violations. We did not observe such violations for RT-MPC, probably because it is more conservative than ERT-MPC.

In general, we observe that our proposed solution outperforms R-MPC of [9] for all investigated value of $\varrho$ . Interestingly, the tracking error seems to peak at $\varrho=0.7$ and then reduces again for R-MPC and RT-MPC, which is due to the same reason as in the linear case.

V-C Execution time of the MPC

Our simulations run on a 24GB RAM Windows machine with a Ryzen7 8-core CPU. From the 50000 executions of the MPC in Section V-B1, we removed the first execution time, since it represents the cold start of the optimization, and present the histogram of the remaining execution times in Figure 3(d). We observe that the majority of the sampling times is below $20\text{\,}\mathrm{ms}$ , which shows that our MPC can run in real-time for the sampling time of $20\text{\,}\mathrm{ms}$ . Further, the median and the $95\,\%$ quantile of the execution time for RT-MPC were $5.00\text{\,}\mathrm{ms}$ and $7.85\text{\,}\mathrm{ms}$ , respectively. The median and the $95\,\%$ quantile of the execution time for ERT-MPC were $6.21\text{\,}\mathrm{ms}$ and $7.06\text{\,}\mathrm{ms}$ , respectively. The histogram for ERT-MPC has two peaks because it solves two different MPC problems depending on if a measurement was received or not. While real-time execution is not considered here, an optimization problem that is not solved in time can be interpreted as a lost packet in a real scenario. Hence, our approach can deal with too long execution times of the MPC as well.

VI Conclusions

We presented a novel framework that addresses the problem of controlling systems over lossy network connections. More precisely, we propose a robust tube-based MPC algorithm that allows for the tracking of a piecewise-constant reference signal with guaranteed convergence properties for constant references, recursive feasibility, and safety and input constraint satisfaction. Further, we presented numerical simulation results of the approach applied to a cartpole system, together with comparisons with state-of-the-art algorithms. Lastly, our code is available as open-source.

For future work, we would like to investigate time-varying trajectories and the reasons for the peak of the reference tracking error around a packet loss probability of $80\%$ .

References

[1] 5G-ACIA, “Key 5G Use Cases and Requirements,” Frankfurt am Main, Germany, Tech. Rep., May 2020.
[2] 5G-SMART, “5G-SMART Final Report,” Stockholm, Sweden, Tech. Rep. D7.4, 2022.
[3] A. Baxi, M. Eisen, S. Sudhakaran, F. Oboril, G. S. Murthy, V. S. Mageshkumar, M. Paulitsch, and M. Huang, “Towards factory-scale edge robotic systems: Challenges and research directions,” Internet of Things Magazine, vol. 5, no. 3, pp. 26–31, 2022.
[4] P. Park, S. Coleri Ergen, C. Fischione, C. Lu, and K. H. Johansson, “Wireless network design for control systems: A survey,” IEEE Communications Surveys & Tutorials, vol. 20, no. 2, pp. 978–1013, 2018.
[5] J. B. Rawlings, D. Q. Mayne, and M. Diehl, Model Predictive Control: Theory, Computation, and Design. Nob Hill Publishing Madison, WI, 2022, vol. 2.
[6] S. Wildhagen, M. Pezzutto, L. Schenato, and F. Allgöwer, “Self-triggered MPC robust to bounded packet loss via a min-max approach,” in 2022 IEEE 61st Conference on Decision and Control (CDC), 2022, pp. 7670–7675.
[7] G. Pin and T. Parisini, “Networked predictive control of uncertain constrained nonlinear systems: Recursive feasibility and input-to-state stability analysis,” IEEE Transactions on Automatic Control, vol. 56, no. 1, pp. 72–87, 2011.
[8] P. K. Mishra, S. S. Diwale, C. N. Jones, and D. Chatterjee, “Reference tracking stochastic model predictive control over unreliable channels and bounded control actions,” Automatica, vol. 127, p. 109512, 2021.
[9] M. Pezzutto, M. Farina, R. Carli, and L. Schenato, “Remote MPC for tracking over lossy networks,” IEEE Control Systems Letters, vol. 6, pp. 1040–1045, 2022.
[10] D. Limon, I. Alvarado, T. Alamo, and E. Camacho, “Robust tube-based MPC for tracking of constrained linear systems with additive disturbances,” Journal of Process Control, vol. 20, no. 3, pp. 248–260, 2010.
[11] P. Roque, W. S. Cortez, L. Lindemann, and D. V. Dimarogonas, “Corridor MPC: Towards optimal and safe trajectory tracking,” in 2022 American Control Conference (ACC), 2022, pp. 2025–2032.
[12] D. Mayne, M. Seron, and S. Raković, “Robust model predictive control of constrained linear systems with bounded disturbances,” Automatica, vol. 41, no. 2, pp. 219–224, 2005.
[13] I. Kolmanovsky and E. G. Gilbert, “Theory and computation of disturbance invariant sets for discrete-time linear systems,” Mathematical problems in engineering, vol. 4, pp. 317–367, 1998.
[14] S. Rakovic, E. Kerrigan, K. Kouramas, and D. Mayne, “Invariant approximations of the minimal robust positively invariant set,” IEEE Transactions on Automatic Control, vol. 50, no. 3, pp. 406–410, 2005.
[15] M. S. Darup and D. Teichrib, “Efficient computation of RPI sets for tube-based robust MPC,” in 2019 18th European Control Conference (ECC), 2019, pp. 325–330.

Remote Tube-based MPC for Tracking Over Lossy Networks

Abstract

I Introduction

II Problem Definition

II-1 Network

Assumption 1.

II-2 Local plant

Assumption 2.

Assumption 3.

Assumption 4.

II-3 Remote Controller

II-4 Problem Formulation

Problem 1.

III Preliminaries

III-1 Nominal plant

III-2 Steady-state behavior

IV Remote Tube-Based Tracking MPC over Lossy Networks

IV-A Remote Model Predictive Controller For Tracking

IV-B Consistent Actuator

IV-C Nominal Plant and Ancillary Controller

Remark 1.

IV-D Estimator

IV-E Theoretical Guarantees

Assumption 5.

Proposition 1.

Proof.

Proposition 2.

Proof.

Theorem 1.

Proof.

Corollary 1.

IV-F Extension to include state feedback

Proposition 3.

Proof.

Corollary 2.

Proof.

V Numerical Examples

V-A Disturbance set 𝕎𝕎\mathbb{W}blackboard_W

V-B Reference Tracking

V-B1 Linear Plant

V-B2 Nonlinear Plant

V-C Execution time of the MPC

VI Conclusions

References

IV Remote Tube-Based Tracking MPC
over Lossy Networks

V-A Disturbance set $\mathbb{W}$