Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Abstract

In this article, we introduce an adaptive online model update algorithm designed for predictive control applications in networked systems, particularly focusing on power distribution systems. Unlike traditional methods that depend on historical data for offline model identification, our approach utilizes real-time data for continuous model updates. This method integrates seamlessly with existing online control and optimization algorithms and provides timely updates in response to real-time changes. This methodology offers significant advantages, including a reduction in the communication network bandwidth requirements by minimizing the data exchanged at each iteration and enabling the model to adapt after disturbances. Furthermore, our algorithm is tailored for non-linear convex models, enhancing its applicability to practical scenarios. The efficacy of the proposed method is validated through a numerical study, demonstrating improved control performance using a synthetic IEEE test case.
keywords: Model-identification, data-driven model predictive control, distributed optimization, online optimization, power grid, networked systems.

Adaptive Online Model Update Algorithm for Predictive Control in Networked Systems Vivek Khatana, Chin-Yao Chang, Wenbo Wang Vivek Khatana is with the Department of Electrical and Computer Engineering, University of Minnesota, Minneapolis, USA (Email: {khata010}@umn.edu). Chin-Yao Chang and Wenbo Wang are with the National Renewable Energy Laboratory, Golden, CO 80401, USA (Email: {chinyao.chang, wenbo.wang}@nrel.gov). This work was authored in part by NREL, operated by Alliance for Sustainable Energy, LLC, for the U.S. Department of Energy (DOE) under Contract No. DE-AC36-08GO28308. Funding provided by DOE Office of Electricity, Advanced Grid Modeling Program, through agreement NO. 33652. The views expressed in the article do not necessarily represent the views of the DOE or the U.S. Government. The U.S. Government retains and the publisher, by accepting the article for publication, acknowledges that the U.S. Government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this work or allow others to do so, for the U.S. Government purposes.

I Introduction

System models are essential for understanding and controlling complex systems, as they enable accurate predictions, analysis, and optimization of resources. The mathematical models abstract complex, real-world phenomena into manageable representations. With the increase in data availability, scientists and practitioners favor adaptive reconfiguration of the model representations to conform to the latest data measurements. In this article, we focus on identifying the sub-system models capturing the global behavior of a networked system to solve the following predictive control problem

minimizeu(1),u(2),,u(T)𝐝𝐮subscriptminimizeu1u2u𝑇superscriptsubscript𝐝𝐮\displaystyle\operatorname*{minimize}_{\text{u}(1),\text{u}(2),\dots,\text{u}(% T)\in\mathbb{R}^{\mathbf{d_{u}}}}roman_minimize start_POSTSUBSCRIPT u ( 1 ) , u ( 2 ) , … , u ( italic_T ) ∈ blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT k=1Ti=1Ni(y(k),k)+hi(ui(k),k)superscriptsubscript𝑘1𝑇superscriptsubscript𝑖1𝑁subscript𝑖𝑦𝑘𝑘subscript𝑖subscriptu𝑖𝑘𝑘\displaystyle\sum_{k=1}^{T}\sum_{i=1}^{N}\ell_{i}(y(k),k)+h_{i}(\text{u}_{i}(k% ),k)∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_y ( italic_k ) , italic_k ) + italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_k ) , italic_k ) (1)
subject toui(k)subject tosubscriptu𝑖𝑘\displaystyle\mbox{subject to}\ \text{u}_{i}(k)subject to u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_k ) 𝒰i,for alli=1,2,,N,for allkformulae-sequenceabsentsubscript𝒰𝑖for all𝑖12𝑁for all𝑘\displaystyle\in\mathcal{U}_{i},\ \mbox{for all}\ i=1,2,\dots,N,\ \mbox{for % all}\ k∈ caligraphic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , for all italic_i = 1 , 2 , … , italic_N , for all italic_k

where, u(k)=[u1(k),u2(k),,uN(k)]𝐝𝐮u𝑘subscriptu1𝑘subscriptu2𝑘subscriptu𝑁𝑘superscriptsubscript𝐝𝐮\text{u}(k)=[\text{u}_{1}(k),\text{u}_{2}(k),\dots,\text{u}_{N}(k)]\in\mathbb{% R}^{\mathbf{d_{u}}}u ( italic_k ) = [ u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_k ) , u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_k ) , … , u start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ( italic_k ) ] ∈ blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is the control decision at time k𝑘kitalic_k with constraint sets 𝒰i𝐝𝐮isubscript𝒰𝑖superscriptsubscriptsubscript𝐝𝐮𝑖\mathcal{U}_{i}\subseteq\mathbb{R}^{\mathbf{d_{u}}_{i}}caligraphic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⊆ blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, i=1N𝐝𝐮i=𝐝𝐮superscriptsubscript𝑖1𝑁subscriptsubscript𝐝𝐮𝑖subscript𝐝𝐮\sum_{i=1}^{N}\mathbf{d_{u}}_{i}=\mathbf{d_{u}}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = bold_d start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT, and y(k)𝐝𝐲𝑦𝑘superscriptsubscript𝐝𝐲y(k)\in\mathbb{R}^{\mathbf{d_{y}}}italic_y ( italic_k ) ∈ blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is an observable that involves the physical or behavioral inter-dependencies among the sub-systems. Functions isubscript𝑖\ell_{i}roman_ℓ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and hisubscript𝑖h_{i}italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in (1) capture the costs due to the output y(t)𝑦𝑡y(t)italic_y ( italic_t ) and the control inputs uisubscriptu𝑖\text{u}_{i}u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT respectively at the sub-system i𝑖iitalic_i. The predictive control problems (1) appear in the context of online optimal control, communication systems, and robotic networks [1, 2] to mention a few. More recently, the problem has also been of interest in the control and operation of power systems [3].

Suppose, the sub-system inter-dependencies are modeled via the parametric description between the observable y𝑦yitalic_y and the inputs u,

(θ):yθ(t)=ϕ(u(t),θ),:𝜃subscript𝑦𝜃𝑡italic-ϕu𝑡𝜃\displaystyle\mathcal{M}(\theta):y_{\theta}(t)=\phi(\text{u}(t),\theta),caligraphic_M ( italic_θ ) : italic_y start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ( italic_t ) = italic_ϕ ( u ( italic_t ) , italic_θ ) , (2)

where θ𝐝θ𝜃superscriptsubscript𝐝𝜃\theta\in\mathbb{R}^{\mathbf{d}_{\theta}}italic_θ ∈ blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is the parameter of the map ϕ:𝐝𝐮×𝐝θ𝐝𝐲:italic-ϕsuperscriptsubscript𝐝𝐮superscriptsubscript𝐝𝜃superscriptsubscript𝐝𝐲\phi:\mathbb{R}^{\mathbf{d_{u}}}\times\mathbb{R}^{\mathbf{d}_{\theta}}\to% \mathbb{R}^{\mathbf{d_{y}}}italic_ϕ : blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT × blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT that captures the relation between the control inputs, u(t)u𝑡\text{u}(t)u ( italic_t ), and the (parameterized) observable or output of the networked system, yθ(t)subscript𝑦𝜃𝑡y_{\theta}(t)italic_y start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ( italic_t ), at time t𝑡titalic_t. It is assumed that there exists a vector θsuperscript𝜃\theta^{\star}italic_θ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT such that the true system output y(t)𝑦𝑡y(t)italic_y ( italic_t ) is given by y(t)=yθ(t)=ϕ(u(t),θ)𝑦𝑡subscript𝑦superscript𝜃𝑡italic-ϕu𝑡superscript𝜃y(t)=y_{\theta^{\star}}(t)=\phi(\text{u}(t),\theta^{\star})italic_y ( italic_t ) = italic_y start_POSTSUBSCRIPT italic_θ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_t ) = italic_ϕ ( u ( italic_t ) , italic_θ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ). Note that optimization (1) presumes the knowledge of the input-output map (2). The decisions u(k)u𝑘\text{u}(k)u ( italic_k ) are determined based on the postulated output yθ(k)subscript𝑦𝜃𝑘y_{\theta}(k)italic_y start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ( italic_k ) via a model of the form (2) and are sensitive to model mismatches. Under model imperfections, the generated control inputs might drive the network operation to an undesirable state.

Given (1) and (2), the problem addressed in the current article pertains to the development and analysis of an algorithm that enables the update of the parametric input-output map in an online manner based on current data measurements to incorporate real-time variations of the controlled system and generate optimal control inputs. The exact description of the class of parametric maps chosen is given in Section II.

I-A Literature Review

System identification [4] is a broad topic that spans multiple fields. Specialized methodologies, such as learning-based methods [5] and behavioral system theory for non-parametric models [6], can generally be viewed as system identification. As for the applications for power distribution systems, utilities typically maintain feeder models in distribution planning and geographic information system databases [7]. However, operational changes to the grid, such as upgrades and reconfigurations [8], as well as database errors [9], necessitate ongoing maintenance of these models and databases [10]. Voltage control and other operational controls [11, 12], if based on erroneous or outdated models and data, can adversely affect system stability and reliability. Model identification techniques can address these model consistency issues. Some approaches involving machine learning methods [13, 14] require centralized data collection, which raises data privacy concerns and necessitates communication infrastructures. A multi-agent-based distributed approach [15, 16] can mitigate these concerns. Building on this foundation, our previous work, [17], advanced the state-of-the-art in distributed identification methods that protect local data privacy for linear systems, albeit with some communication requirements.

Building on the merits of our prior research’s distributed and localized approach, the current article introduces several advancements. The main contributions are as follows:

  1. 1.

    We develop a distributed algorithm for online model identification in networked nonlinear systems to enable the observable estimate in problem (1) to align with the true output of the system.

  2. 2.

    We establish that the proposed online algorithm has a sublinear regret in identifying the true convex input-output map of the system.

  3. 3.

    The developed algorithm has several desirable properties:

    • it preserves the local input data privacy for every sub-system.

    • it requires only the latest measurements for updates, eliminating the need for storing historical data, and has substantially less communication bandwidth requirement.

We present a numerical simulation study to demonstrate the performance of the proposed algorithm. The predictive control problem (1) is instantiated as a voltage regulation problem in power systems. The numerical results corroborate the efficacy of the proposed algorithm in adaptively updating the input-output map of the test power system utilizing the latest measurements. The results establish that having access to an accurate nonlinear model provided by the proposed algorithm results in superior control performance compared to traditional linear models for power distribution systems, underscoring the practical value of our framework.
At this point, we emphasize that the current work is not related to the body of research on the reconstruction and identification of unknown topology of an interconnected system using time-series measurement data [18, 19, 20].
The rest of the paper is organized as follows: Section I-B introduces some key definitions and notations used throughout the article. Sections II and III delve into the distributed system model identification framework and provide the preliminary analysis to aid the development of the model identification algorithm. Section IV presents the proposed algorithm, its convergence analysis and the distributed implementation details. A predictive control problem with online model updates is presented in Section V. Section VI provides the simulation study and demonstrates the numerical results on the performance of the developed algorithm in solving the predictive control problem with online model updates introduced in Section V. The concluding remarks are provided in Section VII with some directions for future research.

I-B Definition and Notations

In this paper, we denote matrices in boldface. For a vector xn𝑥superscript𝑛x\in\mathbb{R}^{n}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, we denote its 2superscript2\ell^{2}roman_ℓ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT-norm and the norm induced by a matrix 𝐀0succeeds𝐀0\mathbf{A}\succ 0bold_A ≻ 0 by x2subscriptnorm𝑥2\|x\|_{2}∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and x𝐀subscriptnorm𝑥𝐀\|x\|_{\mathbf{A}}∥ italic_x ∥ start_POSTSUBSCRIPT bold_A end_POSTSUBSCRIPT, respectively. The vertical and horizontal concatenation of matrices 𝐀isuperscript𝐀𝑖\mathbf{A}^{i}bold_A start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT are denoted as [𝐀1;𝐀2;;𝐀n]Nm×nsuperscript𝐀1superscript𝐀2superscript𝐀𝑛superscript𝑁𝑚𝑛[\mathbf{A}^{1};\mathbf{A}^{2};\cdots;\mathbf{A}^{n}]\in\mathbb{R}^{Nm\times n}[ bold_A start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ; bold_A start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ; ⋯ ; bold_A start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ] ∈ blackboard_R start_POSTSUPERSCRIPT italic_N italic_m × italic_n end_POSTSUPERSCRIPT and [𝐀1,𝐀2,,𝐀n]m×Nnsuperscript𝐀1superscript𝐀2superscript𝐀𝑛superscript𝑚𝑁𝑛[\mathbf{A}^{1},\mathbf{A}^{2},\cdots,\mathbf{A}^{n}]\in\mathbb{R}^{m\times Nn}[ bold_A start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , bold_A start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , ⋯ , bold_A start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ] ∈ blackboard_R start_POSTSUPERSCRIPT italic_m × italic_N italic_n end_POSTSUPERSCRIPT. The Kronecker product of matrices 𝐀isuperscript𝐀𝑖\mathbf{A}^{i}bold_A start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT and 𝐀jsuperscript𝐀𝑗\mathbf{A}^{j}bold_A start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT is denoted as 𝐀i𝐀jtensor-productsuperscript𝐀𝑖superscript𝐀𝑗\mathbf{A}^{i}\otimes\mathbf{A}^{j}bold_A start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ⊗ bold_A start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT. For a matrix 𝐀m×n𝐀superscript𝑚𝑛\mathbf{A}\in\mathbb{R}^{m\times n}bold_A ∈ blackboard_R start_POSTSUPERSCRIPT italic_m × italic_n end_POSTSUPERSCRIPT, vec(𝐀)mnvec𝐀superscript𝑚𝑛\operatorname*{vec}(\mathbf{A})\in\mathbb{R}^{mn}roman_vec ( bold_A ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_m italic_n end_POSTSUPERSCRIPT is a column vector created by concatenating the column vectors of 𝐀𝐀\mathbf{A}bold_A from left to right. For a matrix 𝐀m×n𝐀superscript𝑚𝑛\mathbf{A}\in\mathbb{R}^{m\times n}bold_A ∈ blackboard_R start_POSTSUPERSCRIPT italic_m × italic_n end_POSTSUPERSCRIPT, null{𝐀}:={xn|𝐀x=0}assignnull𝐀conditional-set𝑥superscript𝑛𝐀𝑥0\operatorname{null}\{\mathbf{A}\}:=\{x\in\mathbb{R}^{n}|\mathbf{A}x=0\}roman_null { bold_A } := { italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | bold_A italic_x = 0 } denotes the null space of matrix 𝐀𝐀\mathbf{A}bold_A. Given a set S𝑆Sitalic_S of vectors, the linear span of S𝑆Sitalic_S is defined as span{S}:={i=1Nvixi|vi,xiS}assignspan𝑆conditional-setsuperscriptsubscript𝑖1𝑁subscript𝑣𝑖subscript𝑥𝑖formulae-sequencesubscript𝑣𝑖subscript𝑥𝑖𝑆\operatorname{span}\{S\}:=\{\sum_{i=1}^{N}v_{i}x_{i}|v_{i}\in\mathbb{R},x_{i}% \in S\}roman_span { italic_S } := { ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R , italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_S }. The scalar element of the ithsuperscript𝑖𝑡i^{th}italic_i start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT row and jthsuperscript𝑗𝑡j^{th}italic_j start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT column of 𝐀𝐀\mathbf{A}bold_A is denoted as 𝐀ijsuperscriptsubscript𝐀𝑖𝑗\mathbf{A}_{i}^{j}bold_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT and the jthsuperscript𝑗𝑡j^{th}italic_j start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT row and column of the matrix 𝐀𝐀\mathbf{A}bold_A are denoted as 𝐀j,:subscript𝐀𝑗:\mathbf{A}_{j,:}bold_A start_POSTSUBSCRIPT italic_j , : end_POSTSUBSCRIPT and 𝐀:,jsubscript𝐀:𝑗\mathbf{A}_{:,j}bold_A start_POSTSUBSCRIPT : , italic_j end_POSTSUBSCRIPT, respectively. The identity matrix and vector with all entries equal to 1111 of dimension n𝑛nitalic_n are denoted as 𝐈nsubscript𝐈𝑛\mathbf{I}_{n}bold_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT and 1nsubscript1𝑛1_{n}1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, respectively.

A graph 𝒢𝒢\mathcal{G}caligraphic_G is denoted by a pair (𝒱,)𝒱(\mathcal{V},\mathcal{E})( caligraphic_V , caligraphic_E ) where 𝒱𝒱\mathcal{V}caligraphic_V is a set of vertices (or nodes) and \mathcal{E}caligraphic_E is a set of edges, which are ordered subsets of two distinct elements of 𝒱𝒱\mathcal{V}caligraphic_V. If an edge from j𝒱𝑗𝒱j\in\mathcal{V}italic_j ∈ caligraphic_V to i𝒱𝑖𝒱i\in\mathcal{V}italic_i ∈ caligraphic_V exists then it is denoted as (i,j)𝑖𝑗(i,j)\in\mathcal{E}( italic_i , italic_j ) ∈ caligraphic_E. The set of neighboring sub-systems of node i𝒱𝑖𝒱i\in\mathcal{V}italic_i ∈ caligraphic_V is called the neighborhood of node i𝑖iitalic_i and is denoted by 𝒩i={j|(i,j)}subscript𝒩𝑖conditional-set𝑗𝑖𝑗\operatorname{\mathcal{N}}_{i}=\{j\ |\ (i,j)\in\mathcal{E}\}caligraphic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { italic_j | ( italic_i , italic_j ) ∈ caligraphic_E }. In the subsequent, we use the terms agents, nodes, and sub-systems interchangeably. A continuous function f:p:𝑓superscript𝑝f:\mathbb{R}^{p}\to\mathbb{R}italic_f : blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT → blackboard_R is called Lipschitz continuous with constant L>0𝐿0L>0italic_L > 0 if the following inequality holds: |f(x)f(y)|Lxy,x,ypformulae-sequence𝑓𝑥𝑓𝑦𝐿norm𝑥𝑦for-all𝑥𝑦superscript𝑝|f(x)-f(y)|\leq L\|x-y\|,\ \forall\ x,y\in\mathbb{R}^{p}| italic_f ( italic_x ) - italic_f ( italic_y ) | ≤ italic_L ∥ italic_x - italic_y ∥ , ∀ italic_x , italic_y ∈ blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT. Given a norm \|\cdot\|∥ ⋅ ∥ and a set Kp𝐾superscript𝑝K\subset\mathbb{R}^{p}italic_K ⊂ blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT, define the diameter of K𝐾Kitalic_K with respect to this norm as Diam(K):=supx,yKxyDiam_{\|\cdot\|}(K):=\sup_{x,y\in K}\|x-y\|italic_D italic_i italic_a italic_m start_POSTSUBSCRIPT ∥ ⋅ ∥ end_POSTSUBSCRIPT ( italic_K ) := roman_sup start_POSTSUBSCRIPT italic_x , italic_y ∈ italic_K end_POSTSUBSCRIPT ∥ italic_x - italic_y ∥. In the subsequent text the O(.)O(.)italic_O ( . ) and o(.)o(.)italic_o ( . ) operations denote the standard Big-O and Little-o notations respectively [21].

II Agent Based System Framework

We consider a networked system represented by a graph 𝒢(𝒱,)𝒢𝒱\mathcal{G}(\mathcal{V},\mathcal{E})caligraphic_G ( caligraphic_V , caligraphic_E ) consisting of |𝒱|:=Nassign𝒱𝑁|\mathcal{V}|:=N| caligraphic_V | := italic_N nodes (or sub-systems). Each node i𝑖iitalic_i has an actuator applying the control decision ui𝐝𝐮isubscriptu𝑖superscriptsubscriptsubscript𝐝𝐮𝑖\text{u}_{i}\in\mathbb{R}^{\mathbf{d_{u}}_{i}}u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. Assume that 𝐝𝐲subscript𝐝𝐲\mathbf{d_{y}}bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT number of sensors are deployed in the network 𝒢𝒢\mathcal{G}caligraphic_G. The measurements of all these sensors are sent to a fusion center that collects all the sensor outputs to create a measurement y^(t)𝐝𝐲^𝑦𝑡superscriptsubscript𝐝𝐲\widehat{y}(t)\in\mathbb{R}^{\mathbf{d_{y}}}over^ start_ARG italic_y end_ARG ( italic_t ) ∈ blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT of the true global observable y(t)𝑦𝑡y(t)italic_y ( italic_t ). Every sub-system i𝑖iitalic_i maintains a local estimate y^θii(t)subscriptsuperscript^𝑦𝑖subscript𝜃𝑖𝑡\widehat{y}^{i}_{\theta_{i}}(t)over^ start_ARG italic_y end_ARG start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_t ) of the global observable y(t)𝑦𝑡y(t)italic_y ( italic_t ) via a model of the kind in (2). In particular,

y^θii(t):=ϕi(ui(t),θi),for alli=1,2,,N,formulae-sequenceassignsubscriptsuperscript^𝑦𝑖subscript𝜃𝑖𝑡subscriptitalic-ϕ𝑖subscriptu𝑖𝑡subscript𝜃𝑖for all𝑖12𝑁\displaystyle\widehat{y}^{i}_{\theta_{i}}(t):=\phi_{i}(\text{u}_{i}(t),\theta_% {i}),\ \mbox{for all}\ i=1,2,\dots,N,over^ start_ARG italic_y end_ARG start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_t ) := italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , for all italic_i = 1 , 2 , … , italic_N , (3)

where parameter θi𝐝θisubscript𝜃𝑖superscriptsubscript𝐝subscript𝜃𝑖\theta_{i}\in\mathbb{R}^{\mathbf{d}_{\theta_{i}}}italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, with i=1N𝐝θi=𝐝θsuperscriptsubscript𝑖1𝑁subscript𝐝subscript𝜃𝑖subscript𝐝𝜃\sum_{i=1}^{N}\mathbf{d}_{\theta_{i}}=\mathbf{d}_{\theta}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT = bold_d start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT. The output estimate of the network is defined as

y^θ(t)=1Ni=1Ny^θii(t)=1Ni=1Nϕi(ui(t),θi).subscript^𝑦𝜃𝑡1𝑁superscriptsubscript𝑖1𝑁subscriptsuperscript^𝑦𝑖subscript𝜃𝑖𝑡1𝑁superscriptsubscript𝑖1𝑁subscriptitalic-ϕ𝑖subscriptu𝑖𝑡subscript𝜃𝑖\displaystyle\widehat{y}_{\theta}(t)=\frac{1}{N}\sum_{i=1}^{N}\widehat{y}^{i}_% {\theta_{i}}(t)=\frac{1}{N}\sum_{i=1}^{N}\phi_{i}(\text{u}_{i}(t),\theta_{i}).over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ( italic_t ) = divide start_ARG 1 end_ARG start_ARG italic_N end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT over^ start_ARG italic_y end_ARG start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_t ) = divide start_ARG 1 end_ARG start_ARG italic_N end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) . (4)

Here we assume that the parametric model above is accurate in the sense that there exists a θ:=[θ1;θ2,;θN]𝐝θassignsuperscript𝜃superscriptsubscript𝜃1superscriptsubscript𝜃2superscriptsubscript𝜃𝑁superscriptsubscript𝐝𝜃\theta^{\star}:=[\theta_{1}^{\star};\theta_{2}^{\star},\dots;\theta_{N}^{\star% }]\in\mathbb{R}^{\mathbf{d}_{\theta}}italic_θ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT := [ italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ; italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , … ; italic_θ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ] ∈ blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT such that

y^(t)=y^θ(t)=1Ni=1Nϕi(ui(t),θi),for allt.formulae-sequence^𝑦𝑡subscript^𝑦superscript𝜃𝑡1𝑁superscriptsubscript𝑖1𝑁subscriptitalic-ϕ𝑖subscriptu𝑖𝑡subscriptsuperscript𝜃𝑖for all𝑡\displaystyle\widehat{y}(t)=\widehat{y}_{\theta^{\star}}(t)=\frac{1}{N}\sum_{i% =1}^{N}\phi_{i}(\text{u}_{i}(t),\theta^{\star}_{i}),\ \mbox{for all}\ t.over^ start_ARG italic_y end_ARG ( italic_t ) = over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_t ) = divide start_ARG 1 end_ARG start_ARG italic_N end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , for all italic_t . (5)

With (5), we formulate the following predictive control problem,

minimizeu(1),u(2),,u(T)𝐝𝐮subscriptminimizeu1u2u𝑇superscriptsubscript𝐝𝐮\displaystyle\operatorname*{minimize}_{\text{u}(1),\text{u}(2),\dots,\text{u}(% T)\in\mathbb{R}^{\mathbf{d_{u}}}}roman_minimize start_POSTSUBSCRIPT u ( 1 ) , u ( 2 ) , … , u ( italic_T ) ∈ blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT k=1Ti=1Ni(y^θ(k),k)+hi(ui(k),k)superscriptsubscript𝑘1𝑇superscriptsubscript𝑖1𝑁subscript𝑖subscript^𝑦𝜃𝑘𝑘subscript𝑖subscriptu𝑖𝑘𝑘\displaystyle\sum_{k=1}^{T}\sum_{i=1}^{N}\ell_{i}(\widehat{y}_{\theta}(k),k)+h% _{i}(\text{u}_{i}(k),k)∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ( italic_k ) , italic_k ) + italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_k ) , italic_k ) (6)
subject toui(k)subject tosubscriptu𝑖𝑘\displaystyle\mbox{subject to}\ \text{u}_{i}(k)subject to u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_k ) 𝒰i, for alli=1,2,,N,for allk.formulae-sequenceabsentsubscript𝒰𝑖 for all𝑖12𝑁for all𝑘\displaystyle\in\mathcal{U}_{i},\mbox{ for all}\ i=1,2,\dots,N,\ \ \mbox{for % all}\ k.∈ caligraphic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , for all italic_i = 1 , 2 , … , italic_N , for all italic_k .

Note that (6) is equivalent to problem (1) if y(t)=y^(t)=y^θ(t)𝑦𝑡^𝑦𝑡subscript^𝑦𝜃𝑡y(t)=\widehat{y}(t)=\widehat{y}_{\theta}(t)italic_y ( italic_t ) = over^ start_ARG italic_y end_ARG ( italic_t ) = over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ( italic_t ) hold for all t𝑡titalic_t. This is typically assumed in the state-of-the-art to solve problem (1) (see [11], Assumption 4, [12], Assumption 5, for example). However, when the output estimate y^θsubscript^𝑦𝜃\widehat{y}_{\theta}over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT doesn’t match with the true measurements of the system the control performance is affected adversely. In this article, we take the approach of adaptively improving the parametric output estimate of the sub-systems to maintain the validity of (5). Denote L(θ)=|k=1Ti=1Ni(y^θ(k),k)k=1Ti=1Ni(y^(k),k)|𝐿𝜃superscriptsubscript𝑘1𝑇superscriptsubscript𝑖1𝑁subscript𝑖subscript^𝑦𝜃𝑘𝑘superscriptsubscript𝑘1𝑇superscriptsubscript𝑖1𝑁subscript𝑖^𝑦𝑘𝑘L(\theta)=|\sum_{k=1}^{T}\sum_{i=1}^{N}\ell_{i}(\widehat{y}_{\theta}(k),k)-% \sum_{k=1}^{T}\sum_{i=1}^{N}\ell_{i}(\widehat{y}(k),k)|italic_L ( italic_θ ) = | ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ( italic_k ) , italic_k ) - ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( over^ start_ARG italic_y end_ARG ( italic_k ) , italic_k ) | as the cost of model mismatch with the controller running over a horizon T𝑇Titalic_T. Our goal is to develop algorithms that minimize the model mismatch quantified by L(θ)𝐿𝜃L(\theta)italic_L ( italic_θ ) in real-time so that the performance of the closed-loop controllers in solving (6) is not compromised due to model mismatches.

III Model Update Problem and the Distributed Reformulation

Given the criticality of (5) we aim to reduce the model mismatch by finding the parameter θ𝜃\thetaitalic_θ that solves

minimizeθ𝐫(θ):=12t=1Ty^(t)1Ni=1Nϕi(ui(t),θi)2.assignsubscriptminimize𝜃𝐫𝜃12superscriptsubscript𝑡1𝑇superscriptdelimited-∥∥^𝑦𝑡1𝑁superscriptsubscript𝑖1𝑁subscriptitalic-ϕ𝑖subscriptu𝑖𝑡subscript𝜃𝑖2\displaystyle\operatorname*{minimize}_{\theta}\ \mathbf{r}(\theta):=\frac{1}{2% }\sum_{t=1}^{T}\left\lVert\widehat{y}(t)-\frac{1}{N}\sum_{i=1}^{N}\phi_{i}(% \text{u}_{i}(t),\theta_{i})\right\rVert^{2}.roman_minimize start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT bold_r ( italic_θ ) := divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ∥ over^ start_ARG italic_y end_ARG ( italic_t ) - divide start_ARG 1 end_ARG start_ARG italic_N end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (7)

For the subsequent development, we make the following assumptions:

Assumption 1.

The control decisions generated via problem (6) ensure the stability of the networked system 𝒮𝒮\mathcal{S}caligraphic_S.

Assumption 2.

Functions ϕisubscriptitalic-ϕ𝑖\phi_{i}italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in (3) are proper, convex, and Lipschitz continuous with constant Lisubscript𝐿𝑖L_{i}italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for all i𝑖iitalic_i.

Note that under the network model, each sub-system i𝑖iitalic_i creates an estimate y^θii(t)subscriptsuperscript^𝑦𝑖subscript𝜃𝑖𝑡\widehat{y}^{i}_{\theta_{i}}(t)over^ start_ARG italic_y end_ARG start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_t ) to determine the effect of its regional control decisions ui(t)subscriptu𝑖𝑡\text{u}_{i}(t)u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) on the output (reflected in the measurements). Problem (7) can be interpreted as a distributed optimization problem across a network of sub-systems as:

minimizeθ12N2t=1Ti=1N(y^(t)ϕi(ui(t),θi))2.subscriptminimize𝜃12superscript𝑁2superscriptsubscript𝑡1𝑇superscriptdelimited-∥∥superscriptsubscript𝑖1𝑁^𝑦𝑡subscriptitalic-ϕ𝑖subscriptu𝑖𝑡subscript𝜃𝑖2\displaystyle\operatorname*{minimize}_{\theta}\ \frac{1}{2N^{2}}\sum_{t=1}^{T}% \left\lVert\sum_{i=1}^{N}\left(\widehat{y}(t)-\phi_{i}(\text{u}_{i}(t),\theta_% {i})\right)\right\rVert^{2}.roman_minimize start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 2 italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ∥ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ( over^ start_ARG italic_y end_ARG ( italic_t ) - italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (8)

where we rewrite y^^𝑦\widehat{y}over^ start_ARG italic_y end_ARG by 1Ni=1Ny^1𝑁superscriptsubscript𝑖1𝑁^𝑦\frac{1}{N}\sum_{i=1}^{N}\widehat{y}divide start_ARG 1 end_ARG start_ARG italic_N end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT over^ start_ARG italic_y end_ARG in (7) to derive (8). Optimization problem (8) couples the parameters and data for all the agents. We next consider a reformulation described in [22] to set up a formulation for a distributed algorithm, allowing the sub-systems to do local computations and communicate with the neighboring sub-systems in the network 𝒢(𝒱,)𝒢𝒱\mathcal{G}(\mathcal{V},\mathcal{E})caligraphic_G ( caligraphic_V , caligraphic_E ) to determine a solution for problem (8). Assuming the network 𝒢(𝒱,)𝒢𝒱\mathcal{G}(\mathcal{V},\mathcal{E})caligraphic_G ( caligraphic_V , caligraphic_E ) is connected, let 𝐏N×N𝐏superscript𝑁𝑁\mathbf{P}\in\mathbb{R}^{N\times N}bold_P ∈ blackboard_R start_POSTSUPERSCRIPT italic_N × italic_N end_POSTSUPERSCRIPT be a finite weight matrix associated with the graph satisfying the following assumption:

Assumption 3.

null{𝐏}=span{1N}null𝐏spansubscript1𝑁\operatorname{null}\{\mathbf{P}\}=\operatorname{span}\{1_{N}\}roman_null { bold_P } = roman_span { 1 start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT }.

A few examples of matrices 𝐏𝐏\mathbf{P}bold_P that satisfy Assumption 3 are:

  • (i)

    Laplacian matrix: The Laplacian matrix of the graph [23] is defined as:

    𝐏ijsubscript𝐏𝑖𝑗\displaystyle\mathbf{P}_{ij}bold_P start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ={1,if(i,j),|𝒩i|,ifi=j,0,otherwise.absentcases1if𝑖𝑗subscript𝒩𝑖if𝑖𝑗0otherwise\displaystyle=\begin{cases}-1,&\text{if}\ (i,j)\in\mathcal{E},\\ |\operatorname{\mathcal{N}}_{i}|,&\text{if}\ i=j,\\ 0,&\text{otherwise}.\end{cases}= { start_ROW start_CELL - 1 , end_CELL start_CELL if ( italic_i , italic_j ) ∈ caligraphic_E , end_CELL end_ROW start_ROW start_CELL | caligraphic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | , end_CELL start_CELL if italic_i = italic_j , end_CELL end_ROW start_ROW start_CELL 0 , end_CELL start_CELL otherwise . end_CELL end_ROW
  • (ii)

    A matrix created via a column stochastic matrix: 𝐏=𝐈N𝐏~𝐏subscript𝐈𝑁superscript~𝐏top\mathbf{P}=\mathbf{I}_{N}-\tilde{\mathbf{P}}^{\top}bold_P = bold_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT - over~ start_ARG bold_P end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, where, 𝐏~~𝐏\tilde{\mathbf{P}}over~ start_ARG bold_P end_ARG is a column stochastic matrix such that [𝐏~][0,1],1N𝐏~=1Nformulae-sequencedelimited-[]~𝐏01superscriptsubscript1𝑁top~𝐏subscript1𝑁[\tilde{\mathbf{P}}]\in[0,1],1_{N}^{\top}\tilde{\mathbf{P}}=1_{N}[ over~ start_ARG bold_P end_ARG ] ∈ [ 0 , 1 ] , 1 start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over~ start_ARG bold_P end_ARG = 1 start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT.

Let Φ(u(t),θ):=[ϕ1(u1(t),θ1);ϕ2(u2(t),θ2);;\Phi(\text{u}(t),\theta):=[\phi_{1}(\text{u}_{1}(t),\theta_{1});\phi_{2}(\text% {u}_{2}(t),\theta_{2});\dots;roman_Φ ( u ( italic_t ) , italic_θ ) := [ italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ; italic_ϕ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ; … ; ϕN(uN(t),θN)]N𝐝𝐲,𝐲^(t):=[y^(t);y^(t);;y^(t)]N𝐝𝐲,𝐏^:=𝐏𝐈𝐝𝐲,𝐱=[θ;w]𝐝θ+N𝐝𝐲\phi_{N}(\text{u}_{N}(t),\theta_{N})]\in\mathbb{R}^{N\mathbf{d_{y}}},\widehat{% \mathbf{y}}(t):=[\widehat{y}(t);\widehat{y}(t);\dots;\widehat{y}(t)]\in\mathbb% {R}^{N\mathbf{d_{y}}},\widehat{\mathbf{P}}:=\mathbf{P}\otimes\mathbf{I}_{% \mathbf{d_{y}}},\mathbf{x}=[\theta;w]\in\mathbb{R}^{\mathbf{d}_{\theta}+N% \mathbf{d_{y}}}italic_ϕ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ) ] ∈ blackboard_R start_POSTSUPERSCRIPT italic_N bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , over^ start_ARG bold_y end_ARG ( italic_t ) := [ over^ start_ARG italic_y end_ARG ( italic_t ) ; over^ start_ARG italic_y end_ARG ( italic_t ) ; … ; over^ start_ARG italic_y end_ARG ( italic_t ) ] ∈ blackboard_R start_POSTSUPERSCRIPT italic_N bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , over^ start_ARG bold_P end_ARG := bold_P ⊗ bold_I start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_x = [ italic_θ ; italic_w ] ∈ blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT + italic_N bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. Consider the problem

argmin𝐱=[θ;w]t=1T[ft(𝐱):=12N2Φ(u(t),θ)𝐲^(t)𝐏^w2].subscriptargmin𝐱𝜃𝑤superscriptsubscript𝑡1𝑇delimited-[]assignsubscript𝑓𝑡𝐱12superscript𝑁2superscriptdelimited-∥∥Φu𝑡𝜃^𝐲𝑡^𝐏𝑤2\displaystyle\operatorname*{argmin}_{\mathbf{x}=[\theta;w]}\sum_{t=1}^{T}\left% [f_{t}(\mathbf{x}):=\textstyle\frac{1}{2N^{2}}\left\lVert\Phi(\text{u}(t),% \theta)-\widehat{\mathbf{y}}(t)-\widehat{\mathbf{P}}w\right\rVert^{2}\right].roman_argmin start_POSTSUBSCRIPT bold_x = [ italic_θ ; italic_w ] end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT [ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ) := divide start_ARG 1 end_ARG start_ARG 2 italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∥ roman_Φ ( u ( italic_t ) , italic_θ ) - over^ start_ARG bold_y end_ARG ( italic_t ) - over^ start_ARG bold_P end_ARG italic_w ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] . (9)

Let F(𝐱):=t=1Tft(𝐱)assign𝐹𝐱superscriptsubscript𝑡1𝑇subscript𝑓𝑡𝐱F(\mathbf{x}):=\sum_{t=1}^{T}f_{t}(\mathbf{x})italic_F ( bold_x ) := ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ) for brevity of notation. Problem (9) allows for the objective function ftsubscript𝑓𝑡f_{t}italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT to be distributed across different sub-systems and allows for the synthesis of a distributed algorithm. In the next result, we establish that solving problem (9) indeed aids towards our objective of solving (7). Specifically, Lemma 1 shows that the solution to (9) is also a solution for (8) and thus provides a solution for (7). We make the following assumption,

Assumption 4.

The set of minimizing solution to problem (9), argmin𝐱F(𝐱)subscriptargmin𝐱𝐹𝐱\operatorname*{argmin}_{\mathbf{x}}F(\mathbf{x})roman_argmin start_POSTSUBSCRIPT bold_x end_POSTSUBSCRIPT italic_F ( bold_x ), is non-empty and bounded.

Lemma 1.

(Optimal solutions of (8) and (9)). Let the matrix 𝐏^^𝐏\widehat{\mathbf{P}}over^ start_ARG bold_P end_ARG in (9) be such that 𝐏𝐏\mathbf{P}bold_P satisfy Assumption 3. If 𝐱=[θ;w]superscript𝐱superscript𝜃superscript𝑤\mathbf{x}^{\star}=[\theta^{\star};w^{\star}]bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT = [ italic_θ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ; italic_w start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ] is a solution to problem (9), then θsuperscript𝜃\theta^{\star}italic_θ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT is also a solution to problem (8).

Proof.

Using the first order optimality conditions for (9) with convex function F𝐹Fitalic_F (sum of composition of convex and increasing functions), we have θF(𝐱)=0,wF(𝐱)=0formulae-sequencesubscript𝜃𝐹superscript𝐱0subscript𝑤𝐹superscript𝐱0\nabla_{\theta}F(\mathbf{x}^{\star})=0,\;\nabla_{w}F(\mathbf{x}^{\star})=0∇ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT italic_F ( bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) = 0 , ∇ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT italic_F ( bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) = 0. Namely, for all t{1,2,,T}𝑡12𝑇t\in\{1,2,\dots,T\}italic_t ∈ { 1 , 2 , … , italic_T },

θΦ(u(t),θ)(Φ(u(t),θ)𝐲^(t)𝐏^w)subscript𝜃Φsuperscriptu𝑡superscript𝜃topΦu𝑡superscript𝜃^𝐲𝑡^𝐏superscript𝑤\displaystyle\nabla_{\theta}\Phi(\text{u}(t),\theta^{\star})^{\top}(\Phi(\text% {u}(t),\theta^{\star})-\widehat{\mathbf{y}}(t)-\widehat{\mathbf{P}}w^{\star})∇ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT roman_Φ ( u ( italic_t ) , italic_θ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( roman_Φ ( u ( italic_t ) , italic_θ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) - over^ start_ARG bold_y end_ARG ( italic_t ) - over^ start_ARG bold_P end_ARG italic_w start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) =0,absent0\displaystyle=0,= 0 , (10)
𝐏^(Φ(u(t),θ)𝐲^(t)𝐏^w)superscript^𝐏topΦu𝑡superscript𝜃^𝐲𝑡^𝐏superscript𝑤\displaystyle\widehat{\mathbf{P}}^{\top}(\Phi(\text{u}(t),\theta^{\star})-% \widehat{\mathbf{y}}(t)-\widehat{\mathbf{P}}w^{\star})over^ start_ARG bold_P end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( roman_Φ ( u ( italic_t ) , italic_θ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) - over^ start_ARG bold_y end_ARG ( italic_t ) - over^ start_ARG bold_P end_ARG italic_w start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) =0.absent0\displaystyle=0.= 0 . (11)

Since, null{𝐏}=span{1N}null𝐏spansubscript1𝑁\operatorname{null}\{\mathbf{P}\}=\operatorname{span}\{1_{N}\}roman_null { bold_P } = roman_span { 1 start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT }, it follows from (11) that there exists zsuperscript𝑧z^{\star}italic_z start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT such that

1Nz:=(Φ(u(t),θ)𝐲^(t)𝐏^w).assigntensor-productsubscript1𝑁superscript𝑧Φu𝑡superscript𝜃^𝐲𝑡^𝐏superscript𝑤\displaystyle 1_{N}\otimes z^{\star}:=(\Phi(\text{u}(t),\theta^{\star})-% \widehat{\mathbf{y}}(t)-\widehat{\mathbf{P}}w^{\star}).1 start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ⊗ italic_z start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT := ( roman_Φ ( u ( italic_t ) , italic_θ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) - over^ start_ARG bold_y end_ARG ( italic_t ) - over^ start_ARG bold_P end_ARG italic_w start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) . (12)

Multiplying both by 1N𝐈𝐝𝐲tensor-productsuperscriptsubscript1𝑁topsubscript𝐈subscript𝐝𝐲1_{N}^{\top}\otimes\mathbf{I}_{\mathbf{d_{y}}}1 start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⊗ bold_I start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT we get,

Nz𝑁superscript𝑧\displaystyle Nz^{\star}italic_N italic_z start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT =(1N𝐈𝐝𝐲)1Nzabsenttensor-producttensor-productsuperscriptsubscript1𝑁topsubscript𝐈subscript𝐝𝐲subscript1𝑁superscript𝑧\displaystyle=(1_{N}^{\top}\otimes\mathbf{I}_{\mathbf{d_{y}}})1_{N}\otimes z^{\star}= ( 1 start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⊗ bold_I start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) 1 start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ⊗ italic_z start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT
=(1N𝐈𝐝𝐲)(Φ(u(t),θ)𝐲^(t)𝐏^w)absenttensor-productsuperscriptsubscript1𝑁topsubscript𝐈subscript𝐝𝐲Φu𝑡superscript𝜃^𝐲𝑡^𝐏superscript𝑤\displaystyle=(1_{N}^{\top}\otimes\mathbf{I}_{\mathbf{d_{y}}})(\Phi(\text{u}(t% ),\theta^{\star})-\widehat{\mathbf{y}}(t)-\widehat{\mathbf{P}}w^{\star})= ( 1 start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⊗ bold_I start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ( roman_Φ ( u ( italic_t ) , italic_θ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) - over^ start_ARG bold_y end_ARG ( italic_t ) - over^ start_ARG bold_P end_ARG italic_w start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT )
=i=1N(ϕi(ui(t),θi)y^(t))(1N𝐈𝐝𝐲)(𝐏𝐈𝐝𝐲)wabsentsuperscriptsubscript𝑖1𝑁subscriptitalic-ϕ𝑖subscriptu𝑖𝑡superscriptsubscript𝜃𝑖^𝑦𝑡tensor-productsuperscriptsubscript1𝑁topsubscript𝐈subscript𝐝𝐲tensor-product𝐏subscript𝐈subscript𝐝𝐲superscript𝑤\displaystyle\textstyle=\sum_{i=1}^{N}\big{(}\phi_{i}(\text{u}_{i}(t),\theta_{% i}^{\star})-\widehat{y}(t)\big{)}-(1_{N}^{\top}\otimes\mathbf{I}_{\mathbf{d_{y% }}})(\mathbf{P}\otimes\mathbf{I}_{\mathbf{d_{y}}})w^{\star}= ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ( italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) - over^ start_ARG italic_y end_ARG ( italic_t ) ) - ( 1 start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⊗ bold_I start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ( bold_P ⊗ bold_I start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) italic_w start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT
=i=1N(ϕi(ui(t),θi)y^(t))(1N𝐏)(𝐈𝐝𝐲𝐈𝐝𝐲)wabsentsuperscriptsubscript𝑖1𝑁subscriptitalic-ϕ𝑖subscriptu𝑖𝑡superscriptsubscript𝜃𝑖^𝑦𝑡tensor-productsuperscriptsubscript1𝑁top𝐏tensor-productsubscript𝐈subscript𝐝𝐲subscript𝐈subscript𝐝𝐲superscript𝑤\displaystyle\textstyle=\sum_{i=1}^{N}\big{(}\phi_{i}(\text{u}_{i}(t),\theta_{% i}^{\star})-\widehat{y}(t)\big{)}-(1_{N}^{\top}\otimes\mathbf{P})(\mathbf{I}_{% \mathbf{d_{y}}}\otimes\mathbf{I}_{\mathbf{d_{y}}})w^{\star}= ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ( italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) - over^ start_ARG italic_y end_ARG ( italic_t ) ) - ( 1 start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⊗ bold_P ) ( bold_I start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ bold_I start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) italic_w start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT
=i=1N(ϕi(ui(t),θi)y^(t)),absentsuperscriptsubscript𝑖1𝑁subscriptitalic-ϕ𝑖subscriptu𝑖𝑡superscriptsubscript𝜃𝑖^𝑦𝑡\displaystyle\textstyle=\sum_{i=1}^{N}\big{(}\phi_{i}(\text{u}_{i}(t),\theta_{% i}^{\star})-\widehat{y}(t)\big{)},= ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ( italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) - over^ start_ARG italic_y end_ARG ( italic_t ) ) ,

where we used Assumption 3 in the last step. Thus,

z=1Ni=1N(ϕi(ui(t),θi)y^(t)).superscript𝑧1𝑁superscriptsubscript𝑖1𝑁subscriptitalic-ϕ𝑖subscriptu𝑖𝑡superscriptsubscript𝜃𝑖^𝑦𝑡\displaystyle\textstyle z^{\star}=\frac{1}{N}\sum_{i=1}^{N}\big{(}\phi_{i}(% \text{u}_{i}(t),\theta_{i}^{\star})-\widehat{y}(t)\big{)}.italic_z start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_N end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ( italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) - over^ start_ARG italic_y end_ARG ( italic_t ) ) . (13)

Substituting (12) and (13) in (10), gives: for all t{1,,T}𝑡1𝑇t\in\{1,\dots,T\}italic_t ∈ { 1 , … , italic_T }

θjΦ(uj(t),θj)[i=1N(ϕi(ui(t),θi)\displaystyle\nabla_{\theta_{j}}\textstyle\Phi(\text{u}_{j}(t),\theta^{\star}_% {j})^{\top}\Big{[}\sum_{i=1}^{N}\big{(}\phi_{i}(\text{u}_{i}(t),\theta_{i}^{% \star})∇ start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Φ ( u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT [ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ( italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT )
y^(t))]=0,j{1,2,,N}.\displaystyle\hskip 72.26999pt-\widehat{y}(t)\big{)}\Big{]}=0,\;\forall j\in\{% 1,2,\dots,N\}.- over^ start_ARG italic_y end_ARG ( italic_t ) ) ] = 0 , ∀ italic_j ∈ { 1 , 2 , … , italic_N } . (14)

As problem (8) is convex, by the optimality conditions, any θsuperscript𝜃\theta^{{}^{\prime}}italic_θ start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT is a solution of (8) if and only if, for all t{1,,T}𝑡1𝑇t\in\{1,\dots,T\}italic_t ∈ { 1 , … , italic_T },

θjϕj(uj(t),θj)[i=1N(ϕi(ui(t),θi)\displaystyle\nabla_{\theta_{j}}\phi_{j}\textstyle(\text{u}_{j}(t),\theta^{{}^% {\prime}}_{j})^{\top}\Big{[}\sum_{i=1}^{N}\big{(}\phi_{i}(\text{u}_{i}(t),% \theta_{i}^{{}^{\prime}})∇ start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT [ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ( italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT )
y^(t))]=0,j{1,2,,N}.\displaystyle\hskip 72.26999pt\textstyle-\widehat{y}(t)\big{)}\Big{]}=0,\ % \forall j\in\{1,2,\dots,N\}.- over^ start_ARG italic_y end_ARG ( italic_t ) ) ] = 0 , ∀ italic_j ∈ { 1 , 2 , … , italic_N } .

Thus, we conclude θsuperscript𝜃\theta^{\star}italic_θ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT is a solution to (8). ∎

Using Lemma 1, we can concentrate on solving (9), which facilitates the development of distributed solutions, as will be demonstrated in the following section.

IV Online Model Update Algorithm

Recall the problem statement in Section II. Assume a local controller is available at each sub-system i𝑖iitalic_i that solves problem (6). Starting at any time t0subscript𝑡0t_{0}italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT the local controller has access to the model with parameter θ0superscript𝜃0\theta^{0}italic_θ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT that is used to estimate the output y^θ0(t0)subscript^𝑦superscript𝜃0subscript𝑡0\widehat{y}_{\theta^{0}}(t_{0})over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) for [t0,t0+T1]subscript𝑡0subscript𝑡0𝑇1[t_{0},t_{0}+T-1][ italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_T - 1 ] based on which it generates local control decisions uisubscriptu𝑖\text{u}_{i}u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT that are implemented in the system by the local actuators at some time t1t0+Δtsubscript𝑡1subscript𝑡0Δ𝑡t_{1}\geq t_{0}+\Delta titalic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≥ italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + roman_Δ italic_t, where ΔtΔ𝑡\Delta troman_Δ italic_t is the amount of time it takes to solve problem (6). At time t1subscript𝑡1t_{1}italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT a measurement y^(t1)^𝑦subscript𝑡1\widehat{y}(t_{1})over^ start_ARG italic_y end_ARG ( italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) of the observable is obtained. If the model predicted output, y^θ0(t1)subscript^𝑦superscript𝜃0subscript𝑡1\widehat{y}_{\theta^{0}}(t_{1})over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ), does not match the measurement, y^(t1)^𝑦subscript𝑡1\widehat{y}(t_{1})over^ start_ARG italic_y end_ARG ( italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ), the model parameter needs to be updated. We meet this objective by developing an online algorithm to solve the system model update problem (9). The function ft(𝐱)subscript𝑓𝑡𝐱f_{t}(\mathbf{x})italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ) in (9) is used to update the parameters to a new value at time t𝑡titalic_t given the measurement y^(t)^𝑦𝑡\widehat{y}(t)over^ start_ARG italic_y end_ARG ( italic_t ) and the control decisions ui(t)subscriptu𝑖𝑡\text{u}_{i}(t)u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ). We aim to minimize the “regret” of the online algorithm compared to a model devised using all the input-output pairs in hindsight. Let {𝐱(t)}t1subscript𝐱𝑡𝑡1\{\mathbf{x}(t)\}_{t\geq 1}{ bold_x ( italic_t ) } start_POSTSUBSCRIPT italic_t ≥ 1 end_POSTSUBSCRIPT denote the solution parameters generated by our algorithm, we formally define regret of our algorithm after any time T𝑇Titalic_T as,

T:=t=1Tft(𝐱(t))min𝐱t=1Tft(𝐱).assignsubscript𝑇superscriptsubscript𝑡1𝑇subscript𝑓𝑡𝐱𝑡subscript𝐱superscriptsubscript𝑡1𝑇subscript𝑓𝑡𝐱\displaystyle\mathcal{R}_{T}:=\sum_{t=1}^{T}f_{t}(\mathbf{x}(t))-\min_{\mathbf% {x}}\sum_{t=1}^{T}f_{t}(\mathbf{x}).caligraphic_R start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT := ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) - roman_min start_POSTSUBSCRIPT bold_x end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ) . (15)

Note that if Tsubscript𝑇\mathcal{R}_{T}caligraphic_R start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT is zero, then the solution sequence {𝐱(t)}t1subscript𝐱𝑡𝑡1\{\mathbf{x}(t)\}_{t\geq 1}{ bold_x ( italic_t ) } start_POSTSUBSCRIPT italic_t ≥ 1 end_POSTSUBSCRIPT is such that the total error incurred is equal to the error obtained by minimizing the error objective function in (9) created by using the control decisions and measurement data over the entire time horizon [t0,t0+T1]subscript𝑡0subscript𝑡0𝑇1[t_{0},t_{0}+T-1][ italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_T - 1 ]. We propose Algorithm 1 to update 𝐱(t)=[θ(t);w(t)]𝐱𝑡𝜃𝑡𝑤𝑡\mathbf{x}(t)=[\theta(t);w(t)]bold_x ( italic_t ) = [ italic_θ ( italic_t ) ; italic_w ( italic_t ) ] in an online manner.

For t=0,1,2,𝑡012bold-…t=0,1,2,\dotsitalic_t = 0 , 1 , 2 , bold_…
       - Given 𝐱(t)=[θ(t);w(t)]𝐝θ+N𝐝𝐲,u(t),y^(t)formulae-sequence𝐱𝑡𝜃𝑡𝑤𝑡superscriptsubscript𝐝𝜃𝑁subscript𝐝𝐲u𝑡^𝑦𝑡\mathbf{x}(t)=[\theta(t);w(t)]\in\mathbb{R}^{\mathbf{d}_{\theta}+N\mathbf{d_{y% }}},\text{u}(t),\widehat{y}(t)bold_x ( italic_t ) = [ italic_θ ( italic_t ) ; italic_w ( italic_t ) ] ∈ blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT + italic_N bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , u ( italic_t ) , over^ start_ARG italic_y end_ARG ( italic_t )
       - 𝐱(t+1)=𝐱(t)ηtft(𝐱(t))𝐱𝑡1𝐱𝑡subscript𝜂𝑡subscript𝑓𝑡𝐱𝑡\mathbf{x}(t+1)=\mathbf{x}(t)-\eta_{t}\nabla f_{t}(\mathbf{x}(t))bold_x ( italic_t + 1 ) = bold_x ( italic_t ) - italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) )
Algorithm 1 Online Input-Output Map Update

Lemma 2 establishes the boundedness of the gradient steps involved in Algorithm 1.

Lemma 2.

Let Assumptions 1-4 hold. There exists constants ηt>0subscript𝜂𝑡0\eta_{t}>0italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT > 0 and δ<𝛿\delta<\inftyitalic_δ < ∞ such that ft:=[θftwft]δassignnormsubscript𝑓𝑡normdelimited-[]subscript𝜃subscript𝑓𝑡missing-subexpressionsubscript𝑤subscript𝑓𝑡missing-subexpression𝛿\|\nabla f_{t}\|:=\left\|\left[\begin{array}[]{cc}\nabla_{\theta}f_{t}\\ \nabla_{w}f_{t}\end{array}\right]\right\|\leq\delta∥ ∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ := ∥ [ start_ARRAY start_ROW start_CELL ∇ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ∇ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW end_ARRAY ] ∥ ≤ italic_δ for all t{1,,T}𝑡1𝑇t\in\{1,\dots,T\}italic_t ∈ { 1 , … , italic_T } with 𝐱(t)𝐱𝑡\mathbf{x}(t)bold_x ( italic_t ) updated by Algorithm 1

Proof.

We start by presenting three supporting claims that we later utilize to prove the desired result.

Claim 1: Any γ𝛾\gammaitalic_γ sub-level set Cγ:={𝐱|ft(𝐱)γ}assignsubscript𝐶𝛾conditional-set𝐱subscript𝑓𝑡𝐱𝛾C_{\gamma}:=\{\mathbf{x}\ |\ f_{t}(\mathbf{x})\leq\gamma\}italic_C start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT := { bold_x | italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ) ≤ italic_γ } of ftsubscript𝑓𝑡f_{t}italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is bounded.

Proof. Given β>0𝛽0\beta>0italic_β > 0, let vargmin𝐱ft(𝐱)superscript𝑣subscriptargmin𝐱subscript𝑓𝑡𝐱v^{\star}\in\operatorname*{argmin}_{\mathbf{x}}f_{t}(\mathbf{x})italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∈ roman_argmin start_POSTSUBSCRIPT bold_x end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ), with v<normsuperscript𝑣\|v^{\star}\|<\infty∥ italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ < ∞. Define, Γβ:={𝐱|𝐱v=β}assignsubscriptΓ𝛽conditional-set𝐱norm𝐱superscript𝑣𝛽\Gamma_{\beta}:=\{\mathbf{x}\ |\ \|\mathbf{x}-v^{\star}\|=\beta\}roman_Γ start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT := { bold_x | ∥ bold_x - italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ = italic_β } and vβ=inf𝐱Γβft(𝐱)subscript𝑣𝛽subscriptinfimum𝐱subscriptΓ𝛽subscript𝑓𝑡𝐱v_{\beta}=\inf_{\mathbf{x}\in\Gamma_{\beta}}f_{t}(\mathbf{x})italic_v start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT = roman_inf start_POSTSUBSCRIPT bold_x ∈ roman_Γ start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ). Note that ΓβsubscriptΓ𝛽\Gamma_{\beta}roman_Γ start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT is non-empty and compact. Since, ftsubscript𝑓𝑡f_{t}italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is continuous, from the Weierstrass’s theorem vβsubscript𝑣𝛽v_{\beta}italic_v start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT is attained at some point of ΓβsubscriptΓ𝛽\Gamma_{\beta}roman_Γ start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT, we have vβ>ft(v)subscript𝑣𝛽subscript𝑓𝑡superscript𝑣v_{\beta}>f_{t}(v^{\star})italic_v start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT > italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ). For any 𝐱𝐱\mathbf{x}bold_x such that 𝐱v>βnorm𝐱superscript𝑣𝛽\|\mathbf{x}-v^{\star}\|>\beta∥ bold_x - italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ > italic_β, let α=β𝐱v,𝐱~=(1α)v+α𝐱formulae-sequence𝛼𝛽norm𝐱superscript𝑣~𝐱1𝛼superscript𝑣𝛼𝐱\alpha=\frac{\beta}{\|\mathbf{x}-v^{\star}\|},\tilde{\mathbf{x}}=(1-\alpha)v^{% \star}+\alpha\mathbf{x}italic_α = divide start_ARG italic_β end_ARG start_ARG ∥ bold_x - italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ end_ARG , over~ start_ARG bold_x end_ARG = ( 1 - italic_α ) italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT + italic_α bold_x. By convexity of ftsubscript𝑓𝑡f_{t}italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, we have

(1α)ft(v)+αft(𝐱)ft(𝐱~).1𝛼subscript𝑓𝑡superscript𝑣𝛼subscript𝑓𝑡𝐱subscript𝑓𝑡~𝐱\displaystyle(1-\alpha)f_{t}(v^{\star})+\alpha f_{t}(\mathbf{x})\geq f_{t}(% \tilde{\mathbf{x}}).( 1 - italic_α ) italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) + italic_α italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ) ≥ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( over~ start_ARG bold_x end_ARG ) .

Since 𝐱~v=α𝐱v=βnorm~𝐱superscript𝑣𝛼norm𝐱superscript𝑣𝛽\|\tilde{\mathbf{x}}-v^{\star}\|=\alpha\|\mathbf{x}-v^{\star}\|=\beta∥ over~ start_ARG bold_x end_ARG - italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ = italic_α ∥ bold_x - italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ = italic_β, 𝐱~Γβ~𝐱subscriptΓ𝛽\tilde{\mathbf{x}}\in\Gamma_{\beta}over~ start_ARG bold_x end_ARG ∈ roman_Γ start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT and

ft(𝐱~)vβ=inf𝐱Γβft(𝐱).subscript𝑓𝑡~𝐱subscript𝑣𝛽subscriptinfimum𝐱subscriptΓ𝛽subscript𝑓𝑡𝐱\displaystyle f_{t}(\tilde{\mathbf{x}})\geq v_{\beta}=\inf_{\mathbf{x}\in% \Gamma_{\beta}}f_{t}(\mathbf{x}).italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( over~ start_ARG bold_x end_ARG ) ≥ italic_v start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT = roman_inf start_POSTSUBSCRIPT bold_x ∈ roman_Γ start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ) .

Combining the above two relations, we get

ft(𝐱)subscript𝑓𝑡𝐱\displaystyle f_{t}(\mathbf{x})italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ) ft(𝐱~)ft(v)α+ft(v)ft(v)+vβft(v)αabsentsubscript𝑓𝑡~𝐱subscript𝑓𝑡superscript𝑣𝛼subscript𝑓𝑡superscript𝑣subscript𝑓𝑡superscript𝑣subscript𝑣𝛽subscript𝑓𝑡superscript𝑣𝛼\displaystyle\textstyle\geq\frac{f_{t}(\tilde{\mathbf{x}})-f_{t}(v^{\star})}{% \alpha}+f_{t}(v^{\star})\geq f_{t}(v^{\star})+\frac{v_{\beta}-f_{t}(v^{\star})% }{\alpha}≥ divide start_ARG italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( over~ start_ARG bold_x end_ARG ) - italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_α end_ARG + italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) + divide start_ARG italic_v start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT - italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_α end_ARG
=ft(v)+vβft(v)β𝐱v.absentsubscript𝑓𝑡superscript𝑣subscript𝑣𝛽subscript𝑓𝑡superscript𝑣𝛽norm𝐱superscript𝑣\displaystyle=\textstyle f_{t}(v^{\star})+\frac{v_{\beta}-f_{t}(v^{\star})}{% \beta}\|\mathbf{x}-v^{\star}\|.= italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) + divide start_ARG italic_v start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT - italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_β end_ARG ∥ bold_x - italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ .

Because vβ>ft(v)subscript𝑣𝛽subscript𝑓𝑡superscript𝑣v_{\beta}>f_{t}(v^{\star})italic_v start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT > italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) and ft(𝐱)γsubscript𝑓𝑡𝐱𝛾f_{t}(\mathbf{x})\leq\gammaitalic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ) ≤ italic_γ, we derive

𝐱vβ(γft(v))vβft(v).norm𝐱superscript𝑣𝛽𝛾subscript𝑓𝑡superscript𝑣subscript𝑣𝛽subscript𝑓𝑡superscript𝑣\displaystyle\|\mathbf{x}-v^{\star}\|\leq\textstyle\frac{\beta(\gamma-f_{t}(v^% {\star}))}{v_{\beta}-f_{t}(v^{\star})}.∥ bold_x - italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ ≤ divide start_ARG italic_β ( italic_γ - italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ) end_ARG start_ARG italic_v start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT - italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) end_ARG .

Thus, 𝐱vmax{β,β(γft(v))vβft(v)}.norm𝐱superscript𝑣𝛽𝛽𝛾subscript𝑓𝑡superscript𝑣subscript𝑣𝛽subscript𝑓𝑡superscript𝑣\|\mathbf{x}-v^{\star}\|\leq\max\left\{\beta,\frac{\beta(\gamma-f_{t}(v^{\star% }))}{v_{\beta}-f_{t}(v^{\star})}\right\}.\hskip 25.29494pt\qed∥ bold_x - italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ ≤ roman_max { italic_β , divide start_ARG italic_β ( italic_γ - italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ) end_ARG start_ARG italic_v start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT - italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) end_ARG } . italic_∎

Claim 2: There exists a sufficiently small ηt>0subscript𝜂𝑡0\eta_{t}>0italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT > 0 such that for all t{1,2,,T}𝑡12𝑇t\in\{1,2,\dots,T\}italic_t ∈ { 1 , 2 , … , italic_T }, 𝐱(t+1)𝐱𝑡1\mathbf{x}(t+1)bold_x ( italic_t + 1 ) updated by Algorithm 1 lies in the sub-level set Cft(𝐱(t)):={𝐱|ft(𝐱)ft(𝐱(t))}assignsubscript𝐶subscript𝑓𝑡𝐱𝑡conditional-set𝐱subscript𝑓𝑡𝐱subscript𝑓𝑡𝐱𝑡C_{f_{t}(\mathbf{x}(t))}:=\{\mathbf{x}\ |\ f_{t}(\mathbf{x})\leq f_{t}(\mathbf% {x}(t))\}italic_C start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) end_POSTSUBSCRIPT := { bold_x | italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ) ≤ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) }.

Proof. By Taylor series expansion and ft0subscript𝑓𝑡0f_{t}\geq 0italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ≥ 0,

ft(𝐱(t+1))subscript𝑓𝑡𝐱𝑡1\displaystyle f_{t}(\mathbf{x}(t+1))italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t + 1 ) ) =ft(𝐱(t)ηtft(𝐱(t)))absentsubscript𝑓𝑡𝐱𝑡subscript𝜂𝑡subscript𝑓𝑡𝐱𝑡\displaystyle=f_{t}\big{(}\mathbf{x}(t)-\eta_{t}\nabla f_{t}(\mathbf{x}(t))% \big{)}= italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) - italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) )
=ft(𝐱(t))ηtft(𝐱(t))2+o(ηtft(𝐱(t)))absentsubscript𝑓𝑡𝐱𝑡subscript𝜂𝑡superscriptnormsubscript𝑓𝑡𝐱𝑡2𝑜subscript𝜂𝑡subscript𝑓𝑡𝐱𝑡\displaystyle=f_{t}(\mathbf{x}(t))-\eta_{t}\|\nabla f_{t}(\mathbf{x}(t))\|^{2}% +o(\eta_{t}\nabla f_{t}(\mathbf{x}(t)))= italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) - italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ ∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_o ( italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) )
=ft(𝐱(t))ηt(ft(𝐱(t))2+o(ηtft(𝐱(t)))ηt)absentsubscript𝑓𝑡𝐱𝑡subscript𝜂𝑡superscriptnormsubscript𝑓𝑡𝐱𝑡2𝑜subscript𝜂𝑡subscript𝑓𝑡𝐱𝑡subscript𝜂𝑡\displaystyle=\textstyle f_{t}(\mathbf{x}(t))\!-\!\eta_{t}\left(\!\|\nabla f_{% t}(\mathbf{x}(t))\|^{2}\!+\!\frac{o(\eta_{t}\nabla f_{t}(\mathbf{x}(t)))}{\eta% _{t}}\!\right)\!= italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) - italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( ∥ ∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG italic_o ( italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) ) end_ARG start_ARG italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG )
ft(𝐱(t)),absentsubscript𝑓𝑡𝐱𝑡\displaystyle\leq f_{t}(\mathbf{x}(t)),≤ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) ,

for sufficiently small ηt>0subscript𝜂𝑡0\eta_{t}>0italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT > 0 by the definition of o(ηt)𝑜subscript𝜂𝑡o(\eta_{t})italic_o ( italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ), which completes the proof.

Claim 3: Let 𝐳(t):=Φ(u(t),θ(t))𝐲^(t)𝐏^w(t)assign𝐳𝑡Φu𝑡𝜃𝑡^𝐲𝑡^𝐏𝑤𝑡\operatorname{\mathbf{z}}(t):=\Phi(\text{u}(t),\theta(t))-\widehat{\mathbf{y}}% (t)-\widehat{\mathbf{P}}w(t)bold_z ( italic_t ) := roman_Φ ( u ( italic_t ) , italic_θ ( italic_t ) ) - over^ start_ARG bold_y end_ARG ( italic_t ) - over^ start_ARG bold_P end_ARG italic_w ( italic_t ). Then, 𝐳¯<¯𝐳\exists\ \bar{\operatorname{\mathbf{z}}}<\infty∃ over¯ start_ARG bold_z end_ARG < ∞ such that 𝐳(t)𝐳¯norm𝐳𝑡¯𝐳\|\operatorname{\mathbf{z}}(t)\|\leq\overline{\operatorname{\mathbf{z}}}∥ bold_z ( italic_t ) ∥ ≤ over¯ start_ARG bold_z end_ARG for all t{1,2,,T}𝑡12𝑇t\in\{1,2,\dots,T\}italic_t ∈ { 1 , 2 , … , italic_T }.

Proof. Under Assumptions 1 and 2,

𝐳(t)𝐳(1)=Φ(u(t),θ(t))𝐲^(t)𝐏^w(t)\displaystyle\|\operatorname{\mathbf{z}}(t)-\operatorname{\mathbf{z}}(1)\|=\|% \Phi(\text{u}(t),\theta(t))-\widehat{\mathbf{y}}(t)-\widehat{\mathbf{P}}w(t)\ -∥ bold_z ( italic_t ) - bold_z ( 1 ) ∥ = ∥ roman_Φ ( u ( italic_t ) , italic_θ ( italic_t ) ) - over^ start_ARG bold_y end_ARG ( italic_t ) - over^ start_ARG bold_P end_ARG italic_w ( italic_t ) -
Φ(u(1),θ(1))+𝐲^(1)+𝐏^w(1)\displaystyle\hskip 86.72377pt\Phi(\text{u}(1),\theta(1))+\widehat{\mathbf{y}}% (1)+\widehat{\mathbf{P}}w(1)\|roman_Φ ( u ( 1 ) , italic_θ ( 1 ) ) + over^ start_ARG bold_y end_ARG ( 1 ) + over^ start_ARG bold_P end_ARG italic_w ( 1 ) ∥
(Φ(u(t),θ(t))𝐏^w(t))(Φ(u(1),θ(1))𝐏^w(1))absentnormΦu𝑡𝜃𝑡^𝐏𝑤𝑡Φu1𝜃1^𝐏𝑤1\displaystyle\leq\|\big{(}\Phi(\text{u}(t),\theta(t))-\widehat{\mathbf{P}}w(t)% \big{)}\ -\big{(}\Phi(\text{u}(1),\theta(1))-\widehat{\mathbf{P}}w(1)\big{)}\|≤ ∥ ( roman_Φ ( u ( italic_t ) , italic_θ ( italic_t ) ) - over^ start_ARG bold_P end_ARG italic_w ( italic_t ) ) - ( roman_Φ ( u ( 1 ) , italic_θ ( 1 ) ) - over^ start_ARG bold_P end_ARG italic_w ( 1 ) ) ∥
+𝐲^(t)𝐲^(1)norm^𝐲𝑡^𝐲1\displaystyle\hskip 14.45377pt+\|\widehat{\mathbf{y}}(t)-\widehat{\mathbf{y}}(% 1)\|+ ∥ over^ start_ARG bold_y end_ARG ( italic_t ) - over^ start_ARG bold_y end_ARG ( 1 ) ∥
=Lm[u(t)𝐱(t)][u(1)𝐱(1)]+𝐲^(t)𝐲^(1).absentsubscript𝐿𝑚normdelimited-[]u𝑡missing-subexpression𝐱𝑡missing-subexpressiondelimited-[]u1missing-subexpression𝐱1missing-subexpressionnorm^𝐲𝑡^𝐲1\displaystyle=L_{m}\left\|\left[\!\!\begin{array}[]{cc}\text{u}(t)\\ \mathbf{x}(t)\end{array}\!\!\right]-\left[\!\!\begin{array}[]{cc}\text{u}(1)\\ \mathbf{x}(1)\end{array}\!\!\right]\right\|+\|\widehat{\mathbf{y}}(t)-\widehat% {\mathbf{y}}(1)\|.= italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∥ [ start_ARRAY start_ROW start_CELL u ( italic_t ) end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL bold_x ( italic_t ) end_CELL start_CELL end_CELL end_ROW end_ARRAY ] - [ start_ARRAY start_ROW start_CELL u ( 1 ) end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL bold_x ( 1 ) end_CELL start_CELL end_CELL end_ROW end_ARRAY ] ∥ + ∥ over^ start_ARG bold_y end_ARG ( italic_t ) - over^ start_ARG bold_y end_ARG ( 1 ) ∥ .

Therefore, 𝐳(t)𝐳(1)2y¯+2Lm(u¯+Dm)norm𝐳𝑡𝐳12¯y2subscript𝐿𝑚¯usubscript𝐷𝑚\|\operatorname{\mathbf{z}}(t)-\operatorname{\mathbf{z}}(1)\|\leq 2\overline{% \text{y}}+2L_{m}(\overline{\text{u}}+D_{m})∥ bold_z ( italic_t ) - bold_z ( 1 ) ∥ ≤ 2 over¯ start_ARG y end_ARG + 2 italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( over¯ start_ARG u end_ARG + italic_D start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ), where the results of Claims 1 and 2 are applied with Dm:=maxtDiam.(Cft(𝐱(t)))D_{m}:=\max_{t}Diam_{\|.\|}(C_{f_{t}(\mathbf{x}(t))})italic_D start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT := roman_max start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_D italic_i italic_a italic_m start_POSTSUBSCRIPT ∥ . ∥ end_POSTSUBSCRIPT ( italic_C start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) end_POSTSUBSCRIPT ) and Lm:=max1iNLiassignsubscript𝐿𝑚subscript1𝑖𝑁subscript𝐿𝑖L_{m}:=\max_{1\leq i\leq N}L_{i}italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT := roman_max start_POSTSUBSCRIPT 1 ≤ italic_i ≤ italic_N end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Therefore, there exists a 𝐳¯<¯𝐳\bar{\operatorname{\mathbf{z}}}<\inftyover¯ start_ARG bold_z end_ARG < ∞ that bounds 𝐳(t)norm𝐳𝑡\|\operatorname{\mathbf{z}}(t)\|∥ bold_z ( italic_t ) ∥ for all t{1,2,,T}.𝑡12𝑇t\in\{1,2,\dots,T\}.\qeditalic_t ∈ { 1 , 2 , … , italic_T } . italic_∎

With all the claims, we circle back to the proof of Lemma 2. At any time index t𝑡titalic_t,

ftsubscript𝑓𝑡\displaystyle\nabla f_{t}∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =[θftwft]=[θΦ(u(t),θ(t))𝐳(t)𝐏^𝐳(t)].absentdelimited-[]subscript𝜃subscript𝑓𝑡missing-subexpressionsubscript𝑤subscript𝑓𝑡missing-subexpressiondelimited-[]subscript𝜃Φsuperscriptu𝑡𝜃𝑡top𝐳𝑡missing-subexpressionsuperscript^𝐏top𝐳𝑡missing-subexpression\displaystyle=\left[\begin{array}[]{cc}\nabla_{\theta}f_{t}\\ \nabla_{w}f_{t}\end{array}\right]=\left[\begin{array}[]{cc}\nabla_{\theta}\Phi% (\text{u}(t),\theta(t))^{\top}\operatorname{\mathbf{z}}(t)\\ -\widehat{\mathbf{P}}^{\top}\operatorname{\mathbf{z}}(t)\end{array}\right].= [ start_ARRAY start_ROW start_CELL ∇ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ∇ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW end_ARRAY ] = [ start_ARRAY start_ROW start_CELL ∇ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT roman_Φ ( u ( italic_t ) , italic_θ ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_z ( italic_t ) end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL - over^ start_ARG bold_P end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_z ( italic_t ) end_CELL start_CELL end_CELL end_ROW end_ARRAY ] .

Thus, ft2θΦ(u(t),θ(t))𝐳(t)2+𝐏^𝐳(t)2+2θΦ(u(t),θ(t))𝐳(t)𝐏^𝐳(t)(NLm+𝐏^)2𝐳¯2superscriptnormsubscript𝑓𝑡2superscriptnormsubscript𝜃Φsuperscriptu𝑡𝜃𝑡top𝐳𝑡2superscriptnormsuperscript^𝐏top𝐳𝑡22normsubscript𝜃Φsuperscriptu𝑡𝜃𝑡top𝐳𝑡normsuperscript^𝐏top𝐳𝑡superscript𝑁subscript𝐿𝑚norm^𝐏2superscript¯𝐳2\|\nabla f_{t}\|^{2}\leq\|\nabla_{\theta}\Phi(\text{u}(t),\theta(t))^{\top}% \operatorname{\mathbf{z}}(t)\|^{2}+\|\widehat{\mathbf{P}}^{\top}\operatorname{% \mathbf{z}}(t)\|^{2}+2\|\nabla_{\theta}\Phi(\text{u}(t),\theta(t))^{\top}% \operatorname{\mathbf{z}}(t)\|\|\widehat{\mathbf{P}}^{\top}\operatorname{% \mathbf{z}}(t)\|\leq(NL_{m}+\|\widehat{\mathbf{P}}\|)^{2}\overline{% \operatorname{\mathbf{z}}}^{2}∥ ∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ ∥ ∇ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT roman_Φ ( u ( italic_t ) , italic_θ ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_z ( italic_t ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∥ over^ start_ARG bold_P end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_z ( italic_t ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 2 ∥ ∇ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT roman_Φ ( u ( italic_t ) , italic_θ ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_z ( italic_t ) ∥ ∥ over^ start_ARG bold_P end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_z ( italic_t ) ∥ ≤ ( italic_N italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + ∥ over^ start_ARG bold_P end_ARG ∥ ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT over¯ start_ARG bold_z end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, where we used Assumption 2 and claim 3. Hence, ft(NLm+𝐏^)𝐳¯:=δnormsubscript𝑓𝑡𝑁subscript𝐿𝑚norm^𝐏¯𝐳assign𝛿\|\nabla f_{t}\|\leq(NL_{m}+\|\widehat{\mathbf{P}}\|)\overline{\operatorname{% \mathbf{z}}}:=\delta∥ ∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ ≤ ( italic_N italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + ∥ over^ start_ARG bold_P end_ARG ∥ ) over¯ start_ARG bold_z end_ARG := italic_δ. This completes the proof. ∎

Theorem 1.

(Regret of Algorithm 1). Let Assumptions 1-4 hold. Let 𝐱argmin𝐱t=1Tft(𝐱)superscript𝐱subscriptargmin𝐱superscriptsubscript𝑡1𝑇subscript𝑓𝑡𝐱\mathbf{x}^{\star}\in\operatorname*{argmin}_{\mathbf{x}}\sum_{t=1}^{T}f_{t}(% \mathbf{x})bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∈ roman_argmin start_POSTSUBSCRIPT bold_x end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ) and ηt=c1tsubscript𝜂𝑡subscript𝑐1𝑡\eta_{t}=\frac{c_{1}}{\sqrt{t}}italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = divide start_ARG italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG square-root start_ARG italic_t end_ARG end_ARG, c1>0subscript𝑐10c_{1}>0italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > 0. Then, the regret of Algorithm 1 after any time T𝑇Titalic_T is bounded. In particular,

T=δ1T2δ22=O(T),subscript𝑇subscript𝛿1𝑇2subscript𝛿22𝑂𝑇\displaystyle\mathcal{R}_{T}=\frac{\delta_{1}\sqrt{T}}{2}-\frac{\delta_{2}}{2}% =O(\sqrt{T}),caligraphic_R start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT = divide start_ARG italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT square-root start_ARG italic_T end_ARG end_ARG start_ARG 2 end_ARG - divide start_ARG italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG = italic_O ( square-root start_ARG italic_T end_ARG ) ,

where, δ1subscript𝛿1\delta_{1}italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and δ2subscript𝛿2\delta_{2}italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are some positive finite constants. Therefore, lim supTT/T0subscriptlimit-supremum𝑇subscript𝑇𝑇0\limsup_{T\to\infty}\mathcal{R}_{T}/T\rightarrow 0lim sup start_POSTSUBSCRIPT italic_T → ∞ end_POSTSUBSCRIPT caligraphic_R start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT / italic_T → 0.

Proof.

Let 𝐱argmin𝐱t=1Tft(𝐱)superscript𝐱subscriptargmin𝐱superscriptsubscript𝑡1𝑇subscript𝑓𝑡𝐱\mathbf{x}^{\star}\in\operatorname*{argmin}_{\mathbf{x}}\sum_{t=1}^{T}f_{t}(% \mathbf{x})bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∈ roman_argmin start_POSTSUBSCRIPT bold_x end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ). Consider the update in Algorithm 1, 𝐱(t+1)𝐱=𝐱(t)ηtft(𝐱(t))𝐱𝐱𝑡1superscript𝐱𝐱𝑡subscript𝜂𝑡subscript𝑓𝑡𝐱𝑡superscript𝐱\mathbf{x}(t+1)-\mathbf{x}^{\star}=\mathbf{x}(t)-\eta_{t}\nabla f_{t}(\mathbf{% x}(t))-\mathbf{x}^{\star}bold_x ( italic_t + 1 ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT = bold_x ( italic_t ) - italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, then

𝐱(t+1)𝐱2superscriptnorm𝐱𝑡1superscript𝐱2\displaystyle\|\mathbf{x}(t+1)-\mathbf{x}^{\star}\|^{2}∥ bold_x ( italic_t + 1 ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT 𝐱(t)𝐱2+ηt2ft(𝐱(t))2absentsuperscriptnorm𝐱𝑡superscript𝐱2superscriptsubscript𝜂𝑡2superscriptnormsubscript𝑓𝑡𝐱𝑡2\displaystyle\leq\|\mathbf{x}(t)-\mathbf{x}^{\star}\|^{2}+\eta_{t}^{2}\|\nabla f% _{t}(\mathbf{x}(t))\|^{2}≤ ∥ bold_x ( italic_t ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ ∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
2ηtft(𝐱(t))(𝐱(t)𝐱).2subscript𝜂𝑡subscript𝑓𝑡superscript𝐱𝑡top𝐱𝑡superscript𝐱\displaystyle\hskip 28.90755pt-2\eta_{t}\nabla f_{t}(\mathbf{x}(t))^{\top}(% \mathbf{x}(t)-\mathbf{x}^{\star}).- 2 italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( bold_x ( italic_t ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) .

From Lemma 2, there exists δ=:suptft(𝐱(t))<\delta=:\sup_{t}\|\nabla f_{t}(\mathbf{x}(t))\|<\inftyitalic_δ = : roman_sup start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ ∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) ∥ < ∞,

𝐱(t+1)𝐱2𝐱(t)𝐱2+ηt2δ2superscriptnorm𝐱𝑡1superscript𝐱2superscriptnorm𝐱𝑡superscript𝐱2superscriptsubscript𝜂𝑡2superscript𝛿2\displaystyle\|\mathbf{x}(t+1)-\mathbf{x}^{\star}\|^{2}\leq\|\mathbf{x}(t)-% \mathbf{x}^{\star}\|^{2}+\eta_{t}^{2}\delta^{2}∥ bold_x ( italic_t + 1 ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ ∥ bold_x ( italic_t ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
2ηtft(𝐱(t))(𝐱(t)𝐱)2subscript𝜂𝑡subscript𝑓𝑡superscript𝐱𝑡top𝐱𝑡superscript𝐱\displaystyle\hskip 7.22743pt-2\eta_{t}\nabla f_{t}(\mathbf{x}(t))^{\top}(% \mathbf{x}(t)-\mathbf{x}^{\star})- 2 italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( bold_x ( italic_t ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT )
ft(𝐱(t))(𝐱(t)𝐱)absentsubscript𝑓𝑡superscript𝐱𝑡top𝐱𝑡superscript𝐱\displaystyle\Longrightarrow\nabla f_{t}(\mathbf{x}(t))^{\top}(\mathbf{x}(t)-% \mathbf{x}^{\star})⟹ ∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( bold_x ( italic_t ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) (16)
𝐱(t)𝐱2𝐱(t+1)𝐱22ηt+ηtδ22absentsuperscriptnorm𝐱𝑡superscript𝐱2superscriptnorm𝐱𝑡1superscript𝐱22subscript𝜂𝑡subscript𝜂𝑡superscript𝛿22\displaystyle\hskip 21.68121pt\leq\frac{\|\mathbf{x}(t)-\mathbf{x}^{\star}\|^{% 2}-\|\mathbf{x}(t+1)-\mathbf{x}^{\star}\|^{2}}{2\eta_{t}}+\frac{\eta_{t}\delta% ^{2}}{2}≤ divide start_ARG ∥ bold_x ( italic_t ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - ∥ bold_x ( italic_t + 1 ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG + divide start_ARG italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG

By convexity of ftsubscript𝑓𝑡f_{t}italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT,

ft(𝐱(t))ft(𝐱)ft(𝐱(t))(𝐱(t)𝐱).subscript𝑓𝑡𝐱𝑡subscript𝑓𝑡superscript𝐱subscript𝑓𝑡superscript𝐱𝑡top𝐱𝑡superscript𝐱\displaystyle f_{t}(\mathbf{x}(t))-f_{t}(\mathbf{x}^{\star})\leq\nabla f_{t}(% \mathbf{x}(t))^{\top}(\mathbf{x}(t)-\mathbf{x}^{\star}).italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) - italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≤ ∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( bold_x ( italic_t ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) . (17)

Combining (16) and (17) gives

ft(𝐱(t))ft(𝐱)subscript𝑓𝑡𝐱𝑡subscript𝑓𝑡superscript𝐱\displaystyle f_{t}(\mathbf{x}(t))\!-\!f_{t}(\mathbf{x}^{\star})italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) - italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) 𝐱(t)𝐱2𝐱(t+1)𝐱22ηt+ηtδ22.absentsuperscriptnorm𝐱𝑡superscript𝐱2superscriptnorm𝐱𝑡1superscript𝐱22subscript𝜂𝑡subscript𝜂𝑡superscript𝛿22\displaystyle\leq\frac{\|\mathbf{x}(t)\!-\!\mathbf{x}^{\star}\|^{2}-\|\mathbf{% x}(t\!+\!1)\!-\!\mathbf{x}^{\star}\|^{2}}{2\eta_{t}}+\frac{\eta_{t}\delta^{2}}% {2}.≤ divide start_ARG ∥ bold_x ( italic_t ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - ∥ bold_x ( italic_t + 1 ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG + divide start_ARG italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG .

Summing over t=1𝑡1t=1italic_t = 1 to T𝑇Titalic_T,

Tsubscript𝑇\displaystyle\mathcal{R}_{T}caligraphic_R start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT =t=1Tft(𝐱(t))t=1Tft(𝐱)absentsuperscriptsubscript𝑡1𝑇subscript𝑓𝑡𝐱𝑡superscriptsubscript𝑡1𝑇subscript𝑓𝑡superscript𝐱\displaystyle=\sum_{t=1}^{T}f_{t}(\mathbf{x}(t))-\sum_{t=1}^{T}f_{t}(\mathbf{x% }^{\star})= ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) - ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT )
t=1T(𝐱(t)𝐱2𝐱(t+1)𝐱22ηt)+t=1Tηtδ22absentsuperscriptsubscript𝑡1𝑇superscriptnorm𝐱𝑡superscript𝐱2superscriptnorm𝐱𝑡1superscript𝐱22subscript𝜂𝑡superscriptsubscript𝑡1𝑇subscript𝜂𝑡superscript𝛿22\displaystyle\leq\sum_{t=1}^{T}\left(\frac{\|\mathbf{x}(t)-\mathbf{x}^{\star}% \|^{2}-\|\mathbf{x}(t+1)-\mathbf{x}^{\star}\|^{2}}{2\eta_{t}}\right)+\sum_{t=1% }^{T}\frac{\eta_{t}\delta^{2}}{2}≤ ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( divide start_ARG ∥ bold_x ( italic_t ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - ∥ bold_x ( italic_t + 1 ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG ) + ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT divide start_ARG italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG
=𝐱(1)𝐱22η1𝐱(T+1)𝐱22ηT+δ22t=1Tηtabsentsuperscriptnorm𝐱1superscript𝐱22subscript𝜂1superscriptnorm𝐱𝑇1superscript𝐱22subscript𝜂𝑇superscript𝛿22superscriptsubscript𝑡1𝑇subscript𝜂𝑡\displaystyle=\frac{\|\mathbf{x}(1)-\mathbf{x}^{\star}\|^{2}}{2\eta_{1}}-\frac% {\|\mathbf{x}(T+1)-\mathbf{x}^{\star}\|^{2}}{2\eta_{T}}+\frac{\delta^{2}}{2}% \sum_{t=1}^{T}\eta_{t}= divide start_ARG ∥ bold_x ( 1 ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG - divide start_ARG ∥ bold_x ( italic_T + 1 ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_η start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT end_ARG + divide start_ARG italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT
+12t=2T𝐱(t)𝐱2(1ηt1ηt1).12superscriptsubscript𝑡2𝑇superscriptnorm𝐱𝑡superscript𝐱21subscript𝜂𝑡1subscript𝜂𝑡1\displaystyle+\frac{1}{2}\sum_{t=2}^{T}\|\mathbf{x}(t)-\mathbf{x}^{\star}\|^{2% }\left(\frac{1}{\eta_{t}}-\frac{1}{\eta_{t-1}}\right).+ divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_t = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ∥ bold_x ( italic_t ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( divide start_ARG 1 end_ARG start_ARG italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG - divide start_ARG 1 end_ARG start_ARG italic_η start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT end_ARG ) .

From claims 1 and 2 in Lemma 2, there exists Ξ<Ξ\Xi<\inftyroman_Ξ < ∞ such that supt𝐱(t)𝐱supt𝐱(t)+𝐱Ξsubscriptsupremum𝑡norm𝐱𝑡superscript𝐱subscriptsupremum𝑡norm𝐱𝑡normsuperscript𝐱Ξ\sup_{t}\|\mathbf{x}(t)-\mathbf{x}^{\star}\|\leq\sup_{t}\|\mathbf{x}(t)\|+\|% \mathbf{x}^{\star}\|\leq\Xiroman_sup start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ bold_x ( italic_t ) - bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ ≤ roman_sup start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ bold_x ( italic_t ) ∥ + ∥ bold_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ ≤ roman_Ξ. Therefore,

Tsubscript𝑇\displaystyle\mathcal{R}_{T}caligraphic_R start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT Ξ22(1η1+t=2T(1ηt1ηt1))+δ22t=1TηtabsentsuperscriptΞ221subscript𝜂1superscriptsubscript𝑡2𝑇1subscript𝜂𝑡1subscript𝜂𝑡1superscript𝛿22superscriptsubscript𝑡1𝑇subscript𝜂𝑡\displaystyle\textstyle\leq\frac{\Xi^{2}}{2}\left(\frac{1}{\eta_{1}}+\sum_{t=2% }^{T}\left(\frac{1}{\eta_{t}}-\frac{1}{\eta_{t-1}}\right)\right)+\frac{\delta^% {2}}{2}\sum_{t=1}^{T}\eta_{t}≤ divide start_ARG roman_Ξ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG ( divide start_ARG 1 end_ARG start_ARG italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG + ∑ start_POSTSUBSCRIPT italic_t = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( divide start_ARG 1 end_ARG start_ARG italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG - divide start_ARG 1 end_ARG start_ARG italic_η start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT end_ARG ) ) + divide start_ARG italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT
=Ξ22ηT+δ22t=1Tηt.absentsuperscriptΞ22subscript𝜂𝑇superscript𝛿22superscriptsubscript𝑡1𝑇subscript𝜂𝑡\displaystyle=\textstyle\frac{\Xi^{2}}{2\eta_{T}}+\frac{\delta^{2}}{2}\sum_{t=% 1}^{T}\eta_{t}.= divide start_ARG roman_Ξ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_η start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT end_ARG + divide start_ARG italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT .

For ηt=c1t,t=1Tηt=t=1Tc1t1+t=1Tc1t𝑑t1+[2c1t]1T2c1T+12c1formulae-sequencesubscript𝜂𝑡subscript𝑐1𝑡superscriptsubscript𝑡1𝑇subscript𝜂𝑡superscriptsubscript𝑡1𝑇subscript𝑐1𝑡1superscriptsubscript𝑡1𝑇subscript𝑐1𝑡differential-d𝑡1superscriptsubscriptdelimited-[]2subscript𝑐1𝑡1𝑇2subscript𝑐1𝑇12subscript𝑐1\eta_{t}=\frac{c_{1}}{\sqrt{t}},\sum_{t=1}^{T}\eta_{t}=\sum_{t=1}^{T}\frac{c_{% 1}}{\sqrt{t}}\leq 1+\int_{t=1}^{T}\frac{c_{1}}{\sqrt{t}}dt\leq 1+[2c_{1}\sqrt{% t}]_{1}^{T}\leq 2c_{1}\sqrt{T}+1-2c_{1}italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = divide start_ARG italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG square-root start_ARG italic_t end_ARG end_ARG , ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT divide start_ARG italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG square-root start_ARG italic_t end_ARG end_ARG ≤ 1 + ∫ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT divide start_ARG italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG square-root start_ARG italic_t end_ARG end_ARG italic_d italic_t ≤ 1 + [ 2 italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT square-root start_ARG italic_t end_ARG ] start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ≤ 2 italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT square-root start_ARG italic_T end_ARG + 1 - 2 italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Thus,

T(Ξ2/c1+2δ2c1)T2(2c11)δ22.subscript𝑇superscriptΞ2subscript𝑐12superscript𝛿2subscript𝑐1𝑇22subscript𝑐11superscript𝛿22\displaystyle\mathcal{R}_{T}\leq\frac{(\Xi^{2}/c_{1}+2\delta^{2}c_{1})\sqrt{T}% }{2}-\frac{(2c_{1}-1)\delta^{2}}{2}.caligraphic_R start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ≤ divide start_ARG ( roman_Ξ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 2 italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) square-root start_ARG italic_T end_ARG end_ARG start_ARG 2 end_ARG - divide start_ARG ( 2 italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - 1 ) italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG .

Therefore, lim supTT/T0subscriptlimit-supremum𝑇subscript𝑇𝑇0\limsup_{T\to\infty}\mathcal{R}_{T}/T\rightarrow 0lim sup start_POSTSUBSCRIPT italic_T → ∞ end_POSTSUBSCRIPT caligraphic_R start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT / italic_T → 0. ∎

The result of Theorem 1 establishes that Algorithm 1 provides an estimated output close to the estimated output derived via a best-fixed model in hindsight and thus solves the adaptive model update problem. Next, we elucidate a methodology for implementing the Algorithm 1 within a distributed framework.

Up to this point, we have presented Algorithm 1 to solve the model update problem under online experimental scenarios. In the following, we present how Algorithm 1 can be implemented distributively. Consider a communication network 𝒢c(𝒱c,c)superscript𝒢𝑐superscript𝒱𝑐superscript𝑐\mathcal{G}^{c}(\mathcal{V}^{c},\mathcal{E}^{c})caligraphic_G start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ( caligraphic_V start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT , caligraphic_E start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ) with 𝒱c=𝒱{0},|𝒱c|=N+1formulae-sequencesuperscript𝒱𝑐𝒱0superscript𝒱𝑐𝑁1\mathcal{V}^{c}=\mathcal{V}\cup\{0\},|\mathcal{V}^{c}|=N+1caligraphic_V start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT = caligraphic_V ∪ { 0 } , | caligraphic_V start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT | = italic_N + 1, c={(0,1),(0,2),,(0,N)}(N+1)×(N+1)superscript𝑐01020𝑁𝑁1𝑁1\mathcal{E}^{c}=\mathcal{E}\cup\{(0,1),(0,2),\dots,(0,N)\}\subseteq(N+1)\times% (N+1)caligraphic_E start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT = caligraphic_E ∪ { ( 0 , 1 ) , ( 0 , 2 ) , … , ( 0 , italic_N ) } ⊆ ( italic_N + 1 ) × ( italic_N + 1 ). The node index 00 is the fusion center to which all the sensor measurements are relayed. There are two kinds of communication links in the graph 𝒢csuperscript𝒢𝑐\mathcal{G}^{c}caligraphic_G start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT (a) (i,j)𝑖𝑗(i,j)\in\mathcal{E}( italic_i , italic_j ) ∈ caligraphic_E with i,j{1,2,,N}𝑖𝑗12𝑁i,j\in\{1,2,\dots,N\}italic_i , italic_j ∈ { 1 , 2 , … , italic_N } and (b) (0,j)c0𝑗superscript𝑐(0,j)\in\mathcal{E}^{c}( 0 , italic_j ) ∈ caligraphic_E start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT with j{1,2,,N}𝑗12𝑁j\in\{1,2,\dots,N\}italic_j ∈ { 1 , 2 , … , italic_N }. The sub-systems communicate with each other via the link of kind (a) and the fusion center communicates with all the sub-systems via the communication links of the form (b). The fusion center communicates the measurement y^(t)𝐝𝐲^𝑦𝑡superscriptsubscript𝐝𝐲\widehat{y}(t)\in\mathbb{R}^{\mathbf{d_{y}}}over^ start_ARG italic_y end_ARG ( italic_t ) ∈ blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT to all the sub-systems i{1,2,,N}𝑖12𝑁i\in\{1,2,\dots,N\}italic_i ∈ { 1 , 2 , … , italic_N }. The updates in Algorithm 1 utilizes ft(𝐱(t))subscript𝑓𝑡𝐱𝑡\nabla f_{t}(\mathbf{x}(t))∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ). From (10) and (11), we have

ft(𝐱(t))=[θΦ(u(t),θ(t))𝐳(t)𝐏^𝐳(t)].subscript𝑓𝑡𝐱𝑡delimited-[]subscript𝜃Φsuperscriptu𝑡𝜃𝑡top𝐳𝑡missing-subexpressionsuperscript^𝐏top𝐳𝑡missing-subexpression\displaystyle\nabla f_{t}(\mathbf{x}(t))=\left[\begin{array}[]{cc}\nabla_{% \theta}\Phi(\text{u}(t),\theta(t))^{\top}\operatorname{\mathbf{z}}(t)\\ -\widehat{\mathbf{P}}^{\top}\operatorname{\mathbf{z}}(t)\end{array}\right].∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( bold_x ( italic_t ) ) = [ start_ARRAY start_ROW start_CELL ∇ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT roman_Φ ( u ( italic_t ) , italic_θ ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_z ( italic_t ) end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL - over^ start_ARG bold_P end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_z ( italic_t ) end_CELL start_CELL end_CELL end_ROW end_ARRAY ] . (20)

Note that 𝐳(t)𝐳𝑡\operatorname{\mathbf{z}}(t)bold_z ( italic_t ) can be decomposed as, 𝐳=[𝐳1(t);𝐳2(t);;\operatorname{\mathbf{z}}=[\operatorname{\mathbf{z}}_{1}(t);\operatorname{% \mathbf{z}}_{2}(t);\dots;bold_z = [ bold_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) ; bold_z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_t ) ; … ; 𝐳N(t)]\operatorname{\mathbf{z}}_{N}(t)]bold_z start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ( italic_t ) ], where for i,j{1,2,,N}𝑖𝑗12𝑁i,j\in\{1,2,\dots,N\}italic_i , italic_j ∈ { 1 , 2 , … , italic_N },

𝐳i(t)=ϕi(ui(t),θi(t))y^(t)(𝐏^iiwi(t)+j𝒩i𝐏^ijwj(t)).subscript𝐳𝑖𝑡subscriptitalic-ϕ𝑖subscriptu𝑖𝑡subscript𝜃𝑖𝑡^𝑦𝑡subscript^𝐏𝑖𝑖subscript𝑤𝑖𝑡subscript𝑗subscript𝒩𝑖subscript^𝐏𝑖𝑗subscript𝑤𝑗𝑡\displaystyle\operatorname{\mathbf{z}}_{i}(t)=\phi_{i}(\text{u}_{i}(t),\theta_% {i}(t))-\widehat{y}(t)-(\widehat{\mathbf{P}}_{ii}w_{i}(t)+\sum_{j\in% \operatorname{\mathcal{N}}_{i}}\widehat{\mathbf{P}}_{ij}w_{j}(t)).bold_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) = italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) ) - over^ start_ARG italic_y end_ARG ( italic_t ) - ( over^ start_ARG bold_P end_ARG start_POSTSUBSCRIPT italic_i italic_i end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) + ∑ start_POSTSUBSCRIPT italic_j ∈ caligraphic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT over^ start_ARG bold_P end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) ) .

A closer examination of (20) yields that ftsubscript𝑓𝑡\nabla f_{t}∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT can be further written as, ft=[(1ft);(2ft);;(Nft)]subscript𝑓𝑡superscriptsubscript1subscript𝑓𝑡topsuperscriptsubscript2subscript𝑓𝑡topsuperscriptsubscript𝑁subscript𝑓𝑡top\nabla f_{t}=[(\nabla_{1}f_{t})^{\top};(\nabla_{2}f_{t})^{\top};\dots;(\nabla_% {N}f_{t})^{\top}]∇ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = [ ( ∇ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ; ( ∇ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ; … ; ( ∇ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ], where

ift=[θiϕ(ui(t),θi(t))𝐳i(t)𝐏^ii𝐳i(t)j𝒩i𝐏^ji𝐳j(t)],subscript𝑖subscript𝑓𝑡delimited-[]subscriptsubscript𝜃𝑖italic-ϕsuperscriptsubscriptu𝑖𝑡subscript𝜃𝑖𝑡topsubscript𝐳𝑖𝑡missing-subexpressionsubscript^𝐏𝑖𝑖subscript𝐳𝑖𝑡subscript𝑗subscript𝒩𝑖subscript^𝐏𝑗𝑖subscript𝐳𝑗𝑡missing-subexpression\displaystyle\nabla_{i}f_{t}=\left[\begin{array}[]{cc}\nabla_{\theta_{i}}\phi(% \text{u}_{i}(t),\theta_{i}(t))^{\top}\operatorname{\mathbf{z}}_{i}(t)\\ -\widehat{\mathbf{P}}_{ii}\operatorname{\mathbf{z}}_{i}(t)-\sum_{j\in% \operatorname{\mathcal{N}}_{i}}\widehat{\mathbf{P}}_{ji}\operatorname{\mathbf{% z}}_{j}(t)\end{array}\right],∇ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = [ start_ARRAY start_ROW start_CELL ∇ start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_ϕ ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL - over^ start_ARG bold_P end_ARG start_POSTSUBSCRIPT italic_i italic_i end_POSTSUBSCRIPT bold_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) - ∑ start_POSTSUBSCRIPT italic_j ∈ caligraphic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT over^ start_ARG bold_P end_ARG start_POSTSUBSCRIPT italic_j italic_i end_POSTSUBSCRIPT bold_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) end_CELL start_CELL end_CELL end_ROW end_ARRAY ] , (23)

for all i{1,2,,N}.𝑖12𝑁i\in\{1,2,\dots,N\}.italic_i ∈ { 1 , 2 , … , italic_N } . Thus, using (23) the updates in Algorithm 1 can be implemented in a distributed manner at any sub-system i𝑖iitalic_i while maintaining an auxiliary variable 𝐳isubscript𝐳𝑖\operatorname{\mathbf{z}}_{i}bold_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as shown in Algorithm 2.

For t=0,1,2,𝑡012bold-…t=0,1,2,\dotsitalic_t = 0 , 1 , 2 , bold_…
       - Receive y^(t)^𝑦𝑡\widehat{y}(t)over^ start_ARG italic_y end_ARG ( italic_t ) from the fusion center
       - Given θi(t)θi,wi(t),wj(t)𝐝𝐲,j𝒩iformulae-sequencesubscript𝜃𝑖𝑡superscriptsubscript𝜃𝑖subscript𝑤𝑖𝑡formulae-sequencesubscript𝑤𝑗𝑡superscriptsubscript𝐝𝐲𝑗subscript𝒩𝑖\theta_{i}(t)\in\mathbb{R}^{\theta_{i}},w_{i}(t),w_{j}(t)\in\mathbb{R}^{% \mathbf{d_{y}}},j\in\operatorname{\mathcal{N}}_{i}italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) ∈ blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_j ∈ caligraphic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT
       - 𝐳i(t)=ϕi(ui(t),θi(t))y^(t)j𝒩i{i}𝐏^ijwj(t)subscript𝐳𝑖𝑡subscriptitalic-ϕ𝑖subscriptu𝑖𝑡subscript𝜃𝑖𝑡^𝑦𝑡subscript𝑗subscript𝒩𝑖𝑖subscript^𝐏𝑖𝑗subscript𝑤𝑗𝑡\operatorname{\mathbf{z}}_{i}(t)=\phi_{i}(\text{u}_{i}(t),\theta_{i}(t))-% \widehat{y}(t)-\displaystyle\sum_{j\in\operatorname{\mathcal{N}}_{i}\cup\{i\}}% \widehat{\mathbf{P}}_{ij}w_{j}(t)bold_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) = italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) ) - over^ start_ARG italic_y end_ARG ( italic_t ) - ∑ start_POSTSUBSCRIPT italic_j ∈ caligraphic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∪ { italic_i } end_POSTSUBSCRIPT over^ start_ARG bold_P end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t )
       - θi(t+1)=θi(t)ηtθiϕi(ui(t),θi(t))𝐳i(t)subscript𝜃𝑖𝑡1subscript𝜃𝑖𝑡subscript𝜂𝑡subscriptsubscript𝜃𝑖subscriptitalic-ϕ𝑖superscriptsubscriptu𝑖𝑡subscript𝜃𝑖𝑡topsubscript𝐳𝑖𝑡\theta_{i}(t+1)=\theta_{i}(t)-\eta_{t}\nabla_{\theta_{i}}\phi_{i}(\text{u}_{i}% (t),\theta_{i}(t))^{\top}\operatorname{\mathbf{z}}_{i}(t)italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t + 1 ) = italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) - italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∇ start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t )
       - wi(t+1)=wi(t)+ηtj𝒩i{i}𝐏^ji𝐳j(t)subscript𝑤𝑖𝑡1subscript𝑤𝑖𝑡subscript𝜂𝑡subscript𝑗subscript𝒩𝑖𝑖subscript^𝐏𝑗𝑖subscript𝐳𝑗𝑡w_{i}(t+1)=w_{i}(t)+\eta_{t}\displaystyle\sum_{j\in\operatorname{\mathcal{N}}_% {i}\cup\{i\}}\widehat{\mathbf{P}}_{ji}\operatorname{\mathbf{z}}_{j}(t)italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t + 1 ) = italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) + italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_j ∈ caligraphic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∪ { italic_i } end_POSTSUBSCRIPT over^ start_ARG bold_P end_ARG start_POSTSUBSCRIPT italic_j italic_i end_POSTSUBSCRIPT bold_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t )
Algorithm 2 Distributed Online Input-Output Map Update at Sub-system i𝑖iitalic_i

In Algorithm 2, each agent i𝑖iitalic_i engages in two rounds of communication on auxiliary variables 𝐳isubscript𝐳𝑖\operatorname{\mathbf{z}}_{i}bold_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and wisubscript𝑤𝑖w_{i}italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Importantly, the exchange of 𝐳isubscript𝐳𝑖\operatorname{\mathbf{z}}_{i}bold_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and wisubscript𝑤𝑖w_{i}italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT among agents does not allow for the reconstruction of the model parameters θisubscript𝜃𝑖\theta_{i}italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT or the local input data. As a result, the information transmitted across the communication network does not divulge any direct details regarding the sub-system’s parameters or local data, thereby bolstering the privacy and security of the individual sub-systems.

V Controller with Online Model Updates

In this section, we provide an extension of the control problem (6) beyond the fixed input-output map for the entire time horizon T𝑇Titalic_T. We consider the input-output map in (6) to be frequently updated based on the latest control decision and measurement. The following formulation captures the time-varying aspects

minimizeu(1),,u(T)𝐝𝐮subscriptminimizeu1u𝑇superscriptsubscript𝐝𝐮\displaystyle\operatorname*{minimize}_{\text{u}(1),\dots,\text{u}(T)\in\mathbb% {R}^{\mathbf{d_{u}}}}roman_minimize start_POSTSUBSCRIPT u ( 1 ) , … , u ( italic_T ) ∈ blackboard_R start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT bold_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT k=1Ti=1Ni(y^θi(t)(k),k)+hi(ui(k),k)superscriptsubscript𝑘1𝑇superscriptsubscript𝑖1𝑁subscript𝑖subscript^𝑦subscript𝜃𝑖𝑡𝑘𝑘subscript𝑖subscriptu𝑖𝑘𝑘\displaystyle\sum_{k=1}^{T}\sum_{i=1}^{N}\ell_{i}(\widehat{y}_{\theta_{i}(t)}(% k),k)+h_{i}(\text{u}_{i}(k),k)∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) end_POSTSUBSCRIPT ( italic_k ) , italic_k ) + italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_k ) , italic_k ) (24)
subject to ui(k)𝒰i, for alli=1,2,,N,k,formulae-sequencesubscriptu𝑖𝑘subscript𝒰𝑖 for all𝑖12𝑁for-all𝑘\displaystyle\text{u}_{i}(k)\in\mathcal{U}_{i},\mbox{ for all}\ i=1,2,\dots,N,% \forall k,u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_k ) ∈ caligraphic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , for all italic_i = 1 , 2 , … , italic_N , ∀ italic_k ,

where, t𝑡titalic_t is the model update counter. Whenever, the input-output map is updated the counter t𝑡titalic_t is increased by 1111. Given t𝑡titalic_t, let ~i(y^θi,T):=k=1Ti(y^θi(t)(k),k),h~i(ui,T):=k=1Thi(ui(k),k)formulae-sequenceassignsubscript~𝑖subscript^𝑦subscript𝜃𝑖𝑇superscriptsubscript𝑘1𝑇subscript𝑖subscript^𝑦subscript𝜃𝑖𝑡𝑘𝑘assignsubscript~𝑖subscriptu𝑖𝑇superscriptsubscript𝑘1𝑇subscript𝑖subscriptu𝑖𝑘𝑘\tilde{\ell}_{i}(\widehat{y}_{\theta_{i}},T):=\sum_{k=1}^{T}\ell_{i}(\widehat{% y}_{\theta_{i}(t)}(k),k),\tilde{h}_{i}(\text{u}_{i},T):=\sum_{k=1}^{T}h_{i}(% \text{u}_{i}(k),k)over~ start_ARG roman_ℓ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_T ) := ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) end_POSTSUBSCRIPT ( italic_k ) , italic_k ) , over~ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_T ) := ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_k ) , italic_k ). While the input-output map is fixed the control decisions in problem (24) can be solved for via a projected gradient iteration given by, for all i{1,2,,N}𝑖12𝑁i\in\{1,2,\dots,N\}italic_i ∈ { 1 , 2 , … , italic_N },

ui(τ+1)=Proj𝒰i{ui(τ)\displaystyle\text{u}_{i}(\tau+1)=\textstyle\operatorname*{Proj}_{\mathcal{U}_% {i}}\big{\{}\text{u}_{i}(\tau)-u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_τ + 1 ) = roman_Proj start_POSTSUBSCRIPT caligraphic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT { u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_τ ) - (25)
α[y^θi~i(y^θi(t)(τ),T)uiy^θi(ui(τ));uih~i(ui(τ),T)]},\displaystyle-\alpha\big{[}\nabla_{\widehat{y}_{\theta_{i}}}\tilde{\ell}_{i}(% \widehat{y}_{\theta_{i}(t)}(\tau),T)^{\top}\nabla_{\text{u}_{i}}\widehat{y}_{% \theta_{i}}(\text{u}_{i}(\tau));\nabla_{\text{u}_{i}}\tilde{h}_{i}(\text{u}_{i% }(\tau),T)\big{]}\big{\}},- italic_α [ ∇ start_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT over~ start_ARG roman_ℓ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) end_POSTSUBSCRIPT ( italic_τ ) , italic_T ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ∇ start_POSTSUBSCRIPT u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_τ ) ) ; ∇ start_POSTSUBSCRIPT u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT over~ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_τ ) , italic_T ) ] } ,

where τ𝜏\tauitalic_τ is the iteration counter of the projected gradient steps and Proj𝒰i{}subscriptProjsubscript𝒰𝑖\operatorname*{Proj}_{\mathcal{U}_{i}}\{\cdot\}roman_Proj start_POSTSUBSCRIPT caligraphic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT { ⋅ } denote the projection operator. The control decisions obtained via (25) are implemented in the system. Further, after Tconsubscript𝑇𝑐𝑜𝑛T_{con}italic_T start_POSTSUBSCRIPT italic_c italic_o italic_n end_POSTSUBSCRIPT consecutive iterations of the application of the control decisions to the system, the model update counter t𝑡titalic_t is incremented by 1111 and the input-output map is updated via Algorithm 2 utilizing the current measurement. Subsequently, the projected gradient iterations are reinitialized and performed with the new input-output map corresponding to the parameters θi(t+1)subscript𝜃𝑖𝑡1\theta_{i}(t+1)italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t + 1 ). We summarize this in Algorithm 3.

Initialize:
      t=τ=1,ui(0),θi(0),wi(0)formulae-sequence𝑡𝜏1subscriptu𝑖0subscript𝜃𝑖0subscript𝑤𝑖0t=\tau=1,\text{u}_{i}(0),\theta_{i}(0),w_{i}(0)italic_t = italic_τ = 1 , u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 0 ) , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 0 ) , italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 0 ) for all i𝑖iitalic_i
Repeat
       if mod(τ,Tcon)=0\mod(\tau,T_{con})=0roman_mod ( italic_τ , italic_T start_POSTSUBSCRIPT italic_c italic_o italic_n end_POSTSUBSCRIPT ) = 0 then
            
            - Update the input-output map via Algorithm 2
             - t=t+1𝑡𝑡1t=t+1italic_t = italic_t + 1
             - u(τ)=u(0)u𝜏u0\text{u}(\tau)=\text{u}(0)u ( italic_τ ) = u ( 0 )
       else
             - Compute u(τ)u𝜏\text{u}(\tau)u ( italic_τ ) using (25)
             - Apply u(τ)u𝜏\text{u}(\tau)u ( italic_τ ) to the system
            
       - τ=τ+1𝜏𝜏1\tau=\tau+1italic_τ = italic_τ + 1
Algorithm 3 Control with Online Input-Output Map Update for Solving Problem (24)

VI Application Example and Numerical Simulations

In this section, we provide an example of an application of the proposed framework and present numerical results of a numerical simulation of the application example.

VI-A Power System Application Example

Consider the problem of controlling and optimally managing the operation of a distribution power grid with penetration of photovoltaic energy sources (PES). We formulate this as a voltage regulation problem that fits the formulation in problem (24). We assume that there are N𝑁Nitalic_N number of PES bus and they can adjust their power injections for voltage regulation. The control decision at PES bus i𝑖iitalic_i at any time k𝑘kitalic_k is given by ui(k):=[Pi(k);Qi(k)]2assignsubscriptu𝑖𝑘subscript𝑃𝑖𝑘subscript𝑄𝑖𝑘superscript2\text{u}_{i}(k):=[P_{i}(k);Q_{i}(k)]\in\mathbb{R}^{2}u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_k ) := [ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_k ) ; italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_k ) ] ∈ blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, where, Pi(k)subscript𝑃𝑖𝑘P_{i}(k)italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_k ) and Qi(k)subscript𝑄𝑖𝑘Q_{i}(k)italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_k ) are the net active and reactive power injections at the PES bus i𝑖iitalic_i at time t𝑡titalic_t, respectively. The sets 𝒰i:={[Pi;Qi]:Pi2+Qi2Si,max2,0PiP¯i}assignsubscript𝒰𝑖conditional-setsubscript𝑃𝑖subscript𝑄𝑖formulae-sequencesubscriptsuperscript𝑃2𝑖subscriptsuperscript𝑄2𝑖subscriptsuperscript𝑆2𝑖max0subscript𝑃𝑖subscript¯𝑃𝑖\mathcal{U}_{i}:=\{[P_{i};Q_{i}]:P^{2}_{i}+Q^{2}_{i}\leq S^{2}_{i,\mbox{max}},% 0\leq P_{i}\leq\overline{P}_{i}\}caligraphic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT := { [ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] : italic_P start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_Q start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_S start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i , max end_POSTSUBSCRIPT , 0 ≤ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ over¯ start_ARG italic_P end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT }, where Si,max2subscriptsuperscript𝑆2𝑖maxS^{2}_{i,\mbox{max}}italic_S start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i , max end_POSTSUBSCRIPT is the rated apparent power for the of the PES i𝑖iitalic_i and P¯isubscript¯𝑃𝑖\overline{P}_{i}over¯ start_ARG italic_P end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the maximum real power available with PES i𝑖iitalic_i. The input-output y^θ(t)(k)=1Ni=1Nϕi(ui(k),θi(k))subscript^𝑦𝜃𝑡𝑘1𝑁superscriptsubscript𝑖1𝑁subscriptitalic-ϕ𝑖subscriptu𝑖𝑘subscript𝜃𝑖𝑘\widehat{y}_{\theta(t)}(k)=\frac{1}{N}\sum_{i=1}^{N}\phi_{i}(\text{u}_{i}(k),% \theta_{i}(k))over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ ( italic_t ) end_POSTSUBSCRIPT ( italic_k ) = divide start_ARG 1 end_ARG start_ARG italic_N end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_k ) , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_k ) ) gives the mapping from power injections uisubscriptu𝑖\text{u}_{i}u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT to the magnitudes of the voltages in the entire distribution grid. The functions i(.)\ell_{i}(.)roman_ℓ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( . ) are designed to capture the engineering constraint of keeping the true voltages within the interval [0.95,1.05]0.951.05[0.95,1.05][ 0.95 , 1.05 ]. Quadratic functions ui(k)hi(ui(k),k)subscriptu𝑖𝑘subscript𝑖subscriptu𝑖𝑘𝑘\text{u}_{i}(k)\to h_{i}(\text{u}_{i}(k),k)u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_k ) → italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_k ) , italic_k ) that penalize active power curtailment and reactive power injections at the PES buses at time k𝑘kitalic_k are chosen.

Refer to caption
Figure 1: Schematic of the modified IEEE 37 bus system. The buses highlighted in red triangles are PES buses
Refer to caption
Figure 2: The voltage magnitudes (p.u.formulae-sequence𝑝𝑢p.u.italic_p . italic_u .) over time in the modified IEEE 37373737-bus system without any control.

VI-B Illustrative Numerical Simulations

In this section, we instantiate the voltage regulation problem detailed in the previous section via a modified IEEE-37373737 bus system augmented with additional PES, using the solar irradiance data from Anatolia, CA, USA, and electric loads, having realistic load profiles for 4444 hours with a granularity of one second, introduced at different buses as illustrated in Fig. 1. A total of 18181818 PES buses are considered. We utilize Algorithm 3 with Tcon=1subscript𝑇𝑐𝑜𝑛1T_{con}=1italic_T start_POSTSUBSCRIPT italic_c italic_o italic_n end_POSTSUBSCRIPT = 1 so that the input-output map between the power injections and the voltage magnitudes is updated every subsequent control decision. During the simulation study, PES exchange estimates through a connected communication network with 19191919 nodes (18181818 PES and one fusion center). The graph Laplacian of the graph with 18181818 nodes is used as the weight matrix 𝐏𝐏\mathbf{P}bold_P. We compare two parametric models for capturing the input-output map:

  1. 1.

    A linear model where nodal active and reactive power injections serve as inputs and bus voltages as outputs, described by y^θi(t)(t)=Aiui(t)subscript^𝑦subscript𝜃𝑖𝑡𝑡subscript𝐴𝑖subscriptu𝑖𝑡\widehat{y}_{\theta_{i}(t)}(t)=A_{i}\text{u}_{i}(t)over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) end_POSTSUBSCRIPT ( italic_t ) = italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) for all t𝑡titalic_t. This model is known as the LinDistFlow model [24], and the goal is to identify matrix Aisubscript𝐴𝑖A_{i}italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

  2. 2.

    A non-linear model that posits a polynomial relationship between local power injections and bus voltage, described by y^θi(t)(t)=BiBi24(Ciu¯i(t))2subscript^𝑦subscript𝜃𝑖𝑡𝑡subscript𝐵𝑖superscriptsubscript𝐵𝑖24subscript𝐶𝑖subscript¯u𝑖𝑡2\widehat{y}_{\theta_{i}(t)}(t)=\frac{B_{i}-\sqrt{B_{i}^{2}-4(C_{i}-\overline{% \text{u}}_{i}(t))}}{2}over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) end_POSTSUBSCRIPT ( italic_t ) = divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - square-root start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 4 ( italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - over¯ start_ARG u end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) ) end_ARG end_ARG start_ARG 2 end_ARG for all t𝑡titalic_t, where u¯i=Pi2+Qi2subscript¯u𝑖superscriptsubscript𝑃𝑖2superscriptsubscript𝑄𝑖2\overline{\text{u}}_{i}=\sqrt{P_{i}^{2}+Q_{i}^{2}}over¯ start_ARG u end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = square-root start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG is the apparent power magnitude at bus i𝑖iitalic_i. This reflects the quadratic correlation between power injection and voltage magnitude observed in power flow equations also known as the constant power load model. Bisubscript𝐵𝑖B_{i}italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and Cisubscript𝐶𝑖C_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are the parameters of the model to be determined.

For Algorithm 2, we adopt a step-size ηt=0.01/tsubscript𝜂𝑡0.01𝑡\eta_{t}=0.01/\sqrt{t}italic_η start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = 0.01 / square-root start_ARG italic_t end_ARG. We plot the voltage levels in the modified IEEE 37373737-bus system during the control process while updating the model with new input-output data becomes available using Algorithm 3. Fig. 2 illustrates potential violations of voltage regulation limits without control measures. The real-time estimated linear model provides a satisfactory control performance in Fig. 3, albeit with some fluctuations during periods of reduced PES generation. Notably, the control performance using the identified non-linear constant power load model, as shown in Fig. 4, surpasses that of the linear model, which aligns with expectations given the non-linear model’s closer representation of actual power flow dynamics. Overall, the models identified through our proposed algorithm demonstrate effective voltage regulation capabilities.

Refer to caption
Figure 3: The voltage magnitudes (p.u.formulae-sequence𝑝𝑢p.u.italic_p . italic_u .) over time in the modified IEEE 37373737-bus system with control decisions derived using the online distributed estimated linear map between power injections and voltage magnitudes.
Refer to caption
Figure 4: The voltage magnitudes (p.u.formulae-sequence𝑝𝑢p.u.italic_p . italic_u .) over time in the modified IEEE 37373737-bus system with control decisions derived using a non-linear map between power injections and voltage magnitudes updated online.

VII Conclusion

In this paper, we developed an online distributed algorithm where each agent updates its estimate of the model via an online gradient descent scheme utilizing the most recent input-output pair. We prove that the developed distributed algorithm has a sub-linear regret and determines the original system model. Further, agents only share non-linear estimates preserving their private information. The numerical simulation study corroborates the efficacy of our developed algorithm with the identification of a more accurate quadratic power flow model, which improves the voltage regulation performance of the control system. Looking ahead, our future endeavors will focus on extensive testing within real-world systems, aiming to further validate the performance and characterize the scalability of our method. with tens of thousands of nodes.

References

  • [1] P. Roque, “Coordination of multi-agent systems: Predictive and vision-based control for aerial and space robotics,” Ph.D. dissertation, KTH Royal Institute of Technology, 2022.
  • [2] V. Spudić, C. Conte, M. Baotić, and M. Morari, “Cooperative distributed model predictive control for wind farms,” Optimal Control Applications and Methods, vol. 36, no. 3, pp. 333–352, 2015.
  • [3] A. Bernstein and E. Dall’Anese, “Real-time feedback-based optimization of distribution grids: A unified approach,” IEEE Transactions on Control of Network Systems, vol. 6, no. 3, pp. 1197–1209, 2019.
  • [4] L. Ljung, “System identification,” in Signal analysis and prediction.   Springer, 1998, pp. 163–173.
  • [5] T. Hastie, R. Tibshirani, J. Friedman, T. Hastie, R. Tibshirani, and J. Friedman, “Overview of supervised learning,” The elements of statistical learning: Data mining, inference, and prediction, pp. 9–41, 2009.
  • [6] I. Markovsky and F. Dörfler, “Behavioral systems theory in data-driven analysis, signal processing, and control,” Annual Reviews in Control, vol. 52, pp. 42–64, 2021.
  • [7] K. Montano-Martinez, S. Thakar, V. Vittal, R. Ayyanar, and C. Rojas, “Detailed primary and secondary distribution system feeder modeling based on ami data,” in 2020 52nd North American Power Symposium (NAPS), 2021, pp. 1–6.
  • [8] W. Wang, S. Jazebi, F. de León, and Z. Li, “Looping radial distribution systems using superconducting fault current limiters: Feasibility and economic analysis,” IEEE Transactions on Power Systems, vol. 33, no. 3, pp. 2486–2495, 2018.
  • [9] J. D. Lankutis, “Verifying data integrity: If you cannot believe the data, how can you believe the analytics?” in 2013 IEEE Rural Electric Power Conference (REPC), 2013, pp. C4–1–C4–4.
  • [10] EPRI, “Distribution modeling guidelines:recommendations for system and asset modeling for distributed energy resource assessments,” Electric Power Research Institute, Palo Alto, California (United States), Tech. Rep., 2016.
  • [11] A. Bernstein, J. Comden, Y. Chen, and J. Wang, “Time-varying feedback optimization for quadratic programs with heterogeneous gradient step sizes,” in 2023 62nd IEEE Conference on Decision and Control (CDC).   IEEE, 2023, pp. 4003–4011.
  • [12] A. Bernstein, E. Dall’Anese, and A. Simonetto, “Online primal-dual methods with measurement feedback for time-varying convex optimization,” IEEE Transactions on Signal Processing, vol. 67, no. 8, pp. 1978–1991, 2019.
  • [13] A. Chiuso and G. Pillonetto, “System identification: A machine learning perspective,” Annual Review of Control, Robotics, and Autonomous Systems, vol. 2, no. 1, pp. 281–304, 2019.
  • [14] O. A. Alimi, K. Ouahada, and A. M. Abu-Mahfouz, “A review of machine learning approaches to power system security and stability,” IEEE Access, vol. 8, pp. 113 512–113 531, 2020.
  • [15] S. D. McArthur, E. M. Davidson, V. M. Catterson, A. L. Dimeas, N. D. Hatziargyriou, F. Ponci, and T. Funabashi, “Multi-agent systems for power engineering applications—Part I: Concepts, approaches, and technical challenges,” IEEE Transactions on Power systems, vol. 22, no. 4, pp. 1743–1752, 2007.
  • [16] O. P. Mahela, M. Khosravy, N. Gupta, B. Khan, H. H. Alhelou, R. Mahla, N. Patel, and P. Siano, “Comprehensive overview of multi-agent systems for controlling smart grids,” CSEE Journal of Power and Energy Systems, vol. 8, no. 1, pp. 115–131, 2020.
  • [17] C.-Y. Chang, “A privacy preserving distributed model identification algorithm for power distribution systems,” in 62nd IEEE Conference on Decision and Control, 2023.
  • [18] D. Materassi and M. V. Salapaka, “On the problem of reconstructing an unknown topology via locality properties of the wiener filter,” IEEE Transactions on Automatic Control, vol. 57, no. 7, pp. 1765–1777, 2012.
  • [19] H. H. Weerts, P. M. Van den Hof, and A. G. Dankers, “Identifiability of linear dynamic networks,” Automatica, vol. 89, pp. 247–258, 2018.
  • [20] M. S. Veedu, H. Doddi, and M. V. Salapaka, “Topology learning of linear dynamical systems with latent nodes using matrix decomposition,” IEEE Transactions on Automatic Control, vol. 67, no. 11, pp. 5746–5761, 2022.
  • [21] D. E. Knuth, The Art of Computer Programming, Volume 1: Fundamental Algorithms.   Addison-Wesley, 1997.
  • [22] Y. Huang, Z. Meng, and J. Sun, “Scalable distributed least square algorithms for large-scale linear equations via an optimization approach,” Automatica, vol. 146, p. 110572, 2022.
  • [23] R. Merris, “Laplacian matrices of graphs: a survey,” Linear algebra and its applications, vol. 197, pp. 143–176, 1994.
  • [24] M. E. Baran and F. F. Wu, “Network reconfiguration in distribution systems for loss reduction and load balancing,” IEEE Transactions on Power delivery, vol. 4, no. 2, pp. 1401–1407, 1989.