Recurrent Neural Network Training with Convex Loss and Regularization Functions by Extended Kalman Filtering

doi:10.48550/arXiv.2111.02673

Recurrent Neural Network Training with Convex Loss and Regularization Functions by Extended Kalman Filtering

Bemporad, Alberto

This paper investigates the use of extended Kalman filtering to train recurrent neural networks with rather general convex loss functions and regularization terms on the network parameters, including $\ell_1$-regularization. We show that the learning method is competitive with respect to stochastic gradient descent in a nonlinear system identification benchmark and in training a linear system with binary outputs. We also explore the use of the algorithm in data-driven nonlinear model predictive control and its relation with disturbance models for offset-free closed-loop tracking.

Publication:

arXiv e-prints

Pub Date:

November 2021

DOI:

10.48550/arXiv.2111.02673

arXiv:

arXiv:2111.02673

Bibcode:

2021arXiv211102673B

Keywords:

Computer Science - Machine Learning;
Electrical Engineering and Systems Science - Systems and Control;
Mathematics - Optimization and Control

E-Print:

21 pages, 3 figures, submitted for publication

NASA/ADS

Recurrent Neural Network Training with Convex Loss and Regularization Functions by Extended Kalman Filtering

Abstract