research-article

Public Access

Design-while-verify: correct-by-construction control learning with verification in the loop

Authors:

Qi ZhuAuthors Info & Claims

DAC '22: Proceedings of the 59th ACM/IEEE Design Automation Conference

Pages 925 - 930

https://doi.org/10.1145/3489517.3530556

Published: 23 August 2022 Publication History

Abstract

In the current control design of safety-critical cyber-physical systems, formal verification techniques are typically applied after the controller is designed to evaluate whether the required properties (e.g., safety) are satisfied. However, due to the increasing system complexity and the fundamental hardness of designing a controller with formal guarantees, such an open-loop process of design-then-verify often results in many iterations and fails to provide the necessary guarantees. In this paper, we propose a correct-by-construction control learning framework that integrates the verification into the control design process in a closed-loop manner, i.e., design-while-verify. Specifically, we leverage the verification results (computed reachable set of the system state) to construct feedback metrics for control learning, which measure how likely the current design of control parameters can meet the required reach-avoid property for safety and goal-reaching. We formulate an optimization problem based on such metrics for tuning the controller parameters, and develop an approximated gradient descent algorithm with a difference method to solve the optimization problem and learn the controller. The learned controller is formally guaranteed to meet the required reach-avoid property. By treating verifiability as a first-class objective and effectively leveraging the verification results during the control learning process, our approach can significantly improve the chance of finding a control design with formal property guarantees, demonstrated in a set of experiments that use model-based or neural network based controllers.

References

[1]

Martin Arjovsky, Soumith Chintala, and Léon Bottou. 2017. Wasserstein generative adversarial networks. In ICML. PMLR.

[2]

Alberto Bemporad, Manfred Morari, Vivek Dua, and Efstratios N Pistikopoulos. 2002. The explicit linear quadratic regulator for constrained systems. Automatica (2002).

[3]

Vincent D Blondel and John N Tsitsiklis. 2000. A survey of computational complexity results in systems and control. Automatica 36 (2000), 1249--1274.

Digital Library

[4]

Xin Chen, Erika Ábrahám, and Sriram Sankaranarayanan. 2013. Flow*: An analyzer for non-linear hybrid systems. In CAV. Springer, 258--263.

[5]

Jerry Ding and Claire J Tomlin. 2010. Robust reach-avoid controller synthesis for switched nonlinear systems. In CDC. IEEE, 6481--6486.

[6]

Tommaso Dreossi, Daniel J Fremont, Shromona Ghosh, Edward Kim, Hadi Ravanbakhsh, Marcell Vazquez-Chanlatte, and Sanjit Seshia. 2019. Verifai: A toolkit for the formal design and analysis of artificial intelligence-based systems. In CAV. Springer, 432--442.

[7]

Souradeep Dutta, Xin Chen, and Sriram Sankaranarayanan. 2019. Reachability analysis for neural feedback systems using regressive polynomial rule inference. In HSCC. 157--168.

[8]

Chuchu Fan, Zengyi Qin, Umang Mathur, Qiang Ning, Sayan Mitra, and Mahesh Viswanathan. 2021. Controller synthesis for linear system with reach-avoid specifications. IEEE Trans. Automat. Control (2021).

[9]

Jiameng Fan, Chao Huang, Xin Chen, Wenchao Li, and Qi Zhu. 2020. ReachNN*: A Tool for Reachability Analysis of Neural-Network Controlled Systems. In ATVA, Dang Van Hung and Oleg Sokolsky (Eds.). Springer International Publishing.

[10]

Nicolas Heess, Gregory Wayne, David Silver, Timothy Lillicrap, Tom Erez, and Yuval Tassa. 2015. Learning Continuous Control Policies by Stochastic Value Gradients. NIPs 28 (2015), 2944--2952.

[11]

Peter Henderson, Riashat Islam, Philip Bachman, Joelle Pineau, Doina Precup, and David Meger. 2018. Deep reinforcement learning that matters. In AAAI.

[12]

Thomas A Henzinger, Peter W Kopke, Anuj Puri, and Pravin Varaiya. 1998. What's decidable about hybrid automata? JCSS 57, 1 (1998), 94--124.

Digital Library

[13]

Kai-Chieh Hsu, Vicenç Rubies-Royo, Claire J Tomlin, and Jaime F Fisac. 2021. Safety and Liveness Guarantees through Reach-Avoid Reinforcement Learning. In Robotics: Science and Systems.

[14]

Chao Huang, Jiameng Fan, Xin Chen, Wenchao Li, and Qi Zhu. 2021. POLAR: A Polynomial Arithmetic Framework for Verifying Neural-Network Controlled Systems. arXiv preprint arXiv:2106.13867 (2021).

[15]

Chao Huang, Jiameng Fan, Wenchao Li, Xin Chen, and Qi Zhu. 2019. Reachnn: Reachability analysis of neural-network controlled systems. TECS 18 (2019).

Digital Library

[16]

Radoslav Ivanov, Taylor J Carpenter, James Weimer, Rajeev Alur, George J Pappas, and Insup Lee. 2020. Case study: verifying the safety of an autonomous racing car with a neural network controller. In HSCC. 1--7.

[17]

Radoslav Ivanov, James Weimer, Rajeev Alur, George J Pappas, and Insup Lee. 2019. Verisig: verifying safety properties of hybrid systems with neural network controllers. In HSCC. 169--178.

[18]

Timothy P Lillicrap, Jonathan J Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2016. Continuous control with deep reinforcement learning. In ICLR (Poster).

[19]

Gabriel Peyré, Marco Cuturi, et al. 2019. Computational optimal transport: With applications to data science. Foundations and Trends® in Machine Learning 11, 5--6 (2019), 355--607.

[20]

Xiaowu Sun and Yasser Shoukry. 2021. Provably Correct Training of Neural Network Controllers Using Reachability Analysis. arXiv preprint (2021).

[21]

Yixuan Wang, Chao Huang, Zhilu Wang, Shichao Xu, Zhaoran Wang, and Qi Zhu. 2021. Cocktail: Learn a Better Neural Network Controller from Multiple Experts via Adaptive Mixing and Robust Distillation. DAC (2021).

[22]

Yixuan Wang, Chao Huang, and Qi Zhu. 2020. Energy-efficient control adaptation with safety guarantees for learning-enabled cyber-physical systems. In ICCAD. IEEE, 1--9.

[23]

Webots. [n. d.]. http://www.cyberbotics.com. Mobile Robot Simulation Software.

Cited By

Chakraborty SJha SSamii SMundhenk P(2024)Introduction to the Special Issue on Automotive CPS Safety & Security: Part 2ACM Transactions on Cyber-Physical Systems10.1145/36502108:2(1-17)Online publication date: 15-May-2024
https://dl.acm.org/doi/10.1145/3650210
Wang YZhan SJiao RWang ZJin WYang ZWang ZHuang CZhu QKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Enforcing hard constraints with soft barriersProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3619930(36593-36604)Online publication date: 23-Jul-2023
https://dl.acm.org/doi/10.5555/3618408.3619930
Wang ZHuang CZhu QElkind E(2023)Efficient global robustness certification of neural networks via interleaving twin-network encoding (extended abstract)Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/727(6498-6503)Online publication date: 19-Aug-2023
https://dl.acm.org/doi/10.24963/ijcai.2023/727
Show More Cited By

Recommendations

Fractional-order PI λDµ controller design

This paper introduces a new design method of fractional-order proportional-derivative (FOPD) and fractional-order proportional-integral-derivative (FOPID) controllers. A biquadratic approximation of a fractional-order differential operator is used to ...
Control Systems using MATLAB. Control Design, Customization and Design Case Studies
Learning to verify branching time properties

We present a new model checking algorithm for verifying computation tree logic (CTL) properties. Our technique is based on using language inference to learn the fixpoints necessary for checking a CTL formula instead of computing them iteratively as is ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

DAC '22: Proceedings of the 59th ACM/IEEE Design Automation Conference

July 2022

1462 pages

ISBN:9781450391429

DOI:10.1145/3489517

General Chair:
Rob Oshana
NXP

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGDA: ACM Special Interest Group on Design Automation
IEEE CEDA

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 August 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article

Funding Sources

Conference

DAC '22

Sponsor:

SIGDA

DAC '22: 59th ACM/IEEE Design Automation Conference

July 10 - 14, 2022

California, San Francisco

Acceptance Rates

Overall Acceptance Rate 1,770 of 5,499 submissions, 32%

Upcoming Conference

DAC '25

Sponsor:
sigda

62nd ACM/IEEE Design Automation Conference

June 22 - 26, 2025

San Francisco , CA , USA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

9
Total Citations
View Citations
268
Total Downloads

Downloads (Last 12 months)128
Downloads (Last 6 weeks)32

Reflects downloads up to 04 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

Chakraborty SJha SSamii SMundhenk P(2024)Introduction to the Special Issue on Automotive CPS Safety & Security: Part 2ACM Transactions on Cyber-Physical Systems10.1145/36502108:2(1-17)Online publication date: 15-May-2024
https://dl.acm.org/doi/10.1145/3650210
Wang YZhan SJiao RWang ZJin WYang ZWang ZHuang CZhu QKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Enforcing hard constraints with soft barriersProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3619930(36593-36604)Online publication date: 23-Jul-2023
https://dl.acm.org/doi/10.5555/3618408.3619930
Wang ZHuang CZhu QElkind E(2023)Efficient global robustness certification of neural networks via interleaving twin-network encoding (extended abstract)Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/727(6498-6503)Online publication date: 19-Aug-2023
https://dl.acm.org/doi/10.24963/ijcai.2023/727
Chakraborty SJha SSamii SMundhenk P(2023)Introduction to the Special Issue on Automotive CPS Safety & Security: Part 1ACM Transactions on Cyber-Physical Systems10.1145/35799867:1(1-6)Online publication date: 22-Mar-2023
https://dl.acm.org/doi/10.1145/3579986
Wang YZhan SWang ZHuang CWang ZYang ZZhu QMitra SVenkatasubramanian NDubey AFeng LGhasemi MSprinkle J(2023)Joint Differentiable Optimization and Verification for Certified Reinforcement LearningProceedings of the ACM/IEEE 14th International Conference on Cyber-Physical Systems (with CPS-IoT Week 2023)10.1145/3576841.3585919(132-141)Online publication date: 9-May-2023
https://dl.acm.org/doi/10.1145/3576841.3585919
Liu XLuo YGoeckner AChakraborty TJiao RWang NWang YSato TChen QZhu Q(2023)Invited: Waving the Double-Edged Sword: Building Resilient CAVs with Edge and Cloud Computing2023 60th ACM/IEEE Design Automation Conference (DAC)10.1109/DAC56929.2023.10247809(1-4)Online publication date: 9-Jul-2023
https://doi.org/10.1109/DAC56929.2023.10247809
Zhu QLi WHuang CChen XZhou WWang YLi JFu F(2023)Verification and Design of Robust and Safe Neural Network-enabled Autonomous Systems2023 59th Annual Allerton Conference on Communication, Control, and Computing (Allerton)10.1109/Allerton58177.2023.10313451(1-8)Online publication date: 26-Sep-2023
https://doi.org/10.1109/Allerton58177.2023.10313451
Wang YZhu H(2023)Verification-guided Programmatic Controller SynthesisTools and Algorithms for the Construction and Analysis of Systems10.1007/978-3-031-30820-8_16(229-250)Online publication date: 22-Apr-2023
https://dl.acm.org/doi/10.1007/978-3-031-30820-8_16
Chen XFan JHuang CJiao RLi WLiu XWang YWang ZZhou WZhu Q(2023)Safety-Assured Design and Adaptation of Connected and Autonomous VehiclesMachine Learning and Optimization Techniques for Automotive Cyber-Physical Systems10.1007/978-3-031-28016-0_26(735-757)Online publication date: 27-Mar-2023
https://doi.org/10.1007/978-3-031-28016-0_26

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents