research-article

Free access

Verifiable and interpretable reinforcement learning through program synthesis

AUTHOR:

AAAI'19/IAAI'19/EAAI'19: Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence and Thirty-First Innovative Applications of Artificial Intelligence Conference and Ninth AAAI Symposium on Educational Advances in Artificial Intelligence

Article No.: 1250, Pages 9902 - 9903

https://doi.org/10.1609/aaai.v33i01.33019902

Published: 27 January 2019 Publication History

PDF eReader Publisher Site

Abstract

We study the problem of generating interpretable and verifiable policies for Reinforcement Learning (RL). Unlike the popular Deep Reinforcement Learning (DRL) paradigm, in which the policy is represented by a neural network, the aim of this work is to find policies that can be represented in high-level programming languages. Such programmatic policies have several benefits, including being more easily interpreted than neural networks, and being amenable to verification by scalable symbolic methods. The generation methods for programmatic policies also provide a mechanism for systematically using domain knowledge for guiding the policy search. The interpretability and verifiability of these policies provides the opportunity to deploy RL based solutions in safety critical environments. This thesis draws on, and extends, work from both the machine learning and formal methods communities.

References

[1]

Alur, R.; Bodík, R.; Dallal, E.; Fisman, D.; Garg, P.; Juniwal, G.; Kress-Gazit, H.; Madhusudan, P.; Martin, M. M. K.; Raghothaman, M.; Saha, S.; Seshia, S. A.; Singh, R.; Solar-Lezama, A.; Torlak, E.; and Udupa, A. 2015. Syntax-guided synthesis. In Dependable Software Systems Engineering.

Google Scholar

[2]

Katz, G.; Barrett, C. W.; Dill, D. L.; Julian, K.; and Kochenderfer, M. J. 2017. Reluplex: An efficient smt solver for verifying deep neural networks. In Computer Aided Verification, 97–117.

Google Scholar

[3]

Mnih, V.; Kavukcuoglu, K.; Silver, D.; Rusu, A. A.; Veness, J.; Bellemare, M. G.; Graves, A.; Riedmiller, M.; Fidjeland, A. K.; Ostrovski, G.; et al. 2015. Human-level control through deep reinforcement learning. Nature 518(7540):529.

Crossref

Google Scholar

[4]

Montavon, G.; Samek, W.; and Müller, K. 2017. Methods for interpreting and understanding deep neural networks. CoRR abs/1706.07979.

Google Scholar

[5]

Murali, V.; Qi, L.; Chaudhuri, S.; and Jermaine, C. 2018. Neural sketch learning for conditional program generation. In International Conference on Learning Representations.

Google Scholar

[6]

Ross, S.; Gordon, G. J.; and Bagnell, D. 2011. A reduction of imitation learning and structured prediction to no-regret online learning. In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2011, 627–635.

Google Scholar

[7]

Verma, A.; Murali, V.; Singh, R.; Kohli, P.; and Chaudhuri, S. 2018. Programmatically interpretable reinforcement learning. In Proceedings of the 35th International Conference on Machine Learning, volume 80 of Proceedings of Machine Learning Research, 5045–5054. Stock-holmsmässan, Stockholm Sweden: PMLR.

Google Scholar

Cited By

View all

Gajcin JDusparic I(2024)Redefining Counterfactual Explanations for Reinforcement Learning: Overview, Challenges and OpportunitiesACM Computing Surveys10.1145/364847256:9(1-33)Online publication date: 24-Apr-2024
https://dl.acm.org/doi/10.1145/3648472
Yang YInala JBastani OPu YSolar-Lezama ARinard MRanzato MBeygelzimer ADauphin YLiang PVaughan J(2021)Program synthesis guided reinforcement learning for partially observed environmentsProceedings of the 35th International Conference on Neural Information Processing Systems10.5555/3540261.3542532(29669-29683)Online publication date: 6-Dec-2021
https://dl.acm.org/doi/10.5555/3540261.3542532
Skirzyński JBecker FLieder F(2021)Automatic discovery of interpretable planning strategiesMachine Language10.1007/s10994-021-05963-2110:9(2641-2683)Online publication date: 1-Sep-2021
https://dl.acm.org/doi/10.1007/s10994-021-05963-2

Index Terms

Verifiable and interpretable reinforcement learning through program synthesis

Index terms have been assigned to the content through auto-classification.

Recommendations

Interpretable, Verifiable, and Robust Reinforcement Learning via Program Synthesis
xxAI - Beyond Explainable AI
Abstract
Reinforcement learning is a promising strategy for automatically training policies for challenging control tasks. However, state-of-the-art deep reinforcement learning algorithms focus on training deep neural network (DNN) policies, which are ...
An inductive synthesis framework for verifiable reinforcement learning
PLDI 2019: Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation

Despite the tremendous advances that have been made in the last decade on developing useful machine-learning applications, their wider adoption has been hindered by the lack of strong assurance guarantees that can be made about their behavior. In this ...
Generating interpretable reinforcement learning policies using genetic programming
GECCO '19: Proceedings of the Genetic and Evolutionary Computation Conference Companion

The search for interpretable reinforcement learning policies is of high academic and industrial interest. Especially for industrial systems, domain experts are more likely to deploy autonomously learned controllers if they are understandable and ...

Comments

Information & Contributors

Information

Published In

January 2019

10088 pages

ISBN:978-1-57735-809-1

Publisher

AAAI Press

Publication History

Published: 27 January 2019

Qualifiers

Research-article
Research
Refereed limited

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
62
Total Downloads

Downloads (Last 12 months)44
Downloads (Last 6 weeks)10

Reflects downloads up to 11 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Gajcin JDusparic I(2024)Redefining Counterfactual Explanations for Reinforcement Learning: Overview, Challenges and OpportunitiesACM Computing Surveys10.1145/364847256:9(1-33)Online publication date: 24-Apr-2024
https://dl.acm.org/doi/10.1145/3648472
Yang YInala JBastani OPu YSolar-Lezama ARinard MRanzato MBeygelzimer ADauphin YLiang PVaughan J(2021)Program synthesis guided reinforcement learning for partially observed environmentsProceedings of the 35th International Conference on Neural Information Processing Systems10.5555/3540261.3542532(29669-29683)Online publication date: 6-Dec-2021
https://dl.acm.org/doi/10.5555/3540261.3542532
Skirzyński JBecker FLieder F(2021)Automatic discovery of interpretable planning strategiesMachine Language10.1007/s10994-021-05963-2110:9(2641-2683)Online publication date: 1-Sep-2021
https://dl.acm.org/doi/10.1007/s10994-021-05963-2

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Interpretable, Verifiable, and Robust Reinforcement Learning via Program Synthesis

An inductive synthesis framework for verifiable reinforcement learning

Generating interpretable reinforcement learning policies using genetic programming

Comments

Published In

Sponsors

Publisher

Publication History

Qualifiers

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF

eReader

Login options

Full Access

Abstract

References

Cited By

Index Terms

Recommendations

Interpretable, Verifiable, and Robust Reinforcement Learning via Program Synthesis

An inductive synthesis framework for verifiable reinforcement learning

Generating interpretable reinforcement learning policies using genetic programming

Comments

Information

Published In

Sponsors

Publisher

Publication History

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Login options

Full Access

Share

Share this Publication link

Share on social media

Affiliations