Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Reinforcement Learning Algorithm for Partially Observable Markov Decision Problems release_nvsm7pcrdfcuxevwx76uka2fjq

by Tommi S. Jaakkola, Satinder P. Singh, Michael I. Jordan

Cited By

References to this release by other works.
Fuzzy reference matching is a work in progress!
Read more about quality, completeness, and caveats in the fatcat guide.
Showing 1 - 30 of 93 references (in 165ms)

via fuzzy
Muesli: Combining Improvements in Policy Optimization [article]
Matteo Hessel, Ivo Danihelka, Fabio Viola, Arthur Guez, Simon Schmitt, Laurent Sifre, Theophane Weber, David Silver (+ more)
2021    pre-print
version:v1  arXiv:2104.06159v1 
web.archive.org [PDF]

via fuzzy
Game-Theoretic and Set-Based Methods for Safe Autonomous Vehicles on Shared Roads
Nan Li, University, My
2021  
doi:10.7302/1419 
web.archive.org [PDF]

via fuzzy
Correlation-aware Cooperative Multigroup Broadcast 360 Video Delivery Network: A Hierarchical Deep Reinforcement Learning Approach [article]
Fenghe Hu and Yansha Deng and A. Hamid Aghvami
2021    pre-print
version:v2  arXiv:2010.11347v2 
web.archive.org [PDF]

via fuzzy
Convergence of Finite Memory Q-Learning for POMDPs and Near Optimality of Learned Policies under Filter Stability [article]
Ali Devran Kara, Serdar Yuksel
2021    pre-print
version:v2  arXiv:2103.12158v2 
web.archive.org [PDF]

via fuzzy
Occlusion-Aware Search for Object Retrieval in Clutter [article]
Wissam Bejjani, Wisdom C. Agboh, Mehmet R. Dogar, Matteo Leonetti
2021    pre-print
version:v4  arXiv:2011.03334v4 
web.archive.org [PDF]

via fuzzy
MDP Playground: A Design and Debug Testbed for Reinforcement Learning [article]
Raghu Rajan, Jessica Lizeth Borja Diaz, Suresh Guttikonda, Fabio Ferreira, André Biedenkapp, Jan Ole von Hartz, Frank Hutter
2021    pre-print
version:v4  arXiv:1909.07750v4 
web.archive.org [PDF]

via fuzzy
Influence-aware Memory Architectures for Deep Reinforcement Learning [article]
Miguel Suau, Jinke He, Elena Congeduti, Rolf A.N. Starre, Aleksander Czechowski, Frans A. Oliehoek
2021    pre-print
version:v4  arXiv:1911.07643v4 
web.archive.org [PDF]

via fuzzy
MUSBO: Model-based Uncertainty Regularized and Sample Efficient Batch Optimization for Deployment Constrained Reinforcement Learning [article]
DiJia Su, Jason D. Lee, John M. Mulvey, H. Vincent Poor
2021    pre-print
version:v1  arXiv:2102.11448v1 
web.archive.org [PDF]

via fuzzy
Safety aware model-based reinforcement learning for optimal control of a class of output-feedback nonlinear systems [article]
S M Nahid Mahmud, Moad Abudia, Scott A Nivison, Zachary I. Bell, Rushikesh Kamalapurkar
2021    pre-print
version:v1  arXiv:2110.00271v1 
web.archive.org [PDF]

via fuzzy
Quantitative Day Trading from Natural Language using Reinforcement Learning
Ramit Sawhney, Arnav Wadhwa, Shivam Agarwal, Rajiv Ratn Shah
2021   Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies  unpublished
doi:10.18653/v1/2021.naacl-main.316 
web.archive.org [PDF]

via fuzzy
Markov Decision Processes with Embedded Agents
Luke Harold Miles
2021  
doi:10.13023/etd.2021.135 
web.archive.org [PDF]

via fuzzy
Queueing Network Controls via Deep Reinforcement Learning [article]
J. G. Dai, Mark Gluzman
2020    pre-print
version:v5  arXiv:2008.01644v5 
web.archive.org [PDF]

via fuzzy
A Visual Communication Map for Multi-Agent Deep Reinforcement Learning [article]
Ngoc Duy Nguyen, Thanh Thi Nguyen, Saeid Nahavandi
2020    pre-print
version:v1  arXiv:2002.11882v1 
web.archive.org [PDF]

via fuzzy
Learning Implicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning [article]
Meng Zhou, Ziyu Liu, Pengwei Sui, Yixuan Li, Yuk Ying Chung
2020    pre-print
version:v2  arXiv:2007.02529v2 
web.archive.org [PDF]

via fuzzy
Transfer Reinforcement Learning under Unobserved Contextual Information [article]
Yan Zhang, Michael M. Zavlanos
2020    pre-print
version:v1  arXiv:2003.04427v1 
web.archive.org [PDF]

via fuzzy
Approximate information state for approximate planning and reinforcement learning in partially observed systems [article]
Jayakumar Subramanian, Amit Sinha, Raihan Seraj, Aditya Mahajan
2020    pre-print
version:v1  arXiv:2010.08843v1 
web.archive.org [PDF]

via fuzzy
Enforcing Almost-Sure Reachability in POMDPs [article]
Sebastian Junges, Nils Jansen, Sanjit A. Seshia
2020    pre-print
version:v2  arXiv:2007.00085v2 
web.archive.org [PDF]

via fuzzy
Reinforcement Learning in Healthcare: A Survey [article]
Chao Yu, Jiming Liu, Shamim Nemati
2020    pre-print
version:v4  arXiv:1908.08796v4 
web.archive.org [PDF]

via fuzzy
Reinforcement Learning for Intelligent Healthcare Applications: A Survey
Antonio Coronato, Muddasar Naeem, Giuseppe De Pietro, Giovanni Paragliola
2020   Artificial Intelligence in Medicine
doi:10.1016/j.artmed.2020.101964  pmid:34756216 

via fuzzy
Domain Concretization from Examples: Addressing Missing Domain Knowledge via Robust Planning [article]
Akshay Sharma, Piyush Rajesh Medikeri, Yu Zhang
2020    pre-print
version:v1  arXiv:2011.09034v1 
web.archive.org [PDF]

via fuzzy
A Traffic Simulation Model with Interactive Drivers and High-fidelity Car Dynamics
Guankun Su, Nan Li, Yildiray Yildiz, Anouck Girard, Ilya Kolmanovsky
2019   IFAC-PapersOnLine
doi:10.1016/j.ifacol.2019.01.010 
web.archive.org [PDF]

via fuzzy
Modeling Cyber-Physical Human Systems via an Interplay Between Reinforcement Learning and Game Theory [article]
Mert Albaba, Yildiray Yildiz
2019    pre-print
version:v1  arXiv:1910.05092v1 
web.archive.org [PDF]

via fuzzy
Stochastic Dynamic Programming in DASH
Koffka Khan, Wayne Goodridge
2019   International journal of advanced networking and applications
doi:10.35444/ijana.2019.11031 
web.archive.org [PDF]

via fuzzy
The Power of d Choices in Scheduling for Data Centers with Heterogeneous Servers [article]
Amir Moaddeli, Iman Nabati Ahmadi, Negin Abhar
2019    pre-print
version:v1  arXiv:1904.00447v1 
web.archive.org [PDF]

via fuzzy
II-Learn—A Novel Metric for Measuring the Intelligence Increase and Evolution of Artificial Learning Systems
László Barna Iantovics, Dimitris K. Iakovidis, Elena Nechita
2019   International Journal of Computational Intelligence Systems
doi:10.2991/ijcis.d.191101.001 
web.archive.org [PDF]

via fuzzy
Robust Federated Learning Through Representation Matching and Adaptive Hyper-parameters [article]
Hesham Mostafa
2019    pre-print
version:v1  arXiv:1912.13075v1 
web.archive.org [PDF]

via fuzzy
Using Deep Q-Learning to Prolong the Lifetime of Correlated Internet of Things Devices
Jernej Hribar, Andrei Marinescu, George A. Ropokis, Luiz A. DaSilva
2019   IEEE International Conference on Communications
doi:10.1109/iccw.2019.8756759  dblp:conf/icc/HribarMRD19 
web.archive.org [PDF]

via fuzzy
Transformer Based Reinforcement Learning For Games [article]
Uddeshya Upadhyay, Nikunj Shah, Sucheta Ravikanti, Mayanka Medhe
2019    pre-print
version:v1  arXiv:1912.03918v1 
web.archive.org [PDF]

via fuzzy
Machine Learning for 5G/B5G Mobile and Wireless Communications: Potential, Limitations, and Future Directions
Manuel Eugenio Morocho-Cayamcela, Haeyoung Lee, Wansu Lim
2019   IEEE Access
doi:10.1109/access.2019.2942390 
web.archive.org [PDF]

via fuzzy
Multiple Landmark Detection using Multi-Agent Reinforcement Learning [article]
Athanasios Vlontzos, Amir Alansary, Konstantinos Kamnitsas, Daniel Rueckert, Bernhard Kainz
2019    pre-print
version:v2  arXiv:1907.00318v2 
web.archive.org [PDF]
Showing 1 - 30 of 93 references  next »