via fuzzy
|
Muesli: Combining Improvements in Policy Optimization
[article]
Matteo Hessel, Ivo Danihelka, Fabio Viola, Arthur Guez, Simon Schmitt, Laurent Sifre, Theophane Weber, David Silver (+ more) 2021
pre-print
version:v1
arXiv:2104.06159v1
|
web.archive.org [PDF]
|
via fuzzy
|
Game-Theoretic and Set-Based Methods for Safe Autonomous Vehicles on Shared Roads
Nan Li, University, My 2021
doi:10.7302/1419
|
web.archive.org [PDF]
|
via fuzzy
|
Correlation-aware Cooperative Multigroup Broadcast 360 Video Delivery Network: A Hierarchical Deep Reinforcement Learning Approach
[article]
Fenghe Hu and Yansha Deng and A. Hamid Aghvami 2021
pre-print
version:v2
arXiv:2010.11347v2
|
web.archive.org [PDF]
|
via fuzzy
|
Convergence of Finite Memory Q-Learning for POMDPs and Near Optimality of Learned Policies under Filter Stability
[article]
Ali Devran Kara, Serdar Yuksel 2021
pre-print
version:v2
arXiv:2103.12158v2
|
web.archive.org [PDF]
|
via fuzzy
|
Occlusion-Aware Search for Object Retrieval in Clutter
[article]
Wissam Bejjani, Wisdom C. Agboh, Mehmet R. Dogar, Matteo Leonetti 2021
pre-print
version:v4
arXiv:2011.03334v4
|
web.archive.org [PDF]
|
via fuzzy
|
MDP Playground: A Design and Debug Testbed for Reinforcement Learning
[article]
Raghu Rajan, Jessica Lizeth Borja Diaz, Suresh Guttikonda, Fabio Ferreira, André Biedenkapp, Jan Ole von Hartz, Frank Hutter 2021
pre-print
version:v4
arXiv:1909.07750v4
|
web.archive.org [PDF]
|
via fuzzy
|
Influence-aware Memory Architectures for Deep Reinforcement Learning
[article]
Miguel Suau, Jinke He, Elena Congeduti, Rolf A.N. Starre, Aleksander Czechowski, Frans A. Oliehoek 2021
pre-print
version:v4
arXiv:1911.07643v4
|
web.archive.org [PDF]
|
via fuzzy
|
MUSBO: Model-based Uncertainty Regularized and Sample Efficient Batch Optimization for Deployment Constrained Reinforcement Learning
[article]
DiJia Su, Jason D. Lee, John M. Mulvey, H. Vincent Poor 2021
pre-print
version:v1
arXiv:2102.11448v1
|
web.archive.org [PDF]
|
via fuzzy
|
Safety aware model-based reinforcement learning for optimal control of a class of output-feedback nonlinear systems
[article]
S M Nahid Mahmud, Moad Abudia, Scott A Nivison, Zachary I. Bell, Rushikesh Kamalapurkar 2021
pre-print
version:v1
arXiv:2110.00271v1
|
web.archive.org [PDF]
|
via fuzzy
|
Quantitative Day Trading from Natural Language using Reinforcement Learning
Ramit Sawhney, Arnav Wadhwa, Shivam Agarwal, Rajiv Ratn Shah 2021
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
unpublished
doi:10.18653/v1/2021.naacl-main.316
|
web.archive.org [PDF]
|
via fuzzy
|
Markov Decision Processes with Embedded Agents
Luke Harold Miles 2021
doi:10.13023/etd.2021.135
|
web.archive.org [PDF]
|
via fuzzy
|
Queueing Network Controls via Deep Reinforcement Learning
[article]
J. G. Dai, Mark Gluzman 2020
pre-print
version:v5
arXiv:2008.01644v5
|
web.archive.org [PDF]
|
via fuzzy
|
A Visual Communication Map for Multi-Agent Deep Reinforcement Learning
[article]
Ngoc Duy Nguyen, Thanh Thi Nguyen, Saeid Nahavandi 2020
pre-print
version:v1
arXiv:2002.11882v1
|
web.archive.org [PDF]
|
via fuzzy
|
Learning Implicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning
[article]
Meng Zhou, Ziyu Liu, Pengwei Sui, Yixuan Li, Yuk Ying Chung 2020
pre-print
version:v2
arXiv:2007.02529v2
|
web.archive.org [PDF]
|
via fuzzy
|
Transfer Reinforcement Learning under Unobserved Contextual Information
[article]
Yan Zhang, Michael M. Zavlanos 2020
pre-print
version:v1
arXiv:2003.04427v1
|
web.archive.org [PDF]
|
via fuzzy
|
Approximate information state for approximate planning and reinforcement learning in partially observed systems
[article]
Jayakumar Subramanian, Amit Sinha, Raihan Seraj, Aditya Mahajan 2020
pre-print
version:v1
arXiv:2010.08843v1
|
web.archive.org [PDF]
|
via fuzzy
|
Enforcing Almost-Sure Reachability in POMDPs
[article]
Sebastian Junges, Nils Jansen, Sanjit A. Seshia 2020
pre-print
version:v2
arXiv:2007.00085v2
|
web.archive.org [PDF]
|
via fuzzy
|
Reinforcement Learning in Healthcare: A Survey
[article]
Chao Yu, Jiming Liu, Shamim Nemati 2020
pre-print
version:v4
arXiv:1908.08796v4
|
web.archive.org [PDF]
|
via fuzzy
|
Reinforcement Learning for Intelligent Healthcare Applications: A Survey
Antonio Coronato, Muddasar Naeem, Giuseppe De Pietro, Giovanni Paragliola 2020
Artificial Intelligence in Medicine
doi:10.1016/j.artmed.2020.101964
pmid:34756216
|
|
via fuzzy
|
Domain Concretization from Examples: Addressing Missing Domain Knowledge via Robust Planning
[article]
Akshay Sharma, Piyush Rajesh Medikeri, Yu Zhang 2020
pre-print
version:v1
arXiv:2011.09034v1
|
web.archive.org [PDF]
|
via fuzzy
|
A Traffic Simulation Model with Interactive Drivers and High-fidelity Car Dynamics
Guankun Su, Nan Li, Yildiray Yildiz, Anouck Girard, Ilya Kolmanovsky 2019
IFAC-PapersOnLine
doi:10.1016/j.ifacol.2019.01.010
|
web.archive.org [PDF]
|
via fuzzy
|
Modeling Cyber-Physical Human Systems via an Interplay Between
Reinforcement Learning and Game Theory
[article]
Mert Albaba, Yildiray Yildiz 2019
pre-print
version:v1
arXiv:1910.05092v1
|
web.archive.org [PDF]
|
via fuzzy
|
Stochastic Dynamic Programming in DASH
Koffka Khan, Wayne Goodridge 2019
International journal of advanced networking and applications
doi:10.35444/ijana.2019.11031
|
web.archive.org [PDF]
|
via fuzzy
|
The Power of d Choices in Scheduling for Data Centers with Heterogeneous
Servers
[article]
Amir Moaddeli, Iman Nabati Ahmadi, Negin Abhar 2019
pre-print
version:v1
arXiv:1904.00447v1
|
web.archive.org [PDF]
|
via fuzzy
|
II-Learn—A Novel Metric for Measuring the Intelligence Increase and Evolution of Artificial Learning Systems
László Barna Iantovics, Dimitris K. Iakovidis, Elena Nechita 2019
International Journal of Computational Intelligence Systems
doi:10.2991/ijcis.d.191101.001
|
web.archive.org [PDF]
|
via fuzzy
|
Robust Federated Learning Through Representation Matching and Adaptive
Hyper-parameters
[article]
Hesham Mostafa 2019
pre-print
version:v1
arXiv:1912.13075v1
|
web.archive.org [PDF]
|
via fuzzy
|
Using Deep Q-Learning to Prolong the Lifetime of Correlated Internet of Things Devices
Jernej Hribar, Andrei Marinescu, George A. Ropokis, Luiz A. DaSilva 2019
IEEE International Conference on Communications
doi:10.1109/iccw.2019.8756759
dblp:conf/icc/HribarMRD19
|
web.archive.org [PDF]
|
via fuzzy
|
Transformer Based Reinforcement Learning For Games
[article]
Uddeshya Upadhyay, Nikunj Shah, Sucheta Ravikanti, Mayanka Medhe 2019
pre-print
version:v1
arXiv:1912.03918v1
|
web.archive.org [PDF]
|
via fuzzy
|
Machine Learning for 5G/B5G Mobile and Wireless Communications: Potential, Limitations, and Future Directions
Manuel Eugenio Morocho-Cayamcela, Haeyoung Lee, Wansu Lim 2019
IEEE Access
doi:10.1109/access.2019.2942390
|
web.archive.org [PDF]
|
via fuzzy
|
Multiple Landmark Detection using Multi-Agent Reinforcement Learning
[article]
Athanasios Vlontzos, Amir Alansary, Konstantinos Kamnitsas, Daniel
Rueckert, Bernhard Kainz 2019
pre-print
version:v2
arXiv:1907.00318v2
|
web.archive.org [PDF]
|