[CITATION][C] Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. CoRR

T Haarnoja, A Zhou, P Abbeel, S Levine - arXiv preprint arXiv:1801.01290, 2018

[CITATION][C] Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. arXiv e-prints, page

T Haarnoja, A Zhou, P Abbeel, S Levine - arXiv preprint arXiv:1801.01290, 2018