s2ac: energy-based reinforcement learning with stein soft actor critic

AllNews Videos Images Maps Shopping Books

Search tools

Past month

Any time
Past hour
Past 24 hours
Past week
Past month
Past year

All results

All results
Verbatim

Clear

A Max-Min Entropy Framework for Reinforcement Learning - arxiv-sanity

arxiv-sanity-lite.com › ...

7 days ago · Soft Actor-Critic (SAC) is one of the state-of-the-art off-policy reinforcement learning (RL) algorithms that is within the maximum entropy based RL framework.

similar - arxiv-sanity

arxiv-sanity-lite.com › ...

Jul 19, 2024 · Soft Actor-Critic (SAC) is one of the state-of-the-art off-policy reinforcement learning (RL) algorithms that is within the maximum entropy based RL framework.

In order to show you the most relevant results, we have omitted some entries very similar to the 2 already displayed. If you like, you can repeat the search with the omitted results included.