Cited By
View all- Wei WWang DLi LLiang J(2024)Re-attentive experience replay in off-policy reinforcement learningMachine Language10.1007/s10994-023-06505-8113:5(2327-2349)Online publication date: 1-May-2024
Context: In safety-critical systems, an effective safety analysis produces high-quality safety requirements and ensures a safe product from an early stage. Motivation: In safety-critical industries, safety analysis happens mostly in groups. The ...
As an important approach to solving complex sequential decision problems, reinforcement learning (RL) has been widely studied in the community of artificial intelligence and machine learning. However, the generalization ability of RL is still an open ...
A reinforcement architecture is introduced that consists of three complementary learning systems with different generalization abilities. The ACTOR learns state-action associations, the CRITIC learns a goal-gradient, and the PUNISH system learns what ...
Elsevier Science Publishers Ltd.
United Kingdom