Cited By
View all- Kastner TErdogdu MFarahmand AOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Distributional model equivalence for risk-sensitive reinforcement learningProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668589(56531-56552)Online publication date: 10-Dec-2023
- Skalse JFarrugia-Roberts MRussell SAbate AGleave AKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Invariance in policy optimisation and partial identifiability in reward learningProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3619736(32033-32058)Online publication date: 23-Jul-2023
- Rowland MTang YLyle CMunos RBellemare MDabney WKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)The statistical benefits of quantile temporal-difference learning for value estimationProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3619622(29210-29231)Online publication date: 23-Jul-2023
- Show More Cited By