Cited By
View all- Zhou THairi FYang HLiu JTong TYang FMomma MGao YSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)Finite-time convergence and sample complexity of actor-critic multi-objective reinforcement learningProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3694632(61913-61933)Online publication date: 21-Jul-2024
- Zhang GWang YChen XQian HZhan KWang BWooldridge MDy JNatarajan S(2024)UNEX-RLProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i8.28783(9305-9313)Online publication date: 20-Feb-2024
- Su SChen XWang YWu YZhang ZZhan KWang BGai K(2024)RPAF: A Reinforcement Prediction-Allocation Framework for Cache Allocation in Large-Scale Recommender SystemsProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688128(670-679)Online publication date: 8-Oct-2024
- Show More Cited By