Cited By
View all- Shimizu TTanaka KKishimoto RKiyohara HNomura MSaito Y(2024)Effective Off-Policy Evaluation and Learning in Contextual Combinatorial BanditsProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688099(733-741)Online publication date: 8-Oct-2024
- Saito YAbdollahpouri HAnderton JCarterette BLalmas MChua TNgo CKa-Wei Lee RKumar RLauw H(2024)Long-term Off-Policy Evaluation and LearningProceedings of the ACM Web Conference 202410.1145/3589334.3645446(3432-3443)Online publication date: 13-May-2024
- Kiyohara HNomura MSaito YChua TNgo CKa-Wei Lee RKumar RLauw H(2024)Off-Policy Evaluation of Slate Bandit Policies via Optimizing AbstractionProceedings of the ACM Web Conference 202410.1145/3589334.3645343(3150-3161)Online publication date: 13-May-2024