Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleOctober 2024
CSO: Constraint-Guided Space Optimization for Active Scene Mapping
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 5015–5024https://doi.org/10.1145/3664647.3681066Simultaneously mapping and exploring a complex unknown scene is an NP-hard problem, which is still challenging with the rapid development of deep learning techniques. We present CSO, a deep reinforcement learning-based framework for efficient active ...
- research-articleMay 2024
HiFI: Hierarchical Fairness-aware Integrated Ranking with Constrained Reinforcement Learning
WWW '24: Companion Proceedings of the ACM Web Conference 2024Pages 196–205https://doi.org/10.1145/3589335.3648317Integrated ranking is a critical component in industrial recommendation platforms. It combines candidate lists from different upstream channels or sources and ranks them into an integrated list, which will be exposed to users. During this process, to ...
- research-articleMay 2024
Relaxed Exploration Constrained Reinforcement Learning
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 1727–1735This research introduces a novel setting for reinforcement learning with constraints, termed Relaxed Exploration Constrained Reinforcement Learning (RECRL). Similar to standard constrained reinforcement learning (CRL), the objective in RECRL is to ...
- posterMay 2023
Relaxed Exploration Constrained Reinforcement Learning
AAMAS '23: Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent SystemsPages 2821–2823This extended abstract introduces a novel setting of reinforcement learning with constraints, called Relaxed Exploration Constrained Reinforcement Learning (RECRL). As in standard constrained reinforcement learning (CRL), the aim is to find a policy that ...
- research-articleApril 2023
Two-Stage Constrained Actor-Critic for Short Video Recommendation
- Qingpeng Cai,
- Zhenghai Xue,
- Chi Zhang,
- Wanqi Xue,
- Shuchang Liu,
- Ruohan Zhan,
- Xueliang Wang,
- Tianyou Zuo,
- Wentao Xie,
- Dong Zheng,
- Peng Jiang,
- Kun Gai
WWW '23: Proceedings of the ACM Web Conference 2023Pages 865–875https://doi.org/10.1145/3543507.3583259The wide popularity of short videos on social media poses new opportunities and challenges to optimize recommender systems on the video-sharing platforms. Users sequentially interact with the system and provide complex and multi-faceted responses, ...
- research-articleJuly 2022
SCALES: From Fairness Principles to Constrained Decision-Making
AIES '22: Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and SocietyPages 46–55https://doi.org/10.1145/3514094.3534190This paper proposes SCALES, a general framework that translates well-established fairness principles into a common representation based on the Constraint Markov Decision Process (CMDP). With the help of causal language, our framework can place ...
- research-articleMay 2019
Actor-Critic Algorithms for Constrained Multi-agent Reinforcement Learning
AAMAS '19: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent SystemsPages 1931–1933Multi-agent reinforcement learning has gained lot of popularity primarily owing to the success of deep function approximation architectures. However, many real-life multi-agent applications often impose constraints on the joint action sequence that can ...
- research-articleJuly 2018
Improving Learning & Reducing Time: A Constrained Action-Based Reinforcement Learning Approach
UMAP '18: Proceedings of the 26th Conference on User Modeling, Adaptation and PersonalizationPages 43–51https://doi.org/10.1145/3209219.3209232Constrained action-based decision-making is one of the most challenging decision-making problems. It refers to a scenario where an agent takes action in an environment not only to maximize the expected cumulative reward but where it is subject to ...