Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJanuary 2025
Truly no-regret learning in constrained MDPs
ICML'24: Proceedings of the 41st International Conference on Machine LearningArticle No.: 1489, Pages 36605–36653Constrained Markov decision processes (CMDPs) are a common way to model safety constraints in reinforcement learning. State-of-the-art methods for efficiently solving CMDPs are based on primal-dual algorithms. For these algorithms, all currently known ...
- research-articleMay 2024
Provably Learning Nash Policies in Constrained Markov Potential Games
Multi-agent reinforcement learning addresses sequential decision-making problems with multiple agents, where each agent optimizes its own objective. In many real-world scenarios, agents not only aim to maximize their goals but also need to ensure safe ...
- research-articleMay 2024
On imitation in mean-field games
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 1757, Pages 40426–40437We explore the problem of imitation learning (IL) in the context of mean-field games (MFGs), where the goal is to imitate the behavior of a population of agents following a Nash equilibrium policy according to some unknown payoff function. IL in MFGs ...
- research-articleApril 2024
Trust region policy optimization with optimal transport discrepancies: duality and algorithm for continuous actions
NIPS '22: Proceedings of the 36th International Conference on Neural Information Processing SystemsArticle No.: 1438, Pages 19786–19797Policy Optimization (PO) algorithms have been proven particularly suited to handle the high-dimensionality of real-world continuous control tasks. In this context, Trust Region Policy Optimization methods represent a popular approach to stabilize the ...
- research-articleApril 2024
Active exploration for inverse reinforcement learning
NIPS '22: Proceedings of the 36th International Conference on Neural Information Processing SystemsArticle No.: 423, Pages 5843–5853Inverse Reinforcement Learning (IRL) is a powerful paradigm for inferring a reward function from expert demonstrations. Many IRL algorithms require a known transition model and sometimes even a known expert policy, or they at least require access to a ...
- research-articleMay 2022
Learning on streaming graphs with experience replay
SAC '22: Proceedings of the 37th ACM/SIGAPP Symposium on Applied ComputingPages 470–478https://doi.org/10.1145/3477314.3507113Graph Neural Networks (GNNs) have recently achieved good performance in many predictive tasks involving graph-structured data. However, the majority of existing models consider static graphs only and do not support training on graph streams. While ...
- research-articleJune 2024
Learning in non-cooperative configurable Markov decision processes
NIPS '21: Proceedings of the 35th International Conference on Neural Information Processing SystemsArticle No.: 1747, Pages 22808–22821The Configurable Markov Decision Process framework includes two entities: a Reinforcement Learning agent and a configurator that can modify some environmental parameters to improve the agent's performance. This presupposes that the two actors have ...
- research-articleSeptember 2021
Dealing with multiple experts and non-stationarity in inverse reinforcement learning: an application to real-life problems
- Amarildo Likmeta,
- Alberto Maria Metelli,
- Giorgia Ramponi,
- Andrea Tirinzoni,
- Matteo Giuliani,
- Marcello Restelli
Machine Language (MALE), Volume 110, Issue 9Pages 2541–2576https://doi.org/10.1007/s10994-020-05939-8AbstractIn real-world applications, inferring the intentions of expert agents (e.g., human operators) can be fundamental to understand how possibly conflicting objectives are managed, helping to interpret the demonstrated behavior. In this paper, we ...
- research-articleDecember 2020
Inverse reinforcement learning from a gradient-based learner
NIPS '20: Proceedings of the 34th International Conference on Neural Information Processing SystemsArticle No.: 207, Pages 2458–2468Inverse Reinforcement Learning addresses the problem of inferring an expert's reward function from demonstrations. However, in many applications, we not only have access to the expert's near-optimal behaviour, but we also observe part of her learning ...
- short-paperJanuary 2020
Assigning users to domains of interest based on content and network similarity with champion instances
ASONAM '19: Proceedings of the 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and MiningPages 589–592https://doi.org/10.1145/3341161.3343687In this paper, we propose two approaches to the problem of finding similar users to a set of champions representing domains of interest on social media. The first approach is based on the content shared by the users, while the second one relies on the ...
- research-articleApril 2019
Vocabulary-based community detection and characterization
SAC '19: Proceedings of the 34th ACM/SIGAPP Symposium on Applied ComputingPages 1043–1050https://doi.org/10.1145/3297280.3297384With the increase of digital interaction, social networks are becoming an essential ingredient of our life, by progressively becoming the dominant media, e.g. in influencing political choices. Interaction within social networks tends to take place within ...
- research-articleApril 2018
Iterative Knowledge Extraction from Social Networks
WWW '18: Companion Proceedings of the The Web Conference 2018Pages 1359–1364https://doi.org/10.1145/3184558.3191578Knowledge in the world continuously evolves, and ontologies are largely incomplete, especially regarding data belonging to the so-called long tail. We propose a method for discovering emerging knowledge by extracting it from social content. Once ...
- demonstrationOctober 2016
JoyTag: a battery-less videogame controller exploiting RFID backscattering: demo
MobiCom '16: Proceedings of the 22nd Annual International Conference on Mobile Computing and NetworkingPages 515–516https://doi.org/10.1145/2973750.2985628This demo presents our experiences in developing a joystick for videogames that uses RFID backscattering for battery-free operation. Specifically, we develop a system to gather data from a wireless and battery-less joystick, named JoyTag, while it ...