Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–5 of 5 results for author: Marecki, J

.
  1. arXiv:2404.08755  [pdf, other

    cs.LG cs.AI cs.CV cs.HC

    Training a Vision Language Model as Smartphone Assistant

    Authors: Nicolai Dorka, Janusz Marecki, Ammar Anwar

    Abstract: Addressing the challenge of a digital assistant capable of executing a wide array of user tasks, our research focuses on the realm of instruction-based mobile device control. We leverage recent advancements in large language models (LLMs) and present a visual language model (VLM) that can fulfill diverse tasks on mobile devices. Our model functions by interacting solely with the user interface (UI… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: ICLR 2024 workshop on Generative Models for Decision Making

  2. arXiv:2201.01816  [pdf, other

    cs.AI cs.LG cs.MA

    Hidden Agenda: a Social Deduction Game with Diverse Learned Equilibria

    Authors: Kavya Kopparapu, Edgar A. Duéñez-Guzmán, Jayd Matyas, Alexander Sasha Vezhnevets, John P. Agapiou, Kevin R. McKee, Richard Everett, Janusz Marecki, Joel Z. Leibo, Thore Graepel

    Abstract: A key challenge in the study of multiagent cooperation is the need for individual agents not only to cooperate effectively, but to decide with whom to cooperate. This is particularly critical in situations when other agents have hidden, possibly misaligned motivations and goals. Social deduction games offer an avenue to study how individuals might learn to synthesize potentially unreliable informa… ▽ More

    Submitted 5 January, 2022; originally announced January 2022.

  3. arXiv:1702.03037  [pdf, other

    cs.MA cs.AI cs.GT cs.LG

    Multi-agent Reinforcement Learning in Sequential Social Dilemmas

    Authors: Joel Z. Leibo, Vinicius Zambaldi, Marc Lanctot, Janusz Marecki, Thore Graepel

    Abstract: Matrix games like Prisoner's Dilemma have guided research on social dilemmas for decades. However, they necessarily treat the choice to cooperate or defect as an atomic action. In real-world social dilemmas these choices are temporally extended. Cooperativeness is a property that applies to policies, not elementary actions. We introduce sequential social dilemmas that share the mixed incentive str… ▽ More

    Submitted 9 February, 2017; originally announced February 2017.

    Comments: 10 pages, 7 figures

  4. arXiv:1309.6857  [pdf

    cs.AI

    Solution Methods for Constrained Markov Decision Process with Continuous Probability Modulation

    Authors: Marek Petrik, Dharmashankar Subramanian, Janusz Marecki

    Abstract: We propose solution methods for previously-unsolved constrained MDPs in which actions can continuously modify the transition probabilities within some acceptable sets. While many methods have been proposed to solve regular MDPs with large state sets, there are few practical approaches for solving constrained MDPs with large action sets. In particular, we show that the continuous action sets can be… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-518-526

  5. arXiv:1203.3470  [pdf

    cs.AI

    ALARMS: Alerting and Reasoning Management System for Next Generation Aircraft Hazards

    Authors: Alan S. Carlin, Nathan Schurr, Janusz Marecki

    Abstract: The Next Generation Air Transportation System will introduce new, advanced sensor technologies into the cockpit. With the introduction of such systems, the responsibilities of the pilot are expected to dramatically increase. In the ALARMS (ALerting And Reasoning Management System) project for NASA, we focus on a key challenge of this environment, the quick and efficient handling of aircraft sensor… ▽ More

    Submitted 15 March, 2012; originally announced March 2012.

    Comments: Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

    Report number: UAI-P-2010-PG-93-100