Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–2 of 2 results for author: Della Penna, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:1602.02852  [pdf, other

    stat.ML cs.LG

    Compliance-Aware Bandits

    Authors: Nicolás Della Penna, Mark D. Reid, David Balduzzi

    Abstract: Motivated by clinical trials, we study bandits with observable non-compliance. At each step, the learner chooses an arm, after, instead of observing only the reward, it also observes the action that took place. We show that such noncompliance can be helpful or hurtful to the learner in general. Unfortunately, naively incorporating compliance information into bandit algorithms loses guarantees on s… ▽ More

    Submitted 8 February, 2016; originally announced February 2016.

  2. arXiv:1112.0076  [pdf, other

    q-fin.TR cs.GT stat.ML

    Bandit Market Makers

    Authors: Nicolas Della Penna, Mark D. Reid

    Abstract: We introduce a modular framework for market making. It combines cost-function based automated market makers with bandit algorithms. We obtain worst-case profits guarantee's relative to the best in hindsight within a class of natural "overround" cost functions . This combination allow us to have distribution-free guarantees on the regret of profits while preserving the bounded worst-case losses and… ▽ More

    Submitted 1 August, 2013; v1 submitted 30 November, 2011; originally announced December 2011.

    Comments: A previous version of this work appeared in the NIPS 2011 Workshop on Computational Social Science and the Wisdom of the Crowds