---
_id: '21536'
abstract:
- lang: eng
  text: "We consider a resource-aware variant of the classical multi-armed bandit\r\nproblem:
    In each round, the learner selects an arm and determines a resource\r\nlimit.
    It then observes a corresponding (random) reward, provided the (random)\r\namount
    of consumed resources remains below the limit. Otherwise, the\r\nobservation is
    censored, i.e., no reward is obtained. For this problem setting,\r\nwe introduce
    a measure of regret, which incorporates the actual amount of\r\nallocated resources
    of each learning round as well as the optimality of\r\nrealizable rewards. Thus,
    to minimize regret, the learner needs to set a\r\nresource limit and choose an
    arm in such a way that the chance to realize a\r\nhigh reward within the predefined
    resource limit is high, while the resource\r\nlimit itself should be kept as low
    as possible. We derive the theoretical lower\r\nbound on the cumulative regret
    and propose a learning algorithm having a regret\r\nupper bound that matches the
    lower bound. In a simulation study, we show that\r\nour learning algorithm outperforms
    straightforward extensions of standard\r\nmulti-armed bandit algorithms."
author:
- first_name: Viktor
  full_name: Bengs, Viktor
  last_name: Bengs
- first_name: Eyke
  full_name: Hüllermeier, Eyke
  last_name: Hüllermeier
citation:
  ama: Bengs V, Hüllermeier E. Multi-Armed Bandits with Censored Consumption of Resources.
    <i>arXiv:201100813</i>. 2020.
  apa: Bengs, V., &#38; Hüllermeier, E. (2020). Multi-Armed Bandits with Censored
    Consumption of Resources. <i>ArXiv:2011.00813</i>.
  bibtex: '@article{Bengs_Hüllermeier_2020, title={Multi-Armed Bandits with Censored
    Consumption of Resources}, journal={arXiv:2011.00813}, author={Bengs, Viktor and
    Hüllermeier, Eyke}, year={2020} }'
  chicago: Bengs, Viktor, and Eyke Hüllermeier. “Multi-Armed Bandits with Censored
    Consumption of Resources.” <i>ArXiv:2011.00813</i>, 2020.
  ieee: V. Bengs and E. Hüllermeier, “Multi-Armed Bandits with Censored Consumption
    of Resources,” <i>arXiv:2011.00813</i>. 2020.
  mla: Bengs, Viktor, and Eyke Hüllermeier. “Multi-Armed Bandits with Censored Consumption
    of Resources.” <i>ArXiv:2011.00813</i>, 2020.
  short: V. Bengs, E. Hüllermeier, ArXiv:2011.00813 (2020).
date_created: 2021-03-18T11:27:37Z
date_updated: 2022-01-06T06:55:03Z
department:
- _id: '34'
- _id: '7'
- _id: '355'
language:
- iso: eng
project:
- _id: '52'
  name: Computing Resources Provided by the Paderborn Center for Parallel Computing
publication: arXiv:2011.00813
status: public
title: Multi-Armed Bandits with Censored Consumption of Resources
type: preprint
user_id: '76599'
year: '2020'
...
