skip to main content

A Contextual Bandit Bake-off

Bietti, Alberto ; Agarwal, Alekh ; Langford, John

Journal of machine learning research, 2021-01, Vol.22 (133), p.1-49 [Revista revisada por pares]

Ithaca: Cornell University Library, arXiv.org

Texto completo disponible

Citas Citado por
  • Título:
    A Contextual Bandit Bake-off
  • Autor: Bietti, Alberto ; Agarwal, Alekh ; Langford, John
  • Materias: Algorithms ; Computer Science ; Computer Science - Learning ; Machine Learning ; Statistics ; Statistics - Machine Learning
  • Es parte de: Journal of machine learning research, 2021-01, Vol.22 (133), p.1-49
  • Descripción: Contextual bandit algorithms are essential for solving many real-world interactive machine learning problems. Despite multiple recent successes on statistically and computationally efficient methods, the practical behavior of these algorithms is still poorly understood. We leverage the availability of large numbers of supervised learning datasets to empirically evaluate contextual bandit algorithms, focusing on practical methods that learn by relying on optimization oracles from supervised learning. We find that a recent method (Foster et al., 2018) using optimism under uncertainty works the best overall. A surprisingly close second is a simple greedy baseline that only explores implicitly through the diversity of contexts, followed by a variant of Online Cover (Agarwal et al., 2014) which tends to be more conservative but robust to problem specification by design. Along the way, we also evaluate various components of contextual bandit algorithm design such as loss estimators. Overall, this is a thorough study and review of contextual bandit methodology.
  • Editor: Ithaca: Cornell University Library, arXiv.org
  • Idioma: Inglés

Buscando en bases de datos remotas, por favor espere

  • Buscando por
  • enscope:(USP_VIDEOS),scope:("PRIMO"),scope:(USP_FISICO),scope:(USP_EREVISTAS),scope:(USP),scope:(USP_EBOOKS),scope:(USP_PRODUCAO),primo_central_multiple_fe
  • Mostrar lo que tiene hasta ahora