skip to main content
Invitado
Mi portal
Mi Cuenta
Cerrar sesión
Identificarse
This feature requires javascript
Tags
Periódicos Eletrónicos
Libros Eletrónicos
Bases de Datos
Bibliotecas de USP
Ayuda
Ayuda
Idioma:
Inglés
Castellano
Portugués (Brasil)
This feature required javascript
Search in:
Seleccione el índice para buscar
Search For:
Clear Search Box
Search in:
Por título
Por materia
Por autor
Browse vid input
Búsqueda Sencilla
This feature requires javascript
A Contextual Bandit Bake-off
Bietti, Alberto ; Agarwal, Alekh ; Langford, John
Journal of machine learning research, 2021-01, Vol.22 (133), p.1-49
[Revista revisada por pares]
Ithaca: Cornell University Library, arXiv.org
Texto completo disponible
Citas
Citado por
Recurso en línea
Detalles
Comentarios y Etiquetas
Servicios adicionales
Veces citado
This feature requires javascript
Acciones
Agregar a Mi Portal
Eliminar de Mi Portal
Correo Electrónico
Imprimir
Enlae permanente
Cita bibliográfica
EasyBib
EndNote
RefWorks
Delicious
Exportación RIS
Exportar BibTeX
This feature requires javascript
Título:
A Contextual Bandit Bake-off
Autor:
Bietti, Alberto
;
Agarwal, Alekh
;
Langford, John
Materias:
Algorithms
;
Computer
Science
;
Computer
Science
- Learning
;
Machine Learning
;
Statistics
;
Statistics - Machine Learning
Es parte de:
Journal of machine learning research, 2021-01, Vol.22 (133), p.1-49
Descripción:
Contextual bandit algorithms are essential for solving many real-world interactive machine learning problems. Despite multiple recent successes on statistically and computationally efficient methods, the practical behavior of these algorithms is still poorly understood. We leverage the availability of large numbers of supervised learning datasets to empirically evaluate contextual bandit algorithms, focusing on practical methods that learn by relying on optimization oracles from supervised learning. We find that a recent method (Foster et al., 2018) using optimism under uncertainty works the best overall. A surprisingly close second is a simple greedy baseline that only explores implicitly through the diversity of contexts, followed by a variant of Online Cover (Agarwal et al., 2014) which tends to be more conservative but robust to problem specification by design. Along the way, we also evaluate various components of contextual bandit algorithm design such as loss estimators. Overall, this is a thorough study and review of contextual bandit methodology.
Editor:
Ithaca: Cornell University Library, arXiv.org
Idioma:
Inglés
Enlaces
View paper in arXiv
View record in HAL
This feature requires javascript
This feature requires javascript
Volver a la lista de resultados
Anterior
Resultado
7
Siguiente
This feature requires javascript
This feature requires javascript
Buscando en bases de datos remotas, por favor espere
Buscando por
en
scope:(USP_VIDEOS),scope:("PRIMO"),scope:(USP_FISICO),scope:(USP_EREVISTAS),scope:(USP),scope:(USP_EBOOKS),scope:(USP_PRODUCAO),primo_central_multiple_fe
Mostrar lo que tiene hasta ahora
This feature requires javascript
This feature requires javascript