Result Number | Material Type | Add to My Shelf Action | Record Details and Options |
---|---|---|---|
1 |
Material Type: Artigo
|
A Contextual Bandit Bake-offBietti, Alberto ; Agarwal, Alekh ; Langford, JohnJournal of machine learning research, 2021-01, Vol.22 (133), p.1-49 [Periódico revisado por pares]Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
|
2 |
Material Type: Artigo
|
On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)Washim Uddin Mondal ; Agarwal, Mridul ; Aggarwal, Vaneet ; Ukkusuri, Satish VarXiv.org, 2022-01 [Periódico revisado por pares]Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |