Result Number | Material Type | Add to My Shelf Action | Record Details and Options |
---|---|---|---|
1 |
Material Type: Artigo
|
![]() |
IW-GAE: Importance Weighted Group Accuracy Estimation for Improved Calibration and Model Selection in Unsupervised Domain AdaptationJoo, Taejong ; Klabjan, DiegoarXiv.org, 2024-07Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
2 |
Material Type: Artigo
|
![]() |
On the Second-Order Convergence of Biased Policy Gradient AlgorithmsMu, Siqiao ; Klabjan, DiegoarXiv.org, 2024-05Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
3 |
Material Type: Artigo
|
![]() |
Regret Bounds and Reinforcement Learning Exploration of EXP-based AlgorithmsXu, Mengfan ; Klabjan, DiegoarXiv.org, 2024-05Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
4 |
Material Type: Artigo
|
![]() |
Decentralized Blockchain-based Robust Multi-agent Multi-armed BanditXu, Mengfan ; Klabjan, DiegoarXiv.org, 2024-02Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
5 |
Material Type: Artigo
|
![]() |
Robust softmax aggregation on blockchain based federated learning with convergence guaranteeWu, Huiyu ; Klabjan, DiegoarXiv.org, 2023-11Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
6 |
Material Type: Artigo
|
![]() |
Decentralized Randomly Distributed Multi-agent Multi-armed Bandit with Heterogeneous RewardsXu, Mengfan ; Klabjan, DiegoarXiv.org, 2023-10Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
7 |
Material Type: Artigo
|
![]() |
Semi-supervised 3D Video Information Retrieval with Deep Neural Network and Bi-directional Dynamic-time Warping AlgorithmMa, Yintai ; Klabjan, DiegoarXiv.org, 2023-09Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
8 |
Material Type: Artigo
|
![]() |
Regret Lower Bounds in Multi-agent Multi-armed BanditXu, Mengfan ; Klabjan, DiegoarXiv.org, 2023-08Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
9 |
Material Type: Artigo
|
![]() |
An Ensemble Method of Deep Reinforcement Learning for Automated Cryptocurrency TradingWang, Shuyang ; Klabjan, DiegoarXiv.org, 2023-07Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
10 |
Material Type: Artigo
|
![]() |
Pareto Regret Analyses in Multi-objective Multi-armed BanditXu, Mengfan ; Klabjan, DiegoarXiv.org, 2023-05Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |