skip to main content
Primo Advanced Search
Primo Advanced Search Query Term
Primo Advanced Search Query Term
Primo Advanced Search Query Term
Primo Advanced Search prefilters
Resultados 1 2 3 4 5 next page
Refinado por: Base de dados/Biblioteca: Freely Accessible Journals remover Base de dados/Biblioteca: ROAD: Directory of Open Access Scholarly Resources remover
Result Number Material Type Add to My Shelf Action Record Details and Options
1
IW-GAE: Importance Weighted Group Accuracy Estimation for Improved Calibration and Model Selection in Unsupervised Domain Adaptation
Material Type:
Artigo
Adicionar ao Meu Espaço

IW-GAE: Importance Weighted Group Accuracy Estimation for Improved Calibration and Model Selection in Unsupervised Domain Adaptation

Joo, Taejong ; Klabjan, Diego

arXiv.org, 2024-07

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

2
On the Second-Order Convergence of Biased Policy Gradient Algorithms
Material Type:
Artigo
Adicionar ao Meu Espaço

On the Second-Order Convergence of Biased Policy Gradient Algorithms

Mu, Siqiao ; Klabjan, Diego

arXiv.org, 2024-05

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

3
Regret Bounds and Reinforcement Learning Exploration of EXP-based Algorithms
Material Type:
Artigo
Adicionar ao Meu Espaço

Regret Bounds and Reinforcement Learning Exploration of EXP-based Algorithms

Xu, Mengfan ; Klabjan, Diego

arXiv.org, 2024-05

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

4
Decentralized Blockchain-based Robust Multi-agent Multi-armed Bandit
Material Type:
Artigo
Adicionar ao Meu Espaço

Decentralized Blockchain-based Robust Multi-agent Multi-armed Bandit

Xu, Mengfan ; Klabjan, Diego

arXiv.org, 2024-02

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

5
Robust softmax aggregation on blockchain based federated learning with convergence guarantee
Material Type:
Artigo
Adicionar ao Meu Espaço

Robust softmax aggregation on blockchain based federated learning with convergence guarantee

Wu, Huiyu ; Klabjan, Diego

arXiv.org, 2023-11

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

6
Decentralized Randomly Distributed Multi-agent Multi-armed Bandit with Heterogeneous Rewards
Material Type:
Artigo
Adicionar ao Meu Espaço

Decentralized Randomly Distributed Multi-agent Multi-armed Bandit with Heterogeneous Rewards

Xu, Mengfan ; Klabjan, Diego

arXiv.org, 2023-10

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

7
Semi-supervised 3D Video Information Retrieval with Deep Neural Network and Bi-directional Dynamic-time Warping Algorithm
Material Type:
Artigo
Adicionar ao Meu Espaço

Semi-supervised 3D Video Information Retrieval with Deep Neural Network and Bi-directional Dynamic-time Warping Algorithm

Ma, Yintai ; Klabjan, Diego

arXiv.org, 2023-09

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

8
Regret Lower Bounds in Multi-agent Multi-armed Bandit
Material Type:
Artigo
Adicionar ao Meu Espaço

Regret Lower Bounds in Multi-agent Multi-armed Bandit

Xu, Mengfan ; Klabjan, Diego

arXiv.org, 2023-08

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

9
An Ensemble Method of Deep Reinforcement Learning for Automated Cryptocurrency Trading
Material Type:
Artigo
Adicionar ao Meu Espaço

An Ensemble Method of Deep Reinforcement Learning for Automated Cryptocurrency Trading

Wang, Shuyang ; Klabjan, Diego

arXiv.org, 2023-07

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

10
Pareto Regret Analyses in Multi-objective Multi-armed Bandit
Material Type:
Artigo
Adicionar ao Meu Espaço

Pareto Regret Analyses in Multi-objective Multi-armed Bandit

Xu, Mengfan ; Klabjan, Diego

arXiv.org, 2023-05

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

Resultados 1 2 3 4 5 next page

Personalize Seus Resultados

  1. Editar

Refine Search Results

Expandir Meus Resultados

  1.   

Buscando em bases de dados remotas. Favor aguardar.