skip to main content

Policy Gradients with Memory-Augmented Critic: Stabilizing Off-policy Policy Gradients via Differentiable Memory

Seno, Takuma ; Imai, Michita

Transactions of the Japanese Society for Artificial Intelligence, 2021-01, Vol.36 (1), p.B-K71_1-8

Texto completo disponível

Citações Citado por

Buscando em bases de dados remotas. Favor aguardar.