Policy Gradients with Memory-Augmented Critic: Stabilizing Off-policy Policy Gradients via Differentiable Memory
Seno, Takuma ; Imai, Michita
Transactions of the Japanese Society for Artificial Intelligence, 2021-01, Vol.36 (1), p.B-K71_1-8
Texto completo disponível