Value-Difference Based Exploration: Adaptive Control between Epsilon-Greedy and Softmax
Tokic, Michel ; Palm, Günther Edelkamp, Stefan ; Bach, Joscha
KI 2011: Advances in Artificial Intelligence, p.335-346 [Periódico revisado por pares]Berlin, Heidelberg: Springer Berlin Heidelberg
Sem texto completo