Is there any toy example that can exemplify the performance of double Q-learning?

Asked Dec 23 '20 at 07:56

Active Dec 27 '20 at 10:18

Viewed 171 times

I recently tried to reproduce the results of double Q-learning. However, the results are not satisfying. I have also tried to compare double Q learning with Q-learning in Taxi-v3, FrozenLake without slippery, Roulette-v0, etc. But Q-learning outperforms double Q-learning in all of these environments.

I am not sure whether if there is something wrong with my implementation as many materials about double Q actually focus on double DQN. While at the same time of checking, I wonder is there any toy example that can exemplify the performance of double Q-learning?

edited Dec 27 '20 at 10:18

David

5,100
1
11
33

asked Dec 23 '20 at 07:56

Allen_FrCh

Is there any toy example that can exemplify the performance of double Q-learning?

0 Answers0