2

Some people claim that DQN was used to play many Atari games. But what actually happened? Was DQN trained only once (with some data from all games) or was it trained separately for each game? What was common to all those games? Only the architecture of the RL agent? Did the reward function change for each game?

nbro
  • 42,615
  • 12
  • 119
  • 217
mason7663
  • 653
  • 4
  • 12

0 Answers0