5

I have a custom environment for stock trading where an episode can be as long as 2000-3000 steps. I've run several experiments with td3 and sac algorithms, average reward per episode flattens after few episodes. I believe average reward per episode should further improve, so I thought whether my training episode is too long. What is the recommended upper limit on the episode length?

nbro
  • 42,615
  • 12
  • 119
  • 217
Mika
  • 371
  • 2
  • 10

0 Answers0