0

I am new in the field of RL. I am trying to use tabular methods, Q-Learning for solving a problem that takes a lot of time for computation, so I would like to know if there are more efficient methods for it.

Why are tabular methods are not useful in large state spaces? Maybe too many possible combinations? Could Softmax Action Selection be better than epsilon greedy?

Aquila
  • 33
  • 5

1 Answers1

1

Your question contains the answer. Use value function approximation. Tabular methods must compute a value for each state. That becomes unfeasible with large state spaces. Function approximators can genererlize, and perform well even without ever having seen every state.

chessprogrammer
  • 3,050
  • 2
  • 16
  • 26