In reinforcement learning, when we talk about the principle of optimality, do we assume the policy to be deterministic?
Asked
Active
Viewed 37 times
In reinforcement learning, when we talk about the principle of optimality, do we assume the policy to be deterministic?