Do we assume the policy to be deterministic when proving the optimality?

Asked Aug 18 '20 at 09:32

Active Aug 18 '20 at 10:18

Viewed 37 times

In reinforcement learning, when we talk about the principle of optimality, do we assume the policy to be deterministic?

edited Aug 18 '20 at 10:18

nbro

asked Aug 18 '20 at 09:32

hakiki_makato

0 Answers0