Is there a way to do reinforcement learning in POMDP?

Asked Oct 06 '19 at 17:09

Active Dec 19 '21 at 18:51

Viewed 390 times

Are there any algorithms to use reinforcement learning to learn optimal policies in partially observable Markov decision process (POMDP) i.e. when the state is not perfectly observed? More specifically, how does one update the belief state using Bayes' rule when the update Q kernel is not known?

edited Dec 19 '21 at 18:51

nbro

42,615
12
119
217

asked Oct 06 '19 at 17:09

Deepanshu Vasal

Is there a way to do reinforcement learning in POMDP?

0 Answers0