1

In order to model a card game, as an exercise, I was thinking of an elementary setting as a multiarmed bandit, each lever being the distribution of expected rewards of a specific card.

But, of course, the player only has some cards in the hand each round, or, equivalently, for a given round, it has available a number $n$ of arms randomly selected from the total number $N$ of levers.

Is this just a "contextual bandit" or has it some specific, narrower, name that I could use to look up in the literature?

nbro
  • 42,615
  • 12
  • 119
  • 217
arivero
  • 51
  • 7

0 Answers0