For questions about discrete action spaces in the context of reinforcement learning (or other artificial intelligence sub-fields). There is also the tag for continuous action spaces.
Questions tagged [discrete-action-spaces]
9 questions
16
votes
3 answers
How to implement a variable action space in Proximal Policy Optimization?
I'm coding a Proximal Policy Optimization (PPO) agent with the Tensorforce library (which is built on top of TensorFlow).
The first environment was very simple. Now, I'm diving into a more complex environment, where all the actions are not available…
Max
- 163
- 1
- 6
6
votes
1 answer
What techniques are used to make MDP discrete state space manageable?
Generating a discretized state space for an MDP (Markov Decision Process) model seems to suffer from the curse of dimensionality.
Supposed my state has a few simple features:
Feeling: Happy/Neutral/Sad
Feeling: Hungry/Neither/Full
Food left:…
Brendan Hill
- 263
- 1
- 6
4
votes
1 answer
Can a large discrete action space be represented using Gaussian distributions?
I have a large 1D action space, e.g. dim(A)=2000-10000. Can I use continuous action space where I could learn the mean and std of the Gaussian distributions that I would use to sample action from and round the value to the nearest integer? If yes,…
Mika
- 371
- 2
- 10
3
votes
1 answer
Extend the loss function from the single action to the n-action case per time step
My question concerns a side question (which was not answered) asked here:
How can policy gradients be applied in the case of multiple continuous actions?
I am trying to implement a simple policy gradient algorithm for a discrete multi-action…
Andreas Serov
- 41
- 2
2
votes
1 answer
PPO: multiple discrete actions per step, one depends on the other
I have a custom PPO implementation, and it works fine, but I need to add to it the ability to select 2 actions per turn, one different in nature from the other, one dependent on the other.
Imagine that a turn a had 20 possible actions. Now, for each…
Antonis Karvelas
- 85
- 9
2
votes
1 answer
Is there a multi-agent deep reinforcement learning algorithm which is for environments with only discrete action spaces (Not hybrid)?
Is there a multi-agent deep reinforcement learning algorithm which is for environments with only discrete action spaces (Not hybrid) and have centralized training?
I have been looking for algorithms, (A2C, MADDPG etc.) but still havent find any…
Uur Kn
- 21
- 1
1
vote
1 answer
Model-based RL algorithms for continuous state space and finite action space
At the beginning, if I have a complete model $p(s' \mid s, a)$ (an assumed true model that describes the environment well enough) and the reward function $r(s,a,s')$. How can I exploit the model and learn a good policy in this situation? Assume that…
k2pctdn
- 157
- 5
1
vote
0 answers
What adapts an algorithm to continuous or to discrete action spaces?
Some RL algorithms can only be used for environments with continuous action spaces (e.g TD3, SAC), while others only for discrete action spaces (DQN), and some for both
REINFORCE and other policy gradient variants have the choice of using a…
mugoh
- 549
- 4
- 21
0
votes
0 answers
How to implement RL model with increasing dimensions of state space and action space?
I've read in this discussion that "reinforcement learning is a way of finding the value function of a Markov Decision Process".
I want to implement an RL model, whose state space and action space dimensions would increase, as the MDP progresses. But…
brzepkowski
- 141
- 4