Most Popular
1500 questions
5
votes
2 answers
Variable Number of Inputs to Neural Networks
So suppose that you have a real estate appraisal problem. You have some structured data, and some images exterior of home, bedrooms, kitchen, etc. The number of pictures taken is variable per observational unit, i.e. the house.
I understand the…
Ryan
- 161
- 4
5
votes
1 answer
Approaches to poker tournament winner prediction?
I’ve done my research and could not find answer anywhere else. My apologies in advance if same problem is answered in different terms on stack-overflow.
I am trying to solve poker tournament winner prediction problem. I’ve millions of historical…
Tejas Patel
- 51
- 1
5
votes
1 answer
Can you analyse a neural network to determine good states?
I've developed a neural network that can play a card game. I now want to use it to create decks for the game. My first thought would be to run a lot of games with random decks and use some approximation (maybe just a linear approximation with a…
OrangeMan
- 237
- 1
- 5
5
votes
1 answer
Is the discount not needed in a deterministic environment for Reinforcement Learning?
I'm now reading a book titled as "Deep Reinforcement Learning Hands-On" and the author said the following on the chapter about AlphaGo Zero:
Self-play
In AlphaGo Zero, the NN is used to approximate the prior probabilities of the actions and…
Blaszard
- 1,097
- 4
- 11
- 25
5
votes
1 answer
Which other loss functions for hierarchical multi-label classification could I use?
I am looking to try different loss functions for a hierarchical multi-label classification problem. So far, I have been training different models or submodels like multilayer perceptron (MLP) branch inside a bigger model which deals with different…
Skinish
- 163
- 1
- 1
- 9
5
votes
1 answer
Is there a database somewhere of common lists?
I'm looking for a database or some machine readable document that contains common ordered lists or common short sets. e.g:
{January, February, March,...}
{Monday, Tuesday, ....}
{Red, Orange, Yellow,...}
{1,2,3,4,...}
{one, two, three,…
zooby
- 2,260
- 1
- 14
- 22
5
votes
2 answers
Is there a way to translate the concept of batch size into reinforcement learning?
I am using a neural network as my function approximator for reinforcement learning. In order to get it to train well, I need to choose a good learning rate. Hand-picking one is difficult, so I read up on methods of programmatically choosing a…
user10011538
- 51
- 1
- 2
5
votes
1 answer
How is the fitted Q-iteration algorithm related to $Q^*(s, a)$, and how can we use function approximation with this algorithm?
I hope to get some clarifications on Fitted Q-Iteration (FQI).
My Research So Far
I've read Sutton's book (specifically, ch 6 to 10), Ernst et al and this paper.
I know that $Q^*(s, a)$ expresses the expected value of first taking action $a$ from…
NoviceProg
- 153
- 4
5
votes
2 answers
How can I train a neural network for image classification when the dataset is small?
I need to train a convolutional neural network to classify snake images. The problem is that I have only a small number of images available for some snake types.
So, what is the best approach to train a neural network for image classification using…
Lakwin Chandula
- 51
- 3
5
votes
1 answer
What is the difference between policy and action in reinforcement learning?
I'm confused with the two terminology - action and policy - in Reinforcement Learning. As far as I know, the action is:
It is what the agent makes in a given state.
However, the book I'm reading now (Hands-On Reinforcement Learning with Python)…
Blaszard
- 1,097
- 4
- 11
- 25
5
votes
1 answer
How can I apply reinforcement learning to solve this asteroid game?
Introduction
An attractive asteroid game was described in the paper Learning Policies for Embodied Virtual Agents through Demonstration (2017, Jonathan Dinerstein et al.):
In our first experiment, the virtual agent is a spaceship pilot, The pilot's…
GIA
- 568
- 6
- 22
5
votes
1 answer
Can the optimal value of discount factor in Deep Reinforcement Learning be between 0.2 to 0.8?
I'm now reading a book titled as Hands-On Reinforcement Learning with Python, and the author explains the discount factor that is used in Reinforcement Learing to discount the future reward, with the following:
A discount factor of 0 will never…
Blaszard
- 1,097
- 4
- 11
- 25
5
votes
3 answers
Can ML/AI understand incomplete constructs like humans?
We have AI's predicting images, predicting objects in an image. Understanding audio, meaning of the audio if it is a spoken sentence.
In humans when we start seeing a movie halfway through, we still understand the entire movie (although this might…
user9947
5
votes
1 answer
Do we know what the units of neural networks will do before we train them?
I was learning about back-propagation and, looking at the algorithm, there is no particular 'partiality' given to any unit. What I mean by partiality there is that you have no particular characteristic associated with any unit, and this results in…
Htnamus
- 53
- 6
5
votes
1 answer
Is Christopher Bishop's "Pattern Recognition and Machine Learning" out of date in 2018?
I recently came across a reference to a book that was highly regarded: "Pattern Recognition and Machine Learning" by Christopher Bishop.
I am a beginner working my way through some machine learning courses on my own.
I'm curious if this book is…
Hanzy
- 519
- 3
- 11