Most Popular

1500 questions
5
votes
2 answers

Variable Number of Inputs to Neural Networks

So suppose that you have a real estate appraisal problem. You have some structured data, and some images exterior of home, bedrooms, kitchen, etc. The number of pictures taken is variable per observational unit, i.e. the house. I understand the…
Ryan
  • 161
  • 4
5
votes
1 answer

Approaches to poker tournament winner prediction?

I’ve done my research and could not find answer anywhere else. My apologies in advance if same problem is answered in different terms on stack-overflow. I am trying to solve poker tournament winner prediction problem. I’ve millions of historical…
5
votes
1 answer

Can you analyse a neural network to determine good states?

I've developed a neural network that can play a card game. I now want to use it to create decks for the game. My first thought would be to run a lot of games with random decks and use some approximation (maybe just a linear approximation with a…
OrangeMan
  • 237
  • 1
  • 5
5
votes
1 answer

Is the discount not needed in a deterministic environment for Reinforcement Learning?

I'm now reading a book titled as "Deep Reinforcement Learning Hands-On" and the author said the following on the chapter about AlphaGo Zero: Self-play In AlphaGo Zero, the NN is used to approximate the prior probabilities of the actions and…
Blaszard
  • 1,097
  • 4
  • 11
  • 25
5
votes
1 answer

Which other loss functions for hierarchical multi-label classification could I use?

I am looking to try different loss functions for a hierarchical multi-label classification problem. So far, I have been training different models or submodels like multilayer perceptron (MLP) branch inside a bigger model which deals with different…
5
votes
1 answer

Is there a database somewhere of common lists?

I'm looking for a database or some machine readable document that contains common ordered lists or common short sets. e.g: {January, February, March,...} {Monday, Tuesday, ....} {Red, Orange, Yellow,...} {1,2,3,4,...} {one, two, three,…
zooby
  • 2,260
  • 1
  • 14
  • 22
5
votes
2 answers

Is there a way to translate the concept of batch size into reinforcement learning?

I am using a neural network as my function approximator for reinforcement learning. In order to get it to train well, I need to choose a good learning rate. Hand-picking one is difficult, so I read up on methods of programmatically choosing a…
5
votes
1 answer

How is the fitted Q-iteration algorithm related to $Q^*(s, a)$, and how can we use function approximation with this algorithm?

I hope to get some clarifications on Fitted Q-Iteration (FQI). My Research So Far I've read Sutton's book (specifically, ch 6 to 10), Ernst et al and this paper. I know that $Q^*(s, a)$ expresses the expected value of first taking action $a$ from…
5
votes
2 answers

How can I train a neural network for image classification when the dataset is small?

I need to train a convolutional neural network to classify snake images. The problem is that I have only a small number of images available for some snake types. So, what is the best approach to train a neural network for image classification using…
5
votes
1 answer

What is the difference between policy and action in reinforcement learning?

I'm confused with the two terminology - action and policy - in Reinforcement Learning. As far as I know, the action is: It is what the agent makes in a given state. However, the book I'm reading now (Hands-On Reinforcement Learning with Python)…
Blaszard
  • 1,097
  • 4
  • 11
  • 25
5
votes
1 answer

How can I apply reinforcement learning to solve this asteroid game?

Introduction An attractive asteroid game was described in the paper Learning Policies for Embodied Virtual Agents through Demonstration (2017, Jonathan Dinerstein et al.): In our first experiment, the virtual agent is a spaceship pilot, The pilot's…
GIA
  • 568
  • 6
  • 22
5
votes
1 answer

Can the optimal value of discount factor in Deep Reinforcement Learning be between 0.2 to 0.8?

I'm now reading a book titled as Hands-On Reinforcement Learning with Python, and the author explains the discount factor that is used in Reinforcement Learing to discount the future reward, with the following: A discount factor of 0 will never…
Blaszard
  • 1,097
  • 4
  • 11
  • 25
5
votes
3 answers

Can ML/AI understand incomplete constructs like humans?

We have AI's predicting images, predicting objects in an image. Understanding audio, meaning of the audio if it is a spoken sentence. In humans when we start seeing a movie halfway through, we still understand the entire movie (although this might…
user9947
5
votes
1 answer

Do we know what the units of neural networks will do before we train them?

I was learning about back-propagation and, looking at the algorithm, there is no particular 'partiality' given to any unit. What I mean by partiality there is that you have no particular characteristic associated with any unit, and this results in…
5
votes
1 answer

Is Christopher Bishop's "Pattern Recognition and Machine Learning" out of date in 2018?

I recently came across a reference to a book that was highly regarded: "Pattern Recognition and Machine Learning" by Christopher Bishop. I am a beginner working my way through some machine learning courses on my own. I'm curious if this book is…