Most Popular
1500 questions
7
votes
1 answer
Why is the target $r + \gamma \max_{a'} Q(s', a'; \theta_i^-)$ in the loss function of the DQN architecture?
In the paper Human-level control through deep reinforcement learning, the DQN architecture is presented, where the loss function is as follows
$$
L_i(\theta_i) = \mathbb{E}_{(s, a, r, s') \sim U(D)} \left[ \left( r + \gamma \max_{a'} Q(s', a';…
echo
- 713
- 1
- 6
- 12
7
votes
1 answer
Why does 'loss' change depending on the number of epochs chosen?
I am using Keras to train different NN. I would like to know why if I increment the epochs in 1, the result until the new epoch is not the same. I am using shuffle=False, and np.random.seed(2017), and I have check that if I repeat with the same…
Pablo Ruiz Ruiz
- 179
- 3
7
votes
2 answers
How to combine backpropagation in neural nets and reinforcement learning?
I have followed a course on machine learning, where we learned about the gradient descent (GD) and back-propagation (BP) algorithms, which can be used to update the weights of neural networks, and reinforcement learning, in particular, Q-learning. I…
Yadeses
- 231
- 2
- 5
7
votes
2 answers
Effective algorithms for OCR
I am using Google's OCR to extract text from images, like receipts and invoices.
Whare examples of techniques used to make sense of the text? For example, I would like to extract the date, name of the business, address, total amount, etc.
Before…
Abhay Naik
- 179
- 2
7
votes
1 answer
How to turn a ternary constraint into three binary constraints?
I'm trying to solve problem 6.6 from the book Artificial Intelligence: A Modern Approach, by Peter Norvig and Stuart Russell.
This is in the context of Constraint Satisfaction Problem and how you can re-formulate some problems with the constraints…
Cristóbal Alcázar
- 183
- 1
- 8
7
votes
4 answers
Why is my test error lower than the training error?
I am trying to train a CNN regression model using the ADAM optimizer, dropout and weight decay.
My test accuracy is better than training accuracy. But, as far as I know, usually, the training accuracy is better than test accuracy.
So I wonder how…
이희준
- 73
- 1
- 4
7
votes
3 answers
Are there strictly deterministic LLMs?
LLMs are understood to generate non-deterministic outputs.
Are there LLMs out there that are capable to producing deterministic outputs for any given input given fixed parameters (like e.g temperature)?
I heard that llama.cpp - if run on a CPU…
user599464
- 171
- 1
- 3
7
votes
2 answers
Does data skew matter in classification problem?
I'm working on an image classification problem using a neural network. In the training data set, 90% of the samples fall into 10% of all categories, while 10% of the sample fall into the other 90% categories. So an example is not evenly distributed…
James Gan
- 173
- 1
- 6
7
votes
4 answers
What is the purpose of the hidden layers?
Why would anybody want to use "hidden layers"? How do they enhance the learning ability of the network in comparison to the network which doesn't have them (linear models)?
kenorb
- 10,525
- 6
- 45
- 95
7
votes
4 answers
What does "e" do in the Sigmoid Activation Function?
Within the Sigmoid Squishification function,
f(x) = 1/(1 + e^(-x))
"e" is unnecessary, as it can be replaced by any other value that is not 0 or 1. Why is "e" used here?
As shown below, the function is working well without that, and in replacement,…
Jake
- 181
- 4
7
votes
2 answers
How to determine if an Amazon review is likely to be fake using text classification
I'm currently in the research stage of building a web app in ASP.NET where the user can input a URL to an Amazon product, then the app would determine how likely its reviews are to be genuine. I need help figuring out what algorithm to use in…
user8709398
- 71
- 2
7
votes
2 answers
In this implementation of the Information Set Monte Carlo Tree Search, why can't the players see the cards of each other?
After reading this paper about Monte Carlo methods for imperfect information games with elements of uncertainty, I couldn't understand the application of the determinization step in the author's implementation of the algorithm for the Knockout…
tamirok
- 71
- 1
7
votes
4 answers
What are the reasons to belief AGI will not be dangerous?
We are in the middle of an ongoing debate about the safety of AGI and our current approach towards this technology. As summary, some quotes from a recent article from Time magazine:
Many researchers[...] expect that the most likely result of…
Martin
- 178
- 5
7
votes
5 answers
How is GPT 4 able to solve math?
How can GPT 4 solve complex calculus and other math problems. I believe these problems require analytical reasoning and ability to compute numbers. Does it still use a LLM to complete this process or does it add on to this?
Here is the link to the…
desert_ranger
- 672
- 1
- 6
- 21
7
votes
2 answers
Is there a term to denote AI algorithms that are not trained?
Before the advent of neural architectures, many AI domains (e.g. speech recognition and computer vision) used algorithms that consisted of a series of hand-crafted transformations for feature extraction.
In speech recognition everything to do with…
Mew
- 216
- 1
- 4