Highest Voted Questions - Artificial Intelligence Stack Exchange

7

votes

1 answer

Why is the target $r + \gamma \max_{a'} Q(s', a'; \theta_i^-)$ in the loss function of the DQN architecture?

In the paper Human-level control through deep reinforcement learning, the DQN architecture is presented, where the loss function is as follows $$ L_i(\theta_i) = \mathbb{E}_{(s, a, r, s') \sim U(D)} \left[ \left( r + \gamma \max_{a'} Q(s', a';…

deep-learning reinforcement-learning q-learning

asked Dec 13 '17 at 19:25

echo

713
1
6
12

7

votes

1 answer

Why does 'loss' change depending on the number of epochs chosen?

I am using Keras to train different NN. I would like to know why if I increment the epochs in 1, the result until the new epoch is not the same. I am using shuffle=False, and np.random.seed(2017), and I have check that if I repeat with the same…

neural-networks training optimization keras

asked Dec 07 '17 at 14:32

Pablo Ruiz Ruiz

179
3

7

votes

2 answers

How to combine backpropagation in neural nets and reinforcement learning?

I have followed a course on machine learning, where we learned about the gradient descent (GD) and back-propagation (BP) algorithms, which can be used to update the weights of neural networks, and reinforcement learning, in particular, Q-learning. I…

neural-networks reinforcement-learning dqn deep-rl backpropagation

asked Dec 04 '17 at 23:12

Yadeses

231
2
5

7

votes

2 answers

Effective algorithms for OCR

I am using Google's OCR to extract text from images, like receipts and invoices. Whare examples of techniques used to make sense of the text? For example, I would like to extract the date, name of the business, address, total amount, etc. Before…

machine-learning reference-request optical-character-recognition

asked Nov 21 '17 at 21:34

Abhay Naik

179
2

7

votes

1 answer

How to turn a ternary constraint into three binary constraints?

I'm trying to solve problem 6.6 from the book Artificial Intelligence: A Modern Approach, by Peter Norvig and Stuart Russell. This is in the context of Constraint Satisfaction Problem and how you can re-formulate some problems with the constraints…

ai-design constraint-satisfaction-problems norvig-russell

asked Oct 31 '17 at 01:26

Cristóbal Alcázar

183
1
8

7

votes

4 answers

Why is my test error lower than the training error?

I am trying to train a CNN regression model using the ADAM optimizer, dropout and weight decay. My test accuracy is better than training accuracy. But, as far as I know, usually, the training accuracy is better than test accuracy. So I wonder how…

convolutional-neural-networks training accuracy dropout testing

asked Oct 29 '17 at 10:54

이희준

73
1
4

7

votes

3 answers

Are there strictly deterministic LLMs?

LLMs are understood to generate non-deterministic outputs. Are there LLMs out there that are capable to producing deterministic outputs for any given input given fixed parameters (like e.g temperature)? I heard that llama.cpp - if run on a CPU…

natural-language-processing large-language-models model-request

asked Dec 06 '23 at 15:34

user599464

171
1
3

7

votes

2 answers

Does data skew matter in classification problem?

I'm working on an image classification problem using a neural network. In the training data set, 90% of the samples fall into 10% of all categories, while 10% of the sample fall into the other 90% categories. So an example is not evenly distributed…

neural-networks image-recognition data-preprocessing imbalanced-datasets

asked Oct 16 '17 at 20:28

James Gan

173
1
6

7

votes

4 answers

What is the purpose of the hidden layers?

Why would anybody want to use "hidden layers"? How do they enhance the learning ability of the network in comparison to the network which doesn't have them (linear models)?

neural-networks deep-learning deep-neural-networks hidden-layers

asked Aug 02 '16 at 16:09

kenorb

10,525
6
45
95

7

votes

4 answers

What does "e" do in the Sigmoid Activation Function?

Within the Sigmoid Squishification function, f(x) = 1/(1 + e^(-x)) "e" is unnecessary, as it can be replaced by any other value that is not 0 or 1. Why is "e" used here? As shown below, the function is working well without that, and in replacement,…

sigmoid

asked Aug 13 '23 at 17:20

Jake

181
4

7

votes

2 answers

How to determine if an Amazon review is likely to be fake using text classification

I'm currently in the research stage of building a web app in ASP.NET where the user can input a URL to an Amazon product, then the app would determine how likely its reviews are to be genuine. I need help figuring out what algorithm to use in…

machine-learning natural-language-processing algorithm classification

asked Oct 02 '17 at 16:51

user8709398

71
2

7

votes

2 answers

In this implementation of the Information Set Monte Carlo Tree Search, why can't the players see the cards of each other?

After reading this paper about Monte Carlo methods for imperfect information games with elements of uncertainty, I couldn't understand the application of the determinization step in the author's implementation of the algorithm for the Knockout…

monte-carlo-tree-search combinatorial-games imperfect-information

asked Sep 23 '17 at 16:32

tamirok

71
1

7

votes

4 answers

What are the reasons to belief AGI will not be dangerous?

We are in the middle of an ongoing debate about the safety of AGI and our current approach towards this technology. As summary, some quotes from a recent article from Time magazine: Many researchers[...] expect that the most likely result of…

agi superintelligence ai-safety value-alignment

asked Apr 03 '23 at 10:36

Martin

178
5

7

votes

5 answers

How is GPT 4 able to solve math?

How can GPT 4 solve complex calculus and other math problems. I believe these problems require analytical reasoning and ability to compute numbers. Does it still use a LLM to complete this process or does it add on to this? Here is the link to the…

chatgpt gpt gpt-4

asked Mar 22 '23 at 23:49

desert_ranger

672
1
6
21

7

votes

2 answers

Is there a term to denote AI algorithms that are not trained?

Before the advent of neural architectures, many AI domains (e.g. speech recognition and computer vision) used algorithms that consisted of a series of hand-crafted transformations for feature extraction. In speech recognition everything to do with…

computer-vision terminology algorithm speech-recognition

asked Mar 22 '23 at 11:09

Mew

216
1
4

Most Popular