Highest Voted Questions - Artificial Intelligence Stack Exchange

5

votes

1 answer

Do we use validation and test sets for training a reinforcement learning agent?

I am pretty new to reinforcement learning and was working with some code for the PPO and DQN algorithms. After looking at the code, I noticed that the authors did not include any code to setup a validation or testing dataloader. In most other…

reinforcement-learning training dqn proximal-policy-optimization

asked Nov 22 '21 at 15:24

krishnab

207
2
8

5

votes

1 answer

How can I estimate how many photos I need to train ResNet-50 for image classification?

I am working on a project where I have to classify around 1000 unique objects. I'm trying to plan how much training data I will need to collect. I was planning on using ResNet-50. Is there anyway I can estimate the amount of photos I should plan to…

computer-vision computational-learning-theory training-datasets sample-complexity

asked Nov 16 '21 at 14:56

Tyler Hilbert

157
7

5

votes

2 answers

Why does the activation function for a hidden layer in a MLP have to be non-polynomial?

Across multiple pieces of literature describing MLPs or while describing the universal approximation theorem, the statement is very specific on the activation function being non-polynomial. Is there a reason why it cannot be a higher-order…

neural-networks activation-functions universal-approximation-theorems

asked Nov 15 '21 at 11:53

niil87

53
3

5

votes

1 answer

In TD(0) with linear function approximation, why is the gradient of $\hat v(S^{\prime}, \mathbf w)$ wrt parameters $\mathbf w$ not considered?

I am reading these slides. On page 38, the update for the parameters for the linear function approximation of TD(0) is given. I have a doubt regarding this. The cost function (RMSE) is given on page 37. My doubt is: why is the gradient of $\hat…

reinforcement-learning value-functions function-approximation weights temporal-difference-methods

asked Nov 14 '21 at 10:17

A Yoghes

53
4

5

votes

3 answers

Use of machine learning for analyzing companies enlisted in stock market

Can current trends and tools, in the field of machine learning, replicate the complexity of financial market? If yes, then what are the tools available in this domain. Q. I am trying to build a model to infer results from stock market using the…

neural-networks machine-learning deep-learning algorithm

asked Apr 28 '17 at 11:42

parth

161
5

5

votes

1 answer

Why isn't a target network used for the critic in on-policy actor-critic methods?

Based on my research, I've seen so many on-policy AC approaches that utilise a critic network to estimate the value function $V$. The Bellman equation for the value function is as bellow: $$ V_\pi(s_t) = \sum_a \pi(a|s_t)\sum_{r,…

reinforcement-learning deep-rl dqn actor-critic-methods target-network

asked Nov 07 '21 at 16:12

Green Falcon

170
2
10

5

votes

1 answer

Do Vision Transformers handle arbitrary sequence lengths the same way as normal Transformers?

Does ViT do handle arbitrary sequence lengths using masking the same way the normal Transformer does? The ViT paper doesn't mention anything about it, so I assume it uses masking like the normal Transformer.

neural-networks computer-vision transformer vision-transformer

asked Nov 03 '21 at 21:10

Dean R

53
4

5

votes

1 answer

Image comparison algorithm, trying to figure out how similar two "binary" forms are

I'm a student I'm completely new to this technology maybe my approach could be completely wrong, I want to create an algorithm that compares the similarity between two binarized images. I'll explain: I have 2 pictures as input. The RGB colors of…

algorithm image-recognition computer-vision

asked Apr 26 '17 at 16:21

user6867

51
1

5

votes

3 answers

What is the Intermediate (dense) layer in between attention-output and encoder-output dense layers within transformer block in PyTorch implementation?

In PyTorch, transformer (BERT) models have an intermediate dense layer in between attention and output layers whereas the BERT and Transformer papers just mention the attention connected directly to output fully connected layer for the encoder just…

natural-language-processing pytorch transformer bert

asked Oct 25 '21 at 20:05

mohammad ali Humayun

51
3

5

votes

2 answers

Do we need automatic hyper-parameter tuning when we have a large enough dataset?

Hyperparameter tuning is the process of selecting the optimal hyperparameters for an ANN. Now, my guess is that, if we have sufficient data (say, 1.4 million for, say, 6 features), the model can be optimally trained and we don't need a…

neural-networks hyperparameter-optimization training-datasets

asked Oct 17 '21 at 18:10

user366312

341
1
13

5

votes

3 answers

Is it possible to tell the Reinforcement Learning agent some rules directly without any constraints?

I try to apply RL for a control problem, and I intend to either use Deep Q-Learning or SARSA. I have two heating storage systems with one heating device, and the RL agent is only allowed to heat up 1 for every time slot. How can I do that? I have…

reinforcement-learning

asked Oct 13 '21 at 11:59

PeterBe

276
3
14

5

votes

1 answer

How to decide a train-test split?

In almost every ML model, a train-test (or train-test-val split) is critical to assess the model's performance. However, I have always wondered what the rationale is to decide a particular train-test split. I've seen that some people like an 80-20…

machine-learning training cross-validation testing

asked Oct 12 '21 at 14:13

user48670

5

votes

1 answer

Why is the change in cost wrt bias in neural network equal to error in the neuron?

While reading the book on neural networks by Michael Nielson, I had a problem understanding equation (BP3), which is $$ \frac{\partial C}{\partial b_{j}^{l}}=\delta_{j}^{l} \tag{BP3}\label{BP3}, $$ which can be translated to plain English as…

neural-networks deep-learning backpropagation math calculus

asked Apr 21 '17 at 10:51

Madhusoodan P

151
1
4

5

votes

2 answers

Should I repeat lengthy deep learning experiments to average results ? How to decide how many times to repeat?

I am doing my MSc thesis on deep learning. My model takes many hours to train. Part of what I do is trying different parameters and settings hoping that they will achieve different results. But I often notice that the result differences are too…

deep-learning ai-design research

asked Sep 18 '21 at 18:25

Manveru

221
1
4

5

votes

2 answers

What is the difference between edge computing and federated learning?

I recently read about federated learning introduced by Google, but it seems to be like edge computing. What is the difference between edge computing and federated learning?

machine-learning comparison federated-learning

asked Apr 17 '17 at 10:06

Karan Nadagoudar

285
1
5

Most Popular