Most Popular

1500 questions
5
votes
1 answer

Do we use validation and test sets for training a reinforcement learning agent?

I am pretty new to reinforcement learning and was working with some code for the PPO and DQN algorithms. After looking at the code, I noticed that the authors did not include any code to setup a validation or testing dataloader. In most other…
5
votes
1 answer

How can I estimate how many photos I need to train ResNet-50 for image classification?

I am working on a project where I have to classify around 1000 unique objects. I'm trying to plan how much training data I will need to collect. I was planning on using ResNet-50. Is there anyway I can estimate the amount of photos I should plan to…
5
votes
2 answers

Why does the activation function for a hidden layer in a MLP have to be non-polynomial?

Across multiple pieces of literature describing MLPs or while describing the universal approximation theorem, the statement is very specific on the activation function being non-polynomial. Is there a reason why it cannot be a higher-order…
5
votes
1 answer

In TD(0) with linear function approximation, why is the gradient of $\hat v(S^{\prime}, \mathbf w)$ wrt parameters $\mathbf w$ not considered?

I am reading these slides. On page 38, the update for the parameters for the linear function approximation of TD(0) is given. I have a doubt regarding this. The cost function (RMSE) is given on page 37. My doubt is: why is the gradient of $\hat…
5
votes
3 answers

Use of machine learning for analyzing companies enlisted in stock market

Can current trends and tools, in the field of machine learning, replicate the complexity of financial market? If yes, then what are the tools available in this domain. Q. I am trying to build a model to infer results from stock market using the…
5
votes
1 answer

Why isn't a target network used for the critic in on-policy actor-critic methods?

Based on my research, I've seen so many on-policy AC approaches that utilise a critic network to estimate the value function $V$. The Bellman equation for the value function is as bellow: $$ V_\pi(s_t) = \sum_a \pi(a|s_t)\sum_{r,…
5
votes
1 answer

Do Vision Transformers handle arbitrary sequence lengths the same way as normal Transformers?

Does ViT do handle arbitrary sequence lengths using masking the same way the normal Transformer does? The ViT paper doesn't mention anything about it, so I assume it uses masking like the normal Transformer.
5
votes
1 answer

Image comparison algorithm, trying to figure out how similar two "binary" forms are

I'm a student I'm completely new to this technology maybe my approach could be completely wrong, I want to create an algorithm that compares the similarity between two binarized images. I'll explain: I have 2 pictures as input. The RGB colors of…
user6867
  • 51
  • 1
5
votes
3 answers

What is the Intermediate (dense) layer in between attention-output and encoder-output dense layers within transformer block in PyTorch implementation?

In PyTorch, transformer (BERT) models have an intermediate dense layer in between attention and output layers whereas the BERT and Transformer papers just mention the attention connected directly to output fully connected layer for the encoder just…
5
votes
2 answers

Do we need automatic hyper-parameter tuning when we have a large enough dataset?

Hyperparameter tuning is the process of selecting the optimal hyperparameters for an ANN. Now, my guess is that, if we have sufficient data (say, 1.4 million for, say, 6 features), the model can be optimally trained and we don't need a…
5
votes
3 answers

Is it possible to tell the Reinforcement Learning agent some rules directly without any constraints?

I try to apply RL for a control problem, and I intend to either use Deep Q-Learning or SARSA. I have two heating storage systems with one heating device, and the RL agent is only allowed to heat up 1 for every time slot. How can I do that? I have…
PeterBe
  • 276
  • 3
  • 14
5
votes
1 answer

How to decide a train-test split?

In almost every ML model, a train-test (or train-test-val split) is critical to assess the model's performance. However, I have always wondered what the rationale is to decide a particular train-test split. I've seen that some people like an 80-20…
user48670
5
votes
1 answer

Why is the change in cost wrt bias in neural network equal to error in the neuron?

While reading the book on neural networks by Michael Nielson, I had a problem understanding equation (BP3), which is $$ \frac{\partial C}{\partial b_{j}^{l}}=\delta_{j}^{l} \tag{BP3}\label{BP3}, $$ which can be translated to plain English as…
5
votes
2 answers

Should I repeat lengthy deep learning experiments to average results ? How to decide how many times to repeat?

I am doing my MSc thesis on deep learning. My model takes many hours to train. Part of what I do is trying different parameters and settings hoping that they will achieve different results. But I often notice that the result differences are too…
Manveru
  • 221
  • 1
  • 4
5
votes
2 answers

What is the difference between edge computing and federated learning?

I recently read about federated learning introduced by Google, but it seems to be like edge computing. What is the difference between edge computing and federated learning?