Most Popular
1500 questions
10
votes
2 answers
How do evolutionary algorithms have advantages over the conventional backpropagation methods?
How does employing evolutionary algorithms to design and train artificial neural networks have advantages over using the conventional backpropagation algorithms?
kenorb
- 10,525
- 6
- 45
- 95
10
votes
2 answers
Can autoencoders be used for supervised learning?
Can autoencoders be used for supervised learning without adding an output layer? Can we simply feed it with a concatenated input-output vector for training, and reconstruct the output part from the input part when doing inference? The output part…
rcpinto
- 2,148
- 1
- 17
- 31
10
votes
1 answer
How can the emotional quotient of an AI program be measured?
Can an AI program have an EQ (emotional intelligence or emotional quotient)?
In other words, can the EQ of an AI program be measured?
If EQ is more problematic to measure than IQ (at least with a standard applicable to both humans and AI programs),…
1010101 Lifestyle
- 203
- 1
- 4
10
votes
3 answers
Are there any textual CAPTCHA challenges which can fool AI, but not human?
Are there any modern techniques of generating textual CAPTCHA (so person needs to type the right text) challenges which can easily fool AI with some visual obfuscation methods, but at the same time human can solve them without any struggle?
For…
kenorb
- 10,525
- 6
- 45
- 95
10
votes
2 answers
Is neural networks training done one-by-one?
I'm trying to learn neural networks by watching this series of videos and implementing a simple neural network in Python.
Here's one of the things I'm wondering about: I'm training the neural network on sample data, and I've got 1,000 samples. The…
Ram Rachum
- 260
- 1
- 11
10
votes
1 answer
A mathematical explanation of Attention Mechanism
I am trying to understand why attention models are different than just using neural networks. Essentially the optimization of weights or using gates for protecting and controlling cell state (in recurrent networks), should eventually lead to the…
PixelPioneer
- 203
- 2
- 5
10
votes
1 answer
How to deal with a small amount of labeled samples?
I'm trying to develop skills to deal with very small amounts of labeled samples (250 labeled/20000 total, 200 features) by practicing on Kaggle "Don't Overfit" dataset (Traget_Practice have provided all 20,000 Targets). I've read a ton of papers and…
FirePower
- 201
- 1
- 3
10
votes
2 answers
What is the difference between search and learning?
I came across an article, The Bitter Truth, via the Two Minute Papers YouTube Channel. Rich Sutton says...
One thing that should be learned from the bitter lesson is the great power of general purpose methods, of methods that continue to scale with…
nvi
- 103
- 5
10
votes
1 answer
Is back-propagation applied for each data point or for a batch of data points?
I am new to deep learning and trying to understand the concept of back-propagation. I have a doubt about when the back-propagation is applied. Assume that I have a training data set of 1000 images for handwritten letters,
Is back-propagation…
Maanu
- 245
- 1
- 2
- 7
10
votes
1 answer
Can Q-learning be used in a POMDP?
Can Q-learning (and SARSA) be directly used in a Partially Observable Markov Decision Process (POMDP)? If not, why not? My intuition is that the policies learned will be terrible because of partial observability. Are there ways to transform these…
drerD
- 298
- 2
- 6
10
votes
1 answer
Are decision tree learning algorithms deterministic?
Are decision tree learning algorithms deterministic? Given a fixed dataset, do they always produce a tree with the same structure?
What about the random forest?
mallea
- 203
- 2
- 5
10
votes
2 answers
What's the difference between hyperbolic tangent and sigmoid neurons?
Two common activation functions used in deep learning are the hyperbolic tangent function and the sigmoid activation function. I understand that the hyperbolic tangent is just a rescaling and translation of the sigmoid function:
$\tanh(z) =…
bpachev
- 410
- 4
- 11
10
votes
1 answer
How do I use GPT-2 to summarise text?
In section 3.6 of the OpenAI GPT-2 paper it mentions summarising text based relates to this, but the method is described in very high-level terms:
To induce summarization behavior we add the text TL;DR: after the article and generate 100 tokens…
Tom Hale
- 384
- 3
- 13
10
votes
4 answers
How is it that AI can become biased, and what are the proposals to mitigate this?
This is not meant to be negative or a joke but rather looking for a productive solution on AI development, engineering and its impact on human life:
Lately with my Google searches, the AI model keeps auto filling the ending of my searches…
P.S.
- 203
- 1
- 6
10
votes
1 answer
What is the difference between an episode, a trajectory and a rollout?
I often see the terms episode, trajectory, and rollout to refer to basically the same thing, a list of (state, action, rewards). Are there any concrete differences between the terms or can they be used interchangeably?
In the following paragraphs,…
Paula Vega
- 438
- 4
- 9