Most Popular
1500 questions
13
votes
2 answers
Input/output encoding for a neural network to learn a grid-based game
I am writing a simple toy game with the intent of training a deep neural network on top of it. The games rules are roughly the following:
The game has a board made up of hexagonal cells.
Both players have the same collection of pieces that they can…
Totem
- 381
- 2
- 6
13
votes
2 answers
Is there a fundamental difference between an environment being stochastic and being partially observable?
In AI literature, deterministic vs stochastic and being fully-observable vs partially observable are usually considered two distinct properties of the environment.
I'm confused about this because what appears random can be described by hidden…
martinkunev
- 255
- 1
- 8
13
votes
6 answers
What are good alternatives to the expression "Artificial Intelligence"?
I read a really interesting article titled "Stop Calling it Artificial Intelligence" that made a compelling critique of the name "Artificial Intelligence".
The word intelligence is so broad that it's hard to say whether "Artificial Intelligence" is…
user6698
13
votes
1 answer
When should you not use the bias in a layer?
I'm not really that experienced with deep learning, and I've been looking at research code (mostly PyTorch) for deep neural networks, specifically GANs, and, in many cases, I see the authors setting bias=False in some layers without much…
Nikos Tsakas
- 143
- 1
- 8
13
votes
2 answers
Are the shortcomings of neural networks diminishing?
Having worked with neural networks for about half a year, I have experienced first-hand what are often claimed as their main disadvantages, i.e. overfitting and getting stuck in local minima. However, through hyperparameter optimization and some…
user4747
13
votes
1 answer
What are the fundamental differences between VAE and GAN for image generation?
Starting from my own understanding, and scoped to the purpose of image generation, I'm well aware of the major architectural differences:
A GAN's generator samples from a relatively low dimensional random variable and produces an image. Then the…
Alexander Soare
- 1,379
- 3
- 12
- 28
13
votes
2 answers
What is different in each head of a multi-head attention mechanism?
I have a difficult time understanding the "multi-head" notion in the original transformer paper. What makes the learning in each head unique? Why doesn't the neural network learn the same set of parameters for each attention head? Is it because we…
mhsnk
- 253
- 2
- 5
13
votes
3 answers
Why is the reward in reinforcement learning always a scalar?
I'm reading Reinforcement Learning by Sutton & Barto, and in section 3.2 they state that the reward in a Markov decision process is always a scalar real number. At the same time, I've heard about the problem of assigning credit to an action for a…
user40138
13
votes
2 answers
How are generative adversarial networks trained?
I am reading about generative adversarial networks (GANs) and I have some doubts regarding it. So far, I understand that in a GAN there are two different types of neural networks: one is generative ($G$) and the other discriminative ($D$). The…
Eka
- 1,106
- 8
- 24
13
votes
1 answer
What is the difference between one-shot learning, transfer learning and fine tuning?
Lately, there are lots of posts on one-shot learning. I tried to figure out what it is by reading some articles. To me, it looks like similar to transfer learning, in which we can use pre-trained model weights to create our own model. Fine-tuning…
Hiren Namera
- 785
- 6
- 20
13
votes
1 answer
How would DeepMind's new differentiable neural computer scale?
DeepMind just published a paper about a differentiable neural computer, which basically combines a neural network with a memory.
The idea is to teach the neural network to create and recall useful explicit memories for a certain task. This…
BlindKungFuMaster
- 4,265
- 13
- 23
13
votes
1 answer
What is the intuition behind the attention mechanism?
Attention idea is one of the most influential ideas in deep learning. The main idea behind attention technique is that it allows the decoder to "look back” at the complete input and extracts significant information that is useful in decoding.
I am…
Pluviophile
- 1,293
- 7
- 20
- 40
13
votes
13 answers
Is AI living or non-living?
I'm a bit confused about the definition of life. Can AI systems be called 'living'? Because they can do most of the things that we can. They can even communicate with one another.
They are not formed of what we call cells. But, you see, cells are…
Soham
- 399
- 1
- 2
- 11
13
votes
4 answers
Could an AI think laterally while avoiding "ethically suboptimal" choices?
In the recent PC game The Turing Test, the AI ("TOM") needs help from Ava to get through some puzzle rooms. TOM says he is unable to solve the puzzles because he is not allowed to "think laterally." Specifically, he says he would not have thought to…
Ben N
- 2,589
- 2
- 21
- 35
13
votes
1 answer
Can an AI make a constructed (natural) language?
According to Wikipedia
An artificial or constructed language (sometimes called a conlang) is a language that has been created by a person or small group, instead of being formed naturally as part of a culture.
My question is, could an AI…
Kevin
- 233
- 1
- 6