Most Popular
1500 questions
11
votes
1 answer
Are there neural networks with very few nodes that decently solve non-trivial problems?
I'm interested in knowing whether there exist any neural network, that solves (with >=80% accuracy) any nontrivial problem, that uses very few nodes (where 20 nodes is not a hard limit). I want to develop an intuition on sizes of neural networks.
Guillermo Mosse
- 327
- 1
- 10
11
votes
3 answers
Should I choose a model with the smallest loss or highest accuracy?
I have two Machine Learning models (I use LSTM) that have a different result on the validation set (~100 samples data):
Model A: Accuracy: ~91%, Loss: ~0.01
Model B: Accuracy: ~83%, Loss: ~0.003
The size and the speed of both models are almost the…
malioboro
- 2,859
- 3
- 23
- 47
11
votes
2 answers
What are the segment embeddings and position embeddings in BERT?
They only reference in the paper that the position embeddings are learned, which is different from what was done in ELMo.
ELMo paper - https://arxiv.org/pdf/1802.05365.pdf
BERT paper - https://arxiv.org/pdf/1810.04805.pdf
Skinish
- 163
- 1
- 1
- 9
10
votes
2 answers
Can machine learning algorithms be used to differentiate between small differences in details between images?
I was wondering if machine learning algorithms (CNNs?) can be used/trained to differentiate between small differences in details between images (such as slight differences in shades of red or other colours, or the presence of small objects between…
The Pointer
- 611
- 5
- 22
10
votes
1 answer
Is AlphaZero an example of an AGI?
From DeepMind's research paper on arxiv.org:
In this paper, we apply a similar but fully generic algorithm, which
we call AlphaZero, to the games of chess and shogi as well as Go,
without any additional domain knowledge except the rules of the…
Siddhartha
- 413
- 2
- 11
10
votes
2 answers
What are the limitations of the hill climbing algorithm and how to overcome them?
What are the limitations of the hill climbing algorithm? How can we overcome these limitations?
Abbas Ali
- 576
- 4
- 10
- 17
10
votes
2 answers
What is the difference between goal-based and utility-based agents?
What is the difference between goal-based and utility-based agents? Please, provide a real-world example.
RashkRizwan
- 145
- 1
- 2
- 8
10
votes
4 answers
Has the spontaneous emergence of replicators been modeled in Artificial Life?
One of the cornerstones of The Selfish Gene (Dawkins) is the spontaneous emergence of replicators, i.e. molecules capable of replicating themselves.
Has this been modeled in silico in open-ended evolutionary/artificial life simulations?
Systems like…
sihubumi
- 101
- 3
10
votes
2 answers
How to handle rectangular images in convolutional neural networks?
Almost all the convolutional neural network architecture I have come across have a square input size of an image, like $32 \times 32$, $64 \times 64$ or $128 \times 128$. Ideally, we might not have a square image for all kinds of scenarios. For…
Santhosh Dhaipule Chandrakanth
- 257
- 1
- 2
- 7
10
votes
1 answer
Loss jumps abruptly when I decay the learning rate with Adam optimizer in PyTorch
I'm training an auto-encoder network with Adam optimizer (with amsgrad=True) and MSE loss for Single channel Audio Source Separation task. Whenever I decay the learning rate by a factor, the network loss jumps abruptly and then decreases until the…
imflash217
- 499
- 5
- 15
10
votes
2 answers
Are there dynamic neural networks?
Are there neural networks that can decide to add/delete neurons (or change the neuron models/activation functions or change the assigned meaning for neurons), links or even complete layers during execution time?
I guess that such neural networks…
TomR
- 903
- 6
- 18
10
votes
2 answers
What benefits can be got by applying Graph Convolutional Neural Network instead of ordinary CNN?
What benefits can we got by applying Graph Convolutional Neural Network instead of ordinary CNN? I mean if we can solve a problem by CNN, what is the reason should we convert to Graph Convolutional Neural Network to solve it? Are there any examples…
piratesailor
- 245
- 2
- 8
10
votes
2 answers
Is Nassim Taleb right about AI not being able to accurately predict certain types of distributions?
So Taleb has two heuristics to generally describe data distributions. One is Mediocristan, which basically means things that are on a Gaussian distribution such as height and/or weight of people.
The other is called Extremistan, which describes a…
Josiah Swaim
- 171
- 1
- 6
10
votes
5 answers
How do I keep track of already visited states in breadth-first search?
I was trying to implement the breadth-first search (BFS) algorithm for the sliding blocks puzzle (number type). Now, the main thing I noticed is that, if you have a $4 \times 4$ board, the number of states can be as large as $16!$, so I cannot…
user9947
10
votes
2 answers
Do self-driving cars resort to randomness to make decisions?
I recently heard someone make a statement that when you're designing a self-driving car, you're not building a car but really a computerized driver, so you're trying to model a human mind -- at least the part of the human mind that can drive.
Since…
guillaume31
- 203
- 1
- 8