Highest Voted 'mnist' Questions - Artificial Intelligence Stack Exchange

7

votes

2 answers

How can a neural network distinguish a rotated 6 and 9 digits?

Rotated MNIST is a popular dataset for benchmarking models equivariant to rotations on $\mathbb{R}^2$, described by $SO(2)$ group or its discrete subgroups like $\mathbb{Z}^{n}$: Group equivariant convolutional networks Harmonic networks It…

asked Dec 09 '21 at 09:37

spiridon_the_sun_rotator

2,852
12
17

4

votes

1 answer

How can I make an MNIST digit recognizer that rejects out-of-distribution data?

I've done an MNIST digit recognition neural network. When you put images in that are completely unlike its training data, it still tries to classify them as digits. Sometimes it strongly classifies nonsense data as being a specific digit. I am…

probability-distribution mnist

asked Apr 24 '23 at 09:39

river

143
6

3

votes

2 answers

Implementing a GAN with control over the output class

I am trying to accomplish the reverse of the typical MNIST in machine learning using a GAN - instead of predicting a number from an image of a digit, I want to reconstruct an image of a digit from a number. The traditional GAN, however, isn't…

neural-networks generative-adversarial-networks mnist

asked Jul 31 '23 at 11:39

JS4137

143
4

3

votes

4 answers

How is MNIST only providing the training and the test sets? What about the validation?

I was taught that, usually, a dataset has to be divided into three parts: Training set - for learning purposes Validation set - for picking the model which minimize the loss on this set Test test - for testing the performance of the model picked…

datasets training-datasets mnist validation-datasets

asked Oct 22 '22 at 08:45

tail

167
7

3

votes

1 answer

Can neural networks learn noise?

I'm interested in the following graphs. A neural network was trained to recognise digits from the MNIST dataset and then the labels were randomly shuffled and the following behaviour was observed. How can this behaviour be explained? What explains…

neural-networks machine-learning overfitting mnist

asked Dec 01 '21 at 09:47

Featherball

131
3

2

votes

1 answer

Are the "artifacts" in select Keras MNIST training images really there or is my download corrupt?

I'm having fun with a ludicrously well known and used dataset: mnist. I am doing it with a huge and well known tool: keras. Please excuse the red dots, something else I was doing. I have otherwise not modified the image at all except via the…

datasets keras image-processing mnist

asked Nov 17 '22 at 18:32

EngrStudent

371
3
12

2

votes

2 answers

Why do we subtract logsumexp from the outputs of this neural network?

I'm trying to understand this tutorial for Jax. Here's an excerpt. It's for a neural net that is designed to classify MNIST images: from jax.scipy.special import logsumexp def relu(x): return jnp.maximum(0, x) def predict(params, image): #…

neural-networks machine-learning loss mnist

asked Jun 25 '22 at 02:06

Foobar

153
6

2

votes

0 answers

Statistical method for selecting features for classification

I'm working on a classifier for the famous MNIST handwritten data set. I want to create a few features on my own, and I want to be able to estimate which feature might perform better before actually training the classifier. Lets say that I create…

machine-learning data-preprocessing feature-extraction mnist

asked Nov 26 '20 at 23:43

IsolatedSushi

21
1

1

vote

3 answers

Can I use 4 neurons for output layer to classify hand written digit?

Hello world of ANN usually uses MNIST hand written digit data. For classes there are 10, therefore it takes 10 neurons in the output layer, each class is 0 to 9 handwritten digit images. If in the end there is only one active neuron in the output…

neural-networks mnist handwritten-characters

asked Feb 04 '23 at 18:41

Muhammad Ikhwan Perwira

800
3
10

1

vote

1 answer

Training and validation loss are almost the same (perfect fit?)

I am developing an ANN from scratch which classifies MNIST digits. These are the curves I get using only one hidden layer composed of 100 neurons activated by ReLU function. The output's neurons are activated by the softmax function: Is it correct…

neural-networks loss mnist

asked Nov 03 '22 at 10:20

tail

167
7

1

vote

1 answer

How can I use my neural network model, trained on MNIST database, on "real word" digits such as my handwritten digits?

I have developed a feed-forward ANN from scratch trained (and evaluated) on MNIST database, which contains 60,000 + 10,000 handwritten digits samples. Can I test my model on other digits, for example I write the digit 7 on a paper with my pen and…

neural-networks mnist

asked Oct 24 '22 at 14:43

tail

167
7

1

vote

0 answers

Why does the VAE using a KL-divergence with a non-standard mean does not produce good images?

I know I can make a VAE do generation with a mean of 0 and std-dev of 1. I tested it with the following loss function: def loss(self, data, reconst, mu, sig): rl = self.reconLoss(reconst, data) #dl = self.divergenceLoss(mu, sig) std =…

pytorch variational-autoencoder kl-divergence mnist

asked Feb 14 '21 at 21:56

axon

53
5

0

votes

0 answers

Why Does My Neural Network Have High Accuracy on Training Data but Only 10% Accuracy on Test Data?

I watched this 3Blue1Brown video series explaining who backpropagation works, then I decided to implement it by creating the neural network described in the series (I used The MNIST dataset for training and testing). Here it is : import…

neural-networks deep-learning backpropagation mnist

asked Jan 13 '25 at 15:48

Djebbar Abderrahmene

1

0

votes

1 answer

How can a convnet learn with a 3x3 output layer?

I was studying the "Deep Learning with Python" book, I came across this MNIST example and this is how the last conv2d layer looks like: _________________________________________________________________ conv2d_2 (Conv2D) (None, 3, 3, 64) …

neural-networks mnist

asked May 28 '22 at 20:30

Abcd

101
1

0

votes

0 answers

Why would my neural network have either an accuracy of 90% or 10% on the validation data, given a random initialization?

I'm making a custom neural network framework (in C++, if that is of any help). When I train the model on MNIST, depending on how happy the network is feeling, it'll give me either 90%+ accuracy, or get stuck at 10-9% (on validation set). I shuffle…

neural-networks accuracy weights-initialization mnist validation

asked Mar 04 '21 at 21:53

Ilknur Mustafa

115
4

Questions tagged [mnist]