For questions about the backpropagation through time (BPTT) algorithm, which is often used to find the gradient of the objective function with respect to the parameters of a recurrent neural network (RNN) when training the RNN with gradient descent.
Questions tagged [bptt]
2 questions
3
votes
0 answers
What is the time complexity for training a gated recurrent unit (GRU) neural network using back-propagation through time?
Let us assume we have a GRU network containing $H$ layers to process a training dataset with $K$ tuples, $I$ features, and $H_i$ nodes in each layer.
I have a pretty basic idea how the complexity of algorithms are calculated, however, with the…
rahul tomar
- 51
- 4
0
votes
0 answers
Why isn't my CNN-RNN model learning despite the CNN performing well?
I'm working on a model that combines a CNN with an LSTM to process sequences of spectrograms and make per-time-step predictions. The CNN alone performs well on the task, but after adding an LSTM for temporal modeling, the model's performance doesn't…