Questions tagged [bptt]

For questions about the backpropagation through time (BPTT) algorithm, which is often used to find the gradient of the objective function with respect to the parameters of a recurrent neural network (RNN) when training the RNN with gradient descent.

2 questions
3
votes
0 answers

What is the time complexity for training a gated recurrent unit (GRU) neural network using back-propagation through time?

Let us assume we have a GRU network containing $H$ layers to process a training dataset with $K$ tuples, $I$ features, and $H_i$ nodes in each layer. I have a pretty basic idea how the complexity of algorithms are calculated, however, with the…
0
votes
0 answers

Why isn't my CNN-RNN model learning despite the CNN performing well?

I'm working on a model that combines a CNN with an LSTM to process sequences of spectrograms and make per-time-step predictions. The CNN alone performs well on the task, but after adding an LSTM for temporal modeling, the model's performance doesn't…