As the title says my GNN with three layers of GAT (Graph attention layers) is only moving the metrics when the learning rate is 1. As generally the learning rate is (0,1) should I be worried?
Also here it says that if lr is larger than 1 it is solely focusing on the gradient instead of the model parameters. Don't know if this is a good or bad thing. Why is the learning rate generally beneath 1?