I have a seq2seq model with LSTM or GRU layers with Adam optimizer. Is there any recommendation to use a specific learning rate scheduler?
I have a seq2seq model with LSTM or GRU layers with Adam optimizer. Is there any recommendation to use a specific learning rate scheduler?